Voice Control in a Real Flight Deck Environment

Trzos, Michal; Dostl, Martin; Machkov, Petra; Eitlerov, Jana

doi:10.1007/978-3-030-00794-2_42

Michal Trzos¹⁹,
Martin Dostl¹⁹,
Petra Machkov¹⁹ &
…
Jana Eitlerov¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11107))

Included in the following conference series:

International Conference on Text, Speech, and Dialogue

1403 Accesses
1 Citations

Abstract

In this paper, we present a methodology on how to implement multimodal voice controlled systems by means of automatic speech recognition. The real flight deck environment brings many challenges such as high accuracy requirements, high noise conditions, non-native English-speaking users or limited hardware and software resources. We present the design of an automatic speech recognition system based on a freely available AMI Meeting Corpus and a proprietary corpus provided by Airbus. Then we describe how we trained and evaluated the speech recognition models in a simulated environment using the anechoic chamber laboratory. The tuned speech recognition models were tested in real flight environment on two Honeywell experimental airplanes: Dassault Falcon 900 and Boeing 757.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Dostal, M., Kolcarek, P.: Multimodal navigation display. In: 2015 IEEE/AIAA 34th Digital Avionics Systems Conference (DASC), Prague, pp. 3B1-1–3B1-11 (2015)
Google Scholar
Swearingen, P.A.: United States Patent No. 8,234,121 B1. U.S. Patent and Trademark Office, Washington, DC (2012)
Google Scholar
Mccowan, I., et al.: The AMI meeting corpus. In: Proceedings Measuring Behavior 2005, 5th International Conference on Methods and Techniques in Behavioral Research. In: Noldus, L.P.J.J., Grieco, F., Loijens, L.W.S., Zimmerman, P.H. (eds.) Noldus Information Technology, Wageningen (2005)
Google Scholar
Povey, D., et al.: The Kaldi speech recognition toolkit. In: IEEE 2011 Workshop on Automatic Speech Recognition and Understanding. IEEE Signal Processing Society (2011)
Google Scholar
Peddinti, V., Povey, D., Khudanpur, S.: A time delay neural network architecture for efficient modeling of long temporal contexts. In: proceedings of INTERSPEECH 2015, Dresden, Germany, pp. 3214–3218 (2015)
Google Scholar
Airband. https://en.wikipedia.org/wiki/Airband. Accessed 20 Mar 2018
Srinivasamurthy, A., Motlicek, P., Himawan, I., Szaszk, G., Oualil, Y., Helmke, H.: Semi-supervised learning with semantic knowledge extraction for improved speech recognition in air traffic control. In: Proceedings of the Interspeech 2017, pp. 2406–2410 (2017). https://doi.org/10.21437/Interspeech.2017-1446
Oualil, Y., Klakow, D., Szaszk, G., Srinivasamurthy, A., Helmke, H., Motlicek, P.: A context-aware speech recognition and understanding system for air traffic control domain. In: 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Okinawa, pp. 404–408 (2017)
Google Scholar
Delpech, E., et al.: A real-life, french-accented corpus of air traffic control communications. In: Proceedings of the 11th Language Resources and Evaluation Conference (LREC 2018), Miyazaki, Japan (2018)
Google Scholar
Ranzenberger, T., Hacker, Ch., Gallwitz, F.: Integration of a Kaldi speech recognizer into a speech dialog system for automotive infotainment applications. In: Conference on Electronic Speech Signal Processing (ESSV 2018), Ulm (2018)
Google Scholar
Word Error Rate. https://en.wikipedia.org/wiki/Word_error_rate. Accessed 20 Mar 2018
JSpeech Grammar Format. http://www.w3.org/TR/jsgf. Accessed 20 Mar 2018
ICAO. Manual of Radiotelephony. Document 9432-AN/925, 4th edn (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

Honeywell International, Aerospace Advanced Technology Europe, Tuřanka 1387/100, Brno, Czech Republic
Michal Trzos, Martin Dostl, Petra Machkov & Jana Eitlerov

Authors

Michal Trzos
View author publications
You can also search for this author in PubMed Google Scholar
Martin Dostl
View author publications
You can also search for this author in PubMed Google Scholar
Petra Machkov
View author publications
You can also search for this author in PubMed Google Scholar
Jana Eitlerov
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Michal Trzos .

Editor information

Editors and Affiliations

Faculty of Informatics, Masaryk University, Brno, Czech Republic
Petr Sojka
Faculty of Informatics, Masaryk University, Brno, Czech Republic
Aleš Horák
Faculty of Informatics, Masaryk University, Brno, Czech Republic
Ivan Kopeček
Faculty of Informatics, Masaryk University, Brno, Czech Republic
Karel Pala

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Trzos, M., Dostl, M., Machkov, P., Eitlerov, J. (2018). Voice Control in a Real Flight Deck Environment. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech, and Dialogue. TSD 2018. Lecture Notes in Computer Science(), vol 11107. Springer, Cham. https://doi.org/10.1007/978-3-030-00794-2_42

Download citation

DOI: https://doi.org/10.1007/978-3-030-00794-2_42
Published: 08 September 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00793-5
Online ISBN: 978-3-030-00794-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics