Abstract
One of the most significant features in applied linguistics and recognition technologies used in methods of spoken language recognition is speech signal which includes some primary tasks: preprocessing, processing and recognition regarding next main important features of acoustic analysis of spoken language: \(F_{0i}, I_{{i}}, t_{{i}}, F_{{ni}}\). This paper presents one of the human machine methods with regard to continuous speech detection on the basis of formant Fni analysis. There are many ways to perform acoustic analysis, but the acoustic-phonetic recognition functions at the phoneme and prosody level seem to be one of the classical speech recognition methods.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Anzalone, S.M., Yoshikawa, Y., Ishiguro, H., Menegatti, E., Pagello, E., Sorbello, R.: Towards partners profiling in human robot interaction contexts. In: Noda, I., Ando, N., Brugali, D., Kuffner, J.J. (eds.) SIMPAR 2012. LNCS, vol. 7628, pp. 4–15. Springer, Heidelberg (2012)
Bertau, M.-C.: Voice as heuristic device to integrate biological and social sciences. A comment to Sidtis & Kreimans in the beginning was the familiar voice. Integr. Psychol. Behav. Sci. 46(2), 160–171 (2012)
Khnel, C.: Introduction and motivation. In: Quantifying Quality Aspects of Multimodal Interactive Systems. Part of the series T-Labs Series in Telecommunication Services, pp. 1–11 (2012)
Markowitz, J.A.: Using Speech Recognition. Prentice Hall PTR, Upper Saddle River (1996)
Porta, A., Deru, M., Bergweiler, S., Herzog, G., Poller, P.: Building multimodal dialog user interfaces in the context of the internet of services. In: Towards the Internet of Services: The THESEUS Research Program. Part of the series Cognitive Technologies. Springer International Publishing, Heidelberg, pp. 145–162 (2014)
Potapov, V.: Speech rhythmic patterns of the Slavic languages. In: Ronzhin, A., Potapova, R., Delic, V. (eds.) SPECOM 2014. LNCS, vol. 8773, pp. 425–434. Springer, Heidelberg (2014)
Potapova, R.: Priority trends of the present day applied linguistics. J. Convers. Mach. Build. Russ., pp. 3–4 (2004) (in Russian)
Potapova, R.K.: On Natural Language Processing Technology on the Domain of Science & Industry. Russian Academy of Sciences, Ozyorsk (1992). (in Russian)
Potapova, R.K.: Speech Driving of Robots, 2nd edn. KomKniga, Moscow (2005). Revised and corrected (in Russian)
Potapova, R.K.: Speech: Communication, Information, Cybernetics, 4th edn. Librokom, Moscow (2010). (in Russian)
Rigoll, G.: Multimodal human-robot interaction from the perspective of a speech scientist. In: Ronzhin, A., Potapova, R., Fakotakis, N. (eds.) SPECOM 2015. LNCS, vol. 9319, pp. 3–10. Springer, Heidelberg (2015)
Saraclar, M., Dikici, E., Arisoy, E.: A decade of discriminative language modeling for automatic speech recognition. In: Ronzhin, A., Potapova, R., Fakotakis, N. (eds.) SPECOM 2015. LNCS, vol. 9319, pp. 11–22. Springer, Heidelberg (2015)
Saveliev, A., Basov, O., Ronzhin, A., Ronzhin, A.: Algorithms for low bit-rate coding with adaptation to statistical characteristics of speech signal. In: Ronzhin, A., Potapova, R., Fakotakis, N. (eds.) SPECOM 2015. LNCS, vol. 9319, pp. 65–72. Springer, Heidelberg (2015)
Schmandt, C.: Voice Communication with Machines. Conversational Systems. Van Nostrand Reinhold, New York (1994)
Wechsung, I.: What are multimodal systems? Why do they need evaluation? Theoretical background. An evaluation framework for multimodal interaction. Part of the series T-Labs Series in Telecommunication Services, pp. 7–22 (2014)
Acknowledgments
The research was financially supported by the Russian Foundation for Basic Research, grant No. 14-06-00363.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Potapova, R. (2016). Speech Dialog as a Part of Interactive “Human-Machine” Systems. In: Ronzhin, A., Rigoll, G., Meshcheryakov, R. (eds) Interactive Collaborative Robotics. ICR 2016. Lecture Notes in Computer Science(), vol 9812. Springer, Cham. https://doi.org/10.1007/978-3-319-43955-6_25
Download citation
DOI: https://doi.org/10.1007/978-3-319-43955-6_25
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-43954-9
Online ISBN: 978-3-319-43955-6
eBook Packages: Computer ScienceComputer Science (R0)