Speech Dialog as a Part of Interactive “Human-Machine” Systems

Potapova, Rodmonga

doi:10.1007/978-3-319-43955-6_25

Speech Dialog as a Part of Interactive “Human-Machine” Systems

Rodmonga Potapova¹⁶

Conference paper
First Online: 14 August 2016

1124 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9812))

Abstract

One of the most significant features in applied linguistics and recognition technologies used in methods of spoken language recognition is speech signal which includes some primary tasks: preprocessing, processing and recognition regarding next main important features of acoustic analysis of spoken language: \(F_{0i}, I_{{i}}, t_{{i}}, F_{{ni}}\). This paper presents one of the human machine methods with regard to continuous speech detection on the basis of formant Fni analysis. There are many ways to perform acoustic analysis, but the acoustic-phonetic recognition functions at the phoneme and prosody level seem to be one of the classical speech recognition methods.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Anzalone, S.M., Yoshikawa, Y., Ishiguro, H., Menegatti, E., Pagello, E., Sorbello, R.: Towards partners profiling in human robot interaction contexts. In: Noda, I., Ando, N., Brugali, D., Kuffner, J.J. (eds.) SIMPAR 2012. LNCS, vol. 7628, pp. 4–15. Springer, Heidelberg (2012)
Chapter Google Scholar
Bertau, M.-C.: Voice as heuristic device to integrate biological and social sciences. A comment to Sidtis & Kreimans in the beginning was the familiar voice. Integr. Psychol. Behav. Sci. 46(2), 160–171 (2012)
Article Google Scholar
Khnel, C.: Introduction and motivation. In: Quantifying Quality Aspects of Multimodal Interactive Systems. Part of the series T-Labs Series in Telecommunication Services, pp. 1–11 (2012)
Google Scholar
Markowitz, J.A.: Using Speech Recognition. Prentice Hall PTR, Upper Saddle River (1996)
Google Scholar
Porta, A., Deru, M., Bergweiler, S., Herzog, G., Poller, P.: Building multimodal dialog user interfaces in the context of the internet of services. In: Towards the Internet of Services: The THESEUS Research Program. Part of the series Cognitive Technologies. Springer International Publishing, Heidelberg, pp. 145–162 (2014)
Google Scholar
Potapov, V.: Speech rhythmic patterns of the Slavic languages. In: Ronzhin, A., Potapova, R., Delic, V. (eds.) SPECOM 2014. LNCS, vol. 8773, pp. 425–434. Springer, Heidelberg (2014)
Google Scholar
Potapova, R.: Priority trends of the present day applied linguistics. J. Convers. Mach. Build. Russ., pp. 3–4 (2004) (in Russian)
Google Scholar
Potapova, R.K.: On Natural Language Processing Technology on the Domain of Science & Industry. Russian Academy of Sciences, Ozyorsk (1992). (in Russian)
Google Scholar
Potapova, R.K.: Speech Driving of Robots, 2nd edn. KomKniga, Moscow (2005). Revised and corrected (in Russian)
Google Scholar
Potapova, R.K.: Speech: Communication, Information, Cybernetics, 4th edn. Librokom, Moscow (2010). (in Russian)
Google Scholar
Rigoll, G.: Multimodal human-robot interaction from the perspective of a speech scientist. In: Ronzhin, A., Potapova, R., Fakotakis, N. (eds.) SPECOM 2015. LNCS, vol. 9319, pp. 3–10. Springer, Heidelberg (2015)
Chapter Google Scholar
Saraclar, M., Dikici, E., Arisoy, E.: A decade of discriminative language modeling for automatic speech recognition. In: Ronzhin, A., Potapova, R., Fakotakis, N. (eds.) SPECOM 2015. LNCS, vol. 9319, pp. 11–22. Springer, Heidelberg (2015)
Chapter Google Scholar
Saveliev, A., Basov, O., Ronzhin, A., Ronzhin, A.: Algorithms for low bit-rate coding with adaptation to statistical characteristics of speech signal. In: Ronzhin, A., Potapova, R., Fakotakis, N. (eds.) SPECOM 2015. LNCS, vol. 9319, pp. 65–72. Springer, Heidelberg (2015)
Chapter Google Scholar
Schmandt, C.: Voice Communication with Machines. Conversational Systems. Van Nostrand Reinhold, New York (1994)
MATH Google Scholar
Wechsung, I.: What are multimodal systems? Why do they need evaluation? Theoretical background. An evaluation framework for multimodal interaction. Part of the series T-Labs Series in Telecommunication Services, pp. 7–22 (2014)
Google Scholar

Download references

Acknowledgments

The research was financially supported by the Russian Foundation for Basic Research, grant No. 14-06-00363.

Author information

Authors and Affiliations

Institute of Applied and Mathematical Linguistics, Moscow State Linguistic University, Ostozhenka 38, Moscow, 119034, Russia
Rodmonga Potapova

Authors

Rodmonga Potapova
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rodmonga Potapova .

Editor information

Editors and Affiliations

Russian Academy of Sciences, SPIIRAS , St. Petersburg, Russia
Andrey Ronzhin
TU Munich , München, Germany
Gerhard Rigoll
Tomsk State University , Tomsk, Russia
Roman Meshcheryakov

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Potapova, R. (2016). Speech Dialog as a Part of Interactive “Human-Machine” Systems. In: Ronzhin, A., Rigoll, G., Meshcheryakov, R. (eds) Interactive Collaborative Robotics. ICR 2016. Lecture Notes in Computer Science(), vol 9812. Springer, Cham. https://doi.org/10.1007/978-3-319-43955-6_25

Download citation

DOI: https://doi.org/10.1007/978-3-319-43955-6_25
Published: 14 August 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-43954-9
Online ISBN: 978-3-319-43955-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics