Skip to main content

Speech Dialog as a Part of Interactive “Human-Machine” Systems

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9812))

Abstract

One of the most significant features in applied linguistics and recognition technologies used in methods of spoken language recognition is speech signal which includes some primary tasks: preprocessing, processing and recognition regarding next main important features of acoustic analysis of spoken language: \(F_{0i}, I_{{i}}, t_{{i}}, F_{{ni}}\). This paper presents one of the human machine methods with regard to continuous speech detection on the basis of formant Fni analysis. There are many ways to perform acoustic analysis, but the acoustic-phonetic recognition functions at the phoneme and prosody level seem to be one of the classical speech recognition methods.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Anzalone, S.M., Yoshikawa, Y., Ishiguro, H., Menegatti, E., Pagello, E., Sorbello, R.: Towards partners profiling in human robot interaction contexts. In: Noda, I., Ando, N., Brugali, D., Kuffner, J.J. (eds.) SIMPAR 2012. LNCS, vol. 7628, pp. 4–15. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  2. Bertau, M.-C.: Voice as heuristic device to integrate biological and social sciences. A comment to Sidtis & Kreimans in the beginning was the familiar voice. Integr. Psychol. Behav. Sci. 46(2), 160–171 (2012)

    Article  Google Scholar 

  3. Khnel, C.: Introduction and motivation. In: Quantifying Quality Aspects of Multimodal Interactive Systems. Part of the series T-Labs Series in Telecommunication Services, pp. 1–11 (2012)

    Google Scholar 

  4. Markowitz, J.A.: Using Speech Recognition. Prentice Hall PTR, Upper Saddle River (1996)

    Google Scholar 

  5. Porta, A., Deru, M., Bergweiler, S., Herzog, G., Poller, P.: Building multimodal dialog user interfaces in the context of the internet of services. In: Towards the Internet of Services: The THESEUS Research Program. Part of the series Cognitive Technologies. Springer International Publishing, Heidelberg, pp. 145–162 (2014)

    Google Scholar 

  6. Potapov, V.: Speech rhythmic patterns of the Slavic languages. In: Ronzhin, A., Potapova, R., Delic, V. (eds.) SPECOM 2014. LNCS, vol. 8773, pp. 425–434. Springer, Heidelberg (2014)

    Google Scholar 

  7. Potapova, R.: Priority trends of the present day applied linguistics. J. Convers. Mach. Build. Russ., pp. 3–4 (2004) (in Russian)

    Google Scholar 

  8. Potapova, R.K.: On Natural Language Processing Technology on the Domain of Science & Industry. Russian Academy of Sciences, Ozyorsk (1992). (in Russian)

    Google Scholar 

  9. Potapova, R.K.: Speech Driving of Robots, 2nd edn. KomKniga, Moscow (2005). Revised and corrected (in Russian)

    Google Scholar 

  10. Potapova, R.K.: Speech: Communication, Information, Cybernetics, 4th edn. Librokom, Moscow (2010). (in Russian)

    Google Scholar 

  11. Rigoll, G.: Multimodal human-robot interaction from the perspective of a speech scientist. In: Ronzhin, A., Potapova, R., Fakotakis, N. (eds.) SPECOM 2015. LNCS, vol. 9319, pp. 3–10. Springer, Heidelberg (2015)

    Chapter  Google Scholar 

  12. Saraclar, M., Dikici, E., Arisoy, E.: A decade of discriminative language modeling for automatic speech recognition. In: Ronzhin, A., Potapova, R., Fakotakis, N. (eds.) SPECOM 2015. LNCS, vol. 9319, pp. 11–22. Springer, Heidelberg (2015)

    Chapter  Google Scholar 

  13. Saveliev, A., Basov, O., Ronzhin, A., Ronzhin, A.: Algorithms for low bit-rate coding with adaptation to statistical characteristics of speech signal. In: Ronzhin, A., Potapova, R., Fakotakis, N. (eds.) SPECOM 2015. LNCS, vol. 9319, pp. 65–72. Springer, Heidelberg (2015)

    Chapter  Google Scholar 

  14. Schmandt, C.: Voice Communication with Machines. Conversational Systems. Van Nostrand Reinhold, New York (1994)

    MATH  Google Scholar 

  15. Wechsung, I.: What are multimodal systems? Why do they need evaluation? Theoretical background. An evaluation framework for multimodal interaction. Part of the series T-Labs Series in Telecommunication Services, pp. 7–22 (2014)

    Google Scholar 

Download references

Acknowledgments

The research was financially supported by the Russian Foundation for Basic Research, grant No. 14-06-00363.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rodmonga Potapova .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Potapova, R. (2016). Speech Dialog as a Part of Interactive “Human-Machine” Systems. In: Ronzhin, A., Rigoll, G., Meshcheryakov, R. (eds) Interactive Collaborative Robotics. ICR 2016. Lecture Notes in Computer Science(), vol 9812. Springer, Cham. https://doi.org/10.1007/978-3-319-43955-6_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-43955-6_25

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-43954-9

  • Online ISBN: 978-3-319-43955-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics