Skip to main content

The Prosody Module

  • Chapter

Part of the book series: Artificial Intelligence ((AI))

Abstract

We describe the acoustic-prosodic and syntactic-prosodic annotation and classification of boundaries, accents and sentence mood integrated in the Verbmobil system for the three languages German, English, and Japanese. For the acoustic-prosodic classification, a large feature vector with normalized prosodic features is used. For the three languages, a multilingual prosody module was developed that reduces memory requirement considerably, compared to three monolingual modules. For classification, neural networks and statistic language models are used.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Alexandersson, J., Engel, R., Kipp, M., Koch, S., Küssner, U., Reithinger, N., and Stede, M. Modeling Negotiation Dialogs. In this volume.

    Google Scholar 

  • Bagshaw, P. C. (1994). Automatic Prosodic Analysis for Computer Aided Pronunciation Teaching. PhD thesis, University of Edinburgh.

    Google Scholar 

  • Batliner, A., Huber, R., Niemann, H., Nöth, E., Spilker, J., and Fischer, K. The Recognition of Emotion. In this volume.

    Google Scholar 

  • Batliner, A., Kompe, R., Kießling, A., Mast, M., Niemann, H., and Nöth, E. (1998). M = Syntax + Prosody: a Syntactic-Prosodic Labelling Scheme for Large Spontaneous Speech Databases. Speech Communication 25(4):193–222.

    Article  Google Scholar 

  • Batliner, A., Nutt, M., Warnke, V., Nöth, E., Buckow, J., Huber, R., and Niemann, H. (1999). Automatic Annotation and Classification of Phrase Accents in Spontaneous Speech. In Proceedings of the European Conference on Speech Communication and Technology (EUROSPEECH 99), 519–522.

    Google Scholar 

  • Block, H. (1997). The Language Components in Verbmobil. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, volume 1, 79–82.

    Google Scholar 

  • Heine, J., and Bos, J. Discourse and Dialog Semantics for Translation. In this volume.

    Google Scholar 

  • Jekat, S., Klein, A., Maier, E., Maleck, I., Mast, M., and Quantz, J. (1995). Dialogue Acts in Verbmobil. Verbmobil Report 65.

    Google Scholar 

  • Kiefer, B., Krieger, H.-U., and Nederhof, M.-J. Efficient and Robust HPSG Parsing of Word Graphs. In this volume.

    Google Scholar 

  • Kießling, A. (1997). Extraktion und Klassifikation prosodischer Merkmale in der automatischen Sprachverarbeitung. Berichte aus der Informatik. Aachen: Shaker Verlag.

    Google Scholar 

  • Kipp, M., Alexandersson, J., Reithinger, N., and Engel, R. Dialog Processing. In this volume.

    Google Scholar 

  • Kompe, R. (1997). Prosody in Speech Understanding Systems. Lecture Notes for Artificial Intelligence. Berlin: Springer-Verlag.

    Book  Google Scholar 

  • Mast, M., Maier, E., and Schmitz, B. (1995). Criteria for the Segmentation of Spoken Input into Individual Utterances. Verbmobil Report 97.

    Google Scholar 

  • Klüter, A., Ndiaye, A., and Kirchmann H. Verbmobil from a Software Engineering Point of View: System Design and Software Integration. In this volume.

    Google Scholar 

  • Price, P., Ostendorf, M., Shattuck-Hufnagel, S., and Fong, C. (1991). The Use of Prosody in Syntactic Disambiguation. Journal of the Acoustic Society of America 90:2956–2970.

    Article  Google Scholar 

  • Reithinger, N., and Engel, R. Robust Content Extraction for Translation and Dialog Processing. In this volume.

    Google Scholar 

  • Schukat-Talamazzini, E., Gallwitz, F., Harbeck, S., and Warnke, V. (1997). Rational Interpolation of Maximum Likelihood Predictors in Stochastic Language Modeling. In Proc. European Conf. on Speech Communication and Technology, volume 5, 2731–2734.

    Google Scholar 

  • Searle, J. (1969). Speech Acts. An Essay in the Philosophy of Language. Cambridge: Cambridge University Press.

    Book  Google Scholar 

  • Shriberg, E., Bates, R., Taylor, P., Stolcke, A., Jurafsky, D., Ries, K., Cocarro, N., Martin, R., Meteer, M., and Ess-Dykema, C. V. (1998). Can Prosody Aid the Automatic Classification of Dialog Acts in Conversational Speech? Language and Speech 41:439–487.

    Google Scholar 

  • Spilker, J., Klarner, M., and Görz, G. Processing Self Corrections in a Speech-to-Speech System. In this volume.

    Google Scholar 

  • Vogel, S., Och, F.J., Tillmann, C., Niessen, S., Sawaf, H., and Ney, H. Statistical Methods for Machine Translation. In this volume.

    Google Scholar 

  • Wang, M., and Hirschberg, J. (1992). Automatic Classification of Intonational Phrase Boundaries. Computer Speech & Language 6(2):175–196.

    Article  Google Scholar 

  • Warnke, V., Gallwitz, F., Batliner, A., Buckow, J., Huber, R., Nöth, E., and Höthker, A. (1999). Integrating Multiple Knowledge Sources for Word Hypotheses Graph Interpretation. In Proceedings of the European Conference on Speech Communication and Technology (EUROSPEECH 99), 235–239.

    Google Scholar 

  • Wightman, C. (1992). Automatic Detection of Prosodic Constituents. PhD thesis, Boston University Graduate School.

    Google Scholar 

  • Zell, A., Mache, N., Sommer, T., and Korb, T. (1991a). Design of the SNNS Neural Network Simulator. In Proceedings of the Österreichische Artificial-Intelligence-Tagung, Informatik-Fachberichte 287, 93–102. Springer Verlag.

    Google Scholar 

  • Zell, A., Mache, N., Sommer, T., and Korb, T. (1991b). The SNNS Neural Network Simulator. In Proceedings of the 15. Fachtagung für Künstliche Intelligenz, 254–263. Springer Verlag.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Batliner, A., Buckow, J., Niemann, H., Nöth, E., Warnke, V. (2000). The Prosody Module. In: Wahlster, W. (eds) Verbmobil: Foundations of Speech-to-Speech Translation. Artificial Intelligence. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-04230-4_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-04230-4_8

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-08730-1

  • Online ISBN: 978-3-662-04230-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics