The Prosody Module

Batliner, Anton; Buckow, Jan; Niemann, Heinrich; Nöth, Elmar; Warnke, Volker

doi:10.1007/978-3-662-04230-4_8

The Prosody Module

Anton Batliner³,
Jan Buckow³,
Heinrich Niemann³,
Elmar Nöth³ &
…
Volker Warnke³

Chapter

264 Accesses
28 Citations

Part of the book series: Artificial Intelligence ((AI))

Abstract

We describe the acoustic-prosodic and syntactic-prosodic annotation and classification of boundaries, accents and sentence mood integrated in the Verbmobil system for the three languages German, English, and Japanese. For the acoustic-prosodic classification, a large feature vector with normalized prosodic features is used. For the three languages, a multilingual prosody module was developed that reduces memory requirement considerably, compared to three monolingual modules. For classification, neural networks and statistic language models are used.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Alexandersson, J., Engel, R., Kipp, M., Koch, S., Küssner, U., Reithinger, N., and Stede, M. Modeling Negotiation Dialogs. In this volume.
Google Scholar
Bagshaw, P. C. (1994). Automatic Prosodic Analysis for Computer Aided Pronunciation Teaching. PhD thesis, University of Edinburgh.
Google Scholar
Batliner, A., Huber, R., Niemann, H., Nöth, E., Spilker, J., and Fischer, K. The Recognition of Emotion. In this volume.
Google Scholar
Batliner, A., Kompe, R., Kießling, A., Mast, M., Niemann, H., and Nöth, E. (1998). M = Syntax + Prosody: a Syntactic-Prosodic Labelling Scheme for Large Spontaneous Speech Databases. Speech Communication 25(4):193–222.
Article Google Scholar
Batliner, A., Nutt, M., Warnke, V., Nöth, E., Buckow, J., Huber, R., and Niemann, H. (1999). Automatic Annotation and Classification of Phrase Accents in Spontaneous Speech. In Proceedings of the European Conference on Speech Communication and Technology (EUROSPEECH 99), 519–522.
Google Scholar
Block, H. (1997). The Language Components in Verbmobil. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, volume 1, 79–82.
Google Scholar
Heine, J., and Bos, J. Discourse and Dialog Semantics for Translation. In this volume.
Google Scholar
Jekat, S., Klein, A., Maier, E., Maleck, I., Mast, M., and Quantz, J. (1995). Dialogue Acts in Verbmobil. Verbmobil Report 65.
Google Scholar
Kiefer, B., Krieger, H.-U., and Nederhof, M.-J. Efficient and Robust HPSG Parsing of Word Graphs. In this volume.
Google Scholar
Kießling, A. (1997). Extraktion und Klassifikation prosodischer Merkmale in der automatischen Sprachverarbeitung. Berichte aus der Informatik. Aachen: Shaker Verlag.
Google Scholar
Kipp, M., Alexandersson, J., Reithinger, N., and Engel, R. Dialog Processing. In this volume.
Google Scholar
Kompe, R. (1997). Prosody in Speech Understanding Systems. Lecture Notes for Artificial Intelligence. Berlin: Springer-Verlag.
Book Google Scholar
Mast, M., Maier, E., and Schmitz, B. (1995). Criteria for the Segmentation of Spoken Input into Individual Utterances. Verbmobil Report 97.
Google Scholar
Klüter, A., Ndiaye, A., and Kirchmann H. Verbmobil from a Software Engineering Point of View: System Design and Software Integration. In this volume.
Google Scholar
Price, P., Ostendorf, M., Shattuck-Hufnagel, S., and Fong, C. (1991). The Use of Prosody in Syntactic Disambiguation. Journal of the Acoustic Society of America 90:2956–2970.
Article Google Scholar
Reithinger, N., and Engel, R. Robust Content Extraction for Translation and Dialog Processing. In this volume.
Google Scholar
Schukat-Talamazzini, E., Gallwitz, F., Harbeck, S., and Warnke, V. (1997). Rational Interpolation of Maximum Likelihood Predictors in Stochastic Language Modeling. In Proc. European Conf. on Speech Communication and Technology, volume 5, 2731–2734.
Google Scholar
Searle, J. (1969). Speech Acts. An Essay in the Philosophy of Language. Cambridge: Cambridge University Press.
Book Google Scholar
Shriberg, E., Bates, R., Taylor, P., Stolcke, A., Jurafsky, D., Ries, K., Cocarro, N., Martin, R., Meteer, M., and Ess-Dykema, C. V. (1998). Can Prosody Aid the Automatic Classification of Dialog Acts in Conversational Speech? Language and Speech 41:439–487.
Google Scholar
Spilker, J., Klarner, M., and Görz, G. Processing Self Corrections in a Speech-to-Speech System. In this volume.
Google Scholar
Vogel, S., Och, F.J., Tillmann, C., Niessen, S., Sawaf, H., and Ney, H. Statistical Methods for Machine Translation. In this volume.
Google Scholar
Wang, M., and Hirschberg, J. (1992). Automatic Classification of Intonational Phrase Boundaries. Computer Speech & Language 6(2):175–196.
Article Google Scholar
Warnke, V., Gallwitz, F., Batliner, A., Buckow, J., Huber, R., Nöth, E., and Höthker, A. (1999). Integrating Multiple Knowledge Sources for Word Hypotheses Graph Interpretation. In Proceedings of the European Conference on Speech Communication and Technology (EUROSPEECH 99), 235–239.
Google Scholar
Wightman, C. (1992). Automatic Detection of Prosodic Constituents. PhD thesis, Boston University Graduate School.
Google Scholar
Zell, A., Mache, N., Sommer, T., and Korb, T. (1991a). Design of the SNNS Neural Network Simulator. In Proceedings of the Österreichische Artificial-Intelligence-Tagung, Informatik-Fachberichte 287, 93–102. Springer Verlag.
Google Scholar
Zell, A., Mache, N., Sommer, T., and Korb, T. (1991b). The SNNS Neural Network Simulator. In Proceedings of the 15. Fachtagung für Künstliche Intelligenz, 254–263. Springer Verlag.
Google Scholar

Download references

Author information

Authors and Affiliations

Lehrstuhl für Mustererkennung, Universität Erlangen-Nürnberg, Germany
Anton Batliner, Jan Buckow, Heinrich Niemann, Elmar Nöth & Volker Warnke

Authors

Anton Batliner
View author publications
You can also search for this author in PubMed Google Scholar
Jan Buckow
View author publications
You can also search for this author in PubMed Google Scholar
Heinrich Niemann
View author publications
You can also search for this author in PubMed Google Scholar
Elmar Nöth
View author publications
You can also search for this author in PubMed Google Scholar
Volker Warnke
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

German Research Center for Artificial Intelligence (DFKI), Stuhlsatzenhausweg 3, 66123, Saarbrücken, Germany
Wolfgang Wahlster
Department of Computer Science, University of Saarland, Stuhlsatzenhausweg 3, 66123, Saarbrücken, Germany
Wolfgang Wahlster

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Batliner, A., Buckow, J., Niemann, H., Nöth, E., Warnke, V. (2000). The Prosody Module. In: Wahlster, W. (eds) Verbmobil: Foundations of Speech-to-Speech Translation. Artificial Intelligence. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-04230-4_8

Download citation

DOI: https://doi.org/10.1007/978-3-662-04230-4_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-08730-1
Online ISBN: 978-3-662-04230-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics