Skip to main content

Reliable Detection of Important Word Boundaries Using Prosodic Features

  • Conference paper
Text, Speech and Dialogue (TSD 2011)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6836))

Included in the following conference series:

  • 926 Accesses

Abstract

Natural input dialog systems for entering address data in modern GPS units demand a significantly more robust extraction of slot information. In previous work, prosody was used to detect phrase boundaries (PB) which separate particular address parts. Only input without filler words was used. In this work, carrier sentences are allowed. A boosting approach provides twenty strong prosodic features which model the characteristic of PBs. We introduce the concept of a prosodically marked word boundary (PMB), which enables a better location of the provided information in natural input. Our results on a dataset of 5883 input samples reveal that about 67% of the found PBs indicate a PMB, while most of the remaining boundaries occur within compound words.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Kaufhold, C., Nöth, E.: Using prosodic features for predicting phrase boundaries. In: Speech Prosody (2010)

    Google Scholar 

  2. Gorin, A.L., Riccardi, G., Wright, J.H.: How may I help you? Speech Communication 23, 113–127 (1997)

    Article  MATH  Google Scholar 

  3. Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.: The WEKA Data Mining Software: An Update. SIGKDD Explorations 11 (2009)

    Google Scholar 

  4. Kießling, A.: Extraktion und Klassifikation prosodischer Merkmale in der automatischen Sprachverarbeitung. PhD thesis, Pattern Recognition Lab, University of Erlangen-Nuremberg (1996)

    Google Scholar 

  5. Batliner, A., Buckow, J., Niemann, H., Nöth, E., Warnke, V.: The prosody module. In: Wahlster, W. (ed.) Verbmobil: Foundations of Speech-to-Speech Translations, pp. 106–121. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  6. Wahlster, W.: Verbmobil: Foundations of Speech-to-Speech Translation. Springer, Germany (2000)

    Book  MATH  Google Scholar 

  7. Maier, A., Hönig, F., Zeissler, V., Batliner, A., Körner, E., Yamanaka, N., Ackerman, P., Nöth, E.: A language-independent feature set for the automatic evaluation of prosody. In: Interspeech (2009)

    Google Scholar 

  8. Hönig, F., Batliner, A., Weilhammer, K., Nöth, E.: Islands of failure: Employing word accent information for pronunciation quality assessment of English l2 learners. In: Interspeech (2009)

    Google Scholar 

  9. Selting, M.: Lists as embedded structures and the prosody of list construction as an interactional resource. Journal of Pragmatics 39, 483–526 (2007)

    Article  Google Scholar 

  10. Shukla, M., Nespor, M., Mehler, J.: An interaction between prosody and statistics in the segmentation of fluent speech. Cognitive Psychology 54, 1–32 (2007)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kaufhold, C., Stemmer, G., Nöth, E. (2011). Reliable Detection of Important Word Boundaries Using Prosodic Features. In: Habernal, I., Matoušek, V. (eds) Text, Speech and Dialogue. TSD 2011. Lecture Notes in Computer Science(), vol 6836. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23538-2_33

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-23538-2_33

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-23537-5

  • Online ISBN: 978-3-642-23538-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics