Skip to main content

Implications of Acoustic Variation for the Segmentation of the Czech Trill /r/

  • Conference paper
Cross-Modal Analysis of Speech, Gestures, Gaze and Facial Expressions

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5641))

  • 1571 Accesses

Abstract

The Czech alveolar sonorant trill /r/, like liquids generally, constitutes a challenge from the point of view of locating its boundaries in the acoustic stream. As it is desirable to label and segment a phonetic corpus uniformly and also to facilitate a high degree of inter-labeller agreement, rules for specifying speechsound boundaries should be unambiguous and as straightforward as possible. In this study, we examined various acoustic forms of Czech /r/ – from the trill and a flap to strongly reduced instances – and their implications for segmentation. The above-mentioned requirements resulted in the necessity to treat the segmentation of intervocalic items of /r/ differently from items occurring in consonant clusters.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Wester, M., Kessens, J.M., Cucchiarini, C., Strik, H.: Obtaining phonetic transcriptions: a comparison between expert listeners and a continuous speech recognizer. Language and Speech 44, 377–403 (2001)

    Article  Google Scholar 

  2. Volín, J., Skarnitzl, R., Pollák, P.: Confronting HMM-based Phone Labelling with Human Evaluation of Speech Production. In: Proceedings of Interspeech 2005, pp. 1541–1544. ISCA, Lisbon (2005)

    Google Scholar 

  3. Kominek, J., Bennett, C., Black, A.W.: Evaluating and Correcting Phoneme Segmentation for Unit Selection Synthesis. In: Proceedings of Eurospeech 2003, pp. 313–316. ISCA, Geneva (2003)

    Google Scholar 

  4. Pitt, M., Johnson, K., Hume, E., Kiesling, S., Raymond, W.: The Buckeye corpus of conversational speech: labeling conventions and a test of transcriber reliability. Speech Communication 45, 89–95 (2005)

    Article  Google Scholar 

  5. Wesenick, M.-B., Kipp, A.: Estimating the quality of phonetic transcriptions and segmentations of speech signals. In: Proceedings of ICSLP 1996, pp. 129–132. ISCA, Philadelphia (1996)

    Google Scholar 

  6. Machač, P., Skarnitzl, R., Volín, J.: Inter-labeller agreement in segmental boundary placement. In: Vích, R. (ed.) 17th Czech-German Workshop - Speech Processing, Prague, pp. 57–61 (2007)

    Google Scholar 

  7. Pauws, S., Kamp, Y., Willems, L.: A hierarchical method of automatic speech segmentation for synthesis applications. Speech Communication 19, 207–220 (1996)

    Article  Google Scholar 

  8. Boersma, P., Weenink, D.: Praat, version 4.4.20 (2006), http://www.praat.org

  9. Palková, Z., Volín, J.: The role of F0 contours in determining foot boundaries in Czech. In: Proceedings of the 15th ICPhS, pp. 1783–1786. Organizing Committee, Barcelona (2003)

    Google Scholar 

  10. Machač, P.: Stabilita zvukových charakteristik fonémů ve spontánních mluvených projevech. In: Hladká, Z., Karlík, P. (eds.) Čeština - univerzália a specifika 5, pp. 427–435. Lidové noviny, Praha (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Machač, P. (2009). Implications of Acoustic Variation for the Segmentation of the Czech Trill /r/. In: Esposito, A., Vích, R. (eds) Cross-Modal Analysis of Speech, Gestures, Gaze and Facial Expressions. Lecture Notes in Computer Science(), vol 5641. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03320-9_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-03320-9_17

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-03319-3

  • Online ISBN: 978-3-642-03320-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics