Skip to main content

Speech Segmentation Aspects of Phone Transition Acoustical Modelling

  • Conference paper
  • First Online:
Book cover Text, Speech and Dialogue (TSD 1999)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1692))

Included in the following conference series:

  • 479 Accesses

Abstract

The paper presents our experiences with the phone transition acoustical models. The phone transition models were compared to the traditional context dependent phone models. We put special attention on the speech signal segmentation analysis to provide a better insight into certain segmentation effects when using the different acoustical models. Experiments with the HMM-based models were performed using the HTK toolkit, which was extended to provide proper state parameter tying for the phone transition models. All the model parameters were estimated on the GOPOLIS speech database. The annotation confusions concerning two-phone speech units are also discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Dobrišek, S. (1999). Analysis and Recognition of Phones in Speech Signals. Ph.D. Thesis in preparation, (In Slovenian). University of Ljubljana, Faculty of Electrical Engineering, Ljubljana Slovenia.

    Google Scholar 

  2. Dobrišek, S., Gros, J., Mihelič, F., and Pavešić, N. (1998), Recording and labelling of the GOPOLIS Slovenian speech database. Proc. 1st Int. Conf. on Language Resources & Evaluation, Vol. 2, ESCA, pp. 1089–1096.

    Google Scholar 

  3. Gros, J., Pavešić, N., Mihelič, F. (1997), Text-to-Speech Synthesis: A Complete System for the Slovenian Language. Jurnal of Computing and Information Technology, Vol. 5(1), pp. 11–19.

    Google Scholar 

  4. Young, S., Odell, J., Ollason, D., Vatchev, V., and Woodland, P. (1997), The HTK Book. Cambridge University, Entropic Cambridge Research Laboratory Ltd.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1999 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Dobrišek, S., Mihelič, F., Pavešić, N. (1999). Speech Segmentation Aspects of Phone Transition Acoustical Modelling. In: Matousek, V., Mautner, P., Ocelíková, J., Sojka, P. (eds) Text, Speech and Dialogue. TSD 1999. Lecture Notes in Computer Science(), vol 1692. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48239-3_45

Download citation

  • DOI: https://doi.org/10.1007/3-540-48239-3_45

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-66494-9

  • Online ISBN: 978-3-540-48239-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics