Speech Segmentation Aspects of Phone Transition Acoustical Modelling

Dobrišek, Simon; Mihelič, France; Pavešić, Nikola

doi:10.1007/3-540-48239-3_45

Simon Dobrišek³,
France Mihelič³ &
Nikola Pavešić³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1692))

Included in the following conference series:

International Workshop on Text, Speech and Dialogue

479 Accesses

Abstract

The paper presents our experiences with the phone transition acoustical models. The phone transition models were compared to the traditional context dependent phone models. We put special attention on the speech signal segmentation analysis to provide a better insight into certain segmentation effects when using the different acoustical models. Experiments with the HMM-based models were performed using the HTK toolkit, which was extended to provide proper state parameter tying for the phone transition models. All the model parameters were estimated on the GOPOLIS speech database. The annotation confusions concerning two-phone speech units are also discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Dobrišek, S. (1999). Analysis and Recognition of Phones in Speech Signals. Ph.D. Thesis in preparation, (In Slovenian). University of Ljubljana, Faculty of Electrical Engineering, Ljubljana Slovenia.
Google Scholar
Dobrišek, S., Gros, J., Mihelič, F., and Pavešić, N. (1998), Recording and labelling of the GOPOLIS Slovenian speech database. Proc. 1st Int. Conf. on Language Resources & Evaluation, Vol. 2, ESCA, pp. 1089–1096.
Google Scholar
Gros, J., Pavešić, N., Mihelič, F. (1997), Text-to-Speech Synthesis: A Complete System for the Slovenian Language. Jurnal of Computing and Information Technology, Vol. 5(1), pp. 11–19.
Google Scholar
Young, S., Odell, J., Ollason, D., Vatchev, V., and Woodland, P. (1997), The HTK Book. Cambridge University, Entropic Cambridge Research Laboratory Ltd.
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Electrical Engineering, Laboratory of Artificial Perception, University of Ljubljana, Tržaška 25, SI-1000, Ljubljana, Slovenia
Simon Dobrišek, France Mihelič & Nikola Pavešić

Authors

Simon Dobrišek
View author publications
You can also search for this author in PubMed Google Scholar
France Mihelič
View author publications
You can also search for this author in PubMed Google Scholar
Nikola Pavešić
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Engineerig, Faculty of Applied Sciences, University of West Bohemia in Plzeň, Universitní 22, 306 14, Pizeň, Czech Republic
Václav Matousek , Pavel Mautner & Jana Ocelíková , &
Department of Programming Systems and Communication, Faculty of Informatics, Masaryk University Brno, Botanická 68a, 602 00, Brno, Czech Republic
Petr Sojka

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dobrišek, S., Mihelič, F., Pavešić, N. (1999). Speech Segmentation Aspects of Phone Transition Acoustical Modelling. In: Matousek, V., Mautner, P., Ocelíková, J., Sojka, P. (eds) Text, Speech and Dialogue. TSD 1999. Lecture Notes in Computer Science(), vol 1692. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48239-3_45

Download citation

DOI: https://doi.org/10.1007/3-540-48239-3_45
Published: 01 October 1999
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66494-9
Online ISBN: 978-3-540-48239-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics