Abstract
Traditional concatenative speech synthesizers equipped with a small amount of speech segments suffer from the lack of naturalness. On the other hand, corpus-based speech synthesizers are able to produce much more natural speech. This paper presents a comparison of two new unit-selection methods in the corpus-based speech synthesis. An experimental comparison of comprehensibility and naturalness of all three approaches is provided here. The results are compared with one widely-used unit-selection method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Batůšek, R.: An objective measure for assessment of the concatenative tts segment inventories. In: Proceedings of Eurospeech 2001 — Scandinavia, Aalborg, Denmark (September 2001)
Batůšek, R.: Symbolic segment dissimilarity measure and its applications in speech synthesis. In: Proceedings of IEEE 2002 Workshop on Speech Synthesis, Santa Monica, USA (September 2002)
Black, A.W., Campbell, N.: Optimising selection of units from speech databases for concatenative synthesis. In: Eurospeech, pp. 581–584 (1995)
Hunt, A., Black, A.W.: Unit selection in a concatenative speech synthesis system using a large speech database. In: Proceedings of ICASSP 1996, pp. 373–376, Atlanta, Georgia, USA (1996)
Sagisaka, Y.: Speech synthesis by rule using an optimal selection of non-uniform synthesis units. In: Proceedings of ICASSP 1998, pp. 679–682, NewYork, USA (1988)
Yi, J., Glass, J., Hetherington, I.: A flexible, scalable finite-state transducer architecture for corpus-based concatenative speech synthesis. In: Proceedings of ICSLP, Beijing, China (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Batůšek, R., Gaura, P. (2003). A Comparison of Unit Selection Techniques in Limited Domain Speech Synthesis. In: Matoušek, V., Mautner, P. (eds) Text, Speech and Dialogue. TSD 2003. Lecture Notes in Computer Science(), vol 2807. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39398-6_35
Download citation
DOI: https://doi.org/10.1007/978-3-540-39398-6_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20024-6
Online ISBN: 978-3-540-39398-6
eBook Packages: Springer Book Archive