A Comparison of Unit Selection Techniques in Limited Domain Speech Synthesis

Batůšek, Robert; Gaura, Pavel

doi:10.1007/978-3-540-39398-6_35

Robert Batůšek⁷ &
Pavel Gaura⁷

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2807))

Included in the following conference series:

International Conference on Text, Speech and Dialogue

Abstract

Traditional concatenative speech synthesizers equipped with a small amount of speech segments suffer from the lack of naturalness. On the other hand, corpus-based speech synthesizers are able to produce much more natural speech. This paper presents a comparison of two new unit-selection methods in the corpus-based speech synthesis. An experimental comparison of comprehensibility and naturalness of all three approaches is provided here. The results are compared with one widely-used unit-selection method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Batůšek, R.: An objective measure for assessment of the concatenative tts segment inventories. In: Proceedings of Eurospeech 2001 — Scandinavia, Aalborg, Denmark (September 2001)
Google Scholar
Batůšek, R.: Symbolic segment dissimilarity measure and its applications in speech synthesis. In: Proceedings of IEEE 2002 Workshop on Speech Synthesis, Santa Monica, USA (September 2002)
Google Scholar
Black, A.W., Campbell, N.: Optimising selection of units from speech databases for concatenative synthesis. In: Eurospeech, pp. 581–584 (1995)
Google Scholar
Hunt, A., Black, A.W.: Unit selection in a concatenative speech synthesis system using a large speech database. In: Proceedings of ICASSP 1996, pp. 373–376, Atlanta, Georgia, USA (1996)
Google Scholar
Sagisaka, Y.: Speech synthesis by rule using an optimal selection of non-uniform synthesis units. In: Proceedings of ICASSP 1998, pp. 679–682, NewYork, USA (1988)
Google Scholar
Yi, J., Glass, J., Hetherington, I.: A flexible, scalable finite-state transducer architecture for corpus-based concatenative speech synthesis. In: Proceedings of ICSLP, Beijing, China (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Informatics, Masaryk University, Brno, Czech Republic
Robert Batůšek & Pavel Gaura

Authors

Robert Batůšek
View author publications
You can also search for this author in PubMed Google Scholar
Pavel Gaura
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of West Bohemia in Pilsen, Univerzitni 8, 30614, Plzen, Czech Republic
Václav Matoušek & Pavel Mautner &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Batůšek, R., Gaura, P. (2003). A Comparison of Unit Selection Techniques in Limited Domain Speech Synthesis. In: Matoušek, V., Mautner, P. (eds) Text, Speech and Dialogue. TSD 2003. Lecture Notes in Computer Science(), vol 2807. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39398-6_35

Download citation

DOI: https://doi.org/10.1007/978-3-540-39398-6_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20024-6
Online ISBN: 978-3-540-39398-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics