FlexVoice: A Parametric Approach to High-Quality Speech Synthesis

Balogh, György; Dobler, Ervin; Grőbler, Tamás; Smodics, Béla; Szepesvári, Csaba

doi:10.1007/3-540-45323-7_32

FlexVoice: A Parametric Approach to High-Quality Speech Synthesis

György Balogh³,
Ervin Dobler³,
Tamás Grőbler³,
Béla Smodics³ &
…
Csaba Szepesvári³

Conference paper
First Online: 01 January 2002

364 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1902))

Abstract

FlexVoice, an integrated text-to-speech (TTS) system is presented in this paper. Its most distinctive feature is its low memory and CPU load while preserving the high quality of leading TTS systems. FlexVoice uses a hybrid approach that combines diphone concatenation with LPC-based parametric synthesis. Major improvements of speech quality are achieved by the careful design of each module at all synthesis levels (such as selection of training data for the various machine learning methods and that of the basic synthesis units for the parametric synthesiser). FlexVoice currently supports US English with two male and two female voices.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Dutoit, T.: An Introduction to Text-To-Speech Synthesis. Kluwer Acad. Publ., Dordrecht (1997).
Google Scholar
Klatt, D.H., Klatt, L.C.: Analysis, Synthesis, and Perception of Voice Quality Variations among Female and Male Talkers. J. Acoust. Soc. Am. 87 (1990) 820–857.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Mindmaker Ltd., Budapest, Hungary
György Balogh, Ervin Dobler, Tamás Grőbler, Béla Smodics & Csaba Szepesvári

Authors

György Balogh
View author publications
You can also search for this author in PubMed Google Scholar
Ervin Dobler
View author publications
You can also search for this author in PubMed Google Scholar
Tamás Grőbler
View author publications
You can also search for this author in PubMed Google Scholar
Béla Smodics
View author publications
You can also search for this author in PubMed Google Scholar
Csaba Szepesvári
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Informatics Department of Programming Systems and Communication, Masaryk University, Botanická 68a, 602 00, Brno, Czech Republic
Petr Sojka
Faculty of Informatics, Department of Information Technologies, Masaryk University, Botanická 68a, 602 00, Brno, Czech Republic
Ivan Kopeček & Karel Pala &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Balogh, G., Dobler, E., Grőbler, T., Smodics, B., Szepesvári, C. (2000). FlexVoice: A Parametric Approach to High-Quality Speech Synthesis. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2000. Lecture Notes in Computer Science(), vol 1902. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45323-7_32

Download citation

DOI: https://doi.org/10.1007/3-540-45323-7_32
Published: 15 August 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41042-3
Online ISBN: 978-3-540-45323-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics