Abstract
In this paper we present a novel approach, called ”Text to Pronunciation (TtP)”, for the proper normalization of Non-Standard Words (NSWs) in unrestricted texts. The methodology deals with inflection issues for the consistency of the NSWs with the syntactic structure of the utterances they belong to. Moreover, for the achievement of an augmented auditory representation of NSWs in Text-to-Speech (TtS) systems, we introduce the coupling of the standard normalizer with: i) a language generator that compiles pronunciation formats and ii) VoiceXML attributes for the guidance of the underlying TtS to imitate the human speaking style in the case of numbers. For the evaluation of the above model in the Greek language we have used a 158K word corpus with 4499 numerical expressions. We achieved an internal error rate of 7,67% however, only 1,02% were perceivable errors due to the nature of the language.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Mobius, B., Sproat, R., van Santen, J., Olive, J.: The Bell Labs German Text-To-Speech system: An overview. In: Proceedings of EUROSPEECH 1997, vol. IV, pp. 2443–2446 (1997)
Fries, G., Wirth, A.: FELIX – A TTS System with Improved pre-processing and source signal generation. In: Proceedings of EUROSPEECH 1997, vol. II, pp. 589–592 (1997)
Zingle, H.: Traitement de la prosodie allemande dans un systeme de synthese de la parole. These pour le ‘Doctorat d’Etat, Universite de Strasbourg II (1982)
Ooyama, Y., Miyazaki, M., Ikehara, S.: Natural Language Processing in a Japanese Text- To-Speech System. In: Proceedings of the Annual Computer Science Conference, pp. 40–47. ACM, New York (1987)
Coughlin, D.: Leveraging Syntactic Information for Text Normalization. In: Matoušek, V., Mautner, P., Ocelíková, J., Sojka, P. (eds.) TSD 1999. LNCS (LNAI), vol. 1692, pp. 95–100. Springer, Heidelberg (1999)
Sproat, R., Black, A., Chen, S., Kumar, S., Ostendorf, M., Richards, C.: Normalization of non-standard words. Computer Speech and Language 15(3), 287–333 (2001)
Olinsky, G., Black, A.: Non-Standard Word and Homograph Resolution for Asian Language Text Analysis. In: Proceedings of ICSLP 2000, Beijing, China (2000)
Xydas, G., Kouroupetroglou, G.: The DEMOSTHeNES Speech Composer. In: Proceedings of the 4th ISCA Tutorial and Research Workshop on Speech Synthesis, Perthshire, Scotland, August 29-September 1, pp. 167–172 (2001)
Babiniotis, G., Christou, K.: The Grammar of Modern Greek, II. The verb. Ellinika Grammata (1998)
Burnett, D., Walker, M., Hunt, A.: Speech Synthesis Markup Language Version 1.0. W3C Working Draft, http://www.w3.org/TR/speech-synthesis
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Xydas, G., Karberis, G., Kouroupertroglou, G. (2004). Text Normalization for the Pronunciation of Non-standard Words in an Inflected Language. In: Vouros, G.A., Panayiotopoulos, T. (eds) Methods and Applications of Artificial Intelligence. SETN 2004. Lecture Notes in Computer Science(), vol 3025. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24674-9_41
Download citation
DOI: https://doi.org/10.1007/978-3-540-24674-9_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21937-8
Online ISBN: 978-3-540-24674-9
eBook Packages: Springer Book Archive