Abstract
An extensive rule set for grapheme-to-allophone conversion of Slovene texts has been defined and evaluated. Another rule set has been developed for pronunciation of names. The efficiency of both S5 rule sets was compared to the one of the Onomastica rule set on two manually transcribed test data sets.
A performance test applied on the S5 pronunciation dictionary showed error rates of about 30% in the stress assignment and consequently in the phonetic transcription. In case stress assignment and the transcriptions of graphemic /e/ and /o/ in stressed syllables had been marked in advance a transcription success rate of nearly 100% was achieved both on names and on standard words with the S5 names rule sets and the S5 standard words rule set, respectively.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Gros, J.: Samodejno pretvarjanje besedil v govor. PhD Thesis. University of Ljubljana (1997).
Lindstrom, A., Ljungquvist, M., Gustafson, K.: A modular architecture supporting multiple hypotheses for conversion of text to phonetic and linguistic entities. Proceedings of the EUROSPEECH93. Berlin (1993) 1463–1466.
Toporišić, J.: Slovenska Slovenica (Slovene Grammar). Založba Obzorja Maribor (1991) (in Slovene).
Gros, J., Pavešić, N., Mihelić, F.: A text-to-speech system for the Slovenian language. Proceedings of the EUSIPCO’96, Trieste (1996) 1043–1046.
Hribar, J.: Sinteza umetnega govora iz teksta. MSc Thesis. University of Ljubljana (1984).
Weilguny, S.: Grafemsko-fonsmeki modul za sintezo izoliranih besed slovenskega jezika. MSc Thesis. University of Ljubljana (1993).
Belhoula, K., Kraft, V., Rinscheid, A., Ruehl, H.W.: Extension of a TTS system to rule-based pronunciation of names. Proceedings of the CRIM/FORWISSWorkshop on Progress and Prospects of Speech Research and Technology. Munich (1994) 249–251.
ONOMASTICA-Copernicus Database. CD-ROM. EU Project COP-58. Distributed by the European Language Resources Association. ELRA (1997).
Kačič, Z.: Definiranje leksikona izgovarjav lastnih imen za slovenski jezik. Proceedings of the ERK’98 Conference. Portorož Slovenia (1998) 185–188.
Dedina, M.J., Nusbaum, H.C.: Pronounce: a program for pronunciation by analogy. Computer Speech and Language 5 (1991) 55–64.
Mannell R., Clark, J.E.: Text-to-speech rule and dictionary development. Speech Communication 6 (1987) 317–324.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gros, J., Mihelić, F., Pavešić, N. (1999). Rules for Automatic Grapheme-to-Allophone Transcription in Slovene. In: Matousek, V., Mautner, P., Ocelíková, J., Sojka, P. (eds) Text, Speech and Dialogue. TSD 1999. Lecture Notes in Computer Science(), vol 1692. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48239-3_44
Download citation
DOI: https://doi.org/10.1007/3-540-48239-3_44
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66494-9
Online ISBN: 978-3-540-48239-0
eBook Packages: Springer Book Archive