Abstract
The domain of spoken language technologies ranges from speech input and output systems to complex understanding and generation systems, including multi-modal systems of widely differing complexity (such as automatic dictation machines) and multilingual systems (for example, automatic dialogue and translation systems). The definition of standards and evaluation methodologies for such systems involves the specification and development of highly specific spoken language corpus and lexicon resources, and measurement and evaluation tools [5]. This paper presents the MobiLuz spoken resources of the Slovene language, which will be made freely available for research purposes in speech technology and linguistics.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Brants, T. (2000): TnT-A Statistical Part-of-Speech Tagger. Proceedings of the ANLP-NAACL, in print, Seattle.
Dobrišek S., Kačič Z., Gros J., Horvat B. and Mihelič R, (1996): An Initiative for Standardisation of Phonetic Transcription of Slovenian Speech, Proceedings of the Fifth Electro technical and Computer Science Conference ERK’96, pp. 247–250, Portorož, Slovenia, 1996.
Dimitrova, L., Erjavec, T. Ide, N. Kaalep, H.J., Petkevič, V. and Tufis, D. (1998): Multext-East: Parallel and Comparable Corpora and Lexicons for Six Central and Eastern European Languages. COLING-ACL’ 98 Proceedings, pp. 315–319.
Dobrisek S., Gros J., Mihelič F. and Pavešić N. (1998): Recording and Labelling of the GOPOLIS Slovenian Speech Database, Proceedings of the First International Conference on Language Resources and Evaluation, pp. 1089–1096. Granada, Spain.
EAGLES Handbook (1997): Handbook of Standards and Resources for Spoken Language Systems. Editors D. Gibbon, Roger Moore and Richard Winski. Berlin: Mouton de Gruyter.
Erjavec T. (1998): The MULTEXT-East Slovene Lexicon. Proceedings of the ERK’98 Conference, Portorož, Slovenia, pp. 189–192.
Gros J., Mihelič F. and Pavešić N.,(1995): Sentence Hypothesisation Using Ng-Gram Models, In Proceedings of the the Fourth European Conference On Speech Communication and Technology, pp. 1759–1762, Madrid, Spain.
Gros J., Ipšić I., Mihelič F. and Pavešić N. (1996): Segmentation and labelling of Slovenian diphone inventories, COLING’ 96, pp. 298–303, Copenhagen, Denmark.
Gros, J., Pavešić, N. and Mihelič, F. (1997): Text-to-speech synthesis: a complete system for the Slovenian language. Journal of Computing and Information Technology. 5(1). pp. 11–19.
Ide, N., Tufis, D. and Erjavec, T. (1998): Development and Assessment of Common Lexical Specifications for Six Central and Eastern European Languages. Proceedings of the First International Conference on Language Resources and Evaluation, LREC’ 98, Granada, pp. 233–240.
Ipšić I., Mihelič F., Dobrišek S., Gros J. and Pavešić N. (1998): An overview of the spoken queries in European languages: the Slovenian spoken dialogue system. Proceedings of the scientific conference Artificial Intelligence in Industry from Theory to Practice and 3rd SQEL Workshop on Multi-Lingual Information Retrieval Dialogues, High Tatras, Slovakia, pp. 431–438.
Kačič Z. and Horvat B. and Derlič R. (1994): Zasnova baze izgovorjav slovenskega jezika SNABI. Proceedings of the ERK’ 94. Portorož, Slovenia.
Kačič Z. and Horvat B. (1998): Izgradnja infrastrukture, potrebne za razvoj govorne tehnologije za slovenski jezik. Proceedings of the Conference on Language Technologies for the Slovene Language. Ljubljana. pp. 100–104.
Kaiser J. and Kačič Z. (1998): Development of Slovenian SpeechDat Database. Proceedings of the Workshop On Speech Database Development for Central and Eastern European Languages, Granada, Spain, 1998.
Sperberg-McQueen, C.M., and Burnard, L., eds. (1994): Guidelines for Electronic Text Encoding and Interchange. Chicago and Oxford.
Šuštaršič R., Komar S. and Petek B. (1998): Slovene IPA Symbols, Illustrations of the IPA.
Zemljak M., Kačič Z., Dobrišek S. and Gros J. (2000): A Machine-readable Phonetic Transcription of the Slovene Speech, in preparation.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gros, J., Mihelič, F., Dobrišek, S., Erjavec, T., Žganec, M. (2000). Rules for Automatic Grapheme-to-Allophone Transcription in Slovene. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2000. Lecture Notes in Computer Science(), vol 1902. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45323-7_29
Download citation
DOI: https://doi.org/10.1007/3-540-45323-7_29
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41042-3
Online ISBN: 978-3-540-45323-9
eBook Packages: Springer Book Archive