Skip to main content

Rules for Automatic Grapheme-to-Allophone Transcription in Slovene

  • Conference paper
  • First Online:
Book cover Text, Speech and Dialogue (TSD 2000)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1902))

Included in the following conference series:

  • 361 Accesses

Abstract

The domain of spoken language technologies ranges from speech input and output systems to complex understanding and generation systems, including multi-modal systems of widely differing complexity (such as automatic dictation machines) and multilingual systems (for example, automatic dialogue and translation systems). The definition of standards and evaluation methodologies for such systems involves the specification and development of highly specific spoken language corpus and lexicon resources, and measurement and evaluation tools [5]. This paper presents the MobiLuz spoken resources of the Slovene language, which will be made freely available for research purposes in speech technology and linguistics.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Brants, T. (2000): TnT-A Statistical Part-of-Speech Tagger. Proceedings of the ANLP-NAACL, in print, Seattle.

    Google Scholar 

  2. Dobrišek S., Kačič Z., Gros J., Horvat B. and Mihelič R, (1996): An Initiative for Standardisation of Phonetic Transcription of Slovenian Speech, Proceedings of the Fifth Electro technical and Computer Science Conference ERK’96, pp. 247–250, Portorož, Slovenia, 1996.

    Google Scholar 

  3. Dimitrova, L., Erjavec, T. Ide, N. Kaalep, H.J., Petkevič, V. and Tufis, D. (1998): Multext-East: Parallel and Comparable Corpora and Lexicons for Six Central and Eastern European Languages. COLING-ACL’ 98 Proceedings, pp. 315–319.

    Google Scholar 

  4. Dobrisek S., Gros J., Mihelič F. and Pavešić N. (1998): Recording and Labelling of the GOPOLIS Slovenian Speech Database, Proceedings of the First International Conference on Language Resources and Evaluation, pp. 1089–1096. Granada, Spain.

    Google Scholar 

  5. EAGLES Handbook (1997): Handbook of Standards and Resources for Spoken Language Systems. Editors D. Gibbon, Roger Moore and Richard Winski. Berlin: Mouton de Gruyter.

    Google Scholar 

  6. Erjavec T. (1998): The MULTEXT-East Slovene Lexicon. Proceedings of the ERK’98 Conference, Portorož, Slovenia, pp. 189–192.

    Google Scholar 

  7. Gros J., Mihelič F. and Pavešić N.,(1995): Sentence Hypothesisation Using Ng-Gram Models, In Proceedings of the the Fourth European Conference On Speech Communication and Technology, pp. 1759–1762, Madrid, Spain.

    Google Scholar 

  8. Gros J., Ipšić I., Mihelič F. and Pavešić N. (1996): Segmentation and labelling of Slovenian diphone inventories, COLING’ 96, pp. 298–303, Copenhagen, Denmark.

    Google Scholar 

  9. Gros, J., Pavešić, N. and Mihelič, F. (1997): Text-to-speech synthesis: a complete system for the Slovenian language. Journal of Computing and Information Technology. 5(1). pp. 11–19.

    Google Scholar 

  10. Ide, N., Tufis, D. and Erjavec, T. (1998): Development and Assessment of Common Lexical Specifications for Six Central and Eastern European Languages. Proceedings of the First International Conference on Language Resources and Evaluation, LREC’ 98, Granada, pp. 233–240.

    Google Scholar 

  11. Ipšić I., Mihelič F., Dobrišek S., Gros J. and Pavešić N. (1998): An overview of the spoken queries in European languages: the Slovenian spoken dialogue system. Proceedings of the scientific conference Artificial Intelligence in Industry from Theory to Practice and 3rd SQEL Workshop on Multi-Lingual Information Retrieval Dialogues, High Tatras, Slovakia, pp. 431–438.

    Google Scholar 

  12. Kačič Z. and Horvat B. and Derlič R. (1994): Zasnova baze izgovorjav slovenskega jezika SNABI. Proceedings of the ERK’ 94. Portorož, Slovenia.

    Google Scholar 

  13. Kačič Z. and Horvat B. (1998): Izgradnja infrastrukture, potrebne za razvoj govorne tehnologije za slovenski jezik. Proceedings of the Conference on Language Technologies for the Slovene Language. Ljubljana. pp. 100–104.

    Google Scholar 

  14. Kaiser J. and Kačič Z. (1998): Development of Slovenian SpeechDat Database. Proceedings of the Workshop On Speech Database Development for Central and Eastern European Languages, Granada, Spain, 1998.

    Google Scholar 

  15. Sperberg-McQueen, C.M., and Burnard, L., eds. (1994): Guidelines for Electronic Text Encoding and Interchange. Chicago and Oxford.

    Google Scholar 

  16. Šuštaršič R., Komar S. and Petek B. (1998): Slovene IPA Symbols, Illustrations of the IPA.

    Google Scholar 

  17. Zemljak M., Kačič Z., Dobrišek S. and Gros J. (2000): A Machine-readable Phonetic Transcription of the Slovene Speech, in preparation.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Gros, J., Mihelič, F., Dobrišek, S., Erjavec, T., Žganec, M. (2000). Rules for Automatic Grapheme-to-Allophone Transcription in Slovene. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2000. Lecture Notes in Computer Science(), vol 1902. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45323-7_29

Download citation

  • DOI: https://doi.org/10.1007/3-540-45323-7_29

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-41042-3

  • Online ISBN: 978-3-540-45323-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics