Abstract
We describe here the process of implementing a morphological generator for Romanian, together with references to the resources and previous work that made this achievement possible. The generator receives a lemma and a morpho-lexical description of the intended word-form and it is supposed to generate that very word-form. It uses a paradigmatic description of Romanian morphology and a large dictionary of lemmas associated with their inflectional paradigms. The work is motivated by an ongoing example-based machine translation system and by its utility in extending the lexical resources created in our group. The generation process distinguishes two major situations, depending on the lemma being recorded or not into the generator’s dictionary. In the first case, the generated inflected form is guaranteed to be a real word of Romanian language while in the second case the inflected form might be a potential but not extant word of Romanian.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Brown, P., Cocke, J., Della Pietra, S., Della Pietra, V., Jelinek, F., Mercer, R.: A statistical approach to French/English translation. In: Second International Conference on Theoretical and Methodological Issues in Machine Translation of Natural Languages, p. 16. Carnegie Mellon University, Center for Machine Translation, Pittsburgh (1988)
Tufis, D.: Tiered tagging and combined language models classifiers. In: Matoušek, V., Mautner, P., Ocelíková, J., Sojka, P. (eds.) TSD 1999. LNCS (LNAI), vol. 1692, pp. 28–33. Springer, Heidelberg (1999)
Ion, R.: Methods for Word Sense Disambiguation; Applications for English and Romanian. Phd Thesis, Romanian Academy (2007)
Tufiş, D., Ion, R., Ceauşu, A.l., Ştefănescu, D.: Improved Lexical Alignment by Combining Multiple Reified Alignments. In: Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2006), Trento, Italy, pp. 153–160 (2006)
Erjavec, T.: MULTEXT-East Version 3: Multilingual Morphosyntactic Specifications, Lexicons and Corpora. In: Proceedings of the Fourth International Conference on Language Resources and Evaluation, LREC 2004, ELRA, Paris, pp. 1535–1538 (2004)
Tufiş, D.: It Would Be Much Easier If WENT Were GOED. In: Somers, H., Wood, M.M. (eds.) Proceedings of the 4th European Conference of the Association for Computational Linguistics, Manchester (1989)
Tufiş, D.: Paradigmatic Morphology Learning. In: Miklosko, J. (ed.) Computers and Artificial Intelligence. VEDA Publishing House, Bratislava (1990)
Estival, D., Tufis, D., Popescu, O.: Devélopment d’outils et des données linguistiques pour le traîtement du langage naturel. Rapport Final – Projet EST (7RUPJO38421) ISSCO, Geneve (1994)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Irimia, E. (2009). ROG – A Paradigmatic Morphological Generator for Romanian. In: Vetulani, Z., Uszkoreit, H. (eds) Human Language Technology. Challenges of the Information Society. LTC 2007. Lecture Notes in Computer Science(), vol 5603. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04235-5_7
Download citation
DOI: https://doi.org/10.1007/978-3-642-04235-5_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04234-8
Online ISBN: 978-3-642-04235-5
eBook Packages: Computer ScienceComputer Science (R0)