Advertisement

Minimal Dependency Translation: A Framework for Computer-Assisted Translation for Under-Resourced Languages

  • Michael Gasser
Conference paper
Part of the Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering book series (LNICST, volume 244)

Abstract

This paper introduces Minimal Dependency Translation (MDT), an ongoing project to develop a rule-based framework for the creation of rudimentary bilingual lexicon-grammars for machine translation and computer-assisted translation into and out of under-resourced languages as well as initial steps towards an implementation of MDT for English-to-Amharic translation. The basic units in MDT, called groups, are headed multi-item sequences. In addition to wordforms, groups may contain lexemes, syntactic-semantic categories, and grammatical features. Each group is associated with one or more translations, each of which is a group in a target language. During translation, constraint satisfaction is used to select a set of source-language groups for the input sentence and to sequence the words in the associated target-language groups.

References

  1. 1.
    Abyssinica Translator. http://translator.abyssinica.com/
  2. 2.
    Aklilu, A.: Amharic-English Dictionary. Addis Ababa, Kuraz (1979)Google Scholar
  3. 3.
    Becker, J.: The phrasal lexicon. In: Schank, R., Nash-Webber, B. (eds.) Theoretical Issues in Natural Language Processing, pp. 38–41. Association for Computational Linguistics (1975)Google Scholar
  4. 4.
    Chiang, D.: Hierarchical phrase-based translation. Comput. Linguist. 33, 201–228 (2007)CrossRefGoogle Scholar
  5. 5.
    Debusmann, R.: Extensible dependency grammar: a modular grammar formalism based on multigraph description. Ph.D. thesis, Universität des Saarlandes (2007)Google Scholar
  6. 6.
    Fillmore, C.J., Baker, C.F.: Frame semantics for text understanding. In: Proceedings of WordNet and Other Lexical Resources Workshop, NAACL (2001)Google Scholar
  7. 7.
    Forcada, M.L., Ginestí-Rosell, M., Nordfalk, J., O’Regan, J., Ortiz-Rojas, S., Pérez-Ortiz, J.A., Sánchez-Martínez, F., Ramírez-Sánchez, G., Tyers, F.M.: Apertium: a free/open-source platform for rule-based machine translation. Mach. Transl. 25, 127–144 (2011)CrossRefGoogle Scholar
  8. 8.
    Gasser, M.: HornMorpho: a system for morphological processing of Amharic, Oromo, and Tigrinya. In: Proceedings of Conference on Human Language Technology for Development, Alexandria, Egypt (2011)Google Scholar
  9. 9.
    Google Translate: English-Amharic. https://translate.google.com/#en/am/
  10. 10.
    McShane, M., Nirenburg, S., Cowie, J., Zacharski, R.: Embedding knowledge elicition and MT systems within a single architecture. Mach. Transl. 17, 271–305 (2002)CrossRefGoogle Scholar
  11. 11.
    Osborne, T., Putnam, M., Gross, T.: Catenae: introducing a novel unit of syntactic analysis. Syntax 15, 354–396 (2012)CrossRefGoogle Scholar
  12. 12.
    Honnibal, M.: spaCy: industrial-strength natural language processing in python (2016). https://spacy.io/
  13. 13.
    Steels, L. (ed.): Design Patterns in Fluid Construction Grammar. John Benjamins, Amsterdam (2011)Google Scholar
  14. 14.
    Wikipedia (English): List of Wikipedias (2017). https://en.wikipedia.org/w/index.php?title=List_of_Wikipedias&oldid=796394102. Accessed 29 Aug 2017

Copyright information

© ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering 2018

Authors and Affiliations

  1. 1.Indiana UniversityBloomingtonUSA

Personalised recommendations