Skip to main content

Minimal Dependency Translation: A Framework for Computer-Assisted Translation for Under-Resourced Languages

  • Conference paper
  • First Online:
  • 1094 Accesses

Abstract

This paper introduces Minimal Dependency Translation (MDT), an ongoing project to develop a rule-based framework for the creation of rudimentary bilingual lexicon-grammars for machine translation and computer-assisted translation into and out of under-resourced languages as well as initial steps towards an implementation of MDT for English-to-Amharic translation. The basic units in MDT, called groups, are headed multi-item sequences. In addition to wordforms, groups may contain lexemes, syntactic-semantic categories, and grammatical features. Each group is associated with one or more translations, each of which is a group in a target language. During translation, constraint satisfaction is used to select a set of source-language groups for the input sentence and to sequence the words in the associated target-language groups.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    In the figures, heads are enclosed in brackets.

  2. 2.

    For simplification, Amharic verb lexemes are given in their usual citation form, the third person singular masculine perfect form. In fact they are represented internally in terms of their abstract roots, the sequence of consonants that characterize words in Semitic languages.

  3. 3.

    In fact the system would also generate the (incorrect) feminine of the verb in this case since the group does not include the subject itself: .

References

  1. Abyssinica Translator. http://translator.abyssinica.com/

  2. Aklilu, A.: Amharic-English Dictionary. Addis Ababa, Kuraz (1979)

    Google Scholar 

  3. Becker, J.: The phrasal lexicon. In: Schank, R., Nash-Webber, B. (eds.) Theoretical Issues in Natural Language Processing, pp. 38–41. Association for Computational Linguistics (1975)

    Google Scholar 

  4. Chiang, D.: Hierarchical phrase-based translation. Comput. Linguist. 33, 201–228 (2007)

    Article  Google Scholar 

  5. Debusmann, R.: Extensible dependency grammar: a modular grammar formalism based on multigraph description. Ph.D. thesis, Universität des Saarlandes (2007)

    Google Scholar 

  6. Fillmore, C.J., Baker, C.F.: Frame semantics for text understanding. In: Proceedings of WordNet and Other Lexical Resources Workshop, NAACL (2001)

    Google Scholar 

  7. Forcada, M.L., Ginestí-Rosell, M., Nordfalk, J., O’Regan, J., Ortiz-Rojas, S., Pérez-Ortiz, J.A., Sánchez-Martínez, F., Ramírez-Sánchez, G., Tyers, F.M.: Apertium: a free/open-source platform for rule-based machine translation. Mach. Transl. 25, 127–144 (2011)

    Article  Google Scholar 

  8. Gasser, M.: HornMorpho: a system for morphological processing of Amharic, Oromo, and Tigrinya. In: Proceedings of Conference on Human Language Technology for Development, Alexandria, Egypt (2011)

    Google Scholar 

  9. Google Translate: English-Amharic. https://translate.google.com/#en/am/

  10. McShane, M., Nirenburg, S., Cowie, J., Zacharski, R.: Embedding knowledge elicition and MT systems within a single architecture. Mach. Transl. 17, 271–305 (2002)

    Article  Google Scholar 

  11. Osborne, T., Putnam, M., Gross, T.: Catenae: introducing a novel unit of syntactic analysis. Syntax 15, 354–396 (2012)

    Article  Google Scholar 

  12. Honnibal, M.: spaCy: industrial-strength natural language processing in python (2016). https://spacy.io/

  13. Steels, L. (ed.): Design Patterns in Fluid Construction Grammar. John Benjamins, Amsterdam (2011)

    Google Scholar 

  14. Wikipedia (English): List of Wikipedias (2017). https://en.wikipedia.org/w/index.php?title=List_of_Wikipedias&oldid=796394102. Accessed 29 Aug 2017

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Michael Gasser .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Gasser, M. (2018). Minimal Dependency Translation: A Framework for Computer-Assisted Translation for Under-Resourced Languages. In: Mekuria, F., Nigussie, E., Dargie, W., Edward, M., Tegegne, T. (eds) Information and Communication Technology for Development for Africa. ICT4DA 2017. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 244. Springer, Cham. https://doi.org/10.1007/978-3-319-95153-9_20

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-95153-9_20

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-95152-2

  • Online ISBN: 978-3-319-95153-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics