Abstract
Turkish belongs to the Turkic family of languages and these languages exhibit tremendous similarity when it comes to morphological and grammatical structure but have somewhat different lexicons owing to various historical, geographical, and cultural interactions with neighboring languages. In this chapter we briefly cover the similarities and differences of these languages and introduce a machine translation methodology that exploits the similarities among these languages. This methodology relies on rule-based and statistical components and can be applicable for not only Turkic languages but also any other cognate language pairs.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
For example, en.wikipedia.org/wiki/Turkiclanguages (Accessed Sept. 14, 2017).
- 2.
Note that the minor variants of these languages are not shown for the sake of clarity.
- 3.
Second person plural possessive agreement suffix.
References
Altıntaş K (2000) Turkish to Crimean-Tatar machine translation system. Master’s thesis, Bilkent University, Ankara
Altıntaş K, Güvenir HA (2003) Learning translation templates for closely related languages. In: Proceedings of knowledge-based intelligent information and engineering systems, Oxford, pp 756–762
Bouamor H, Alshikhabobakr H, Mohit B, Oflazer K (2014) A human judgement corpus and a metric for Arabic MT evaluation. In: Proceedings of EMNLP, Doha, pp 207–213
Corbi-Bellot AM, Forcada ML, Ortíz-Rojas S, Pérez-Ortiz JA, Ramírez-Sánchez G, Sánchez-Martínez F, Alegria I, Mayor A, Sarasola K (2005) An open-source shallow-transfer machine translation engine for the romance languages of Spain. In: Proceedings of EAMT, Budapest
Dvořák B, Homola P, Kuboň V (2006) Exploiting similarity in the MT into a minority language. In: Proceedings of the workshop on minority languages: “strategies for developing machine translation for minority languages”, Genoa, pp 59–64
El Kholy A, Habash N (2011) Automatic error analysis for morphologically rich languages. In: Proceedings of the MT summit, Xiamen, pp 225–232
Fatullayev A, Shagavatov S (2008) Turkish-Azerbaijani translation module of Dilmanc MT system. In: Proceedings of the second international conference on problems of cybernetics and informatics, Baku, Azerbaijan, pp 150–153
Garrido-Alenda A, Gilabert-Zarco P, Pérez-Ortiz JA, Pertusa-Ibáñez A, Ramírez-Sánchez G, Sánchez-Martínez F, Scalco MA, Forcada ML (2003) Shallow parsing for Portuguese-Spanish machine translation. In: Proceedings of the workshop on tagging and shallow processing of Portuguese, Lisbon, pp 135–144
Hajič J (1987) RUSLAN - an MT system between closely related languages. In: Proceedings of EACL, Copenhagen, pp 113–117
Hajič J, Hric J, Kuboň V (2000) Machine translation of very close languages. In: Proceedings of NAACL-ANLP, Seattle, WA, pp 7–12
Hajič J, Homola P, Kuboň V (2003) A simple multilingual machine translation system. In: Proceedings of the MT summit, New Orleans, LA
Hamzaoğlu İ (1993) Machine translation from Turkish to other Turkic languages and an implementation for the Azeri language. Master’s thesis, Boğaziçi University, Istanbul
Homola P, Kuboň V (2008) Improving machine translation between closely related romance languages. In: Proceedings of EAMT, Hamburg, pp 72–77
Karttunen L, Gaal T, Kempe A (1997) Xerox finite-state tool. Technical report, Xerox Research Centre Europe, Meylan
Kos K, Bojar O (2009) Evaluation of machine translation metrics for Czech as the target language. Prague Bull Math Ling 92:135–148
Nakov P, Tiedemann J (2012) Combining word-level and character-level models for machine translation between closely-related languages. In: Proceedings of ACL, Jeju, pp 301–305
Oflazer K (1994) Two-level description of Turkish morphology. Lit Ling Comput 9(2):137–148
Oflazer K, Çetinoğlu Ö, Say B (2004) Integrating morphology with multi-word expression processing in Turkish. In: Proceedings of the ACL workshop on multiword expressions: integrating processing, Barcelona, pp 64–71
Oller CA, Forcada ML (2006) Open-source machine translation between small languages: Catalan and Aranese Occitan. In: Proceedings of the workshop on minority languages: “strategies for developing machine translation for minority languages”, Genoa, pp 51–54
Salimzyanov I, Washington JN, Tyers FM (2013) A free/open-source Kazakh-Tatar machine translation system. In: Proceedings of the MT summit, Nice, pp 174–182
Scannell KP (2006) Machine translation for closely related language pairs. In: Proceedings of the workshop on minority languages: “strategies for developing machine translation for minority languages”, Genoa, pp 103–109
Tantuğ AC, Adalı E, Oflazer K (2007) A MT system from Turkmen to Turkish employing finite state and statistical methods. In: Proceedings of the MT summit, Copenhagen, pp 459–465
Tantuğ AC, Oflazer K, Durgar-El Kahlout İ (2008) BLEU+: a tool for fine-grained BLEU computation. In: Proceedings of LREC, Marrakesh, pp 1493–1499
Tiedemann J (2009) Character-based PSMT for closely related languages. In: Proceedings of EAMT, Barcelona, pp 12–19
Tyers FM, Washington JN, Salimzyanov I, Batalov R (2012) A prototype machine translation system for Tatar and Bashkir based on free/open-source components. In: Proceedings of the workshop on language resources and technologies for Turkic languages, Istanbul, pp 11–14
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this chapter
Cite this chapter
Tantuğ, A.C., Adalı, E. (2018). Machine Translation Between Turkic Languages. In: Oflazer, K., Saraçlar, M. (eds) Turkish Natural Language Processing. Theory and Applications of Natural Language Processing. Springer, Cham. https://doi.org/10.1007/978-3-319-90165-7_11
Download citation
DOI: https://doi.org/10.1007/978-3-319-90165-7_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-90163-3
Online ISBN: 978-3-319-90165-7
eBook Packages: Computer ScienceComputer Science (R0)