Abstract
This paper presents the experimental results of several approaches to machine translation evaluation to determine the quality of Thai to English translation. We compare automatic metrics and human-based evaluation that includes error classification, reading comprehension and analysis from a professional translator. The research compares translation systems that are available to end users in Thailand to provide an understanding of the quality of translation in general use. Both the rate of 47.2 % error words per text and the BLEU score of 0.21 indicate the difficulty of Thai to English translation. Despite a high error rate for the translations, users were able to successfully answer about 60 % of the questions using the output of the machine translation systems in the reading comprehension tests.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Avramidis, E., Burchardt, A., Federmann, C., Popović, M., Tscherwinka, C., Vilar, D.: Involving language professionals in the evaluation of machine translation. In: LREC, pp. 1127–1130 (2012)
Callison-Burch, C.: Fast, cheap, and creative: evaluating translation quality using amazon’s mechanical turk. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, vol. 1, pp. 286–295. Association for Computational Linguistics (2009)
Chimsuk, T., Auwatanamongkol, S.: A Thai to English machine translation system using Thai LFG tree structure as interlingua. World Academy of Science, Engineering and Technology, pp. 690–695 (2009)
Koehn, P.: Open problems in machine translation (2013). https://www.youtube.com/watch?v=6UVgFjJeFGY
Jones, D., Shen, W., Granoien, N., Herzog, M., Weinstein, C.: Measuring translation quality by testing English speakers with a new defense language proficiency test for Arabic. Massachusetts Institute of Technology, Lexington Lincoln Lab (2005)
Labutsri, N., Chamchong, R., Booth, R., Rodtook, A.: English syntactic reordering for English-Thai phrase-based statistical machine translation. In: Proceedings of the 6th International Joint Conference on Computer Science and Software Engineering (JCSSE 2009) (2009)
Lyons, S.: A survey of the use of mobile technology and translation tools by students at secondary school in Thailand. Payap Univ. J. 26(1) (2016)
Luekhong, P., Ruangrajitpakorn, T., Supnithi, T., Sukhahuta, R.: Pooja: similarity-based bilingual word alignment framework for SMT. In: Proceedings of the 10th International Symposium on Natural Language Processing, Phuket, Thailand (2013)
Netjinda, N., Facundes, N., Sirinaovakul, B.: Toward statistical machine translation for Thai and English. In: International Symposium On Digital Libraries, Albuquerque, New Mexico, USA, 27–28 October 2009 (2005)
Noyunsan, C., Poltree, C.H.S., Saikeaw, K.R.: A multi-aspect comparison and evaluation on Thai word segmentation programs. In: JIST (Workshops & Posters) 2014, pp. 132–135 (2014)
Nathalang, S., Porkeaw, P., Supnithi, T.: Don’t use big words with me: an evaluation of English-Thai statistical-based machine translation. In: Proceedings of the International Symposium on Using Corpora in Contrastive and Translation Studies (UCCTS2010) (2010)
Papineni, P., Roukos, S., Ward, T., Zhu, W.-J.: BLEU: a method of automatic evaluation of machine translation. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 311–318. Association for Computational Linguistics (2002)
Popović, M.: Hjerson: an open source tool for automatic error classification of machine translation output. Prague Bull. Math. Linguist. 96, 59–67 (2011)
Popović, M., Burchardt, A.: From human to automatic error classification for machine translation output. In: 15th International Conference of the European Association for Machine Translation (EAMT 2011) (2011)
Popović, M., Ney, H.: Word error rates: decomposition over POS classes and applications for error analysis. In: Proceedings of the Second Workshop on Statistical Machine Translation, pp. 48–55. Association for Computational Linguistics (2007)
Popović, M., Ney, H.: Towards automatic error analysis of machine translation output. Comput. Linguist. 37(4), 657–688 (2011)
Przybocki, M.A., Peterson, K., Bronsart, S.: Translation adequacy and preference evaluation tool (TAP-ET). In: LREC, vol. 2008, p. 6 (2008)
Slayden, G., Hwang, M.Y., Schwartz, L.: Thai sentence-breaking for large-scale SMT. In: 23rd International Conference on Computational Linguistics, p. 8 (2010)
Sornlertlamvanich, V., Charoenpornsawat, P., Boriboon, M., Boonmana, L.: ParSit: English-Thai machine translation services on internet. In: 12th Annual Conference, ECTI and New Economy. National Electronics and Computer Technology Center, Bangkok (2000)
Supnithi, T., Sornlertlamvanich, V., Charoenporn, T.: A cross system machine translation. In: Proceedings of the 2002 COLING Workshop on Machine Translation in Asia (COLING-MTIA 2002), vol. 16 (2002)
Vilar, D., Xu, J., d’Haro, L.F., Ney, H.: Error analysis of statistical machine translation output. In: Proceedings of LREC, pp. 697–702 (2006)
Wutiwiwatchai, C.: Language and speech translation activities in Thailand. ASEAN-NICT Round Table – Feb 2015 (2015)
Wutiwiwatchai, C., Supnithi, T., Kosawat, K.: Speech-to-speech translation activities in Thailand. In: Workshop on Technologies and Corpora for Asia-Pacific Speech Translation (TCAST), p. 7 (2007)
Wutiwiwatchai, C., Supnithi, T., Porkaew, P., Thatphithakkul, N.: Improvement issues in English-Thai speech translation. In: Proceedings of TCAST Workshop 2009 (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Lyons, S. (2016). Quality of Thai to English Machine Translation. In: Ohwada, H., Yoshida, K. (eds) Knowledge Management and Acquisition for Intelligent Systems . PKAW 2016. Lecture Notes in Computer Science(), vol 9806. Springer, Cham. https://doi.org/10.1007/978-3-319-42706-5_20
Download citation
DOI: https://doi.org/10.1007/978-3-319-42706-5_20
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42705-8
Online ISBN: 978-3-319-42706-5
eBook Packages: Computer ScienceComputer Science (R0)