Abstract
Large amounts of bilingual data and monolingual data in the target language are usually used to train statistical machine translation systems. In this paper we propose several semi-supervised techniques within a Bengali English Phrase-based Statistical Machine Translation (SMT) System in order to improve translation quality. We conduct experiments on a Bengali-English dataset and our initial experimental results show improvement in translation quality.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Sarkar, A., Haffari, G., Ueffing, N.: Transductive learning for statistical machine translation. In: Proc. ACL (2007)
Ueffing, N., Simard, M., Larkin, S., Johnson, J.H.: NRC’s Portage system for WMT 2007. In: Proc. ACL Workshop on SMT (2007)
Stolcke, A.: SRILM - an extensible language modeling toolkit. In: Proc. ICSLP (2002)
Och, F.J.: Minimum error rate training in statistical machine translation. In: Proc. ACL (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Roy, M. (2009). A Semi-supervised Approach to Bengali-English Phrase-Based Statistical Machine Translation. In: Gao, Y., Japkowicz, N. (eds) Advances in Artificial Intelligence. Canadian AI 2009. Lecture Notes in Computer Science(), vol 5549. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-01818-3_45
Download citation
DOI: https://doi.org/10.1007/978-3-642-01818-3_45
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-01817-6
Online ISBN: 978-3-642-01818-3
eBook Packages: Computer ScienceComputer Science (R0)