Skip to main content

Learning Bilingual Sentence Representations for Quality Estimation of Machine Translation

  • Conference paper
  • First Online:
Book cover Machine Translation (CWMT 2016)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 668))

Included in the following conference series:

Abstract

In this paper, we propose a novel approach learning bilingual representations to predict quality estimation of machine translation. We use two bi-directional Long Short-Term Memory (LSTM) based architectures map the source sentence and target sentence to two context vector of a fixed dimensionality, then we compute the weighted cosine distance of the two vectors to estimate the translation quality of the target sentence. Our experimental results show that our model improve the performance over a baseline system with 17 features in the English-to-Spanish sentence-level quality estimation task of WMT15.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Blatz, J., Fitzgerald, E., Foster, G., Gandrabur, S., Goutte, C., Kulesza, A., Sanchis, A., Ueffing, N.: Confidence estimation for machine translation. In: Proceedings of the 20th International Conference on Computational Linguistics, p. 315. Association for Computational Linguistics (2004)

    Google Scholar 

  2. Bojar, O., Buck, C., Callison-Burch, C., Federmann, C., Haddow, B., Koehn, P., Monz, C., Post, M., Soricut, R., Specia, L.: Findings of the 2013 workshop on statistical machine translation. In: Proceedings of the Eighth Workshop on Statistical Machine Translation, pp. 1–44. Association for Computational Linguistics, Sofia, Bulgaria (2013)

    Google Scholar 

  3. Bojar, O., Chatterjee, R., Federmann, C., Haddow, B., Huck, M., Hokamp, C., Koehn, P., Logacheva, V., Monz, C., Negri, M., Post, M., Scarton, C., Specia, L., Turchi, M.: Findings of the 2015 workshop on statistical machine translation. In: Proceedings of the Tenth Workshop on Statistical Machine Translation, pp. 1–46. Association for Computational Linguistics, Lisbon, Portugal (2015)

    Google Scholar 

  4. Bojar, O., Buck, C., Federmann, C., Haddow, B., Koehn, P., Leveling, J., Monz, C., Pecina, P., Post, M., Saint-Amand, H., Soricut, R., Specia, L., Tamchyna, A.: Findings of the 2014 workshop on statistical machine translation. In: Proceedings of the Ninth Workshop on Statistical Machine Translation, pp. 12–58. Association for Computational Linguistics, Baltimore, Maryland, USA (2014)

    Google Scholar 

  5. Callison-Burch, C., Koehn, P., Monz, C., Post, M., Soricut, R., Specia, L.: Findings of the 2012 workshop on statistical machine translation. In: Proceedings of the Seventh Workshop on Statistical Machine Translation, pp. 10–51. Association for Computational Linguistics, Montréal, Canada (2012)

    Google Scholar 

  6. Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. In: ICLR 2015, pp. 1–15 (2014). http://arxiv.org/abs/1409.0473v3

  7. Gers, F.A., Schmidhuber, J., Cummins, F.: Learning to forget: continual prediction with LSTM. Neural Comput. 12(10), 2451–2471 (2000)

    Article  Google Scholar 

  8. Gers, F.A., Schraudolph, N.N., Schmidhuber, J.: Learning precise timing with LSTM recurrent networks. J. Mach. Learn. Res. 3, 115–143 (2002)

    MATH  MathSciNet  Google Scholar 

  9. Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)

    Article  Google Scholar 

  10. Koehn, P.: Europarl: a parallel corpus for statistical machine translation. In: MT summit, vol. 5, pp. 79–86 (2005)

    Google Scholar 

  11. Langlois, D.: LORIA system for the WMT15 quality estimation shared task. In: Proceedings of the Tenth Workshop on Statistical Machine Translation, pp. 323–329. Association for Computational Linguistics, Lisbon, Portugal, September 2015

    Google Scholar 

  12. Luong, M.T., Pham, H., Manning, C.D.: Effective approaches to attention-based neural machine translation (2015)

    Google Scholar 

  13. Shah, K., Logacheva, V., Paetzold, G., Blain, F., Beck, D., Bougares, F., Specia, L.: SHEF-NN: translation quality estimation with neural networks. In: Proceedings of the Tenth Workshop on Statistical Machine Translation, pp. 342–347, no. September. Association for Computational Linguistics, September 2015

    Google Scholar 

  14. Snover, M., Dorr, B., Schwartz, R., Micciulla, L., Makhoul, J.: A study of translation edit rate with targeted human annotation. In: Proceedings of Association for Machine Translation in the Americas, pp. 223–231 (2006)

    Google Scholar 

  15. Specia, L., Cancedda, N., Dymetman, M., Turchi, M., Cristianini, N.: Estimating the sentence-level quality of machine translation systems. In: EAMT-2009: 13th Annual Conference of the European Association for Machine Translation, pp. 28–35 (2009)

    Google Scholar 

  16. Specia, L., Paetzold, G., Scarton, C.: Multi-level translation quality prediction with quest++. In: Proceedings of ACL-IJCNLP 2015 System Demonstrations, pp. 115–120. Association for Computational Linguistics and The Asian Federation of Natural Language Processing, Beijing, China, July 2015. http://www.aclweb.org/anthology/P15-4020

  17. Specia, L., Shah, K., de Souza, J.G.C., Cohn, T., Kessler, F.B.: Quest a translation quality estimation framework. In: Proceedings of the 51st ACL: System Demonstrations, pp. 79–84 (2013)

    Google Scholar 

  18. Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks. In: Advances in Neural Information Processing Systems (NIPS), pp. 3104–3112 (2014). http://papers.nips.cc/paper/5346-sequence-to-sequence-learning-with-neural

  19. Zeiler, M.D.: Adadelta: an adaptive learning rate method. arXiv preprint arXiv:1212.5701 (2012)

Download references

Acknowledgements

This paper is supported by the project of Natural Science Foundation of China (Grant No. 61272384 & 61402134 & 61370170).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Muyun Yang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer Nature Singapore Pte Ltd.

About this paper

Cite this paper

Zhu, J., Yang, M., Li, S., Zhao, T. (2016). Learning Bilingual Sentence Representations for Quality Estimation of Machine Translation. In: Yang, M., Liu, S. (eds) Machine Translation. CWMT 2016. Communications in Computer and Information Science, vol 668. Springer, Singapore. https://doi.org/10.1007/978-981-10-3635-4_4

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-3635-4_4

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-3634-7

  • Online ISBN: 978-981-10-3635-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics