Phrase-Based Statistical Machine Translation

  • Richard Zens
  • Franz Josef Och
  • Hermann Ney
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2479)


This paper is based on the work carried out in the framework of the Verbmobil project, which is a limited-domain speech translation task (German-English). In the final evaluation, the statistical approach was found to perform best among five competing approaches.

In this paper, we will further investigate the used statistical translation models. A shortcoming of the single-word based model is that it does not take contextual information into account for the translation decisions. We will present a translation model that is based on bilingual phrases to explicitly model the local context. We will show that this model performs better than the single-word based model. We will compare monotone and non-monotone search for this model and we will investigate the benefit of using the sum criterion instead of the maximum approximation.


Target Sentence Translation Model Word Error Rate Sentence Pair Source Sentence 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Auerswald, M.: Example-based machine translation with templates. [17] 418–427Google Scholar
  2. 2.
    Block, H.U.: Example-based incremental synchronous interpretation. [17] 411–417Google Scholar
  3. 3.
    Brown, P.F., Della Pietra, S.A., Della Pietra, V.J., Mercer, R.L.: The mathematics of statistical machine translation: Parameter estimation. Computational Linguistics 19 (1993) 263–311Google Scholar
  4. 4.
    Emele, M.C., Dorna, M., Lüdeling, A., Zinsmeister, H., Rohrer, C.: Semantic-based transfer. [17] 359–376Google Scholar
  5. 5.
    Germann, U., Jahr, M., Knight, K., Marcu, D., Yamada, K.: Fast decoding and optimal decoding for machine translation. In: Proc. of the 39th Annual Meeting of the Association for Computational Linguistics (ACL), Toulouse, France (2001) 228–235Google Scholar
  6. 6.
    Manning, C.D., Schütze, H.: Foundations of statistical natural language processing. MIT Press, Cambridge, MA (1999)zbMATHGoogle Scholar
  7. 7.
    Ney, H., Nießen, S., Och, F.J., Sawaf, H., Tillmann, C., Vogel, S.: Algorithms for statistical translation of spoken language. IEEE Trans. on Speech and Audio Processing 8 (2000) 24–36CrossRefGoogle Scholar
  8. 8.
    Nießen, S., Och, F.J., Leusch, G., Ney, H.: An evaluation tool for machine translation: Fast evaluation for MT research. In: Proc. of the Second Int. Conf. on Language Resources and Evaluation (LREC), Athens, Greece (2000) 39–45Google Scholar
  9. 9.
    Och, F. J., Ney, H.: Discriminative training and maximum entropy models for statistical machine translation. In: Proc. of the 40th Annual Meeting of the Association for Computational Linguistics (ACL). (2002) 8 pages To appear.Google Scholar
  10. 10.
    Och, F.J., Tillmann, C., Ney, H.: Improved alignment models for statistical machine translation. In: Proc. of the Joint SIGDAT Conf. on Empirical Methods in Natural Language Processing and Very Large Corpora, University of Maryland, College Park, MD (1999) 20–28Google Scholar
  11. 11.
    Papineni, K.A., Roukos, S., Ward, T., Zhu, W.J.: Bleu: a method for automatic evaluation of machine translation. Technical Report RC22176 (W0109-022), IBM Research Division, Thomas J. Watson Research Center (2001)Google Scholar
  12. 12.
    Reithinger, N., Engel, R.: Robust content extraction for translation and dialog processing. [17] 428–437Google Scholar
  13. 13.
    Tessiore, L., v. Hahn, W.: Functional validation of a machine interpretation system: Verbmobil. [17] 611–631Google Scholar
  14. 14.
    Tillmann, C.: Word re-ordering and dynamic programming based search algorithms for statistical machine translation. PhD thesis, Computer Science Department, RWTH Aachen, Germany (2001)Google Scholar
  15. 15.
    Tillmann, C., Ney, H.: Word re-ordering and DP-based search in statistical machine translation. In: COLING’ 00: The 18th Int. Conf. on Computational Linguistics, Saarbrücken, Germany (2000) 850–856Google Scholar
  16. 16.
    Vogel, S., Ney, H., Tillmann, C.: HMM-based word alignment in statistical translation. In: COLING’ 96: The 16th Int. Conf. on Computational Linguistics, Copenhagen, Denmark (1996) 836–841Google Scholar
  17. 17.
    Wahlster, W., ed.: Verbmobil: Foundations of speech-to-speech translations. Springer Verlag, Berlin, Germany (2000)Google Scholar
  18. 18.
    Wang, Y.Y., Waibel, A.: Modeling with structures in statistical machine translation. In: COLING-ACL’ 98: 36th Annual Meeting of the Association for Computational Linguistics and 17th Int. Conf. on Computational Linguistics. Volume 2, Montreal, Canada (1998) 1357–1363Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2002

Authors and Affiliations

  • Richard Zens
    • 1
  • Franz Josef Och
    • 1
  • Hermann Ney
    • 1
  1. 1.Human Language Technology and Pattern Recognition Lehrstuhl für Informatik VI Computer Science DepartmentRWTH Aachen — University of TechnologyGermany

Personalised recommendations