Stat-XFER: A General Search-Based Syntax-Driven Framework for Machine Translation

Lavie, Alon

doi:10.1007/978-3-540-78135-6_31

Alon Lavie¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4919))

Included in the following conference series:

International Conference on Intelligent Text Processing and Computational Linguistics

1478 Accesses
10 Citations

Abstract

The CMU Statistical Transfer Framework (Stat-XFER) is a general framework for developing search-based syntax-driven machine translation (MT) systems. The framework consists of an underlying syntax-based transfer formalism along with a collection of software components designed to facilitate the development of a broad range of MT research systems. The main components are a general language-independent runtime transfer engine and decoder, along with several different tools for creating the various underlying language-pair-specific resources that are required for building a specific MT system for any given language pair. We describe the general framework, its unique properties and features, and its application to the construction of MT research prototype systems for a diverse collection of language pairs.

Download to read the full chapter text

Chapter PDF

Parse and Corpus-Based Machine Translation

Controlled Ascent: Imbuing Statistical MT with Linguistic Knowledge

Getting Past the Language Gap: Innovations in Machine Translation

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Koehn, P., Och, F.J., Marcu, D.: Statistical Phrase-based Translation. In: Proceedings of HLT-NAACL 2003, Association for Computational Linguistics, Edmonton, Alberta, Canada, pp. 127–133 (2003)
Google Scholar
Venugopal, A., Vogel, S., Waibel, A.: Effective Phrase Translation Extraction from Alignment Models. In: Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics (ACL-2003), Sapporo, Japan, pp. 319–326 (2003)
Google Scholar
Chiang, D.: A Hierarchical Phrase-based Model for Statistical Machine Translation. In: Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL-2005), Ann Arbor, Michigan, pp. 263–270 (2005)
Google Scholar
Imamura, K., et al.: Example-based Machine Translation Based on Syntactic Transfer with Statistical Models. In: Proceedings of COLING-2004, Geneva, Switzerland, pp. 99–105 (2004)
Google Scholar
Galley, M., et al.: Scalable Inference and Training of Context-Rich Syntactic Translation Models. In: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, Australia, pp. 961–968 (2006)
Google Scholar
Lavie, A., et al.: Experiments with a Hindi-to-English Transfer-based MT System under a Miserly Data Scenario. Transactions on Asian Language Information Processing (TALIP) 2 (2003)
Google Scholar
Lavie, A., et al.: A Trainable Transfer-based Machine Translation Approach for Languages with Limited Resources. In: Proceedings of Workshop of the European Association for Machine Translation (EAMT-2004), Valletta, Malta (2004)
Google Scholar
Peterson, E.: Adapting a Transfer Engine for Rapid Machine Translation Development. Master’s thesis, Georgetown University (2002)
Google Scholar
Zhang, Y., Vogel, S.: Suffix Array and its Applications in Empirical Natural Language Processing. Technical Report CMU-LTI-06-010, Language Technologies Institute, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA (2006)
Google Scholar
Lavie, A., et al.: Rapid Prototyping of a Transfer-based Hebrew-to-English Machine Translation System. In: Proceedings of the 10th International Conference on Theoretical and Methodological Issues in Machine Translation (TMI-2004), Baltimore, MD, pp. 1–10 (2004)
Google Scholar
Wintner, S.: Hebrew Computational Linguistics: Past and Future. Artificial Intelligence Review 21, 113–138 (2004)
Article MATH Google Scholar
Wintner, S., Yona, S.: Resources for Processing Hebrew. In: Proceedings of the MT-Summit IX workshop on Machine Translation for Semitic Languages, New Orleans (2003)
Google Scholar
Yona, S., Wintner, S.: A Finite-State Morphological Grammar of Hebrew. Natural Language Engineering (to appear, 2007)
Google Scholar
Wintner, S.: Finite-state Technology as a Programming Environment. In: Gelbukh, A. (ed.) CICLing 2007. LNCS, vol. 4394, pp. 97–106. Springer, Heidelberg (2007)
Chapter Google Scholar
Adler, M., Elhadad, M.: An Unsupervised Morpheme-Based HMM for Hebrew Morphological Disambiguation. In: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, Australia, pp. 665–672. Association for Computational Linguistics (2006)
Google Scholar
Shacham, D.: Morphological Disambiguation of Hebrew. Master’s thesis, University of Haifa (2007)
Google Scholar
Dahan, H.: Hebrew–English English–Hebrew Dictionary. Academon, Jerusalem (1997)
Google Scholar
Papineni, K., et al.: BLEU: a Method for Automatic Evaluation of Machine Translation. In: Proceedings of 40th Annual Meeting of the Association for Computational Linguistics (ACL), Philadelphia, PA, pp. 311–318 (2002)
Google Scholar
Doddington, G.: Automatic Evaluation of Machine Translation Quality Using N-gram Co-Occurrence Statistics. In: Proceedings of the Second Conference on Human Language Technology (HLT-2002) (2002)
Google Scholar
Efron, B., Tibshirani, R.: Bootstrap Methods for Standard Errors, Confidence Intervals and Other Measures of Statistical Accuracy. Statistical Science 1, 54–77 (1986)
Article MathSciNet Google Scholar
Probst, K., et al.: MT for Resource-Poor Languages Using Elicitation-Based Learning of Syntactic Transfer Rules. Machine Translation 17 (2002)
Google Scholar

Download references

Author information

Authors and Affiliations

Language Technologies Institute, Carnegie Mellon University, Pittsburgh, PA, 15213, USA
Alon Lavie

Authors

Alon Lavie
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Alexander Gelbukh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lavie, A. (2008). Stat-XFER: A General Search-Based Syntax-Driven Framework for Machine Translation. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2008. Lecture Notes in Computer Science, vol 4919. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78135-6_31

Download citation

DOI: https://doi.org/10.1007/978-3-540-78135-6_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-78134-9
Online ISBN: 978-3-540-78135-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Stat-XFER: A General Search-Based Syntax-Driven Framework for Machine Translation

Abstract

Chapter PDF

Similar content being viewed by others

Parse and Corpus-Based Machine Translation

Controlled Ascent: Imbuing Statistical MT with Linguistic Knowledge

Getting Past the Language Gap: Innovations in Machine Translation

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Stat-XFER: A General Search-Based Syntax-Driven Framework for Machine Translation

Abstract

Chapter PDF

Similar content being viewed by others

Parse and Corpus-Based Machine Translation

Controlled Ascent: Imbuing Statistical MT with Linguistic Knowledge

Getting Past the Language Gap: Innovations in Machine Translation

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation