Abstract
We present an Example-Based Machine Translation approach which recursively divides the sentence to be translated, and translates each part separately. The sentence is divided according to the structure of similar examples extracted during the matching process. The approach is especially intended for languages where resources and tools are pretty much unavailable. POS taggers are the only tools utilized, and the bilingual corpus the only resource employed. In addition, the translation system contains an analogy-based sub-sentential alignment module, which predicts word correspondences between new pairs of sentences. This module causes the corpus to grow because new examples can be appended automatically. Consequently, a relatively small initial corpus is sufficient for the translation system to start. The approach has been tested on a French-Japanese corpus of spoken language and produced promising results worthy of further investigation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Andriamanankasina, T., K. Araki and K. Tochinai. 1999. Sub-Sentential Alignment Method by Analogy. In PACLIC13: Proceedings of the 13th Pacific Asia Conference on Language, Information and Computation, Taipei, Taiwan, pp.277–284.
Araki, K., Y. Takahashi, Y. Momouchi and K. Tochinai. 1996. Non-Segmented Kana-Kanji Translation Using Inductive Learning. Transactions of the IEICE J97-D-II(4):391–402.
Brill, E. 1993. A Corpus-Based Approach To Language Learning. PhD Dissertation, University of Pensylvania, Philadelphia, PA.
Brown, P., S. Delia Pietra, V. Delia Pietra and R. Mercer. 1993. The Mathematics of Statistical Machine Translation: Parameter Estimation. Computational Linguistics 19(2):263–309.
Cranias, L., H. Papageorgiou and S. Piperidis. 1994. A Matching Technique in Example-Based Machine Translation. In COLING-94: Proceedings of 15th International Conference on Computational Linguistics, Kyoto, Japan, pp.100–104.
Kitamura M., and Y. Matsumoto. 1997. Automatic Extraction of Translation Patterns in Parallel Corpora. Transactions of the IPSJ 38(4): 727–736 (cf. Chapter 13, this volume).
Kitano, H. 1993. A Comprehensive and Practical Model of Memory-Based Machine Translation. In Proceedings of the Thirteenth International Joint Conference on Artificial Intelligence, Chambéry, Prance, pp.1276–1282.
Meguro, S. 1987. Manuel de Conversation Française. Hakusuisha, Tokyo, Japan.
Melamed, D. 1997. A Word-to-Word Model of Translational Equivalence. In 35th Conference of the Association for Computational Linguistics and 8th Conference of the European Chapter of the Association for Computational Linguistics, Madrid, Spain, pp.490–497.
Nirenburg S., S. Beale and K. Domashnev. 1994. A Full-Text Experiment in Example-Based Machine Translation. In NeMLaP: Proceedings of the International Conference on New Methodologies in Language Processing, Manchester, UK, pp.95–103.
Nirenburg, S., D. Grannes and K. Domashnev. 1993. Two Approaches to Matching in Example-Based Machine Translation. In Proceedings of the Fifth International Conference on Theoretical and Methodological issues in Machine Translation TMI’ 93: MT in the Next Generation, Kyoto, Japan, pp.47–57.
Sato, F. 1990. Locutions de base. Hakusuisha, Tokyo, Japan.
Veale, T. and A. Way. 1997. Gaijin: A Bootstrapping Approach to Example-Based Machine Translation. In International Conference, Recent Advances in Natural Language Processing, Tzigov Chark, Bulgaria, pp.239–244.
Yamashita, T. 1996. ChaSen Technical Report. Technical Report, Nara Advanced Institute of Science and Technology, Nara, Japan.
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Andriamanankasina, T., Araki, K., Tochinai, K. (2003). EBMT of POS-Tagged Sentences by Recursive Division Via Inductive Learning. In: Carl, M., Way, A. (eds) Recent Advances in Example-Based Machine Translation. Text, Speech and Language Technology, vol 21. Springer, Dordrecht. https://doi.org/10.1007/978-94-010-0181-6_8
Download citation
DOI: https://doi.org/10.1007/978-94-010-0181-6_8
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-1401-7
Online ISBN: 978-94-010-0181-6
eBook Packages: Springer Book Archive