EBMT of POS-Tagged Sentences by Recursive Division Via Inductive Learning

Andriamanankasina, Tantely; Araki, Kenji; Tochinai, Koji

doi:10.1007/978-94-010-0181-6_8

Tantely Andriamanankasina,
Kenji Araki &
Koji Tochinai

Part of the book series: Text, Speech and Language Technology ((TLTB,volume 21))

199 Accesses
2 Citations

Abstract

We present an Example-Based Machine Translation approach which recursively divides the sentence to be translated, and translates each part separately. The sentence is divided according to the structure of similar examples extracted during the matching process. The approach is especially intended for languages where resources and tools are pretty much unavailable. POS taggers are the only tools utilized, and the bilingual corpus the only resource employed. In addition, the translation system contains an analogy-based sub-sentential alignment module, which predicts word correspondences between new pairs of sentences. This module causes the corpus to grow because new examples can be appended automatically. Consequently, a relatively small initial corpus is sufficient for the translation system to start. The approach has been tested on a French-Japanese corpus of spoken language and produced promising results worthy of further investigation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Andriamanankasina, T., K. Araki and K. Tochinai. 1999. Sub-Sentential Alignment Method by Analogy. In PACLIC13: Proceedings of the 13th Pacific Asia Conference on Language, Information and Computation, Taipei, Taiwan, pp.277–284.
Google Scholar
Araki, K., Y. Takahashi, Y. Momouchi and K. Tochinai. 1996. Non-Segmented Kana-Kanji Translation Using Inductive Learning. Transactions of the IEICE J97-D-II(4):391–402.
Google Scholar
Brill, E. 1993. A Corpus-Based Approach To Language Learning. PhD Dissertation, University of Pensylvania, Philadelphia, PA.
Google Scholar
Brown, P., S. Delia Pietra, V. Delia Pietra and R. Mercer. 1993. The Mathematics of Statistical Machine Translation: Parameter Estimation. Computational Linguistics 19(2):263–309.
Google Scholar
Cranias, L., H. Papageorgiou and S. Piperidis. 1994. A Matching Technique in Example-Based Machine Translation. In COLING-94: Proceedings of 15th International Conference on Computational Linguistics, Kyoto, Japan, pp.100–104.
Google Scholar
Kitamura M., and Y. Matsumoto. 1997. Automatic Extraction of Translation Patterns in Parallel Corpora. Transactions of the IPSJ 38(4): 727–736 (cf. Chapter 13, this volume).
Google Scholar
Kitano, H. 1993. A Comprehensive and Practical Model of Memory-Based Machine Translation. In Proceedings of the Thirteenth International Joint Conference on Artificial Intelligence, Chambéry, Prance, pp.1276–1282.
Google Scholar
Meguro, S. 1987. Manuel de Conversation Française. Hakusuisha, Tokyo, Japan.
Google Scholar
Melamed, D. 1997. A Word-to-Word Model of Translational Equivalence. In 35th Conference of the Association for Computational Linguistics and 8th Conference of the European Chapter of the Association for Computational Linguistics, Madrid, Spain, pp.490–497.
Google Scholar
Nirenburg S., S. Beale and K. Domashnev. 1994. A Full-Text Experiment in Example-Based Machine Translation. In NeMLaP: Proceedings of the International Conference on New Methodologies in Language Processing, Manchester, UK, pp.95–103.
Google Scholar
Nirenburg, S., D. Grannes and K. Domashnev. 1993. Two Approaches to Matching in Example-Based Machine Translation. In Proceedings of the Fifth International Conference on Theoretical and Methodological issues in Machine Translation TMI’ 93: MT in the Next Generation, Kyoto, Japan, pp.47–57.
Google Scholar
Sato, F. 1990. Locutions de base. Hakusuisha, Tokyo, Japan.
Google Scholar
Veale, T. and A. Way. 1997. Gaijin: A Bootstrapping Approach to Example-Based Machine Translation. In International Conference, Recent Advances in Natural Language Processing, Tzigov Chark, Bulgaria, pp.239–244.
Google Scholar
Yamashita, T. 1996. ChaSen Technical Report. Technical Report, Nara Advanced Institute of Science and Technology, Nara, Japan.
Google Scholar

Download references

Authors

Tantely Andriamanankasina
View author publications
You can also search for this author in PubMed Google Scholar
Kenji Araki
View author publications
You can also search for this author in PubMed Google Scholar
Koji Tochinai
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Applications, Dublin City University, Dublin, Ireland
Andy Way
Institut der Gesellschaft zur Forderung der Angewandten Informationsforschung e. V. an der Universität des Saarlandes, Saarbrücken, Germany
Michael Carl

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Andriamanankasina, T., Araki, K., Tochinai, K. (2003). EBMT of POS-Tagged Sentences by Recursive Division Via Inductive Learning. In: Carl, M., Way, A. (eds) Recent Advances in Example-Based Machine Translation. Text, Speech and Language Technology, vol 21. Springer, Dordrecht. https://doi.org/10.1007/978-94-010-0181-6_8

Download citation

DOI: https://doi.org/10.1007/978-94-010-0181-6_8
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-1401-7
Online ISBN: 978-94-010-0181-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics