Skip to main content

EBMT of POS-Tagged Sentences by Recursive Division Via Inductive Learning

  • Chapter
Recent Advances in Example-Based Machine Translation

Part of the book series: Text, Speech and Language Technology ((TLTB,volume 21))

Abstract

We present an Example-Based Machine Translation approach which recursively divides the sentence to be translated, and translates each part separately. The sentence is divided according to the structure of similar examples extracted during the matching process. The approach is especially intended for languages where resources and tools are pretty much unavailable. POS taggers are the only tools utilized, and the bilingual corpus the only resource employed. In addition, the translation system contains an analogy-based sub-sentential alignment module, which predicts word correspondences between new pairs of sentences. This module causes the corpus to grow because new examples can be appended automatically. Consequently, a relatively small initial corpus is sufficient for the translation system to start. The approach has been tested on a French-Japanese corpus of spoken language and produced promising results worthy of further investigation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Andriamanankasina, T., K. Araki and K. Tochinai. 1999. Sub-Sentential Alignment Method by Analogy. In PACLIC13: Proceedings of the 13th Pacific Asia Conference on Language, Information and Computation, Taipei, Taiwan, pp.277–284.

    Google Scholar 

  • Araki, K., Y. Takahashi, Y. Momouchi and K. Tochinai. 1996. Non-Segmented Kana-Kanji Translation Using Inductive Learning. Transactions of the IEICE J97-D-II(4):391–402.

    Google Scholar 

  • Brill, E. 1993. A Corpus-Based Approach To Language Learning. PhD Dissertation, University of Pensylvania, Philadelphia, PA.

    Google Scholar 

  • Brown, P., S. Delia Pietra, V. Delia Pietra and R. Mercer. 1993. The Mathematics of Statistical Machine Translation: Parameter Estimation. Computational Linguistics 19(2):263–309.

    Google Scholar 

  • Cranias, L., H. Papageorgiou and S. Piperidis. 1994. A Matching Technique in Example-Based Machine Translation. In COLING-94: Proceedings of 15th International Conference on Computational Linguistics, Kyoto, Japan, pp.100–104.

    Google Scholar 

  • Kitamura M., and Y. Matsumoto. 1997. Automatic Extraction of Translation Patterns in Parallel Corpora. Transactions of the IPSJ 38(4): 727–736 (cf. Chapter 13, this volume).

    Google Scholar 

  • Kitano, H. 1993. A Comprehensive and Practical Model of Memory-Based Machine Translation. In Proceedings of the Thirteenth International Joint Conference on Artificial Intelligence, Chambéry, Prance, pp.1276–1282.

    Google Scholar 

  • Meguro, S. 1987. Manuel de Conversation Française. Hakusuisha, Tokyo, Japan.

    Google Scholar 

  • Melamed, D. 1997. A Word-to-Word Model of Translational Equivalence. In 35th Conference of the Association for Computational Linguistics and 8th Conference of the European Chapter of the Association for Computational Linguistics, Madrid, Spain, pp.490–497.

    Google Scholar 

  • Nirenburg S., S. Beale and K. Domashnev. 1994. A Full-Text Experiment in Example-Based Machine Translation. In NeMLaP: Proceedings of the International Conference on New Methodologies in Language Processing, Manchester, UK, pp.95–103.

    Google Scholar 

  • Nirenburg, S., D. Grannes and K. Domashnev. 1993. Two Approaches to Matching in Example-Based Machine Translation. In Proceedings of the Fifth International Conference on Theoretical and Methodological issues in Machine Translation TMI’ 93: MT in the Next Generation, Kyoto, Japan, pp.47–57.

    Google Scholar 

  • Sato, F. 1990. Locutions de base. Hakusuisha, Tokyo, Japan.

    Google Scholar 

  • Veale, T. and A. Way. 1997. Gaijin: A Bootstrapping Approach to Example-Based Machine Translation. In International Conference, Recent Advances in Natural Language Processing, Tzigov Chark, Bulgaria, pp.239–244.

    Google Scholar 

  • Yamashita, T. 1996. ChaSen Technical Report. Technical Report, Nara Advanced Institute of Science and Technology, Nara, Japan.

    Google Scholar 

Download references

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer Science+Business Media Dordrecht

About this chapter

Cite this chapter

Andriamanankasina, T., Araki, K., Tochinai, K. (2003). EBMT of POS-Tagged Sentences by Recursive Division Via Inductive Learning. In: Carl, M., Way, A. (eds) Recent Advances in Example-Based Machine Translation. Text, Speech and Language Technology, vol 21. Springer, Dordrecht. https://doi.org/10.1007/978-94-010-0181-6_8

Download citation

  • DOI: https://doi.org/10.1007/978-94-010-0181-6_8

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-1-4020-1401-7

  • Online ISBN: 978-94-010-0181-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics