Advertisement

Chunk-Based Dependency-to-String Model with Japanese Case Frame

  • Jinan Xu
  • Peihao Wu
  • Jun Xie
  • Yujie Zhang
Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 493)

Abstract

This paper proposes an idea to integrate Japanese case frame into chunk-based dependency-to-string model. At first, case frames are acquired from Japanese chunk-based dependency analysis results. Then case frames are used to constraint rule extraction and decoding in chunk-based dependency-to-string model. Experimental results show that the proposed method performs well on long structural reordering and lexical translation, and achieves better performance than hierarchical phrase-based model and word-based dependency-to-string model on Japanese to Chinese test sets.

Keywords

Statistical Machine Translation Japanese Case Frame syntax structure chunk-based dependency-to-string model 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Yamada, K., Knight, K.: A syntax-based statistical translation model. In: Proceedings of the 39th Annual Meeting on Association for Computational Linguistics, pp. 523–530. Association for Computational Linguistics (2001)Google Scholar
  2. 2.
    Liu, Y., Liu, Q., Lin, S.: Tree-to-string alignment template for statistical machine translation. In: Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association for Computational Linguistics, pp. 609–616. Association for Computational Linguistics (2006)Google Scholar
  3. 3.
    Liu, Y., Huang, Y., Liu, Q., et al.: Forest-to-string statistical translation rules. In: Annual Meeting-Association For Computational Linguistics, vol. 45(1), p. 704 (2007)Google Scholar
  4. 4.
    Mi, H., Huang, L., Liu, Q.: Forest-Based Translation. In: ACL, pp. 192–199 (2008)Google Scholar
  5. 5.
    Xie, J., Mi, H., Liu, Q.: A novel dependency-to-string model for statistical machine translation. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 216–226. Association for Computational Linguistics (2011)Google Scholar
  6. 6.
    Watanabe, T., Sumita, E., Okuno, H.G.: Chunk-based statistical translation. In: Proceedings of the 41st Annual Meeting on Association for Computational Linguistics, vol. 1, pp. 303–310. Association for Computational Linguistics (2003)Google Scholar
  7. 7.
    Hoshino, S., Miyao, Y., Sudoh, K., et al.: Two-Stage Pre-ordering for Japanese-to-English Statistical Machine Translation. In: Proceedings of the 6th International Joint Conference on Natural Language Processing (2013)Google Scholar
  8. 8.
    Wu, X., Sudoh, K., Duh, K., et al.: Extracting Pre-ordering Rules from Predicate-Argument Structures. In: IJCNLP, pp. 29–37 (2011)Google Scholar
  9. 9.
    Wu, X., Sudoh, K., Duh, K., et al.: Extracting preordering rules from chunk-based dependency trees for Japanese-to-English translation. In: Proceedings of the 13th Machine Translation Summit, pp. 300–307 (2011)Google Scholar
  10. 10.
    Wu, D., Fung, P.: Semantic roles for smt: a hybrid two-pass model. In: Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers, pp. 13–16. Association for Computational Linguistics (2009)Google Scholar
  11. 11.
    Kawahara, D., Kurohashi, S.: Case frame compilation from the web using high-performance computing. In: Proceedings of the 5th International Conference on Language Resources and Evaluation, pp. 1344–1347 (2006)Google Scholar
  12. 12.
    Kawahara, D., Kurohashi, S.: A fully-lexicalized probabilistic model for Japanese syntactic and case structure analysis. In: Proceedings of the Main Conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, pp. 176–183. Association for Computational Linguistics (2006)Google Scholar
  13. 13.
    Sasano, R., Kurohashi, S.: A Discriminative Approach to Japanese Zero Anaphora Resolution with Large-scale Lexicalized Case Frames. In: IJCNLP, pp. 758–766 (2011)Google Scholar
  14. 14.
    Koehn, P., Och, F.J., Marcu, D.: Statistical phrase-based translation. In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, vol. 1, pp. 48–54. Association for Computational Linguistics (2003)Google Scholar
  15. 15.
    Och, F.J.: Minimum error rate training in statistical machine translation. In: Proceedings of the 41st Annual Meeting on Association for Computational Linguistics, vol. 1, pp. 160–167. Association for Computational Linguistics (2003)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2014

Authors and Affiliations

  • Jinan Xu
    • 1
    • 2
  • Peihao Wu
    • 1
    • 2
  • Jun Xie
    • 1
    • 2
  • Yujie Zhang
    • 1
    • 2
  1. 1.School of Computer and Information TechnologyBeijing Jiaotong UniversityBeijingChina
  2. 2.Beijing Samsung Telecom R&D CenterBeijingChina

Personalised recommendations