Entity Translation with Collective Inference in Knowledge Graph

Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9362)

Abstract

Nowadays knowledge base (KB) has been viewed as one of the important infrastructures for many web search applications and NLP tasks. However, in practice the availability of KB data varies from language to language, which greatly limits potential usage of knowledge base. In this paper, we propose a novel method to construct or enrich a knowledge base by entity translation with help of another KB but compiled in a different language. In our work, we concentrate on two key tasks: 1) collecting translation candidates with as good coverage as possible from various sources such as web or lexicon; 2) building an effective disambiguation algorithm based on collective inference approach over knowledge graph to find correct translation for entities in the source knowledge base. We conduct experiments on movie domain of our in-house knowledge base from English to Chinese, and the results show the proposed method can achieve very high translation precision compared with classical translation methods, and significantly increase the volume of Chinese knowledge base in this domain.

Keywords

Knowledge base Machine translation Collective learning 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Al-onaizan, Y., Knight, K.: Translating named entities using monolingual and bilingual resources. In: Proceedings of the 40th Meeting of the Association for Computational Linguistics (ACL02), pp. 400–408 (2002)Google Scholar
  2. 2.
    Bao, J., Duan, N., Zhou, M., Zhao, T.: Knowledge-based question answering as machine translation. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics. Long Papers, vol. 1, pp. 967–976. Association for Computational Linguistics, Baltimore, June 2014Google Scholar
  3. 3.
    Berant, J., Chou, A., Frostig, R., Liang, P.: Semantic parsing on Freebase from questionanswer pairs. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pp. 1533–1544. Association for Computational Linguistics, Seattle, Washington, October 2013Google Scholar
  4. 4.
    Brown, P.F., Pietra, S.D., Pietra, V.J.D., Mercer, R.L.: The mathematics of statistical machine translation: Parameter estimation. Computational Linguistics 19(2), 263–311 (1993)Google Scholar
  5. 5.
    de Melo, G., Weikum, G.: UWN: a large multilingual lexical knowledge base. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, pp. 151–156. Association for Computational Linguistics, Stroudsburg (2012)Google Scholar
  6. 6.
    Han, X., Sun, L., Zhao, J.: Collective entity linking in web text: a graph-based method. In: Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’11, pp. 765–774. ACM, New York (2011)Google Scholar
  7. 7.
    Huang, F., Vogel, S.: Improved named entity translation and bilingual named entity extraction. In: Proceedings of Fourth IEEE International Conference on Multimodal Interfaces. Institute of Electrical & Electronics Engineers (IEEE) (2002)Google Scholar
  8. 8.
    Jiang, L., Zhou, M., Chien, L.F., Niu, C.: Named entity translation with Web mining and transliteration. In: Proceedings of the 20th International Joint Conference on Artificial Intelligence, pp. 1629–1634 (2007)Google Scholar
  9. 9.
    Lin, D., Zhao, S., Van Durme, B., Paşca, M.: Mining parenthetical translations from the Web by word alignment. In: Proceedings of ACL-08: HLT, pp. 994–1002. Association for Computational Linguistics, Columbus, June 2008Google Scholar
  10. 10.
    Liu, S., Li, C.-H., Li, M., Zhou, M.: Learning translation consensus with structured label propagation. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics. Long Papers, vol. 1, pp. 302–310. Association for Computational Linguistics, Jeju Island, July 2012Google Scholar
  11. 11.
    Subramanya, A., Petrov, S., Pereira, F.: Efficient graph-based semi-supervised learning of structured tagging models. In: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pp. 167–176. Association for Computational Linguistics, Cambridge, October 2010Google Scholar
  12. 12.
    Talukdar, P.P., Pereira, F.: Experiments in graph-based semi-supervised learning methods for class-instance acquisition. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp. 1473–1481. Association for Computational Linguistics, Uppsala, July 2010Google Scholar
  13. 13.
    Wick, M., Rohanimanesh, K., Culotta, A., Mccallum, A.: Samplerank: learning preference from atomic gradients. In: NIPS WS on Advances in Ranking (2009)Google Scholar
  14. 14.
    Yao, X., Van Durme, B.: Information extraction over structured data: question answering with Freebase. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics. Long Papers, vol. 1, pp. 956–966. Association for Computational Linguistics, Baltimore, June 2014Google Scholar

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  • Qinglin Li
    • 1
  • Shujie Liu
    • 2
  • Rui Lin
    • 3
  • Mu Li
    • 2
  • Ming Zhou
    • 2
  1. 1.Shanghai Jiaotong UniversityShanghaiChina
  2. 2.Microsoft Research AsiaBeijingChina
  3. 3.Harbin Institute of TechnologyHarbinChina

Personalised recommendations