Advertisement

ZhishiLink: Entity Linking on Zhishi.me

  • Chenyang Wu
  • Haofen Wang
  • Jun Qu
  • Yong Yu
Part of the Communications in Computer and Information Science book series (CCIS, volume 406)

Abstract

Entity linking, which aims to find entities in given text, plays an important role in the trend of shifting from Web of documents to Web of knowledge. In this paper, we present ZhishiLink, an entity linking system targeting the largest Chinese linked open data - zhishi.me. In ZhishiLink, we perform domain-specific disambiguation by leveraging domain topic models to capture the implicit semantics of the entity mentions, in which we collect domains using the categories of zhishi.me. We also evaluate our system on two manually tagged text corpus, namely sina news and sina weibo. Experimental results show that ZhishiLink can successfully resolve most ambiguities raised in both text media with high efficiency. Restful APIs and a web user interface are further provided for external use and user browsing.

Keywords

Entity Linking Zhishi.me Topic Model Disambiguation 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. The Journal of Machine Learning Research 3, 993–1022 (2003)zbMATHGoogle Scholar
  2. 2.
    Bunescu, R., Pasca, M.: Using encyclopedic knowledge for named entity disambiguation. In: Proceedings of EACL, vol. 6, pp. 9–16 (2006)Google Scholar
  3. 3.
    Cucerzan, S.: Large-scale named entity disambiguation based on wikipedia data. In: Proceedings of EMNLP-CoNLL, vol. 6, pp. 708–716 (2007)Google Scholar
  4. 4.
    Han, X., Sun, L.: A generative entity-mention model for linking entities with knowledge base. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol. 1, pp. 945–954. Association for Computational Linguistics (2011)Google Scholar
  5. 5.
    Han, X., Sun, L., Zhao, J.: Collective entity linking in web text: a graph-based method. In: Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 765–774. ACM (2011)Google Scholar
  6. 6.
    Kulkarni, S., Singh, A., Ramakrishnan, G., Chakrabarti, S.: Collective annotation of wikipedia entities in web text. In: Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 457–466. ACM (2009)Google Scholar
  7. 7.
    Mendes, P.N., Jakob, M., García-Silva, A., Bizer, C.: Dbpedia spotlight: shedding light on the web of documents. In: Proceedings of the 7th International Conference on Semantic Systems, pp. 1–8. ACM (2011)Google Scholar
  8. 8.
    Mihalcea, R., Csomai, A.: Wikify!: linking documents to encyclopedic knowledge. In: Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, pp. 233–242. ACM (2007)Google Scholar
  9. 9.
    Singn, S., Subramanya, A., Pereira, F., McCallum, A.: Wikilinks: A large-scale cross-document coreference corpus labeled via links to wikipedia (2012)Google Scholar
  10. 10.
    Zhang, W., Su, J., Tan, C.L.: A wikipedia-lda model for entity linking with batch size changing instance selection. In: Proc. of International Joint Conference for Natural Language Processing, Chiang Mai, Thailand, pp. 8–13 (2011)Google Scholar
  11. 11.
    Zheng, Z., Li, F., Huang, M., Zhu, X.: Learning to link entities with knowledge base. In: Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 483–491. Association for Computational Linguistics (2010)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Chenyang Wu
    • 1
  • Haofen Wang
    • 1
  • Jun Qu
    • 1
  • Yong Yu
    • 1
  1. 1.Apex Data & Knowledge Management LabShanghai Jiao Tong UniversityChina

Personalised recommendations