Abstract
With the rapid development of Internet, lots of web data are published by internet users. This situation causes tremendous entities appear on the web. However, because of variety and ambiguity of natural language, one entity usually has multiple expressions. To know the actual meaning of one document, it is important to solve the problem of entity ambiguity. Entity linking is a good solution for entity disambiguation. It links one entity to one entrance of a resource to help users grasp the actual meaning of this entity. For the reason that traditional entity linking methods cannot acquire high performance in both accuracy and efficiency, we propose a novel entity linking algorithm. This algorithm is mainly divided into three steps. It first generates candidate entities for each mention in documents via heuristic-based rule. Then we leverage the relationship between entities in the knowledge base and use them to construct a semantic entity graph to connect all the related candidate entities. Finally we give a score to measure the possibility of one entity to be an entrance for one mention and choose the one with the highest score as the best assignment. Experimental results show that our entity linking algorithm performs well in both accuracy and efficiency.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Shen, W., Wang, J., Han, J.: Entity linking with a knowledge base: issues, techniques, and solutions. IEEE Trans. Knowl. Data Eng. 27(2), 443–460 (2015)
Cucerzan, S.: Large-scale named entity disambiguation based on wikipedia data. In: EMNLP-CoNLL, vol. 7, pp. 708–716 (2007)
Han, X., Sun, L.: A generative entity-mention model for linking entities with knowledge base. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol. 1, pp. 945–954. Association for Computational Linguistics (2011)
Zuo, Z., Kasneci, G., Gruetze, T., et al.: BEL: Bagging for Entity Linking. In: COLING, pp. 2075–2086 (2014)
Bunescu, R.C., Pasca, M.: Using encyclopedic knowledge for named entity disambiguation. In: EACL, vol. 6, pp. 9–16 (2006)
Daiber, J., Jakob, M., Hokamp, C., et al.: Improving efficiency and accuracy in multilingual entity extraction. In: Proceedings of the 9th International Conference on Semantic Systems, pp. 121–124. ACM (2013)
Schäfer, B., Bizer, C.: Exploiting DBpedia for graph-based entity linking to wikipedia. FakultätfürWirtschaftsinformatik und Wirtschaftsmathematik (2014)
Alhelbawy, A., Gaizauskas, R.: Named entity disambiguation using HMMs. In: 2013 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT), vol. 3, pp. 159–162. IEEE (2013)
Hoffart, J., Yosef, M.A., Bordino, I., et al.: Robust disambiguation of named entities in text. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 782–792. Association for Computational Linguistics (2011)
Hachey, B., Radford, W., Curran, J.R.: Graph-based named entity linking with wikipedia. In: Bouguettaya, A., Hauswirth, M., Liu, L. (eds.) WISE 2011. LNCS, vol. 6997, pp. 213–226. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-24434-6_16
Ceccarelli, D., Lucchese, C., Orlando, S., et al.: Learning relatedness measures for entity linking. In: Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, pp. 139–148. ACM (2013)
Usbeck, R., Ngomo, A.C.N., Röder, M., et al.: AGDISTIS-agnostic disambiguation of named entities using Linked Open Data. In: ECAI, pp. 1113–1114 (2014)
Sil, A., Florian, R.: The IBM systems for trilingual entity discovery and linking at TAC 2015. In: Proceedings of Text Analysis Conference (TAC 2015) (2015)
Kalloubi, F., Nfaoui, E.H., El Beqqali, O.: Graph based tweet entity linking using DBpedia. In: 2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA), pp. 501–506. IEEE (2014)
Bhattacharya, I., Getoor, L.: Online collective entity resolution. In: Proceedings of the National Conference on Artificial Intelligence, Menlo Park, CA, vol. 22, no. 2, p. 1606. MIT Press, Cambridge, AAAI Press, London (2007)
Ratinov, L., Roth, D., Downey, D., et al.: Local and global algorithms for disambiguation to wikipedia. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol. 1, pp. 1375–1384. Association for Computational Linguistics (2011)
Blanco, R., Ottaviano, G., Meij, E.: Fast and space-efficient entity linking for queries. In: Proceedings of the Eighth ACM International Conference on Web Search and Data Mining, pp. 179–188. ACM (2015)
Nguyen, D.B., Hoffart, J., Theobald, M., et al.: AIDA-light: high-throughput named-entity disambiguation. In: LDOW (2014)
Luo, G., Huang, X., Lin, C.Y., et al.: Joint named entity recognition and disambiguation. In: Proceedings of EMNLP (2015)
Ji, H., Nothman, J., Hachey, B.: Overview of TAC-KBP2014 entity discovery and linking tasks. In: Proceedings of Text Analysis Conference (TAC 2014) (2014)
Acknowledgements
The research in this paper is supported by National Natural Science Foundation of China (No. 61300114, 61572151), CCF-Tencent Open Fund (No. IAGR20160109), HIT-Tecncent (No. AGR201601).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Zheng, G., Liu, M., Liu, B. (2017). Collective Entity Linking Based on DBpedia. In: Li, J., Zhou, M., Qi, G., Lao, N., Ruan, T., Du, J. (eds) Knowledge Graph and Semantic Computing. Language, Knowledge, and Intelligence. CCKS 2017. Communications in Computer and Information Science, vol 784. Springer, Singapore. https://doi.org/10.1007/978-981-10-7359-5_8
Download citation
DOI: https://doi.org/10.1007/978-981-10-7359-5_8
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-7358-8
Online ISBN: 978-981-10-7359-5
eBook Packages: Computer ScienceComputer Science (R0)