Abstract
In this paper, we study the problem of personal name disambiguation (NED). We develop a framework to address the three challenges in personal name disambiguation: (1) identification of referential ambiguity; (2) identification of lexical ambiguity; and (3) predicting the NIL value, that is the value when a named entity cannot be mapped to a knowledge base. Our framework includes extractor, searcher and disambiguator. Experimental results evaluated on real-world data sets show that our framework and algorithm provide accuracy in personal name linking up to 92%, which is higher than the accuracy of previously developed algorithms.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Curcezan, S.: Large-scale named entity disambiguation based on Wikipedia data. In: Proceedings EMNLP-CoNLL, pp. 708–716 (2007)
Bunescu, R., Pasca, M.: Using encyclopedic knowledge for named entity disambiguation. In: Proceedings of EACL, pp. 9–16 (2006)
Shen, W., Wang, J., Luo, P., Wang, M.: Linden: linking named entities with knowledge base via semantic knowledge. In: Proceedings of WWW 2012, pp. 449–458. ACM (2012)
Hachey, B., Radford, W., Nothman, J., Honnibal, M., Curran, J.R.: Evaluating entity linking with wik-ipedia. Artif. Intell. 194, 130–150 (2013)
Christen, P., Churches, T., Zhu, J.X.: Probabilistic name and address cleaning and standardisation (2002)
Arasu, A., Kaushik, R.: A grammar-based entity representation framework for data cleaning. In: Proceedings of SIGMOD, pp. 233–244. ACM (2009)
Dredze, M., McNamee, P., Rao, D., Gerber, A., Finin, T.: Entity disambiguation for knowledge base population. In: Proceedings of COLING, pp. 277–285 (2010)
Li, Y., Wang, C., Han, F., Han, J., Roth, D., Yan, X.: Mining evidences for named entity disambiguation. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD, pp. 1070–1078 (2013)
Yang, W.: Identifying syntactic differences between two programs. Softw. Pract. Exper. 21(7), 739–755 (1991)
Zhai, Y., Liu, B.: Structured data extraction from the web based on partial tree alignment. IEEE Trans. Knowl. Data Eng. 18(12), 1614–1628 (2006)
Mann, G.S., Yarowsky, D.: Unsupervised personal name disambiguation. In: Proceedings of Conference on Natural Language Learning at HLT-NAACL- Volume 4, CONLL, pp. 33–40 (2003)
Demidova, E., Oelze, I., Nejdl, W.: Aligning freebase with the YAGO ontology. In: Proceedings of ACM International Conference on Information & Knowledge Management, CIKM, pp. 579–588 (2013)
Han, X., Zhao, J.: Web personal name disambiguation based on reference entity tables mined from the web. In: Proceedings of International Workshop on Web Information and Data Management, WIDM, pp. 75–82 (2009)
Raman, V., Hellerstein, J.M.: Potter’s wheel: an interactive data cleaning system. In: Proceedings of the International Conference on Very Large Data Bases, VLDB, pp. 381–390. Morgan Kaufmann Publishers (2001)
Lehman, J., et al.: Dbpedia–a large-scale, multilingual knowledge base extracted from wikipedia. Semant. Web 6(2), 167–195 (2013)
WordNet. https://wordnet.princeton.edu/
Wikipedia. https://en.wikipedia.org/wiki/Main_Page
FreeBase. https://en.wikipedia.org/wiki/Freebase
DBPedia. http://wiki.dbpedia.org/
Bilenko, M., Mooney, R., Cohen, W., Ravikumar, P., Fienberg, S.: Adaptive name matching in information integration. IEEE Intell. Syst. 18(5), 16–23 (2003)
Elmagarmid, A., Ipeirotis, P., Verykios, V.: Duplicate record detection: a survey. IEEE Trans. Knowl. Data Eng. 19(1), 1–16 (2007)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Georgieva, L., Buatongkue, S. (2019). A New Framework for Personal Name Disambiguation. In: Arai, K., Kapoor, S., Bhatia, R. (eds) Intelligent Computing. SAI 2018. Advances in Intelligent Systems and Computing, vol 858. Springer, Cham. https://doi.org/10.1007/978-3-030-01174-1_76
Download citation
DOI: https://doi.org/10.1007/978-3-030-01174-1_76
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-01173-4
Online ISBN: 978-3-030-01174-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)