Skip to main content

A New Framework for Personal Name Disambiguation

  • Conference paper
  • First Online:
  • 1277 Accesses

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 858))

Abstract

In this paper, we study the problem of personal name disambiguation (NED). We develop a framework to address the three challenges in personal name disambiguation: (1) identification of referential ambiguity; (2) identification of lexical ambiguity; and (3) predicting the NIL value, that is the value when a named entity cannot be mapped to a knowledge base. Our framework includes extractor, searcher and disambiguator. Experimental results evaluated on real-world data sets show that our framework and algorithm provide accuracy in personal name linking up to 92%, which is higher than the accuracy of previously developed algorithms.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Curcezan, S.: Large-scale named entity disambiguation based on Wikipedia data. In: Proceedings EMNLP-CoNLL, pp. 708–716 (2007)

    Google Scholar 

  2. Bunescu, R., Pasca, M.: Using encyclopedic knowledge for named entity disambiguation. In: Proceedings of EACL, pp. 9–16 (2006)

    Google Scholar 

  3. Shen, W., Wang, J., Luo, P., Wang, M.: Linden: linking named entities with knowledge base via semantic knowledge. In: Proceedings of WWW 2012, pp. 449–458. ACM (2012)

    Google Scholar 

  4. Hachey, B., Radford, W., Nothman, J., Honnibal, M., Curran, J.R.: Evaluating entity linking with wik-ipedia. Artif. Intell. 194, 130–150 (2013)

    Article  Google Scholar 

  5. Christen, P., Churches, T., Zhu, J.X.: Probabilistic name and address cleaning and standardisation (2002)

    Google Scholar 

  6. Arasu, A., Kaushik, R.: A grammar-based entity representation framework for data cleaning. In: Proceedings of SIGMOD, pp. 233–244. ACM (2009)

    Google Scholar 

  7. Dredze, M., McNamee, P., Rao, D., Gerber, A., Finin, T.: Entity disambiguation for knowledge base population. In: Proceedings of COLING, pp. 277–285 (2010)

    Google Scholar 

  8. Li, Y., Wang, C., Han, F., Han, J., Roth, D., Yan, X.: Mining evidences for named entity disambiguation. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD, pp. 1070–1078 (2013)

    Google Scholar 

  9. Yang, W.: Identifying syntactic differences between two programs. Softw. Pract. Exper. 21(7), 739–755 (1991)

    Article  Google Scholar 

  10. Zhai, Y., Liu, B.: Structured data extraction from the web based on partial tree alignment. IEEE Trans. Knowl. Data Eng. 18(12), 1614–1628 (2006)

    Article  Google Scholar 

  11. Mann, G.S., Yarowsky, D.: Unsupervised personal name disambiguation. In: Proceedings of Conference on Natural Language Learning at HLT-NAACL- Volume 4, CONLL, pp. 33–40 (2003)

    Google Scholar 

  12. Demidova, E., Oelze, I., Nejdl, W.: Aligning freebase with the YAGO ontology. In: Proceedings of ACM International Conference on Information & Knowledge Management, CIKM, pp. 579–588 (2013)

    Google Scholar 

  13. Han, X., Zhao, J.: Web personal name disambiguation based on reference entity tables mined from the web. In: Proceedings of International Workshop on Web Information and Data Management, WIDM, pp. 75–82 (2009)

    Google Scholar 

  14. Raman, V., Hellerstein, J.M.: Potter’s wheel: an interactive data cleaning system. In: Proceedings of the International Conference on Very Large Data Bases, VLDB, pp. 381–390. Morgan Kaufmann Publishers (2001)

    Google Scholar 

  15. Lehman, J., et al.: Dbpedia–a large-scale, multilingual knowledge base extracted from wikipedia. Semant. Web 6(2), 167–195 (2013)

    MathSciNet  Google Scholar 

  16. WordNet. https://wordnet.princeton.edu/

  17. Wikipedia. https://en.wikipedia.org/wiki/Main_Page

  18. FreeBase. https://en.wikipedia.org/wiki/Freebase

  19. DBPedia. http://wiki.dbpedia.org/

  20. Bilenko, M., Mooney, R., Cohen, W., Ravikumar, P., Fienberg, S.: Adaptive name matching in information integration. IEEE Intell. Syst. 18(5), 16–23 (2003)

    Article  Google Scholar 

  21. Elmagarmid, A., Ipeirotis, P., Verykios, V.: Duplicate record detection: a survey. IEEE Trans. Knowl. Data Eng. 19(1), 1–16 (2007)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to L. Georgieva .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Georgieva, L., Buatongkue, S. (2019). A New Framework for Personal Name Disambiguation. In: Arai, K., Kapoor, S., Bhatia, R. (eds) Intelligent Computing. SAI 2018. Advances in Intelligent Systems and Computing, vol 858. Springer, Cham. https://doi.org/10.1007/978-3-030-01174-1_76

Download citation

Publish with us

Policies and ethics