Skip to main content

NAYOSE: A System for Reference Disambiguation of Proper Nouns Appearing on Web Pages

  • Conference paper
Information Retrieval Technology (AIRS 2006)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4182))

Included in the following conference series:

Abstract

We are developing a reference disambiguation system called NAYOSE System. In order to cope with the case the same person name or place name appears over two or more Web pages, we propose a system classifying each page into a cluster which corresponds to the same entity in the real world. For this purpose, we propose two new methods involving algorithms to classify these pages. In our evaluation, the combination of local text matching and named entities matching outperformed the previous baseline algorithm used in simple document classification method by 0.22 in the overall F-measure.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bagga, A., Baldwin, B.: Entity-Based Cross-Document Coreferencing Using the Vector Space Model. In: Proceedings of COLING-ACL 1998, pp. 79–85 (1998)

    Google Scholar 

  2. Bekkerman, R., MacCallum, A.: Disambiguating Web Appearances of People in a Social Network. In: Proceedings of WWW 2005, pp. 463–470 (2005)

    Google Scholar 

  3. Larsen, B., Aone, C.: Fast and effective text mining using linear-time document clustering. In: Proceedings of the 5th ACM SIGKDD, pp. 16–22 (1999)

    Google Scholar 

  4. Mann, G.S., Yarowsky, D.: Unsupervised Personal Name Disambiguation. In: Proceedings of CoNLL 2003, pp. 33–40 (2003)

    Google Scholar 

  5. Morton, T.S.: Coreference for NLP Applications. In: Proceedings of ACL 2000, pp. 173–180 (2000)

    Google Scholar 

  6. Niu, C., Li, W., Srihari, R.K.: Weakly Supervised Learning for Cross-document Person Name Disambiguation Supported by Information Extraction. In: Proceedings of ACL 2004, pp. 598–605 (2004)

    Google Scholar 

  7. Wan, X., Gao, J., Li, M., Ding, B.: Person Resolution in Person Search Results: WebHawk. In: Proceedings of CIKM 2005, pp. 163–170 (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ono, S., Yoshida, M., Nakagawa, H. (2006). NAYOSE: A System for Reference Disambiguation of Proper Nouns Appearing on Web Pages. In: Ng, H.T., Leong, MK., Kan, MY., Ji, D. (eds) Information Retrieval Technology. AIRS 2006. Lecture Notes in Computer Science, vol 4182. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11880592_26

Download citation

  • DOI: https://doi.org/10.1007/11880592_26

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-45780-0

  • Online ISBN: 978-3-540-46237-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics