Skip to main content

Doing More with Named Entities

Turning Text into a Linked Data Hub

  • Conference paper
  • First Online:
  • 896 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 416))

Abstract

The usability, disclosure and value of digitized full text collections can be improved by linking named entities or events in text to linked data [1]. These data can be used to obtain additional information and it can be used in queries for expressing conditions in terms of semantic relations. When the links are obtained automatically, we need disambiguation by users or by sophisticated algorithms. Providers of full text data will have to deal with all of these aspects. In this paper we discuss our plans and work in progress and propose a common approach to increase interoperability between text data from different providers. We feel that it should be the ambition of every provider of text collections to have as many named entities as possible in text identified with globally unique persistent identifiers, linked to one or more resource descriptions to guide users through the enormous amount of digitized text. Notice: This work reflects the personal view of the authors and does not yet reflect current policies of the KB.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Linked Data. http://www.w3.org/standards/semanticweb/data

  2. SPARQL. Query Language for RDF. http://www.w3.org/TR/rdf-sparql-query/

  3. SRU. Search and Retrieval via URL’s. http://www.loc.gov/standards/sru/

  4. DBpedia. http://dbpedia.org/About

  5. Freebase. http://www.freebase.com/

  6. VIAF. Virtual International Authority File. http://viaf.org

  7. SIWA. Schema for the Integration of Web Applications. http://www.kbresearch.nl/SIWA

  8. Europeana Newspapers. http://www.europeana-newspapers.eu/

  9. Stanford Named Entity Recognizer. http://nlp.stanford.edu/software/CRF-NER.shtml

  10. Sil, A., Cronin, E., et al.: Linking named entities in any database. In: EMNLP-CoNLL ‘12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning

    Google Scholar 

  11. Shen, W., Wang, J., et al.: LINDEN: linking named entities with knowledge base via semantic knowledge. In: WWW 2012, Lyon, France, 16–20 Apr 2012

    Google Scholar 

  12. Bunescu, R., Pasca, M.: Using encyclopedic knowledge for named entity disambiguation. In Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics (EACL-06), pp. 9–16, Trento, Italy (2006)

    Google Scholar 

  13. van Hooland, S., De Wilde, M.: Named-entity recognition, a gateway drug for cultural heritage collections to the linked data cloud ? Literary and Linguistic Computing, 01/2013

    Google Scholar 

  14. Godby, C.J., Hswe, P., et al.: Who’s who in your digital collection: developing a tool for name disambiguation and identity resolution. J. Chic. Colloq. Dig. Human. Comput. Sci. 1(2), 116–127 (2011)

    Google Scholar 

  15. MacKay, A.W.: Enriching the digital library experience: innovations with named entity recognition and geographic information system technologies (2008)

    Google Scholar 

  16. Sealinc. Socially-enriched access to linked cultural media. http://www.commit-nl.nl/projects/socially-enriched-access-to-linked-cultural-media

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Theo van Veen or Michel Koppelaar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

van Veen, T., Koppelaar, M. (2014). Doing More with Named Entities. In: Bolikowski, Ł., Casarosa, V., Goodale, P., Houssos, N., Manghi, P., Schirrwagen, J. (eds) Theory and Practice of Digital Libraries -- TPDL 2013 Selected Workshops. TPDL 2013. Communications in Computer and Information Science, vol 416. Springer, Cham. https://doi.org/10.1007/978-3-319-08425-1_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-08425-1_16

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-08424-4

  • Online ISBN: 978-3-319-08425-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics