Skip to main content

Preliminary Analysis of Data Sources Interlinking

Data Searchery: A Case Study

  • Conference paper
  • First Online:
Theory and Practice of Digital Libraries -- TPDL 2013 Selected Workshops (TPDL 2013)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 416))

Included in the following conference series:

Abstract

The novel e-Science’s data-centric paradigm has proved that interlinking publications and research data objects coming from different realms and data sources (e.g. publication repositories, data repositories) makes dissemination, re-use, and validation of research activities more effective. Scholarly Communication Infrastructures (SCIs) are advocated for bridging such data sources by offering an overlay of services for identification, creation, and navigation of relationships among objects of different nature. Since realization and maintenance of such infrastructures is in general very cost-consuming, in this paper we propose a lightweight approach for “preliminary analysis of data source interlinking” to help practitioners at evaluating whether and to what extent realizing them can be effective. We present Data Searchery, a configurable tool delivering a service for relating objects across data sources, be them publications or research data, by identifying relationships between their metadata descriptions in real-time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Mendeley, http://www.mendeley.com/.

  2. 2.

    ORCID, http://orcid.org/.

  3. 3.

    The Research Data Alliance, http://europe.rd-alliance.org/.

  4. 4.

    The DataCite Initiative, http://datacite.org.

  5. 5.

    NARCIS, http://www.narcis.nl/.

  6. 6.

    The OpenAIRE project, http://www.openaire.eu/en.

  7. 7.

    The DRIVER repository, http://www.driver-repository.eu/.

  8. 8.

    Google Scholar, http://scholar.google.com.

  9. 9.

    Apache Solr, http://lucene.apache.org/solr/.

  10. 10.

    Elasticsearch, http://www.elasticsearch.org/.

  11. 11.

    D-Net Software Toolkit, http://www.d-net.research-infrastructures.eu.

  12. 12.

    WhatIzIt - EBI, http://www.ebi.ac.uk/webservices/whatizit/.

  13. 13.

    PANGAEA - Data Publisher for Earth & Environmental Science, http://www.pangaea.de.

  14. 14.

    Figshare.com, http://figshare.com/.

References

  1. Bourne, P.E., Clark, T.W., Dale, R., de Waard, A., Herman, I., Hovy, E.H., Shotton, D.: Improving the future of research communications and e-scholarship (Dagstuhl perspectives workshop 11331). Dagstuhl Manifestos 1(1), 41–60 (2012)

    Google Scholar 

  2. Hogenaar, A.: What is an enhanced publication? http://www.openaire.eu/en/component/content/article/76-highlights/344-a-short-introduction-to-enhanced-publications

  3. Gray, J.: A transformed scientific method. In: Hey, T., Tansley, S., Tolle, K. (eds.) The Fourth Paradigm: Data-Intensive Scientific Discovery. Microsoft Research, Redmond (2009)

    Google Scholar 

  4. Reilly, S., Schallier, W., Schrimpf, S., Smit, E., Wilkinson, M.: Report on integration of data and publications. ODE Opportunities for Data Exchange

    Google Scholar 

  5. Callaghan, S., Donegan, S.: Making data a first class scientific output: data citation and publication by NERC’s environmental data centres. Int. J. Digit. Curation 7(1), 107–113 (2012)

    Article  Google Scholar 

  6. Chavan, V., Penev, L.: The data paper: a mechanism to incentivize data publishing in biodiversity science. BMC Bioinform. 12(Suppl 15), S2 (2011)

    Article  Google Scholar 

  7. Hoogerwerf, M., Lösch, M., Schirrwagen, J., Callaghan, S., Manghi, P., Iatropoulou, K., Keramida, D., Rettberg, N.: Linking data and publications: towards a cross-disciplinary approach. Int. J. Digit. Curation 8(1), 244–254 (2013)

    Article  Google Scholar 

  8. Wallis, J.C., Rolando, E., Borgman, C.L.: If we share data, will anyone use them? data sharing and reuse in the long tail of science and technology. PLoS ONE 8(7), e67332 (2013)

    Article  Google Scholar 

  9. Castelli, D., Manghi, P., Thanos, C.: A vision towards scientific communication infrastructures - on bridging the realms of research digital libraries and scientific data centers. J. Digit. Libr. 13(3/4), 155–169 (2013)

    Article  Google Scholar 

  10. Manghi, P., Bolikowski, L., Manola, N., Shirrwagen, J., Smith, T.: Openaireplus: the European scholarly communication data infrastructure. D-Lib Mag. 18(9–10) (2012). http://www.bibsonomy.org/bibtex/23435c839e8f925c6ca94a4c2972015b1/dblp

  11. Manghi, P., Manola, N., Horstmann, W., Peters, D.: An infrastructure for managing EC funded research output - the openaire project. Grey J. (TGJ): Int. J. Grey Lit. 6(1), 31–40 (2010)

    Google Scholar 

  12. Attwood, T.K., Kell, D.B., McDermott, P., Marsh, J., Pettifer, S.R., Thorne, D.: Utopia documents: linking scholarly literature with research data. Bioinformatics 26(18), 568–574 (2010)

    Article  Google Scholar 

  13. Bruce, T.R., Hillmann, D.: The Continuum of Metadata Quality: Defining, Expressing, Exploiting. American Library Association, Chicago (2004)

    Google Scholar 

  14. Tani, A., Candela, L., Castelli, D.: Dealing with metadata quality: the legacy of digital library efforts. Inf. Process. Manag. 49(6), 1194–1205 (2013)

    Article  Google Scholar 

  15. Feijen, M., Horstmann, W., Manghi, P., Robinson, M., Russell, R.: DRIVER: Building the Network for Accessing Digital Repositories across Europe. In: Ariadne Magazine, vol. 53, pp. 1–4, Ariadne (2007). http://puma.isti.cnr.it/dfdownload.php?ident=/cnr.isti/2007-A0-047

  16. Manghi, P., Mikulicic, M., Candela, L., Castelli, D., Pagano, P.: Realizing and maintaining aggregative digital library systems: D-net software toolkit and oaister system. D-Lib Mag. 16(3/4) (2010). http://www.bibsonomy.org/bib/bibtex/2d5fb59f6245dc730c4d86882d7bfb18d/dblp

  17. Berners-Lee, T.: Linked data. http://www.w3.org/DesignIssues/LinkedData.html

  18. Wölger, S., Siorpaes, K., Bürger, T., Simperl, E., Thaler, S., Hofer, C.: A survey on data interlinking methods. Technical report, Semantic Technology Institute (STI), University of Insbruck (March 2011)

    Google Scholar 

  19. Nikolaidou, P.T., Shaeles, S.N., Karakos, A.S.: MusicPedia: retrieving and merging-interlinking music metadata. Int. J. Comput. 3(8) (2011)

    Google Scholar 

  20. Rinke Hoekstra, P.G.: Linkitup: Link discovery for research data. In: Proceedings of the AAAI Fall Symposium on Discovery Informatics (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Andrea Mannocci or Paolo Manghi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Mannocci, A., Manghi, P. (2014). Preliminary Analysis of Data Sources Interlinking. In: Bolikowski, Ł., Casarosa, V., Goodale, P., Houssos, N., Manghi, P., Schirrwagen, J. (eds) Theory and Practice of Digital Libraries -- TPDL 2013 Selected Workshops. TPDL 2013. Communications in Computer and Information Science, vol 416. Springer, Cham. https://doi.org/10.1007/978-3-319-08425-1_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-08425-1_6

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-08424-4

  • Online ISBN: 978-3-319-08425-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics