Data Retrieval = Text Retrieval?

  • Maryam BugajeEmail author
  • Gobinda Chowdhury
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10766)


Due to the comparatively more recent emergence of data retrieval systems than text-based search engines, the former have still yet to achieve similar maturity in terms of standards and techniques. Most of the existing solutions for data retrieval are more or less makeshift adaptations of text retrieval systems rather than purpose-built solutions specially designed to cater to the particular peculiarities, subtleties, and unique requirements of research datasets. In this paper we probe into the key differences between text and data retrieval that bear practical relevance to the retrieval question; these differences we demonstrate by evaluating some representative examples of research data repositories as well as presenting findings from previous studies.


Data retrieval Text retrieval Research data management Research data repositories 


  1. 1.
    Borgman, C.: Big Data, Little Data, No Data: Scholarship in the networked world. MIT Press, Cambridge (2015)Google Scholar
  2. 2.
    Weber, A., Piesche, C.: Requirements on long-term accessibility and preservation of research results with particular regard to their provenance. ISPRS Int. J. Geo-Inf. 5, 49 (2016)CrossRefGoogle Scholar
  3. 3.
    Bugaje, M., Chowdhury, G.: Is data retrieval different from text retrieval? An exploratory study. In: Choemprayong, S., Crestani, F., Cunningham, S.J. (eds.) ICADL 2017. LNCS, vol. 10647, pp. 97–103. Springer, Cham (2017). CrossRefGoogle Scholar
  4. 4.
    Borgman, C.L.: The conundrum of sharing research data. J. Am. Soc. Inf. Sci. Technol. 63(6), 1059–1078 (2012)CrossRefGoogle Scholar
  5. 5.
    Borgman, C.L., Wallis, J.C., Mayernik, M.S.: Who’s got the data? Interdependencies in science and technology collaborations. Comput. Support. Coop. Work 21(6), 485–523 (2012)CrossRefGoogle Scholar
  6. 6.
    The data harvest: how sharing research data can yield knowledge, jobs and growth. An RDA Europe report, December 2014. Accessed 06 Nov 2017
  7. 7.
    Boru, D., Kliazovich, D., Granelli, F., Bouvry, P., Zomaya, A.Y.: Energy-efficient data replication in cloud computing datacenters. Clust. Comput. 18(1), 385–402 (2015)CrossRefGoogle Scholar
  8. 8.
    Chowdhury, G.G.: Sustainability of Scholarly Information. Facet Publishing, London (2014)Google Scholar
  9. 9.
    Chowdhury, G.G.: How to improve the sustainability of digital libraries and information services? J. Assoc. Inf. Sci. Technol. 67(10), 2379–2391 (2016)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Faculty of Engineering and Environment, iSchoolNorthumbria UniversityNewcastleUK

Personalised recommendations