Skip to main content

Limits to the Pursuit of Reproducibility: Emergent Data-Scarce Domains of Science

  • Conference paper
  • First Online:
Transforming Digital Worlds (iConference 2018)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10766))

Included in the following conference series:

Abstract

Recommendations and interventions to promote reproducibility in science have so far largely been formulated in the context of well-established domains characterized by data- and computationally-intensive methods. However, much promising research occurs in little data domains that are emergent and experience data scarcity. This paper presents a longitudinal study of such a domain, deep subseafloor biosphere research. Two important challenges this domain faces in establishing itself are increasing production and circulation of data, and strengthening relationships between domain researchers. Some potential interventions to promote reproducibility may also help the domain to establish itself. However, other potential interventions could profoundly damage the domain’s long-term prospects of maturation by impeding production of new data and undermining critical relationships between researchers. This paper challenges the dominant framing of the pursuit of reproducible science as identifying, and overcoming, barriers to reproducibility. Instead, those interested in pursuing reproducibility in a domain should take into account multiple aspects of that domain’s epistemic culture to avoid negative unintended consequences. Further, pursuing reproducibility is premature for emergent, data-scarce domains: scarce resources should instead be invested to help these domains to mature, for instance by addressing data scarcity.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Borgman, C.L.: Big Data, Little Data, No Data: Scholarship in the Networked World. The MIT Press, Cambridge (2015)

    Google Scholar 

  2. Vitale, C.R.: Is research reproducibility the new data management for libraries? Bull. Assoc. Inf. Sci. Technol. 42(3), 38–41 (2016)

    Google Scholar 

  3. Baker, M.: 1,500 scientists lift the lid on reproducibility. Nat. News 533(7604), 452 (2016)

    Article  Google Scholar 

  4. Pellizzari, E., Lohr, K.N., Blatecky, A., Creel, D.: Reproducibility: A Primer on Semantics and Implications for Research, 1st edn. RTI Press/RTI International, Research Triangle Park (2017)

    Book  Google Scholar 

  5. Stodden, V., Leisch, F., Peng, R.D. (eds.): Implementing Reproducible Research. CRC Press, Boca Raton (2014)

    Google Scholar 

  6. Knorr-Cetina, K.: Epistemic Cultures: How the Sciences Make Knowledge. Harvard University Press, Cambridge (1999)

    Google Scholar 

  7. Lenoir, T.: Instituting Science: The Cultural Production of Scientific Disciplines. Stanford University Press, Stanford (1997)

    Google Scholar 

  8. Wallis, J.C., Rolando, E., Borgman, C.L.: If we share data, will anyone use them? Data sharing and reuse in the long tail of science and technology. PLoS ONE 8(7), e67332 (2013)

    Article  Google Scholar 

  9. Stodden, V.: Resolving irreproducibility in empirical and computational research. IMS Bull. Online (2013)

    Google Scholar 

  10. Ram, K., Marwick, B.: Building towards a future where reproducible, open science is the norm. In: Kitzes, J., Turek, D., Deniz, F. (eds.) The Practice of Reproducible Research: Case Studies and Lessons from the Data-Intensive Sciences, pp. 69–78. University of California Press, Oakland (2018)

    Google Scholar 

  11. Ince, D.C., Hatton, L., Graham-Cumming, J.: The case for open computer programs. Nature 482(7386), 485–488 (2012)

    Article  Google Scholar 

  12. Stodden, V., et al.: Enhancing reproducibility for computational methods. Science 354(6317), 1240–1241 (2016)

    Article  Google Scholar 

  13. Kahneman, D.: A new etiquette for replication. Soc. Psychol. 45(4), 310 (2014)

    Google Scholar 

  14. Marwick, B.: Computational reproducibility in archaeological research: basic principles and a case study of their implementation. J. Archaeol. Method Theory 24(2), 424–450 (2017)

    Article  Google Scholar 

  15. Kitzes, J., Turek, D., Deniz, F. (eds.): The Practice of Reproducible Research: Case Studies and Lessons from the Data-Intensive Sciences. Univ of California Press, Oakland (2018)

    Google Scholar 

  16. Darch, P.T., Borgman, C.L.: Ship space to database: emerging infrastructures for studies of the deep subseafloor biosphere. PeerJ Comput. Sci. 2, e97 (2016)

    Article  Google Scholar 

  17. Hammersley, M., Atkinson, P.: Ethnography: Principles in Practice, 3rd edn. Routledge, London (2007). Reprinted

    Google Scholar 

  18. Darch, P.T., Borgman, C.L., Traweek, S., Cummings, R.L., Wallis, J.C., Sands, A.E.: What lies beneath?: knowledge infrastructures in the subseafloor biosphere and beyond. Int. J. Digit. Libr. 16(1), 61–77 (2015)

    Article  Google Scholar 

Download references

Acknowledgements

This work is funded by the Alfred P. Sloan Foundation (Awards #20113194, #201514001). Thank you to current members of UCLA Center for Knowledge Infrastructures (CKI) for comments on earlier drafts of this paper (Christine L. Borgman, Bernie Boscoe, Milena S. Golshan, Irene Pasquetto, and Michael J. Scroggins), to past members of CKI (Ashley E. Sands and Sharon Traweek) for discussion of ideas, and to Rebekah L. Cummings for assistance with data collection. Thank you also to the C-DEBI and IODP personnel who were observed and interviewed.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Peter T. Darch .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Darch, P.T. (2018). Limits to the Pursuit of Reproducibility: Emergent Data-Scarce Domains of Science. In: Chowdhury, G., McLeod, J., Gillet, V., Willett, P. (eds) Transforming Digital Worlds. iConference 2018. Lecture Notes in Computer Science(), vol 10766. Springer, Cham. https://doi.org/10.1007/978-3-319-78105-1_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-78105-1_21

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-78104-4

  • Online ISBN: 978-3-319-78105-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics