Skip to main content

Exploring Reproducibility and FAIR Principles in Data Science Using Ecological Niche Modeling as a Case Study

  • Conference paper
  • First Online:
Advances in Conceptual Modeling (ER 2019)

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 11787))

Included in the following conference series:

Abstract

Reproducibility is a fundamental requirement of the scientific process since it enables outcomes to be replicated and verified. Computational scientific experiments can benefit from improved reproducibility for many reasons, including validation of results and reuse by other scientists. However, designing reproducible experiments remains a challenge and hence the need for developing methodologies and tools that can support this process. Here, we propose a conceptual model for reproducibility to specify its main attributes and properties, along with a framework that allows for computational experiments to be findable, accessible, interoperable, and reusable. We present a case study in ecological niche modeling to demonstrate and evaluate this framework.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://fairsharing.org/.

  2. 2.

    https://repositoryfinder.datacite.org/.

  3. 3.

    https://github.com/mmondelli/reproduceR.

  4. 4.

    https://github.com/Model-R/modelr_pkg.

References

  1. Baker, M.: 1,500 scientists lift the lid on reproducibility. Nature 533(7604), 452–454 (2016)

    Article  Google Scholar 

  2. Borregaard, M.K., Hart, E.M.: Towards a more reproducible ecology. Ecography 39(4), 349–353 (2016)

    Article  Google Scholar 

  3. Brinckman, A., et al.: Computing environments for reproducibility: capturing the “whole tale”. Future Gener. Comput. Syst. 94, 854–867 (2019)

    Article  Google Scholar 

  4. Chirigati, F., Rampin, R., Shasha, D., Freire, J.: ReproZip: computational reproducibility with ease. In: Proceedings of the 2016 International Conference on Management of Data, pp. 2085–2088. ACM (2016)

    Google Scholar 

  5. De Prins, J.: Global open biodiversity data: future vision of fair biodiversity data access, management, use and stewardship. Biodivers. Inf. Sci. Stand. 3, e37190 (2019)

    Google Scholar 

  6. Deelman, E., et al.: Workflows and e-Science: an overview of workflow system features and capabilities. Future Gener. Comput. Syst. 25(5), 528–540 (2009)

    Article  Google Scholar 

  7. Freire, J., Chirigati, F.: Provenance and the different flavors of computational reproducibility. Bull. Tech. Comm. Data Eng. 41(1), 15–26 (2018)

    Google Scholar 

  8. Goble, C., Cohen-Boulakia, S., et al.: Fair computational workflows (2019). https://doi.org/10.5281/zenodo.3268653

  9. Ivie, P., Thain, D.: Reproducibility in scientific computing. ACM Comput. Surv. (CSUR) 51(3), 63 (2018)

    Article  Google Scholar 

  10. Madduri, R., Chard, K., D’Arcy, M., et al.: Reproducible big data science: a case study in continuous fairness. PloS one 14(4), e0213013 (2019)

    Article  Google Scholar 

  11. Mondelli, M.L., et al.: BioWorkbench: a high-performance framework for managing and analyzing bioinformatics experiments. PeerJ 6, e5551 (2018)

    Article  Google Scholar 

  12. Pimentel, J.F., Murta, L., Braganholo, V., Freire, J.: noWorkflow: a tool for collecting, analyzing, and managing provenance from python scripts. Proc. VLDB Endow. 10, 1841–1844 (2017)

    Article  Google Scholar 

  13. Sánchez-Tapia, A., et al.: Model-R: a framework for scalable and reproducible ecological niche modeling. In: Mocskos, E., Nesmachnow, S. (eds.) CARLA 2017. CCIS, vol. 796, pp. 218–232. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-73353-1_15

    Chapter  Google Scholar 

  14. Stodden, V., et al.: Toward reproducible computational research: an empirical analysis of data and code policy adoption by journals. PLoS One 8(6), e67111 (2013)

    Article  Google Scholar 

  15. Thomas, P., et al.: Sharing and preserving computational analyses for posterity with encapsulator. Comput. Sci. Eng. 20(4), 111 (2018)

    Article  Google Scholar 

  16. Wilkinson, M.D., Dumontier, M., Aalbersberg, I.J., et al.: The fair guiding principles for scientific data management and stewardship. Sci. Data 3, 160018 (2016)

    Article  Google Scholar 

Download references

Acknowledgments

This work was supported by CNPq, CAPES, and FAPERJ. We thank Marinez Ferreira, Andrea Sánchez-Tapia and Sara Mortara, from the Botanic Garden of Rio de Janeiro, for their contributions.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Maria Luiza Mondelli or Luiz M. R. Gadelha Jr. .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Mondelli, M.L., Townsend Peterson, A., Gadelha, L.M.R. (2019). Exploring Reproducibility and FAIR Principles in Data Science Using Ecological Niche Modeling as a Case Study. In: Guizzardi, G., Gailly, F., Suzana Pitangueira Maciel, R. (eds) Advances in Conceptual Modeling. ER 2019. Lecture Notes in Computer Science(), vol 11787. Springer, Cham. https://doi.org/10.1007/978-3-030-34146-6_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-34146-6_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-34145-9

  • Online ISBN: 978-3-030-34146-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics