Abstract
Reproducibility is a fundamental requirement of the scientific process since it enables outcomes to be replicated and verified. Computational scientific experiments can benefit from improved reproducibility for many reasons, including validation of results and reuse by other scientists. However, designing reproducible experiments remains a challenge and hence the need for developing methodologies and tools that can support this process. Here, we propose a conceptual model for reproducibility to specify its main attributes and properties, along with a framework that allows for computational experiments to be findable, accessible, interoperable, and reusable. We present a case study in ecological niche modeling to demonstrate and evaluate this framework.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Baker, M.: 1,500 scientists lift the lid on reproducibility. Nature 533(7604), 452–454 (2016)
Borregaard, M.K., Hart, E.M.: Towards a more reproducible ecology. Ecography 39(4), 349–353 (2016)
Brinckman, A., et al.: Computing environments for reproducibility: capturing the “whole tale”. Future Gener. Comput. Syst. 94, 854–867 (2019)
Chirigati, F., Rampin, R., Shasha, D., Freire, J.: ReproZip: computational reproducibility with ease. In: Proceedings of the 2016 International Conference on Management of Data, pp. 2085–2088. ACM (2016)
De Prins, J.: Global open biodiversity data: future vision of fair biodiversity data access, management, use and stewardship. Biodivers. Inf. Sci. Stand. 3, e37190 (2019)
Deelman, E., et al.: Workflows and e-Science: an overview of workflow system features and capabilities. Future Gener. Comput. Syst. 25(5), 528–540 (2009)
Freire, J., Chirigati, F.: Provenance and the different flavors of computational reproducibility. Bull. Tech. Comm. Data Eng. 41(1), 15–26 (2018)
Goble, C., Cohen-Boulakia, S., et al.: Fair computational workflows (2019). https://doi.org/10.5281/zenodo.3268653
Ivie, P., Thain, D.: Reproducibility in scientific computing. ACM Comput. Surv. (CSUR) 51(3), 63 (2018)
Madduri, R., Chard, K., D’Arcy, M., et al.: Reproducible big data science: a case study in continuous fairness. PloS one 14(4), e0213013 (2019)
Mondelli, M.L., et al.: BioWorkbench: a high-performance framework for managing and analyzing bioinformatics experiments. PeerJ 6, e5551 (2018)
Pimentel, J.F., Murta, L., Braganholo, V., Freire, J.: noWorkflow: a tool for collecting, analyzing, and managing provenance from python scripts. Proc. VLDB Endow. 10, 1841–1844 (2017)
Sánchez-Tapia, A., et al.: Model-R: a framework for scalable and reproducible ecological niche modeling. In: Mocskos, E., Nesmachnow, S. (eds.) CARLA 2017. CCIS, vol. 796, pp. 218–232. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-73353-1_15
Stodden, V., et al.: Toward reproducible computational research: an empirical analysis of data and code policy adoption by journals. PLoS One 8(6), e67111 (2013)
Thomas, P., et al.: Sharing and preserving computational analyses for posterity with encapsulator. Comput. Sci. Eng. 20(4), 111 (2018)
Wilkinson, M.D., Dumontier, M., Aalbersberg, I.J., et al.: The fair guiding principles for scientific data management and stewardship. Sci. Data 3, 160018 (2016)
Acknowledgments
This work was supported by CNPq, CAPES, and FAPERJ. We thank Marinez Ferreira, Andrea Sánchez-Tapia and Sara Mortara, from the Botanic Garden of Rio de Janeiro, for their contributions.
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Mondelli, M.L., Townsend Peterson, A., Gadelha, L.M.R. (2019). Exploring Reproducibility and FAIR Principles in Data Science Using Ecological Niche Modeling as a Case Study. In: Guizzardi, G., Gailly, F., Suzana Pitangueira Maciel, R. (eds) Advances in Conceptual Modeling. ER 2019. Lecture Notes in Computer Science(), vol 11787. Springer, Cham. https://doi.org/10.1007/978-3-030-34146-6_3
Download citation
DOI: https://doi.org/10.1007/978-3-030-34146-6_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-34145-9
Online ISBN: 978-3-030-34146-6
eBook Packages: Computer ScienceComputer Science (R0)