Skip to main content

Data Provenance in Citizen Science Databases

  • Conference paper
  • First Online:
  • 1246 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 909))

Abstract

Today, more and more scientific groups are developing citizen science applications. Citizen science is a relatively new domain of science that has already proved to be as beneficial as classical science. One of the major challenges citizen science face is the data quality assurance. It uses several techniques to verify the data quality based on expert evaluation, voting systems, etc. Data provenance is used in many scientific systems and provides reliable mechanism for tracking data history. It includes history of origin, changes, and all interactions between different parts of data. Data provenance by itself has many types such as “Why provenance”, “When provenance”, and “What provenance”. The purpose of this work is to build a prototype of a database with built-in data provenance. Several databases systems and models such as Relational databases, NoSQL databases are taken into consideration. Experiments are been conducted to test limitations of proposed prototype.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Ellul, C., Francis, L., Haklay, M.: A flexible database-centric platform for citizen science data capture. In: 2011 IEEE Seventh International Conference on E-Science Workshops (eScienceW) (2011)

    Google Scholar 

  2. McKinley, D.C., et al.: Citizen science can improve conservation science, natural resource management, and environmental protection. Biol. Conserv. 208, 15–28 (2017)

    Article  Google Scholar 

  3. Wiggins, A., He, Y.: Community-based data validation practices in citizen science. In: CSCW (2016)

    Google Scholar 

  4. Sheppard, S.A., Wiggins, A., Terveen, L.: Capturing quality: retaining provenance for curated volunteer monitoring data. In: Proceedings of the 17th ACM Conference on Computer Supported Cooperative Work and Social Computing (2014)

    Google Scholar 

  5. Memarsadeghi, N.: Citizen science [Guest editors’ introduction]. Comput. Sci. Eng. 17(4), 8–10 (2015)

    Article  Google Scholar 

  6. Bonney, R., et al.: Next steps for citizen science. Science 343(6178), 1436–1437 (2014)

    Article  Google Scholar 

  7. Understanding Citizen Science and Environmental Monitoring. Final Report. https://www.ceh.ac.uk/sites/default/files/citizensciencereview.pdf

  8. Jambeck, J.R., Johnsen, K.: Citizen-based litter and marine debris data collection and mapping. Comput. Sci. Eng. 17(4), 20–26 (2015)

    Article  Google Scholar 

  9. Dickinson, J.L., Zuckerberg, B., Bonter, D.N.: Citizen Science as an ecological research tool: challenges and benefits. Ann. Rev. Ecol. Evol. Syst. 41, 149–172 (2010)

    Article  Google Scholar 

  10. Danielsen, F., et al.: A multicountry assessment of tropical resource monitoring by local communities. Bioscience 64(3), 236–251 (2014)

    Article  Google Scholar 

  11. Smith, A., Lynn, S., Lintott, C.J.: Human Computation and Crowdsourcing: Works in Progress and Demonstrations (2013)

    Google Scholar 

  12. Buneman, P., Khanna, S., Wang-Chiew, T.: Why and where: a characterization of data provenance. In: Van den Bussche, J., Vianu, V. (eds.) ICDT 2001. LNCS, vol. 1973, pp. 316–330. Springer, Heidelberg (2001). https://doi.org/10.1007/3-540-44503-X_20

    Chapter  Google Scholar 

  13. Wang, Y.R., Madnick, S.E.: A polygen model for heterogeneous database systems: the source tagging perspective (1990)

    Google Scholar 

  14. Woodruff, A., Stonebraker, M.: Supporting fine-grained data lineage in a database visualization environment. In: 1997 Proceedings of the 13th International Conference on Data Engineering (1997)

    Google Scholar 

  15. Ioannou, E., Garofalakis, M.: Query analytics over probabilistic databases with unmerged duplicates. IEEE Trans. Knowled. Data Eng. 27(8), 2245–2260 (2015)

    Article  Google Scholar 

  16. Ioannou, E., Garofalakis, M.: Query analytics over probabilistic databases with unmerged duplicates. IEEE Trans. Knowl. Data Eng. 27(8), 2245–2260 (2015)

    Article  Google Scholar 

  17. Stonebraker, M.: SQL databases v. NoSQL databases. Commun. ACM 53(4), 10–11 (2010)

    Article  Google Scholar 

  18. Kulkarni, D.: A fine-grained access control model for key-value systems. In: Proceedings of the Third ACM Conference on Data and Application Security and Privacy (2013)

    Google Scholar 

  19. Kitchenham, B., et al.: Systematic literature reviews in software engineering – a tertiary study. Inf. Softw. Technol. 52, 792–805 (2010)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ajantha Dahanayake .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Tiufiakov, N., Dahanayake, A., Zudilova, T. (2018). Data Provenance in Citizen Science Databases. In: Benczúr, A., et al. New Trends in Databases and Information Systems. ADBIS 2018. Communications in Computer and Information Science, vol 909. Springer, Cham. https://doi.org/10.1007/978-3-030-00063-9_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-00063-9_23

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-00062-2

  • Online ISBN: 978-3-030-00063-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics