Skip to main content

Provenance Management for Data Exploration

  • Conference paper
Data Integration in the Life Sciences (DILS 2010)

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 6254))

Included in the following conference series:

  • 518 Accesses

Abstract

Computing has been an enormous accelerator to science and industry alike and it has led to an information explosion in many different fields. The unprecedented volume of data acquired by sensors, derived by simulations and analysis processes, and shared on the Web opens up new opportunities, but it also creates many challenges when it comes to managing and analyzing these data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Callahan, S., Freire, J., Santos, E., Scheidegger, C., Silva, C., Vo, H.: Managing the evolution of dataflows with vistrails (Extended Abstract). In: IEEE Workshop on Workflow and Data Flow for Scientific Applications, SciFlow (2006)

    Google Scholar 

  2. CrowdLabs, http://www.crowdlabs.org

  3. Davidson, S.B., Boulakia, S.C., Eyal, A., Ludäscher, B., McPhillips, T.M., Bowers, S., Anand, M.K., Freire, J.: Provenance in scientific workflow systems. IEEE Data Eng. Bull. 30(4), 44–50 (2007)

    Google Scholar 

  4. Davidson, S.B., Freire, J.: Provenance and scientific workflows: challenges and opportunities. In: SIGMOD, pp. 1345–1350 (2008)

    Google Scholar 

  5. Ellkvist, T., Koop, D., Anderson, E.W., Freire, J., Silva, C.T.: Using provenance to support real-time collaborative design of workflows. In: Freire, J., Koop, D., Moreau, L. (eds.) IPAW 2008. LNCS, vol. 5272, pp. 266–279. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  6. Ellkvist, T., Strömbäck, L., Lins, L.D., Freire, J.: A first study on strategies for generating workflow snippets. In: Proceedings of the ACM SIGMOD International Workshop on Keyword Search on Structured Data (KEYS), pp. 15–20 (2009)

    Google Scholar 

  7. Ellqvist, T., Koop, D., Freire, J., Silva, C., Stromback, L.: Using mediation to achieve provenance interoperability. In: IEEE Congress on Services, pp. 291–298 (2009)

    Google Scholar 

  8. Fomel, S., Claerbout, J.F.: Guest editors’ introduction: Reproducible research. Computing in Science and Engineering 11, 5–7 (2009)

    Google Scholar 

  9. Freire, J., Koop, D., Santos, E., Silva, C.T.: Provenance for computational tasks: A survey. Computing in Science and Engineering 10(3), 11–21 (2008)

    Article  Google Scholar 

  10. Freire, J., Silva, C.: Towards enabling social analysis of scientific data. In: ACM CHI Social Data Analysis Workshop (2008)

    Google Scholar 

  11. Freire, J., Silva, C., Callahan, S., Santos, E., Scheidegger, C., Vo, H.: Managing rapidly-evolving scientific workflows. In: Moreau, L., Foster, I. (eds.) IPAW 2006. LNCS, vol. 4145, pp. 10–18. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  12. Koop, D., Santos, E., Bela Bauer, J.F., Troyer, M., Silva, C.T.: Bridging workflow and data provenance using strong links. In: SSDBM (to appear 2010)

    Google Scholar 

  13. Koop, D., Scheidegger, C., Freire, J., Silva, C.T.: The provenance of workflow upgrades. In: IPAW (to appear, 2010)

    Google Scholar 

  14. Koop, D., Scheidegger, C.E., Callahan, S.P., Freire, J., Silva, C.T.: Viscomplete: Automating suggestions for visualization pipelines. IEEE Transactions on Visualization and Computer Graphics 14(6), 1691–1698 (2008)

    Article  Google Scholar 

  15. Lins, L., Koop, D., Anderson, E.W., Callahan, S.P., Santos, E., Scheidegger, C.E., Freire, J., Silva, C.T.: Examining statistics of workflow evolution provenance: A first study. In: Ludäscher, B., Mamoulis, N. (eds.) SSDBM 2008. LNCS, vol. 5069, pp. 573–579. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  16. Mesirov, J.P.: Accessible reproducible research. Science 327(5964), 415–416 (2010)

    Article  Google Scholar 

  17. Moreau, L., Freire, J., Futrelle, J., McGrath, R.E., Myers, J., Paulson, P.: The open provenance model: An overview. In: Freire, J., Koop, D., Moreau, L. (eds.) IPAW 2008. LNCS, vol. 5272, pp. 323–326. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  18. Santos, E., Freire, J., Silva, C.: Information sharing in science 2.0: Challenges and opportunities. In: ACM CHI Workshop on The Changing Face of Digital Science: New Practices in Scientific Collaborations (2009)

    Google Scholar 

  19. Santos, E., Koop, D., Vo, H.T., Anderson, E.W., Freire, J., Silva, C.T.: Using workflow medleys to streamline exploratory tasks. In: SSDBM, pp. 292–301 (2009)

    Google Scholar 

  20. Santos, E., Lins, L., Ahrens, J., Freire, J., Silva, C.T.: Vismashup: Streamlining the creation of custom visualization applications. IEEE Transactions on Visualization and Computer Graphics 15(6), 1539–1546 (2009)

    Article  Google Scholar 

  21. Santos, E., Lins, L., Ahrens, J.P., Freire, J., Silva, C.T.: A first study on clustering collections of workflow graphs. In: Freire, J., Koop, D., Moreau, L. (eds.) IPAW 2008. LNCS, vol. 5272, pp. 160–173. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  22. Scheidegger, C.E., Koop, D., Santos, E., Vo, H.T., Callahan, S.P., Freire, J., Silva, C.T.: Tackling the provenance challenge one layer at a time. Concurrency and Computation: Practice and Experience 20(5), 473–483 (2008)

    Article  Google Scholar 

  23. Scheidegger, C.E., Vo, H.T., Koop, D., Freire, J., Silva, C.T.: Querying and creating visualizations by analogy. IEEE Transactions on Visualization and Computer Graphics 13(6), 1560–1567 (2007)

    Article  Google Scholar 

  24. Scheidegger, C.E., Vo, H.T., Koop, D., Freire, J., Silva, C.T.: Querying and re-using workflows with vistrails. In: SIGMOD, pp. 1251–1254 (2008)

    Google Scholar 

  25. Silva, C., Freire, J., Callahan, S.P.: Provenance for visualizations: Reproducibility and beyond. IEEE Computing in Science & Engineering (2007) (to appear)

    Google Scholar 

  26. Silva, C.T., Anderson, E., Santos, E., Freire, J.: Using vistrails and provenance for teaching scientific visualization. In: Proceedings of the Eurographics Education Program (to appear, 2010)

    Google Scholar 

  27. Silva, C.T., Freire, J.: Software infrastructure for exploratory visualization and data analysis: past, present, and future. Journal of Physics: Conference Series 25(012100), 15 pages (2008) (SciDAC 2008 Conference)

    Google Scholar 

  28. VisTrails, http://www.vistrails.org

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Freire, J. (2010). Provenance Management for Data Exploration. In: Lambrix, P., Kemp, G. (eds) Data Integration in the Life Sciences. DILS 2010. Lecture Notes in Computer Science(), vol 6254. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15120-0_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-15120-0_1

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-15119-4

  • Online ISBN: 978-3-642-15120-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics