Abstract
The importance of tracking the provenance of electronic data becomes apparent when data set providers need to also provide metadata describing where the data came from. This need has driven the development of a practical oceanographic data provenance system at the Monterey Bay Aquarium Research Institute. MBARI’s Shore Side Data System is designed to manage data collected, processed, and archived from oceanographic observatories. We describe the provenance tracking aspects of this system and the lessons learned from its implementation in an operational environment.
Chapter PDF
Similar content being viewed by others
References
Glenn, S., Schofield, O.: Observing the Oceans from the COOL Room: Our History, Experience, and Opinions. Oceanography 16(4), 37–52 (2003)
Baptista, A., Howe, B., Freire, J., Maier, D., Silva, C.T.: Scientific Exploration in the Era of Ocean Observatories. Computing in Science and Engineering 10(3), 53–58 (2008)
Freire, J., Silva, C.T., Callahan, S.P., Santos, E., Scheidegger, C.E., Vo, H.T.: Managing Rapidly-Evolving Scientific Workflows. In: Moreau, L., Foster, I. (eds.) IPAW 2006. LNCS, vol. 4145, pp. 10–18. Springer, Heidelberg (2006)
Chapman, A., Jagadish, H.V.: Issues in Building Practical Provenance Systems. Bulletin of the IEEE Computer Society Technical Committee on Data Engineering 30(4), 38–43 (2007)
MOOS: Monterey Ocean Observing System, http://www.mbari.org/moos/
SIAM: Software infrastructure and application for MOOS, http://www.mbari.org/moos/siam/siam.htm
MUSE: MOOS Upper-Water-Column Science Experiment, http://www.mbari.org/muse/
FGDC: Federal Geographic Data Committee, http://www.fgdc.gov/
Graybeal, J., Gomes, K., McCann, M., Schlining, B., Schramm, R., Wilkin, D.: MBARI’s Operational, extensible data management for ocean observatories. In: The Third International Workshop on Scientific Use of Submarine Cables and Related Technologies, Tokyo, pp. 288–292 (2003)
The Shore Side Data System, http://www.mbari.org/ssds/
Agile software development, http://en.wikipedia.org/wiki/Agile_software_development
NetCDF: Network Common Data Form, http://www.unidata.ucar.edu/software/netcdf/
OPeNDAP: Open-source Project for a Network Data Access Protocol, http://www.opendap.org/
CIMT: Center for Integrated Marine Technologies, http://cimt.ucsc.edu/
Gomes, K., OReilly, T., Graybeal, J.: Issues in data management in observing systems and lessons learned. In: Proceedings of the Marine Technology Society/Institute of Electrical and Electronics Engineers Oceans Conference, Boston, Massachusetts (2006)
PUCK: Programmable Underwater Connector with Knowledge, http://www.mbari.org/pw/puck.htm
NetCDF Climate and Forecast (CF) Metadata Convention, http://cf-pcmdi.llnl.gov/
Moreau, L., Groth, P., Miles, S., Vazquez-Salceda, J., Ibbotson, J., Jiang, S., Munroe, S., Rana, O., Schreiber, A., Tan, V., Varga, L.: The Provenance of Electronic Data. Communications of the ACM 51(4), 52–58 (2008)
Simmhan, Y.L., Plale, B., Gannon, D.: A survey of Data Provenance in e-science. SIGMOD 34(3), 31–36 (2005)
Foster, I., Vockler, J., Wilde, M., Yong, Z.: Chimera: a virtual data system for representing, querying, and automating data derivation. In: Proceedings of 14th International Conference on Scientific and Statistical Database Management, 2002, pp. 37–46 (2002)
Frew, J., Bose, R.: Earth System Science Workbench: A Data Management Infrastructure for Earth Science Products. In: Proceedings of the 13th International Conference on Scientific and Statistical Database Management, Fairfax, VA, pp. 180–189 (2001)
Abbott, M., Sears, C.: The Always-Connected World and Its Impacts on Ocean research. Oceanography 19(1), 14–21 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
McCann, M., Gomes, K. (2008). Oceanographic Data Provenance Tracking with the Shore Side Data System. In: Freire, J., Koop, D., Moreau, L. (eds) Provenance and Annotation of Data and Processes. IPAW 2008. Lecture Notes in Computer Science, vol 5272. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89965-5_30
Download citation
DOI: https://doi.org/10.1007/978-3-540-89965-5_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89964-8
Online ISBN: 978-3-540-89965-5
eBook Packages: Computer ScienceComputer Science (R0)