Abstract
Managing scientific data requires tools that can track complex provenance information about digital resources and workflows. RDF triples are a convenient abstraction for combining independently-generated factual statements, including statements about provenance[1]. Harvesting is a strategy for asynchronously acquiring distributed information for the purposes of aggregation and analysis[2]. Harvesting typically requires that information be temporally scoped and attributed to some creator or information source. An RDF triple asserts a fact without attributing it to any actor or period of time, so the abstraction must be extended to support typical harvesting scenarios. This paper compares standard, conventional, and non-standard means of extending RDF triples to associate them with attribution and timing information. Then, it considers the implications of these techniques for harvesting and presents some implementation sketches based on a journaling strategy.
Chapter PDF
Similar content being viewed by others
References
Wong, S.C., Miles, S., Fang, W., Groth, P., Moreau, L.: Provenance-based validation of e-science experiments. In: Gil, Y., Motta, E., Benjamins, V.R., Musen, M.A. (eds.) ISWC 2005. LNCS, vol. 3729, pp. 801–815. Springer, Heidelberg (2005)
Lagoze, C., de Sompel, H.V.: The Open Archives Initiative: Building a low-barrier interoperability framework, http://www.cs.cornell.edu/lagoze/papers/oai-jcdl.pdf http://citeseer.ist.psu.edu/lagoze01open.htm
Heymans, S., Nieuwenborgh, D.V., Vermeir, D.: Preferential reasoning on a web of trust. In: Gil, Y., Motta, E., Benjamins, V.R., Musen, M.A. (eds.) ISWC 2005. LNCS, vol. 3729, pp. 368–382. Springer, Heidelberg (2005)
Huang, Z., Stuckenschmidt, H.: Reasoning with multi-version ontologies: a temporal logic approach. In: Gil, Y., Motta, E., Benjamins, V.R., Musen, M.A. (eds.) ISWC 2005. LNCS, vol. 3729, pp. 398–412. Springer, Heidelberg (2005)
Network Time Protocol, IEFT RFC 958
RDF Semantics. W3C Recommendation (February 10, 2004), http://www.w3.org/TR/rdf-mt/#Reif
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Futrelle, J. (2006). Harvesting RDF Triples. In: Moreau, L., Foster, I. (eds) Provenance and Annotation of Data. IPAW 2006. Lecture Notes in Computer Science, vol 4145. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11890850_8
Download citation
DOI: https://doi.org/10.1007/11890850_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-46302-3
Online ISBN: 978-3-540-46303-0
eBook Packages: Computer ScienceComputer Science (R0)