Skip to main content

User Trust and Judgments in a Curated Database with Explicit Provenance

  • Chapter
Book cover In Search of Elegance in the Theory and Practice of Computation

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8000))

Abstract

We focus on human-in-the-loop, information-integration settings where users gather and evaluate data from a broad variety of sources and where the levels of trust in sources and users change dynamically. In such settings, users must use their judgment as they collect and modify data. As an example, a battlefield information officer preparing a report to inform his or her superiors about the current state of affairs must gather and integrate data from many (including non-computerized) sources. By tracking multiple sources for individual values, the officer may eliminate a value from the current state whenever all of the sources where this value was found are no longer trusted. We define a conceptual model for a curated database with provenance for such settings, the Multi-granularity, Multi-provenance Model (MMP), which supports multiple insertions and multiple (copy-and-)paste operations for a single database element, captures the external source for all operations, and includes a Data Confidence Language that allows users to confirm or doubt values to record their atomic judgments about the data. In this paper, we briefly summarize the MMP model and show how it can be extended to support potentially complex operations including compound judgment operators (such as merging tuples to achieve entity resolution), while capturing a complete record of data provenance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agrawal, P., Benjelloun, O., Das Sarma, A., Hayworth, C., Nabar, S., Sugihara, T., Widom, J.: Trio: a system for data, uncertainty, and lineage. In: Proceedings of the 32nd International Conference on Very Large Data Bases, VLDB 2006. VLDB Endowment (2006)

    Google Scholar 

  2. Archer, D.W., Delcambre, L.M.L.: Definition and Formalization of Entity Resolution Functions for Everyday Information Integration. In: Schewe, K.-D., Thalheim, B. (eds.) SDKB 2008. LNCS, vol. 4925, pp. 126–142. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  3. Archer, D., Delcambre, L.: A Conceptual Model and Predicate Language for Data Selection and Projection Based on Provenance. In: Proceedings of the Second Workshop on the Theory and Practiceof Provenance (TaPP 2010), San Jose, CA (February 2010)

    Google Scholar 

  4. Archer, D.: Conceptual Modeling of Data with Provenance. PhD dissertation. Portland State University (2011)

    Google Scholar 

  5. Bhagwat, D., Chiticariu, L., Tan, W., Vijayvargiya, G.: An annotation management system for relational databases.In Proceedings of the 30thInternational Conference on Very Large Data Bases, VLDB 2004. VLDB Endowment (2004)

    Google Scholar 

  6. Buneman, P., Chapman, A., Cheney, J., Vansummeren, S.: A provenance model for manually curated data. In: Moreau, L., Foster, I. (eds.) IPAW 2006. LNCS, vol. 4145, pp. 162–170. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  7. Buneman, P., Cheney, J., Vansummeren, S.: On the expressivenesss of implicit provenance in query and update languages. ACM Transactions on Database Systems 33(4) (2008)

    Google Scholar 

  8. Cui, Y., Widom, J., Wiener, J.: Tracing the lineage of view data in a warehousing environment. ACM Transactions on Database Systems 25(2) (2000)

    Google Scholar 

  9. Green, T., Karvounarakis, G., Taylor, N., Biton, O., Ives, Z., Tannen, V.: Orchestra: facilitating collaborative data sharing. In: SIGMOD 2007: Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data. ACM, New York (2007)

    Google Scholar 

  10. Green, T., Karvounarakis, G., Tannen, V.: Provenance semirings. In: PODS 2007: Proceedings of the Twenty-Sixth ACM SIGMOD-SIGACTSIGART Symposium on Principles of Database Systems, ACM, New York (2007)

    Google Scholar 

  11. Levitin, A.: How to measure size, and how not to. In: Proceedings of the Tenth COMPSAC Conference. IEEE Computer Society Press, Washington DC (1986)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Archer, D.W., Delcambre, L.M.L., Maier, D. (2013). User Trust and Judgments in a Curated Database with Explicit Provenance. In: Tannen, V., Wong, L., Libkin, L., Fan, W., Tan, WC., Fourman, M. (eds) In Search of Elegance in the Theory and Practice of Computation. Lecture Notes in Computer Science, vol 8000. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41660-6_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-41660-6_5

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-41659-0

  • Online ISBN: 978-3-642-41660-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics