Abstract
Uncertainty management and information integration have been challenging issues in AI and database research. The literature is vast and rich on either of these two issues, however, they have not been studied simultaneously in the same setting. In this work, we make a first attempt and propose a framework for information integration with uncertainty, which uses the information source tracking(IST) model [9] as the underlying certainty model. The IST model is an extension of the relational data model in which every tuple t is annotated with (a set of) fixed length vectors, called agent vectors, representing the (human or sensor) agents which confirmed t or contributed to it. Our framework consists of a dynamic collection of autonomous but cooperating IST databases, called the information sources or sites, in which each relation r is annotated with a site vector, indicating which sites contributed to the definition of r. We extend the relational algebra from the basic IST model accordingly to manipulate agent and site vectors. We also extend the reliability calculation algorithm from the basic model to compute the certainty of each answer tuple as a function of the reliabilities of the contributing agents and sites. We have developed a running prototype of the proposed framework for which we mainly used SQL programming for query rewriting and manipulation of agent and site vectors.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Anokhin, P., Motro, A.: Data integration: Inconsistency detection and resolution based on source properties. In: Proc. FMI 2001 (2001)
Bernstein, P.: Generic model management: A database infrastructure for schema manipulation. In: Batini, C., Giunchiglia, F., Giorgini, P., Mecella, M. (eds.) CoopIS 2001. LNCS, vol. 2172, p. 1. Springer, Heidelberg (2001)
Daswani, N., Garcia-Molina, H., Yang, B.: Open problems in datasharing peer-to-peer systems. In: Calvanese, D., Lenzerini, M., Motwani, R. (eds.) ICDT 2003. LNCS, vol. 2572, pp. 1–15. Springer, Heidelberg (2002)
Halevy, A.Y.: Answering queries using views: A survey. The VLDB Journal 10(4), 270–294 (2001)
Lakshmanan, Laks, V.S., Shiri, N.: Logic programming and deductive databases with uncertainty: A survey. In: Enclyclopedia of Computer Science and Technology, vol. 45, pp. 153–176. Marcel Dekker, Inc, New York (2001)
Parsons, S.: Current approaches to handling imperfect information in data and knowledge bases. Knowledge and Data Engineering 8(3), 353–372 (1996)
Popa, L., Velegrakis, Y., Miller, R.J., Hernandez, M.A., Fagin, R.: Translating web data. In: Proceedings of VLDB 2002, Hong Kong SAR, China, pp. 598–609 (2002)
Rahm, E., Bernstein, P.A.: A survey of approaches to automatic schema matching. VLDB Journal 10(4), 334–350 (2001)
Sadri, F.: Modeling uncertainty in databases. In: Proc. 7th IEEE Intl. Conf. on Data Eng., pp. 122–131 (April 1991)
Sen, S., Wong, J.: Analyzing peer-to-peer traffic across large networks. In: Second Annual ACM Internet Measurement Workshop (November 2002)
Avi, S., Michael, S., Ullman, J.D.: Database systems: Achievements and opportunities. In: The Lagunita report of the NSF workshop on the future of database systems research held in Palo Alto, California (1990)
Ullman, J.D.: Information integration using logical views. Theoretical Computer Science 239(2), 189–210 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kiani, A., Shiri, N. (2005). A Framework for Information Integration with Uncertainty. In: Ramos, F.F., Larios Rosillo, V., Unger, H. (eds) Advanced Distributed Systems. ISSADS 2005. Lecture Notes in Computer Science, vol 3563. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11533962_17
Download citation
DOI: https://doi.org/10.1007/11533962_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28063-7
Online ISBN: 978-3-540-31674-9
eBook Packages: Computer ScienceComputer Science (R0)