Synonyms
Bayesian networks; Correlated databases; Markov networks; Probabilistic databases
Definition
Uncertain data appears naturally in many real-world applications for a variety of reasons, ranging from inherent limitations of the measurement or monitoring infrastructures to widespread use of statistical analysis and probabilistic inference. Further, the uncertainties associated with different entities or facts in the data are often correlated with each other. For instance, two facts may be known to be mutually exclusive, i.e., even if we are uncertain about which of the two are true, we may know that both the facts cannot be simultaneously true. Oftentimes the correlations are more complex; for example, given two uncertain facts, we may know that if one of them is true, then the probability for the other being true is higher and vice versa. To manage such correlated data in a principled manner, the uncertain data model must be expressive enough to allow capturing such...
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Aggarwal CC. Managing and mining uncertain data. New York: Springer Incorporated; 2009.
Cheng R, Kalashnikov D, Prabhakar S. Evaluating probabilistic queries over imprecise data. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2003.
Cowell RG, Philip Dawid A, Lauritzen SL, Spiegelhater DJ. Probabilistic networks and expert systems. New York: Springer; 1999.
Dalvi N, Suciu D. Efficient query evaluation on probabilistic databases. In: Proceedings of the 32nd International Conference on Very Large Data Bases; 2006.
Das Sarma A, Benjelloun O, Halevy A, Widom J. Working models for uncertain data. In: Proceedings of the 22nd International Conference on Data Engineering; 2006.
Deshpande A, Guestrin C, Madden S, Hellerstein JM, Hong W. Model-driven data acquisition in sensor networks. In: Proceedings of the 30th International Conference on Very Large Data Bases; 2004.
Deshpande A, Getoor L, Sen P. Graphical models for uncertain data. In: Aggarwal C, editor. Managing and mining uncertain data. New York: Springer; 2009.
Fuhr N, Rolleke T. A probabilistic relational algebra for the integration of information retrieval and database systems. ACM Trans Inf Syst. 1997;15(1):32.
Getoor L, Taskar B, editors. Introduction to statistical relational learning. Cambridge: MIT Press; 2007.
Jayram TS, Krishnamurthy R, Raghavan S, Vaithyanathan S, Zhu H. Avatar information extraction system. In: IEEE Data Engineering Bulletin; 2006.
Jha A, Suciu D. Probabilistic databases with markoviews. Proc VLDB Endowment. 2012;5(11): 1160–71.
Jordan MI, editor. Learning in graphical models. Cambridge: MIT Press; 1999.
Jordan MI, Ghahramani Z, Jaakkola TS, Saul LK. An introduction to variational methods for graphical models. Mach Learn. 1999;37(2):183–233.
Kanagal B, Deshpande A. Online filtering, smoothing and probabilistic modeling of streaming data. In: Proceedings of the 24th International Conference on Data Engineering; 2008.
Kanagal B, Deshpande A. Indexing correlated probabilistic databases. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2009. p. 455–68.
Kanagal B, Deshpande A. Lineage processing on correlated probabilistic databases. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2010.
Murphy KP, Weiss Y, Jordan MI. Loopy belief propagation for approximate inference: an empirical study. In: Proceedings of the 15th Conference on Uncertainty in Artificial Intelligence; 1999. p. 467–75.
Pearl J. Probabilistic reasoning in intelligent systems. San Mateo: Morgan Kaufmann; 1988.
Poole D. First-order probabilistic inference. In: Proceedings of the 18th International Joint Conference on Artificial Intelligence; 2003.
Rekatsinas T, Deshpande A, Getoor L. Local structure and determinism in probabilistic databases. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2012. p. 373–84.
Sen P, Deshpande A, Getoor L. Exploiting shared correlations in probabilistic databases. In: Proceedings of the 34th International Conference on Very Large Data Bases; 2008.
Sen P, Deshpande A, Getoor L. PrDB: managing and exploiting rich correlations in probabilistic databases. VLDB J. 2009;18(5):1065–90.
Suciu D, Olteanu D, Ré C, Koch C. Probabilistic databases. Synth Lect Data Manag. 2011;3(2): 1–180.
Wick M, McCallum A, Miklau G. Scalable probabilistic databases with factor graphs and MCMC. Proc VLDB Endowment. 2010;3(1–2): 794–804.
Zhe Wang D, Michelakis E, Garofalakis M, Hellerstein JM. Bayesstore: managing large, uncertain data repositories with probabilistic graphical models. Proc VLDB Endowment. 2008;1(1):340–51.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Science+Business Media, LLC, part of Springer Nature
About this entry
Cite this entry
Deshpande, A. (2018). Graphical Models for Uncertain Data Management. In: Liu, L., Özsu, M.T. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-8265-9_80741
Download citation
DOI: https://doi.org/10.1007/978-1-4614-8265-9_80741
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-8266-6
Online ISBN: 978-1-4614-8265-9
eBook Packages: Computer ScienceReference Module Computer Science and Engineering