Abstract
Conditional Functional Dependencies (CFD) are an extension of Functional Dependencies (FDs) that capture rules about the data consistency. Existing work on discovering CFDs focused on centralized data. Here, we extend this work to horizontally distributed relations. Given an embedded functional dependency, we generate a pattern tableau that represents a CFD. The original feature of our work is generating CFD pattern tableaux from a distributed relation, without merging all the distributed tuples in a centralized relation. We propose a distributed algorithm based on the concept of pattern summary that minimizes data shipping between the sites of distributed relation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agrawal, S., Deb, S., Naidu, K.V.M., Rastogi, R.: Efficient detection of distributed constraint violations. In: ICDE, pp. 1320–1324 (2007)
Bohannon, P., Fan, W., Geerts, F., Jia, X., Kementsietsidis, A.: Conditional functional dependencies for data cleaning. In: ICDE, pp. 746–755 (2007)
Bravo, L., Fan, W., Ma, S.: Extending dependencies with conditions. In: VLDB, pp. 243–254 (2007)
Chiang, F., Miller, R.J.: Discovering data quality rules. PVLDB 1(1), 1166–1177 (2008)
Cong, G., Fan, W., Geerts, F., Jia, X., Ma, S.: Improving data quality: Consistency and accuracy. In: VLDB, pp. 315–326 (2007)
Conrad, S., Schmitt, I., Türker, C.: Considering integrity constraints during federated database design. In: Embury, S.M., Fiddian, N.J., Gray, W.A., Jones, A.C. (eds.) BNCOD 1998. LNCS, vol. 1405, pp. 119–133. Springer, Heidelberg (1998)
Fan, W., Geerts, F., Lakshmanan, L.V.S., Xiong, M.: Discovering conditional functional dependencies. In: ICDE, pp. 1231–1234 (2009)
Fan, W., Geerts, F., Ma, S., Müller, H.: Detecting inconsistencies in distributed data. In: ICDE, pp. 64–75 (2010)
Fan, W., Li, J., Tang, N., Yu, W.: Incremental detection of inconsistencies in distributed data. In: ICDE, pp. 318–329 (2012)
Golab, L., Karloff, H.J., Korn, F., Srivastava, D., Yu, B.: On generating near-optimal tableaux for conditional functional dependencies. PVLDB 1(1), 376–390 (2008)
Gupta, A., Sagiv, Y., Ullman, J.D., Widom, J.: Constraint checking with partial information. In: PODS, pp. 45–55 (1994)
Huyn, N.: Maintaining global integrity constraints in distributed databases. Constraints 2(3/4), 377–399 (1997)
Reyya, S., Prameela, M., Yadav, G.V., Rani, K.S., Bhargavi, A.V.: An efficient extension of conditional functional dependencies in distributed databases. Database 5, 6
Song, S., Chen, L., Yu, P.S.: Comparable dependencies over heterogeneous data. VLDB J. 22(2), 253–274 (2013)
Wyss, C.M., Giannella, C.M., Robertson, E.L.: FastFDs: A heuristic-driven, depth-first algorithm for mining functional dependencies from relation instances - extended abstract. In: Kambayashi, Y., Winiwarter, W., Arikawa, M. (eds.) DaWaK 2001. LNCS, vol. 2114, pp. 101–110. Springer, Heidelberg (2001)
Ye, F., Liu, J., Qian, J., Xue, X.: A framework for mining functional dependencies from large distributed databases. In: Proceedings of the 2010 International Conference on Artificial Intelligence and Computational Intelligence, AICI 2010, vol. 03, pp. 109–113. IEEE Computer Society, Washington, DC (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Sahri, S., Ouziri, M., Benbernou, S. (2014). Summary-Based Pattern Tableaux Generation for Conditional Functional Dependencies in Distributed Data. In: Decker, H., Lhotská, L., Link, S., Spies, M., Wagner, R.R. (eds) Database and Expert Systems Applications. DEXA 2014. Lecture Notes in Computer Science, vol 8644. Springer, Cham. https://doi.org/10.1007/978-3-319-10073-9_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-10073-9_5
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10072-2
Online ISBN: 978-3-319-10073-9
eBook Packages: Computer ScienceComputer Science (R0)