Abstract
Solving conflicts between overlapping databases requires an understanding of the reasons that lead to the inconsistencies. Provided that conflicts do not occur randomly but follow certain regularities, patterns in the form of “If condition Then conflict” provide a valuable means to facilitate their understanding. In previous work, we adopt existing association rule mining algorithms to identify such patterns. Within this paper we discuss extensions to our initial approach aimed at identifying possible update operations that caused the conflicts between the databases. This is done by restricting the items used for pattern mining. We further propose a classification of patterns based on mappings between the contradicting values to represent special cases of conflict generating updates.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
FAN, W., LU, H., MADNICK, S.E. and CHEUNG, D. (2001): Discovering and Reconciling Value Conflicts for Numerical Data Integration. Information Systems, 26, 635–656.
FELLEGI, P. and HOLT, D. (1976): A Systematic Approach to Automatic Edit and Imputation. Journal of the American Statistical Association, 71, 17–35.
HERNANDEZ, M.A. and STOLFO, S.J. (1995): The Merge/Purge Problem for Large Databases. Proc. Int. Conf. Management of Data (SIGMOD). San Jose, California.
MÜLLER, H., LESER, U. and FREYTAG, J.-C. (2004): Mining for Patterns in Contradictory Data. Proc. SIGMOD Int. Workshop on Information Quality for Information Systems (IQIS’04). Paris, France.
PAN, F., CONG, G., TUNG, A.K.H., YANG, J. and ZAKI, M.J. (2003): CARPENTER: Finding Closed Patterns in Long Biological Datasets. Proc. Int. Conf. on Knowledge Discovery and Data Mining (SIGKDD). Washington DC.
ZAKI, M.J. (2002): CHARM: An Efficient Algorithm for Closed Itemset Mining. Proc. of the Second SIAM Int. Conf. on Data Mining. Arlington, VA.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Müller, H., Leser, U., Freytag, JC. (2007). Classification of Contradiction Patterns. In: Decker, R., Lenz, H.J. (eds) Advances in Data Analysis. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70981-7_20
Download citation
DOI: https://doi.org/10.1007/978-3-540-70981-7_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-70980-0
Online ISBN: 978-3-540-70981-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)