Abstract
A number of important decisions are based on a set of specific items in a database called select items. Thus the analysis of select items in multiple databases becomes of primordial relevance. In this chapter, we focus on the following issues. First, a model of mining global patterns of select items from multiple databases is presented. Second, a measure of quantifying an overall association between two items in a database is discussed. Third, we present an algorithm that is based on the proposed overall association between two items in a database for the purpose of grouping the frequent items in multiple databases. Each group contains a select item called the nucleus item and the group grows while being centered around the nucleus item. Experimental results are concerned with some synthetic and real-world databases.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Adhikari A, Rao PR (2008a) Synthesizing heavy association rules from different real data sources. Pattern Recogn Lett 29(1):59–71
Adhikari A, Rao PR (2008b) Efficient clustering of databases induced by local patterns. Decis Support Syst 44(4):925–943
Adhikari A, Ramachandrarao P, Pedrycz W (2011) Study of select items in different data sources by grouping. Knowl Inf Syst 27(1):23–43
Aggarwal C, Yu P (1998) A new framework for itemset generation. In: Proceedings of the 17th symposium on principles of database systems, pp 18–24
Agrawal R, Shafer J (1999) Parallel mining of association rules. IEEE Trans Knowl Data Eng 8(6):962–969
Agrawal R, Imielinski T, Swami A (1993) Mining association rules between sets of items in large databases. In: Proceedings of ACM SIGMOD conference, pp 207–216
Barte RG (1976) The elements of real analysis, 2nd edn. Wiley, London
Chattratichat J, Darlington J, Ghanem M, Guo Y, Hüning H, Köhler M, Sutiwaraphun J, To HW, Yang D (1997) Large scale data mining: challenges, and responses. In: Proceedings of the third international conference on knowledge discovery and data mining, pp 143–146
Cheung D, Ng V, Fu A, Fu Y (1996) Efficient mining of association rules in distributed databases. IEEE Trans Knowl Data Eng 8(6):911–922
Frequent itemset mining dataset repository (2004) http://fimi.cs.helsinki.fi/data
Galambos J, Simonelli I (1996) Bonferroni-type inequalities with applications. Springer, New York
Jaroszewicz S, Simovici DA (2002) Support approximations using Bonferroni-type inequalities. In: Proceedings of sixth European conference on principles of data mining and knowledge discovery, pp 212–223
Klemettinen M, Mannila H, Ronkainen P, Toivonen T, Verkamo A (1994) Finding interesting rules from large sets of discovered association rules. In: Proceedings of the 3rd international conference on information and knowledge management, pp 401–407
Lin Y, Hu X, Li X, Wu X (2013) Mining stable patterns in multiple correlated databases. Decision Support Systems
Liu B, Hsu W, Ma Y (1999) Pruning and summarizing the discovered associations. In: Proceedings of the 5th international conference on knowledge discovery and data mining, pp 125–134
Pavlov D, Mannila H, Smyth P (2000) Probabilistics models for query approximation with large sparse binary data sets. In: Proceedings of sixteenth conference on uncertainty in artificial intelligence, pp 465–472
Proefschrift (2004) Multi-relational data mining, PhD thesis, Dutch Graduate School for Information and Knowledge Systems, Aan de Universiteit Utrecht
Pyle D (1999) Data preparation for data mining. Morgan Kufmann, San Francisco
Silberschatz A, Tuzhilin A (1996) What makes patterns interesting in knowledge discovery systems. IEEE Trans Knowl Data Eng 8(6):970–974
Silverstein C, Brin S, Motwani R (1998) Beyond market baskets: generalizing association rules to dependence rules. Data Min Knowl Disc 2(1):39–68
Tan P-N, Kumar V, Srivastava J (2002) Selecting the right interestingness measure for association patterns. In: Proceedings of SIGKDD conference, pp 32–41
Wu X, Zhang S (2003) Synthesizing high-frequency rules from different data sources. IEEE Trans Knowl Data Eng 14(2):353–367
Wu X, Zhang C, Zhang S (2005) Database classification for multi-database mining. Inf Syst 30(1):71–88
Xin D, Han J, Yan X, Cheng H (2005) Mining compressed frequent-pattern sets. In: Proceedings of the 31st VLDB conference, pp 709–720
Zhang S (2002) Knowledge discovery in multi-databases by analyzing local instances. PhD thesis, Deakin University
Zhang S, Wu X, Zhang C (2003) Multi-database mining. IEEE computational intelligence. Bulletin 2(1):5–13
Zhang C, Liu M, Nie W, Zhang S (2004a) Identifying global exceptional patterns in multi-database mining. IEEE Comput Intel Bull 3(1):19–24
Zhang S, Zhang C, Wu X (2004b) Knowledge discovery in multiple databases. Springer, New York
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Adhikari, A., Adhikari, J., Pedrycz, W. (2014). Mining Patterns of Select Items in Different Data Sources. In: Data Analysis and Pattern Recognition in Multiple Databases. Intelligent Systems Reference Library, vol 61. Springer, Cham. https://doi.org/10.1007/978-3-319-03410-2_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-03410-2_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-03409-6
Online ISBN: 978-3-319-03410-2
eBook Packages: EngineeringEngineering (R0)