Mining Patterns of Select Items in Different Data Sources

Adhikari, Animesh; Adhikari, Jhimli; Pedrycz, Witold

doi:10.1007/978-3-319-03410-2_6

Animesh Adhikari⁶,
Jhimli Adhikari⁷ &
Witold Pedrycz⁸

Part of the book series: Intelligent Systems Reference Library ((ISRL,volume 61))

1440 Accesses

Abstract

A number of important decisions are based on a set of specific items in a database called select items. Thus the analysis of select items in multiple databases becomes of primordial relevance. In this chapter, we focus on the following issues. First, a model of mining global patterns of select items from multiple databases is presented. Second, a measure of quantifying an overall association between two items in a database is discussed. Third, we present an algorithm that is based on the proposed overall association between two items in a database for the purpose of grouping the frequent items in multiple databases. Each group contains a select item called the nucleus item and the group grows while being centered around the nucleus item. Experimental results are concerned with some synthetic and real-world databases.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Adhikari A, Rao PR (2008a) Synthesizing heavy association rules from different real data sources. Pattern Recogn Lett 29(1):59–71
Article Google Scholar
Adhikari A, Rao PR (2008b) Efficient clustering of databases induced by local patterns. Decis Support Syst 44(4):925–943
Article Google Scholar
Adhikari A, Ramachandrarao P, Pedrycz W (2011) Study of select items in different data sources by grouping. Knowl Inf Syst 27(1):23–43
Article Google Scholar
Aggarwal C, Yu P (1998) A new framework for itemset generation. In: Proceedings of the 17th symposium on principles of database systems, pp 18–24
Google Scholar
Agrawal R, Shafer J (1999) Parallel mining of association rules. IEEE Trans Knowl Data Eng 8(6):962–969
Article Google Scholar
Agrawal R, Imielinski T, Swami A (1993) Mining association rules between sets of items in large databases. In: Proceedings of ACM SIGMOD conference, pp 207–216
Google Scholar
Barte RG (1976) The elements of real analysis, 2nd edn. Wiley, London
Google Scholar
Chattratichat J, Darlington J, Ghanem M, Guo Y, Hüning H, Köhler M, Sutiwaraphun J, To HW, Yang D (1997) Large scale data mining: challenges, and responses. In: Proceedings of the third international conference on knowledge discovery and data mining, pp 143–146
Google Scholar
Cheung D, Ng V, Fu A, Fu Y (1996) Efficient mining of association rules in distributed databases. IEEE Trans Knowl Data Eng 8(6):911–922
Article Google Scholar
Frequent itemset mining dataset repository (2004) http://fimi.cs.helsinki.fi/data
Galambos J, Simonelli I (1996) Bonferroni-type inequalities with applications. Springer, New York
Google Scholar
Jaroszewicz S, Simovici DA (2002) Support approximations using Bonferroni-type inequalities. In: Proceedings of sixth European conference on principles of data mining and knowledge discovery, pp 212–223
Google Scholar
Klemettinen M, Mannila H, Ronkainen P, Toivonen T, Verkamo A (1994) Finding interesting rules from large sets of discovered association rules. In: Proceedings of the 3^rd international conference on information and knowledge management, pp 401–407
Google Scholar
Lin Y, Hu X, Li X, Wu X (2013) Mining stable patterns in multiple correlated databases. Decision Support Systems
Google Scholar
Liu B, Hsu W, Ma Y (1999) Pruning and summarizing the discovered associations. In: Proceedings of the 5th international conference on knowledge discovery and data mining, pp 125–134
Google Scholar
Pavlov D, Mannila H, Smyth P (2000) Probabilistics models for query approximation with large sparse binary data sets. In: Proceedings of sixteenth conference on uncertainty in artificial intelligence, pp 465–472
Google Scholar
Proefschrift (2004) Multi-relational data mining, PhD thesis, Dutch Graduate School for Information and Knowledge Systems, Aan de Universiteit Utrecht
Google Scholar
Pyle D (1999) Data preparation for data mining. Morgan Kufmann, San Francisco
Google Scholar
Silberschatz A, Tuzhilin A (1996) What makes patterns interesting in knowledge discovery systems. IEEE Trans Knowl Data Eng 8(6):970–974
Article Google Scholar
Silverstein C, Brin S, Motwani R (1998) Beyond market baskets: generalizing association rules to dependence rules. Data Min Knowl Disc 2(1):39–68
Article Google Scholar
Tan P-N, Kumar V, Srivastava J (2002) Selecting the right interestingness measure for association patterns. In: Proceedings of SIGKDD conference, pp 32–41
Google Scholar
Wu X, Zhang S (2003) Synthesizing high-frequency rules from different data sources. IEEE Trans Knowl Data Eng 14(2):353–367
Google Scholar
Wu X, Zhang C, Zhang S (2005) Database classification for multi-database mining. Inf Syst 30(1):71–88
Article MATH Google Scholar
Xin D, Han J, Yan X, Cheng H (2005) Mining compressed frequent-pattern sets. In: Proceedings of the 31st VLDB conference, pp 709–720
Google Scholar
Zhang S (2002) Knowledge discovery in multi-databases by analyzing local instances. PhD thesis, Deakin University
Google Scholar
Zhang S, Wu X, Zhang C (2003) Multi-database mining. IEEE computational intelligence. Bulletin 2(1):5–13
Google Scholar
Zhang C, Liu M, Nie W, Zhang S (2004a) Identifying global exceptional patterns in multi-database mining. IEEE Comput Intel Bull 3(1):19–24
Google Scholar
Zhang S, Zhang C, Wu X (2004b) Knowledge discovery in multiple databases. Springer, New York
Book MATH Google Scholar

Download references

Author information

Authors and Affiliations

Parvatibai Chowgule College, P. O. Fatorda, Margao, Goa, 403 602, India
Animesh Adhikari
Narayan Zantye College, P. O. Bicholim Industrial Estate, Bicholim, 403 529, India
Jhimli Adhikari
Department of Electrical and Computer Engineering, University of Alberta, Edmonton, AB, ABT6G2V4, Canada
Witold Pedrycz

Authors

Animesh Adhikari
View author publications
You can also search for this author in PubMed Google Scholar
Jhimli Adhikari
View author publications
You can also search for this author in PubMed Google Scholar
Witold Pedrycz
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Animesh Adhikari .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Adhikari, A., Adhikari, J., Pedrycz, W. (2014). Mining Patterns of Select Items in Different Data Sources. In: Data Analysis and Pattern Recognition in Multiple Databases. Intelligent Systems Reference Library, vol 61. Springer, Cham. https://doi.org/10.1007/978-3-319-03410-2_6

Download citation

DOI: https://doi.org/10.1007/978-3-319-03410-2_6
Published: 07 December 2013
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-03409-6
Online ISBN: 978-3-319-03410-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics