Mining Patterns of Select Items in Multiple Databases

Adhikari, Animesh; Ramachandrarao, Pralhad; Pedrycz, Witold

doi:10.1007/978-1-84996-044-1_4

Animesh Adhikari⁴,
Pralhad Ramachandrarao⁵ &
Witold Pedrycz⁶

Part of the book series: Advanced Information and Knowledge Processing ((AI&KP))

609 Accesses

Abstract

A number of important decisions are based on a set of specific items in a database called the select items. Thus the analysis of select items in multiple databases becomes of primordial relevance. In this chapter, we focus on the following issues. First, a model of mining global patterns of select items from multiple databases is presented. Second, a measure of quantifying an overall association between two items in a database is discussed. Third, we present an algorithm that is based on the proposed overall association between two items in a database for the purpose of grouping the frequent items in multiple databases. Each group contains a select item called the nucleus item and the group grows while being centered around the nucleus item. Experimental results are concerned with some synthetic and real-world databases.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Adhikari A, Rao PR (2007) Study of select items in multiple databases by grouping. In: Proceedings of the 3rd Indian International Conference on Artificial Intelligence, Pune, India, pp. 1699–1718
Google Scholar
Adhikari A, Rao PR (2008a) Synthesizing heavy association rules from different real data sources. Pattern Recognition Letters 29(1): 59–71
Article Google Scholar
Adhikari A, Rao PR (2008b) Efficient clustering of databases induced by local patterns. Decision Support Systems 44(4): 925–943
Article Google Scholar
Aggarwal C, Yu P (1998) A new framework for itemset generation. In: Proceedings of the 17th Symposium on Principles of Database Systems, Seattle, WA, pp. 18–24
Google Scholar
Agrawal R, Imielinski T, Swami A (1993) Mining association rules between sets of items in large databases. In: Proceedings of ACM SIGMOD Conference, Washington DC, pp. 207–216
Google Scholar
Agrawal R, Shafer J (1999) Parallel mining of association rules. IEEE Transactions on Knowledge and Data Engineering 8(6): 962–969
Article Google Scholar
Barte RG (1976) The Elements of Real Analysis. 2nd edition, John Wiley & Sons, New York
Google Scholar
Chattratichat J, Darlington J, Ghanem M, Guo Y, Hüning H, Köhler M, Sutiwaraphun J, To HW, Yang D (1997) Large scale data mining: Challenges, and responses. In: Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, pp. 143–146
Google Scholar
Cheung D, Ng V, Fu A, Fu Y (1996) Efficient mining of association rules in distributed databases. IEEE Transactions on Knowledge and Data Engineering 8(6): 911–922
Article Google Scholar
Frequent Itemset Mining Dataset Repository (2004) http://fimi.cs.helsinki.fi/data
Galambos J, Simonelli I (1996) Bonferroni-type Inequalities with Applications. Springer, New York
MATH Google Scholar
Jaroszewicz S, Simovici DA (2002) Support approximations using Bonferroni-type inequalities. In: Proceedings of Sixth European Conference on Principles of Data Mining and Knowledge Discovery, Helsinki, Finland, pp. 212–223
Google Scholar
Klemettinen M, Mannila H, Ronkainen P, Toivonen T, Verkamo A (1994) Finding interesting rules from large sets of discovered association rules. In: Proceedings of the 3rd International Conference on Information and Knowledge Management, Gaithersburg, MD, pp. 401–407
Google Scholar
Liu B, Hsu W, Ma Y (1999) Pruning and summarizing the discovered associations. In: Proceedings of the 5th International Conference on Knowledge Discovery and Data Mining, San Diego, CA, pp. 125–134
Google Scholar
Pavlov D, Mannila H, Smyth P (2000) Probabilistics models for query approximation with large sparse binary data sets. In: Proceedings of Sixteenth Conference on Uncertainty in Artificial Intelligence, San Francisco, CA, pp. 465–472
Google Scholar
Proefschrift (2004) Multi-relational data mining, Ph D thesis, Dutch Graduate School for Information and Knowledge Systems, Aan de Universiteit Utrecht
Google Scholar
Pyle D (1999) Data Preparation for Data Mining. Morgan Kufmann, San Francisco
Google Scholar
Silberschatz A, Tuzhilin A (1996) What makes patterns interesting in knowledge discovery systems. IEEE Transactions on Knowledge and Data Engineering 8(6): 970–974
Article Google Scholar
Silverstein C, Brin S, Motwani R (1998) Beyond market baskets: Generalizing association rules to dependence rules. Data Mining and Knowledge Discovery 2(1): 39–68
Article Google Scholar
Tan P-N, Kumar V, Srivastava J (2002) Selecting the right interestingness measure for association patterns. In: Proceedings of SIGKDD Conference, Alberta, Canada, pp. 32–41
Google Scholar
Wu X, Zhang S (2003) Synthesizing high-frequency rules from different data sources. IEEE Transactions on Knowledge and Data Engineering 14(2): 353–367
Google Scholar
Wu X, Zhang C, Zhang S (2005) Database classification for multi-database mining. Information Systems 30(1): 71–88
Article MATH Google Scholar
Xin D, Han J, Yan X, Cheng H (2005) Mining compressed frequent-pattern sets. In: Proceedings of the 31st VLDB Conference, Trondheim, Norway, pp. 709–720
Google Scholar
Zhang S (2002) Knowledge discovery in multi-databases by analyzing local instances, Ph D thesis, Deakin University
Google Scholar
Zhang C, Liu M, Nie W, Zhang S (2004a) Identifying global exceptional patterns in multi-database mining. IEEE Computational Intelligence Bulletin 3(1): 19–24
Google Scholar
Zhang S, Wu X, Zhang C (2003) Multi-database mining. IEEE Computational Intelligence Bulletin 2(1): 5–13
Google Scholar
Zhang S, Zhang C, Wu X (2004b) Knowledge Discovery in Multiple Databases. Springer, Berlin
Book MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Smt. Parvatibal Chowgule College, Margoa, 403602, India
Animesh Adhikari
Department of Computer Science & Technology, Goa University, Goa, 403206, India
Pralhad Ramachandrarao
Department of Electrical & Computer Engineering, University of Alberta, 9107 116 Street, Edmonton, AB, T6G 2V4, Canada
Witold Pedrycz

Authors

Animesh Adhikari
View author publications
You can also search for this author in PubMed Google Scholar
Pralhad Ramachandrarao
View author publications
You can also search for this author in PubMed Google Scholar
Witold Pedrycz
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Animesh Adhikari .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Adhikari, A., Ramachandrarao, P., Pedrycz, W. (2010). Mining Patterns of Select Items in Multiple Databases. In: Developing Multi-Database Mining Applications. Advanced Information and Knowledge Processing. Springer, London. https://doi.org/10.1007/978-1-84996-044-1_4

Download citation

DOI: https://doi.org/10.1007/978-1-84996-044-1_4
Published: 03 March 2010
Publisher Name: Springer, London
Print ISBN: 978-1-84996-043-4
Online ISBN: 978-1-84996-044-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics