Advertisement

Mining Patterns of Select Items in Multiple Databases

  • Animesh AdhikariEmail author
  • Pralhad Ramachandrarao
  • Witold Pedrycz
Chapter
Part of the Advanced Information and Knowledge Processing book series (AI&KP)

Abstract

A number of important decisions are based on a set of specific items in a database called the select items. Thus the analysis of select items in multiple databases becomes of primordial relevance. In this chapter, we focus on the following issues. First, a model of mining global patterns of select items from multiple databases is presented. Second, a measure of quantifying an overall association between two items in a database is discussed. Third, we present an algorithm that is based on the proposed overall association between two items in a database for the purpose of grouping the frequent items in multiple databases. Each group contains a select item called the nucleus item and the group grows while being centered around the nucleus item. Experimental results are concerned with some synthetic and real-world databases.

Keywords

Frequent Itemset Global Pattern Frequent Item Central Office Local Database 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

  1. Adhikari A, Rao PR (2007) Study of select items in multiple databases by grouping. In: Proceedings of the 3rd Indian International Conference on Artificial Intelligence, Pune, India, pp. 1699–1718Google Scholar
  2. Adhikari A, Rao PR (2008a) Synthesizing heavy association rules from different real data sources. Pattern Recognition Letters 29(1): 59–71CrossRefGoogle Scholar
  3. Adhikari A, Rao PR (2008b) Efficient clustering of databases induced by local patterns. Decision Support Systems 44(4): 925–943CrossRefGoogle Scholar
  4. Aggarwal C, Yu P (1998) A new framework for itemset generation. In: Proceedings of the 17th Symposium on Principles of Database Systems, Seattle, WA, pp. 18–24Google Scholar
  5. Agrawal R, Imielinski T, Swami A (1993) Mining association rules between sets of items in large databases. In: Proceedings of ACM SIGMOD Conference, Washington DC, pp. 207–216Google Scholar
  6. Agrawal R, Shafer J (1999) Parallel mining of association rules. IEEE Transactions on Knowledge and Data Engineering 8(6): 962–969CrossRefGoogle Scholar
  7. Barte RG (1976) The Elements of Real Analysis. 2nd edition, John Wiley & Sons, New YorkGoogle Scholar
  8. Chattratichat J, Darlington J, Ghanem M, Guo Y, Hüning H, Köhler M, Sutiwaraphun J, To HW, Yang D (1997) Large scale data mining: Challenges, and responses. In: Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, pp. 143–146Google Scholar
  9. Cheung D, Ng V, Fu A, Fu Y (1996) Efficient mining of association rules in distributed databases. IEEE Transactions on Knowledge and Data Engineering 8(6): 911–922CrossRefGoogle Scholar
  10. Frequent Itemset Mining Dataset Repository (2004) http://fimi.cs.helsinki.fi/data
  11. Galambos J, Simonelli I (1996) Bonferroni-type Inequalities with Applications. Springer, New YorkzbMATHGoogle Scholar
  12. Jaroszewicz S, Simovici DA (2002) Support approximations using Bonferroni-type inequalities. In: Proceedings of Sixth European Conference on Principles of Data Mining and Knowledge Discovery, Helsinki, Finland, pp. 212–223Google Scholar
  13. Klemettinen M, Mannila H, Ronkainen P, Toivonen T, Verkamo A (1994) Finding interesting rules from large sets of discovered association rules. In: Proceedings of the 3rd International Conference on Information and Knowledge Management, Gaithersburg, MD, pp. 401–407Google Scholar
  14. Liu B, Hsu W, Ma Y (1999) Pruning and summarizing the discovered associations. In: Proceedings of the 5th International Conference on Knowledge Discovery and Data Mining, San Diego, CA, pp. 125–134Google Scholar
  15. Pavlov D, Mannila H, Smyth P (2000) Probabilistics models for query approximation with large sparse binary data sets. In: Proceedings of Sixteenth Conference on Uncertainty in Artificial Intelligence, San Francisco, CA, pp. 465–472Google Scholar
  16. Proefschrift (2004) Multi-relational data mining, Ph D thesis, Dutch Graduate School for Information and Knowledge Systems, Aan de Universiteit UtrechtGoogle Scholar
  17. Pyle D (1999) Data Preparation for Data Mining. Morgan Kufmann, San FranciscoGoogle Scholar
  18. Silberschatz A, Tuzhilin A (1996) What makes patterns interesting in knowledge discovery systems. IEEE Transactions on Knowledge and Data Engineering 8(6): 970–974CrossRefGoogle Scholar
  19. Silverstein C, Brin S, Motwani R (1998) Beyond market baskets: Generalizing association rules to dependence rules. Data Mining and Knowledge Discovery 2(1): 39–68CrossRefGoogle Scholar
  20. Tan P-N, Kumar V, Srivastava J (2002) Selecting the right interestingness measure for association patterns. In: Proceedings of SIGKDD Conference, Alberta, Canada, pp. 32–41Google Scholar
  21. Wu X, Zhang S (2003) Synthesizing high-frequency rules from different data sources. IEEE Transactions on Knowledge and Data Engineering 14(2): 353–367Google Scholar
  22. Wu X, Zhang C, Zhang S (2005) Database classification for multi-database mining. Information Systems 30(1): 71–88zbMATHCrossRefGoogle Scholar
  23. Xin D, Han J, Yan X, Cheng H (2005) Mining compressed frequent-pattern sets. In: Proceedings of the 31st VLDB Conference, Trondheim, Norway, pp. 709–720Google Scholar
  24. Zhang S (2002) Knowledge discovery in multi-databases by analyzing local instances, Ph D thesis, Deakin UniversityGoogle Scholar
  25. Zhang C, Liu M, Nie W, Zhang S (2004a) Identifying global exceptional patterns in multi-database mining. IEEE Computational Intelligence Bulletin 3(1): 19–24Google Scholar
  26. Zhang S, Wu X, Zhang C (2003) Multi-database mining. IEEE Computational Intelligence Bulletin 2(1): 5–13Google Scholar
  27. Zhang S, Zhang C, Wu X (2004b) Knowledge Discovery in Multiple Databases. Springer, BerlinzbMATHCrossRefGoogle Scholar

Copyright information

© Springer-Verlag London 2010

Authors and Affiliations

  • Animesh Adhikari
    • 1
    Email author
  • Pralhad Ramachandrarao
    • 2
  • Witold Pedrycz
    • 3
  1. 1.Department of Computer ScienceSmt. Parvatibal Chowgule CollegeMargoaIndia
  2. 2.Department of Computer Science & TechnologyGoa UniversityGoaIndia
  3. 3.Department of Electrical & Computer EngineeringUniversity of AlbertaEdmontonCanada

Personalised recommendations