Skip to main content

Mining Optimal Class Association Rule Set

  • Conference paper
  • First Online:
Advances in Knowledge Discovery and Data Mining (PAKDD 2001)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2035))

Included in the following conference series:

Abstract

We define an optimal class association rule set to be the minimum rule set with the same prediction power of the complete class association rule set. Using this rule set instead of the complete class association rule set we can avoid redundant computation that would otherwise be required for mining predictive association rules and hence improve the efficiency of the mining process significantly. We present an efficient algorithm for mining the optimal class association rule set using an upward closure property of pruning weak rules before they are actually generated. We have implemented the algorithm and our experimental results show that our algorithm generates the optimal class association rule set, whose size is smaller than 1/17 of the complete class association rule set on average, in significantly less time than generating the complete class association rule set. Our proposed criterion has been shown very effective for pruning weak rules in dense databases.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. R. Agrawal, T. Imielinski, and A. Swami. Mining associations between sets of items in massive databases. In Proc. of the ACM SIGMOD Int’l Conference on Management of Data, 1993.

    Google Scholar 

  2. R. Agrawal, H. Mannila, R. Srikant, H. Toivonen, and A.I. Verkamo. Fast discovery of association rules. In Fayyad U. and et al, editors, Advances in Knowledge Discovery and Data Mining. MIT Press, 1996.

    Google Scholar 

  3. Kamal Ali, Stefanos Manganaris, and Ramakrishnan Srikant. Partial classification using association rules. In David Heckerman, Heikki Mannila, Daryl Pregibon, and Ramasamy Uthurusamy, editors, Proceedings of the Third International Conference on Knowledge Discovery and Data Mining (KDD-97), page 115. AAAI Press, 1997.

    Google Scholar 

  4. Roberto Bayardo and Rakesh Agrawal. Mining the most interesting rules. In Surajit Chaudhuri and David Madigan, editors, Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 145–154, N.Y., August 1518 1999. ACM Press.

    Google Scholar 

  5. Roberto Bayardo, Rakesh Agrawal, and Dimitrios Gunopulos. Constraint-based rule mining in large, dense database. In Proc. of the 15th Int’l Conf. on Data Engineering, pages 188–197, 1999.

    Google Scholar 

  6. E. Keogh C. Blake and C.J. Merz. UCI repository of machine learning databases, http://www.ics.uci.edu/~mlearn/MLRepository.html, 1998.

  7. Sergey Brin, Rajeev Motwani, and Craig Silverstein. Beyond market baskets: Generalizing association rules to correlations. SIGMOD Record (ACM Special Interest Group on Management of Data), 26(2):265.

    Google Scholar 

  8. P. Clark and R. Boswell. Rule induction with CN2: Some recent improvements. In Y. Kodratoff, editor, Machine Learning-EWSL-91, 1991.

    Google Scholar 

  9. H. Toivonene M. Klemettinen P RonKainen K Hatonen and H Mannila. Pruning and grouping discovered association rules. Technical report, Department of Computer Science, University of Helsinki, Finland (ftp://cs.helsinki.fi/pub/Reports/by Project/PMDM/), 1998.

  10. M. Houtsma and A. Swami. Set-oriented mining of association rules in relational databases. In 11th Intl. Conf. data Engineering, 1995.

    Google Scholar 

  11. Ron Kohavi, Dan Sommerfield, and James Dougherty. Data mining using MLC++: A machine learning library in C++. In Tools with Artificial Intelligence, pages 234–245. IEEE Computer Society Press, 1996. Received the best paper award.

    Google Scholar 

  12. Bing Liu, Wynne Hsu, and Yiming Ma. Integrating classification and association rule mining. In SIGKDD 98, pages 80–86, 1998.

    Google Scholar 

  13. Bing Liu, Wynne Hsu, and Yiming Ma. Pruning and summarizing the discovered associations. In SIGKDD 99, 1999.

    Google Scholar 

  14. H. Mannila, H. Toivonen, and I. Verkamo. Efficient algorithms for discovering association rules. InAAAI Wkshp. Knowledge Discovery in Databases, July 1994.

    Google Scholar 

  15. Tom M. Mitchell. Machine Learning. McGraw-Hill, 1997.

    Google Scholar 

  16. Raymond T. Ng, Laks V.S. Lakshmanan, Jiawei Han, and Alex Pang. Exploratory mining and pruning optimizations of constrained associations rules. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD-98), volume 27, 2 of ACM SIGMOD Record, pages 13–24, New York, June 1-4 1998. ACM Press.

    Article  Google Scholar 

  17. J.S. Park, M. Chen, and P.S. Yu. An effective hash based algorithm for mining association rules. In ACM SIGMOD Intl. Conf. Management of Data, May 1995.

    Google Scholar 

  18. J.R. Quinlan. C4.5: Programs for Machine Learning. MK, San Mateo, CA, 1993.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Li, J., Shen, H., Topor, R. (2001). Mining Optimal Class Association Rule Set. In: Cheung, D., Williams, G.J., Li, Q. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2001. Lecture Notes in Computer Science(), vol 2035. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45357-1_39

Download citation

  • DOI: https://doi.org/10.1007/3-540-45357-1_39

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-41910-5

  • Online ISBN: 978-3-540-45357-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics