Skip to main content

Constraint-Based Discovery and Inductive Queries: Application to Association Rule Mining

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2447))

Abstract

Recently inductive databases (IDBs) have been proposed to afford the problem of knowledge discovery from huge databases. Querying these databases needs for primitives to: (1) select, manipulate and query data, (2) select, manipulate and query “interesting” patterns (i.e., those patterns that satisfy certain constraints), and (3) cross over patterns and data (e.g., selecting the data in which some patterns hold). Designing such query languages is a long-term goal and only preliminary approaches have been studied, mainly for the association rule mining task. Starting from a discussion on the MINE RULE operator, we identify several open issues for the design of inductive databases dedicated to these descriptive rules. These issues concern not only the offered primitives but also the availability of efficient evaluation schemes. We emphasize the need for primitives that work on more or less condensed representations for the frequent itemsets, e.g., the (frequent) β-free and closed itemsets. It is useful not only for optimizing single association rule mining queries but also for sophisticated post-processing and interactive rule mining.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. R. Agrawal, T. Imielinski, and A. Swami. Mining association rules between sets of items in large databases. In Proceedings SIGMOD’93, pages 207–216, Washington, USA, 1993. ACM Press.

    Google Scholar 

  2. R. Agrawal, H. Mannila, R. Srikant, H. Toivonen, and A. I. Verkamo. Fast discovery of association rules. In Advances in Knowledge Discovery and Data Mining, pages 307–328. AAAI Press, 1996.

    Google Scholar 

  3. Y. Bastide, N. Pasquier, R. Taouil, G. Stumme, and L. Lakhal. Mining minimal non-redundant association rules using frequent closed itemsets. In Proceedings CL 2000, volume 1861 of LNCS, pages 972–986, London, UK, 2000. Springer-Verlag.

    Google Scholar 

  4. Y. Bastide, R. Taouil, N. Pasquier, G. Stumme, and L. Lakhal. Mining frequent patterns with counting inference. SIGKDD Explorations, 2(2):66–75, Dec. 2000.

    Article  Google Scholar 

  5. M. Botta, J.-F. Boulicaut, C. Masson, and R. Meo. A comparison between query languages for the extraction of association rules. In Proceedings DaWaK’02, Aix en Provence, F, 2002. Springer-Verlag. To appear.

    Google Scholar 

  6. J.-F. Boulicaut and A. Bykowski. Frequent closures as a concise representation for binary data mining. In Proceedings PAKDD’00, volume 1805 of LNAI, pages 62–73, Kyoto, JP, 2000. Springer-Verlag.

    Google Scholar 

  7. J.-F. Boulicaut, A. Bykowski, and B. Jeudy. Towards the tractable discovery of association rules with negations. In Proceedings FQAS’00, Advances in Soft Computing series, pages 425–434, Warsaw, PL, Oct. 2000. Springer-Verlag.

    Google Scholar 

  8. J.-F. Boulicaut, A. Bykowski, and C. Rigotti. Approximation of frequency queries by means of free-sets. In Proceedings PKDD’00, volume 1910 of LNAI, pages 75–85, Lyon, F, 2000. Springer-Verlag.

    Google Scholar 

  9. J.-F. Boulicaut and B. Jeudy. Using constraint for itemset mining: should we prune or not? In Proceedings BDA’00, pages 221–237, Blois, F, 2000.

    Google Scholar 

  10. J.-F. Boulicaut, M. Klemettinen, and H. Mannila. Modeling KDD processes within the inductive database framework. In Proceedings DaWaK’99, volume 1676 of LNCS, pages 293–302, Florence, I, 1999. Springer-Verlag.

    Google Scholar 

  11. A. Bykowski and C. Rigotti. A condensed representation to find frequent patterns. In Proceedings PODS’01, pages 267–273, Santa Barbara, USA, 2001. ACM Press.

    Google Scholar 

  12. M. M. Garofalakis, R. Rastogi, and K. Shim. SPIRIT: Sequential pattern mining with regular expression constraints. In Proceedings VLDB’99, pages 223–234, Edinburgh, UK, 1999. Morgan Kaufmann.

    Google Scholar 

  13. B. Goethals and J. van den Bussche. On implementing interactive association rule mining. In Proceedings of the ACM SIGMOD Workshop DMKD’99, Philadelphia, USA, 1999.

    Google Scholar 

  14. J. Han and M. Kamber. Data Mining: Concepts and techniques. Morgan Kaufmann Publishers, San Francisco, USA, 2000. 533 pages.

    Google Scholar 

  15. T. Imielinski and H. Mannila. A database perspective on knowledge discovery. Communications of the ACM, 39(11):58–64, 1996.

    Article  Google Scholar 

  16. T. Imielinski and A. Virmani. MSQL: A query language for database mining. Data Mining and Knowledge Discovery, 3(4):373–408, 1999.

    Article  Google Scholar 

  17. B. Jeudy and J.-F. Boulicaut. Optimization of association rule mining queries. Intelligent Data Analysis, 6(5), 2002. To appear.

    Google Scholar 

  18. B. Jeudy and J.-F. Boulicaut. Using condensed representations for interactive association rule mining. In Proceedings PKDD’02, Helsinki, FIN, 2002. Springer-Verlag. To appear.

    Google Scholar 

  19. M. Kryszkiewicz. Concise representations of association rules. In Proceedings of the ESF Exploratory Workshop on Pattern Detection and Discovery, London, UK, 2002. Springer-Verlag. To appear in this volume.

    Google Scholar 

  20. H. Mannila and P. Smyth. Approximate query answering with frequent sets and maximum entropy. In Proceedings ICDE’00, page 309, San Diego, USA, 2000. IEEE Computer Press.

    Google Scholar 

  21. H. Mannila and H. Toivonen. Multiple uses of frequent sets and condensed representations. In Proceedings KDD’96, pages 189–194, Portland, USA, 1996. AAAI Press.

    Google Scholar 

  22. H. Mannila and H. Toivonen. Levelwise search and borders of theories in knowledge discovery. Data Mining and Knowledge Discovery, 1(3):241–258, 1997.

    Article  Google Scholar 

  23. R. Meo, G. Psaila, and S. Ceri. An extension to SQL for mining association rules. Data Mining and Knowledge Discovery, 2(2):195–224, 1998.

    Article  Google Scholar 

  24. P. Moen. Attribute, Event Sequence, and Event Type Similarity Notions for Data Mining. PhD thesis, Department of Computer Science, P.O. Box 26, FIN-00014 University of Helsinki, Jan. 2000.

    Google Scholar 

  25. B. Nag, P. M. Deshpande, and D. J. DeWitt. Using a knowledge cache for interactive discovery of association rules. In Proceedings SIGKDD’99, pages 244–253. ACM Press, 1999.

    Google Scholar 

  26. R. Ng, L. V. Lakshmanan, J. Han, and A. Pang. Exploratory mining and pruning optimizations of constrained associations rules. In Proceedings SIGMOD’98, pages 13–24, Seattle, USA, 1998. ACM Press.

    Google Scholar 

  27. N. Pasquier, Y. Bastide, R. Taouil, and L. Lakhal. Efficient mining of association rules using closed itemset lattices. Information Systems, 24(1):25–46, 1999.

    Article  Google Scholar 

  28. J. Pei, J. Han, and R. Mao. CLOSET an efficient algorithm for mining frequent closed itemsets. In Proceedings of the ACM SIGMOD Workshop DMKD’00, pages 21–30, Dallas, USA, 2000.

    Google Scholar 

  29. P. Smyth and R. M. Goodman. An information theoretic approach to rule induction from databases. IEEE Transactions on Knowledge and Data Engineering, 4(4):301–316, 1992.

    Article  Google Scholar 

  30. R. Srikant, Q. Vu, and R. Agrawal. Mining association rules with item constraints. In Proceedings KDD’97, pages 67–73, Newport Beach, USA, 1997. AAAI Press.

    Google Scholar 

  31. M. J. Zaki. Generating non-redundant association rules. In Proceedings ACM SIGKDD’00, pages 34–43, Boston, USA, 2000. ACM Press.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Jeudy, B., Jean-François, B. (2002). Constraint-Based Discovery and Inductive Queries: Application to Association Rule Mining. In: Hand, D.J., Adams, N.M., Bolton, R.J. (eds) Pattern Detection and Discovery. Lecture Notes in Computer Science(), vol 2447. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45728-3_9

Download citation

  • DOI: https://doi.org/10.1007/3-540-45728-3_9

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44148-9

  • Online ISBN: 978-3-540-45728-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics