Abstract
Recently inductive databases (IDBs) have been proposed to afford the problem of knowledge discovery from huge databases. Querying these databases needs for primitives to: (1) select, manipulate and query data, (2) select, manipulate and query “interesting” patterns (i.e., those patterns that satisfy certain constraints), and (3) cross over patterns and data (e.g., selecting the data in which some patterns hold). Designing such query languages is a long-term goal and only preliminary approaches have been studied, mainly for the association rule mining task. Starting from a discussion on the MINE RULE operator, we identify several open issues for the design of inductive databases dedicated to these descriptive rules. These issues concern not only the offered primitives but also the availability of efficient evaluation schemes. We emphasize the need for primitives that work on more or less condensed representations for the frequent itemsets, e.g., the (frequent) β-free and closed itemsets. It is useful not only for optimizing single association rule mining queries but also for sophisticated post-processing and interactive rule mining.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
R. Agrawal, T. Imielinski, and A. Swami. Mining association rules between sets of items in large databases. In Proceedings SIGMOD’93, pages 207–216, Washington, USA, 1993. ACM Press.
R. Agrawal, H. Mannila, R. Srikant, H. Toivonen, and A. I. Verkamo. Fast discovery of association rules. In Advances in Knowledge Discovery and Data Mining, pages 307–328. AAAI Press, 1996.
Y. Bastide, N. Pasquier, R. Taouil, G. Stumme, and L. Lakhal. Mining minimal non-redundant association rules using frequent closed itemsets. In Proceedings CL 2000, volume 1861 of LNCS, pages 972–986, London, UK, 2000. Springer-Verlag.
Y. Bastide, R. Taouil, N. Pasquier, G. Stumme, and L. Lakhal. Mining frequent patterns with counting inference. SIGKDD Explorations, 2(2):66–75, Dec. 2000.
M. Botta, J.-F. Boulicaut, C. Masson, and R. Meo. A comparison between query languages for the extraction of association rules. In Proceedings DaWaK’02, Aix en Provence, F, 2002. Springer-Verlag. To appear.
J.-F. Boulicaut and A. Bykowski. Frequent closures as a concise representation for binary data mining. In Proceedings PAKDD’00, volume 1805 of LNAI, pages 62–73, Kyoto, JP, 2000. Springer-Verlag.
J.-F. Boulicaut, A. Bykowski, and B. Jeudy. Towards the tractable discovery of association rules with negations. In Proceedings FQAS’00, Advances in Soft Computing series, pages 425–434, Warsaw, PL, Oct. 2000. Springer-Verlag.
J.-F. Boulicaut, A. Bykowski, and C. Rigotti. Approximation of frequency queries by means of free-sets. In Proceedings PKDD’00, volume 1910 of LNAI, pages 75–85, Lyon, F, 2000. Springer-Verlag.
J.-F. Boulicaut and B. Jeudy. Using constraint for itemset mining: should we prune or not? In Proceedings BDA’00, pages 221–237, Blois, F, 2000.
J.-F. Boulicaut, M. Klemettinen, and H. Mannila. Modeling KDD processes within the inductive database framework. In Proceedings DaWaK’99, volume 1676 of LNCS, pages 293–302, Florence, I, 1999. Springer-Verlag.
A. Bykowski and C. Rigotti. A condensed representation to find frequent patterns. In Proceedings PODS’01, pages 267–273, Santa Barbara, USA, 2001. ACM Press.
M. M. Garofalakis, R. Rastogi, and K. Shim. SPIRIT: Sequential pattern mining with regular expression constraints. In Proceedings VLDB’99, pages 223–234, Edinburgh, UK, 1999. Morgan Kaufmann.
B. Goethals and J. van den Bussche. On implementing interactive association rule mining. In Proceedings of the ACM SIGMOD Workshop DMKD’99, Philadelphia, USA, 1999.
J. Han and M. Kamber. Data Mining: Concepts and techniques. Morgan Kaufmann Publishers, San Francisco, USA, 2000. 533 pages.
T. Imielinski and H. Mannila. A database perspective on knowledge discovery. Communications of the ACM, 39(11):58–64, 1996.
T. Imielinski and A. Virmani. MSQL: A query language for database mining. Data Mining and Knowledge Discovery, 3(4):373–408, 1999.
B. Jeudy and J.-F. Boulicaut. Optimization of association rule mining queries. Intelligent Data Analysis, 6(5), 2002. To appear.
B. Jeudy and J.-F. Boulicaut. Using condensed representations for interactive association rule mining. In Proceedings PKDD’02, Helsinki, FIN, 2002. Springer-Verlag. To appear.
M. Kryszkiewicz. Concise representations of association rules. In Proceedings of the ESF Exploratory Workshop on Pattern Detection and Discovery, London, UK, 2002. Springer-Verlag. To appear in this volume.
H. Mannila and P. Smyth. Approximate query answering with frequent sets and maximum entropy. In Proceedings ICDE’00, page 309, San Diego, USA, 2000. IEEE Computer Press.
H. Mannila and H. Toivonen. Multiple uses of frequent sets and condensed representations. In Proceedings KDD’96, pages 189–194, Portland, USA, 1996. AAAI Press.
H. Mannila and H. Toivonen. Levelwise search and borders of theories in knowledge discovery. Data Mining and Knowledge Discovery, 1(3):241–258, 1997.
R. Meo, G. Psaila, and S. Ceri. An extension to SQL for mining association rules. Data Mining and Knowledge Discovery, 2(2):195–224, 1998.
P. Moen. Attribute, Event Sequence, and Event Type Similarity Notions for Data Mining. PhD thesis, Department of Computer Science, P.O. Box 26, FIN-00014 University of Helsinki, Jan. 2000.
B. Nag, P. M. Deshpande, and D. J. DeWitt. Using a knowledge cache for interactive discovery of association rules. In Proceedings SIGKDD’99, pages 244–253. ACM Press, 1999.
R. Ng, L. V. Lakshmanan, J. Han, and A. Pang. Exploratory mining and pruning optimizations of constrained associations rules. In Proceedings SIGMOD’98, pages 13–24, Seattle, USA, 1998. ACM Press.
N. Pasquier, Y. Bastide, R. Taouil, and L. Lakhal. Efficient mining of association rules using closed itemset lattices. Information Systems, 24(1):25–46, 1999.
J. Pei, J. Han, and R. Mao. CLOSET an efficient algorithm for mining frequent closed itemsets. In Proceedings of the ACM SIGMOD Workshop DMKD’00, pages 21–30, Dallas, USA, 2000.
P. Smyth and R. M. Goodman. An information theoretic approach to rule induction from databases. IEEE Transactions on Knowledge and Data Engineering, 4(4):301–316, 1992.
R. Srikant, Q. Vu, and R. Agrawal. Mining association rules with item constraints. In Proceedings KDD’97, pages 67–73, Newport Beach, USA, 1997. AAAI Press.
M. J. Zaki. Generating non-redundant association rules. In Proceedings ACM SIGKDD’00, pages 34–43, Boston, USA, 2000. ACM Press.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jeudy, B., Jean-François, B. (2002). Constraint-Based Discovery and Inductive Queries: Application to Association Rule Mining. In: Hand, D.J., Adams, N.M., Bolton, R.J. (eds) Pattern Detection and Discovery. Lecture Notes in Computer Science(), vol 2447. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45728-3_9
Download citation
DOI: https://doi.org/10.1007/3-540-45728-3_9
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44148-9
Online ISBN: 978-3-540-45728-2
eBook Packages: Springer Book Archive