Abstract
Finding association rules is an important data mining problem and can be derived based on mining large frequent candidate sets. In this paper, a new algorithm for efficient generating large frequent candidate sets is proposed, which is called Matrix Algorithm. The algorithm generates a matrix which entries 1 or 0 by passing over the cruel database only once, and then the frequent candidate sets are obtained from the resulting matrix. Finally association rules are mined from the frequent candidate sets. Numerical experiments and comparison with the Apriori Algorithm are made on 4 randomly generated test problems with small, middle and large sizes. Experiments results confirm that the proposed algorithm is more effective than Apriori Algorithm.
Supported by the Youth Key Foundations of Univ. of Electronic Science and Technology of China (Jx04042).
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Agrawal, R., Imielinski, T., Swami, A.: Database mining: A performance perspective. IEEE Trans. Knowledge and Data Eng. 5(6), 914–925 (1993)
Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: Proceedings of the ACM SIGMOD Conference on Management of Data, pp. 207–216 (1993)
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proceedings of International Conference on Very Large Data Bases, pp. 487–499 (1994)
Park, J.S., Chen, M.S., Yu, P.S.: Using a hash-based method with transaction trimming for mining association rules. IEEE Trans. on Knowledge Data Engrg. 9(5), 813–825 (1997)
Tsai, P.S.M., Chen, C.-M.: Discovering knowledge from large databases using prestored information. Information Systems 26(1), 3–16 (2001)
Tsai, P.S.M., Chen, C.-M.: Mining interesting association rules from customer databases and transaction databases. Information Systems 29(3), 685–696 (2004)
Han, J., Fu, Y.: Discovery of multiple-level association rules from large databases. In: Proceedings of the VLDB Conference, pp. 420–431 (1995)
Srikant, R., Agrawal, R.: Mining generalized association rules. In: Proceedings of the VLDB Conference, pp. 407–419 (1995)
Srikant, R., Agrawal, R.: Mining quantitative association rules in large relational tables. In: Proceedings of the ACM SIGMOD, pp. 1–12 (1996)
Lent, B., Swami, A., Widom, J.: Clustering association rules. In: Proceedings of the IEEE International Conference on Data Engineering, pp. 220–231 (1997)
Agrawal, R., Shafer, J.: Parallel mining of association rules. IEEE Trans. on Knowledge and Data Engg. 8(6), 962–969 (1996)
Hipp, J., Gontzer, U., Nakhaeizadeh, G.: Algorithms for association rule mining – a general survey and comparison. SIGKDD Explorations 2(1), 58–64 (2000)
Berzal, F., Cubero, J.C., Marrin, N., Serrano, J.M.: TBAR: An efficient method for association rules mining in relational databases. Data and Knowledge Engineering 37, 47–64 (2001)
Holt, J.D., Chung, S.M.: Mining association rules using inverted hashing and pruning. Information Processing Letters 83, 211–220 (2002)
Hsu, P.-Y., Chen, e.-L., Ling, C.-C.: Algorithms for mining association rules in bagdatabases. Information Science 166(1), 31–47 (2004)
Zhang, S., Lu, J., Zhang, C.: A fuzzy logic based method to acquireuser threshold of minimum-support for mining association rules. Information Science 164(1), 1–16 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yuan, Y., Huang, T. (2005). A Matrix Algorithm for Mining Association Rules. In: Huang, DS., Zhang, XP., Huang, GB. (eds) Advances in Intelligent Computing. ICIC 2005. Lecture Notes in Computer Science, vol 3644. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11538059_39
Download citation
DOI: https://doi.org/10.1007/11538059_39
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28226-6
Online ISBN: 978-3-540-31902-3
eBook Packages: Computer ScienceComputer Science (R0)