Improved Implementation and Performance Analysis of Association Rule Mining in Large Databases

Nagesh, H. R.; Bharath Kumar, M.; Ravinarayana, B.

doi:10.1007/978-3-642-36321-4_9

H. R. Nagesh⁴,
M. Bharath Kumar⁴ &
B. Ravinarayana⁴

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 361))

Included in the following conference series:

International Conference on Advances in Computing, Communication and Control

2840 Accesses
1 Citations

Abstract

Data mining is the process of extracting interesting and previously unknown patterns and correlations from data stored in Data Base Management Systems (DBMSs). Association Rule Mining is the process of discovering items, which tend to occur together in transactions. Efficient algorithms to mine frequent patterns are crucial to many tasks in data mining. The task of mining association rules consists of two main steps. The first involves finding the set of all frequent itemsets. The second step involves testing and generating all high confidence rules among itemsets. Our paper deals with obtaining both the frequent itemsets as well as generating association rules among them.

In this paper we implement the FORC (Fully Organized Candidate Generation) algorithm, which is a constituent of the Viper algorithm for generating our candidates and subsequently our frequent itemsets. Our implementation is an improvement over Apriori, the most common algorithm used for frequent item set mining.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Shenoy, P., Bawa, M., Shah, D.: Turbo-charging Vertical Mining of large databases. In: Proceedings of ACM SIGMOD Intl. Conference on Mgmt of Data (2000)
Google Scholar
Mobasher, B., Cooley, R., Srivastava, J.: Web mining: Information and pattern discovery on the World Wide Web. In: 9th IEEE International Conference on Tools with Artificial Intelligence, ICTAI 1997 (1997)
Google Scholar
Agrawal, R., Imielinski, T., Swamy, A.: Mining association rules between sets of items in large databases. In: Proceedings of ACM SIGMOD Intl. Conference on Management of Data (1993)
Google Scholar
Bodon, F.: A Fast Apriori Implementation. Computer and Automation Research Institute. Hungarian Academy of Sciences, Budapest (2012)
Google Scholar
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proc. of 20th Intl. Conf. Very Large Databases, VLDB (1994)
Google Scholar
Golomb, S.W.: Run Length Encoding. IEEE Transactions on Information Theory 12(3) (1966)
Google Scholar
Savasere, A., Omiecinski, E., Navathe, S.: An efficient algorithm for mining association rules in large databases. In: Proc. of 21st Intl. Conf. on Very Large Databases, VLDB (1995)
Google Scholar
Chen, M., Han, J., Yu, P.S.: Data Mining: An Overview from a Database Perspective. TKDE 8(6) (December 1996)
Google Scholar
Adriaans, P., Zantinge, D.: Data Mining. Addison-Wesley (1996)
Google Scholar
Agrawal, R., Imielinski, T., Swami, A.: Database Mining: A Performance Perspective. TKDE 5(6) (December 1993)
Google Scholar
Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann (2001)
Google Scholar
Han, J., Pei, J., Yin, Y.: Mining Frequent Patterns without Candidate Generation. SIGMOD (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Computer Science & Engineering, MITE, Mangalore, India
H. R. Nagesh, M. Bharath Kumar & B. Ravinarayana

Authors

H. R. Nagesh
View author publications
You can also search for this author in PubMed Google Scholar
M. Bharath Kumar
View author publications
You can also search for this author in PubMed Google Scholar
B. Ravinarayana
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Fr. Conceicao Rodrigues College of Engineering, Bandstand, Bandra (W), 400 050, Mumbai, Maharashtra, India
Srija Unnikrishnan
Fr. Conceicao Rodrigues College of Engineering, Bandstand, Bandra (W), 400 050, Mumbai, India
Sunil Surve
Dept. of Electronics Engineering, Fr. Conceicao Rodrigues College of Engineering, Bandstand, Bandra (West), 400 050, Mumbai, India
Deepak Bhoir

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nagesh, H.R., Bharath Kumar, M., Ravinarayana, B. (2013). Improved Implementation and Performance Analysis of Association Rule Mining in Large Databases. In: Unnikrishnan, S., Surve, S., Bhoir, D. (eds) Advances in Computing, Communication, and Control. ICAC3 2013. Communications in Computer and Information Science, vol 361. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-36321-4_9

Download citation

DOI: https://doi.org/10.1007/978-3-642-36321-4_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-36320-7
Online ISBN: 978-3-642-36321-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics