Skip to main content

A New Improved Apriori Algorithm Based on Compression Matrix

  • Conference paper
Advanced Data Mining and Applications (ADMA 2014)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8933))

Included in the following conference series:

Abstract

The existing Apriori algorithm based on matrix still has the problems that the candidate itemsets are too large and matrix takes up too much memory space. To solve these problems, an improved Apriori algorithm based on compression matrix is proposed. The improvement ideas of this algorithm are as follows: (1) reducing the times of scanning matrix set during compressing by adding two arrays to record the counts of 1 in the row and column; (2) minimizing the scale of matrix and improving space utilization by deleting the itemsets which cannot be connected and the infrequent itemsets in compressing matrix; (3) decreasing the errors of the mining result by changing the condition of deleting the unnecessary transaction column;(4) reducing the cycling number of algorithm by changing the stopping condition of program. Instance analysis and experimental results show that the proposed algorithm can accurately and efficiently mines all frequent itemsets in transaction database, and improves the efficiency of mining association rules.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agrawal, R., Imielinaki, T., Swami, A.: Mining association rules between sets of items in large databases. In: Proceedings of the ACM SIGMOD Conference on Management of Date, pp. 207–216. ACM press, Washington, D.C (1993)

    Google Scholar 

  2. Han, J., Pei, J., Yin, Y.: Mining Frequent Patterns without Candidate Generations. In: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, pp. 1–12. ACM Press, Dallas (2000)

    Chapter  Google Scholar 

  3. Park, J.S., Chen, M.S., Yu, P.S.: An effective hash-based algorithms for mining association rules. In: Proceedings of the 1995 ACM SIGMOD International Conference on Management of Data, pp. 175–186. ACM Press, San Jose (1995)

    Chapter  Google Scholar 

  4. Savasere, A., Omiecinski, E., Navathe, S.: An Efficient Algorithm for Mining Association rules. In: Proceedings of the 21st International Conference on Very Large Database, pp. 432–444. ACM Press, New York (1995)

    Google Scholar 

  5. Tovionen, H.: Sampling large databases for association rules. In: 22th International Conference on Very Large Database, Bombay, India, pp. 1–12 (1996)

    Google Scholar 

  6. Brin, S., Motwan, R.I., Ullman, J.D., et al.: Dynamic Itemset Counting and Implication Rules for Market Basket Data. In: 1997 ACM SIGMOD Conference on Management of Data, pp. 255–264. ACM Press, New York (1997)

    Chapter  Google Scholar 

  7. Zaki, M.J.: Scalable Algorithms for Association Mining. IEEE Transactions on Knowledge and Data Engineering 12(3), 372–390 (2000)

    Article  MathSciNet  Google Scholar 

  8. Huang, L.S., Chen, H.P., Wang, X., et al.: A Fast Algorithm for Mining Association Rules. Journal of Computer Science and Technology 15(6), 619–624 (2000)

    Article  MATH  Google Scholar 

  9. Yuan, Y., Huang, T.: A Matrix Algorithm for Mining Association Rules. In: Huang, D.-S., Zhang, X.-P., Huang, G.-B. (eds.) ICIC 2005. LNCS, vol. 3644, pp. 370–379. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  10. Zhang, Z.L., Liu, J., Zhang, J.: A Fast Algorithm for Mining association Rules Based on Boolean Matrix. In: 2008 International Conference on Wireless Communications, Networking and Mobile Computing, Dalian, China, pp. 1–3 (2008)

    Google Scholar 

  11. Wand, F., Li, Y.H.: An improved Apriori algorithm based on the matix. In: 2008 International Seminar on Future BioMedical Information Engineering, Wuhan, China, pp. 152–155 (2008)

    Google Scholar 

  12. Liu, H.Z., Dai, S.P., Jiang, H.: Quantitative association rules mining algorithm based on matrix. In: 2009 International Conference on Computational Intelligence and Software Engineering, pp. 1–4. IEEE Computer Society, Wuhan (2009)

    Google Scholar 

  13. Khare, N., Adlakha, N., Pardasani, K.R.: An Algorithm for Mining Multidimensional Association Rules using Boolean Matrix. In: 2010 International Conference on Recent Trends in Information, Telecommunication and Computing, pp. 95–99. IEEE Computer Society, Kochi (2010)

    Chapter  Google Scholar 

  14. Luo, X.W., Wang, W.Q.: Improved Algorithms Research for Association Rule Based on Matrix. In: 2010 International Conference on Intelligent Computing and Cognitive Informatics, pp. 415–419. IEEE Computer Society, Kuala Lumpur (2010)

    Chapter  Google Scholar 

  15. Krajca, P., Outrata, J., Vychodil, V.: Using frequent closed itemsets for data dimensionality reduction. In: 11th IEEE Internatinal Conference on Data Mining, pp. 1128–1133. Institute of Electrical and Electronics Engineers Inc, Vancouver (2011)

    Google Scholar 

  16. Chen, J.M., Lin, G.F., Yang, Z.H.: Extracting spatial association rules from the maximum frequent itemsets based on Boolean matrix. In: 19th International Conference on Geoinformatics, pp. 1–5. IEEE Computer Society, Shanghai (2011)

    Google Scholar 

  17. Zhou, Z.P., Wang, J.F.: An improved matrix sorting index association rule data mining algorithm. In: 33rd Chinese Control Conference, pp. 500–505. IEEE Computer Society, Nanjing (2014)

    Chapter  Google Scholar 

  18. Yang, X.Y., Zhen, L., Fu, Y.: MapReduce as a programming model for association rules algorithm on Hadoop. In: 3rd International Conference on Information Sciences and Interaction Sciences, pp. 99–102. IEEE Computer Society, Chengdu (2010)

    Chapter  Google Scholar 

  19. Wang, Z.Q., Li, H.L.: Research of Massive Web Log Data Mining Based on Cloud Computing. In: Fifth International Conference on Computational and Information Sciences, pp. 591–594. IEEE Computer Society, Shiyang (2013)

    Google Scholar 

  20. Mannila, H., Toivonen, H., Verkamo, A.I.: Efficient algorithms for discovering association rules. In: AAAI Workshop on Knowledge Discovery in Databases (KDD 1994), pp. 181–192 (1994)

    Google Scholar 

  21. Liu, B.Z.: Improved appriori mining frequent items algorithm. Application Research of computation 29(2), 475–477 (2012) (in Chinese)

    Google Scholar 

  22. Li, R., Kang, L.Y., Geng, H.: New optimization association rule algorithm based on array. Science Technology and Engineering 8(21), 5846–5849 (2008) (in Chinese)

    Google Scholar 

  23. Fu, S., Liao, M.H., Song, D.: An improved Appriori algorithm based on compression matrix approach. Microelectronics and Computer 29(6), 28–32 (2010) (in Chinese)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Li, T., Luo, D. (2014). A New Improved Apriori Algorithm Based on Compression Matrix. In: Luo, X., Yu, J.X., Li, Z. (eds) Advanced Data Mining and Applications. ADMA 2014. Lecture Notes in Computer Science(), vol 8933. Springer, Cham. https://doi.org/10.1007/978-3-319-14717-8_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-14717-8_1

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-14716-1

  • Online ISBN: 978-3-319-14717-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics