Skip to main content

A Survey on Representation for Itemsets in Association Rule Mining

  • Conference paper
  • First Online:
Advanced Computing and Intelligent Engineering

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1082))

Abstract

Mining frequent itemset is one of the main challenges in association rule mining. The efficiency of frequent itemsets depends on the computation time and the data structure used for storing the itemsets. The data structure greatly influences the space requirement. Most of the algorithms work well for a sparse dataset. However, if the dataset is large, it becomes difficult for computation, which eventually increases the execution time. This will affect the scalability of the algorithm. With a compact and concise representation of the itemsets, the itemsets can fit in the memory and hence, do not require any I/O operations. The data structures that are mostly used are array, tree, and trie. In this paper, we present a comparison of the different data structures that are used by the mining algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Agrawal, R., Mannila, H., Srikant, R., Toivonen, H., Verkamo, A.I., et al.: Fast discovery of association rules. Adv. Knowl. Discov. Data Min. 12(1), 307–328 (1996)

    Google Scholar 

  2. Ananthanarayana, V., Murty, M.N., Subramanian, D.: Tree structure for efficient data mining using rough sets. Pattern Recognit. Lett. 24(6), 851–862 (2003)

    Article  Google Scholar 

  3. Baralis, E., Cerquitelli, T., Chiusano, S.: A persistent hy-tree to efficiently support itemset mining on large datasets. In: Proceedings of the 2010 ACM Symposium on Applied Computing, pp. 1060–1064. ACM (2010)

    Google Scholar 

  4. Bodon, F.: A fast apriori implementation, rpi cs department technical report tr 03-14 (2003)

    Google Scholar 

  5. Chen, J., Xiao, K.: Bisc: a bitmap itemset support counting approach for efficient frequent itemset mining. ACM Trans. Knowl. Discov. Data (TKDD) 4(3), 12 (2010)

    Google Scholar 

  6. Cheung, D.W., Lee, S.D., Kao, B.: A general incremental technique for maintaining discovered association rules. In: Database Systems For Advanced Applications’ 97, pp. 185–194. World Scientific (1997)

    Google Scholar 

  7. Ezeife, C.I., Su, Y.: Mining incremental association rules with generalized fp-tree. In: Conference of the Canadian Society for Computational Studies of Intelligence, pp. 147–160. Springer (2002)

    Google Scholar 

  8. Grahne, G., Zhu, J.: Efficiently using prefix-trees in mining frequent itemsets. In: FIMI, vol. 90 (2003)

    Google Scholar 

  9. Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: ACM Sigmod Record, vol. 29, pp. 1–12. ACM (2000)

    Google Scholar 

  10. Leung, C.S., Khan, Q.I., Hoque, T.: Cantree: a tree structure for efficient incremental mining of frequent patterns. In: Fifth IEEE International Conference on Data Mining, p. 8. IEEE (2005)

    Google Scholar 

  11. Nguyen, T.T.: Mining incrementally closed item sets with constructive pattern set. Exp. Syst. Appl. 100, 41–67 (2018)

    Article  Google Scholar 

  12. Pietracaprina, A.: Mining frequent itemsets using patricia tries (2003)

    Google Scholar 

  13. Rácz, B.: nonordfp: an fp-growth variation without rebuilding the fp-tree. In: FIMI (2004)

    Google Scholar 

  14. Schmidt-Thieme, L.: Algorithmic features of eclat. In: FIMI (2004)

    Google Scholar 

  15. Song, M., Rajasekaran, S.: A transaction mapping algorithm for frequent itemsets mining. IEEE Trans. Knowl. Data Eng. 18(4), 472–481 (2006)

    Article  Google Scholar 

  16. Srikant, R., Agrawal, R.: Mining sequential patterns: generalizations and performance improvements. In: International Conference on Extending Database Technology, pp. 1–17. Springer (1996)

    Google Scholar 

  17. Tanbeer, S.K., Ahmed, C.F., Jeong, B.S., Lee, Y.K.: Efficient single-pass frequent pattern mining using a prefix-tree. Inf. Sci. 179(5), 559–583 (2009)

    Article  MathSciNet  Google Scholar 

  18. Uno, T., Kiyomi, M., Arimura, H.: Lcm ver. 3: collaboration of array, bitmap and prefix tree for frequent itemset mining. In: Proceedings of the 1st International Workshop on Open Source Data Mining: Frequent Pattern Mining Implementations, pp. 77–86. ACM (2005)

    Google Scholar 

  19. Zaki, M.J., Gouda, K.: Fast vertical mining using diffsets. In: Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 326–335. ACM (2003)

    Google Scholar 

  20. Zhou, Z., Ezeife, C.: A low-scan incremental association rule maintenance method based on the apriori property. In: Conference of the Canadian Society for Computational Studies of Intelligence, pp. 26–35. Springer (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Carynthia Kharkongor .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Kharkongor, C., Nath, B. (2020). A Survey on Representation for Itemsets in Association Rule Mining. In: Pati, B., Panigrahi, C., Buyya, R., Li, KC. (eds) Advanced Computing and Intelligent Engineering. Advances in Intelligent Systems and Computing, vol 1082. Springer, Singapore. https://doi.org/10.1007/978-981-15-1081-6_14

Download citation

  • DOI: https://doi.org/10.1007/978-981-15-1081-6_14

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-15-1080-9

  • Online ISBN: 978-981-15-1081-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics