Skip to main content

Parallel Implantation of Frequent Itemset Mining Using Inverted Matrix Based on OpenCL

  • Conference paper
  • First Online:
Proceedings of the International Congress on Information and Communication Technology

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 438))

  • 1045 Accesses

Abstract

Extracting knowledge in the form of frequent itemsets and association rules deserves great importance in the field of data mining. Apriori algorithm suffers from multiple scans of the database and thus forms high memory dependency. On the other hand frequent pattern tree (FP tree) growth algorithm becomes impractical for large databases due to memory-based data structure. An efficient approach of inverted matrix with COFI (co-occurrence frequent item) tree alleviates disadvantages of both the above-mentioned algorithms. For massively large computations, modern GPUs provide a large set of parallel processors which facilitate in general-purpose computing. General purpose graphical processing unit (GPGPU) is way of utilizing the existing GPU for general purpose use. Open computing language (OpenCL) provides a standard for cross-platform programming on modern processors such as many-core CPUs and GPUs. As inverted matrix approach is advantageous over other algorithms, it is desirable to form it parallel to OpenCL. We have proposed a new technique called CLInverted matrix itemset mining, which is an advancement over existing techniques and contributes to load sharing. The proposed architecture in this paper highlights the inverted matrix approach implantation based on OpenCL framework. In experiments we have compared the results of serial and parallel versions of the proposed approach on various OpenCL devices.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Tompson, J., Schlachter, K.: An Introduction to the OpenCL Programming Model. Person Education (2012).

    Google Scholar 

  2. Khronos group, http://www.khronos.org/opencl.

  3. Gervasi, O., Russo, D., Vella, F.: The AES Implantation Based on OpenCL for Multi/Many Core Architecture. In: IEEE International Conference on Computational Science and Its Applications (ICCSA), pp. 129–134, IEEE (2010).

    Google Scholar 

  4. El-Hajj, M., Zaïane, O. R.: COFI-tree Mining: A New Approach to Pattern Growth with Reduced Candidacy Generation. In: Workshop on Frequent Itemset Mining Implementations (FIMI’03) in Conjunction with IEEE-ICDM, (2003).

    Google Scholar 

  5. Agrawal, R., Srikant, R.: Fast Algorithms for Mining Association Rules. In: Proceedings 20th International Conference on Very Large Databases (VLDB), vol. 1215, pp. 487–499, ACM (1994).

    Google Scholar 

  6. Han, J., Kamber, M.: Data Mining: Concepts and Techniques, 2nd ed., Morgan Kaufmann Publisher, San Francisco (2006).

    Google Scholar 

  7. Han, J., Pei, J., Yin, Y.: Mining Frequent Patterns without Candidate Generation. In: ACM SIGMOD Record, vol. 29, no. 2, pp. 1–12, ACM (2000).

    Google Scholar 

  8. Park, J. S., Chen, M., Yu, P. S.: An Effective Hash Based Algorithm for Mining Association Rules. In Proceedings of ACM SIGMOD Conference, pp. 175–186, ACM Press, New York (1995).

    Google Scholar 

  9. Zaïane, O. R., El-Hajj, M., Lu, P.: Fast Parallel Association Rule Mining without Candidacy Generation. In: Proceedings IEEE International Conference on Data Mining (ICDM), pp. 665–668, IEEE (2001).

    Google Scholar 

  10. El-Hajj, M., Zaïane, O. R.: Inverted Matrix: Efficient Discovery of Frequent Items in Large Datasets in the Context of Interactive Mining. In: Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 109–118, ACM (2003).

    Google Scholar 

  11. El-Hajj, M., Zaïane, O. R.: Parallel Association Rule Mining with Minimum Inter-processor Communication. In: Proceedings of 14th International Workshop on Database and Expert Systems Applications, pp. 519–523, IEEE (2003).

    Google Scholar 

  12. Bhanderi, S.D., Garg, S.: Parallel Frequent Set Mining Using Inverted Matrix Approach. In: Nirma University International Conference on Engineering (NUiCONE), pp. 1–4, IEEE (2012).

    Google Scholar 

  13. Frequent Itemset Mining Dataset Repository, http://fimi.ua.ac.be/data/mushroom.dat.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pratipalsinh Zala .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer Science+Business Media Singapore

About this paper

Cite this paper

Pratipalsinh Zala, Hiren Kotadiya, Sanjay Bhanderi (2016). Parallel Implantation of Frequent Itemset Mining Using Inverted Matrix Based on OpenCL. In: Satapathy, S., Bhatt, Y., Joshi, A., Mishra, D. (eds) Proceedings of the International Congress on Information and Communication Technology. Advances in Intelligent Systems and Computing, vol 438. Springer, Singapore. https://doi.org/10.1007/978-981-10-0767-5_9

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-0767-5_9

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-0766-8

  • Online ISBN: 978-981-10-0767-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics