Skip to main content

Frequent Itemsets as Descriptors of Textual Records

  • Conference paper
  • First Online:
Computational Collective Intelligence (ICCCI 2019)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11684))

Included in the following conference series:

Abstract

The analysis of numerical data, whether structured, semi-structured, or raw, is of paramount importance in many sectors of economic, scientific, or simply social activity. the process of extraction of association rules is based on the lexical quality of the text and on the minimum support set by the user. In this paper, we propose to use frequent itemsets as descriptors and classifying them by using K-Medoids algorithm and Hierarchical cluster. We present how they can be identified and used to define a level of similarity between several segments. The experiments conducted demonstrate the potential of the proposed approach for defining similarity between segments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Agrawal, R., Imielinski T., Swami, A.: Minning association rules between sets of items in large databases. In: Proceedings of the SIGMOD Conference on Management of Data, pp 207–216 (1993)

    Google Scholar 

  2. Agrawal, R., Srikant, R.: Fast Algorithms for mining association rules. In: Proceedings of the 20th International Conference on Very Large Database, pp. 487–499 (1994)

    Google Scholar 

  3. Alghamdi, R.A., Taileb, M., Ameen, M.: A new multimodal fusion method based on association rules mining for image retrieval. In: 2014 17th IEEE Mediterranean Electrotechnical Conference (MELECON), pp. 493–499. IEEE (2014)

    Google Scholar 

  4. Bahri, E., Lallich, S.: Proposition d’une méthode de classification associative adaptative. 10eme journées Francophones d’Extraction et Gestion des Connaissances, EGC 2010, pp. 501–512 (2010)

    Google Scholar 

  5. Bin Tong, H.S., Suzuki, E.: A feature-free and parameter-light multi-task clustering framework. In: Knowledge and Information Systems, pp. 20, 17, 42 (2013)

    Google Scholar 

  6. Fournier-Viger, P., Lin, J.C.W., Vo, B., Chi, T.T., Zhang, J., Le, H.B.: A survey of itemsets mining. Data Mining and Knowledge Discovery, Wiley Interdisciplinary Reviews (2017)

    Book  Google Scholar 

  7. Geng, L., Hamilton, H.J.: Interestingness measures for data mining: a survey. ACM Comput. Surv. (CSUR) 38(3), 9 (2006)

    Article  Google Scholar 

  8. Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: ACM Sigmod Record, vol. 29, No. 2, pp. 1–12. ACM (2000)

    Google Scholar 

  9. Le Bras, Y., Meyer, P., Lenca, P., Lallich, S.: Mesure de la robustesse de règles d’association. QDC 2010 (2010)

    Google Scholar 

  10. Liu, B., Hsu, W., Ma, Y.: Integrating classification and association rule mining. In: Knowledge Discovery and Data Mining, pp. 80–86 (1998)

    Google Scholar 

  11. McCallum, A., Nigam, K.: A comparison of event models for naive bayes text classification. In: AAAI-98 Workshop on Learning for Text Categorization, vol. 752, pp. 41–48 (1998)

    Google Scholar 

  12. Mittal, K., Aggarwal, G., Mahajan, P.: A comparative study of association rule mining techniques and predictive mining approaches for association classification. Int. J. Adv. Res. Comput. Sci. 8(9), 365–372 (2017)

    Article  Google Scholar 

  13. Rompré, L., Biskri, I., Meunier, J.-G.: Using association rules mining for retrieving genre-specific music files. In: Proceedings of FLAIRS 2017, pp. 706–711 (2017)

    Google Scholar 

  14. Tan, P.N., Kumar, V., Srivastava, J.: Selecting the right interestingness measure for association patterns. In: Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 32–41. ACM (2002)

    Google Scholar 

  15. Zaïane, O.R., Antonie, M.L.: Classifying text documents by associating terms with text categories. In: Australian Computer Science Communications, vol. 24, No. 2, pp. 215–222 (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ismaïl Biskri .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Bokhabrine, A., Biskri, I., Ghazzali, N. (2019). Frequent Itemsets as Descriptors of Textual Records. In: Nguyen, N., Chbeir, R., Exposito, E., Aniorté, P., Trawiński, B. (eds) Computational Collective Intelligence. ICCCI 2019. Lecture Notes in Computer Science(), vol 11684. Springer, Cham. https://doi.org/10.1007/978-3-030-28374-2_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-28374-2_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-28373-5

  • Online ISBN: 978-3-030-28374-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics