Frequent Itemsets as Descriptors of Textual Records

Bokhabrine, Ayoub; Biskri, Ismaïl; Ghazzali, Nadia

doi:10.1007/978-3-030-28374-2_4

Ayoub Bokhabrine¹³,
Ismaïl Biskri¹³ &
Nadia Ghazzali¹³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11684))

Included in the following conference series:

International Conference on Computational Collective Intelligence

1740 Accesses
2 Citations

Abstract

The analysis of numerical data, whether structured, semi-structured, or raw, is of paramount importance in many sectors of economic, scientific, or simply social activity. the process of extraction of association rules is based on the lexical quality of the text and on the minimum support set by the user. In this paper, we propose to use frequent itemsets as descriptors and classifying them by using K-Medoids algorithm and Hierarchical cluster. We present how they can be identified and used to define a level of similarity between several segments. The experiments conducted demonstrate the potential of the proposed approach for defining similarity between segments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Agrawal, R., Imielinski T., Swami, A.: Minning association rules between sets of items in large databases. In: Proceedings of the SIGMOD Conference on Management of Data, pp 207–216 (1993)
Google Scholar
Agrawal, R., Srikant, R.: Fast Algorithms for mining association rules. In: Proceedings of the 20th International Conference on Very Large Database, pp. 487–499 (1994)
Google Scholar
Alghamdi, R.A., Taileb, M., Ameen, M.: A new multimodal fusion method based on association rules mining for image retrieval. In: 2014 17th IEEE Mediterranean Electrotechnical Conference (MELECON), pp. 493–499. IEEE (2014)
Google Scholar
Bahri, E., Lallich, S.: Proposition d’une méthode de classification associative adaptative. 10eme journées Francophones d’Extraction et Gestion des Connaissances, EGC 2010, pp. 501–512 (2010)
Google Scholar
Bin Tong, H.S., Suzuki, E.: A feature-free and parameter-light multi-task clustering framework. In: Knowledge and Information Systems, pp. 20, 17, 42 (2013)
Google Scholar
Fournier-Viger, P., Lin, J.C.W., Vo, B., Chi, T.T., Zhang, J., Le, H.B.: A survey of itemsets mining. Data Mining and Knowledge Discovery, Wiley Interdisciplinary Reviews (2017)
Book Google Scholar
Geng, L., Hamilton, H.J.: Interestingness measures for data mining: a survey. ACM Comput. Surv. (CSUR) 38(3), 9 (2006)
Article Google Scholar
Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: ACM Sigmod Record, vol. 29, No. 2, pp. 1–12. ACM (2000)
Google Scholar
Le Bras, Y., Meyer, P., Lenca, P., Lallich, S.: Mesure de la robustesse de règles d’association. QDC 2010 (2010)
Google Scholar
Liu, B., Hsu, W., Ma, Y.: Integrating classification and association rule mining. In: Knowledge Discovery and Data Mining, pp. 80–86 (1998)
Google Scholar
McCallum, A., Nigam, K.: A comparison of event models for naive bayes text classification. In: AAAI-98 Workshop on Learning for Text Categorization, vol. 752, pp. 41–48 (1998)
Google Scholar
Mittal, K., Aggarwal, G., Mahajan, P.: A comparative study of association rule mining techniques and predictive mining approaches for association classification. Int. J. Adv. Res. Comput. Sci. 8(9), 365–372 (2017)
Article Google Scholar
Rompré, L., Biskri, I., Meunier, J.-G.: Using association rules mining for retrieving genre-specific music files. In: Proceedings of FLAIRS 2017, pp. 706–711 (2017)
Google Scholar
Tan, P.N., Kumar, V., Srivastava, J.: Selecting the right interestingness measure for association patterns. In: Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 32–41. ACM (2002)
Google Scholar
Zaïane, O.R., Antonie, M.L.: Classifying text documents by associating terms with text categories. In: Australian Computer Science Communications, vol. 24, No. 2, pp. 215–222 (2002)
Google Scholar

Download references

Author information

Authors and Affiliations

Université du Québec à Trois-Rivières, Trois-Rivières, Canada
Ayoub Bokhabrine, Ismaïl Biskri & Nadia Ghazzali

Authors

Ayoub Bokhabrine
View author publications
You can also search for this author in PubMed Google Scholar
Ismaïl Biskri
View author publications
You can also search for this author in PubMed Google Scholar
Nadia Ghazzali
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ismaïl Biskri .

Editor information

Editors and Affiliations

Wrocław University of Science and Technology, Wrocław, Poland
Ngoc Thanh Nguyen
University of Pau and Pays de l'Adour, Pau, France
Richard Chbeir
University of Pau and Pays de l'Adour, Pau, France
Ernesto Exposito
University of Pau and Pays de l'Adour, Pau, France
Philippe Aniorté
Wrocław University of Science and Technology, Wrocław, Poland
Bogdan Trawiński

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bokhabrine, A., Biskri, I., Ghazzali, N. (2019). Frequent Itemsets as Descriptors of Textual Records. In: Nguyen, N., Chbeir, R., Exposito, E., Aniorté, P., Trawiński, B. (eds) Computational Collective Intelligence. ICCCI 2019. Lecture Notes in Computer Science(), vol 11684. Springer, Cham. https://doi.org/10.1007/978-3-030-28374-2_4

Download citation

DOI: https://doi.org/10.1007/978-3-030-28374-2_4
Published: 09 August 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-28373-5
Online ISBN: 978-3-030-28374-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics