Abstract
The analysis of numerical data, whether structured, semi-structured, or raw, is of paramount importance in many sectors of economic, scientific, or simply social activity. the process of extraction of association rules is based on the lexical quality of the text and on the minimum support set by the user. In this paper, we propose to use frequent itemsets as descriptors and classifying them by using K-Medoids algorithm and Hierarchical cluster. We present how they can be identified and used to define a level of similarity between several segments. The experiments conducted demonstrate the potential of the proposed approach for defining similarity between segments.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Agrawal, R., Imielinski T., Swami, A.: Minning association rules between sets of items in large databases. In: Proceedings of the SIGMOD Conference on Management of Data, pp 207–216 (1993)
Agrawal, R., Srikant, R.: Fast Algorithms for mining association rules. In: Proceedings of the 20th International Conference on Very Large Database, pp. 487–499 (1994)
Alghamdi, R.A., Taileb, M., Ameen, M.: A new multimodal fusion method based on association rules mining for image retrieval. In: 2014 17th IEEE Mediterranean Electrotechnical Conference (MELECON), pp. 493–499. IEEE (2014)
Bahri, E., Lallich, S.: Proposition d’une méthode de classification associative adaptative. 10eme journées Francophones d’Extraction et Gestion des Connaissances, EGC 2010, pp. 501–512 (2010)
Bin Tong, H.S., Suzuki, E.: A feature-free and parameter-light multi-task clustering framework. In: Knowledge and Information Systems, pp. 20, 17, 42 (2013)
Fournier-Viger, P., Lin, J.C.W., Vo, B., Chi, T.T., Zhang, J., Le, H.B.: A survey of itemsets mining. Data Mining and Knowledge Discovery, Wiley Interdisciplinary Reviews (2017)
Geng, L., Hamilton, H.J.: Interestingness measures for data mining: a survey. ACM Comput. Surv. (CSUR) 38(3), 9 (2006)
Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: ACM Sigmod Record, vol. 29, No. 2, pp. 1–12. ACM (2000)
Le Bras, Y., Meyer, P., Lenca, P., Lallich, S.: Mesure de la robustesse de règles d’association. QDC 2010 (2010)
Liu, B., Hsu, W., Ma, Y.: Integrating classification and association rule mining. In: Knowledge Discovery and Data Mining, pp. 80–86 (1998)
McCallum, A., Nigam, K.: A comparison of event models for naive bayes text classification. In: AAAI-98 Workshop on Learning for Text Categorization, vol. 752, pp. 41–48 (1998)
Mittal, K., Aggarwal, G., Mahajan, P.: A comparative study of association rule mining techniques and predictive mining approaches for association classification. Int. J. Adv. Res. Comput. Sci. 8(9), 365–372 (2017)
Rompré, L., Biskri, I., Meunier, J.-G.: Using association rules mining for retrieving genre-specific music files. In: Proceedings of FLAIRS 2017, pp. 706–711 (2017)
Tan, P.N., Kumar, V., Srivastava, J.: Selecting the right interestingness measure for association patterns. In: Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 32–41. ACM (2002)
Zaïane, O.R., Antonie, M.L.: Classifying text documents by associating terms with text categories. In: Australian Computer Science Communications, vol. 24, No. 2, pp. 215–222 (2002)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Bokhabrine, A., Biskri, I., Ghazzali, N. (2019). Frequent Itemsets as Descriptors of Textual Records. In: Nguyen, N., Chbeir, R., Exposito, E., Aniorté, P., Trawiński, B. (eds) Computational Collective Intelligence. ICCCI 2019. Lecture Notes in Computer Science(), vol 11684. Springer, Cham. https://doi.org/10.1007/978-3-030-28374-2_4
Download citation
DOI: https://doi.org/10.1007/978-3-030-28374-2_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-28373-5
Online ISBN: 978-3-030-28374-2
eBook Packages: Computer ScienceComputer Science (R0)