Abstract
This work presents a novel deterministic method to obtain rules for Subgroup Discovery tasks. It makes no previous discretization for the numeric attributes, but their conditions are obtained dynamically. To obtain the final rules, the AUC value of a rule has been used for selecting them. An experimental study supported by appropriate statistical tests was performed, showing good results in comparison with the classic deterministic algorithms CN2-SD and APRIORI-SD. The best results were obtained in the number of induced rules, where a significant reduction was achieved. Also, better coverage and less number of attributes were obtained in the comparison with CN2-SD.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Bay, S.D., Pazzani, M.J.: Detecting group differences. Mining contrast sets. Data Min. Knowl. Discov. 5(3), 213–246 (2001)
Dong, G., Li, J.: Efficient mining of emerging patterns. Discovering trends and differences. In: Proceedings of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 43–52 (1999)
Klösgen, W.: Explora: A multipattern and multistrategy discovery assistant. Advances in Knowledge Discovery and Data Mining, pp. 249–271. American Association for Artificial Intelligence, Cambridge (1996)
Wrobel, S.: An algorithm for multi-relational discovery of subgroups. In: Proceedings of the 1st European Conference on Principles of Data Mining and Knowledge Discovery (PKDD-97), pp 78–87 (1997)
Novak, P.N., Lavrač, N., Webb, G.: Supervised descriptive rule discovery: a unifying survey of contrast set, emerging pattern and subgroup mining. J. Mach. Learn. Res. 10, 377–403 (2009)
Lavrač, N., Kavsek, B., Flach, P.A., Todorovski, L.: Subgroup discovery with CN2-SD. J. Mach. Learn. Res. 5, 153–188 (2004)
Kavsek, B., Lavrač, N.: APRIORI-SD: adapting association rule learning to subgroup discovery. Appl. Artif. Intell. 20(7), 543–583 (2006)
Atzmüller, M., Puppe, F.: SD-Map – a fast algorithm for exhaustive subgroup discovery. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) PKDD 2006. LNCS (LNAI), vol. 4213, pp. 6–17. Springer, Heidelberg (2006)
Carmona, C.J., González, P., del Jesus, M.J., Herrera, F.: NMEEF-SD: non-dominated multi-objective evolutionary algorithm for extracting fuzzy rules in subgroup discovery. IEEE Trans. Fuzzy Syst. 18(5), 958–970 (2010)
Rodríguez, D., Ruiz, R., Riquelme, J.C., Aguilar-Ruiz, J.S.: Searching for rules to detect defective modules: a subgroup discovery approach. Inf. Sci. 191, 14–30 (2012)
Carmona, C.J., Ruiz-Rodado, V., del Jesus, M.J., Weber, A., Grootveld, M., González, P., Elizondo, D.: A fuzzy genetic programming-based algorithm for subgroup discovery and the application to one problem of pathogenesis of acute sore throat conditions in humans. Inf. Sci. 298, 180–197 (2015)
Grosskreutz, H., Rüping, S.: On subgroup discovery in numerical domains. Data Min. Knowl. Discov. 19(2), 210–226 (2009)
Fayyad, U., Irani, K.B.: Multi-interval discretization of continuous-valued attributes for classification learning. In: 13th International Joint Conference on Artificial Intelligence, pp. 1022–1029 (1999)
Domínguez-Olmedo, J.L., Mata, J., Pachón, V., Maña, M.J.: A deterministic approach to association rule mining without attribute discretization. In: Snasel, V., Platos, J., El-Qawasmeh, E. (eds.) ICDIPC 2011, Part I. CCIS, vol. 188, pp. 140–150. Springer, Heidelberg (2011)
Lichman, M.: UCI Machine Learning Repository. School of Information and Computer Science, University of California, Irvine, CA (2013). http://archive.ics.uci.edu/ml
Alcalá-Fdez, J., Fernandez, A., Luengo, J., Derrac, J., García, S., Sánchez, L., Herrera, F.: KEEL data-mining software tool. J. Multiple-Valued Logic Soft Comput. 17, 255–287 (2011)
Demšar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7, 1–30 (2006)
Acknowledgments
This work was partially funded by the Regional Government of Andalusia (Junta de Andalucía), grant number TIC-7629.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Domínguez-Olmedo, J.L., Vázquez, J.M., Pachón, V. (2015). Deterministic Extraction of Compact Sets of Rules for Subgroup Discovery. In: Jackowski, K., Burduk, R., Walkowiak, K., Wozniak, M., Yin, H. (eds) Intelligent Data Engineering and Automated Learning – IDEAL 2015. IDEAL 2015. Lecture Notes in Computer Science(), vol 9375. Springer, Cham. https://doi.org/10.1007/978-3-319-24834-9_17
Download citation
DOI: https://doi.org/10.1007/978-3-319-24834-9_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24833-2
Online ISBN: 978-3-319-24834-9
eBook Packages: Computer ScienceComputer Science (R0)