Abstract
We define a class of patterns generalizing the jumping emerging patterns which have been used successfully for classification problems but which are often absent in complex or sparse databases and which are often very specific. In supervised learning, the objects in a database are classified a priori into one class called positive – a target class – and the remaining classes, called negative. Each pattern, or set of attributes, has support in the positive class and in the negative class, and the ratio of these is the emergence of that pattern; the stimulating patterns are those patterns a, such that for many closed patterns b, adding the attributes of a to b reduces the support in the negative class much more than in the positive class. We present methods for comparing and attributing stimulation of closed patterns. We discuss the complexity of enumerating stimulating patterns. We discuss in particular the discovery of highly stimulating patterns and the discovery of patterns which capture contrasts. We extract these two types of stimulating patterns from UCI machine learning databases.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Dong, G., Li, J.: Applications of Emerging Patterns for Microarray Gene Expression Data Analysis. In: Liu, L., Tamer Özsu, M. (eds.) Encyclopedia of Database Systems, vol. 107. Springer, Heidelberg (2009)
Dong, G., Li, J.: Efficient mining of emerging patterns: discovering trends and differences. In: KDD 1999: Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 43–52. ACM, New York (1999)
Dong, G., Zhang, X., Wong, L., Li, J.: CAEP: Classification by Aggregating Emerging Patterns. In: Arikawa, S., Furukawa, K. (eds.) DS 1999. LNCS (LNAI), vol. 1721, pp. 30–42. Springer, Heidelberg (1999)
Fayyad, U.M., Irani, K.B.: The Attribute Selection Problem in Decision Tree Generation. In: AAAI, pp. 104–110 (1992)
Ganter, B., Stumme, G., Wille, R.: Formal Concept Analysis: Foundations and Applications. LNCS (LNAI). Springer, New York (2005)
Ganter, B., Wille, R.: Formal Concept Analysis: Mathematical Foundations. In: Trans. C. Franzke. Springer, New York (1997)
Harrell Jr., Frank, E.: Regression Modeling Strategies. Springer, New York (2006)
Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning. Springer Series in Statistics (2001)
Huang, H.-j., Qin, Y., Zhu, X., Zhang, J., Zhang, S.: Difference Detection Between Two Contrast Sets. In: Tjoa, A.M., Trujillo, J. (eds.) DaWaK 2006. LNCS, vol. 4081, pp. 481–490. Springer, Heidelberg (2006)
Li, J., Dong, G., Ramamohanarao, K.: Making use of the most expressive jumping emerging patterns for classification. Knowledge and Information Systems 3(2), 131–145 (2001)
Li, J., Wong, L.: Emerging patterns and gene expression data. Genome Informatics 12, 3–13 (2001)
Li, J., Yang, Q.: Strong Compound-Risk Factors: Efficient Discovery Through Emerging Patterns and Contrast Sets. IEEE Transactions on Information Technology in Biomedicine 5(11), 544–552 (2007)
Loekito, E., Bailey, J.: Using Highly Expressive Contrast Patterns for Classification - Is It Worthwhile? In: Theeramunkong, T., Kijsirikul, B., Cercone, N., Ho, T.-B. (eds.) PAKDD 2009. LNCS, vol. 5476, pp. 483–490. Springer, Heidelberg (2009)
Loekito, E., Bailey, J.: Mining influential attributes that capture class and group contrast behaviour. In: Shanahan, J.G., Amer-Yahia, S., Manolescu, I., Zhang, Y., Evans, D.A., Kolcz, A., Choi, K.-S., Chowdhury, A. (eds.) Proceedings of the 17th ACM Conference on Information and Knowledge Management, CIKM, Napa CA, USA, pp. 971–980 (2008)
Loekito, E., Bailey, J.: Fast mining of high dimensional expressive contrast patterns using zero-suppressed binary decision diagrams. In: KDD 2006: Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 307–316, 1-59593-339-5 (2006)
Poezevara, G., Cuissart, B., Crémilleux, B.: Discovering Emerging Graph Patterns from Chemicals. In: Rauch, J., Raś, Z.W., Berka, P., Elomaa, T. (eds.) Foundations of Intelligent Systems. LNCS, vol. 5722, pp. 45–55. Springer, Heidelberg (2009)
Ramamohanarao, K., Bailey, J., Fan, H.: Efficient Mining of Contrast Patterns and Their Applications to Classification. In: ICISIP 2005: Proceedings of the 2005 3rd International Conference on Intelligent Sensing and Information Processing, Washington, DC, USA, pp. 39–47, 0-7803-9588-3. IEEE Computer Society, Los Alamitos (2005)
Ramamohanarao, K., Fan, H.: Patterns Based Classifiers. In: World Wide Web, Hingham, MA, USA, vol. 1(10), pp. 71–83, 1386-145X. Kluwer Academic Publishers, Dordrecht (2007)
Ting, R.M.H., Bailey, J.: In: Ghosh, J., Lambert, D., Skillicorn, D.B., Srivastava, J. (eds.) Proceedings of the Sixth SIAM International Conference on Data Mining, SDM, Bethesda, MD, USA, April 20-22. SIAM, Philadelphia (2006)
Valtchev, P., Grosser, D., Roume, C., Hacene, M.R.: Galicia: An Open Platform for Lattices. In: Using Conceptual Structures: Contributions to the 11th Intl. Conference on Conceptual Structures, pp. 241–254 (2003)
Webb, G., Butler, S., Newlands, D.: On detecting differences between groups. In: KDD 2003: Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 256–265. ACM, New York (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bissell-Siders, R., Cuissart, B., Crémilleux, B. (2010). On the Stimulation of Patterns. In: Croitoru, M., Ferré, S., Lukose, D. (eds) Conceptual Structures: From Information to Intelligence. ICCS 2010. Lecture Notes in Computer Science(), vol 6208. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14197-3_9
Download citation
DOI: https://doi.org/10.1007/978-3-642-14197-3_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14196-6
Online ISBN: 978-3-642-14197-3
eBook Packages: Computer ScienceComputer Science (R0)