Abstract
We study classification over a slow stream of complex objects like customers or students. The learning task must take into account that an object’s label is influenced by incoming data from adjoint, fast streams of transactions, e.g. customer purchases or student exams, and that this label may even change over time. This task involves combining the streams, and exploiting associations between the target label and attribute values in the fast streams. We propose a method for the discovery of classification rules over such a confederation of streams, and we use it to enhance a decision tree classifier. We show that the new approach has competitive predictive power while building much smaller decision trees than the original classifier.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Aydin, T., Güvenir, H.A.: Learning interestingness of streaming classification rules. In: Aykanat, C., Dayar, T., Körpeoğlu, İ. (eds.) ISCIS 2004. LNCS, vol. 3280, pp. 62–71. Springer, Heidelberg (2004)
Aydın, T., Güvenir, H.A.: Modeling interestingness of streaming classification rules as a classification problem. In: Savacı, F.A. (ed.) TAINN 2005. LNCS (LNAI), vol. 3949, pp. 168–176. Springer, Heidelberg (2006)
Catlett, J.: Megainduction: Machine Learning on Very Large Databases. Ph.D. thesis, University of Sydney (1991)
Charikar, M., Chen, K., Farach-Colton, M.: Finding frequent items in data streams. In: Widmayer, P., Triguero, F., Morales, R., Hennessy, M., Eidenbenz, S., Conejo, R. (eds.) ICALP 2002. LNCS, vol. 2380, pp. 693–703. Springer, Heidelberg (2002)
Chi, Y., Wang, H., Yu, P.S., Muntz, R.: Moment: Maintaining closed frequent itemsets over a stream sliding window. In: Proceedings of the International Conference on Data Mining, ICDM 2004 (2004)
Domingos, P., Hulten, G.: Mining high-speed data streams. In: Proceedings of the International Conference on Knowledge Discovery in Databases, KDD 2000, pp. 71–80. ACM, New York (2000)
Ferrer-Troyano, F., Aguilar-Ruiz, J.S., Riquel Jose, C.: Data streams classification by incremental rule learning with parameterized generalization. In: ACM Symposium on Applied Computing, SAC 2006, pp. 657–661. ACM, New York (2006), http://doi.acm.org/10.1145/1141277.1141428
Gupta, A., Kumar, N., Bhatnagar, V.: Incremental classification rules based on association rules using formal concept analysis. In: Perner, P., Imiya, A. (eds.) MLDM 2005. LNCS (LNAI), vol. 3587, pp. 11–20. Springer, Heidelberg (2005)
Hidber, C.: Online association rule mining. Tech. Rep. UCB/CSD-98-1004, EECS Department, University of California, Berkeley (1998), http://www.eecs.berkeley.edu/Pubs/TechRpts/1998/5677.html
Hulten, G., Spencer, L., Domingos, P.: Mining time-changing data streams. In: Proceedings of International Conference on Knowledge Discovery in Databases, KDD 2001. ACM, New York (2001)
Kroegel, M.A.: On Propositionalization for Knowledge Discovery in Relational Databases. Ph.D. thesis, University of Magdeburg, Germany (2003)
McGovern, A., Hiers, N., Collier, M., Gagne II, D.J., Brown, R.A.: Spatiotemporal relational probability trees. In: Proceedings of the International Conference on Data Mining, ICDM 2008 (2008)
Siddiqui, Z.F., Spiliopoulou, M.: Combining multiple interrelated streams for incremental clustering. In: Winslett, M. (ed.) SSDBM 2009. LNCS, vol. 5566, pp. 535–552. Springer, Heidelberg (2009)
Siddiqui, Z.F., Spiliopoulou, M.: Stream clustering of growing objects. In: Gama, J., Costa, V.S., Jorge, A.M., Brazdil, P.B. (eds.) DS 2009. LNCS, vol. 5808, pp. 433–440. Springer, Heidelberg (2009)
Siddiqui, Z.F., Spiliopoulou, M.: Tree induction over perennial objects. In: Gertz, M., Ludäscher, B. (eds.) SSDBM 2010. LNCS, vol. 6187, pp. 640–657. Springer, Heidelberg (2010)
Veloso, A., Meira, J.W., Carvalho, M., Possas, B., Parthasarathy, S., Zaki, J.: Mining frequent itemsets in evolving databases. In: Proceedings of the 2nd SIAM International Conference on Data Mining (2002)
Yu, P.S., Chi, Y.: Association rule mining on streams. In: Encyclopedia Database Systems (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Siddiqui, Z.F., Spiliopoulou, M. (2011). Classification Rule Mining for a Stream of Perennial Objects. In: Bassiliades, N., Governatori, G., Paschke, A. (eds) Rule-Based Reasoning, Programming, and Applications. RuleML 2011. Lecture Notes in Computer Science, vol 6826. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22546-8_22
Download citation
DOI: https://doi.org/10.1007/978-3-642-22546-8_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22545-1
Online ISBN: 978-3-642-22546-8
eBook Packages: Computer ScienceComputer Science (R0)