Abstract
In this paper we propose a new incremental Gene Expression Programming (GEP) ensemble classifier. Our base classifiers are induced from a chunk of data instances using GEP. Size of the chunk controls the number of instances with known class labels used to induce base classifiers iteratively. Instances with unknown class label are classified in sequence, one by one. It is assumed that after a decision as to the class label of the new instance has been taken its true class label is revealed. From a set of base classifier a metagene is induced and used to predict class label of instances with unknown class labels. To validate the approach an extensive computational experiment has been carried-out.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
MOA Analysis: UCI Machine Learning Repository (2013). http://moa.cms.waikato.ac.nz/datasets/
Bertini, J.R.J., Zhao, L., Lopes, A.A.: An incremental learning algorithm based on the k-associated graph for non-stationary data classification. Inf. Sci. 246, 52–68 (2013)
Cohen, L., Avrahami, G., Last, M., Kandel, A.: Info-fuzzy algorithms for mining dynamic data streams. Appl. Soft Comput. 8(4), 1283–1294 (2008). http://dx.doi.org/10.1016/j.asoc.2007.11.003
Fern, A., Givan, R.: Online ensemble learning: an empirical study. Mach. Learn. 53(1-2), 71–109 (2003). http://dx.doi.org/10.1023/A:1025619426553
Ferreira, C.: Gene expression programming: a new adaptive algorithm for solving problems. CoRR cs.AI/0102027 (2001). http://arxiv.org/abs/cs.AI/0102027
Ferreira, C.: Gene Expression Programming: Mathematical Modeling by an Artificial Intelligence. SCI, vol. 21. Springer, Heidelberg (2006)
Hulten, G., Spencer, L., Domingos, P.M.: Mining time-changing data streams. In: Lee, D., Schkolnick, M., Provost, F.J., Srikant, R. (eds.) Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 26–29 August, pp. 97–106. ACM (2001). http://portal.acm.org/citation.cfm?id=502512.502529
Jedrzejowicz, J., Jedrzejowicz, P.: GEP-induced expression trees as weak classifiers. In: Perner, P. (ed.) ICDM 2008. LNCS, vol. 5077, pp. 129–141. Springer, Heidelberg (2008). doi:10.1007/978-3-540-70720-2_10
Jedrzejowicz, J., Jedrzejowicz, P.: A family of GEP-induced ensemble classifiers. In: Nguyen, N.T., Kowalczyk, R., Chen, S.-M. (eds.) ICCCI 2009. LNCS, vol. 5796, pp. 641–652. Springer, Heidelberg (2009). doi:10.1007/978-3-642-04441-0_56
Jedrzejowicz, J., Jedrzejowicz, P.: Experimental evaluation of two new gep-based ensemble classifiers. Expert Syst. Appl. 38(9), 10932–10939 (2011). http://dx.doi.org/10.1016/j.eswa.2011.02.135
Jedrzejowicz, J., Jedrzejowicz, P.: Distance-based online classifiers. Expert Syst. Appl. 60, 249–257 (2016). http://dx.doi.org/10.1016/j.eswa.2016.05.015
Kotsiantis, S.B.: An incremental ensemble of classifiers. Artif. Intell. Rev. 36(4), 249–266 (2011). http://dx.doi.org/10.1007/s10462-011-9211-4
Lichman, M.: UCI Machine Learning Repository (2013). http://archive.ics.uci.edu/ml/
Liu, D., Wu, Y., Jiang, H.: FP-ELM: an online sequential learning algorithm for dealing with concept drift. Neurocomputing 207, 322–334 (2016). http://www.sciencedirect.com/science/article/pii/S0925231216303125
Mldata.org: Machine learning data set repository (2013). http://mldata.org/repository/tags/data/
Moreno-Torres, J.G., Sáez, J.A., Herrera, F.: Study on the impact of partition-induced dataset shift on k -fold cross-validation. IEEE Trans. Neural Netw. Learning Syst. 23(8), 1304–1312 (2012). http://dx.doi.org/10.1109/TNNLS.2012.2199516
Schlimmer, J.C., Granger, R.H.: Incremental learning from noisy data. Mach. Learn. 1(3), 317–354 (1986). http://dx.doi.org/10.1023/A:1022810614389
Todorovski, L., Dzeroski, S.: Combining classifiers with meta decision trees. Mach. Learn. 50(3), 223–249 (2003). http://dx.doi.org/10.1023/A:1021709817809
Torres, D.M., Aguilar-Ruiz, J.S.: A similarity-based approach for data stream classification. Expert Syst. Appl. 41(9), 4224–4234 (2014). http://dx.doi.org/10.1016/j.eswa.2013.12.041
Turkov, P., Krasotkina, O., Mottl, V.: Dynamic programming for bayesian logistic regression learning under concept drift. In: Maji, P., Ghosh, A., Murty, M.N., Ghosh, K., Pal, S.K. (eds.) PReMI 2013. LNCS, vol. 8251, pp. 190–195. Springer, Heidelberg (2013). doi:10.1007/978-3-642-45062-4_26
Utgoff, P.E., Berkman, N.C., Clouse, J.A.: Decision tree induction based on efficient tree restructuring. Mach. Learn. 29(1), 5–44 (1997). http://dx.doi.org/10.1023/A:1007413323501
Wang, L., Ji, H., Jin, Y.: Fuzzy passive-aggressive classification: a robust and efficient algorithm for online classification problems. Inf. Sci. 220, 46–63 (2013)
Wei, X., Huang, H.: Granular twin support vector machines based on mixture kernel function. In: Huang, D.-S., Han, K. (eds.) ICIC 2015. LNCS, vol. 9227, pp. 43–54. Springer, Cham (2015). doi:10.1007/978-3-319-22053-6_5
Wisaeng, K.: A comparison of different classification techniques for bank direct marketing. Int. J. Soft Comput. Eng. 3(4), 116–119 (2013)
Wolpert, D.H.: Stacked generalization. Neural Netw. 5(2), 241–259 (1992). http://dx.doi.org/10.1016/S0893-6080(05)80023-1
Xu, S., Wang, J.: A fast incremental extreme learning machine algorithm for data streams classification. Expert Syst. Appl. 65(C), 332–344 (2016). https://doi.org/10.1016/j.eswa.2016.08.052
Zliobaite, I.: Combining similarity in time and space for training set formation under concept drift. Intell. Data Anal. 15(4), 589–611 (2011). http://dx.doi.org/10.3233/IDA-2011-0484
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Jedrzejowicz, J., Jedrzejowicz, P. (2018). Incremetal GEP-Based Ensemble Classifier. In: Czarnowski, I., Howlett, R., Jain, L. (eds) Intelligent Decision Technologies 2017. IDT 2017. Smart Innovation, Systems and Technologies, vol 72. Springer, Cham. https://doi.org/10.1007/978-3-319-59421-7_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-59421-7_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-59420-0
Online ISBN: 978-3-319-59421-7
eBook Packages: EngineeringEngineering (R0)