A Fuzzy Classifier for Data Streams with Infinitely Delayed Labels

  • Tiago Pinho da SilvaEmail author
  • Vinicius Mourão Alves Souza
  • Gustavo Enrique Almeida Prado Alves Batista
  • Heloisa de Arruda Camargo
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11401)


In data stream learning, classification is a prominent task which aims to predict the class labels of incoming examples. However, in classification, most of the approaches from literature make assumptions that limit the usefulness of the methods in real scenarios such as the supposition that the label of an example will be available right after its prediction, i.e., there is no time delay to acquiring actual labels. It is a very optimistic assumption, since labeling the entire data stream is usually not feasible. Some recent approaches overcome this limitation, considering unsupervised learning methods to deal with delayed labels. Also, some proposals explore concepts of fuzzy set theory to add more flexibility to the learning process, although restricted to data streams with no delayed labels. In this paper, we propose a fuzzy classifier for data streams with infinitely delayed labels called FuzzMiC. Our algorithm generates a model based on fuzzy micro-clusters that provides flexible class boundaries and allows the classification of evolving data streams. Experiments show that our approach is promising in dealing with incremental changes.


Data streams Classification Delayed labels Fuzzy 


  1. 1.
    Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Springer US, New York (1981). Scholar
  2. 2.
    Dyer, K.B., Capo, R., Polikar, R.: COMPOSE: a semisupervised learning framework for initially labeled nonstationary streaming data. TNNLS 25(1), 12–26 (2014)Google Scholar
  3. 3.
    Gama, J., Žliobaitė, I., Bifet, A., Pechenizkiy, M., Bouchachia, A.: A survey on concept drift adaptation. CSUR 46(4), 44 (2014)CrossRefGoogle Scholar
  4. 4.
    Hashemi, S., Yang, Y.: Flexible decision tree for data stream classification in the presence of concept change, noise and missing values. Data Min. Knowl. Disc. 19(1), 95–131 (2009)MathSciNetCrossRefGoogle Scholar
  5. 5.
    Hulten, G., Spencer, L., Domingos, P.: Mining time-changing data streams. In: ACM SIGKDD, pp. 97–106. ACM (2001)Google Scholar
  6. 6.
    Krempl, G.: The algorithm APT to classify in concurrence of latency and drift. In: Gama, J., Bradley, E., Hollmén, J. (eds.) IDA 2011. LNCS, vol. 7014, pp. 222–233. Springer, Heidelberg (2011). Scholar
  7. 7.
    Lopes, P.A., Camargo, H.A.: Fuzzstream: fuzzy data stream clustering based on the online-offline framework. In: FUZZ-IEEE (2017)Google Scholar
  8. 8.
    Marrs, G.R., Hickey, R.J., Black, M.M.: The impact of latency on online classification learning with concept drift. In: Bi, Y., Williams, M.-A. (eds.) KSEM 2010. LNCS (LNAI), vol. 6291, pp. 459–469. Springer, Heidelberg (2010). Scholar
  9. 9.
    Silva, T.P., Urban, G.A., Lopes, P.A., Camargo, H.A.: A fuzzy variant for on-demand data stream classification. In: BRACIS, pp. 67–72 (2017)Google Scholar
  10. 10.
    Souza, V.M.A., Silva, D.F., Batista, G.E.A.P.A., Gama, J.: Classification of evolving data streams with infinitely delayed labels. In: ICMLA, pp. 214–219 (2015)Google Scholar
  11. 11.
    Souza, V.M.A., Silva, D.F., Gama, J., Batista, G.E.A.P.A.: Data stream classification guided by clustering on nonstationary environments and extreme verification latency. In: SIAM SDM, pp. 873–881 (2015)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  • Tiago Pinho da Silva
    • 1
    Email author
  • Vinicius Mourão Alves Souza
    • 1
  • Gustavo Enrique Almeida Prado Alves Batista
    • 1
  • Heloisa de Arruda Camargo
    • 2
  1. 1.Universidade de São PauloSão CarlosBrazil
  2. 2.Universidade Federal de São CarlosSão CarlosBrazil

Personalised recommendations