Skip to main content

Part of the book series: Studies in Big Data ((SBD,volume 56))

  • 965 Accesses

Abstract

Although ensembles of classifiers are one of the most popular tools to deal with data streams classification task [1,2,3,4,5], in the literature there is a lack of new approaches to creating ensembles of regression estimators [6, 7]. Most of the latest developments focus on the application of the regression estimators to solve very important real-world problems. In [8] the authors propose to create an ensemble composed of decision trees, gradient boosted trees and random forest to forecast electricity consumptions. The algorithm uses different weights for each component based on its previous performance. As it was shown in [9], the regression can be applied to enhanced prediction of occurrence of the concept-drift. The authors propose an ensemble method which utilizes constrained penalized regression as a combiner to track a drifting concept in a classification setting. The data stream approach to system fault prediction has been examined in [10]. In this paper different data-stream-based linear regression prediction methods have been tested and compared with a newly developed fault detection system. The applied and evaluated data stream mining algorithms were: grid-based classifier, polygon-based method, and one-class support vector machines. The results showed that the linear regression method generally achieved good performance in predicting short-term data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Pietruczuk, L., Rutkowski, L., Jaworski, M., Duda, P.: A method for automatic adjustment of ensemble size in stream data mining. In: 2016 International Joint Conference on Neural Networks (IJCNN), pp. 9–15 (2016)

    Google Scholar 

  2. Pietruczuk, L., Rutkowski, L., Jaworski, M., Duda, P.: How to adjust an ensemble size in stream data mining? Inf. Sci. 381, 46–54 (2017)

    Article  MathSciNet  Google Scholar 

  3. Street, W.N., Kim, Y.: A streaming ensemble algorithm (sea) for large-scale classification. In: Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 377–382. ACM (2001)

    Google Scholar 

  4. Minku, L.L., Yao, X.: DDD: a new ensemble approach for dealing with concept drift. IEEE Trans. Knowl. Data Eng. 24(4), 619–633 (2012)

    Article  Google Scholar 

  5. Sun, J., Fujita, H., Chen, P., Li, H.: Dynamic financial distress prediction with concept drift based on time weighting combined with adaboost support vector machine ensemble. Knowl. Based Syst. 120, 4–14 (2017)

    Article  Google Scholar 

  6. Ikonomovska, E., Gama, J., Džeroski, S.: Online tree-based ensembles and option trees for regression on evolving data streams. Neurocomputing 150, 458–470 (2015)

    Article  Google Scholar 

  7. Duda, P., Jaworski, M., Rutkowski, L.: Online GRNN-based ensembles for regression on evolving data streams. In: International Symposium on Neural Networks, pp. 221–228. Springer, Berlin (2018)

    Google Scholar 

  8. Galicia, A., Talavera-Llames, R., Troncoso, A., Koprinska, I., Martínez-Álvarez, F.: Multi-step forecasting for big data time series based on ensemble learning. Knowl. Based Syst. (2018)

    Google Scholar 

  9. Wang, L.-Y., Park, C., Choi, H., Yeon, K.: A classifier ensemble for concept drift using a constrained penalized regression combiner. Procedia Comput. Sci. 91, 252–259 (2016)

    Article  Google Scholar 

  10. Alzghoul, A., Löfstrand, M., Backe, B.: Data stream forecasting for system fault prediction. Comput. Ind. Eng. 62(4), 972–978 (2012)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Leszek Rutkowski .

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Rutkowski, L., Jaworski, M., Duda, P. (2020). Regression. In: Stream Data Mining: Algorithms and Their Probabilistic Properties. Studies in Big Data, vol 56. Springer, Cham. https://doi.org/10.1007/978-3-030-13962-9_14

Download citation

Publish with us

Policies and ethics