Skip to main content

A Context-Driven Data Weighting Approach for Handling Concept Drift in Classification

  • Conference paper
  • First Online:
Proceedings of the 9th International Conference on Computer Recognition Systems CORES 2015

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 403))

  • 991 Accesses

Abstract

Adapting classification models to concept drift is one of the main challenges associated with applying these models in dynamic environments. In particular, the learned concept is not static and may change over time under the influence of varying conditions (i.e. varying context). Unlike existing approaches where only the most recent data are considered for adapting the model, we propose incorporating context awareness into the adaptation process. The goal is to utilise knowledge of relevant context variables to facilitate the selection of more relevant training data. Specifically, we propose to weight each training example based on the degree of similarity with the current context. To detect such similarity, we utilise two approaches: a simple difference between the context variable values and a distribution-based distance metric. The experimental analyses show that such explicit context utilisation results in a more effective data selection strategy and enables to produce more accurate predictions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Barakat, L.: Context identification and exploitation in dynamic data mining—an application to classifying electricity price changes. In: Bouchachia, A. (ed.) Adaptive and Intelligent Systems. Lecture Notes in Computer Science, vol. 8779, pp. 80–89. Springer, Heidelberg (2014)

    Google Scholar 

  2. Du, L., Song, Q., Jia, X.: Detecting concept drift: an information entropy based method using an adaptive sliding window. Intell. Data Anal. 18, 337–364 (2014)

    Google Scholar 

  3. Frank, E., Hall, M., Pfahringer, B.: Locally weighted naive bayes. In: Proceedings of the 19th Conference on Uncertainty in Artificial Intelligence, pp. 249–256 (2003)

    Google Scholar 

  4. Gama, J., Medas, P., Castillo, G., Rodrigues, P.: Learning with drift detection. In: Proceedings of the 17th Brazilian Symposium on Artificial Intelligence, LNAI 3171, pp. 286–295 (2004)

    Google Scholar 

  5. Gama, J., Bifet, A., Pechenizkiy, M., Bouchachia, A.: A survey on concept drift adaptation. ACM Comput. Surv. 46(4), 1–37 (2014)

    Article  MATH  Google Scholar 

  6. Garcia, M.B., del Campo-Avila, J., Fidalgo, R., Bifet, A., Gavalda, R., Morales-Bueno, R.: Early drift detection method. In: ECML PKDD 2006 Workshop on Knowledge Discovery from Data Streams (2006)

    Google Scholar 

  7. Harries, M., Sammut, C., Horn, K.: Extracting hidden context. Mach. Learn. 32(2), 101–126 (1998)

    Article  MATH  Google Scholar 

  8. Harries, M.: Splice-2 Comparative Evaluation: Electricity Pricing. Technical report, University of New South Wales (1999)

    Google Scholar 

  9. Hulten, G., Spencer, L., Domingos, P.: Mining time-changing data streams. In: Proceedings of the 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 97–106 (2001)

    Google Scholar 

  10. Katakis, I., Tsoumakas, G., Vlahavas, I.: Tracking recurring contexts using ensemble classifiers: an application to email filtering. Knowl. Inf. Syst. 22(3), 371–391 (2010)

    Article  Google Scholar 

  11. Koychev, I.: Gradual forgetting for adaptation to concept drift. In: Proceedings of ECAI Workshop Current Issues in Spatio-Temporal Reasoning, pp. 101–106 (2000)

    Google Scholar 

  12. Kubat, M.: Floating approximation in time-varying knowledge bases. Pattern Recognit. Lett. 10(4), 223–227 (1989)

    Article  MATH  Google Scholar 

  13. Pavlidis, N.G., Tasoulis, D.K., Adams, N.M., Hand, D.J.: \(\lambda \)-Perceptron: an adaptive classifier for data streams. Pattern Recognit. 44(1), 78–96 (2011)

    Article  MATH  Google Scholar 

  14. Pavlidis, N.G., Tasoulis, D.K., Adams, N.M., Hand, D.J.: Adaptive consumer credit classification. J. Oper. Res. Soc. 63(12), 1645–1654 (2012)

    Article  Google Scholar 

  15. Schlimmer, J., Granger, R.: Incremental learning from noisy data. Mach. Learn. 1(3), 317–354 (1986)

    Google Scholar 

  16. Sebastiao, R., Gama, J.: Change detection in learning histograms from data streams. In: Proceedings of the Portuguese Conference on Artificial Intelligence. LNAI 4874, pp. 112–123 (2007)

    Google Scholar 

  17. Tsymbal, A.: The Problem of Concept Drift: Definitions and Related Work. Computer Science Department, Trinity College Dublin (2004)

    Google Scholar 

  18. Wang, H., Fan, W., Yu, P. S., Han, J.: Mining concept-drifting data streams using ensemble classifiers. In: Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 226–235 (2003)

    Google Scholar 

  19. Widmer, G., Kubat, M.: Learning in the presence of concept drift and hidden contexts. Mach. Learn. 23(1), 69–101 (1996)

    Google Scholar 

  20. Zliobaite, I., Kuncheva, L.: Determining the training window for small sample size classification with concept drift. In: Proceedings of the IEEE International Conference on Data Mining Workshop, pp. 447–452 (2009)

    Google Scholar 

  21. Zliobaite, I.: How Good is the Electricity Benchmark for Evaluating Concept Drift Adaptation. CoRR, abs/1301-3524 (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lida Barakat .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Barakat, L. (2016). A Context-Driven Data Weighting Approach for Handling Concept Drift in Classification. In: Burduk, R., Jackowski, K., Kurzyński, M., Woźniak, M., Żołnierek, A. (eds) Proceedings of the 9th International Conference on Computer Recognition Systems CORES 2015. Advances in Intelligent Systems and Computing, vol 403. Springer, Cham. https://doi.org/10.1007/978-3-319-26227-7_36

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-26227-7_36

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-26225-3

  • Online ISBN: 978-3-319-26227-7

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics