Skip to main content

A Weightless Neural Network-Based Approach for Stream Data Clustering

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7435))

Abstract

One of the major data mining tasks is to cluster similar data, because of its usefulness, providing means of summarizing large ammounts of raw data into handy information. Clustering data streams is particularly challenging, because of the constraints imposed when dealing with this kind of input. Here we report our work, in which it was investigated the use of WiSARD discriminators as primary data synthesizing units. An analysis of StreamWiSARD, a new sliding-window stream data clustering system, the benefits and the drawbacks of its use and a comparison to other approaches are all presented.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Zhou, A., Cao, F., Qian, W., Jin, C.: Tracking clusters in evolving data streams over sliding windows. Knowl. Inf. Syst. 15(2), 181–214 (2008)

    Article  Google Scholar 

  2. Wan, L., Ng, W.K., Dang, X.H., Yu, P.S., Zhang, K.: Density-based clustering of data streams at multiple resolutions. ACM Trans. Knowl. Discov. Data 3, 14:1–14:28 (2009)

    Google Scholar 

  3. Aggarwal, C.C., Han, J., Wang, J., Yu, P.S.: A framework for clustering evolving data streams. In: VLDB, pp. 81–92 (2003)

    Google Scholar 

  4. Aleksander, I., Gregorio, M.D., França, F.M.G., Lima, P.M.V., Morton, H.: A brief introduction to weightless neural systems. In: Proceedings of the 17th European Symposium on Artificial Neural Networks, ESANN 2009, Bruges, Belgium, April 22-24 (2009)

    Google Scholar 

  5. Bifet, A., Holmes, G., Pfahringer, B., Read, J., Kranen, P., Kremer, H., Jansen, T., Seidl, T.: Moa: A real-time analytics open source framework. In: [7], pp. 617–620

    Google Scholar 

  6. Brodley, C.E. (ed.): Proceedings of the Twenty-first International Conference on Machine Learning (ICML 2004), Banff, Alberta, Canada, July 4-8. ACM International Conference Proceeding Series, vol. 69. ACM (2004)

    Google Scholar 

  7. Gunopulos, D., Hofmann, T., Malerba, D., Vazirgiannis, M. (eds.): ECML PKDD 2011. LNCS, vol. 6913. Springer, Heidelberg (2011)

    Google Scholar 

  8. Kranen, P., Assent, I., Baldauf, C., Seidl, T.: The clustree: indexing micro-clusters for anytime stream mining. Knowl. Inf. Syst. 29(2), 249–272 (2011)

    Article  Google Scholar 

  9. Lühr, S., Lazarescu, M.: Incremental clustering of dynamic data streams using connectivity based representative points. Data Knowl. Eng. 68(1), 1–27 (2009)

    Article  Google Scholar 

  10. Moise, G., Sander, J., Ester, M.: Robust projected clustering. Knowl. Inf. Syst. 14, 273–298 (2008)

    Article  MATH  Google Scholar 

  11. Bandeira, L.C.: NC-WISARD: Uma interpretação sem pesos do modelo neural neocognitron. M.Sc. thesis, Rio de Janeiro, RJ, Brasil (2010) (in Portuguese)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Cardoso, D., De Gregorio, M., Lima, P., Gama, J., França, F. (2012). A Weightless Neural Network-Based Approach for Stream Data Clustering. In: Yin, H., Costa, J.A.F., Barreto, G. (eds) Intelligent Data Engineering and Automated Learning - IDEAL 2012. IDEAL 2012. Lecture Notes in Computer Science, vol 7435. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32639-4_40

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-32639-4_40

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-32638-7

  • Online ISBN: 978-3-642-32639-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics