Skip to main content

Data Stream Mining Based-Outlier Prediction for Cloud Computing

  • Conference paper
  • First Online:
Digital Economy. Emerging Technologies and Business Innovation (ICDEc 2017)

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 290))

Included in the following conference series:

Abstract

The cloud computing is the dream of computing used as utility that became true. It is currently emerging as a hot topic due to the important services it provides. Ensuring high quality services is a challenging task especially with the considerable increase of the user’s requests coming continuously in real time to the data center servers and consuming its resources. Abnormal users requests may contribute to the system failure. Thus, it’s crucial to detect these abnormalities for further analysis and prediction. To do that, we propose the use of the outlier detection techniques in the context of the data stream mining due to the similarity between the nature of the data streams and the users requests which require analysis and mining in real time. The main contribution of this paper consists of: first, the formulation of the users requests as well as the server state as a stream of data. This data is generated from \(CSG^+\) a cloud stream generator that we extended from CSG [1]. Second, the creation of a framework for the detection of the abnormal users requests in terms of the CPU and memory by using AnyOut and MCOD algoithms implemented within MOA (Massive Online Analysis) (http://moa.cms.waikato.ac.nz/) framework. Third, the comparison between them in this context.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. toumi, H., Brahmi, Z., Ben Arfa, Z.: Server load prediction using stream mining. In: The 31st International Conference on Information Networking (ICOIN) (2017)

    Google Scholar 

  2. Hassen, F.B., Brahmi, Z., Toumi, H.: VM placement algorithm based on recruitment process within ant colonies. In: The 1st International Conference on Digital Economy Emerging Technologies and Business Innovation (2016)

    Google Scholar 

  3. Souiden, I., Brahmi, Z., Toumi, H.: A survey on outlier detection in the context of stream mining: review of existing approaches and recommendations. In: The 16th International Conference on Intelligent Systems Design and Applications, Porto, Portugal (2016)

    Google Scholar 

  4. Assent, I., Kranen, P., Baldauf, C., Seidl, T.: AnyOut: anytime outlier detection on streaming data. In: The 17th International Conference on DASFAA, pp. 228–242 (2012)

    Google Scholar 

  5. Kontaki, M., Gounarisn, A., Papadopoulos, A.N., Tsichlas, K., Manolopoulos, Y.: Efficient and flexible algorithms for monitoring distance based outliers over data streams. Inf. Syst. 55(3), 37–53 (2016)

    Article  Google Scholar 

  6. Wilkes, J.: googleclusterdata. https://github.com/google/cluster-data (2013)

  7. Zhang, Q., Cheng, L., Boutaba, R.: Cloud computing: state-of-the-art and research challenges. J. Internet Serv. Appl. 1(1), 7–18 (2010)

    Article  Google Scholar 

  8. Solaimani, M., Iftekhar, M., Khan, L.: Statistical technique for online anomaly detection using spark over heterogeneous data from multi-source vmware performance data. In: IEEE International Conference on Big Data (2014)

    Google Scholar 

  9. Sauvanaud, C., Silvestre, G., Kaaniche, M., Kanoun, K.: Data stream clustering for online anomaly detection in cloud applications. In: The 11th European Dependable Computing Conference, September 2015

    Google Scholar 

  10. Uddin, M.S., Kuh, A.: Online least-squares one-class support vector machine for outlier detection in power grid data. In: IEEE International Conference on Acoustics, Speech and Signal Processing (2016)

    Google Scholar 

  11. Uddin, M.S., Kuh, A., Weng, Y., Ilic, M.: Online bad data detection using kernel density estimation. In: IEEE Power and Energy Society and General Meeting (2015)

    Google Scholar 

  12. Kale, A., Ingle, M.D.: SVM based feature extraction for novel class detection from streaming data. Wirel. Personal Commun. J. 110(9) 2015

    Google Scholar 

  13. Pokrajac, D., Lazarevic, A., Latecki, L.J.: Incremental local outlier detection for data streams. In: IEEE Symposium on CIDM, pp. 504–515 (2007)

    Google Scholar 

  14. Karimian, S.H., Kelarestaghi, M., Hashemi, S.: I-inclof: improved incremental local outlier detection for data streams. In: The 16th CSI International Symposium on Artificial Intelligence and Signal Processing (2012)

    Google Scholar 

  15. Cao, L., Yangt, Di., Wang, Q., Yu, Y., Wang, J., Rundensteiner, E.A.: Scalable distance based outlier detection over high-volume data streams. In: The 30th International Conference on Data Engineering (2014)

    Google Scholar 

  16. Lin, F., Le, W., Bo, J.: Research on maximal frequent pattern outlier factor for online high dimensional time-series outlier detection. J. Convergence Inf. Technol. 5(10), 66–71 (2010)

    Article  Google Scholar 

  17. Said, A.M., Dominic, P.D.D., Faye, L.: Data stream outlier detection approach based on frequent pattern mining technique. Int. J. Bus. Inf. Syst. 20(1), 55–70 (2015)

    Google Scholar 

  18. Bhaduri, K., Das, K., Matthews, B.L.: Detecting Abnormal Machine Characteristics in Cloud Infrastructures. In: Proceedings of the IEEE 11th International Conference on Data Mining Workshops, pp. 137–144 (2011)

    Google Scholar 

  19. Hawkins, D.M.: Identification of Outliers. Springer, Heidelberg (1980)

    Book  MATH  Google Scholar 

  20. Yogita, Toshniwal, D.: Unsupervised outlier detection in streaming data using weighted clustering. In: The 12th International Conference on ISDA, pp. 160–164 (2012)

    Google Scholar 

  21. Sheng, D., Derrick, K., Walfredo, C.: Characterization and comparison of cloud versus grid workloads. In: IEEE International Conference on Cluster Computing (2012)

    Google Scholar 

  22. Tan, Y., Nguyen, H., Shen, Z., Gu, X., Venkatramani, C., Rajan, D.: Prepare: predictive performance anomaly prevention for virtualized cloud systems. In: The IEEE 32nd International Conference on Distributed Computing Systems (ICDCS) (2012)

    Google Scholar 

  23. Vijayarani, S., Jothi, P.: Detecting outliers in Data streams using Clustering Algorithms. Int. J. Innov. Res. Comput. Commun. Eng. 1(8) (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Imen Souiden or Zaki Brahmi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Souiden, I., Brahmi, Z., Lafi, L. (2017). Data Stream Mining Based-Outlier Prediction for Cloud Computing. In: Jallouli, R., Zaïane, O., Bach Tobji, M., Srarfi Tabbane, R., Nijholt, A. (eds) Digital Economy. Emerging Technologies and Business Innovation. ICDEc 2017. Lecture Notes in Business Information Processing, vol 290. Springer, Cham. https://doi.org/10.1007/978-3-319-62737-3_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-62737-3_11

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-62736-6

  • Online ISBN: 978-3-319-62737-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics