Abstract
The cloud computing is the dream of computing used as utility that became true. It is currently emerging as a hot topic due to the important services it provides. Ensuring high quality services is a challenging task especially with the considerable increase of the user’s requests coming continuously in real time to the data center servers and consuming its resources. Abnormal users requests may contribute to the system failure. Thus, it’s crucial to detect these abnormalities for further analysis and prediction. To do that, we propose the use of the outlier detection techniques in the context of the data stream mining due to the similarity between the nature of the data streams and the users requests which require analysis and mining in real time. The main contribution of this paper consists of: first, the formulation of the users requests as well as the server state as a stream of data. This data is generated from \(CSG^+\) a cloud stream generator that we extended from CSG [1]. Second, the creation of a framework for the detection of the abnormal users requests in terms of the CPU and memory by using AnyOut and MCOD algoithms implemented within MOA (Massive Online Analysis) (http://moa.cms.waikato.ac.nz/) framework. Third, the comparison between them in this context.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
toumi, H., Brahmi, Z., Ben Arfa, Z.: Server load prediction using stream mining. In: The 31st International Conference on Information Networking (ICOIN) (2017)
Hassen, F.B., Brahmi, Z., Toumi, H.: VM placement algorithm based on recruitment process within ant colonies. In: The 1st International Conference on Digital Economy Emerging Technologies and Business Innovation (2016)
Souiden, I., Brahmi, Z., Toumi, H.: A survey on outlier detection in the context of stream mining: review of existing approaches and recommendations. In: The 16th International Conference on Intelligent Systems Design and Applications, Porto, Portugal (2016)
Assent, I., Kranen, P., Baldauf, C., Seidl, T.: AnyOut: anytime outlier detection on streaming data. In: The 17th International Conference on DASFAA, pp. 228–242 (2012)
Kontaki, M., Gounarisn, A., Papadopoulos, A.N., Tsichlas, K., Manolopoulos, Y.: Efficient and flexible algorithms for monitoring distance based outliers over data streams. Inf. Syst. 55(3), 37–53 (2016)
Wilkes, J.: googleclusterdata. https://github.com/google/cluster-data (2013)
Zhang, Q., Cheng, L., Boutaba, R.: Cloud computing: state-of-the-art and research challenges. J. Internet Serv. Appl. 1(1), 7–18 (2010)
Solaimani, M., Iftekhar, M., Khan, L.: Statistical technique for online anomaly detection using spark over heterogeneous data from multi-source vmware performance data. In: IEEE International Conference on Big Data (2014)
Sauvanaud, C., Silvestre, G., Kaaniche, M., Kanoun, K.: Data stream clustering for online anomaly detection in cloud applications. In: The 11th European Dependable Computing Conference, September 2015
Uddin, M.S., Kuh, A.: Online least-squares one-class support vector machine for outlier detection in power grid data. In: IEEE International Conference on Acoustics, Speech and Signal Processing (2016)
Uddin, M.S., Kuh, A., Weng, Y., Ilic, M.: Online bad data detection using kernel density estimation. In: IEEE Power and Energy Society and General Meeting (2015)
Kale, A., Ingle, M.D.: SVM based feature extraction for novel class detection from streaming data. Wirel. Personal Commun. J. 110(9) 2015
Pokrajac, D., Lazarevic, A., Latecki, L.J.: Incremental local outlier detection for data streams. In: IEEE Symposium on CIDM, pp. 504–515 (2007)
Karimian, S.H., Kelarestaghi, M., Hashemi, S.: I-inclof: improved incremental local outlier detection for data streams. In: The 16th CSI International Symposium on Artificial Intelligence and Signal Processing (2012)
Cao, L., Yangt, Di., Wang, Q., Yu, Y., Wang, J., Rundensteiner, E.A.: Scalable distance based outlier detection over high-volume data streams. In: The 30th International Conference on Data Engineering (2014)
Lin, F., Le, W., Bo, J.: Research on maximal frequent pattern outlier factor for online high dimensional time-series outlier detection. J. Convergence Inf. Technol. 5(10), 66–71 (2010)
Said, A.M., Dominic, P.D.D., Faye, L.: Data stream outlier detection approach based on frequent pattern mining technique. Int. J. Bus. Inf. Syst. 20(1), 55–70 (2015)
Bhaduri, K., Das, K., Matthews, B.L.: Detecting Abnormal Machine Characteristics in Cloud Infrastructures. In: Proceedings of the IEEE 11th International Conference on Data Mining Workshops, pp. 137–144 (2011)
Hawkins, D.M.: Identification of Outliers. Springer, Heidelberg (1980)
Yogita, Toshniwal, D.: Unsupervised outlier detection in streaming data using weighted clustering. In: The 12th International Conference on ISDA, pp. 160–164 (2012)
Sheng, D., Derrick, K., Walfredo, C.: Characterization and comparison of cloud versus grid workloads. In: IEEE International Conference on Cluster Computing (2012)
Tan, Y., Nguyen, H., Shen, Z., Gu, X., Venkatramani, C., Rajan, D.: Prepare: predictive performance anomaly prevention for virtualized cloud systems. In: The IEEE 32nd International Conference on Distributed Computing Systems (ICDCS) (2012)
Vijayarani, S., Jothi, P.: Detecting outliers in Data streams using Clustering Algorithms. Int. J. Innov. Res. Comput. Commun. Eng. 1(8) (2013)
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Souiden, I., Brahmi, Z., Lafi, L. (2017). Data Stream Mining Based-Outlier Prediction for Cloud Computing. In: Jallouli, R., Zaïane, O., Bach Tobji, M., Srarfi Tabbane, R., Nijholt, A. (eds) Digital Economy. Emerging Technologies and Business Innovation. ICDEc 2017. Lecture Notes in Business Information Processing, vol 290. Springer, Cham. https://doi.org/10.1007/978-3-319-62737-3_11
Download citation
DOI: https://doi.org/10.1007/978-3-319-62737-3_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-62736-6
Online ISBN: 978-3-319-62737-3
eBook Packages: Computer ScienceComputer Science (R0)