Cognitive Informatics and Soft Computing pp 299-306 | Cite as
A Survey: Classification of Big Data
Abstract
In the current decades large data sets are mostly available from the source, extraction and analysis of data is an interesting and challenging task. Big Data relate to expansive bulk size, developing datasets that are intricate and have numerous self-ruling spring. Prior advances were not ready to deal with capacity and handling of enormous dataset in this manner Big Data idea appears. This is a monotonous employment for clients to distinguish precise data from enormous unstructured data. Along these lines, there ought to be some system which characterize unstructured data into sorted out shape which causes client to effectively get to required data. Arrangement systems over big value-based database give expected dataset to the clients from huge datasets further straightforward way. There are two primary arrangement procedures, administered and unsupervised. In this paper we concentrated on to investigation of various administered characterization methods. Encourage this paper demonstrates use of every system and their points of interest and confinements.
Keywords
Big data Classification Structured UnstructuredReferences
- 1.Wu, X., Zhu, X., Wu, G., Ding, W.: Data mining with big data. IEEE Trans. Knowl. Data Eng. 26(1), 1041–4347 (2014)Google Scholar
- 2.Wang, D., Liu, X., Wang M.: A DT-SVM strategy for stock futures prediction with big data. In: IEEE International Conference on Computational Science and Engineering. 978-0-76955096, pp. 1/13 (2013)Google Scholar
- 3.Ghemawat, S., Gobioff, H., Leung, S.-T. The Google file system. In: ACM SIGOPS Operating Systems Review, ACM, vol. 37, pp 29–43 (2003)Google Scholar
- 4.Dean, J., Ghemawat, S.: Mapreduce: simplified data processing on large clusters. Commun. ACM 51(1), 107–113 (2008)CrossRefGoogle Scholar
- 5.Kesavaraj, G., Sukumaran, S.: A study on classification techniques in data mining. In: IEEE ICCCNT, pp. 4–6 (2013)Google Scholar
- 6.Suthaharan, S.: Big Data Classification: Problems and Challenges in Network Intrusion Prediction with Machine Learning. University of North Carolina at Greensboro, USA (2012)Google Scholar
- 7.Kotsiantis, S.B.: Supervised Machine Learning: A Review of Classification Techniques. vol. 31, pp. 249–268, Informatica, USA (2007)Google Scholar
- 8.Yu, H., Yang, J., Han, J.: Classifying large data sets using SVMs with hierarchical clusters. In: SIGKDD ’03 Washington, DC, 1581137370/03/0008 (2003)Google Scholar
- 9.Piao, Y., Park, H.W., Jin, C.H., Ryu, K.H.: Ensemble Method for Classification of High-Dimensional Data. IEEE. 978-1-4799-3919-0/14 (2014)Google Scholar
- 10.Yenkar, V., Bartere, M.: Review on data mining with big data. Int. J. Comput. Sci. Mob. Comput. 3(4), 97–102 (2014)Google Scholar
- 11.Mohammed, G.H., Zamil, A.L.: The application of semantic-based classification on big data. In: IEEE International Conference on Information and Communication Systems (ICICS). 978-1-4799-3023 4/14 (2014)Google Scholar
- 12.Dai, W., Ji, W.: A map reduce implementation of C4. 5 decision tree algorithm. Int. J. Database Theor. Appl. SERSC 7(1), 49–60 (2014)Google Scholar
- 13.Mohanty, M.N., Kumar, A., Routray, A., Kabisatpathy, P.: Evolutionary algorithm based optimization for PQ disturbances classification using SVM. Int. J. Control Autom. Syst. 8(6), 1306–1312 (2010)Google Scholar
- 14.Jeyakumar, V., Li, G., Suthaharan, S.: Support vector machine classifiers with uncertain knowledge sets via robust convex optimization. Optim. J. Math. Program. Oper. Res. 1–18 (2012). https://doi.org/10.1080/02331934.2012.703667