A Survey: Classification of Big Data

  • Dinesh Kumar
  • Mihir Narayan MohantyEmail author
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 768)


In the current decades large data sets are mostly available from the source, extraction and analysis of data is an interesting and challenging task. Big Data relate to expansive bulk size, developing datasets that are intricate and have numerous self-ruling spring. Prior advances were not ready to deal with capacity and handling of enormous dataset in this manner Big Data idea appears. This is a monotonous employment for clients to distinguish precise data from enormous unstructured data. Along these lines, there ought to be some system which characterize unstructured data into sorted out shape which causes client to effectively get to required data. Arrangement systems over big value-based database give expected dataset to the clients from huge datasets further straightforward way. There are two primary arrangement procedures, administered and unsupervised. In this paper we concentrated on to investigation of various administered characterization methods. Encourage this paper demonstrates use of every system and their points of interest and confinements.


Big data Classification Structured Unstructured 


  1. 1.
    Wu, X., Zhu, X., Wu, G., Ding, W.: Data mining with big data. IEEE Trans. Knowl. Data Eng. 26(1), 1041–4347 (2014)Google Scholar
  2. 2.
    Wang, D., Liu, X., Wang M.: A DT-SVM strategy for stock futures prediction with big data. In: IEEE International Conference on Computational Science and Engineering. 978-0-76955096, pp. 1/13 (2013)Google Scholar
  3. 3.
    Ghemawat, S., Gobioff, H., Leung, S.-T. The Google file system. In: ACM SIGOPS Operating Systems Review, ACM, vol. 37, pp 29–43 (2003)Google Scholar
  4. 4.
    Dean, J., Ghemawat, S.: Mapreduce: simplified data processing on large clusters. Commun. ACM 51(1), 107–113 (2008)CrossRefGoogle Scholar
  5. 5.
    Kesavaraj, G., Sukumaran, S.: A study on classification techniques in data mining. In: IEEE ICCCNT, pp. 4–6 (2013)Google Scholar
  6. 6.
    Suthaharan, S.: Big Data Classification: Problems and Challenges in Network Intrusion Prediction with Machine Learning. University of North Carolina at Greensboro, USA (2012)Google Scholar
  7. 7.
    Kotsiantis, S.B.: Supervised Machine Learning: A Review of Classification Techniques. vol. 31, pp. 249–268, Informatica, USA (2007)Google Scholar
  8. 8.
    Yu, H., Yang, J., Han, J.: Classifying large data sets using SVMs with hierarchical clusters. In: SIGKDD ’03 Washington, DC, 1581137370/03/0008 (2003)Google Scholar
  9. 9.
    Piao, Y., Park, H.W., Jin, C.H., Ryu, K.H.: Ensemble Method for Classification of High-Dimensional Data. IEEE. 978-1-4799-3919-0/14 (2014)Google Scholar
  10. 10.
    Yenkar, V., Bartere, M.: Review on data mining with big data. Int. J. Comput. Sci. Mob. Comput. 3(4), 97–102 (2014)Google Scholar
  11. 11.
    Mohammed, G.H., Zamil, A.L.: The application of semantic-based classification on big data. In: IEEE International Conference on Information and Communication Systems (ICICS). 978-1-4799-3023 4/14 (2014)Google Scholar
  12. 12.
    Dai, W., Ji, W.: A map reduce implementation of C4. 5 decision tree algorithm. Int. J. Database Theor. Appl. SERSC 7(1), 49–60 (2014)Google Scholar
  13. 13.
    Mohanty, M.N., Kumar, A., Routray, A., Kabisatpathy, P.: Evolutionary algorithm based optimization for PQ disturbances classification using SVM. Int. J. Control Autom. Syst. 8(6), 1306–1312 (2010)Google Scholar
  14. 14.
    Jeyakumar, V., Li, G., Suthaharan, S.: Support vector machine classifiers with uncertain knowledge sets via robust convex optimization. Optim. J. Math. Program. Oper. Res. 1–18 (2012).

Copyright information

© Springer Nature Singapore Pte Ltd. 2019

Authors and Affiliations

  1. 1.Department of CSESri Sai College of Engineering and TechnologyBadhani, PathankotIndia
  2. 2.Department of Electronics and Communication EngineeringSiksha ‘O’ Anusandhan UniversityBhubaneswarIndia

Personalised recommendations