Big data analytic diabetics using map reduce and classification techniques

  • Ahmad Ali AlZubi


Diabetes is more severe in women, according to various medical reports and surveys. Sometimes diabetes is difficult to identify due to various common symptoms, such as headache, fatigue, slow healing of cuts and blurry vision. Thus, this paper introduces novel big data and classification techniques such as effective map reducing technologies are used to recognize the diabetes. Initially, the data were collected from a large dataset, and the map reducing concept is applied to compose the small chunk of data efficiently. Following this process, the noise present in the collected dataset is removed using the normalization process. After that, the statistical features are selected using the ant bee colony approach that uses the ant characteristics such as wandering. The selected features are trained with the help of the support vector machine with multilayer neural network. The trained or learned features are efficiently classified using the associated neural network, and the efficiency of the system is evaluated with the help of experimental results in terms of error rate, sensitivity, specificity and accuracy.


Big data Map reduce Classification Diabetes Diabetics Ant bee colony SVM-trained multilayer neural network Associated neural network 



This project was supported by King Saud University, Deanship of Scientific Research, Community College Research Unit.


  1. 1.
    Pugoy RA, Mariano V (2011) Automated rice leaf disease detection using shape image analysis. In: 3rd International Conference on Digital Image Processing (ICDIP 2011), Chengdu, China, 15–17 April 2011Google Scholar
  2. 2.
    Orillo JW et al (2014) Identification of diseases in rice plant (Oryza sativa) using back propagation artificial neural network. IEEE.
  3. 3.
    Revenaz A, Ruggeri M, Martelli M (2010) Wireless communication protocol for agricultural machines synchronization and fleet management. In: Proc. IEEE Intl Symp Industrial Electronics, Bari, Italy, 04–07 Jul. 2010, pp 3498–3504Google Scholar
  4. 4.
    Abdul Aziz ID et al (2009) Remote monitoring in agricultural greenhouse using wireless sensor and short message service (SMS). Intl J Eng Technol 9(9):35–43Google Scholar
  5. 5.
    Nambiar R, Bhardwaj R, Sethi A, Vargheese R (2013) A look at challenges and opportunities of big data analytics in healthcare, Big data. In: 2013 IEEE International Conference on 17–22Google Scholar
  6. 6.
    Makandar A, Patrot A (2015) Computation pre-processing techniques for image restoration. Int J Comput Appl 113(4):11–17Google Scholar
  7. 7.
    Shyni S, Shantha Mary Joshitta R, Arockiam L (2016) Applications of big data analytics for diagnosing diabetic mellitus: issues and challenges. Int J Recent Trends Eng Res (IJRTER) 02(06):454–461Google Scholar
  8. 8.
    Viceconti M, Hunter P, Hose R (2015) Big data, big knowledge: big data for personalized healthcare. IEEE J Biomed Health Inform 19(4):1209–1215CrossRefGoogle Scholar
  9. 9.
    Yogamangalam R, Karthikeyan B (2013) Segmentation techniques comparison in image processing. Int J Eng Technol 5(1):307–313Google Scholar
  10. 10., cucumber, tomato, cotton. Accessed 12 Feb 2017
  11. 11.
  12. 12.
    Yogamangalam R (2013) Segmentation techniques comparison in image processing. Int J Eng Technol IJET 5(1):307–313Google Scholar
  13. 13.
  14. 14.
  15. 15. Accessed 11 Mar 2017
  16. 16.
    Francis J et al (2016) Identification of leaf diseases in pepper plants using soft computing techniques. In: Conference on emerging devices and smart systems in IEEEGoogle Scholar
  17. 17.
  18. 18.
    Lin C, Liu K, Chen M (2005) Dual clustering: integrating data clustering over optimization and constraint domains. IEEE 17(5):628–637Google Scholar
  19. 19.
    Chen Y et al (2016) Deep feature extraction and classification of hyperspectral images based on convolutional neural networks. IEEE Trans Geosci Remote Sens 54(10):6232–6251CrossRefGoogle Scholar
  20. 20.
    Pushpavalli R, Sivarajde G (2013) Image enhancement using adaptive neuro-fuzzy inference system. Int J Sci Technol Res 2(6):256–262Google Scholar
  21. 21.
    William J, Dela Cruz J, Agapito L (2014) Identification of diseases in rice plant (Oryza Sativa) using back propagation artificial neural network. In: 7th IEEE International Conference Humanoid, Nanotechnology, Information Technology Communication and Control, Environment and Management (HNICEM), IEEEGoogle Scholar
  22. 22.
    Prasad S, Peddoju SK, Ghosh D (2014) Energy efficient mobile vision system for plant leaf disease identification, IEEE.
  23. 23.
    Khirade SD, Patil AB (2015) Plant disease detection using image processing. In: 2015 International Conference on Computing Communication Control and Automation, IEEEGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Computer Science Department, Community CollegeKing Saud UniversityRiyadhSaudi Arabia

Personalised recommendations