The Effect of Vitamin B12 Deficiency on Blood Count Using Data Mining

  • Nada Almugren
  • Nafla Alrumayyan
  • Rabiah Alnashwan
  • Abeer Alfutamani
  • Isra Al-Turaiki
  • Omar Almugren
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 753)


Healthcare systems create vast amount of data collected from medical examination. Data mining techniques are widely used in healthcare systems to detect diseases in early stages. In this paper, we applied four data mining techniques to find the relation between vitamin B12 levels and blood cell count. Four data mining techniques were applied to real patients’ dataset: Neural Networks (MLP), Naïve Bayes, J48, and JRip. The resulting models were evaluated using the real datasets obtained from King Khalid University Hospital (KKUH), Riyadh, Saudi Arabia. Experimental results showed that both MLP and JRip techniques were capable of classifying the dataset correctly regardless of the size of the dataset.


Healthcare Data mining Data mining techniques Neural networks Naïve Bayesian J48 JRip Vitamin B12 


  1. 1.
    Haykin, S.: Neural Networks: A Comprehensive Foundation, 1st edn. Prentice Hall PTR, Upper Saddle River (1994)zbMATHGoogle Scholar
  2. 2.
    Cichosz, P.: Naïve Bayes classifier. In: Data Mining Algorithms, pp. 118–133. Wiley (2015)Google Scholar
  3. 3.
    Patil, T., Sherekar, S.: Performance analysis of Naive Bayes and J48 classification algorithm for data classification. TechRepublic. Accessed 07 Dec 2017
  4. 4.
    Rajput, A., Aharwal, R., Dubey, M., Raghuvanshi, M.: J48 and JRIP rules for E-governance data. Accessed 07 Dec 2017
  5. 5.
    Saichanma, S., Chulsomlee, S., Thangrua, N., Pongsuchart, P., Sanmun, D.: The observation report of red blood cell morphology in thailand teenager by using data mining technique. In: Advances in Hematology (2014)Google Scholar
  6. 6.
    Abdullah, M., Al-Asmari, S.: Anemia types prediction based on data mining classification algorithms, November 2016Google Scholar
  7. 7.
    Sanap, S.A., Nagori, M., Kshirsagar, V.: Classification of anemia using data mining techniques. In: Swarm, Evolutionary, and Memetic Computing, pp. 113–121 (2011)Google Scholar
  8. 8.
    Vijayarani, S., Sudha, S.: An efficient clustering algorithm for predicting diseases from hemogram blood test samples. Indian J. Sci. Technol. 8(17) (2015)Google Scholar
  9. 9.
    Jain, A.K.: Data clustering: 50 years beyond K-means. Pattern Recognit. Lett. 31(8), 651–666 (2010)CrossRefGoogle Scholar
  10. 10.
    Ferraz, A., Brito, J.H., Carvalho, V., Machado, J.: Blood type classification using computer vision and machine learning. Neural Comput. Appl. 28(8), 2029–2040 (2017)CrossRefGoogle Scholar
  11. 11.
    Alshami, I.: Automated diagnosis of thalassemia based on datamining classifiers. In: International Conference Information Application, ICIA 2012, pp. 440–445 (2012)Google Scholar
  12. 12.
    Dinakaran, K., Preethi, R.: A novel approach to uncover the patient blood related diseases using data mining techniques. J. Med. Sci. 13(2), 95–102 (2013)CrossRefGoogle Scholar
  13. 13.
    Minnie, D., Srinivasan, S.: Clustering the preprocessed automated blood cell counter data using modified K-means algorithms and generation of association rules. Int. J. Comput. Appl. 52, 38–42 (2012)Google Scholar
  14. 14.
    El-Halees, A., Shurrab, A.: Blood tumor prediction using data mining techniques. In: Hematology Diseases, Blood Tumor, Rule Induction, Association Rules, Deep Learning, vol. 6, pp. 23–30 (2017)Google Scholar
  15. 15.
    Matsuoka, K., Yokoyama, S., Watanabe, K., Tsumoto, S.: Data mining analysis of relationship between blood stream infection and clinical background in patients undergoing lactobacillus therapy, vol. 6, pp. 1940–1945. Springer (2007)Google Scholar
  16. 16.
    Soley-Bori, M.: Dealing with missing data: key assumptions and methods for applied analysis. School of Public Health Department of Health Policy & Management, Boston University, Technical report 4, May 2013Google Scholar
  17. 17.
    Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: SMOTE: synthetic minority over-sampling technique. ArXiv11061813 Cs, June 2011Google Scholar
  18. 18.
    Neukart, F., Grigorescu, C.M., Moraru, S.A.: High order computational intelligence in data mining a generic approach to systemic intelligent data mining. In: 2011 6th Conference on Speech Technology and Human-Computer Dialogue (SpeD), pp. 1–9 (2011)Google Scholar
  19. 19.
    Dimitoglou, G., Adams, J.A., Jim, C.M.: Comparison of the C4.5 and a Naive Bayes classifier for the prediction of lung cancer survivability. ArXiv12061121 Cs, June 2012Google Scholar

Copyright information

© Springer International Publishing AG, part of Springer Nature 2018

Authors and Affiliations

  1. 1.King Saud UniversityRiyadhSaudi Arabia

Personalised recommendations