Skip to main content

Majority Voting Algorithm for Diagnosing of Imbalanced Malaria Disease

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computational Vision and Biomechanics ((LNCVB,volume 30))

Abstract

Vector borne diseases like malaria fever is one of the most elevating issues in medical domain. Accurate identification of a patient from the given set of samples and classification becomes one of the challenging task when dealing with imbalanced datasets. Many conventional machine learning and data mining algorithms are shows poor performance to classify skewed distributed data because they are trained very well with the majority class samples only. Proposing an ensemble method called majority voting defined with a set of machine learning algorithms namely decision tree—C4.5, Naive Bayesian and K-Nearest Neighbor (KNN) classifiers. Classification of samples can be done based on the majority voting of classifiers. Experiment results stating that voting ensemble method shows classification accuracy of 95.2% on imbalanced malaria disease data whereas dealing with balanced malaria disease data voting ensembler shows 92.1% of accuracy. Consequently voting shows 100% classification report on precision, Recall and F1-Score on imbalanced malaria disease data sets whereas on balanced malaria disease data voting shows 96% of Precision, Recall and F1-Score metrics.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   44.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD   59.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Bui TQ, Pham HM (2016) Web based GIS for spatial pattern detection: application to malaria incidence in Vietnam. Bui Pham Springer Plus 5(1014):1–14

    Google Scholar 

  2. MacLeod DA, Jones A, Di Giuseppe F, Caminade C, Morse AP (2015) Demonstration of successful malaria forecasts for Botswana using an operational seasonal climate model. Environ Res Lett 10:044005, 1–11 (IOP Publishing)

    Google Scholar 

  3. Rahman MZ, Roytman L, Kadik A, Rosy DA (2015) Environmental data analysis and remote sensing for early detection of dengue and malaria. In: Proceedings of SPIE, vol 9112, pp 1–9

    Google Scholar 

  4. WHO Malaria Report (2016) http://www.who.int/mediacentre/factsheets/fs387/en/

  5. Pengfei J, Chunkai Z, Zhenyu H (2014) A new sampling approach for classification of imbalanced data sets with high density. In: IEEE—BigComp, pp 217–222

    Google Scholar 

  6. Ditzler G, Polikar R (2012) Incremental learning of concept drift from streaming imbalanced data. IEEE Trans Knowl Data Eng, pp 1–30

    Google Scholar 

  7. Nugroho HA, Akbar SA, Murhandarwati EEH (2015) Feature extraction and classification for detection malaria parasites in thin blood smear. In: IEEE 2nd international conference on information technology, computer and electrical engineering (ICITACEE), pp 197–201

    Google Scholar 

  8. Das DK, Maiti AK, Chakraborty C (2015) Automated system for characterization and classification of malaria-infected stages using light microscopic images of thin blood smears. J Microsc 257(3):238–252

    Article  Google Scholar 

  9. Ruiz D, Brun C, Connor SJ, Omumbo JA, Lyon B, Thomson MC (2014) Testing a multi-malaria-model ensemble against 30 years of data in the Kenyan highlands. Malaria J 13:206, 1–14

    Google Scholar 

  10. Smith T, Ross A, Maire N, Chitnis N, Studer A, Hardy D, Brooks A, Penny M, Tanner M (2012) Ensemble modeling of the likely public health impact of pre-erythrocytic malaria vaccine. PLOS Med 9(1):1–20

    Article  Google Scholar 

  11. Pandit P, Anand A (2016, August) Artificial neural networks for detection of malaria in RBCs. ArXiv: 1608.06627)

    Google Scholar 

  12. Bbosa F, Wesonga R, Jehopio P (2016) Clinical malaria diagnosis: rule based Classification statistical prototype. Springer Plus 5:939

    Article  Google Scholar 

  13. Wu C, Wong PJY (2016) Multi-dimensional discrete Halanay inequalities and the global stability of the disease free equilibrium of a discrete delayed malaria model. Adv Differ Equ 2016:113

    Article  MathSciNet  MATH  Google Scholar 

  14. Tsai M-H, Tsai M-H, Yu S-S, Chan Y-K, Jen C-C (2015) Blood smear image based malaria parasite and infected-erythrocyte detection and segmentation. Transactional Processing Systems. J Med Syst 39:118. https://doi.org/10.1007/s10916-015-0280-9

    Article  Google Scholar 

  15. Rahmanti FZ, Ningrum NK, Imania NK, Purnomo MH (2015, November) Plasmodium vivax classification from digitalization microscopic thick blood film using combination of second order statistical feature extraction and K-Nearest Neighbour (K-NN) classifier method. In: IEEE 4th international conference on instrumentation, communications, information technology, and biomedical engineering (ICICI-BME), Bandung, pp 2–3

    Google Scholar 

  16. Charpe KC, Bairagi V (2015) Automated malaria parasite and there stage detection in microscopic blood images. In: IEEE sponsored 9th international conference on intelligent systems and control (ISCO)

    Google Scholar 

  17. Somasekar J, Reddy BE (2015) Segmentation of erythrocytes infected with malaria parasites for the diagnosis using microscopy imaging. Comput Electr Eng, pp 336–351 (Elsevier)

    Google Scholar 

  18. Cameron E, Battle KE, Bhatt S, Weiss DJ, Bisanzio D, Mappin B, Dalrymple U, Hay SI, Smith DL, Griffin JT, Wenger EA, Eckhoff PA, Smith TA, Penny MA, Gething PW (2015) Defining the relationship between infection prevalence and clinical incidence of Plasmodium falciparum malaria. Nat Commun 6:8170, 1–10

    Google Scholar 

  19. Krawczyk B (2016) Learning from imbalanced data: open challenges and future directions. Prog Artif Intell, pp 1–12

    Google Scholar 

  20. Deng X, Zhong W, Ren J, Zeng D, Zhang H (2016) An imbalanced data classification method based on automatic clustering under-sampling. IEEE Trans, pp 1–8

    Google Scholar 

  21. Ali A, Shamsuddin SM, Ralescu AL (2013) Classification with class imbalance problem: a review. Int J Adv Soft Comput Appl 5(3):1–30

    Google Scholar 

  22. Poolsawad N, Kambhampati C, Cleland JGF (2014) Balancing class for performance of classification with a clinical dataset. In: Proceedings of the World Congress on engineering, vol 1, pp 1–6

    Google Scholar 

  23. Rahman MM, Davis DN (2013) Addressing the class imbalance problem in medical datasets. Int J Mach Learn Comput 3(2):224–228

    Article  Google Scholar 

  24. Haixiang G, Yijing L, Shang J, Mingyun G, Yuanyue H, Bing G (2016) Learning from class-imbalanced data: review of methods and applications. Expert Syst Appl, pp 1–49

    Google Scholar 

  25. Jamal S, Periwal V, Scaria V (2013) Predictive modeling of anti-malarial molecules inhibiting apicoplast formation. BMC Bioinform 14:55, 1–8

    Google Scholar 

  26. Andrade BB, Reis-Filho A, Souza-Neto SM, Clarencio J, Carmargo LMA, Barral A, Barral-Netto M (2010) Severe Plasmodium vivax malaria exhibits marked inflammatory imbalance. Malaria J 9:13, 1–8

    Google Scholar 

  27. Dubey R, Zhou J, Wanga Y, Thompson PM, Ye J (2014) Analysis of sampling techniques for imbalanced data: An n = 648 ADNI study. Elsevier Neuro Image 87:220–241

    Google Scholar 

  28. Ng WWY, Hu J, Yeung DS, Yin S, Roli F (2015) Diversified sensitivity-based under sampling for imbalance classification problems. IEEE Trans Cybern, pp 1–11

    Google Scholar 

  29. Roumani YF, May JH, Strum DP, Vargas LG (2013) Classifying highly imbalanced ICU data. Health care Manag Sci 16:119–128

    Article  Google Scholar 

  30. Pengfei J, Chunkai Z, Zhenyu H (2014) A new sampling approach for classification of imbalanced data sets with high density. In: IEEE transaction, pp 217–222

    Google Scholar 

  31. Garcia V, Sanchez JS, Mollineda RA (2012) On the effectiveness of preprocessing methods when dealing with different levels of class imbalance. Knowl Based Syst 25:13–21 (Elsevier)

    Article  Google Scholar 

  32. Thongkam J, Xu G, Zhang Y, Huang F (2009) Toward breast cancer survivability prediction model through improving training space. Expert Syst Appl 36:12200–12209 (Elsevier)

    Article  Google Scholar 

  33. Zhao X-M, Li X, Chen L, Aihara K (2007) Protein classification with imbalanced data. Wiley InterSci 70:125–1132

    Google Scholar 

  34. López V, Fernandez A, Garcia S, Palade V, Herrera F (2013) An insight into classification with imbalanced data: empirical results and current trends on using data intrinsic characteristics. Inf Sci 250:113–141 (Elsevier)

    Article  Google Scholar 

  35. Ma L, Fan S (2017) CURE-SMOTE algorithm and hybrid algorithm for feature selection and parameter optimization based on random forests. BMC Bioinform 18:169

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to T. Sajana .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sajana, T., Narasingarao, M.R. (2019). Majority Voting Algorithm for Diagnosing of Imbalanced Malaria Disease. In: Pandian, D., Fernando, X., Baig, Z., Shi, F. (eds) Proceedings of the International Conference on ISMAC in Computational Vision and Bio-Engineering 2018 (ISMAC-CVB). ISMAC 2018. Lecture Notes in Computational Vision and Biomechanics, vol 30. Springer, Cham. https://doi.org/10.1007/978-3-030-00665-5_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-00665-5_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-00664-8

  • Online ISBN: 978-3-030-00665-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics