Skip to main content

Likelihood Prediction of Diabetes at Early Stage Using Data Mining Techniques

  • Conference paper
  • First Online:

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 992))

Abstract

Diabetes is one of the fastest growing chronic life threatening diseases that have already affected 422 million people worldwide according to the report of World Health Organization (WHO), in 2018. Due to the presence of a relatively long asymptomatic phase, early detection of diabetes is always desired for a clinically meaningful outcome. Around 50% of all people suffering from diabetes are undiagnosed because of its long-term asymptomatic phase. The early diagnosis of diabetes is only possible by proper assessment of both common and less common sign symptoms, which could be found in different phases from disease initiation up to diagnosis. Data mining classification techniques have been well accepted by researchers for risk prediction model of the disease. To predict the likelihood of having diabetes requires a dataset, which contains the data of newly diabetic or would be diabetic patient. In this work, we have used such a dataset of 520 instances, which has been collected using direct questionnaires from the patients of Sylhet Diabetes Hospital in Sylhet, Bangladesh. We have analyzed the dataset with Naive Bayes Algorithm, Logistic Regression Algorithm, and Random Forest Algorithm and after applying tenfold Cross- Validation and Percentage Split evaluation techniques, Random forest has been found having best accuracy on this dataset. Finally, a commonly accessible, user-friendly tool for the end user to check the risk of having diabetes from assessing the symptoms and useful tips to control over the risk factors has been proposed.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. The 6 Different Types of Diabetes: (5 Mar 2018). The diabetic journey. https://thediabeticjourney.com/the-6-different-types-of-diabetes

  2. Statistics About Diabetes: American Diabetes Association, 22 Mar 2018. https://www.diabetes.org

  3. Diabetes, World Health Organization (WHO): 30 Oct 2018. https://www.who.int/news-room/fact-sheets/detail/diabetes

  4. Failure to detect type 2 diabetes early costing \$700 million per year, Diabetes Australia, 8 July 2018. https://www.diabetesaustralia.com.au

  5. Harris, M.I., et al.: Onset of NIDDM occurs at least 4–7 yr before clinical diagnosis. Diabetes Care 15(7), 815–819 (1992)

    Article  Google Scholar 

  6. Akter, S., et al.: Prevalence of diabetes and prediabetes and their risk factors among Bangladeshi adults: a nationwide survey. Bull. World Health Organ. 92, 204–213A (2014)

    Article  Google Scholar 

  7. Ramachandran, A.: Know the signs and symptoms of diabetes. Indian J. Med. Res. 140(5), 579 (2014)

    Google Scholar 

  8. Kumar, V., Valide, L.: A data mining approach for prediction and treatment of diabetes disease. Int. J. Sci. Invent. Today (2014). ISSN 2319-5436

    Google Scholar 

  9. Agrawal, P., Dewangan, A.: A brief survey on the techniques used for the diagnosis of diabetes-mellitus. Int. Res. J. Eng. Technol. (IRJET). 02(03) (2015). e-ISSN: 2395-0056; p-ISSN: 2395-0072

    Google Scholar 

  10. Joshi, T.N. Chawan, P.M.: Diabetes prediction using machine learning techniques. Dewangan, S. et.al. Int. J. Eng. Res. Appl. (Part -II) 8(1), 09–13 (2018). ISSN: 2248-9622

    Google Scholar 

  11. Sapon, M.A., Ismail, K., Zainudin, S.: Prediction of diabetes by using artificial neural network. In: 2011 International Conference on Circuits, System and Simulation IPCSIT, vol. 7. IACSIT Press, Singapore (2011)

    Google Scholar 

  12. Asir, A.G., Singh, E.J., Leavline, Baig, B.S.: Diabetes prediction using medical data. J. Comput. Intell. Bioinform. 10(1), 1–8 (2017)

    Google Scholar 

  13. Ahmed: Developing a predicted model for diabetes type 2 treatment plans by using data mining (2016b)

    Google Scholar 

  14. Rabina1, Er. Anshu Chopra2: Diabetes prediction by supervised and unsupervised learning with feature selection, 2(5). ISSN: 2454-132

    Google Scholar 

  15. Mishra, V., Samuel, C., Sharma, S.K.: Use of machine learning to predict the onset of diabetes. Int. J. Recent Adv. Mech. Eng. (IJMECH) 4(2) (2015)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to M. M. Faniqul Islam or Rahatara Ferdousi .

Editor information

Editors and Affiliations

Ethics declarations

Ethical Approval

All procedures performed in studies involving human were in accordance with the ethical standards of the institution at which the studies were conducted and ethical approval was obtained from Sylhet Diabetic Hospital, Sylhet Bangladesh. Ref: S.D.A/88

Informed Consent

Informed consent was obtained from all individual participants included in the study.

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Islam, M.M.F., Ferdousi, R., Rahman, S., Bushra, H.Y. (2020). Likelihood Prediction of Diabetes at Early Stage Using Data Mining Techniques. In: Gupta, M., Konar, D., Bhattacharyya, S., Biswas, S. (eds) Computer Vision and Machine Intelligence in Medical Image Analysis. Advances in Intelligent Systems and Computing, vol 992. Springer, Singapore. https://doi.org/10.1007/978-981-13-8798-2_12

Download citation

Publish with us

Policies and ethics