Abstract
Diabetes is one of the fastest growing chronic life threatening diseases that have already affected 422 million people worldwide according to the report of World Health Organization (WHO), in 2018. Due to the presence of a relatively long asymptomatic phase, early detection of diabetes is always desired for a clinically meaningful outcome. Around 50% of all people suffering from diabetes are undiagnosed because of its long-term asymptomatic phase. The early diagnosis of diabetes is only possible by proper assessment of both common and less common sign symptoms, which could be found in different phases from disease initiation up to diagnosis. Data mining classification techniques have been well accepted by researchers for risk prediction model of the disease. To predict the likelihood of having diabetes requires a dataset, which contains the data of newly diabetic or would be diabetic patient. In this work, we have used such a dataset of 520 instances, which has been collected using direct questionnaires from the patients of Sylhet Diabetes Hospital in Sylhet, Bangladesh. We have analyzed the dataset with Naive Bayes Algorithm, Logistic Regression Algorithm, and Random Forest Algorithm and after applying tenfold Cross- Validation and Percentage Split evaluation techniques, Random forest has been found having best accuracy on this dataset. Finally, a commonly accessible, user-friendly tool for the end user to check the risk of having diabetes from assessing the symptoms and useful tips to control over the risk factors has been proposed.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
The 6 Different Types of Diabetes: (5 Mar 2018). The diabetic journey. https://thediabeticjourney.com/the-6-different-types-of-diabetes
Statistics About Diabetes: American Diabetes Association, 22 Mar 2018. https://www.diabetes.org
Diabetes, World Health Organization (WHO): 30 Oct 2018. https://www.who.int/news-room/fact-sheets/detail/diabetes
Failure to detect type 2 diabetes early costing \$700 million per year, Diabetes Australia, 8 July 2018. https://www.diabetesaustralia.com.au
Harris, M.I., et al.: Onset of NIDDM occurs at least 4–7 yr before clinical diagnosis. Diabetes Care 15(7), 815–819 (1992)
Akter, S., et al.: Prevalence of diabetes and prediabetes and their risk factors among Bangladeshi adults: a nationwide survey. Bull. World Health Organ. 92, 204–213A (2014)
Ramachandran, A.: Know the signs and symptoms of diabetes. Indian J. Med. Res. 140(5), 579 (2014)
Kumar, V., Valide, L.: A data mining approach for prediction and treatment of diabetes disease. Int. J. Sci. Invent. Today (2014). ISSN 2319-5436
Agrawal, P., Dewangan, A.: A brief survey on the techniques used for the diagnosis of diabetes-mellitus. Int. Res. J. Eng. Technol. (IRJET). 02(03) (2015). e-ISSN: 2395-0056; p-ISSN: 2395-0072
Joshi, T.N. Chawan, P.M.: Diabetes prediction using machine learning techniques. Dewangan, S. et.al. Int. J. Eng. Res. Appl. (Part -II) 8(1), 09–13 (2018). ISSN: 2248-9622
Sapon, M.A., Ismail, K., Zainudin, S.: Prediction of diabetes by using artificial neural network. In: 2011 International Conference on Circuits, System and Simulation IPCSIT, vol. 7. IACSIT Press, Singapore (2011)
Asir, A.G., Singh, E.J., Leavline, Baig, B.S.: Diabetes prediction using medical data. J. Comput. Intell. Bioinform. 10(1), 1–8 (2017)
Ahmed: Developing a predicted model for diabetes type 2 treatment plans by using data mining (2016b)
Rabina1, Er. Anshu Chopra2: Diabetes prediction by supervised and unsupervised learning with feature selection, 2(5). ISSN: 2454-132
Mishra, V., Samuel, C., Sharma, S.K.: Use of machine learning to predict the onset of diabetes. Int. J. Recent Adv. Mech. Eng. (IJMECH) 4(2) (2015)
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Ethics declarations
Ethical Approval
All procedures performed in studies involving human were in accordance with the ethical standards of the institution at which the studies were conducted and ethical approval was obtained from Sylhet Diabetic Hospital, Sylhet Bangladesh. Ref: S.D.A/88
Informed Consent
Informed consent was obtained from all individual participants included in the study.
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Islam, M.M.F., Ferdousi, R., Rahman, S., Bushra, H.Y. (2020). Likelihood Prediction of Diabetes at Early Stage Using Data Mining Techniques. In: Gupta, M., Konar, D., Bhattacharyya, S., Biswas, S. (eds) Computer Vision and Machine Intelligence in Medical Image Analysis. Advances in Intelligent Systems and Computing, vol 992. Springer, Singapore. https://doi.org/10.1007/978-981-13-8798-2_12
Download citation
DOI: https://doi.org/10.1007/978-981-13-8798-2_12
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-8797-5
Online ISBN: 978-981-13-8798-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)