Predictive Analysis and Prognostic Approach of Diabetes Prediction with Machine Learning Techniques


Medical experts indulge in numerous strategies for efficient and predictive measures to model the health status of patients and formulate the patterns that are formed in test results. Most patients would dream of their betterments of their health conditions and thus preventing the progression of any disease. When diabetics is considered in the model, or highly intervening methodology would be required for pre-diabetic individuals. Hidden Markov models have been modified into variant models to derive predictions that accurately produce expected results by investigating patterns of clinical observations from a detailed sample of patient’s dataset. There are yet unanswered and concerning challenges to derive an absolute model for predicting diabetes. The datasets from which the patterns are derived from, still holds levels of in completeness, irregularity and obvious clinical interventions during the diagnosis. The Electronic Medical Records are not furnished with all requisite information in all conditions and scenarios. Due to these irregularities prediction has become highly challenging and there is increase in misclassification rate. Newton’s Divide Difference Method (NDDM) is a conventional model for filling the irregularity in electronic datasets through divided differences. The classical approach considers a polynomial approximation approach, thus leading to Runge Phenomenon. If the interval between data fields id higher, severity of finding the irregularities is even higher. By using this type of technique it helps in improving the accuracy thereby bringing in high level prediction without any error and misclassification. In this technique proposed, a novel approximation technique is implemented using the Euclidean distance parameter over the NDDM approximation to predict the outcomes or risk of Type 2 Diabetes Mellitus among patients. Real world entities in CPCSSN are considered for this study and proposed method is tested. The proposed method filled the irregularity in the data components of EMR with better approximations and the quality of prediction has improved significantly.

This is a preview of subscription content, access via your institution.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5


  1. 1.

    American Diabetes Association. (2004). Diagnosis and classification of diabetes mellitus. Diabetes Care, 27(1), S5.

    Google Scholar 

  2. 2.

    Arulananth, T. S., Balaji, L., Baskar, M., et al. (2020). PCA Based Dimensional Data Reduction and Segmentation for DICOM Images. Neural Processing Letters.

    Article  Google Scholar 

  3. 3.

    Y. Y. Liu, S. Li, F. Li, L. Song and J. M. Rehg (2015) ``Efficient learning of continuous-time hidden Markov models for disease progression,'' in Proc.Adv. Neural Inf. Process. Syst. 21; 3600-3608.

  4. 4.

    Alberti, K. G. M. M., Zimmet, P. Z., & nition, De. (1998). Diagnosis and classification of diabetes mellitus and its complications” Part 1: Diagnosis and classification of diabetes mellitus Provisional report of a WHO consultation. Diabetic Med., 15(7), 539–553.

    Article  Google Scholar 

  5. 5.

    Ekhlaspour, L., Mondesir, D., Lautsch, N., Balliro, C., Hillard, M., Magyar, K., et al. (2017). Comparative accuracy of 17 point-of-care glucose meters. J. Diabetes Sci. Technol., 11(3), 558–566.

    Article  Google Scholar 

  6. 6.

    Perveen, S., Shahbaz, M., Keshavjee, K., & Guergachi, A. (2019). Metabolic syndrome and development of diabetes mellitus: Predictive modelling based on machine learning techniques. IEEE Access., 7, 1365–1375.

    Article  Google Scholar 

  7. 7.

    Cho, N., Shaw, J., Karuranga, S., Huang, Y., Fernandes, J. D. R., Ohlrogge, A., & Malanda, B. (2018). IDF diabetes Atlas: Global estimates of diabetes prevalence for 2017 and projections for 2045. Diabetes Res. Clin. Pract., 2(138), 271–281.

    Article  Google Scholar 

  8. 8.

    Li, Y., Swift, S., & Tucker, A. (2013). Modelling and analysing the dynamics of disease progression from cross-sectional studies. J. Biomedical Information., 46(2), 266–274.

    Article  Google Scholar 

  9. 9.

    Saraçoglu, R. D. (2012). Hidden Markov model-based classification of heart valve disease with PCA for dimension reduction. Eng. Appl. Artificial Intelligence., 25(7), 1523–1528.

    Article  Google Scholar 

  10. 10.

    Ramkumar, J., Baskar, M., Viswak, M., & Ashish, M. D. (2020). Smart Shopping with Integrated Secure System based on IoT. International Journal of Advanced Science and Technology., 29(5), 301–312.

    Google Scholar 

  11. 11.

    Suchithra, M., Baskar, M., Ramkumar, J. P., & Kalyanasundaram, B. (2020). Amutha, “Invariant packet feature with network conditions for efficient low rate attack detection in multimedia networks for improved QoS.” J Ambient Intell Human Comput.

    Article  Google Scholar 

  12. 12.

    Baskar, M., Ramkumar, J., Karthikeyan, C., et al. (2021). Low rate DDoS mitigation using real-time multi threshold traffic monitoring system. J Ambient Intell Human Comput.

    Article  Google Scholar 

  13. 13.

    Mhetre, N. A., Deshpande, A. V., & Mahalle, P. N. (2016). Trust management model based on fuzzy approach for ubiquitous computing. Int. J. Ambient Computing Intelligence., 7(2), 33–46.

    Article  Google Scholar 

  14. 14.

    El, M., Nahas, S. Kassim., & Shikoun, N. (2012). Profile hidden Markov model for detection and prediction of hepatitis C virus mutation. Int. JournalComput. Science. Issues., 9(5), 251.

    Google Scholar 

  15. 15.

    Dong, X., Chen, S., & Pan, S. J. (2017). “Learning to prune deep neural networks via layer-wise optimal brain surgeon’’. Proc. 31st Int.” Conf. Neural Inf. Process. Syst., 43, 4860–4874.

    Google Scholar 

  16. 16.

    Zhu, T., Li, K., Herrero, P., Chen, J., & Georgiou, P. (2018). A deep learning algorithm for personalized blood glucose prediction. Proc. Int. Workshop Knowledge Discovery Healthcare Data., 87, 1–5.

    Google Scholar 

  17. 17.

    Rodbard, D. (2016). “Continuous glucose monitoring: A review of successes, challenges and opportunities’’’ in Diabetes Technol.” Therapeutics, 18(S2), S2-3.

    Google Scholar 

  18. 18.

    Abbott Diabetes Care Division. (2018). WELCOME to the Forefront of Diabetes Care. [Online]. Available:

  19. 19.

    D. Madigan (2005) ``Bayesian data mining for health surveillance,'' in Spatial and Syndromic Surveillance for Public Health. A. B. Lawson and K. Klienman, Eds. Chichester, U.K.: Wiley. 203_221.

  20. 20.

    D. Chen, Z. Runtong, S. Xiaopu, W. V. Li and H. Zhao (2018) “Predicting the interaction between treatment processes and disease progression by using hidden Markov model,'' Symmetry.

  21. 21.

    Pugazhenthi, S., Qin, L., & Reddy, P. H. (2017). Common neurodegenerative pathways in obesity, diabetes and Alzheimer’s disease. Biochim. Bio-phys. Acta-Mol. Basis Disease., 1863(5), 1037–1045.

    Article  Google Scholar 

Download references

Author information



Corresponding author

Correspondence to J. Omana.

Ethics declarations

Conflicts of interest

We authors not having any conflict of interest among ourselves to submit and publish our articles in Wireless Personal Communications journal.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Omana, J., Moorthi, M. Predictive Analysis and Prognostic Approach of Diabetes Prediction with Machine Learning Techniques. Wireless Pers Commun (2021).

Download citation


  • Prognostic modelling
  • Prediction
  • Automated modelling
  • Type 2 diabetes mellitus
  • Sparse data handling
  • Approximation
  • Machine learning algorithm