Skip to main content

An Efficient Prediction Model for Diabetic Database Using Soft Computing Techniques

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5908))

Abstract

Organizations aim at harnessing predictive insights, using the vast real-time data stores that they have accumulated through the years, using data mining techniques. Health sector, has an extremely large source of digital data - patient-health related data-store, which can be effectively used for predictive analytics. This data, may consists of missing, incorrect and sometimes incomplete values sets that can have a detrimental effect on the decisions that are outcomes of data analytics. Using the PIMA Indians Diabetes dataset, we have proposed an efficient imputation method using a hybrid combination of CART and Genetic Algorithm, as a preprocessing step. The classical neural network model is used for prediction, on the preprocessed dataset. The accuracy achieved by the proposed model far exceeds the existing models, mainly because of the soft computing preprocessing adopted. This approach is simple, easy to understand and implement and practical in its approach.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Mitra, S., Acharya, T.: Data Mining, Multimedia, Soft-computing and Bioinformatics. Wiley Interscience, Hoboken (2004)

    Google Scholar 

  2. Little, R.J.A., Rubin, D.B.: Statistical Analysis with Missing Data. John Wiley, New York (1987)

    MATH  Google Scholar 

  3. Zhang, S., Qin, Z., Ling, C.X., Sheng, S.: Missing is Useful: Missing Values in Cost Sensitive Decision Trees. IEEE Transactions on Knowledge and Data Engineering 17(12) (2005)

    Google Scholar 

  4. Rady, E.A., Abd, El-Monsef, M.M.E., Abd, El-Latif, W.A.: A Modified Rough Set Approach to Incomplete Information Systems. Journal of Applied Mathematics and Decision Sciences 2007, article ID 58248 (2007)

    Google Scholar 

  5. Satya Kumar, D.V.R., Sriram, K., Rao, K.M., Murty, U.S.: Management of Filariasis Using Prediction Rules Derived from Data Mining. In: Bioinformation by Biomedical Informatics Publishing Group (2005)

    Google Scholar 

  6. Palaniappan, S., Awang, R.: Intelligent Heart Disease Prediction System using Data Mining Techniques. International Journal of Computer Science and Network Security 8(8) (2008)

    Google Scholar 

  7. Liu, P., Lei, L.: A Review of Missing Data Treatment Methods. Intelligent Information Management Systems and Technologies 1(3), 412–419 (2005)

    Google Scholar 

  8. Mehala, B., Ranjit Jeba Thangaiah, P., Vivekanandan, K.: Selecting Scalable Algorithms to Deal with Missing Values. International Journal of Recent Trends in Engineering 1(2) (2009)

    Google Scholar 

  9. Adbdella, M., Marwala, T.: Treatment of Missing Data Using Neural Networks and Genetic Algorithms. In: International Joint Conference on Neural Networks, Canada (2005)

    Google Scholar 

  10. Magnani, M.: Techniques for Dealing with Missing Data in Knowledge Discovery Tasks, Department of Computer Science, University of Bologna (2004)

    Google Scholar 

  11. Acuna, E., Rodriguez, C.: The Treatment of Missing Values and its Effect in the Classifier Accuracy. In: Multiscale Methods in Science and Engineering, pp. 639–647. Springer, Heidelberg (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bhat, V.H., Rao, P.G., Shenoy, P.D., Venugopal, K.R., Patnaik, L.M. (2009). An Efficient Prediction Model for Diabetic Database Using Soft Computing Techniques. In: Sakai, H., Chakraborty, M.K., Hassanien, A.E., Ślęzak, D., Zhu, W. (eds) Rough Sets, Fuzzy Sets, Data Mining and Granular Computing. RSFDGrC 2009. Lecture Notes in Computer Science(), vol 5908. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-10646-0_40

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-10646-0_40

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-10645-3

  • Online ISBN: 978-3-642-10646-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics