Skip to main content

A Classification method based on principal component analysis and differential evolution algorithm applied for prediction diagnosis from clinical EMR heart data sets

  • Chapter
Book cover Computational Intelligence in Optimization

Part of the book series: Adaptation, Learning, and Optimization ((ALO,volume 7))

Abstract

In this article we have studied the usage of a classification method based on preprocessing the data first using principal component analysis, and then using the compressed data in actual classification process which is based on differential evolution algorithm, an evolutionary optimization algorithm. This method is applied here for prediction diagnosis from clinical data sets with chief complaint of chest pain using classical Electronic Medical Record (EMR), heart data sets. For experimentation we used a set of five frequently applied benchmark data sets including Cleveland, Hungarian, Long Beach, Switzerland and Statlog data sets. These data sets are containing demographic properties, clinical symptoms, clinical findings, laboratory test results specific electrocardiography (ECG), results pertaining to angina and coronary infarction, etc. In other words, classical EMR data pertaining to the evaluation of a chest pain patient and ruling out angina and/or Coronary Artery Disease, (CAD). The prediction diagnosis results with the proposed classification approach were found promisingly accurate. For example, the Switzerland data set was classified with 94.5 % ±0.4 % accuracy. Combining all these data sets resulted in the classification accuracy of 82.0 % ±0.5 %. We compared the results of the proposed method with the corresponding results of the other methods reported in the literature that have demonstrated relatively high classification performance in solving this problem. Depending on the case, the results of the proposed method were of equal level with the best compared methods, or outperformed their classification accuracy clearly. In general, the results are suggesting that the proposed method has potential in this task.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 249.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abbass, H.A.: An evolutionary artificial neural networks approach for breast cancer diagnosis. Artificial Intelligence in Medicine 25, 265–281 (2002)

    Article  Google Scholar 

  2. Abdel-Aal, R.E.: GMDH-based Feature Ranking and Selection for Improved Classification of Medical Data. Journal of Biomedical Informatics 38, 456–468 (2005)

    Article  Google Scholar 

  3. Bacardit, J., Krasnogor, N.: Smart Crossover Operator with Multiple Parents for a Pittsburgh Learning Classifier System. In: Proceedings of the 8th Annual Conference on Genetic and Evolutionary Computation (GECCO 2006), pp. 1441–1448. ACM Press, New York (2006)

    Chapter  Google Scholar 

  4. Booker, L.: Improving the performance of generic algorithms in classifier systems. In: Grefenstette, J.J. (ed.) Proc. 1st Int. Conf. on Genetic Algorithms, Pittsburgh, PA, July 1985, pp. 80–92 (1985)

    Google Scholar 

  5. Cauwenberghs, G., Poggio, T.: Incremental and decremental support vector machine learning. In: Advanced Neural Information Processing Systems, vol. 13. MIT Press, Cambridge (2001)

    Google Scholar 

  6. Detrano, R., Janosi, A., Steinbrunn, W., Pfisterer, M., Sandhu, S., Guppy, K., Lee, S., Froelicher, V.: International application of a new probability algorithm for the diagnosis of coronary artery disease. Americal Journal of Cardiology 64, 304–310 (1989)

    Article  Google Scholar 

  7. Donoho, D.: High-dimensional data analysis: The curses and blessings of dimensionality. In: Lecture at the “Mathematical Challenges of the 21st Century” conference of the American Math. Society, Los Angeles, August 6-11 (2000)

    Google Scholar 

  8. Duda, R., Hart, P.: Pattern Classification and Scene Analysis. John Wiley & Sons, Chichester (1973)

    MATH  Google Scholar 

  9. Fodor, I.K.: A Survey of Dimension Reduction Techniques, LLNL technical report (June 2002)

    Google Scholar 

  10. Fogarty, T.C.: Co-evolving co-operative populations of rules in learning control systems. In: Fogarty, T.C. (ed.) AISB-WS 1994. LNCS, vol. 865, pp. 195–209. Springer, Heidelberg (1994)

    Google Scholar 

  11. Giacobini, M., Brabazon, A., Cagnoni, S., Gianni, A.D., Drechsler, R.: Automatic Recognition of Hand Gestures with Differential Evolution - Applications of Evolutionary Computing: Evoworkshops (2008)

    Google Scholar 

  12. Gomes-Skarmeta, A.F., Valdes, M., Jimenez, F., Marin-Blazquez, J.G.: Approximative fuzzy rules approaches for classification with hybrid-GA technigues. Information Sciences 136, 193–214 (2001)

    Article  Google Scholar 

  13. Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann Publisher, San Francisco (2000)

    Google Scholar 

  14. Herbrich, R., Graepel, T., Campbell, C.: Bayes point machines. J. Machine Learning Res. 1, 245–279 (2001)

    Article  MATH  MathSciNet  Google Scholar 

  15. Hervas-Martinez, C., Martinez-Estudillo, F.: Logistic Regression Using Covariates Obtained by Product-unit Neural Network Models. Pattern Recognition 40, 52–64 (2007)

    Article  MATH  Google Scholar 

  16. Holland, J.H.: Properties of the bucket-brigade algorithm. In: Grefenstette, J.J. (ed.) Proc. 1st Int. Conf. on Genetic Algorithms, Pittsburgh, PA, July 1985, pp. 1–7 (1985)

    Google Scholar 

  17. Holland, J.H.: Genetic algorithms and classifier systems: foundations and future directions. In: Proc. 2nd Int. Conf. on Genetic Algorithms, pp. 82–89 (1987)

    Google Scholar 

  18. Holland, J.H., Holyoak, K.J., Nisbett, R.E., Thagard, P.R.: Classifier systems, Q-morphisms and induction. In: Davis, L. (ed.) Genetic algorithms and Simulated Annealing, ch. 9, pp. 116–128 (1987)

    Google Scholar 

  19. Jolliffe, I.: Principal Component Analysis. Springer, Heidelberg (1986)

    Google Scholar 

  20. King, R.D., Feng, C., Sutherland, A.: Statlog: Comparison of Classification Algorithms on Large Real-World Problems. Applied Artificial Intelligence 9(3), 256–287 (1995)

    Article  Google Scholar 

  21. Li, Q., Li, T., Zhu, S., Kambhamettu, C.: Improving Medical/Biological Data Classification Performance by Wavelet Preprocessing. In: Proceedings of IEEE International Conference on Data mining (ICDM), pp. 657–660 (2002)

    Google Scholar 

  22. Łeski, J.M.: An ε− Margin Nonlinear Classifier Based on Fuzzy If-Then Rules. IEEE Transactions on Systems, Man and Cybernetics-Part B: Cybernetics 34(1), 68–76 (2004)

    Article  Google Scholar 

  23. Luukka, P., Sampo, J.: Similarity Classifier Using Differential Evolution and Genetic Algorithm in Weight Optimization. Journal of Advanced Computational Intelligence and Intelligent Informatics 8(6), 591–598 (2004)

    Google Scholar 

  24. Martens, H., Naes, T.: Multivariate Calibration. John Wiley, Chichester (1989)

    MATH  Google Scholar 

  25. Newman, D.J., Hettich, S., Blake, C.L., Merz, C.J.: UCI Repository of machine learning databases. University of California, Department of Information and Computer Science, Irvine, CA, http://www.ics.uci.edu/~mlearn/MLRepository.html (Cited 30 November 2008)

  26. Omran, M., Engelbrecht, A.P., Salman, A.: Differential Evolution Methods for Unsupervised Image Classification. In: Proceedings of the Seventh Congress on Evolutionary Computation (CEC 2005), Edinburgh, Scotland. IEEE Press, Los Alamitos (2005)

    Google Scholar 

  27. Pedreira, C.E.: Learning Vector Quantization with Training Data Selection. IEEE Transactions on Pattern Analysis and Machine Intelligence 28(1), 157–162 (2006)

    Article  Google Scholar 

  28. Polat, K., Sahan, S., Günes, S.: Automatic detection of heart disease using an artificial immune recognition system (AIRS) with fuzzy resource allocation mechanism and k-nn (nearest neighbour) based weighting preprocessing. Expert Systems with Applications 32, 625–631 (2007)

    Article  Google Scholar 

  29. Price, K.V.: New Ideas in Optimization. In: An Introduction to Differential Evolution, ch. 6, pp. 79–108. McGraw-Hill, London (1999)

    Google Scholar 

  30. Price, K., Storn, R., Lampinen, J.: Differential Evolution - A Practical Approach to Global Optimization. Springer, Heidelberg (2005)

    MATH  Google Scholar 

  31. Robertson, G.: Parallel implementation of genetic algorithms in a classifier system. In: Davis, L. (ed.) Genetic algorithms and Simulated Annealing, ch. 10, pp. 129–140 (1987)

    Google Scholar 

  32. Sirin, I., Güvenir, H.A.: An Algorithm for Classification by Feature Partitioning Technical Report CIS-9301, Bilkent University, Dept. of Computer Engineering and Information Science, Ankara (1993)

    Google Scholar 

  33. Storn, R., Price, K.V.: Differential Evolution - a Simple and Efficient Heuristic for Global Optimization over Continuous Space. Journal of Global Optimization 11(4), 341–359 (1997)

    Article  MATH  MathSciNet  Google Scholar 

  34. Vapnik, V.: The Nature of Statistical Learning Theory. Springer, Heidelberg (1995)

    MATH  Google Scholar 

  35. Wilson, S.W.: Hierarchical credit allocation in a classifier system. In: Davis, L. (ed.) Genetic algorithms and Simulated Annealing, ch. 8, pp. 104–115 (1987)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Luukka, P., Lampinen, J. (2010). A Classification method based on principal component analysis and differential evolution algorithm applied for prediction diagnosis from clinical EMR heart data sets. In: Tenne, Y., Goh, CK. (eds) Computational Intelligence in Optimization. Adaptation, Learning, and Optimization, vol 7. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12775-5_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-12775-5_11

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-12774-8

  • Online ISBN: 978-3-642-12775-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics