Abstract
In this article we have studied the usage of a classification method based on preprocessing the data first using principal component analysis, and then using the compressed data in actual classification process which is based on differential evolution algorithm, an evolutionary optimization algorithm. This method is applied here for prediction diagnosis from clinical data sets with chief complaint of chest pain using classical Electronic Medical Record (EMR), heart data sets. For experimentation we used a set of five frequently applied benchmark data sets including Cleveland, Hungarian, Long Beach, Switzerland and Statlog data sets. These data sets are containing demographic properties, clinical symptoms, clinical findings, laboratory test results specific electrocardiography (ECG), results pertaining to angina and coronary infarction, etc. In other words, classical EMR data pertaining to the evaluation of a chest pain patient and ruling out angina and/or Coronary Artery Disease, (CAD). The prediction diagnosis results with the proposed classification approach were found promisingly accurate. For example, the Switzerland data set was classified with 94.5 % ±0.4 % accuracy. Combining all these data sets resulted in the classification accuracy of 82.0 % ±0.5 %. We compared the results of the proposed method with the corresponding results of the other methods reported in the literature that have demonstrated relatively high classification performance in solving this problem. Depending on the case, the results of the proposed method were of equal level with the best compared methods, or outperformed their classification accuracy clearly. In general, the results are suggesting that the proposed method has potential in this task.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abbass, H.A.: An evolutionary artificial neural networks approach for breast cancer diagnosis. Artificial Intelligence in Medicine 25, 265–281 (2002)
Abdel-Aal, R.E.: GMDH-based Feature Ranking and Selection for Improved Classification of Medical Data. Journal of Biomedical Informatics 38, 456–468 (2005)
Bacardit, J., Krasnogor, N.: Smart Crossover Operator with Multiple Parents for a Pittsburgh Learning Classifier System. In: Proceedings of the 8th Annual Conference on Genetic and Evolutionary Computation (GECCO 2006), pp. 1441–1448. ACM Press, New York (2006)
Booker, L.: Improving the performance of generic algorithms in classifier systems. In: Grefenstette, J.J. (ed.) Proc. 1st Int. Conf. on Genetic Algorithms, Pittsburgh, PA, July 1985, pp. 80–92 (1985)
Cauwenberghs, G., Poggio, T.: Incremental and decremental support vector machine learning. In: Advanced Neural Information Processing Systems, vol. 13. MIT Press, Cambridge (2001)
Detrano, R., Janosi, A., Steinbrunn, W., Pfisterer, M., Sandhu, S., Guppy, K., Lee, S., Froelicher, V.: International application of a new probability algorithm for the diagnosis of coronary artery disease. Americal Journal of Cardiology 64, 304–310 (1989)
Donoho, D.: High-dimensional data analysis: The curses and blessings of dimensionality. In: Lecture at the “Mathematical Challenges of the 21st Century” conference of the American Math. Society, Los Angeles, August 6-11 (2000)
Duda, R., Hart, P.: Pattern Classification and Scene Analysis. John Wiley & Sons, Chichester (1973)
Fodor, I.K.: A Survey of Dimension Reduction Techniques, LLNL technical report (June 2002)
Fogarty, T.C.: Co-evolving co-operative populations of rules in learning control systems. In: Fogarty, T.C. (ed.) AISB-WS 1994. LNCS, vol. 865, pp. 195–209. Springer, Heidelberg (1994)
Giacobini, M., Brabazon, A., Cagnoni, S., Gianni, A.D., Drechsler, R.: Automatic Recognition of Hand Gestures with Differential Evolution - Applications of Evolutionary Computing: Evoworkshops (2008)
Gomes-Skarmeta, A.F., Valdes, M., Jimenez, F., Marin-Blazquez, J.G.: Approximative fuzzy rules approaches for classification with hybrid-GA technigues. Information Sciences 136, 193–214 (2001)
Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann Publisher, San Francisco (2000)
Herbrich, R., Graepel, T., Campbell, C.: Bayes point machines. J. Machine Learning Res. 1, 245–279 (2001)
Hervas-Martinez, C., Martinez-Estudillo, F.: Logistic Regression Using Covariates Obtained by Product-unit Neural Network Models. Pattern Recognition 40, 52–64 (2007)
Holland, J.H.: Properties of the bucket-brigade algorithm. In: Grefenstette, J.J. (ed.) Proc. 1st Int. Conf. on Genetic Algorithms, Pittsburgh, PA, July 1985, pp. 1–7 (1985)
Holland, J.H.: Genetic algorithms and classifier systems: foundations and future directions. In: Proc. 2nd Int. Conf. on Genetic Algorithms, pp. 82–89 (1987)
Holland, J.H., Holyoak, K.J., Nisbett, R.E., Thagard, P.R.: Classifier systems, Q-morphisms and induction. In: Davis, L. (ed.) Genetic algorithms and Simulated Annealing, ch. 9, pp. 116–128 (1987)
Jolliffe, I.: Principal Component Analysis. Springer, Heidelberg (1986)
King, R.D., Feng, C., Sutherland, A.: Statlog: Comparison of Classification Algorithms on Large Real-World Problems. Applied Artificial Intelligence 9(3), 256–287 (1995)
Li, Q., Li, T., Zhu, S., Kambhamettu, C.: Improving Medical/Biological Data Classification Performance by Wavelet Preprocessing. In: Proceedings of IEEE International Conference on Data mining (ICDM), pp. 657–660 (2002)
Łeski, J.M.: An ε− Margin Nonlinear Classifier Based on Fuzzy If-Then Rules. IEEE Transactions on Systems, Man and Cybernetics-Part B: Cybernetics 34(1), 68–76 (2004)
Luukka, P., Sampo, J.: Similarity Classifier Using Differential Evolution and Genetic Algorithm in Weight Optimization. Journal of Advanced Computational Intelligence and Intelligent Informatics 8(6), 591–598 (2004)
Martens, H., Naes, T.: Multivariate Calibration. John Wiley, Chichester (1989)
Newman, D.J., Hettich, S., Blake, C.L., Merz, C.J.: UCI Repository of machine learning databases. University of California, Department of Information and Computer Science, Irvine, CA, http://www.ics.uci.edu/~mlearn/MLRepository.html (Cited 30 November 2008)
Omran, M., Engelbrecht, A.P., Salman, A.: Differential Evolution Methods for Unsupervised Image Classification. In: Proceedings of the Seventh Congress on Evolutionary Computation (CEC 2005), Edinburgh, Scotland. IEEE Press, Los Alamitos (2005)
Pedreira, C.E.: Learning Vector Quantization with Training Data Selection. IEEE Transactions on Pattern Analysis and Machine Intelligence 28(1), 157–162 (2006)
Polat, K., Sahan, S., Günes, S.: Automatic detection of heart disease using an artificial immune recognition system (AIRS) with fuzzy resource allocation mechanism and k-nn (nearest neighbour) based weighting preprocessing. Expert Systems with Applications 32, 625–631 (2007)
Price, K.V.: New Ideas in Optimization. In: An Introduction to Differential Evolution, ch. 6, pp. 79–108. McGraw-Hill, London (1999)
Price, K., Storn, R., Lampinen, J.: Differential Evolution - A Practical Approach to Global Optimization. Springer, Heidelberg (2005)
Robertson, G.: Parallel implementation of genetic algorithms in a classifier system. In: Davis, L. (ed.) Genetic algorithms and Simulated Annealing, ch. 10, pp. 129–140 (1987)
Sirin, I., Güvenir, H.A.: An Algorithm for Classification by Feature Partitioning Technical Report CIS-9301, Bilkent University, Dept. of Computer Engineering and Information Science, Ankara (1993)
Storn, R., Price, K.V.: Differential Evolution - a Simple and Efficient Heuristic for Global Optimization over Continuous Space. Journal of Global Optimization 11(4), 341–359 (1997)
Vapnik, V.: The Nature of Statistical Learning Theory. Springer, Heidelberg (1995)
Wilson, S.W.: Hierarchical credit allocation in a classifier system. In: Davis, L. (ed.) Genetic algorithms and Simulated Annealing, ch. 8, pp. 104–115 (1987)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Luukka, P., Lampinen, J. (2010). A Classification method based on principal component analysis and differential evolution algorithm applied for prediction diagnosis from clinical EMR heart data sets. In: Tenne, Y., Goh, CK. (eds) Computational Intelligence in Optimization. Adaptation, Learning, and Optimization, vol 7. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12775-5_11
Download citation
DOI: https://doi.org/10.1007/978-3-642-12775-5_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12774-8
Online ISBN: 978-3-642-12775-5
eBook Packages: EngineeringEngineering (R0)