ROC-Based Evolutionary Learning: Application to Medical Data Mining

Sebag, Michèle; Azé, Jérôme; Lucas, Noël

doi:10.1007/978-3-540-24621-3_31

ROC-Based Evolutionary Learning: Application to Medical Data Mining

Michèle Sebag⁹,
Jérôme Azé⁹ &
Noël Lucas⁹

Conference paper

741 Accesses
15 Citations

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2936))

Abstract

A novel way of comparing supervised learning algorithms has been introduced since the late 90’s, based on Receiver Operating Characteristics (ROC) curves.

From this approach is derived a NP complete optimization criterion for supervised learning, the area under the ROC curve.

This optimization criterion, tackled with evolution strategies, is experimentally compared to the structural risk criterion tackled by quadratic optimization in Support Vector Machines. Comparable results are obtained on a set of benchmark problems in the Irvine repository, within a fraction of the SVM computational cost.

Additionally, the variety of solutions provided by evolutionary computation can be exploited for visually inspecting the contributing factors of the phenomenon under study. The impact study and sensitivity analysis facilities offered by ROGER (ROC-based Genetic LearneR) are demonstrated on a medical application, the identification of Atherosclerosis Risk Factors.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bäck, T.: Evolutionary Algorithms in theory and practice. Oxford University Press, New York (1995)
Google Scholar
Bousquet, O., Elisseeff, A.: Stability and generalization. Journal of Machine Learning Research 2, 499–526 (2002)
Article MATH MathSciNet Google Scholar
Blake, C., Keogh, E., Merz, C.J.: UCI Repository of machine learning databases (1998), http://www.ics.uci.edu/~mlearn/MLRepository.html
Bradley, A.P.: The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition (1997)
Google Scholar
Collobert, R., Bengio, S.: Svmtorch: Support vector machines for largescale regression problems. Journal of Machine Learning Research 1, 143–160 (2001)
Article MathSciNet Google Scholar
Card, S.K., Mackinlay, J.D., Shneiderman, B.: Information Visualization: Using vision to think. Morgan Kaufmann, San Francisco (1999)
Google Scholar
Cohen, W.W., Schapire, R.E., Singer, Y.: Learning to order things. In: Advances in Neural Information Processing Systems, vol. 10, The MIT Press, Cambridge (1998)
Google Scholar
Cristianini, N., Shawe-Taylor, J.: An introduction to Support Vector Machines. Cambridge University Press, Cambridge (2000)
Google Scholar
Deb, K.: Multi-Objective Optimization Using Evolutionary Algorithms. John Wiley, Chichester (2001)
MATH Google Scholar
Dietterich, T.G.: Approximate statistical tests for comparing supervised classification learning algorithms. Neural Computation (1998)
Google Scholar
Domingos, P.: Meta-cost: A general method for making classifiers cost sensitive. In: Knowledge Discovery from Databases, pp. 155–164. Morgan Kaufmann, San Francisco (1999)
Google Scholar
Furnkranz, J., Flach, P.: An analysis of rule evaluation metrics. In: Proc. of the 20th Int. Conf. on Machine Learning, Morgan Kaufmann, San Francisco (2003)
Google Scholar
Ferri, C., Flach, P.A., Hernandez-Orallo, J.: Learning decision trees using the area under the ROC curve. In: Morgan Kaufmann (ed.) Proceedings of the 19th International Conference on Machine Learning, pp. 179–186 (2002)
Google Scholar
Flach, P.: The geometry of ROC space: Understanding machine learning metrics through ROC isometrics. In: Proc. of the 20th Int. Conf. on Machine Learning, Morgan Kaufmann, San Francisco (2003)
Google Scholar
Fogel, D.B., Wasson, E.C., Boughton, E.M., Porto, V.W., Angeline, P.J.: Linear and neural models for classifying breast cancer. IEEE Trans. Medical Imaging 17(3), 485–488 (1998)
Article Google Scholar
Lucas, N., Azé, J., Sebag, M.: Atherosclerosis risk identification and visual analysis. In: Discovery Challenge ECML-PKDD 2002 (2002), http://lisp.vse.cz/challenge/ecmlpkdd2002/
Ling, C.X., Hunag, J., Zhang, H.: AUC: a better measure than accuracy in comparing learning algorithms. In: Proc. of 16th Canadian Conference on AI 2003 (2003) (to appear)
Google Scholar
Mozer, M.C., Dodier, R., Colagrosso, M.C., Guerra-Salcedo, C., Wolniewicz, R.: Prodding the ROC curve: Constrained optimization of classifier performance. In: Advances in Neural Information Processing Systems, vol. 13, The MIT Press, Cambridge (2001)
Google Scholar
Provost, F., Fawcett, T., Kohavi, R.: The case against accuracy estimation for comparing classifiers. In: Proc. of the 15th Int. Conf. on Machine Learning, pp. 445–553. Morgan Kaufmann, San Francisco (1998)
Google Scholar
Schölkopf, B., Burgess, C., Smola, A.: Advances in Kernel Methods. MIT Press, Cambridge (1998)
MATH Google Scholar
Schwefel, H.-P.: Numerical Optimization of Computer Models. John Wiley & Sons, New-York (1981), 2nd edn. (1995)
MATH Google Scholar
Shapire, R.E., Freund, Y., Bartlett, P., Lee, W.S.: Boosting the margin: a new explanation for the effectiveness of voting methods. In: Proc. of the 14th Int. Conf. on Machine Learning, pp. 322–330. Morgan Kaufmann, San Francisco (1997)
Google Scholar
Srinivasan, A., King, R.D., Bristol, D.W.: An assessment of submissions made to the Predictive Toxicology Evaluation Challenge. In: Proc. of Int. Joint Conf. on Artificial Intelligence, IJCAI 1999, pp. 270–275. Morgan Kaufmann, San Francisco (1999)
Google Scholar
Vapnik, V.N.: Statistical Learning Theory. Wiley, Chichester (1998)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

TAO : Thème Apprentissage et Optimisation, Laboratoire de Recherche en Informatique, CNRS UMR 8623, Université Paris-Sud Orsay, 91405, Orsay Cedex
Michèle Sebag, Jérôme Azé & Noël Lucas

Authors

Michèle Sebag
View author publications
You can also search for this author in PubMed Google Scholar
Jérôme Azé
View author publications
You can also search for this author in PubMed Google Scholar
Noël Lucas
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

UMR-CNRS 6632, Université de Provence, 39 rue F. Joliot-Curie, 13453, Marseille cedex 13, France
Pierre Liardet
Université Louis Pasteur, LSIIT, FDBT, Pôle API, F-67400, Illkirch, France
Pierre Collet
Laboratoire d’Informatique du Littoral, Maison de la Recherche Blaise Pascal, 50 rue Ferdinand Buisson, BP 719, 62228, Calais Cedex, France
Cyril Fonlupt
INRIA Saclay - Ile-de-France, Parc Orsay Université, 4, rue Jacques Monod, 91893, Orsay Cedex, France
Evelyne Lutton
TAO, INRIA Saclay & LRI (UMR CNRS 8623), Bât 490, Université Paris-Sud, 91405, Orsay Cedex, France
Marc Schoenauer

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sebag, M., Azé, J., Lucas, N. (2004). ROC-Based Evolutionary Learning: Application to Medical Data Mining. In: Liardet, P., Collet, P., Fonlupt, C., Lutton, E., Schoenauer, M. (eds) Artificial Evolution. EA 2003. Lecture Notes in Computer Science, vol 2936. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24621-3_31

Download citation

DOI: https://doi.org/10.1007/978-3-540-24621-3_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21523-3
Online ISBN: 978-3-540-24621-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics