Introducing ROC Curves as Error Measure Functions: A New Approach to Train ANN-Based Biomedical Data Classifiers

Ramos-Pollán, Raúl; Guevara-López, Miguel Ángel; Oliveira, Eugénio

doi:10.1007/978-3-642-16687-7_68

Raúl Ramos-Pollán¹⁸,
Miguel Ángel Guevara-López¹⁹ &
Eugénio Oliveira²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6419))

Included in the following conference series:

Iberoamerican Congress on Pattern Recognition

1885 Accesses
4 Citations

Abstract

This paper explores the usage of the area (Az) under the Receiver Operating Characteristic (ROC) curve as error measure to guide the training process to build machine learning ANN-based classifiers for biomedical data analysis. Error measures (like root mean square error, RMS) are used to guide training algorithms measuring how far solutions are from the ideal classification, whereas it is well known that optimal classification rates do not necessarily yield to optimal Az’s. Our hypothesis is that Az error measures can guide existing training algorithms to obtain better Az’s than other error measures. This was tested after training 280 different configurations of ANN-based classifiers, with simulated annealing, using five biomedical binary datasets from the UCI machine learning repository with different test/train data splits. Each ANN configuration was trained both using the Az and RMS based error measures. In average Az was improved in 7.98% in testing data (9.32% for training data) when using 70% of the datasets elements for training. Further analysis reveals interesting patterns (Az improvement is greater when Az are lower). These results encourage us to further explore the usage of Az based error measures in training methods for classifiers in a more generalized manner.

Download to read the full chapter text

Chapter PDF

A robust data scaling algorithm to improve classification accuracies in biomedical data

Article Open access 09 September 2016

Influence of Bias and Variance in Selection of Machine Learning Classifiers for Biomedical Applications

Comparisons of ADABOOST, KNN, SVM and Logistic Regression in Classification of Imbalanced Dataset

Keywords

References

Kostka, P., Tkacz, E.J.: Feature extraction and selection algorithms in biomedical data classifiers based on time-frequency and principle component analysis. In: Proc. 11th Mediterranean Conference on Medical and Biomedical Engineering and Computing 2007, vol. 16, pp. 70–73. Springer, Heidelberg (2007)
Chapter Google Scholar
Drakos, J., Karakantza, M., Zoumbos, N., Lakoumentas, J., Nikiforidis, G., Sakellaropoulos, G.: A perspective for biomedical data integration: Design of databases for flow cytometry. BMC Bioinformatics 9(1), 99 (2008)
Article Google Scholar
Fawcett, T.: An introduction to ROC analysis. Pattern Recognition Letters 27(8), 861–874 (2006)
Article MathSciNet Google Scholar
Castro, C.L., Braga, A.P.: Optimization of the Area under the ROC Curve. In: Proc. of 10th Brazilian Symposium on Neural Networks, SBRN 2008, pp. 141–146 (2008)
Google Scholar
Cortes, C., Mohri, M.: AUC optimization vs. error rate minimization. In: Advances in Neural Information Processing Systems. MIT Press, Cambridge (2003)
Google Scholar
Rakotomamonjy, A.: Optimizing Area under ROC Curve with SVMs. In: Proc. Workshop of ROC Analysis in Artificial Intelligence, pp. 71–80. ROCAI (2004)
Google Scholar
Heaton, J.: Programming Neural Networks with Encog 2 in Java. Heaton Research, Inc. (2010)
Google Scholar
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P.: Witten, I.H.: The WEKA Data Mining Software: An Update. SIGKDD Explorations 11(1) (2009)
Google Scholar
Kirkpatrick, S., Gelatt Jr., C.D., Vecchi, M.P.: Optimization by Simulated Annealing. Science 220(4598), 671–680 (1983)
Article MathSciNet MATH Google Scholar
Asuncion, A., Newman, D.J.: UCI Machine Learning Repository. University of California, School of Information and Computer Science, Irvine, CA (2007), http://www.ics.uci.edu/~mlearn/MLRepository.html
EGEE: The gLite middleware, vol. 2010 (2009)
Google Scholar
John Eng, M.D.: ROC analysis: web-based calculator for ROC curves, vol. 2010. Johns Hopkins University, Baltimore (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

CETA-CIEMAT Centro Extremeño de Tecnologías Avanzadas, Calle Sola 1, 10200, Trujillo, Spain
Raúl Ramos-Pollán
INEGI Instituto de Engenharia, Mecanica e Gestão Industrial, Universidade do Porto, Campus da FEUP, Rua Roberto Frias 400, 4200-465, Porto, Portugal
Miguel Ángel Guevara-López
LIACC-DEI-Faculdade de Engenharia, Universidade do Porto, Rua Roberto Frias s/n, 4200-465, Porto, Portugal
Eugénio Oliveira

Authors

Raúl Ramos-Pollán
View author publications
You can also search for this author in PubMed Google Scholar
Miguel Ángel Guevara-López
View author publications
You can also search for this author in PubMed Google Scholar
Eugénio Oliveira
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Département Traitement du Signal et des Images, CNRS LTCI, Télécom ParisTech, 46 rue Barrault, 75634, Paris Cedex 13, France
Isabelle Bloch
Institute of Mathematics and Statistics - IME, Department of Computer Science, University of São Paulo - USP, Rua do Matão 1010, SP CEP 05508-090, São Paulo, Brazil
Roberto M. Cesar Jr.

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ramos-Pollán, R., Guevara-López, M.Á., Oliveira, E. (2010). Introducing ROC Curves as Error Measure Functions: A New Approach to Train ANN-Based Biomedical Data Classifiers. In: Bloch, I., Cesar, R.M. (eds) Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. CIARP 2010. Lecture Notes in Computer Science, vol 6419. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16687-7_68

Download citation

DOI: https://doi.org/10.1007/978-3-642-16687-7_68
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16686-0
Online ISBN: 978-3-642-16687-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Introducing ROC Curves as Error Measure Functions: A New Approach to Train ANN-Based Biomedical Data Classifiers

Abstract

Chapter PDF

Similar content being viewed by others

A robust data scaling algorithm to improve classification accuracies in biomedical data

Influence of Bias and Variance in Selection of Machine Learning Classifiers for Biomedical Applications

Comparisons of ADABOOST, KNN, SVM and Logistic Regression in Classification of Imbalanced Dataset

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Introducing ROC Curves as Error Measure Functions: A New Approach to Train ANN-Based Biomedical Data Classifiers

Abstract

Chapter PDF

Similar content being viewed by others

A robust data scaling algorithm to improve classification accuracies in biomedical data

Influence of Bias and Variance in Selection of Machine Learning Classifiers for Biomedical Applications

Comparisons of ADABOOST, KNN, SVM and Logistic Regression in Classification of Imbalanced Dataset

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation