Which Resampling-Based Error Estimator for Benchmark Studies? A Power Analysis with Application to PLS-LDA

Boulesteix, Anne-Laure

doi:10.1007/978-3-319-40643-5_4

Anne-Laure Boulesteix⁶

Part of the book series: Springer Proceedings in Mathematics & Statistics ((PROMS,volume 173))

Included in the following conference series:

International Conference on Partial Least Squares and Related Methods

1454 Accesses
1 Citations

Abstract

Resampling-based methods such as k-fold cross-validation or repeated splitting into training and test sets are routinely used in the context of supervised statistical learning to assess the prediction performances of prediction methods using real data sets. In this paper, we consider methodological issues related to comparison studies of prediction methods which involve several real data sets and use resampling-based error estimators as the evaluation criteria. In the literature papers often claim that, say, “Method 1 performs better than Method 2 on real data” without applying any proper statistical inference approach to support their claims and without clearly explaining what they mean by “perform better.” We recently proposed a new statistical testing framework which provides a statistically correct formulation of such paired tests—which are often performed in the machine learning community—to compare the performances of two methods on several real data sets. However, the behavior of the different available resampling-based error estimation procedures in this statistical framework is unknown. In this paper we empirically assess this behavior through an exemplary benchmark study based on 50 microarray data sets and formulate tentative recommendations regarding the choice of resampling-based error estimation procedures in light of the results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Binder, H., Schumacher, M.: Adapting prediction error estimates for biased complexity selection in high-dimensional bootstrap samples. Stat. Appl. Genet. Mol. Biol. 7, 12 (2008)
MathSciNet MATH Google Scholar
Bock, J.: Bestimmung des Stichprobenumfangs. Oldenburg Verlag, München Wien (1998)
Google Scholar
Boulesteix, A.-L.: PLS dimension reduction for classification with microarray data. Stat. Appl. Genet. Mol. Biol. 3, 33 (2004)
MathSciNet MATH Google Scholar
Boulesteix, A.-L.: On representative and illustrative comparisons with real data in bioinformatics: response to the letter to the editor by Smith et al. Bioinformatics 29, 2664–2666 (2013)
Article Google Scholar
Boulesteix, A.-L., Lauer, S., Eugster, M.: A plea for neutral comparison studies in computational sciences. PLOS ONE 8, 61562 (2013)
Article Google Scholar
Boulesteix, A.-L., Hable, R., Lauer, S., Eugster, M.: A statistical framework for hypothesis testing in real data comparison studies. Am. Stat. 69, 201–212 (2015)
Article MathSciNet Google Scholar
Demsar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7, 1–30 (2006)
MathSciNet MATH Google Scholar
de Souza, B.F., de Carvalho, A., Soares, C.: A comprehensive comparison of ML algorithms for gene expression data classification. In: The 2010 International Joint Conference of Neural Networks (IJCNN), Barcelona, pp. 1–8 (2010)
Google Scholar
Dougherty, E.R., Sima, C., Hanczar, B., Braga-Neto, U.M.: Performance of error estimators for classification. Curr. Bioinform. 5, 53–67 (2010)
Article Google Scholar
Molinaro, A., Simon, R., Pfeiffer, R.M.: Prediction error estimation: a comparison of resampling methods. Bioinformatics 21, 3301–3307 (2005)
Article Google Scholar
Slawski, M., Daumer, M., Boulesteix, A.-L.: CMA: a comprehensive bioconductor package for supervised classification with high dimensional data. BMC Bioinform. 9, 439 (2008)
Article Google Scholar

Download references

Acknowledgements

We thank Rory Wilson for helpful comments.

Author information

Authors and Affiliations

Department of Medical Informatics, Biometry and Epidemiology, University of Munich, Marchioninistr. 15, 81377, Munich, Germany
Anne-Laure Boulesteix

Authors

Anne-Laure Boulesteix
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Anne-Laure Boulesteix .

Editor information

Editors and Affiliations

School of Behavioral and Brain Sciences, The University of Texas at Dallas, Richardson, Texas, USA
Hervé Abdi
ESSEC Business School, Cergy Pontoise CX, France
Vincenzo Esposito Vinzi
CNAM, Paris, USA
Giorgio Russolillo
CNAM, Paris Cedex 03, France
Gilbert Saporta
NEOMA Business School, Rouen, France
Laura Trinchera

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Boulesteix, AL. (2016). Which Resampling-Based Error Estimator for Benchmark Studies? A Power Analysis with Application to PLS-LDA. In: Abdi, H., Esposito Vinzi, V., Russolillo, G., Saporta, G., Trinchera, L. (eds) The Multiple Facets of Partial Least Squares and Related Methods. PLS 2014. Springer Proceedings in Mathematics & Statistics, vol 173. Springer, Cham. https://doi.org/10.1007/978-3-319-40643-5_4

Download citation

DOI: https://doi.org/10.1007/978-3-319-40643-5_4
Published: 14 October 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-40641-1
Online ISBN: 978-3-319-40643-5
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics