Abstract
Many interesting functional bases, such as piecewise polynomials or wavelets, are examples of localized bases. We investigate the optimality of V -fold cross-validation and a variant called V -fold penalization in the context of the selection of linear models generated by localized bases in a heteroscedastic framework. It appears that while V -fold cross-validation is not asymptotically optimal when V is fixed, the V -fold penalization procedure is optimal. Simulation studies are also presented.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Antoniadis, A., Gregoire, G., & McKeague, I. (1994). Wavelet methods for curve estimation. Journal of the American Statistical Association, 89(428), 1340–1353.
Arlot, S. (2008). V -fold cross-validation improved: V -fold penalization. ArXiv:0802.0566v2. http://hal.archives-ouvertes.fr/hal-00239182/en/
Arlot, S., & Célisse, A. (2010). A survey of cross-validation procedures for model selection. Statistical Surveys, 4, 40–79.
Arlot, S., & Massart, P. (2009) Data-driven calibration of penalties for least-squares regression. Journal of Machine Learning Research, 10, 245–279 (electronic).
Barron, A., Birgé, L., & Massart, P. (1999). Risk bounds for model selection via penalization. Probability Theory and Related Fields, 113(3), 301–413.
Birgé, L., & Massart, P. (1998). Minimum contrast estimators on sieves: exponential bounds and rates of convergence. Bernoulli, 4(3), 329–375.
Cai, T. (1999). Adaptive wavelet estimation: a block thresholding and oracle inequality approach. Annals of Statistics, 27(3), 898–924.
Cai, T., & Brown, L. (1998). Wavelet shrinkage for nonequispaced samples. Annals of Statistics, 26, 1783–1799.
Cai, T., & Brown, L. (1999). Wavelet estimation for samples with random uniform design. Statistics and Probability Letters, 42(3), 313–321.
Donoho D., Maleki, A., & Shahram, M. (2006). Wavelab 850. http://statweb.stanford.edu/~wavelab/
Hall, P., & Turlach, B. (1997). Interpolation methods for nonlinear wavelet regression with irregularly spaced design. Annals of Statistics, 25(5), 1912–1925.
Kulik, R., & Raimondo, M. (2009). Wavelet regression in random design with heteroscedastic dependent errors. Annals of Statistics, 37(6A), 3396–3430.
Mallat, S. (2008). A wavelet tour of signal processing: The sparse way. New York: Academic.
Marron, J., Adak, S., Johnstone, I., Neumann, M., & Patil, P. (1998). Exact risk analysis of wavelet regression. Journal of Computational and Graphical Statistics, 7(3), 278–309.
Nason, G. (1996). Wavelet shrinkage using cross-validation. Journal of the Royal Statistical Society, Series B, 58, 463–479.
Navarro, F., & Saumard, A. (2017). Slope heuristics and V-Fold model selection in heteroscedastic regression using strongly localized bases. ESAIM: Probability and Statistics, 21, 412–451.
Saumard, A. (2012). Optimal upper and lower bounds for the true and empirical excess risks in heteroscedastic least-squares regression. Electronic Journal of Statistics, 6(1–2):579–655.
Saumard, A. (2013). Optimal model selection in heteroscedastic regression using piecewise polynomial functions. Electronic Journal of Statistics, 7, 1184–1223.
Saumard, A. (2017). On optimality of empirical risk minimization in linear aggregation. Bernoulli (to appear). arXiv:1605.03433
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Navarro, F., Saumard, A. (2018). Efficiency of the V -Fold Model Selection for Localized Bases. In: Bertail, P., Blanke, D., Cornillon, PA., Matzner-Løber, E. (eds) Nonparametric Statistics. ISNPS 2016. Springer Proceedings in Mathematics & Statistics, vol 250. Springer, Cham. https://doi.org/10.1007/978-3-319-96941-1_4
Download citation
DOI: https://doi.org/10.1007/978-3-319-96941-1_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-96940-4
Online ISBN: 978-3-319-96941-1
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)