Variable Selection

Härdle, Wolfgang Karl; Simar, Léopold

doi:10.1007/978-3-662-45171-7_9

Wolfgang Karl Härdle³ &
Léopold Simar⁴

233k Accesses

Abstract

Variable selection is very important in statistical modelling. We are frequently not only interested in using a model for prediction but also need to correctly identify the relevant variables, that is, to recover the correct model under given assumptions. It is known that under certain conditions, the ordinary least squares (OLS) method produces poor prediction results and does not yield a parsimonious model causing overfitting. Therefore the objective of the variable selection methods is to find the variables which are the most relevant for prediction. Such methods are particularly important when the true underlying model has a sparse representation (many parameters close to zero). The identification of relevant variables will reduce the noise and therefore improve the prediction performance of the fitted model.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Efron, B., Hastie, T., Johnstone, I., & Tibshirani, R. (2004). Least angle regression (with discussion). Annals of Statistics, 32(2), 407–499.
Article MATH MathSciNet Google Scholar
Friedman, J. H., Hastie, T., & Tibshirani, R. (2010). Regularization paths for generalized linear models via coordinate descent. Journal of Statistical Software, 33(1), 1–22.
Google Scholar
Lawson, C., & Hansen, R. (1974). Solving least square problems. Englewood Cliffs: Prentice Hall.
Google Scholar
Mardia, K. V., Kent, J. T., & Bibby, J. M. (1979). Multivariate analysis. Duluth/London: Academic Press.
MATH Google Scholar
Osborne, M., Presnell, B., & Turlach, B. (2000). On the lasso and its dual. Journal of Computational and Graphical Statistics, 9(2), 319–337.
MathSciNet Google Scholar
Shevade, S. K., & Keerthi, S. S. (2003). A simple and efficient algorithm for gene selection using sparse logistic regression. Bioinformatics, 19, 2246–2253.
Article Google Scholar
Simon, N., Friedman, J. H., Hastie, T., & Tibshirani, R. (2013). A sparse-group lasso. Journal of Computational and Graphical Statistics, 22(2), 231–245.
Article MathSciNet Google Scholar
Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society, Series B, 58, 267–288.
MATH MathSciNet Google Scholar
Yuan, M., & Lin, Y. (2006). Model selection and estimation in regression with grouped variables. Journal of the Royal Statistical Society, Series B, 68, 49–67.
Article MATH MathSciNet Google Scholar
Zou, H., & Hastie, T. (2005). Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society, Series B, 67, 301–320.
Article MATH MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

C.A.S.E. Centre f. Appl. Stat. & Econ. School of Business and Economics, Humboldt-Universität zu Berlin, Berlin, Germany
Wolfgang Karl Härdle
Center of Operations Research & Econometrics (CORE), Katholieke Univeristeit Leuven Inst. Statistics, Leuven, Belgium
Léopold Simar

Authors

Wolfgang Karl Härdle
View author publications
You can also search for this author in PubMed Google Scholar
Léopold Simar
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Härdle, W.K., Simar, L. (2015). Variable Selection. In: Applied Multivariate Statistical Analysis. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45171-7_9

Download citation

DOI: https://doi.org/10.1007/978-3-662-45171-7_9
Published: 03 February 2015
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-45170-0
Online ISBN: 978-3-662-45171-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics