Abstract
In this article, we considered to assign a single scaling parameter to LASSO estimators for investigating and improving a problem of excessive shrinkage at a sparse representation. This problem is important because it directly affects a quality of model selection in LASSO. We derived a prediction risk for LASSO with scaling and obtained an optimal scaling parameter value that minimizes the risk. We then showed the risk is improved by assigning the optimal scaling value. In a numerical example, we found that an estimate of the optimal scaling value is larger than one especially at a sparse representation; i.e. excessive shrinkage is relaxed by expansion via scaling. Additionally, we observed that a risk for LASSO is high at a sparse representation and it is minimized at a relatively large model while this is improved by the introduction of an estimate of the optimal scaling value. We here constructed a fully empirical risk estimate that approximates the actual risk well. We then observed that, by applying the risk estimate as a model selection criterion, LASSO with scaling tends to obtain a model with low risk and high sparsity compared to LASSO without scaling.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Carter, C.K., Eagleson, G.K.: A comparison of variance estimators in nonparametric regression. J. R. Statist. Soc. B 54, 773–780 (1992)
Efron, B., Hastie, T., Johnstone, I., Tibshirani, R.: Least angle regression. Ann. Statist. 32, 407–499 (2004)
Fan, J., Li, R.: Variable selection via nonconcave penalized likelihood and its oracle properties. J. Amer. Statist. Assoc. 96, 1348–1360 (2001)
Hagiwara, K.: On scaling of soft-thresholding estimator. Neurocomputing 194, 360–371 (2016)
Leng, C.L., Lin, Y., Wahba, G.: A note on the lasso and related procedures in model selection. Stat. Sin. 16, 1273–1284 (2006)
Stein, C.: Estimation of the mean of a multivariate normal distribution. Ann. Stat. 9, 1135–1151 (1981)
Tibshirani, R.: Regression shrinkage and selection via the lasso. J. R. Statist. Soc. Ser. B. 58, 267–288 (1996)
Zou, H.: The adaptive lasso and its oracle properties. J. Amer. Statist. Assoc. 101, 1418–1492 (2006)
Zou, H., Hastie, T., Tibshirani, R.: On the degree of freedom of the LASSO. Ann. Statist. 35, 2173–2192 (2007)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Hagiwara, K. (2016). A Problem in Model Selection of LASSO and Introduction of Scaling. In: Hirose, A., Ozawa, S., Doya, K., Ikeda, K., Lee, M., Liu, D. (eds) Neural Information Processing. ICONIP 2016. Lecture Notes in Computer Science(), vol 9948. Springer, Cham. https://doi.org/10.1007/978-3-319-46672-9_3
Download citation
DOI: https://doi.org/10.1007/978-3-319-46672-9_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46671-2
Online ISBN: 978-3-319-46672-9
eBook Packages: Computer ScienceComputer Science (R0)