Abstract
To regress one or more quantitative response variables on a set of predictor variables of different nature, it is necessary to transform non-quantitative predictors in such a way that they can be analyzed together with the other variables measured on an interval scale. Here, a new proposal to cope with this issue in Partial Least Squares (PLS) regression framework is presented. The approach consists in quantifying each non-quantitative predictor according to Hayashi’s first quantification method, using the dependent variable (or, in the multivariate case, a linear combination of the response variables) as an external criterion. The PLS weight of each variable which is quantified according to the proposed approach is coherent with the statistical relationship between its original non-quantitative variable and the response variable(s) as expressed in terms of Pearson’s correlation ratio. Firstly, the case where one variable depends on a set of both categorical and quantitative variables is discussed; then, a modified PLS algorithm, called PLS-CAP, is proposed to obtain the quantifications of the categorical predictors in the multi-response case. An application on real data is presented in order to enhance the properties of the quantification approach based on the PLS-CAP with respect to the classical approach based on the dummy code of the categorical variables.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Breiman, L., Friedman, J.: Estimating optimal transformations for multiple regression and correlation. J. Am. Stat. Assoc. 80, 580–598 (1985)
Hayashi, C.: On the prediction of phenomena from qualitative data and the quantification of qualitative data from the mathematicostatistical point of view. Ann. Inst. Statist. Math. 3, 69–98 (1952)
Tenenhaus, M.: La Regression PLS. Technip, Paris (1998)
Wold, S., Johansson, E. Cocchi, M.: PLS: partial least squares projections to latent structures. In: Kubinyi, H. (ed.), 3D QSAR in Drug Design: Theory, Methods and Applications, pp. 523–550. ESCOM Science Publishers, Leiden (1993)
Young F.W., Jan de Leeuw J., Takane Y.: Regression with qualitative and quantitative variables: an alternating least squares method with optimal scaling features. Psychometrika 41, 505–529 (1976)
Acknowledgments
The present paper is financially supported by National Interest Research Project (PRIN) 2006 (co-financed by Italian Ministry of University and Research) Multivariate statistical models for the ex-ante and the ex-post analysis of regulatory impact. National Coordinator: Prof. Carlo Natale Lauro (University of Naples “Federico II”)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Russolillo, G., Lauro, C.N. (2011). A Proposal for Handling Categorical Predictors in PLS Regression Framework. In: Fichet, B., Piccolo, D., Verde, R., Vichi, M. (eds) Classification and Multivariate Analysis for Complex Data Structures. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13312-1_36
Download citation
DOI: https://doi.org/10.1007/978-3-642-13312-1_36
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13311-4
Online ISBN: 978-3-642-13312-1
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)