Skip to main content

A Proposal for Handling Categorical Predictors in PLS Regression Framework

  • Conference paper
  • First Online:

Abstract

To regress one or more quantitative response variables on a set of predictor variables of different nature, it is necessary to transform non-quantitative predictors in such a way that they can be analyzed together with the other variables measured on an interval scale. Here, a new proposal to cope with this issue in Partial Least Squares (PLS) regression framework is presented. The approach consists in quantifying each non-quantitative predictor according to Hayashi’s first quantification method, using the dependent variable (or, in the multivariate case, a linear combination of the response variables) as an external criterion. The PLS weight of each variable which is quantified according to the proposed approach is coherent with the statistical relationship between its original non-quantitative variable and the response variable(s) as expressed in terms of Pearson’s correlation ratio. Firstly, the case where one variable depends on a set of both categorical and quantitative variables is discussed; then, a modified PLS algorithm, called PLS-CAP, is proposed to obtain the quantifications of the categorical predictors in the multi-response case. An application on real data is presented in order to enhance the properties of the quantification approach based on the PLS-CAP with respect to the classical approach based on the dummy code of the categorical variables.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Breiman, L., Friedman, J.: Estimating optimal transformations for multiple regression and correlation. J. Am. Stat. Assoc. 80, 580–598 (1985)

    Article  MathSciNet  MATH  Google Scholar 

  2. Hayashi, C.: On the prediction of phenomena from qualitative data and the quantification of qualitative data from the mathematicostatistical point of view. Ann. Inst. Statist. Math. 3, 69–98 (1952)

    Article  MATH  Google Scholar 

  3. Tenenhaus, M.: La Regression PLS. Technip, Paris (1998)

    MATH  Google Scholar 

  4. Wold, S., Johansson, E. Cocchi, M.: PLS: partial least squares projections to latent structures. In: Kubinyi, H. (ed.), 3D QSAR in Drug Design: Theory, Methods and Applications, pp. 523–550. ESCOM Science Publishers, Leiden (1993)

    Google Scholar 

  5. Young F.W., Jan de Leeuw J., Takane Y.: Regression with qualitative and quantitative variables: an alternating least squares method with optimal scaling features. Psychometrika 41, 505–529 (1976)

    Article  MATH  Google Scholar 

Download references

Acknowledgments

The present paper is financially supported by National Interest Research Project (PRIN) 2006 (co-financed by Italian Ministry of University and Research) Multivariate statistical models for the ex-ante and the ex-post analysis of regulatory impact. National Coordinator: Prof. Carlo Natale Lauro (University of Naples “Federico II”)

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Giorgio Russolillo .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Russolillo, G., Lauro, C.N. (2011). A Proposal for Handling Categorical Predictors in PLS Regression Framework. In: Fichet, B., Piccolo, D., Verde, R., Vichi, M. (eds) Classification and Multivariate Analysis for Complex Data Structures. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13312-1_36

Download citation

Publish with us

Policies and ethics