Principal Component Analysis Based on a Subset of Variables for Qualitative Data

  • Yuichi Mori
  • Yutaka Tanaka
  • Tomoyuki Tarumi
Conference paper
Part of the Studies in Classification, Data Analysis, and Knowledge Organization book series (STUDIES CLASS)


A principal component analysis based on a subset of variables is proposed by Tanaka and Mori (1996) to derive principal components which are computed as linear combinations of a subset of quantitative variables but which can reproduce all the variables very well. The present paper discusses an extension of their modified principal component analysis so that it can deal with qualitative variables by using the idea of the alternating least squares method by Young et al.(1978). A backward elimination procedure is applied to find a suitable sequence of subsets of variables. A numerical example is shown to illustrate the performance of the proposed procedure.


Principal Component Analysis Variable Selection Instrumental Variable Variable Selection Method Optimal Scaling 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. De Leeuw, J. (1982): Nonlinear principal component analysis. COMPSTAT 1982, 77–86, Physica-Verlag, Vienna.Google Scholar
  2. De Leeuw, J. (1984): The Gifi system of nonlinear multivariate analysis. Data Analysis and Informatics III, Diday, E. et al. (eds.), 415–424, Elsevier Science Publishers, North-Holland.Google Scholar
  3. De Leeuw, J. and Van Rijckevorsel, J. (1980): HOMALS & PRINCALS: Some generalization of principal components analysis. Data Analysis and Informatics, Diday, E. et al. (eds.), 415–424, North-Holland Publishing Company.Google Scholar
  4. Israels, A. Z. (1984): Redundancy analysis for qualitative variables. Psychometrika, 49, 331–346.CrossRefGoogle Scholar
  5. Jolliffe, I T. (1972): Discarding variables in a principal component analysis. I Artificial data. Applied Statistics,21 160–173.Google Scholar
  6. Jolliffe, I. T. (1973): Discarding variables in a principal component analysis. II. Real data. Applied Statistics, 22, 21–31.CrossRefGoogle Scholar
  7. Jolliffe, I. T. (1986): Principal component analysis. Springer-Verlag, New York.CrossRefGoogle Scholar
  8. Krzanowski, W. J. (1987a): Selection of variables to preserve multivariate data structure, using principal components. Applied Statistics, 36, 22–33.CrossRefGoogle Scholar
  9. Krzanowski, W. J. (1987b): Cross—validation in principal component analysis. Biometrics, 43, 575–584.MathSciNetCrossRefGoogle Scholar
  10. McCabe, G. P. (1984): Principal Variables. Technometrics, 26, 137–144.MathSciNetMATHCrossRefGoogle Scholar
  11. Rao, C. R. (1964): The use and interpretation of principal component analysis in applied research. Sankhya, A, 26, 329–58.Google Scholar
  12. Robert, P. and Escoufier, Y. (1976): A unifying tool for linear multivariate statistical methods: the RV-coefficient. Applied Statistics, 25, 257–265.MathSciNetCrossRefGoogle Scholar
  13. Sano, K. et al. (1977): Statistical studies on evaluation of mind disturbance of consciousness: Abstraction of characteristic clinical pictures by cross-sectional investigation. Sinkei Kenkyu no Shinpo, 21, 1052–1065 (in Japanese).Google Scholar
  14. Sano, K. et al. (1983): Statistical studies on evaluation of mind disturbance of consciousness. Journal of Neurosurg, 58, 223–230.CrossRefGoogle Scholar
  15. Tanaka. Y. and Kodake, K. (1981): A method of variable selection in factor analysis and its numerical investigation. Behaviorraetrika, 10, 49–61.Google Scholar
  16. Tanaka, Y. (1983): Some criteria for variable selection in factor analysis. Behaviorrnetrika, 13. 31–45.CrossRefGoogle Scholar
  17. Tanaka, Y and Mori, Y. (1996): Principal component analysis based on a subset of variables: Variable selection and sensitivity analysis. American Journal of Mathematics and Management Sciences, Special Volume (to appear).Google Scholar
  18. Young, F. et al. (1978): The principal components of mixed measurement level multivariate data: An alternating least squares method with optimal scaling features. Psychometrika, 43. 279–281.MATHCrossRefGoogle Scholar

Copyright information

© Springer Japan 1998

Authors and Affiliations

  • Yuichi Mori
    • 1
  • Yutaka Tanaka
    • 2
  • Tomoyuki Tarumi
    • 2
  1. 1.Kurashiki City College160 Kojima Hieda-choKurashiki 711Japan
  2. 2.Department of Environmental and Mathematical SciencesOkayama UniversityOkayama 700Japan

Personalised recommendations