Abstract
In this work, a discrete model for clustering and a continuous factorial one for dimension reduction are simultaneously fitted to categorical data, with the aim of identifying the best partition of the objects, described by the best orthogonal linear combinations of the factors, according to the least-squares criterion. This new methodology named multiple correspondence k-means is a useful alternative to the Tandem Analysis in the case of categorical data. Then, this approach has a double objective: data reduction and synthesis, simultaneously in the direction of rows and columns of the data matrix.
References
Arabie, P. (1994). Cluster analysis in marketing research. Advanced Methods in Marketing Research, 160–189.
Benzécri, J. P. (1979). Sur le calcul des taux d’inertie dans l’analyse d’un questionnaire, addendum et erratum à [BIN. MULT.]. Les cahiers de l’analyse des données, 4(3), 377–378.
Bolton, R. J., Krzanowski, W. J. (2012). Projection pursuit clustering for exploratory data analysis. Journal of Computational and Graphical Statistics.
Desarbo, W., Jedidi, K., Cool, K., & Schendel, D. (1991). Simultaneous multidimensional unfolding and cluster analysis: An investigation of strategic groups. Marketing Letters, 2(2), 129–146.
De Soete, G., & Carroll, J. D. (1994). K-means clustering in a low-dimensional Euclidean space. New approaches in classification and data analysis (pp. 212–219). Berlin, Heidelberg: Springer.
De Soete, G., & Heiser, W. J. (1993). A latent class unfolding model for analyzing single stimulus preference ratings. Psychometrika, 58(4), 545–565.
Greenacre, M. J. (1984) Theory and applications of correspondence analysis.
Heiser, W. J. (1993). Clustering in low-dimensional space. Information and classification (pp. 162–173). Berlin, Heidelberg: Springer.
Hwang, H., Dillon, W. R., & Takane, Y. (2006). An extension of multiple correspondence analysis for identifying heterogeneous subgroups of respondents. Psychometrika, 71(1), 161–171.
MacQueen, J. (1967). Some methods for classification and analysis of multivariate observations. In Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, (Vol. 1, no. 14, pp. 281–297).
Timmerman, M. E., Ceulemans, E., Kiers, H. A., & Vichi, M. (2010). Factorial and reduced K-means reconsidered. Computational Statistics & Data Analysis, 54(7), 1858–1871.
Vichi, M., Kiers, H. A.: Factorial K-means analysis for two-way data. Computational Statistics & Data Analysis, 37(1), pp. 49–64 (2001). Bioinformatics, 17(9), 763–774 (2001).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Fordellone, M., Vichi, M. (2017). Multiple Correspondence K-Means: Simultaneous Versus Sequential Approach for Dimension Reduction and Clustering. In: Lauro, N., Amaturo, E., Grassia, M., Aragona, B., Marino, M. (eds) Data Science and Social Research. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-319-55477-8_8
Download citation
DOI: https://doi.org/10.1007/978-3-319-55477-8_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-55476-1
Online ISBN: 978-3-319-55477-8
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)