Abstract
A continuous factorial model together with a discrete clustering one are fitted simultaneously to two-way data, with the aim to identify the best partition of the objects and the best partition of the variables, according to a least-squares loss function. In addition, the proposed methodology allows to detect simultaneously, factors describing classes of variables and centroids characterizing classes of objects.
The continuous and discrete models are fitted to two-way data by solving a least-squares modeling problem, mathematically restated as a quadratic constrained program with mixed variables.
An iterative alternating least-squares algorithm is proposed to give an efficient solution of the NP-hard minimization problem: starting from clusters centroids, in a number of reduced dimensions, a constrained orthogonal rotation allows to highlight classes of variables that better identify the classification of the objects; then new cluster centroids are computed and the partition of the objects is given by solving an assignment problem. At each step of the alternating least-squares algorithm the objective function is not increased, thus the algorithm converges to at least a local optimal solution of the problem.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Desarbo, W., S. and Howard, D., J. And Jedidi, K.(1991): MULTICLUS: A new method for simultaneously performing multidimensional scaling and clustering, Psychometrika, 56, 121–136.
De Soete G. and Carroll, J. D.(1994): K-means clustering in a lowdimensional Euclidean space. In: Diday et al. (eds): New approaches in classification and data analysis, Springer, Heidelberg, 212–219.
Gabriel, K. R.(1971): The biplot graphic display of matrices with application to principal component analysis, Biometrika, 58, 453–467.
Gower, J.(1987): Procrustes Analysis. In: B. Fichet and C. Lauro (eds.): Methods for Multidimensional Data Analysis, ECAS, 247–258.
Macqueen, J.(1967): Some Methods for Classification and Analysis of Multivariate Observations. Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and probability, Volume 1 Statistics, L.M. Le Cam and J. Neyman (eds.), Berkeley CA: University of California Press, 281–297.
Rizzi, A. and Vichi, M.(1995): Three-way Data Set Analysis. In: A. Rizzi (ed.) Some relations between matrices and structures of multidimensional data analysis, 93–166.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Vichi, M. (2002). Discrete and Continuous Models for Two-way Data. In: Gaul, W., Ritter, G. (eds) Classification, Automation, and New Media. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-55991-4_15
Download citation
DOI: https://doi.org/10.1007/978-3-642-55991-4_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43233-3
Online ISBN: 978-3-642-55991-4
eBook Packages: Springer Book Archive