Comparison of Cluster Analysis Approaches for Binary Data
Cluster methods allow to partition observations into homogeneous groups. Standard cluster analysis approaches consider the variables used to partition observations as continuous. In this work, we deal with the particular case all variables are binary. We focused on two specific methods that can handle binary data: the monothetic analysis and the model-based co-clustering. The aim is to compare the outputs performing these two methods on a common dataset, and figure out how they differ. The dataset on which the two methods are performed is a UNESCO dataset made up of 58 binary variables concerning the ability of UNESCO management to use Internet to promote world heritage sites.
KeywordsCluster analysis Binary data Monothetic analysis cluster Model-based co-clustering UNESCO
- 7.Maechler, M., Rousseeuw, P., Struyf, A., Hubert, M., Hornik, K.: cluster: Cluster analysis basics and extensions. R package version 1(4) (2016)Google Scholar
- 10.World Heritage Committee: Operational guidelines for the implementation of the World heritage Convention. Unesco World Heritage Centre (2008)Google Scholar
- 11.Zhou, Q., DeSantis, R.: Usability issues in city tourism web site design: a content analysis. In: IPCC 2005. Proceedings. International Professional Communication Conference, 2005., 789–796. IEEE (2005)Google Scholar