Comparing Approaches for Clustering Mixed Mode Data: An Application in Marketing Research
Practical applications in marketing research often involve mixtures of categorical and continuous variables. For the purpose of clustering, a variety of algorithms has been proposed to deal with mixed mode data. In this paper we apply some of these techniques on two data sets regarding marketing problems. We also propose an approach based on the consensus between partitions obtained by considering separately each variable or subsets of variables having the same scale. This approach may be applied to data with many categorical variables and does not impose restrictive assumptions on the variable distribution. We finally suggest a summarizing fuzzy partition with membership degrees obtained as a function of the classes determined by the different methods.
KeywordsMembership Degree Optional Accessory Marketing Research Rand Index Fuzzy Partition
- Chiu, T., Fang, D., Chen, J., Wang, Y., & Jeris, C. (2001). A robust and scalable clustering algorithm for mixed type attributes in large database environment. In Proceedings of the seventh ACM SIGKDD international conference on knowledge discovery and data mining 263–268. San Francisco, CA.CrossRefGoogle Scholar
- Kaufman, L., & Rousseuw, P. J. (1990). Finding groups in data: An introduction to cluster analysis. New York: Wiley.Google Scholar
- Zhang, T., Ramakrishnan, R., & Livny, M. (1996). BIRCH: An efficient data clustering method for very large databases. In: Proceedings of the ACM SIGMOD Conference on Management of Data 103–114. Montreal, Canada.Google Scholar