Skip to main content

Cluster Analysis in Marketing Research

  • Living reference work entry
  • Latest version View entry history
  • First Online:
Handbook of Market Research

Abstract

Cluster analysis is an exploratory tool for compressing data into a smaller number of groups or representing points. The latter aims at sufficiently summarizing the underlying data structure and as such can serve the analyst for further consideration instead of dealing with the complete data set. Because of this data compression property, cluster analysis remains to be an essential part of the marketing analyst’s toolbox in today’s data rich business environment. This chapter gives an overview of the various approaches and methods for cluster analysis and links them with the most relevant marketing research contexts. We also provide pointers to the specific packages and functions for performing cluster analysis using the R ecosystem for statistical computing. A substantial part of this chapter is devoted to the illustration of applying different clustering procedures to a reference data set of shopping basket data. We briefly outline the general approach of the considered techniques, provide a walk-through for the corresponding R code required to perform the analyses, and offer some interpretation of the results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

References

  • Adams, R. A., & Fournier, J. J. (2003). Sobolev spaces (Pure and applied mathematics) (Vol. 140). Amsterdam: Elsevier.

    Google Scholar 

  • Aldenderfer, M. S., & Blashfield, R. K. (1984). Cluster analysis. Beverly Hills: Sage.

    Book  Google Scholar 

  • Anderberg, M. R. (1973). Cluster analysis for applications. New York: Academic.

    Google Scholar 

  • Arabie, P., & Lawrence, J. H. (1994). Cluster analysis in marketing research. In R. P. Bagozzi (Ed.), Advanced methods of marketing research (pp. 160–189). Cambridge, MA: Blackwell.

    Google Scholar 

  • Arabie, P., & Lawrence, J. H. (1996). An overview of combinatorial data analysis. Clustering and classification (pp. 5–63). Singapore: World Scientific.

    Google Scholar 

  • Arabie, P., Carroll, J. D., DeSarbo, W., & Wind, J. (1981). Overlapping clustering: A new method for product positioning. Journal of Marketing Research, 28(3), 310–317.

    Article  Google Scholar 

  • Bock, H. H. (1974). Automatische Klassifikation. Göttingen: Vandenhoeck & Ruprecht.

    Google Scholar 

  • Boztuğ, Y., & Reutterer, T. (2008). A combined approach for segment-specific market basket analysis. European Journal of Operational Research, 187(1), 294–312.

    Article  Google Scholar 

  • Breugelmans, E., Boztuğ, Y., & Reutterer, T. (2010). A multistep approach to derive targeted category promotions. Working paper series of the Marketing Science Institute, MSI report no. 10-118, Cambridge, MA.

    Google Scholar 

  • Büschken, J., & Allenby, G. M. (2016). Sentence-based text analysis for customer reviews. Marketing Science, 35(6), 953–975.

    Article  Google Scholar 

  • Cattell, R. B. (1943). The description of personality: Basic traits resolved into clusters. Journal of Abnormal and Social Psychology, 38(4), 476–506.

    Article  Google Scholar 

  • Chapman, C., & McDonnell Feit, E. (2019). Segmentation: Clustering and classification. R for marketing research and analytics (pp. 299–338). New York: Springer.

    Google Scholar 

  • Davies, D. L., & Bouldin, D. W. (1979). A cluster separation measure. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1(2), 224–227.

    Article  Google Scholar 

  • Decker, R. (2005). Market basket analysis by means of a growing neural network. The International Review of Retail, Distribution and Consumer Research, 15(2), 151–169.

    Article  Google Scholar 

  • DeSarbo, W. S., Ajay, K. M., & Lalita, A. M. (1993). Non-spatial tree models for the assessment of competitive market structure: An integrated review of the marketing and psychometric literature. In J. Eliashberg & G. L. Lilien (Eds.), Handbooks in operations research and management science (Vol. 5, pp. 193–257). Amsterdam: Elsevier.

    Google Scholar 

  • Dimitriadou, E., Dolničar, S., & Weingessel, A. (2002). An examination of indexes for determining the number of clusters in binary data sets. Psychometrika, 67(1), 137–159.

    Article  Google Scholar 

  • Dolnicar, S., & Leisch, F. (2003). Winter tourist segments in Austria: Identifying stable vacation styles using bagged clustering techniques. Journal of Travel Research, 41(3), 281–292.

    Article  Google Scholar 

  • Dolnicar, S., Grün, B., Leisch, F., & Schmidt, K. (2014). Required sample sizes for data-driven market segmentation analyses in tourism. Journal of Travel Research, 53(3), 296–306.

    Article  Google Scholar 

  • Dolnicar, S., Grün, B., & Leisch, F. (2018). Market segmentation analysis. Understanding it, doing it, and making it useful. Singapore: Springer.

    Book  Google Scholar 

  • Dréze, X., & Hoch, S. J. (1998). Exploiting the installed base using cross-merchandising and category destination programs. International Journal of Research in Marketing, 15(5), 459–471.

    Article  Google Scholar 

  • Everitt, B. S., Landau, S., Leese, M., & Stahl, D. (2011). Cluster analysis: Wiley series in probability and statistics. New York: Wiley.

    Book  Google Scholar 

  • Farris, J. S. (1969). On the cophenetic correlation coefficient. Systematic Zoology, 18(3), 279–285.

    Article  Google Scholar 

  • Fraley, C., & Raftery, A. E. (2002). Model-based clustering, discriminant analysis, and density estimation. Journal of the American Statistical Association, 97(458), 611–631.

    Article  Google Scholar 

  • Fraley, C., & Raftery, A. E. (2003). Enhanced model-based clustering, density estimation, and discriminant analysis software: MCLUST. Journal of Classification, 20(2), 263–286.

    Article  Google Scholar 

  • Frühwirth-Schnatter, S. (2006). Finite mixture and Markov switching models. New York: Springer Science & Business Media.

    Google Scholar 

  • Ghesmoune, M., Lebbah, M., & Azzag, H. (2016). State-of-the-art on clustering data streams. Big Data Analytics, 1(13), 1–27.

    Google Scholar 

  • Grover, R., & Srinivasan, V. (1987). A simultaneous approach to market segmentation and market structuring. Journal of Marketing Research, 24, 139–153.

    Article  Google Scholar 

  • Hartigan, J. A. (1975). Clustering algorithms. New York: Wiley.

    Google Scholar 

  • Hartigan, J. A., & Wong, M. A. (1979). Algorithm AS 136: A k-means clustering algorithm. Journal of the Royal Statistical Society: Series C: Applied Statistics, 28(1), 100–108.

    Google Scholar 

  • Hastie, T., Tibshirani, R., & Friedman, J. (2009). Unsupervised learning. In The elements of statistical learning (pp. 485–585). New York: Springer.

    Chapter  Google Scholar 

  • Hennig, C., Meila, M., Murtagh, F., & Rocci, R. (2015). Handbook of cluster analysis. Boca Raton/London/New York: CRC Press.

    Book  Google Scholar 

  • Hornik, K. (2004). Cluster ensembles. In C. Weihs, W. Gaul (Eds.), Classification – The ubiquitous challenge. Proceedings of the 28th annual conference of the Gesellschaft für Klassifikation E.V (pp. 65–72). Heidelberg: University of Dortmund/Springer.

    Google Scholar 

  • Hornik, K. (2005). A clue for cluster ensembles. Journal of Statistical Software, 14(12), 1–25.

    Article  Google Scholar 

  • Hruschka, H. (1986). Market definition and segmentation using fuzzy clustering methods. International Journal of Research in Marketing, 3(2), 117–134.

    Article  Google Scholar 

  • Hruschka, H., & Natter, M. (1986). Comparing performance of feedforward neural nets and K-means for cluster-based market segmentation. European Journal of Operational Research, 114(2), 346–353.

    Article  Google Scholar 

  • Jain, A. K., & Dubes, R. C. (1988). Algorithms for clustering data. Upper Saddle River: Prentice-Hall.

    Google Scholar 

  • Kaufman, L., & Rousseeuw, P. J. (1990). Finding groups in data: An introduction to cluster analysis. Hoboken: Wiley.

    Book  Google Scholar 

  • Leisch, F. (2006). A toolbox for k-centroids cluster analysis. Computational Statistics & Data Analysis, 51(2), 526–544.

    Article  Google Scholar 

  • MacQueen, J. (1967). Some methods for classification and analysis of multivariate observations. Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, 1(14), 281–297.

    Google Scholar 

  • Manchanda, P., Ansari, A., & Gupta, S. (1999). The “shopping basket”: A model for multicategory purchase incidence decisions. Marketing Science, 18(2), 95–114.

    Article  Google Scholar 

  • Mazanec, J. A. (1999). Simultaneous positioning and segmentation analysis with topologically ordered feature maps: A tour operator example. Journal of Retailing and Customer Services, 6(4), 219–235.

    Article  Google Scholar 

  • Mazanec, J. A., & Strasser, H. (2000). A nonparametric approach to perceptions-based market segmentation: Foundations (Vol. 1). Wien: Springer.

    Book  Google Scholar 

  • McLachlan, G. J., & Basford, K. E. (1988). Mixture models: Inference and applications to clustering. New York: Marcel Dekker.

    Google Scholar 

  • Mild, A., & Reutterer, T. (2003). An improved collaborative filtering approach for predicting cross-category purchases based on binary market basket data. Journal of Retailing and Consumer Services, 10(3), 123–133.

    Article  Google Scholar 

  • Milligan, G. W., & Cooper, M. C. (1985). An examination of procedures for determining the number of clusters in a data set. Psychometrika, 50(2), 159–179.

    Article  Google Scholar 

  • Mooi, E., Sarstedt, M., & Mooi-Reci, I. (2018). Data. In Market research (pp. 27–50). Singapore: Springer.

    Chapter  Google Scholar 

  • Netzer, O., Feldman, R., Goldenberg, J., & Fresko, M. (2012). Mine your own business: Market-structure surveillance through text mining. Marketing Science, 31(3), 521–543.

    Article  Google Scholar 

  • Ng, R. T., & Han, J. (2002). CLARANS: A method for clustering objects for spatial data mining. IEEE Transactions on Knowledge and Data Engineering, 14(5), 1003–1016.

    Article  Google Scholar 

  • Punj, G., & Stewart, D. W. (1983). Cluster analysis in marketing research: Review and suggestions for application. Journal of Marketing Research, 20(2), 134–148.

    Article  Google Scholar 

  • R Core Team. (2019). R: A language and environment for statistical computing. R Foundation for Statistical Computing. Vienna: R Development Core Team.

    Google Scholar 

  • Rand, W. M. (1971). Objective criteria for the evaluation of clustering methods. Journal of the American Statistical Association, 66, 846850.

    Article  Google Scholar 

  • Rao, V. R., & Sabavala, D. J. (1981). Inference of hierarchical choice processes from panel data. Journal of Consumer Research, 8(1), 85–96.

    Article  Google Scholar 

  • Reutterer, T. (1998). Competitive market structure and segmentation analysis with self-organizing feature maps. In P. Anderson (Ed.), Proceedings of the 27th EMAC conference. Track 5: Marketing research (pp. 85–105). Stockholm: EMAC.

    Google Scholar 

  • Reutterer, T. (2003). Bestandsaufnahme und aktuelle Entwicklungen bei der Segmentierungsanalyse von Produktmarkten. Journal für Betriebswirtschaft, 53(2), 52–74.

    Google Scholar 

  • Reutterer, T., & Natter, M. (2000). Segmentation-based competitive analysis with MULTICLUS and topology representing networks. Computers & Operations Research, 27(11–12), 1227–1247.

    Article  Google Scholar 

  • Reutterer, T., Mild, A., Natter, M., & Taudes, A. (2006). A dynamic segmentation approach for targeting and customizing direct marketing campaigns. Journal of Interactive Marketing, 20(3–4), 43–57.

    Article  Google Scholar 

  • Reutterer, T., Hahsler, M., & Hornik, K. (2007). Data mining und marketing am beispiel der explorativen warenkorbanalyse. Marketing ZFP, 29(3), 163–180.

    Article  Google Scholar 

  • Reutterer, T., Hornik, K., March, N., & Gruber, K. (2017). A data mining framework for targeted category promotions. Journal of Business Economics, 87(3), 337–358.

    Article  Google Scholar 

  • Rousseeuw, P. J. (1987). Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics, 20, 53–65.

    Article  Google Scholar 

  • Russell, G. J., & Petersen, A. (2000). Analysis of cross category dependence in market basket selection. Journal of Retailing, 76(3), 367–392.

    Article  Google Scholar 

  • Russell, G. J., Ratneshwar, S., Schocker, A. D., Bell, D., Bodapat, A., Degeratu, A., Hildebrandt, L., Kim, N., Ramaswami, S., & Shankar, V. H. (1999). Multiple-category decision-making: Review and synthesis. Marketing Letters, 10(3), 319–332.

    Article  Google Scholar 

  • Saraçli, S., Doğan, N., & Doğan, I. (2013). Comparison of hierarchical cluster analysis methods by cophenetic correlation. Journal of Inequalities and Applications, 2013(1), 203.

    Article  Google Scholar 

  • Sneath, P. H. (1957). Some thoughts on bacterial classification. Journal of General Microbiology, 17, 184–200.

    Article  Google Scholar 

  • Sokal, R. R., & Sneath, P. H. A. (1963). Principles of numerical taxonomy (A series of books in biology). San Francisco: W.H. Freeman.

    Google Scholar 

  • Späth, H. (1977). Cluster-analyse – Algorithmen zur Objektklassifizierung und Datenreduktion (2nd ed.). München/Wien: Oldenbourg Wissenschaftsverlag.

    Google Scholar 

  • Srivastava, R. K., Leone, R. P., & Shocker, A. D. (1981). Market structure analysis: Hierarchical clustering of products based on substitution-in-use. Journal of Marketing, 45(3), 38–48.

    Article  Google Scholar 

  • Srivastava, R. K., Alpert, M. I., & Shocker, A. D. (1984). A customer-oriented approach for determining market structures. Journal of Marketing, 48(2), 32–45.

    Article  Google Scholar 

  • Strasser, H. (2000). Reduction of complexity. In J. Mazanec & H. Strasser (Eds.), A nonparametric approach to perceptions-based market segmentation: Foundations (pp. 99–140). Wien/New York: Springer.

    Google Scholar 

  • Strehl, A., & Ghosh, J. (2003). Relationship-based clustering and visualization for high-dimensional data mining. INFORMS Journal on Computing, 15(2), 208–230.

    Article  Google Scholar 

  • Struyf, A., Hubert, M., & Rousseeuw, P. (1996). Clustering in an object-oriented environment. Journal of Statistical Software, 1(4), 1.

    Article  Google Scholar 

  • Tirunillai, S., & Tellis, G. J. (2014). Mining marketing meaning from online chatter: Strategic brand analysis of big data using Latent Dirichlet allocation. Journal of Marketing Research, 51(4), 463–479.

    Article  Google Scholar 

  • Titterington, D. M., Smith, A. F. M., & Makov, U. E. (1985). Statistical analysis of finite mixture distributions. Chichester: Wiley.

    Google Scholar 

  • Ward, J. H. (1963). Hierarchical grouping to optimize an objective function. Journal of the American Statistical Association, 58, 236–244.

    Article  Google Scholar 

  • Wedel, M., & Kamakura, W. A. (2000). Market segmentation – Conceptual and methodological foundations. New York: Springer.

    Book  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Thomas Reutterer .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this entry

Check for updates. Verify currency and authenticity via CrossMark

Cite this entry

Reutterer, T., Dan, D. (2020). Cluster Analysis in Marketing Research. In: Homburg, C., Klarmann, M., Vomberg, A. (eds) Handbook of Market Research. Springer, Cham. https://doi.org/10.1007/978-3-319-05542-8_11-2

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-05542-8_11-2

  • Received:

  • Accepted:

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-05542-8

  • Online ISBN: 978-3-319-05542-8

  • eBook Packages: Springer Reference Business and ManagementReference Module Humanities and Social SciencesReference Module Business, Economics and Social Sciences

Publish with us

Policies and ethics

Chapter history

  1. Latest

    Cluster Analysis in Marketing Research
    Published:
    26 March 2020

    DOI: https://doi.org/10.1007/978-3-319-05542-8_11-2

  2. Original

    Cluster Analysis in Marketing Research
    Published:
    29 March 2019

    DOI: https://doi.org/10.1007/978-3-319-05542-8_11-1