The Number of Clusters in Market Segmentation

Wagner, Ralf; Scholz, Sören W.; Decker, Reinhold

doi:10.1007/3-540-28397-8_19

Ralf Wagner²²,
Sören W. Scholz²² &
Reinhold Decker²²

Part of the book series: Studies in Classification, Data Analysis, and Knowledge Organization ((STUDIES CLASS))

2372 Accesses
9 Citations

Abstract

Learning the ‘true’ number of clusters in a given data set is a fundamental and largely unsolved problem in data analysis, which seriously affects the identification of customer segments in marketing research.

In this paper, we discuss the properties of relevant criteria commonly used to estimate the number of clusters. Moreover, we outline two adaptive clustering algorithms, a growing k-means algorithm and a growing self-organizing neural network. In the empirical part of the paper, we find that the first algorithm stops growing with exactly the number of clusters that we get when determining the optimal number of clusters by means of the JUMP-criterion. This cluster solution proves to be rather similar to the one we obtain by applying the neural network approach. To evaluate the clusters, we use association rules. By testing these rules, we show the differences of patterns underlying particular market segments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

BAIER, D., GAUL, W. and SCHADER, M. (1997): Two-Mode Overlapping Clustering With Applications to Simultaneous Beneflt Segmentation and Market Structuring, in: Klar, R. and Opitz, O. (Eds.), Classification and Knowledge Organization. Springer, Heidelberg, 557–566.
Google Scholar
BLACKWELL, R.D., MINIARD, P.W., and ENGEL, J.F. (2001): Consumer Behavior, Harcourt, Fort Worth.
Google Scholar
BOCK, H.H. (1985): On Some Significance Tests in Cluster Analysis. Journal of Classification, 2,1, 77–108.
Article MATH MathSciNet Google Scholar
BOCK, H.H. (1996): Probability Models in Partitional Cluster Analysis. Computional Statistics and Data Analysis, 23,5, 5–28.
Article MATH Google Scholar
BOONE, D.S. and ROEHM, M. (2002): Evaluating the Appropriateness of Market Segmentation Solutions Using Artificial Neural Networks and the Membership Clustering Criterion. Marketing Letters, 13,4, 317–333.
Article Google Scholar
BRASSINGTON, F. and PETTITT, S. (2005): Essentials of Marketing. Prentice Hall, Harlow.
Google Scholar
BRA̅ZMA, A., JONASSEN, I., EIDHAMMER, I., and GILBERT, D. (1998): Approaches to the Automatic Discovery of Patterns in Biosequences. Journal of Computional Biology, 5,2, 277–304.
Google Scholar
BRIN, S., MOTWANI, R., ULLMAN, J.D., and TSUR, S. (1997): Dynamic Itemset Counting and Implication Rules for Market Basket Data. In: J. Peckham (Ed.): Proceedings ACM SIGMOD International Conference on Management of Data. ACM Press, New York, 255–264.
Google Scholar
CALINSKI, R. and HARABASZ, J. (1974): A Dendrite Method for Cluster Analysis. Communications in Statistics (Series A), 3,1, 1–27.
MathSciNet Google Scholar
DECKER, R. (2005): Market Basket Analysis by Means of a Growing Neural Network, The International Review of Retail, Distribution and Consumer Research, forthcoming.
Google Scholar
DIBB, S. and SIMKIN, L. (1994): Implementation Problems in Industrial Market Segmentation. Industrial Marketing Management, 23,1, 55–63.
Article Google Scholar
DIBB, S. and STERN, P. (1995): Questioning the Reliability of Market Segmentation Techniques. Omega — International Journal of Management, 3,6, 625–636.
Article Google Scholar
DUDOIT, S. and FRIDLYAND, J. (2002): A Prediction-Based Resampling Method of Estimating the Number of Clusters in a Dataset, Genome Biology, 3,7, 1–21.
Article Google Scholar
FENNELL, G., ALLENBY, G.M., YANG, S., and EDWARDS, Y. (2003): The Effectiveness of Demographic and Psychographic Variables for Explaining Brand and Product Category Use. Quantitative Marketing and Economics, 1,2, 223–244.
Article Google Scholar
GAUL, W. and L. SCHMIDT-THIEME (2002): Recommender Systems Based on User Navigational Behavior in the Internet, Behaviormetrika, 29,1, 1–22.
MathSciNet MATH Google Scholar
GREEN, RE. and KRIEGER, A.M. (1995): Alternative Approaches to Cluster-Based Market Segmentation. Journal of the Market Research Society, 3, 221–239.
Google Scholar
GRANZIN, K.L, OLSEN, J.E., and PAINTER, J.J. (1998): Marketing to Consumer Segments Using Health-Promoting Lifestyles. Journal of Retailing and Consumer Services, 5,3, 131–141.
Article Google Scholar
HAIR, J.F., ANDERSON, R.E., TATHAM, R.L., and BLACK, W.C. (1998): Multivariate Data Analysis. 5^th ed., Prentice Hall, Upper Saddle River.
Google Scholar
HAMERLY, G. and ELKAN, C. (2003): Learning the k in k-means. Advances in Neural Information Processing Systems, 17, http://www.citeseer.ist.psu.edu/hamerly031earning.html.
Google Scholar
HARTIGAN, J.A. (1985): Statistical Theory in Clustering. Journal of Classification, 2,1, 63–76.
Article MATH MathSciNet Google Scholar
HILDERMAN, R.J. and HAMILTON H.J. (2001): Evaluation of Interestingness Measures for Ranking Discovered Knowledge. In: D. Cheung, G.J. Williams, and Q. Li (Eds.): Advances in Knowledge Discovery and Data Mining. Springer, Berlin, 247–259.
Google Scholar
KRZANOWSKI, W. and LAI, Y. (1988): A Criterion for Determining the Number of Clusters in a Dataset Using Sum of Squares Clustering. Biometrics, 44,1, 23–34.
MathSciNet MATH Google Scholar
LIU, B., MA, Y., and LEE, R. (2001): Analyzing the Interestingness of Association Rules From the Temporal Dimension. IEEE International Conference on Data Mining (ICDM-2001), http://www.cs.uic.edu/liub/publications/ICDM-2001.ps.
Google Scholar
LU, C.S. (2003): Market Segment Evaluation and International Distribution Centers. Transportation Research Part E: Logistics and Transportation Review, 391, 49–60.
Article Google Scholar
MARDIA, K.V. (1974): Applications of Some Measures of Multivariate Skewness and Kurtosis for Testing Normality and Robustness Studies. Sankhya, 36, 115–128.
MATH MathSciNet Google Scholar
MARDIA, K.V., KENT, J.T., and BIBBY, J.M. (1979): Multivariate Analysis. Academic Press, London.
MATH Google Scholar
MECKLIN, C.J. and MUNDFROM, D.J. (2004): An Appraisal and Bibliography of Tests for Multivariate Normality. International Statistical Review, 72,1, 123–138.
Article MATH Google Scholar
MILLIGAN, G.W. and COOPER, M.C. (1985): An Examination of Procedures for Determining the Number of Clusters in a Data Set. Psychometrika, 50, 159–179.
Article Google Scholar
PALMER, R.A. and Millier, P. (2004): Segmentation: Identification, Intuition, and Implementation. Industrial Marketing Management, 33,8, 779–785.
Article Google Scholar
PAPASTEFANOU, G., SCHMIDT, P., BRSCH-SUPAN, A. and LDTKE, H. and OLTERSDORF, U. (1999): Social and Economic Research with Consumer Panel Data. GESIS, Mannheim.
Google Scholar
ROTH, V., LANGE, T., BRAUN, M., and BUHMANN, J. (2002): A Resampling Approach to Cluster Validation. in: W. Härdle and B. Rönz (Eds.): Proceedings in Computational Statistics. Physica, Heidelberg, 123–128.
Google Scholar
SUGAR, C.A. and JAMES, G.M. (2003): Finding the Number of Clusters in a Dataset: An Information-Theoretic Approach. Journal of the American Statistical Society, 98,463, 750–762.
MathSciNet MATH Google Scholar
TIBSHIRANI, R., WALTER, G., and HASTIE, T. (2001): Estimating the Number of Clusters in a Dataset via the Gap Statistic. Journal of the Royal Statistical Society (Series B), 63,3, 411–423.
Article MATH Google Scholar
WAGNER, R. (2005): Mining Promising Qualification Patterns. In: D. Baier and K.-D. Wernecke (Eds.): Innovations in Classification, Data Science, and Information Systems. Berlin, Springer, 249–256.
Google Scholar
WEDEL, M. and KAMAKURA, W.A. (2000): Market Segmentation: Conceptional and Methodological Foundations. 2^nd ed., Kluwer Academic Publishers, Dordrecht.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Business Administration and Economics, Bielefeld University, P.O. Box 100131, D-33615, Bielefeld, Germany
Ralf Wagner, Sören W. Scholz & Reinhold Decker

Authors

Ralf Wagner
View author publications
You can also search for this author in PubMed Google Scholar
Sören W. Scholz
View author publications
You can also search for this author in PubMed Google Scholar
Reinhold Decker
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Business Administration and Economics, Brandenburg University of Technology Cottbus, Konrad-Wachsmann-Allee 1, 03046, Cottbus, Germany
Daniel Baier (Chair of Marketing and Innovation Management) (Chair of Marketing and Innovation Management)
Department of Business Administration and Economics, Bielefeld University, Universitätsstr. 25, 33615, Bielefeld, Germany
Reinhold Decker (Chair of Marketing) (Chair of Marketing)
Computer Based New Media Group (CGNM), Institute for Computer Science, University of Freiburg, Georges-Köhler-Allee 51, 79110, Freiburg, Germany
Lars Schmidt-Thieme

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Wagner, R., Scholz, S.W., Decker, R. (2005). The Number of Clusters in Market Segmentation. In: Baier, D., Decker, R., Schmidt-Thieme, L. (eds) Data Analysis and Decision Support. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-28397-8_19

Download citation

DOI: https://doi.org/10.1007/3-540-28397-8_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26007-3
Online ISBN: 978-3-540-28397-3
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics