An Intergrated Data Mining and Survival Analysis Model for Customer Segmentation

  • Guozheng Zhang
  • Yun Chen
Part of the IFIP — The International Federation for Information Processing book series (IFIPAICT, volume 251)


More and more literatures have researched the application of data mining technology in customer segmentation, and achieved sound effects. One of the key purposes of customer segmentation is customer retention. But the application of single data mining technology mentioned in previous literatures is unable to identify customer churn trend for adopting different actions on customer retention. This paper focus on constructs a integrated data mining and survival analysis model to segment customers into heterogeneous group by their survival probability (churn trend) and help enterprises adopting appropriate actions to retain profitable customers according to each segment’s churn trend. This model contains two components. Firstly, using data mining clustering arithmetic cluster customers into heterogeneous clusters according to their survival characters. Secondly, using survival analysis predicting each cluster’s survival/hazard function to identify their churn trend and test the validity of clustering for getting the correct customer segmentation. This model proposed by this paper was applied in a dataset from one biggest china telecommunications company. This paper also suggests some propositions for further research.


Data Mining Customer Relationship Management Customer Segment Customer Retention Heterogeneous Cluster 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 26.
    1.A. Nairn, and P. Bottomley (2003), Cluster analysis procedures in the CRM era, International Journal of Market Research, Vol. 45 Quarter 2Google Scholar
  2. 27.
    2.A. Banasiewicz, Acquiring high value, retainable customers, Database Marketing & Customer Strategy Management, 2004, Vol. 12,1, 21–31CrossRefGoogle Scholar
  3. 28.
    3.E. H. Sub, K.C. Noh, C.K. Suh (1999), Customer list segmentation using the combined response model, Expert Systems with Applications, 17(2): 89–97CrossRefGoogle Scholar
  4. 29.
    4.C. Fraley, and A.E. Raftery(2002), Model-Based Clustering, Discriminant Analysis, and Density Estimation. Journal of the American Statistical Association 97:611–631.MATHCrossRefMathSciNetGoogle Scholar
  5. 30.
    5.G. Sunil, D.R. Lehmann, and A.S. Jennifer (2004): Valuing Customer Journal of Marketing Research, 41 (February), 7–18.Google Scholar
  6. 31.
    6.JR. DW. Hosmer, and S. Lemeshow(1999), Applied Survival Analysis: Regression Modeling of Time to Event Data, New York: John Wiley & Sons,.MATHGoogle Scholar
  7. 32.
    7.K. Jaesoo et al.(2003): Segmenting the market of West Australian senior tourists using an artificial neural network, Tourism Management, 24(1):25–34CrossRefGoogle Scholar
  8. 33.
    8.J. Kleinberg, C. Papadimitriou, P. Raghavan (2004): Segmentation Problems, Journal of the ACM, Vol. 51, No. 2, March, pp. 263–280.MathSciNetGoogle Scholar
  9. 34.
    9.L.D. Chen, K.S. Soliman, E. Mao, M. N. Frolick (2000): Measuring user satisfaction with data warehouse: an exploratory study, Information & Management, 37(3): 103–110.CrossRefGoogle Scholar
  10. 35.
    10.P.N. Tan, M. Steinbach, V. Kumar (2006). Introduction to Data Mining, Posts & Telecom Press, pp310–320, Beijing.Google Scholar
  11. 36.
    11.P.V. Freytag, et al(2001), Business to Business Market Segmentation Industrial Marketing Management, 30(6):4 73486Google Scholar
  12. 37.
    12.Y. Chen, G.Z. Zhang, D.F Hu and S.S. Wang (2006), Customer Segmentation in Customer Relationship Management Based on Data Mining, Knowledge Enterprise: Intelligent Strategies in Product Design, Manufacturing, and Management, Volume 207, pp.288–293CrossRefGoogle Scholar
  13. 38.
    13.P.C. Verhoef, P.N. Spring, J.C. Hoekstra, The commercial use of segmentation and predictive modeling techniques for database marketing in the Netherlands, Decision Support Systems, 2003, 34(4):471–481CrossRefGoogle Scholar
  14. 39.
    14.M. Wedel, W.A. Kamakura and U. Bockenholt (2000): Marketing data, models and decisions, International Journal of Research in Marketing 17(2–3) 203–208.CrossRefGoogle Scholar
  15. 40.
    15.W. Reinartz, M. Krafft, and W.D. Hoyer(2004), The Customer Relationship Management Process: Its Measurement and Impact on Performance, Journal of Marketing Research, 293 Vol. XLI (August), 293–305)CrossRefGoogle Scholar

Copyright information

© International Federation for Information Processing 2007

Authors and Affiliations

  • Guozheng Zhang
    • 1
  • Yun Chen
    • 2
  1. 1.College of BusinessHouzhou Dianzi UniversityP.R.China
  2. 2.School of Public Economy AdministrationShanghai University of finance & economicsP.R.China

Personalised recommendations