This paper studies the problem of estimating the number of clusters in the context of logistic regression clustering. The classi.cation likelihood approach is employed to tackle this problem. An information theoretic criterion for selecting the number of logistic curves is proposed in the sequel and then its asymptotic property is considered.
The paper is arranged as follows: In Section 2, some notations are given and an information theoretic criterion is proposed for estimating the number of clusters. In Section 3, the small sample performance of the proposed criterion is studied by Monte Carlo simulation. In Section 4, the asymptotic property of the criterion proposed in Section 2 is investigated. Some lemmas needed in Section 4 are given in the appendix.
Keywords
- Logistic Regression
- Binomial Distribution
- Maximum Likelihood Estimator
- Asymptotic Property
- Linear Predictor
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Chung KL (2001) A Course in Probability Theory (3rd edition). Academic Press
Farewell BT, Sprott D (1988) The use of a mixture model in the analysis of count data. Biometrics 44:1191-1194
Follmann DA, Lambert D (1989) Generalizing logistic regression by nonparametric mixing. Journal of the American Statistical Association 84:295-300
Follmann DA, Lambert D (1991) Identifiability for nonparametric mixtures of logistic regressions. Journal of Statistical Planning and Inference 27:375-381
McCullagh P, Nelder JA (1989) Generalized Linear Models (2nd edition). Chapman and Hall
Qian G, Field C (2002) Law of iterated logarithm and consistent model selection criterion in logistic regression. Statistics & Probability Letters 56:101-112
Shao Q, Wu Y (2005) A consistent procedure for determining the number of clusters in regression clustering. Journal of Statistical Planning and Inference 135:461-476
Wedel M, DeSarbo WS (1995) A mixture likelihood approach for generalized linear models. Journal of Classification 12:21-55
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2008 Physica-Verlag Heidelberg
About this chapter
Cite this chapter
Qian, G., Rao, C.R., Wu, Y., Shao, Q. (2008). Estimating the Number of Clusters in Logistic Regression Clustering by an Information Theoretic Criterion. In: Recent Advances in Linear Models and Related Areas. Physica-Verlag HD. https://doi.org/10.1007/978-3-7908-2064-5_2
Download citation
DOI: https://doi.org/10.1007/978-3-7908-2064-5_2
Publisher Name: Physica-Verlag HD
Print ISBN: 978-3-7908-2063-8
Online ISBN: 978-3-7908-2064-5
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)