Abstract
The conditional independence assumption is commonly used in multivariate mixture models in behavioral research. We propose an exponential tilt model to analyze data from a multivariate mixture distribution with conditionally independent components. In this model, the log ratio of the density functions of the components is modeled as a quadratic function in the observations. There are a number of advantages in this approach. First, except for the exponential tilt assumption, the marginal distributions of the observations can be completely arbitrary. Second, unlike some previous methods, which require the multivariate data to be discrete, modeling can be performed based on the original data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Allman, E.S., Matias, C., Rhodes, J.A.: Identifiability of parameters in latent class models with many observed variables. Ann. Stat. 37, 3099–3132 (2009)
Anderson, J.A.: Multivariate logistic compounds. Biometrika 66, 17–26 (1979)
Benaglia, T., Chauveau, D., Hunter, D.R.: An EM-like algorithm for semi- and non-parametric estimation in multivariate mixtures. J. Comput. Graph. Stat. 18, 505–526 (2009)
Chang, G.T., Walther, G.: Clustering with mixtures of log-concave distributions. Comput. Stat. Data Anal. 51, 6242–6251 (2007)
Chen, X., Fan, Y., Tsyrennikov, V.: Efficient estimation of semiparametric multivariate copula models. J. Am. Stat. Assoc. 101, 1228–1240 (2006)
Cruz-Medina, I.R., Hettmansperger, T.P., Thomas, H.: Semiparametric mixture models and repeated measures: the multinomial cut point model. Appl. Stat. 53, 463–474 (2004)
Dempster, A., Laird, N., Rubin, D.: Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. Ser. B 39, 1–38 (1977)
Dey, D., Kuo, L., Sahu, S.: A bayesian predictive approach to determining the number of components in a mixture distribution. Stat. Comput. 5, 297–305 (1995)
Efron, B., Tibshirani, R.: Using specially designed exponential families for density estimation. Ann. Stat. 24, 2431–2461 (1996)
Elmore, R.T., Hall, P., Neeman, A.: An application of classical invariant theory to identifiability in nonparametric mixtures. Ann. I. Fourier 55, 1–28 (2005)
Hall, P., Neeman, A., Pakyari, R., Elmore, R.T.: Nonparametric inference in multivariate mixtures. Biometrika 92, 667–678 (2005)
Hall, P., Zhou, X.H.: Nonparametric estimation of component distributions in a multivariate mixture. Ann. Stat. 31, 201–224 (2003)
Hettmansperger, T.P., Thomas, H.: Almost nonparametric inference for repeated measure in mixture models. J. R. Stat. Soc. Ser. B 62, 811–825 (2000)
Kay, R., Little, S.: Assessing the fit of the logistic model: A case study of children with the haemolytic uraemic syndrome. Appl. Stat. 35, 16–30 (1986)
Khalili, A., Potter, D., Yan, P., Li, L., Gray, J., Huang, T., Lin, S.: Gamma-normal-gamma mixture model for detecting differentially methylated loci in three breast cancer cell lines. Cancer Informat. 3, 43–54 (2007)
Leung, D., Qin, J.: Semi-parametric inference in a bivariate (multivariate) mixture model. Stat. Sin. 16, 153–163 (2006)
Levine, M., Hunter, D.R., Chauveau, D.: Maximum smoothed likelihood for multivariate mixtures. Biometrika 98 403–416 (2011)
Lindsay, B.G.: Mixture Models: Theory, Geometry, and Applications. Institute of Mathematical Statistics, Hayward, CA (1995)
Liu, Q., Pierce, D.A.: A note on gauss-hermite quadrature. Biometrika 81, 624–629 (1994)
McLachlan, G., Peel, D.: Finite Mixture Models. Wiley, New York (2000)
Miller, C.A., Kail, R., Leonard, L.B., Tomblin, J.B.: Speed of processing in children with specific language impairment. J. Speech Lang. Hear. Res. 44, 416–433 (2001)
Nagelkerke, N.J.D., Borgdorff, M.W., Kim, S.J.: Logistic discrimination of mixtures of m. tuberculosis and non-specific tuberculin reactions. Stat. Med. 20, 1113–1124 (2001)
Owen, A.: Empirical likelihood ratio confidence intervals for a single functional. Biometrika 75, 237–49 (1988)
Qin, J., Berwick, M., Ashbolt, R., Dwyer, T.: Quantifying the change of melanoma incidence by breslow thickness. Biometrics 58, 665–670 (2002)
Qin, J., Zhang, B.: A goodness-of-fit test for logistic regression models based on case-control data. Biometrika 84, 609–618 (1997)
Qu, Y.S., Hadgu, A.: A model for evaluating sensitivity and specificity for correlated diagnostic tests in efficacy studies with an imperfect reference test. J. Am. Stat. Assoc. 93, 920–928 (1998)
Schwarz, G.: Estimating the dimension of a model. Ann. Stat. 5, 461–464 (1978)
Thomas, H., Lohaus, A.: Modeling growth and individual differences in spatial tasks. Monogr. Soc. Res. Child Dev. 58, 1–191 (1993)
Titterington, D.M., Smith, A.F.M., Makov, U.E.: Statistical Analysis of Finite Mixture Distributions. Wiley, Chichester (1985)
Walther, G.: Detecting the presence of mixing with multiscale maximum likelihood. J. Am. Stat. Assoc. 97, 508–513 (2002)
White, I.R., Thompson, S.G.: Choice of test for comparing two groups, with particular application to skewed outcomes. Stat. Med.21, 1205–1215 (2003)
Wiper, M., Rios Insua, D., Ruggeri, F.: Mixtures of gamma distributions with applications. J. Comput. Graph. Stat. 10, 440–454 (2001)
Young, D.S., Benaglia, T., Chauveau, D., Elmore, R.T., Hettmansperger, T.P., Hunter, D.R., Thomas, H., Xuan, F.: mixtools: Tools for mixture models. R package version 0.3.2 (2008)
Zhang, B.: An information matrix test for logistic regression models based on case-control data. Biometrika 88, 921–932 (2001)
Acknowledgements
Tracey Wrobel Hammel and Thomas Hettmansperger were partially supported by NSF Grant SES-0518772. Denis Leung was supported by SMU Research Center.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Hammel, T.W., Hettmansperger, T.P., Leung, D.H.Y., Qin, J. (2015). Semiparametric Analysis in Conditionally Independent Multivariate Mixture Models. In: Nordhausen, K., Taskinen, S. (eds) Modern Nonparametric, Robust and Multivariate Methods. Springer, Cham. https://doi.org/10.1007/978-3-319-22404-6_21
Download citation
DOI: https://doi.org/10.1007/978-3-319-22404-6_21
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-22403-9
Online ISBN: 978-3-319-22404-6
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)