Abstract
A latent-class mixture model is proposed for the unsupervised classification of incomplete multivariate data with mixed linear and circular components. The model allows for nonignorable missing values and integrates circular and normal densities to capture the association between toroidal clusters of circular observations and elliptical clusters of linear observations. Maximum likelihood estimation of the model is facilitated by an EM algorithm that treats unknown class membership and missing values as different sources of incomplete information. The model is exploited on incomplete time series of wind speed and direction and wave height and direction to identify a number of sea regimes.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Albert, P., Follmann, D.: Random effects and latent processes approaches for analyzing binary longitudinal data with missingness: a comparison of approaches using opiate clinical trial data. Stat. Methods Med. Res. 16(5), 417–439 (2007). DOI 10.1177/0962280206075308. URL http://smm.sagepub.com/content/16/5/417.abstract
Banfield, J., Raftery, A.: Model-based Gaussian and non-Gaussian clustering. Biometrics 49(3), 803–821 (1993). URL http://www.jstor.org/stable/2532201
Bertotti, L., Cavalieri, L.: Wind and wave predictions in the Adriatic sea. J. Marine Syst. 78, S227–S234 (2009)
Biernacki, C., Celeux, G., Govaert, G.: Choosing starting values for the EM algorithm for getting the highest likelihood in multivariate Gaussian mixture models. Comput. Stat. Data Anal. 41(3–4), 561–575 (2003). DOI 10.1016/S0167-9473(02)00163-9. URL http://www.sciencedirect.com/science/article/pii/S0167947302001639
Gilks, W.R., Wild, P.: Adaptive rejection sampling for Gibbs sampling. Appl. Stat. 41, 337–348 (1992)
Hagenaars, J., McCutcheon, A.: Applied Latent Class Analysis. Cambridge University Press, Cambridge (2002)
Hunt Land Jorgensen, M.: Mixture model clustering for mixed data with missing information. Comput. Stat. Data Anal. 41(3–4), 429–440 (2003). DOI DOI: 10.1016/S0167-9473(02)00190-1. URL http://www.sciencedirect.com/science/article/B6V8V-472JRC1-13/2/1564c6358976c518c96c320c38dd052e
Ibrahim, J.G., Lipsitz, S.R.: Missing covariates in generalized linear models when the missing data mechanism is non-ignorable. J. Roy. Stat. Soc. B 61, 173–190 (1999)
Ingrassia, S., Rocci, R.: Degeneracy of the EM algorithm for the MLE of multivariate Gaussian mixtures and dynamic constraints. Comput. Stat. Data Anal. 55, 1715–1725 (2011)
Lagona, F., Picone, M.: A latent-class model for clustering incomplete linear and circular data in marine studies. J. Data Sci. 9, 585–605 (2011)
Lagona, F., Picone, M.: Maximum likelihood estimation of bivariate circular hidden Markov models from incomplete data. J. Stat. Comput. Simul. 1–15 (2012) URL http://www.tandfonline.com/doi/abs/10.1080/00949655.2012.656642. DOI 10.1080/00949655.2012. 656642
Lagona, F., Picone, M.: Model-based clustering of multivariate skew data with circular components and missing values. J. Appl. Stat. 39, 927–945 (2012). DOI 10.1080/ 02664763.2011.626850. URL http://www.tandfonline.com/doi/abs/10.1080/02664763.2011.626850
Little, R.: Modeling the drop-out mechanism in repeated-measures studies. J. Am. Stat. Assoc. 90, 1112–1121 (1995)
Mardia, K., Taylor, C., Subramaniam, G.: Protein bioinformatics and mixtures of bivariate von Mises distributions for angular data. Biometrics 63, 505–512 (2007)
Mardia, K.V., Hughes, G., Taylor, C.C., Singh, H.: A multivariate von Mises distribution with applications to bioinformatics. Can. J. Stat. 36(1), 99–109 (2008). URL http://dx.doi.org/10.1002/cjs.5550360110
McLachlan, G., Peel, D.: Finite Mixture Models. Wiley, New York (2000)
Rubin, D.: Multiple Imputation for Nonresponse in Surveys. Wiley, New York (1987)
Shafer, J.: Analysis of incomplete multivariate data. Chapman and Hall, Boca Raton (1997)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Italia
About this chapter
Cite this chapter
Lagona, F., Picone, M. (2013). Classification of Multivariate Linear-Circular Data with Nonignorable Missing Values. In: Grigoletto, M., Lisi, F., Petrone, S. (eds) Complex Models and Computational Methods in Statistics. Contributions to Statistics. Springer, Milano. https://doi.org/10.1007/978-88-470-2871-5_13
Download citation
DOI: https://doi.org/10.1007/978-88-470-2871-5_13
Published:
Publisher Name: Springer, Milano
Print ISBN: 978-88-470-2870-8
Online ISBN: 978-88-470-2871-5
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)