Skip to main content

Classification of Multivariate Linear-Circular Data with Nonignorable Missing Values

  • Chapter
  • First Online:
  • 2210 Accesses

Part of the book series: Contributions to Statistics ((CONTRIB.STAT.))

Abstract

A latent-class mixture model is proposed for the unsupervised classification of incomplete multivariate data with mixed linear and circular components. The model allows for nonignorable missing values and integrates circular and normal densities to capture the association between toroidal clusters of circular observations and elliptical clusters of linear observations. Maximum likelihood estimation of the model is facilitated by an EM algorithm that treats unknown class membership and missing values as different sources of incomplete information. The model is exploited on incomplete time series of wind speed and direction and wave height and direction to identify a number of sea regimes.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Albert, P., Follmann, D.: Random effects and latent processes approaches for analyzing binary longitudinal data with missingness: a comparison of approaches using opiate clinical trial data. Stat. Methods Med. Res. 16(5), 417–439 (2007). DOI 10.1177/0962280206075308. URL http://smm.sagepub.com/content/16/5/417.abstract

  2. Banfield, J., Raftery, A.: Model-based Gaussian and non-Gaussian clustering. Biometrics 49(3), 803–821 (1993). URL http://www.jstor.org/stable/2532201

    Google Scholar 

  3. Bertotti, L., Cavalieri, L.: Wind and wave predictions in the Adriatic sea. J. Marine Syst. 78, S227–S234 (2009)

    Article  Google Scholar 

  4. Biernacki, C., Celeux, G., Govaert, G.: Choosing starting values for the EM algorithm for getting the highest likelihood in multivariate Gaussian mixture models. Comput. Stat. Data Anal. 41(3–4), 561–575 (2003). DOI 10.1016/S0167-9473(02)00163-9. URL http://www.sciencedirect.com/science/article/pii/S0167947302001639

  5. Gilks, W.R., Wild, P.: Adaptive rejection sampling for Gibbs sampling. Appl. Stat. 41, 337–348 (1992)

    Article  MATH  Google Scholar 

  6. Hagenaars, J., McCutcheon, A.: Applied Latent Class Analysis. Cambridge University Press, Cambridge (2002)

    Book  MATH  Google Scholar 

  7. Hunt Land Jorgensen, M.: Mixture model clustering for mixed data with missing information. Comput. Stat. Data Anal. 41(3–4), 429–440 (2003). DOI DOI: 10.1016/S0167-9473(02)00190-1. URL http://www.sciencedirect.com/science/article/B6V8V-472JRC1-13/2/1564c6358976c518c96c320c38dd052e

  8. Ibrahim, J.G., Lipsitz, S.R.: Missing covariates in generalized linear models when the missing data mechanism is non-ignorable. J. Roy. Stat. Soc. B 61, 173–190 (1999)

    Google Scholar 

  9. Ingrassia, S., Rocci, R.: Degeneracy of the EM algorithm for the MLE of multivariate Gaussian mixtures and dynamic constraints. Comput. Stat. Data Anal. 55, 1715–1725 (2011)

    Article  MathSciNet  Google Scholar 

  10. Lagona, F., Picone, M.: A latent-class model for clustering incomplete linear and circular data in marine studies. J. Data Sci. 9, 585–605 (2011)

    MathSciNet  Google Scholar 

  11. Lagona, F., Picone, M.: Maximum likelihood estimation of bivariate circular hidden Markov models from incomplete data. J. Stat. Comput. Simul. 1–15 (2012) URL http://www.tandfonline.com/doi/abs/10.1080/00949655.2012.656642. DOI 10.1080/00949655.2012. 656642

  12. Lagona, F., Picone, M.: Model-based clustering of multivariate skew data with circular components and missing values. J. Appl. Stat. 39, 927–945 (2012). DOI 10.1080/ 02664763.2011.626850. URL http://www.tandfonline.com/doi/abs/10.1080/02664763.2011.626850

  13. Little, R.: Modeling the drop-out mechanism in repeated-measures studies. J. Am. Stat. Assoc. 90, 1112–1121 (1995)

    Article  MathSciNet  MATH  Google Scholar 

  14. Mardia, K., Taylor, C., Subramaniam, G.: Protein bioinformatics and mixtures of bivariate von Mises distributions for angular data. Biometrics 63, 505–512 (2007)

    Article  MathSciNet  MATH  Google Scholar 

  15. Mardia, K.V., Hughes, G., Taylor, C.C., Singh, H.: A multivariate von Mises distribution with applications to bioinformatics. Can. J. Stat. 36(1), 99–109 (2008). URL http://dx.doi.org/10.1002/cjs.5550360110

  16. McLachlan, G., Peel, D.: Finite Mixture Models. Wiley, New York (2000)

    Book  MATH  Google Scholar 

  17. Rubin, D.: Multiple Imputation for Nonresponse in Surveys. Wiley, New York (1987)

    Book  Google Scholar 

  18. Shafer, J.: Analysis of incomplete multivariate data. Chapman and Hall, Boca Raton (1997)

    Book  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Francesco Lagona .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Italia

About this chapter

Cite this chapter

Lagona, F., Picone, M. (2013). Classification of Multivariate Linear-Circular Data with Nonignorable Missing Values. In: Grigoletto, M., Lisi, F., Petrone, S. (eds) Complex Models and Computational Methods in Statistics. Contributions to Statistics. Springer, Milano. https://doi.org/10.1007/978-88-470-2871-5_13

Download citation

Publish with us

Policies and ethics