Skip to main content

Modeling Zero Inflation and Overdispersion in the Length of Hospital Stay for Patients with Ischaemic Heart Disease

  • Chapter
  • First Online:
Advanced Statistical Methods in Data Science

Part of the book series: ICSA Book Series in Statistics ((ICSABSS))

Abstract

Ischaemic heart disease is the top one cause of death in the world; however, quantifying its burden in a population is a challenge. Hospitalization data provide a proxy for measuring the severity of ischaemic heart disease. Length of stay (LOS) in hospital is often used as an indicator of hospital efficiency and a proxy of resource consumption, which may be characterized as zero-inflated if there is an over-abundance of zeros, or zero-deflated if there are fewer zeros than expected under a standard count model. Such data may also have a highly right-skewed distribution for the nonzero values. Hurdle models and zero inflated models were developed to accommodate both the excess zeros and skewness of the data with various configuration of spatial random effects, as well as allowing for analysis of nonlinear effect of seasonality and other fixed effect covariates. We draw attention to considerable drawbacks with regards to model misspecifications. Modeling and inference use the fully Bayesian approach via Markov Chain Monte Carlo (MCMC) simulation techniques. Our results indicate that both hurdle and zero inflated models accounting for clustering at the residential neighborhood level outperforms the models without counterpart models, and modeling the count component as a negative binomial distribution is significantly superior to ones with a Poisson distribution. Additionally, hurdle models provide a better fit compared to the counterpart zero-inflated models in our application.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Agarwal DK, Gelfand A, Citron-Pousty S (2002) Zero-inflated models with application to spatial count data. Environ Ecol Stat 9:341–355

    Article  MathSciNet  Google Scholar 

  • Akaike H (1973) Information theory as an extension of the maximum likelihood principle. In: Petrov BV, Csaki BF (eds) Second international symposium on information theory. Academiai Kiado, Budapest

    Google Scholar 

  • Besag J, York J, Mollie A (1991) Bayesian image restoration with two applications in spatial statistics. Ann Inst Stat Math 43:1–21

    Article  MathSciNet  MATH  Google Scholar 

  • Bhatia S (2010) Biomaterials for clinical applications. Springer, New York

    Book  Google Scholar 

  • Bull GM, Morton J (1978) Environment, temperature and death rates. Age Ageing 7:210–230

    Article  Google Scholar 

  • Buu A, Li R, Tan X, Zucker R (2012) Statistical models for longitudinal zero-inflated count data with applications to the substance abuse field. Stat Med 31:4074–4086

    Article  MathSciNet  Google Scholar 

  • Chassot P, Delabays A, Spahn DR (2002) Preoperative evaluation of patients with, or at risk of, coronary artery disease undergoing non-cardiac surgery. Br J Anaesth 89:747–759

    Article  Google Scholar 

  • Gelfand AE, Vounatsou P (2003) Proper multivariate conditional autoregressive models for spatial data analysis. Biostat 4:11–25

    Article  MATH  Google Scholar 

  • Go A, Mozaffarian D, Roger V, Benjamin E, Berry J, Borden W, Bravata D, Dai S, Ford E, Fox C (2013) Heart disease and stroke statistics – 2013 update: a report from the American heart association. Circulation 127:e6–e245

    Article  Google Scholar 

  • Heilbron DC (1994) Zero-altered and other regression models for count data with added zeros. Biom J 36:531–547

    Article  MATH  Google Scholar 

  • Lambert D (1992) Zero-inflated Poisson regression with an application to defects in manufacturing. Technometrics 34:1–14

    Article  MATH  Google Scholar 

  • Mardia KV (1988) Multi-dimensional multivariate gaussian Markov random fields with application to image processing. J Multivar Anal 24:265–284

    Article  MathSciNet  MATH  Google Scholar 

  • Mathers C, Fat D, Boerma J (2008) The global burden Of disease: 2004 update. World Health Organization, Geneva

    Book  Google Scholar 

  • Mathers C, Loncar D (2006) Projections of global mortality and burden of disease from 2002 to 2030. PLoS Med 3:2011–2030

    Article  Google Scholar 

  • Min Y, Agresti A (2005) Random effect models for repeated measures of zero-inflated count data. Stat Modell 5:1–19

    Article  MathSciNet  MATH  Google Scholar 

  • Mullahy J (1986) Specification and testing of some modified count data models. J Econom 33:341–365

    Article  MathSciNet  Google Scholar 

  • Murray CJ, Lopez AD (1997) Alternative projections of mortality and disability by cause 1990–2020: global burden of disease study. Lancet 349:1498–1504

    Article  Google Scholar 

  • Musal M, Aktekin T (2013) Bayesian spatial modeling of HIV mortality via zero-inflated Poisson models. Stat Med 32:267–281

    Article  MathSciNet  Google Scholar 

  • Mwalili S, Lesaffre E, Declerck D (2008) The zero-inflated negative binomial regression model with correction for misclassification: an example in caries research. Stat Methods Med Res 17:123–139

    Article  MathSciNet  MATH  Google Scholar 

  • Neelon B, Ghosh P, Loebs P (2013) A spatial Poisson hurdle model for exploring geographic variation in emergency department visits. J R Stat Soc Ser A 176:389–413

    Article  MathSciNet  Google Scholar 

  • Neelon B, O’Malley A, Normand S (2010) A Bayesian model for repeated measures zero-inflated count data with application to outpatient psychiatric service use. Stat Modell 10:421–439

    Article  MathSciNet  Google Scholar 

  • Rathbun S, Fei SL (2006) A spatial zero-inflated poisson regression model for oak regeneration. Envrion Ecol Stat 13:409–426

    Article  MathSciNet  Google Scholar 

  • Rogot E, Blackwelder WC (1979) Associations of cardiovascular mortality with weather in Memphis, Tennessee. Public Health Rep 85:25–39

    Article  Google Scholar 

  • Rose G (1966) Cold weather and ischaemic heart disease. Br J Prev Soc Med 20:97–100

    Google Scholar 

  • Spiegelhalter D, Thomas A, Best N, Lunn D (2005) Winbugs user manual, version 1.4. MRC Biostatistics Unit, Institute of Public Health and Department of Epidemiology & Public Health, Imperial College School of Medicine. http://www.mrc-bsu.cam.ac.uk/bugs

  • Spiegelhalter DJ, Best NG, Carlin BP, Van der Linde A (2002) Bayesian measures of model complexity and fit (with discussion). J R Stat Soc Ser B 64:583–640

    Google Scholar 

  • Mortality, summary list of causes 2008. Stat Canada

    Google Scholar 

  • Ver Hoef JM, Jansen J (2007) Space-time zero-inflated count models of harbor seals. Environ 18:697–712

    Google Scholar 

  • Ver Hoef JM, Jansen JK (2007) Spacetime zero-inflated count models of Harbor seals. Envrion 18:697–712

    Google Scholar 

  • Vuong QH (1989) Likelihood ratio tests for model selection and non-nested hypotheses. Econom 57:307–333

    Article  MathSciNet  MATH  Google Scholar 

  • Woodhouse P, Khaw K, Plummer M (1993) Seasonal variation of blood pressure and its relationship to ambient temperature in an elderly population. J Hypertens 85:1267–1274

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Cindy Xin Feng .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer Science+Business Media Singapore

About this chapter

Cite this chapter

Feng, C.X., Li, L. (2016). Modeling Zero Inflation and Overdispersion in the Length of Hospital Stay for Patients with Ischaemic Heart Disease. In: Chen, DG., Chen, J., Lu, X., Yi, G., Yu, H. (eds) Advanced Statistical Methods in Data Science. ICSA Book Series in Statistics. Springer, Singapore. https://doi.org/10.1007/978-981-10-2594-5_3

Download citation

Publish with us

Policies and ethics