Skip to main content

Semicontinuous Regression Models with Skew Distributions

  • Chapter
  • First Online:
Complex Models and Computational Methods in Statistics

Part of the book series: Contributions to Statistics ((CONTRIB.STAT.))

Abstract

In applied studies, researchers are often confronted with semicontinuous response models. These are models with a semicontinuous response variable, i.e. a continuous variable that has a lower bound, that we here consider to be zero, and such that a sizable fraction of the observations takes value on this boundary. Semicontinuous response models are common in pharmacovigilance, pharmacoepidemiological and pharmacoeconomic studies, where it can be sometimes useful to evaluate and monitor the doses of a certain drug substance consumed by the general population. The preponderant number of observations taking value zero corresponds to the part of the population which is not actually consuming the medicine, either because they do not need it or because, even if they need it, they are not taking it in a given interval of time. Another interesting field of application concerns goods or drugs consumption to be studied for economic or social purposes. We here explore the use of several asymmetric distributions to address the fact that the continuous part of the data distribution shows skewness in most cases. As an illustration, the proposal is applied to model alcohol expenditure in Italian households.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Aristei, D., Pieroni, L.: A double-hurdle approach to modelling tobacco consumption in Italy. Appl. Econ. 40, 2463–2476 (2008)

    Article  Google Scholar 

  2. Azzalini, A.: The skew-normal distribution and related multivariate families (with discussion). Scand. J. Stat. 32, 159–188 (C/R 189–200) (2005)

    Google Scholar 

  3. Azzalini, A., Capitanio, A.: Statistical applications of the multivariate skew normal distribution. J. Roy. Stat. Soc.: Ser. B (Stat. Methodol.) 61, 579–602 (1999)

    Google Scholar 

  4. Blundell, R., Meghir, C.: Bivariate alternatives to the Tobit model. J. Econ. 34, 179–200 (1987)

    MATH  Google Scholar 

  5. Capobianco, R., Hutton, J., Stanghellini, E.: Modelling censored data with the skew-normal distribution. In: Proceedings of the 25th International Workshop on Statistical Modelling (IWSM2010), Glasgow, UK, 119–122 (2010)

    Google Scholar 

  6. Chai, H., Bailey, K.: Use of log-skew-normal distribution in analysis of continuous data with a discrete component at zero. Stat. Med. 27, 3643–3655 (2008)

    Article  MathSciNet  Google Scholar 

  7. Couturier, D., Victoria-Feser, M.: Zero-inflated truncated generalized Pareto distribution for the analysis of radio audience data. Ann. Appl. Stat. 4, 1824–1846 (2010)

    Article  MathSciNet  MATH  Google Scholar 

  8. Cragg, J.: Some statistical models for limited dependent variables with application to the demand for durable goods. Econometrica: J. Economet. Soc. 39, 829–844 (1971)

    Article  MATH  Google Scholar 

  9. Duan, N., Manning, W., Morris, C., Newhouse, J.: A comparison of alternative models for the demand for medical care. J. Bus. Econ. Stat. 1, 115–126 (1983)

    Google Scholar 

  10. Healy, M.J.R.: Multivariate normal plotting. Appl. Stat. 17, 157–161 (1968)

    Article  MathSciNet  Google Scholar 

  11. Hutton, J., Stanghellini, E.: Modelling bounded health scores with censored skew-normal distributions. Stat. Med. 30, 368–376 (2011)

    Article  MathSciNet  Google Scholar 

  12. Jones, A.: A double-hurdle model of cigarette consumption. J. Appl. Economet. 4, 23–39 (1989)

    Article  Google Scholar 

  13. Lambert, D.: Zero-inflated poisson regression, with an application to defects in manufacturing. Technometrics 34, 1–14 (1992)

    Article  MATH  Google Scholar 

  14. Min, Y., Agresti, A.: Modeling nonnegative data with clumping at zero: a survey. J. Iranian Stat. Soc. 1, 7–33 (2002)

    Google Scholar 

  15. Moulton, L., Halsey, N.: A mixed gamma model for regression analyses of quantitative assay data. Vaccine 14, 1154–1158 (1996)

    Article  Google Scholar 

  16. Mullahy, J.: Specification and testing of some modified count data models. J. Economet. 33, 341–365 (1986)

    Article  MathSciNet  Google Scholar 

  17. Olsen, M., Schafer, J.: A two-part random-effects model for semicontinuous longitudinal data. J. Am. Stat. Assoc. 96, 730–745 (2001)

    Article  MathSciNet  MATH  Google Scholar 

  18. Powell, J.: Estimation of semiparametric models. In: Engle, R.F., McFadden, D.L. (eds.) Handbook of Econometrics, vol. IV, pp. 2443–521. North Holland, Amsterdam (1994)

    Google Scholar 

  19. R Development Core Team: R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing (2011)

    Google Scholar 

  20. Su, L., Tom, B., Farewell, V.: Bias in 2-part mixed models for longitudinal semicontinuous data. Biostatistics 10, 374 (2009)

    Article  Google Scholar 

  21. Tobin, J.: Estimation of relationships for limited dependent variables. Econometrica: J. Economet. Soc. 26, 24–36 (1958)

    Article  MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Elena Stanghellini .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Italia

About this chapter

Cite this chapter

Gottard, A., Stanghellini, E., Capobianco, R. (2013). Semicontinuous Regression Models with Skew Distributions. In: Grigoletto, M., Lisi, F., Petrone, S. (eds) Complex Models and Computational Methods in Statistics. Contributions to Statistics. Springer, Milano. https://doi.org/10.1007/978-88-470-2871-5_12

Download citation

Publish with us

Policies and ethics