Abstract
In applied studies, researchers are often confronted with semicontinuous response models. These are models with a semicontinuous response variable, i.e. a continuous variable that has a lower bound, that we here consider to be zero, and such that a sizable fraction of the observations takes value on this boundary. Semicontinuous response models are common in pharmacovigilance, pharmacoepidemiological and pharmacoeconomic studies, where it can be sometimes useful to evaluate and monitor the doses of a certain drug substance consumed by the general population. The preponderant number of observations taking value zero corresponds to the part of the population which is not actually consuming the medicine, either because they do not need it or because, even if they need it, they are not taking it in a given interval of time. Another interesting field of application concerns goods or drugs consumption to be studied for economic or social purposes. We here explore the use of several asymmetric distributions to address the fact that the continuous part of the data distribution shows skewness in most cases. As an illustration, the proposal is applied to model alcohol expenditure in Italian households.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Aristei, D., Pieroni, L.: A double-hurdle approach to modelling tobacco consumption in Italy. Appl. Econ. 40, 2463–2476 (2008)
Azzalini, A.: The skew-normal distribution and related multivariate families (with discussion). Scand. J. Stat. 32, 159–188 (C/R 189–200) (2005)
Azzalini, A., Capitanio, A.: Statistical applications of the multivariate skew normal distribution. J. Roy. Stat. Soc.: Ser. B (Stat. Methodol.) 61, 579–602 (1999)
Blundell, R., Meghir, C.: Bivariate alternatives to the Tobit model. J. Econ. 34, 179–200 (1987)
Capobianco, R., Hutton, J., Stanghellini, E.: Modelling censored data with the skew-normal distribution. In: Proceedings of the 25th International Workshop on Statistical Modelling (IWSM2010), Glasgow, UK, 119–122 (2010)
Chai, H., Bailey, K.: Use of log-skew-normal distribution in analysis of continuous data with a discrete component at zero. Stat. Med. 27, 3643–3655 (2008)
Couturier, D., Victoria-Feser, M.: Zero-inflated truncated generalized Pareto distribution for the analysis of radio audience data. Ann. Appl. Stat. 4, 1824–1846 (2010)
Cragg, J.: Some statistical models for limited dependent variables with application to the demand for durable goods. Econometrica: J. Economet. Soc. 39, 829–844 (1971)
Duan, N., Manning, W., Morris, C., Newhouse, J.: A comparison of alternative models for the demand for medical care. J. Bus. Econ. Stat. 1, 115–126 (1983)
Healy, M.J.R.: Multivariate normal plotting. Appl. Stat. 17, 157–161 (1968)
Hutton, J., Stanghellini, E.: Modelling bounded health scores with censored skew-normal distributions. Stat. Med. 30, 368–376 (2011)
Jones, A.: A double-hurdle model of cigarette consumption. J. Appl. Economet. 4, 23–39 (1989)
Lambert, D.: Zero-inflated poisson regression, with an application to defects in manufacturing. Technometrics 34, 1–14 (1992)
Min, Y., Agresti, A.: Modeling nonnegative data with clumping at zero: a survey. J. Iranian Stat. Soc. 1, 7–33 (2002)
Moulton, L., Halsey, N.: A mixed gamma model for regression analyses of quantitative assay data. Vaccine 14, 1154–1158 (1996)
Mullahy, J.: Specification and testing of some modified count data models. J. Economet. 33, 341–365 (1986)
Olsen, M., Schafer, J.: A two-part random-effects model for semicontinuous longitudinal data. J. Am. Stat. Assoc. 96, 730–745 (2001)
Powell, J.: Estimation of semiparametric models. In: Engle, R.F., McFadden, D.L. (eds.) Handbook of Econometrics, vol. IV, pp. 2443–521. North Holland, Amsterdam (1994)
R Development Core Team: R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing (2011)
Su, L., Tom, B., Farewell, V.: Bias in 2-part mixed models for longitudinal semicontinuous data. Biostatistics 10, 374 (2009)
Tobin, J.: Estimation of relationships for limited dependent variables. Econometrica: J. Economet. Soc. 26, 24–36 (1958)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Italia
About this chapter
Cite this chapter
Gottard, A., Stanghellini, E., Capobianco, R. (2013). Semicontinuous Regression Models with Skew Distributions. In: Grigoletto, M., Lisi, F., Petrone, S. (eds) Complex Models and Computational Methods in Statistics. Contributions to Statistics. Springer, Milano. https://doi.org/10.1007/978-88-470-2871-5_12
Download citation
DOI: https://doi.org/10.1007/978-88-470-2871-5_12
Published:
Publisher Name: Springer, Milano
Print ISBN: 978-88-470-2870-8
Online ISBN: 978-88-470-2871-5
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)