Skip to main content
Log in

Modeling and forecasting daily maximum hourly ozone concentrations using the RegAR model with skewed and heavy-tailed innovations

  • Published:
Environmental and Ecological Statistics Aims and scope Submit manuscript

Abstract

This paper considers the modeling and forecasting of daily maximum hourly ozone concentrations in Laranjeiras, Serra, Brazil, through dynamic regression models. In order to take into account the natural skewness and heavy-tailness of the data, a linear regression model with autoregressive errors and innovations following a member of the family of scale mixture of skew-normal distributions was considered. Pollutants and meteorological variables were considered as predictors, along with some deterministic factors, namely week-days and seasons. The Oceanic Niño Index was also considered as a predictor. The estimated model was able to explain satisfactorily well the correlation structure of the ozone time series. An out-of-sample forecast study was also performed. The skew-normal and skew-t models displayed quite competitive point forecasts compared to the similar model with gaussian innovations. On the other hand, in terms of forecast intervals, the skewed models presented much better performance with more accurate prediction intervals. These findings were empirically corroborated by a forecast Monte Carlo experiment.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

References

  • Babu GJ, Rao C (2004) Goodness-of-fit tests when parameters are estimated. Sankhya 66(1):63–74

    Google Scholar 

  • Biswas A, Hwang JS, Angers JF (2015) Air pollution effects on clinic visits in small areas of Taiwan revisited. Environ Ecol Stat 22(1):17–32

    Article  Google Scholar 

  • Bruno F, Guttorp P, Sampson PD, Cocchi D (2009) A simple non-separable, non-stationary spatiotemporal model for ozone. Environ Ecol Stat 16(4):515–529

    Article  Google Scholar 

  • Burnett R, Bartlett S, Krewski D, Robert G, Raad-Young M (1994) Air pollution effects on hospital admissions: a statistical analysis of parallel time series. Environ Ecol Stat 1(4):325–332

    Article  Google Scholar 

  • Cancho VG, Lachos VH, Ortega EM (2010) A nonlinear regression model with skew-normal errors. Stat Pap 51(3):547–558

    Article  Google Scholar 

  • Cao CZ, Lin JG, Shi JQ (2014) Diagnostics on nonlinear model with scale mixtures of skew-normal and first-order autoregressive errors. Statistics 48(5):1033–1047

    Article  Google Scholar 

  • Chen PC, Lai YM, Chan CC, Hwang JS, Yang CY, Wang JD (1999) Short-term effect of ozone on the pulmonary function of children in primary school. Environ Health Perspect 107(11):921

    Article  CAS  Google Scholar 

  • Chiogna M, Pauli F (2011) Modelling short-term effects of ozone on morbidity: an application to the city of Milano, Italy, 1995–2003. Environ Ecol Stat 18(1):169–184

    Article  CAS  Google Scholar 

  • Flores BE (1986) A pragmatic view of accuracy measurement in forecasting. Omega 14(2):93–98

    Article  Google Scholar 

  • Huang B, Banzon VF, Freeman E, Lawrimore J, Liu W, Peterson TC, Smith TM, Thorne PW, Woodruff SD, Zhang HM (2015) Extended reconstructed sea surface temperature version 4 (ERSST. v4). Part I: Upgrades and intercomparisons. J Clim 28(3):911–930

    Article  Google Scholar 

  • Johnson N, Kotz S, Balakrishnan N (1994) Continuous univariate probability distributions, vol 1. Wiley, New York

    Google Scholar 

  • Kalbarczyk R, Kalbarczyk E, Niedźwiecka-Filipiak I, Serafin L (2015) Ozone concentration at ground level depending on the content of NOx and meteorological conditions. Ecol Chem Eng S 22(4):527–541

    CAS  Google Scholar 

  • Karnosky DF, Skelly JM, Percy KE, Chappelka AH (2007) Perspectives regarding 50 years of research on effects of tropospheric ozone air pollution on US forests. Environ Pollut 147(3):489–506

    Article  CAS  Google Scholar 

  • Liu PWG (2007) Establishment of a Box–Jenkins multivariate time-series model to simulate ground-level peak daily one-hour ozone concentrations at Ta-Liao in Taiwan. J Air Waste Manag Assoc 57(9):1078–1090

    Article  CAS  Google Scholar 

  • Liu PWG, Johnson R (2002) Forecasting peak daily ozone levels-I. A regression with time series errors model having a principal component trigger to fit, (1991) ozone levels. J Air Waste Manag Assoc 52(9):1064–1074

    Article  CAS  Google Scholar 

  • Liu PWG, Johnson R (2003) Forecasting peak daily ozone levels: Part 2—A regression with time series errors model having a principal component trigger to forecast 1999 and 2002 ozone levels. J Air Waste Manag Assoc 53(12):1472–1489

    Article  CAS  Google Scholar 

  • Liu PWG, Tsai JH, Lai HC, Tsai DM, Li LW (2013) Establishing multiple regression models for ozone sensitivity analysis to temperature variation in Taiwan. Atmos Environ 79:225–235

    Article  CAS  Google Scholar 

  • Liu W, Huang B, Thorne PW, Banzon VF, Zhang HM, Freeman E, Lawrimore J, Peterson TC, Smith TM, Woodruff SD (2015) Extended reconstructed sea surface temperature version 4 (ERSST. v4): Part II. Parametric and structural uncertainty estimations. J Clim 28(3):931–951

    Article  Google Scholar 

  • Ljung GM, Box GE (1978) On a measure of lack of fit in time series models. Biometrika 65(2):297–303

    Article  Google Scholar 

  • Meng XL, Rubin DB (1993) Maximum likelihood estimation via the ecm algorithm: a general framework. Biometrika 80(2):267–278

    Article  Google Scholar 

  • Mourot G, Gasso K, Ragot JH (1999) Modelling of ozone concentrations using a Takagi-Sugeno model. Control Eng Pract 7(6):707–715

    Article  Google Scholar 

  • Pankratz A (2012) Forecasting with dynamic regression models. Wiley, New York

    Google Scholar 

  • Percy K, Legge A, Krupa S (2003) Tropospheric ozone: a continuing threat to global forests? In: Karnosky D, Percy K, Chappelka A, Simpson C, Pikkarainen J (eds) Developments in environmental science: air pollution, global change and forests in the new millennium, Chap. 4, vol 3. Elsevier, Amsterdam, pp 85–117

    Chapter  Google Scholar 

  • Prybutok VR, Yi J, Mitchell D (2000) Comparison of neural network models with ARIMA and regression models for prediction of Houston’s daily maximum ozone concentrations. Eur J Oper Res 122(1):31–40

    Article  Google Scholar 

  • R Core Team (2017) R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria, https://www.R-project.org/. Accessed 2 Oct 2018

  • Seinfeld JH, Pandis SN (2016) Atmospheric chemistry and physics: from air pollution to climate change. Wiley, New York

    Google Scholar 

  • Spellman G (1999) An application of artificial neural networks to the prediction of surface ozone concentrations in the United Kingdom. Appl Geogr 19(2):123–136

    Article  Google Scholar 

  • Tibshirani R (1996) Regression shrinkage and selection via the LASSO. J R Stat Soc Ser B (Methodological) 58:267–288

    Google Scholar 

  • Tsay RS (2005) Analysis of financial time series. Wiley, New York

    Book  Google Scholar 

  • Wang H, Li G, Tsai CL (2007) Regression coefficient and autoregressive order shrinkage and selection via the LASSO. J R Stat Soc Ser B (Stat Methodol) 69(1):63–78

    Article  Google Scholar 

  • Wang P, Baines A, Lavine M, Smith G (2012) Modelling ozone injury to US forests. Environ Ecol Stat 19(4):461–472

    Article  CAS  Google Scholar 

  • Xie FC, Lin JG, Wei BC (2009) Diagnostics for skew-normal nonlinear regression models with AR(1) errors. Comput Stat Data Anal 53(12):4403–4416

    Article  Google Scholar 

Download references

Acknowledgements

The authors thank to Instituto Estadual de Meio Ambiente e Recursos Hídricos of Espírito Santo state for making the data sets used in this paper available. The authors also thank the extremely helpful comments of the associate editor and reviewers, which have improved substantially the quality of the paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alessandro José Queiroz Sarnaglia.

Additional information

Handling Editor: Pierre Dutilleul.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sarnaglia, A.J.Q., Monroy, N.A.J. & da Vitória, A.G. Modeling and forecasting daily maximum hourly ozone concentrations using the RegAR model with skewed and heavy-tailed innovations. Environ Ecol Stat 25, 443–469 (2018). https://doi.org/10.1007/s10651-018-0413-7

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10651-018-0413-7

Keywords

Navigation