Abstract
This study explores a Bayesian regression analysis for count data in the presence of spatial and temporal correlations. The contribution is to develop a regression model for count data that provides flexibility in modeling the complexity of zero-inflation, overdispersion, as well as spatial patterns in a dynamic manner. More importantly, it improves the computational efficiency via dimension reduction while handling the high-dimensional correlation structure in the data. The proposed model is applied to the survey data by the Northeast Fisheries Sciences Center (NEFSC) for estimation and prediction of the Atlantic cod in the Gulf of Maine—Georges Bank region. Both zero-inflated Poisson and negative binomial models are fitted. Model comparison shows the improvement in model fitting with consideration in the spatial-temporal correlation as well as the overdispersion in the count data.
Keywords
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Agarwal, D. K., Gelfand, A. E., & Citron-Pousty, S. (2002). Zero-inflated models with application to spatial count data. Environmental and Ecological Statistics, 9, 341–355.
Albert, J. H., & Chib, S. (1993). Bayesian analysis of binary and polychotomous response data. Journal of the American Statistical Association, 88, 669–679.
Alexander, N., Moyeed, R., & Stander, J. (2000). Spatial modelling of individual-level parasite counts using the negative binomial distribution. Biostatistics, 1(4), 453–463.
Banerjee, S. (2005). On geodetic distance computations in spatial modeling. Biometrics, 61(2), 617–625.
Banerjee, S., Carlin, B. P., & Gelfand, A. E. (2004). Hierarchical modeling and analysis for spatial data. Boca Raton, London, New York, Washington, DC: Chapman & Hall/CRC.
Banerjee, S., Gelfand, A., Finley, A., & Sang, H. (2008). Gaussian predictive process models for large spatial datasets. Journal of the Royal Statistical Society Series B, 70, 825–848.
Breslow, N. E., & Clayton, D. G. (1993). Approximate inference in generalized linear mixed models. Journal of the American Statistical Association, 88, 9–25.
Cressie, N. (1993). Statistics for spatial data (revised edn.) New York: Wiley.
da Silva, A. R., & Rodrigues, T. C. V. (2014). Geographically weighted negative binomial regression – incorporating overdispersion. Statistics and Computing, 24(5), 769–783.
De Oliveira, V. (2000). Bayesian prediction of clipped Gaussian random fields. Computational Statistics & Data Analysis, 34, 299–314.
Dey, D. K., Chen, M. H., & Chang, H. (1997). Bayesian approach for nonlinear random effects models. Biometrics, 53(4), 1239–1252.
Diggle, P. J., Tawn, J. A., & Moyeed, R. A. (1998). Model-based geostatistics (with discussion). Applied Statistics, 47, 299–350.
Fei, S., & Rathbun, S. L. (2006). A spatial zero-inflated Poisson model for oak regeneration. Environmental and Ecological Statistics, 13, 406–426.
Fernandes, M. V., Schmidt, A. M., & Migon, H. S. (2009). Modelling zero-inflated spatio-temporal processes. Statistical Modelling, 9(1), 3–25.
Finley, A. O., Sang, H., Banerjee, S., & Gelfand, A. E. (2009). Improving the performance of predictive process modeling for large datasets. Computational Statistics & Data Analysis, 53(8), 2873–2884.
Fu, Y. Z., Chu, P. X., & Lu, L. Y. (2015). A Baysian approach of joint models for clustered zero-inflated count data with skewness and measurement errors. Journal of Applied Statistics, 42(4), 745–761.
Gelfand, A. E., Dey, D. K., & Chang, H. (1992). Model determination using predictive distributions with implementation via sampling-based methods. In J. M. Bernardo, J. O. Berger, A. P. Dawid, & A. F. M. Smith (Eds.), Bayesian statistics (Vol. 4, pp. 147–167). Oxford: Oxford University Press.
Ghosh, S. K., Mukhopadhyay, P., & Lu, J. C. (2006). Bayesian analysis of zero-inflated regression models. Journal of Statistical Planning and Inference, 136, 1360–1375.
Hadfield, J. D. (2010). MCMC methods for multi-response generalized linear mixed models: The MCMCglmm R package. Journal of Statistical Software, 33(2), 1–22.
Jiang, X., Dey, D. K., Prunier, R., Wilson, A. M., & Holsinger, K. E. (2013). A new class of flexible link functions with application to species co-occurrence in Cape floristic region. The Annals of Applied Statistics, 7(4), 2180–2204.
Kassahun, W., Neyens, T., Molenberghs, G., Faes, C., & Verbeke, G. (2015). A joint model for hierarchical continuous and zero-inflated overdispersed count data. Journal of Statistical Computation and Simulation, 85(3), 552–571.
Lee, K. L., & Bell, D. R. (2009). A spatial negative binomial regression of individual-level count data with regional and person-specific covariates. Working Paper Series, The Wharton School, University of Philadelphia.
Li, H. (2008). Bayesian hierarchical models for spatial count data with application to fire frequency in British Columbia. Master’s thesis, Department of Mathmatics and Statistics, University of Victoria, Victoria, Canada.
Lopes, H. F., Gamerman, D., & Salazar, E. (2011). Generalized spatial dynamic factor models. Computational Statistics & Data Analysis, 55, 1319–1330.
Lopes, H. F., Salazar, E., & Gamerman, D. (2008). Spatial dynamic factor analysis. Bayesian Analysis, 3(4), 759–792.
Min, Y., & Agresti, A. (2005). Random effect models for repeated measures of zero-inflated count data. Statistical Modelling, 5, 1–19.
Mohebbi, M., Wolfe, R., & Forbes, A. (2014). Disease mapping and regression with count data in the presence of overdispersion and spatial autocorrelation: A Bayesian model averaging approach. International Journal of Environmental Research and Public Health, 11, 883–902.
Nelder, J. A., & Wedderburn, R. M. (1972). Generalized linear models. Journal of the Royal Statistical Society, Series A, 135, 370–384.
Recta, V., Haran, M., & Rosenberger, J. L. (2012). A two-stage model for incidence and prevalence in point-level spatial count data. Environmetrics, 23, 162–174.
Rumisha, S. F., Smith, T., Abdulla, S., Masanja, H., & Vounatsou, P. (2014). Modelling heterogeneity in malaria transmission using large sparse spatial-temporal entomological data. Global Health Action, 7, 22682. http://dx.doi.org/10.3402/gha.v7.22,682.
Salazar, E., Sansó, B., Finley, A., Hammerling, D., Steinsland, I., Wang, X. (2011). Comparing and blending regional climate model prediction for the American southwest. Journal of Agricultural, Biological, and Environmental Statistics, 16, 586–605.
Smith, B. J. (2007). boa: An R package for MCMC output convergence assessment and posterior inference. Journal of Statistical Software, 21(11), 1–37.
Spiegelhalter, D. J., Best, N. G., Carlin, B. P., & Van Der Linde, A. (2002). Bayesian measures of model complexity and fit (with discussion). Journal of the Royal Statistical Society, Series B, 64, 583–639.
Ver Hoef, J. M., & Jansen, J. K. (2007). Space-time zero-inflated count models of harbor seals. Environmetrics, 18, 697–712.
Wang, X., Chen, M. H., Kuo, R. C., & Dey, D. K. (2015). Bayesian spatial-temporal modeling of ecological zero-inflated count data. Statistica Sinica, 25, 189–204.
Wang, X., & Dey, D. K. (2010). Generalized extreme value regression for binary response data: An application to B2B electronic payments system adoption. Annals of Applied Statistics, 4(4), 2000–2023.
West, M., & Harrison, J. (1997). Bayesian forecasting and dynamic models (2nd ed.). Berlin: Springer.
Whittle, P. (1954). On stationary processes in the plane. Biometrika, 41(3/4), 434–449.
Wikle, C. K., & Anderson, C. J. (2003). Climatological analysis of tornado report counts using a hierarchical Bayesian spatiotemporal model. Journal of Geophysical Research (Atmospheres), 108, 9005–9019.
Acknowledgements
We thank two referees for their constructive comments and suggestions. Dr. Wang thanks the domestic and international conference travel support provided by the Charles Phelps Taft Center at the University of Cincinnati. Dr. Chen’s research was partially supported by NIH grants #GM 70335 and #P01 CA142538.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Wang, X., Chen, MH., Kuo, R.C., Dey, D.K. (2016). Dynamic Spatial Pattern Recognition in Count Data. In: Jin, Z., Liu, M., Luo, X. (eds) New Developments in Statistical Modeling, Inference and Application. ICSA Book Series in Statistics. Springer, Cham. https://doi.org/10.1007/978-3-319-42571-9_10
Download citation
DOI: https://doi.org/10.1007/978-3-319-42571-9_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42570-2
Online ISBN: 978-3-319-42571-9
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)