Abstract
The need to count things is ubiquitous, so data in the form of counts arise often in practice. Examples include: the number of alpha particles emitted from a source of radiation in a given time; the number of cases of leukemia reported per year in a certain jurisdiction; the number of flaws per metre of electrical cable. This chapter is concerned with counts when the individual events being counted are independent, or nearly so, and where there is no clear upper limit for the number of events that can occur, or where the upper limit is very much greater than any of the actual counts. We first compile important information about the Poisson distribution, the distribution most often used with count data. Poisson regression, or models for count data described by covariates, has already been covered elsewhere. We then focus on describing models for rates and models for counts organized in tables. Overdispersion is then discussed, including a discussion negative binomial glms and quasi-Poisson models as alternative models.
Poor data and good reasoning give poor results.
Good data and poor reasoning give poor results.
Poor data and poor reasoning give rotten results.
E. C. Berkeley [ 4 , p. 20]
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Agresti, A.: An Introduction to Categorical Data Analysis, second edn. Wiley-Interscience, New York (2007)
Andersen, E.B.: Multiplicative Poisson models with unequal cell rates. Scandinavian Journal of Statistics 4, 153–158 (1977)
Appleton, D.R., French, J.M., Vanderpump, M.P.J.: Ignoring a covariate: An example of Simpson’s paradox. The American Statistician 50, 340–341 (1996)
Berkeley, E.C.: Right answers—a short guide for obtaining them. Computers and Automation 18(10) (1969)
Brockmann, H.J.: Satellite male groups in horseshoe crabs, limulus polyphemus. Ethology 102, 1–21 (1996)
Brunswick, A.F.: Adolescent health, sex, and fertility. American Journal of Public Health 61(4), 711–729 (1971)
Canadian Cancer Society: Canadian cancer statistics 2000. Published on the internet: www.cancer.ca/stats2000/tables/tab5e.htm (2000). Accessed 19 September 2001
Charig, C.R., Webb, D.R., Payne, S.R., Wickham, J.E.A.: Comparison of treatment of renal calculi by open surgery, percutaneous nephrolithotomy, and extracorporeal shockwave lithotripsy. British Medical Journal 292, 879–882 (1986)
Christensen, R.: Log-Linear Models. Springer Texts in Statistics. Springer, New York (2013)
Davison, A.C.: Statistical Models. Cambridge University Press, UK (2003)
Dunn, P.K.: Contingency tables and log-linear models. In: K. Kempf-Leonard (ed.) Encyclopedia of Social Measurement, pp. 499–506. Elsevier (2005)
Dunn, P.K., Smyth, G.K.: Randomized quantile residuals. Journal of Computational and Graphical Statistics 5(3), 236–244 (1996)
Ellison, A.M.: Bayesian inference in ecology. Ecology Letters 7, 509–520 (2004)
Everitt, B.S., Hothorn, T.: A Handbook of Statistical Analyses using, second edn. Chapman & Hall/CRC, Boca Raton, FL (2010)
Everitt, B.S., Smith, A.M.R.: Interactions in contingency tables: A brief discussion of alternative definitions. Psychological Medicine 9, 581–583 (1979)
Fienberg, S.: The Analysis of Cross-Classified Categorical Data. Springer, New York (2007)
Giardiello, F.M., Hamilton, S.R., Krush, A.J., Piantadosi, S., Hylind, L.M., Celano, P., Booker, S.V., Robinson, C.R., Johan, G., Offerhaus, A.: Treatment of colonic and rectal adenomas with sulindac in famial adenomatous polyposis. New England Journal of Medicine 328(18), 1313–1316 (1993)
Gotelli, N.J., Ellison, A.M.: Biogeography at a regional scale: Determinants of ant species density in bogs and forests of New England. Ecology 83(6), 1604–1609 (2002)
Hand, D.J., Daly, F., Lunn, A.D., McConway, K.Y., Ostrowski, E.: A Handbook of Small Data Sets. Chapman and Hall, London (1996)
Health Department of Western Australia: Annual report 1997/1998—health of Western Australians—mortality and survival. Published on the internet: www.health.wa.gov.au/Publications/annualreport_9798/. Accessed 19 September 2001
Howes, A.L., Maron, M., McAlpine, C.A.: Bayesian networks and adaptive management of wildlife habitat. Conservation Biology 24(4), 974–983 (2010)
Hutchinson, M.K., Holtman, M.C.: Analysis of count data using Poisson regression. Research in Nursing and Health 28, 408–418 (2005)
Jorgensen, D.W.: Multiple regression analysis of a Poisson process. Journal of the American Statistical Association 56(294), 235–245 (1961)
Julious, S.A., Mullee, M.A.: Confounding and Simpson’s paradox. British Medical Journal 309(1480), 1480–1481 (1994)
King, G.: Statistical models for political science event counts: Bias in conventional procedures and evidence for the exponential Poisson regression model. American Journal of Political Science 32(3), 838–863 (1988)
Lindsey, J.K.: Modelling Frequency and Count Data. No. 15 in Oxford Statistical Science Series. Clarendon Press, Oxford (1995)
Lovett, A.A., Gatrell, A.C.: The geography of spina bifida in England and Wales. Transactions of the Institute of British Geographers (New Series) 13(3), 288–302 (1988)
Luo, D., Wood, G.R., Jones, G.: Visualising contingency table data. The Australian Mathematical Society Gazette 31(4), 258–262 (2004)
Maag, J.W., Behrens, J.T.: Epidemiologic data on seriously emotionally disturbed and learning disabled adolescents: Reporting extreme depressive symptomatology. Behavioral Disorders 15(1) (1989)
Maron, M.: Threshold effect of eucalypt density on an aggressive avian competitor. Biological Conservation 136, 100–107 (2007)
Norton, J., Lawrence, G., Wood, G.: The Australian public’s perception of genetically-engineered foods. Australasian Biotechnology pp. 172–181 (1998)
Pettifor, R.A.: Brood-manipulation experiments. I. The number of offspring surviving per nest in blue tits (Parus caeruleus). Journal of Animal Ecology 62, 131–144 (1993)
Piantadosi, S.: Clinical Trials: A Methodologic Perspective, second edn. John Wiley and Sons, New York (2005)
Siegel, R.L., Miller, K.D., Jemal, A.: Cancer statistics, 2015. CA: A Cancer Journal for Clinicians 65(1), 5–29 (2015)
Smith, P.T., Heitjan, D.F.: Testing and adjusting for departures from nominal dispersion in generalized linear models. Journal of the Royal Statistical Society, Series C 42(1), 31–41 (1993)
Smyth, G.K.: Australasian data and story library (Ozdasl) (2011). URL http://www.statsci.org/data
Venables, W.N., Ripley, B.D.: Modern Applied Statistics with S, fourth edn. Springer-Verlag, New York (2002). URL http://www.stats.ox.ac.uk/pub/MASS4
Whittemore, A.S., Gong, G.: Poisson regression with misclassified counts: Applications to cervical cancer mortality rates. Journal of the Royal Statistical Society, Series C 40(1), 81–93 (1991)
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Science+Business Media, LLC, part of Springer Nature
About this chapter
Cite this chapter
Dunn, P.K., Smyth, G.K. (2018). Chapter 10: Models for Counts: Poisson and Negative Binomial GLMs. In: Generalized Linear Models With Examples in R. Springer Texts in Statistics. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-0118-7_10
Download citation
DOI: https://doi.org/10.1007/978-1-4419-0118-7_10
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4419-0117-0
Online ISBN: 978-1-4419-0118-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)