Skip to main content

Chapter 10: Models for Counts: Poisson and Negative Binomial GLMs

  • Chapter
  • First Online:
Book cover Generalized Linear Models With Examples in R

Part of the book series: Springer Texts in Statistics ((STS))

Abstract

The need to count things is ubiquitous, so data in the form of counts arise often in practice. Examples include: the number of alpha particles emitted from a source of radiation in a given time; the number of cases of leukemia reported per year in a certain jurisdiction; the number of flaws per metre of electrical cable. This chapter is concerned with counts when the individual events being counted are independent, or nearly so, and where there is no clear upper limit for the number of events that can occur, or where the upper limit is very much greater than any of the actual counts. We first compile important information about the Poisson distribution, the distribution most often used with count data. Poisson regression, or models for count data described by covariates, has already been covered elsewhere. We then focus on describing models for rates and models for counts organized in tables. Overdispersion is then discussed, including a discussion negative binomial glms and quasi-Poisson models as alternative models.

Poor data and good reasoning give poor results.

Good data and poor reasoning give poor results.

Poor data and poor reasoning give rotten results.

E. C. Berkeley [ 4 , p. 20]

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 119.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Agresti, A.: An Introduction to Categorical Data Analysis, second edn. Wiley-Interscience, New York (2007)

    Google Scholar 

  2. Andersen, E.B.: Multiplicative Poisson models with unequal cell rates. Scandinavian Journal of Statistics 4, 153–158 (1977)

    MathSciNet  MATH  Google Scholar 

  3. Appleton, D.R., French, J.M., Vanderpump, M.P.J.: Ignoring a covariate: An example of Simpson’s paradox. The American Statistician 50, 340–341 (1996)

    Google Scholar 

  4. Berkeley, E.C.: Right answers—a short guide for obtaining them. Computers and Automation 18(10) (1969)

    Google Scholar 

  5. Brockmann, H.J.: Satellite male groups in horseshoe crabs, limulus polyphemus. Ethology 102, 1–21 (1996)

    Article  Google Scholar 

  6. Brunswick, A.F.: Adolescent health, sex, and fertility. American Journal of Public Health 61(4), 711–729 (1971)

    Article  Google Scholar 

  7. Canadian Cancer Society: Canadian cancer statistics 2000. Published on the internet: www.cancer.ca/stats2000/tables/tab5e.htm (2000). Accessed 19 September 2001

    Google Scholar 

  8. Charig, C.R., Webb, D.R., Payne, S.R., Wickham, J.E.A.: Comparison of treatment of renal calculi by open surgery, percutaneous nephrolithotomy, and extracorporeal shockwave lithotripsy. British Medical Journal 292, 879–882 (1986)

    Article  Google Scholar 

  9. Christensen, R.: Log-Linear Models. Springer Texts in Statistics. Springer, New York (2013)

    Google Scholar 

  10. Davison, A.C.: Statistical Models. Cambridge University Press, UK (2003)

    Google Scholar 

  11. Dunn, P.K.: Contingency tables and log-linear models. In: K. Kempf-Leonard (ed.) Encyclopedia of Social Measurement, pp. 499–506. Elsevier (2005)

    Google Scholar 

  12. Dunn, P.K., Smyth, G.K.: Randomized quantile residuals. Journal of Computational and Graphical Statistics 5(3), 236–244 (1996)

    Google Scholar 

  13. Ellison, A.M.: Bayesian inference in ecology. Ecology Letters 7, 509–520 (2004)

    Article  Google Scholar 

  14. Everitt, B.S., Hothorn, T.: A Handbook of Statistical Analyses using, second edn. Chapman & Hall/CRC, Boca Raton, FL (2010)

    Google Scholar 

  15. Everitt, B.S., Smith, A.M.R.: Interactions in contingency tables: A brief discussion of alternative definitions. Psychological Medicine 9, 581–583 (1979)

    Article  Google Scholar 

  16. Fienberg, S.: The Analysis of Cross-Classified Categorical Data. Springer, New York (2007)

    Book  Google Scholar 

  17. Giardiello, F.M., Hamilton, S.R., Krush, A.J., Piantadosi, S., Hylind, L.M., Celano, P., Booker, S.V., Robinson, C.R., Johan, G., Offerhaus, A.: Treatment of colonic and rectal adenomas with sulindac in famial adenomatous polyposis. New England Journal of Medicine 328(18), 1313–1316 (1993)

    Article  Google Scholar 

  18. Gotelli, N.J., Ellison, A.M.: Biogeography at a regional scale: Determinants of ant species density in bogs and forests of New England. Ecology 83(6), 1604–1609 (2002)

    Article  Google Scholar 

  19. Hand, D.J., Daly, F., Lunn, A.D., McConway, K.Y., Ostrowski, E.: A Handbook of Small Data Sets. Chapman and Hall, London (1996)

    MATH  Google Scholar 

  20. Health Department of Western Australia: Annual report 1997/1998—health of Western Australians—mortality and survival. Published on the internet: www.health.wa.gov.au/Publications/annualreport_9798/. Accessed 19 September 2001

    Google Scholar 

  21. Howes, A.L., Maron, M., McAlpine, C.A.: Bayesian networks and adaptive management of wildlife habitat. Conservation Biology 24(4), 974–983 (2010)

    Article  Google Scholar 

  22. Hutchinson, M.K., Holtman, M.C.: Analysis of count data using Poisson regression. Research in Nursing and Health 28, 408–418 (2005)

    Article  Google Scholar 

  23. Jorgensen, D.W.: Multiple regression analysis of a Poisson process. Journal of the American Statistical Association 56(294), 235–245 (1961)

    Article  MathSciNet  Google Scholar 

  24. Julious, S.A., Mullee, M.A.: Confounding and Simpson’s paradox. British Medical Journal 309(1480), 1480–1481 (1994)

    Article  Google Scholar 

  25. King, G.: Statistical models for political science event counts: Bias in conventional procedures and evidence for the exponential Poisson regression model. American Journal of Political Science 32(3), 838–863 (1988)

    Article  Google Scholar 

  26. Lindsey, J.K.: Modelling Frequency and Count Data. No. 15 in Oxford Statistical Science Series. Clarendon Press, Oxford (1995)

    Google Scholar 

  27. Lovett, A.A., Gatrell, A.C.: The geography of spina bifida in England and Wales. Transactions of the Institute of British Geographers (New Series) 13(3), 288–302 (1988)

    Article  Google Scholar 

  28. Luo, D., Wood, G.R., Jones, G.: Visualising contingency table data. The Australian Mathematical Society Gazette 31(4), 258–262 (2004)

    MathSciNet  MATH  Google Scholar 

  29. Maag, J.W., Behrens, J.T.: Epidemiologic data on seriously emotionally disturbed and learning disabled adolescents: Reporting extreme depressive symptomatology. Behavioral Disorders 15(1) (1989)

    Article  Google Scholar 

  30. Maron, M.: Threshold effect of eucalypt density on an aggressive avian competitor. Biological Conservation 136, 100–107 (2007)

    Article  Google Scholar 

  31. Norton, J., Lawrence, G., Wood, G.: The Australian public’s perception of genetically-engineered foods. Australasian Biotechnology pp. 172–181 (1998)

    Google Scholar 

  32. Pettifor, R.A.: Brood-manipulation experiments. I. The number of offspring surviving per nest in blue tits (Parus caeruleus). Journal of Animal Ecology 62, 131–144 (1993)

    Article  Google Scholar 

  33. Piantadosi, S.: Clinical Trials: A Methodologic Perspective, second edn. John Wiley and Sons, New York (2005)

    Book  Google Scholar 

  34. Siegel, R.L., Miller, K.D., Jemal, A.: Cancer statistics, 2015. CA: A Cancer Journal for Clinicians 65(1), 5–29 (2015)

    Google Scholar 

  35. Smith, P.T., Heitjan, D.F.: Testing and adjusting for departures from nominal dispersion in generalized linear models. Journal of the Royal Statistical Society, Series C 42(1), 31–41 (1993)

    Article  Google Scholar 

  36. Smyth, G.K.: Australasian data and story library (Ozdasl) (2011). URL http://www.statsci.org/data

  37. Venables, W.N., Ripley, B.D.: Modern Applied Statistics with S, fourth edn. Springer-Verlag, New York (2002). URL http://www.stats.ox.ac.uk/pub/MASS4

    Book  Google Scholar 

  38. Whittemore, A.S., Gong, G.: Poisson regression with misclassified counts: Applications to cervical cancer mortality rates. Journal of the Royal Statistical Society, Series C 40(1), 81–93 (1991)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Science+Business Media, LLC, part of Springer Nature

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Dunn, P.K., Smyth, G.K. (2018). Chapter 10: Models for Counts: Poisson and Negative Binomial GLMs. In: Generalized Linear Models With Examples in R. Springer Texts in Statistics. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-0118-7_10

Download citation

Publish with us

Policies and ethics