Advertisement

Chapter 10: Models for Counts: Poisson and Negative Binomial GLMs

  • Peter K. Dunn
  • Gordon K. Smyth
Chapter
Part of the Springer Texts in Statistics book series (STS)

Abstract

The need to count things is ubiquitous, so data in the form of counts arise often in practice. Examples include: the number of alpha particles emitted from a source of radiation in a given time; the number of cases of leukemia reported per year in a certain jurisdiction; the number of flaws per metre of electrical cable. This chapter is concerned with counts when the individual events being counted are independent, or nearly so, and where there is no clear upper limit for the number of events that can occur, or where the upper limit is very much greater than any of the actual counts. We first compile important information about the Poisson distribution, the distribution most often used with count data. Poisson regression, or models for count data described by covariates, has already been covered elsewhere. We then focus on describing models for rates and models for counts organized in tables. Overdispersion is then discussed, including a discussion negative binomial glms and quasi-Poisson models as alternative models.

References

  1. [1]
    Agresti, A.: An Introduction to Categorical Data Analysis, second edn. Wiley-Interscience, New York (2007)Google Scholar
  2. [2]
    Andersen, E.B.: Multiplicative Poisson models with unequal cell rates. Scandinavian Journal of Statistics 4, 153–158 (1977)MathSciNetzbMATHGoogle Scholar
  3. [3]
    Appleton, D.R., French, J.M., Vanderpump, M.P.J.: Ignoring a covariate: An example of Simpson’s paradox. The American Statistician 50, 340–341 (1996)Google Scholar
  4. [4]
    Berkeley, E.C.: Right answers—a short guide for obtaining them. Computers and Automation 18(10) (1969)Google Scholar
  5. [5]
    Brockmann, H.J.: Satellite male groups in horseshoe crabs, limulus polyphemus. Ethology 102, 1–21 (1996)CrossRefGoogle Scholar
  6. [6]
    Brunswick, A.F.: Adolescent health, sex, and fertility. American Journal of Public Health 61(4), 711–729 (1971)CrossRefGoogle Scholar
  7. [7]
    Canadian Cancer Society: Canadian cancer statistics 2000. Published on the internet: www.cancer.ca/stats2000/tables/tab5e.htm (2000). Accessed 19 September 2001Google Scholar
  8. [8]
    Charig, C.R., Webb, D.R., Payne, S.R., Wickham, J.E.A.: Comparison of treatment of renal calculi by open surgery, percutaneous nephrolithotomy, and extracorporeal shockwave lithotripsy. British Medical Journal 292, 879–882 (1986)CrossRefGoogle Scholar
  9. [9]
    Christensen, R.: Log-Linear Models. Springer Texts in Statistics. Springer, New York (2013)Google Scholar
  10. [10]
    Davison, A.C.: Statistical Models. Cambridge University Press, UK (2003)Google Scholar
  11. [11]
    Dunn, P.K.: Contingency tables and log-linear models. In: K. Kempf-Leonard (ed.) Encyclopedia of Social Measurement, pp. 499–506. Elsevier (2005)Google Scholar
  12. [12]
    Dunn, P.K., Smyth, G.K.: Randomized quantile residuals. Journal of Computational and Graphical Statistics 5(3), 236–244 (1996)Google Scholar
  13. [13]
    Ellison, A.M.: Bayesian inference in ecology. Ecology Letters 7, 509–520 (2004)CrossRefGoogle Scholar
  14. [14]
    Everitt, B.S., Hothorn, T.: A Handbook of Statistical Analyses using, second edn. Chapman & Hall/CRC, Boca Raton, FL (2010)Google Scholar
  15. [15]
    Everitt, B.S., Smith, A.M.R.: Interactions in contingency tables: A brief discussion of alternative definitions. Psychological Medicine 9, 581–583 (1979)CrossRefGoogle Scholar
  16. [16]
    Fienberg, S.: The Analysis of Cross-Classified Categorical Data. Springer, New York (2007)CrossRefGoogle Scholar
  17. [17]
    Giardiello, F.M., Hamilton, S.R., Krush, A.J., Piantadosi, S., Hylind, L.M., Celano, P., Booker, S.V., Robinson, C.R., Johan, G., Offerhaus, A.: Treatment of colonic and rectal adenomas with sulindac in famial adenomatous polyposis. New England Journal of Medicine 328(18), 1313–1316 (1993)CrossRefGoogle Scholar
  18. [18]
    Gotelli, N.J., Ellison, A.M.: Biogeography at a regional scale: Determinants of ant species density in bogs and forests of New England. Ecology 83(6), 1604–1609 (2002)CrossRefGoogle Scholar
  19. [19]
    Hand, D.J., Daly, F., Lunn, A.D., McConway, K.Y., Ostrowski, E.: A Handbook of Small Data Sets. Chapman and Hall, London (1996)zbMATHGoogle Scholar
  20. [20]
    Health Department of Western Australia: Annual report 1997/1998—health of Western Australians—mortality and survival. Published on the internet: www.health.wa.gov.au/Publications/annualreport_9798/. Accessed 19 September 2001Google Scholar
  21. [21]
    Howes, A.L., Maron, M., McAlpine, C.A.: Bayesian networks and adaptive management of wildlife habitat. Conservation Biology 24(4), 974–983 (2010)CrossRefGoogle Scholar
  22. [22]
    Hutchinson, M.K., Holtman, M.C.: Analysis of count data using Poisson regression. Research in Nursing and Health 28, 408–418 (2005)CrossRefGoogle Scholar
  23. [23]
    Jorgensen, D.W.: Multiple regression analysis of a Poisson process. Journal of the American Statistical Association 56(294), 235–245 (1961)MathSciNetCrossRefGoogle Scholar
  24. [24]
    Julious, S.A., Mullee, M.A.: Confounding and Simpson’s paradox. British Medical Journal 309(1480), 1480–1481 (1994)CrossRefGoogle Scholar
  25. [25]
    King, G.: Statistical models for political science event counts: Bias in conventional procedures and evidence for the exponential Poisson regression model. American Journal of Political Science 32(3), 838–863 (1988)CrossRefGoogle Scholar
  26. [26]
    Lindsey, J.K.: Modelling Frequency and Count Data. No. 15 in Oxford Statistical Science Series. Clarendon Press, Oxford (1995)Google Scholar
  27. [27]
    Lovett, A.A., Gatrell, A.C.: The geography of spina bifida in England and Wales. Transactions of the Institute of British Geographers (New Series) 13(3), 288–302 (1988)CrossRefGoogle Scholar
  28. [28]
    Luo, D., Wood, G.R., Jones, G.: Visualising contingency table data. The Australian Mathematical Society Gazette 31(4), 258–262 (2004)MathSciNetzbMATHGoogle Scholar
  29. [29]
    Maag, J.W., Behrens, J.T.: Epidemiologic data on seriously emotionally disturbed and learning disabled adolescents: Reporting extreme depressive symptomatology. Behavioral Disorders 15(1) (1989)CrossRefGoogle Scholar
  30. [30]
    Maron, M.: Threshold effect of eucalypt density on an aggressive avian competitor. Biological Conservation 136, 100–107 (2007)CrossRefGoogle Scholar
  31. [31]
    Norton, J., Lawrence, G., Wood, G.: The Australian public’s perception of genetically-engineered foods. Australasian Biotechnology pp. 172–181 (1998)Google Scholar
  32. [32]
    Pettifor, R.A.: Brood-manipulation experiments. I. The number of offspring surviving per nest in blue tits (Parus caeruleus). Journal of Animal Ecology 62, 131–144 (1993)CrossRefGoogle Scholar
  33. [33]
    Piantadosi, S.: Clinical Trials: A Methodologic Perspective, second edn. John Wiley and Sons, New York (2005)CrossRefGoogle Scholar
  34. [34]
    Siegel, R.L., Miller, K.D., Jemal, A.: Cancer statistics, 2015. CA: A Cancer Journal for Clinicians 65(1), 5–29 (2015)Google Scholar
  35. [35]
    Smith, P.T., Heitjan, D.F.: Testing and adjusting for departures from nominal dispersion in generalized linear models. Journal of the Royal Statistical Society, Series C 42(1), 31–41 (1993)CrossRefGoogle Scholar
  36. [36]
    Smyth, G.K.: Australasian data and story library (Ozdasl) (2011). URL http://www.statsci.org/data
  37. [37]
    Venables, W.N., Ripley, B.D.: Modern Applied Statistics with S, fourth edn. Springer-Verlag, New York (2002). URL http://www.stats.ox.ac.uk/pub/MASS4 CrossRefGoogle Scholar
  38. [38]
    Whittemore, A.S., Gong, G.: Poisson regression with misclassified counts: Applications to cervical cancer mortality rates. Journal of the Royal Statistical Society, Series C 40(1), 81–93 (1991)CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  • Peter K. Dunn
    • 1
  • Gordon K. Smyth
    • 2
  1. 1.Faculty of Science, Health, Education and EngineeringSchool of Health of Sport Science, University of the Sunshine CoastQueenslandAustralia
  2. 2.Bioinformatics DivisionWalter and Eliza Hall Institute of Medical ResearchParkvilleAustralia

Personalised recommendations