Skip to main content

Simple Statistical Tests and P Values

  • Chapter
  • First Online:
Evidence-Based Surgery

Abstract

Having a general understanding of statistical tests , and how to interpret their results, is an important part of conducting, consuming, and applying evidence from the surgical literature. Unfortunately, statistical guides can often be focused on mathematical formulas, which maybe difficult for the average surgical researcher to understand. The goal of this chapter is to describe common statistical principles in English, leaving the mathematical symbolism to a minimum. This chapter utilizes a clinical trial from the surgical literature to explain simple statistical tests that are often seen in the surgical literature. These statistical tests are then reproduced using readily available statistical software, and compared to those results reported by the original authors. This chapter will introduce readers to common statistical ideas including, but not limited to chi-square tests, Student’s t test, Fisher exact tests, and sensitivity analysis . Additionally, readers will be introduced to the Cochrane risk of bias tool, and how this tool can be used to appraise a study. This chapter also acts as a reference to connect readers with additional resources, some more mathematical than others, that will offer additional information and different perspectives than this chapter.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Barberan-Garcia A, Ubré M, Roca J, Lacy AM, Burgos F, Risco R, et al. Personalised prehabilitation in high-risk patients undergoing elective major abdominal surgery: a randomized blinded controlled trial. Ann Surg. 2018;267(1):50–6.

    Article  Google Scholar 

  2. Cadeddu M, Farrokhyar F, Levis C, Haines AT, Thoma A for the Evidence-Based Surgery Working Group. Users’ guide to the surgical literature; understanding confidence intervals. Can J Surg. 2012;55(3):207–11.

    Google Scholar 

  3. Higgins JPT, Altman DG, Gøtzsche PC, Jüni P, Moher D, Oxman AD, et al. The Cochrane Collaboration’s tool for assessing risk of bias in randomised trials. BMJ. 2011;343:d5928.

    Google Scholar 

  4. Schunemann HJ, Guyatt GH. Commentary—goodbye M(C)ID! Hello MID, where do you come from? Health Serv Res. 2005;40:593–7.

    Article  Google Scholar 

  5. Minitab Inc. Minitab [Internet]. [cited 2018 Sep 6]. Available from: http://www.minitab.com/en-us/.

  6. van Buuren S. Flexible imputation of missing data, 2nd ed. Boca Raton, FL: Chapman & Hall/CRC; 2018.

    Google Scholar 

  7. Goodman SN. p values, hypothesis tests, and likelihood: implications for epidemiology of a neglected historical debate. Am J Epidemiol. 1993;137(5):485–96.

    Article  CAS  Google Scholar 

  8. Fraser DAS, Reid N. Crisis in science? Or crisis in statistics! Mixed messages in statistics with impact on science. J Statist Res. 2016;48–50(1):1–9.

    Google Scholar 

  9. Wasserstein RL, Lazar NA. The ASA’s statement on p-values: content, process, and purpose. Am Stat. 2016;70(2):129–33.

    Article  Google Scholar 

  10. Mark DB, Lee KL, Harrell FE Jr. Understanding the role of P values and hypothesis tests in clinical research. JAMA Cardiol. 2016;1(9):1048–54.

    Article  Google Scholar 

  11. Ioannidis JPA. The proposal to lower P values thresholds to .005. JAMA. 2018;319(14):1429–30.

    Google Scholar 

  12. Wild CJ, Pfannkuch M, Regan M. Towards more accessible conceptions of statistical inference. J Roy Statist Soc A. 2011;174(Part 2):1–23.

    Google Scholar 

  13. Hurlbert SH, Lombardi CM. Lopsided reasoning on lopsided tests and multiple comparisons. Aust NZ J Stat. 2012;54(1):23–42.

    Article  Google Scholar 

  14. Altman DG, Machin D, Bryant TN, Gardner MJ, editors. Statistics with confidence, 2nd ed. BMJ Books; 2011.

    Google Scholar 

  15. Streiner DL. One-tailed and two-tailed tests. Statistics commentary series; commentary #12. J Clin Psychopharmacol. 2015;35(6):628–9.

    Google Scholar 

  16. CRAN (Comprehensive R Archive Network). The Comprehensive R Archive Network [Internet]. [cited 2018 Sep 6]. Available from: http://CRAN.R-project.org.

  17. Wheatley M. 5 key considerations when choosing open source statistical software. Data informed. Big data and analytics in the enterprise. http://data-informed.com/author/nadaeum/ [It worked for CHG.].

  18. Edwards H. A review of probability and statistics apps for mobile devices. In: Maker K, de Sousa B, Gould R, editors. Sustainability in statistics education. Proceedings of the ninth international conference on teaching statistics (ICOTS July). Flagstaff AZ: Voorburg; 2014. p. 1–4.

    Google Scholar 

  19. McCullough BD, Yalata AT. Spreadsheets in the cloud—not ready yet. J Stat Softw. 2013;52(7). Available from: https://www.jstatsoft.org/article/view/v052i07/v52i07.pdf.

  20. Melard G. On the accuracy of statistical procedures in Microsoft Excel 2010. Comput Stat. 2014;29(5):1095–128.

    Article  Google Scholar 

  21. The R Foundation. The R project for statistical computing [Internet]. Available from: https://www.r-project.org. Accessed 24 Jul 2018.

  22. Pearson RK. Exploratory data analysis using R. Boca Raton FL: Chapman & Hall/CRC; 2018.

    Google Scholar 

  23. Thieme N. R generation. The story of a statistical programming language that became a subcultural phenomenon. Significance. 2018;15(4):14–9.

    Google Scholar 

  24. Little RJ, D’Agostino R, Cohen ML, Dickersin K, Emerson SS, Farrar JT, et al. The prevention and treatment of missing data in clinical trials. NEJM. 2012;367:1355–60.

    Article  CAS  Google Scholar 

  25. Mayo-Wilson E, Fusco N, Li T, Hong H, Canner JK, Dickersin K for the MUDS Investigators. Multiple outcomes and analyses in clinical trials create challenges for interpretation and research synthesis. J Clin Epidemiol. 2017;86:39–50.

    Google Scholar 

  26. Perneger TV. What’s wrong with Bonferroni adjustments? BMJ. 1998;316(7137):1236–8.

    Article  CAS  Google Scholar 

  27. Cook RJ, Farewell VT. Multiplicity considerations in the design and analysis of clinical trials. J Roy Statist Soc A. 1996;159(1):93–110.

    Article  Google Scholar 

  28. US Department of Health and Human Services, Food and Drug Administration, Center for Drug Evaluation and Research (CDER), Center for Biologics Evaluation and Research (CBER). Multiple endpoints in clinical trials [Internet]. [cited 2018 Sep 6]. Available from: https://www.fda.gov/downloads/drugs/guidancecomplianceregulatoryinformation/guidances/ucm536750.pdf.

  29. Bland JM, Altman DG. Multiple significance tests; the Bonferroni method. Statistics notes. BMJ. 1995;310:170.

    Article  CAS  Google Scholar 

  30. Livingston EH. Introducing the JAMA guide to statistics and methods. NEJM. 2014;312(1):35.

    CAS  Google Scholar 

  31. Aslan D, Sandberg S. Simple statistics in diagnostic tests. J Med Biochem. 2007;26:309–13.

    Article  Google Scholar 

  32. Lydersen S. Statistical review: frequently given comments. Ann Rheum Dis. 2015;74:323–5.

    Article  Google Scholar 

  33. Lang TA, Altman DG. Basic statistical reporting for articles published in biomedical journals: the “Statistical Analyses and Methods in the Published Literature” or “the SAMPL guidelines”. In: Smart P, Maisonneuve H, Polderman A, editors. Science editors’ handbook. European Association of Science Editors; 2013. p. 1–8.

    Google Scholar 

  34. Wan X, Wang W, Liu J, Tong T. Estimating the sample mean and standard deviations from the sample size, median, range and/or interquartile range. BMC Med Res Methodol. 2014;14:13.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Charles H. Goldsmith .

Editor information

Editors and Affiliations

Appendices

Appendix 1: Articles Identified in the Literature Search

  1. 1.

    Barberan-Garcia A, Ubré M, Roca J, Lacy AM, Burgos F, Risco R, Momblan D, Balust J, Blanco I, Martinez-Palli G. Personalized prehabilitation in high-risk patients undergoing elective major abdominal surgery: a randomized blinded controlled trial. Ann Surg. 2018;267(1):50–6. https://doi.org/10.1097/00000000000002293.

  2. 2.

    Berkel AEM, Bongers BC, vanKamp M-JS, Kotte H, Weltevreden P, deJongh FHC, Eijvogel MMM, Wymenga ANM, Bigirwamungu-Bargeman M, vander Palen J, van Det MC, van Meeteren NLU, Klasse JM. The effects of prehabilitation versus usual care to reduce postoperative complications in high-risk patients with colorectal cancer of dysplasia scheduled for elective colorectal resection: study protocol of a randomized controlled trial. BMC Gastroenterol. 2018;18:29. https://doi.org/10.1186/s12876-018-0754-6.

  3. 3.

    Mayo NE, Feldman L, Scott S, Zavorsky G, Kim DJ, Charlebois P, Stein B, Carli F. Impact of preoperative change in physical function on postoperative recovery: argument supporting prehabilitation for colorectal surgery. Surgery. 2011;150(3):505–14. https://doi.org/10.1016/j.surg.2011.07.045.

  4. 4.

    NCT02024776. Effectiveness of prehabilitation program for high-risk patients underwent abdominal surgery. [Registration]. Online Publication Date: 2018. Available from: http://clinicaltrials.gov/show/nct02024776.2013. [This is the registration for A1-1.]

  5. 5.

    NCT02934230. The prehabilitation study: exercise before surgery to improve patient function in people. [Registration]. Online Publication Date: 2018. Available from: http://clinicaltrials.gov/show/nct02934230.2016.

Appendix 2: Sensitivity Analysis for Table 27.3 Using Chi-Square Test and Minitab 18

  1. (a)

    Assume outcomes for missing patients were all NO, for both arms

Comps\Arm

I, %

C, %

Total, %

Δ

Pearson X2, LR X2

DF

P values

Report P2

Yes

19, 26

39, 55

58.40

29

12.499, 12.793

1

0.000, 0.000

< 0.001, < 0.001

No

54

32

86

     

Total

73

71

144

     

Comps = complications, Δ = %C − %I, LR = Likelihood Ratio , X2 = computed chi-square statistic , DF = degrees of freedom, P2 = two-tailed P value ; also for b, c, and d.

  1. (b)

    Assume outcomes for missing patients were all NO for I and Yes for C

    Comps\Arm

    I, %

    C, %

    Total, %

    Δ

    Pearson X2, LR X2

    DF

    P values

    Report P2

    Yes

    19, 26

    47, 66

    66.46

    40

    23.394, 24.077

    1

    0.000, 0.000

    < 0.001, < 0.001

    No

    54

    24

    78

         

    Total

    73

    71

    144

         
  2. (c)

    Assume outcomes for missing patients were all NO for C and Yes for I

    Comps\Arm

    I, %

    C, %

    Total, %

    Δ

    Pearson X2, LR X2

    DF

    P values

    Report P2

    Yes

    30, 41

    39, 55

    69.48

    14

    2.760, 2.769

    1

    0.097, 0.096

    No change

    No

    54

    32

    86

         

    Total

    73

    71

    144

         
  3. (d)

    Assume outcomes for missing patients were all Yes, for both arms

    Comps\Arm

    I, %

    C, %

    Total, %

    Δ

    Pearson X2, LR X2

    DF

    P values

    Report P2

    Yes

    30, 41

    47, 66

    77.53

    25

    9.115, 9.219

    1

    0.003, 0.002

    No change

    No

    43

    24

    67

         

    Total

    73

    71

    144

         
  4. (e)

    Sensitivity analysis summary of all four tables above: think checkerboard! Using a 3 × 3 board

 

I No

 

I Yes

C No

< 0.001, < 0.001, a

0.097, 0.096, c

 

C Yes

< 0.001, < 0.001, b

0.003, 0.002, d

Entries are P2 values, letters indicate the tables above, and dashes (–) indicate parts of the sensitivity that we did not compute. Notice that the bolded cells, a, b, and d are all less than α = 0.05, so are statistically significant, while c cell that is not bolded so is not statistically significant.

Appendix 3: Risk of Bias Scoring of the Barberan-Garcia et al. [1] Paper

Risk of Bias

A

Was the method of randomization adequate?

Yes

B

Was the treatment allocation concealed?

No

Was knowledge of the allocated interventions adequately prevented during the study?

C

1. Was the patient blinded to the intervention ?

No

D

2. Was the care provider blinded to the intervention?

No

E

3. Was the outcome assessor blinded to the intervention?

Unsure

Were incomplete outcome data adequately addressed?

F

1. Was the dropout rate described and acceptable?

No, No

G

2. Were all randomized participants analyzed in the group to which they were allocated?

No

H

Are reports of the study free of suggestion of selective outcome reporting?

Unsure

Other sources of potential bias :

I

1. Were the groups similar at baseline regarding the most important prognostic indicators?

Unsure

J

2. Were co-interventions avoided or similar?

Unsure

K

3. Was the compliance acceptable in all groups?

Unsure

L

4. Was the timing of the outcome assessment similar in all groups?

Unsure

M

Is there a serious and fatal flaw with this study?

(focus on the impact of selection bias, information bias, reporting errors, and confounding)

Flawed

 

Are other sources of potential bias unlikely?

- funding bias, conflict of interest statement

- outcome measure not valid

Unsure

The Cochrane risk of bias tool is available from Higgins et al. [3].

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Goldsmith, C.H., Duku, E.K., Thoma, A., Murphy, J. (2019). Simple Statistical Tests and P Values. In: Thoma, A., Sprague, S., Voineskos, S., Goldsmith, C. (eds) Evidence-Based Surgery. Springer, Cham. https://doi.org/10.1007/978-3-030-05120-4_27

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-05120-4_27

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-05119-8

  • Online ISBN: 978-3-030-05120-4

  • eBook Packages: MedicineMedicine (R0)

Publish with us

Policies and ethics