Simple Statistical Tests and P Values

Goldsmith, Charles H.; Duku, Eric K.; Thoma, Achilles; Murphy, Jessica

doi:10.1007/978-3-030-05120-4_27

Charles H. Goldsmith^5,6,7,
Eric K. Duku⁸,
Achilles Thoma^7,9 &
…
Jessica Murphy⁹

616 Accesses

Abstract

Having a general understanding of statistical tests , and how to interpret their results, is an important part of conducting, consuming, and applying evidence from the surgical literature. Unfortunately, statistical guides can often be focused on mathematical formulas, which maybe difficult for the average surgical researcher to understand. The goal of this chapter is to describe common statistical principles in English, leaving the mathematical symbolism to a minimum. This chapter utilizes a clinical trial from the surgical literature to explain simple statistical tests that are often seen in the surgical literature. These statistical tests are then reproduced using readily available statistical software, and compared to those results reported by the original authors. This chapter will introduce readers to common statistical ideas including, but not limited to chi-square tests, Student’s t test, Fisher exact tests, and sensitivity analysis . Additionally, readers will be introduced to the Cochrane risk of bias tool, and how this tool can be used to appraise a study. This chapter also acts as a reference to connect readers with additional resources, some more mathematical than others, that will offer additional information and different perspectives than this chapter.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Barberan-Garcia A, Ubré M, Roca J, Lacy AM, Burgos F, Risco R, et al. Personalised prehabilitation in high-risk patients undergoing elective major abdominal surgery: a randomized blinded controlled trial. Ann Surg. 2018;267(1):50–6.
Article Google Scholar
Cadeddu M, Farrokhyar F, Levis C, Haines AT, Thoma A for the Evidence-Based Surgery Working Group. Users’ guide to the surgical literature; understanding confidence intervals. Can J Surg. 2012;55(3):207–11.
Google Scholar
Higgins JPT, Altman DG, Gøtzsche PC, Jüni P, Moher D, Oxman AD, et al. The Cochrane Collaboration’s tool for assessing risk of bias in randomised trials. BMJ. 2011;343:d5928.
Google Scholar
Schunemann HJ, Guyatt GH. Commentary—goodbye M(C)ID! Hello MID, where do you come from? Health Serv Res. 2005;40:593–7.
Article Google Scholar
Minitab Inc. Minitab [Internet]. [cited 2018 Sep 6]. Available from: http://www.minitab.com/en-us/.
van Buuren S. Flexible imputation of missing data, 2nd ed. Boca Raton, FL: Chapman & Hall/CRC; 2018.
Google Scholar
Goodman SN. p values, hypothesis tests, and likelihood: implications for epidemiology of a neglected historical debate. Am J Epidemiol. 1993;137(5):485–96.
Article CAS Google Scholar
Fraser DAS, Reid N. Crisis in science? Or crisis in statistics! Mixed messages in statistics with impact on science. J Statist Res. 2016;48–50(1):1–9.
Google Scholar
Wasserstein RL, Lazar NA. The ASA’s statement on p-values: content, process, and purpose. Am Stat. 2016;70(2):129–33.
Article Google Scholar
Mark DB, Lee KL, Harrell FE Jr. Understanding the role of P values and hypothesis tests in clinical research. JAMA Cardiol. 2016;1(9):1048–54.
Article Google Scholar
Ioannidis JPA. The proposal to lower P values thresholds to .005. JAMA. 2018;319(14):1429–30.
Google Scholar
Wild CJ, Pfannkuch M, Regan M. Towards more accessible conceptions of statistical inference. J Roy Statist Soc A. 2011;174(Part 2):1–23.
Google Scholar
Hurlbert SH, Lombardi CM. Lopsided reasoning on lopsided tests and multiple comparisons. Aust NZ J Stat. 2012;54(1):23–42.
Article Google Scholar
Altman DG, Machin D, Bryant TN, Gardner MJ, editors. Statistics with confidence, 2nd ed. BMJ Books; 2011.
Google Scholar
Streiner DL. One-tailed and two-tailed tests. Statistics commentary series; commentary #12. J Clin Psychopharmacol. 2015;35(6):628–9.
Google Scholar
CRAN (Comprehensive R Archive Network). The Comprehensive R Archive Network [Internet]. [cited 2018 Sep 6]. Available from: http://CRAN.R-project.org.
Wheatley M. 5 key considerations when choosing open source statistical software. Data informed. Big data and analytics in the enterprise. http://data-informed.com/author/nadaeum/ [It worked for CHG.].
Edwards H. A review of probability and statistics apps for mobile devices. In: Maker K, de Sousa B, Gould R, editors. Sustainability in statistics education. Proceedings of the ninth international conference on teaching statistics (ICOTS July). Flagstaff AZ: Voorburg; 2014. p. 1–4.
Google Scholar
McCullough BD, Yalata AT. Spreadsheets in the cloud—not ready yet. J Stat Softw. 2013;52(7). Available from: https://www.jstatsoft.org/article/view/v052i07/v52i07.pdf.
Melard G. On the accuracy of statistical procedures in Microsoft Excel 2010. Comput Stat. 2014;29(5):1095–128.
Article Google Scholar
The R Foundation. The R project for statistical computing [Internet]. Available from: https://www.r-project.org. Accessed 24 Jul 2018.
Pearson RK. Exploratory data analysis using R. Boca Raton FL: Chapman & Hall/CRC; 2018.
Google Scholar
Thieme N. R generation. The story of a statistical programming language that became a subcultural phenomenon. Significance. 2018;15(4):14–9.
Google Scholar
Little RJ, D’Agostino R, Cohen ML, Dickersin K, Emerson SS, Farrar JT, et al. The prevention and treatment of missing data in clinical trials. NEJM. 2012;367:1355–60.
Article CAS Google Scholar
Mayo-Wilson E, Fusco N, Li T, Hong H, Canner JK, Dickersin K for the MUDS Investigators. Multiple outcomes and analyses in clinical trials create challenges for interpretation and research synthesis. J Clin Epidemiol. 2017;86:39–50.
Google Scholar
Perneger TV. What’s wrong with Bonferroni adjustments? BMJ. 1998;316(7137):1236–8.
Article CAS Google Scholar
Cook RJ, Farewell VT. Multiplicity considerations in the design and analysis of clinical trials. J Roy Statist Soc A. 1996;159(1):93–110.
Article Google Scholar
US Department of Health and Human Services, Food and Drug Administration, Center for Drug Evaluation and Research (CDER), Center for Biologics Evaluation and Research (CBER). Multiple endpoints in clinical trials [Internet]. [cited 2018 Sep 6]. Available from: https://www.fda.gov/downloads/drugs/guidancecomplianceregulatoryinformation/guidances/ucm536750.pdf.
Bland JM, Altman DG. Multiple significance tests; the Bonferroni method. Statistics notes. BMJ. 1995;310:170.
Article CAS Google Scholar
Livingston EH. Introducing the JAMA guide to statistics and methods. NEJM. 2014;312(1):35.
CAS Google Scholar
Aslan D, Sandberg S. Simple statistics in diagnostic tests. J Med Biochem. 2007;26:309–13.
Article Google Scholar
Lydersen S. Statistical review: frequently given comments. Ann Rheum Dis. 2015;74:323–5.
Article Google Scholar
Lang TA, Altman DG. Basic statistical reporting for articles published in biomedical journals: the “Statistical Analyses and Methods in the Published Literature” or “the SAMPL guidelines”. In: Smart P, Maisonneuve H, Polderman A, editors. Science editors’ handbook. European Association of Science Editors; 2013. p. 1–8.
Google Scholar
Wan X, Wang W, Liu J, Tong T. Estimating the sample mean and standard deviations from the sample size, median, range and/or interquartile range. BMC Med Res Methodol. 2014;14:13.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Health Sciences, Simon Fraser University, Burnaby, BC, Canada
Charles H. Goldsmith
Department of Occupational Science and Occupational Therapy, Faculty of Medicine, The University of British Columbia, Vancouver, BC, Canada
Charles H. Goldsmith
Department of Health Research Methods, Evidence and Impact, McMaster University, Hamilton, ON, Canada
Charles H. Goldsmith & Achilles Thoma
Department of Psychiatry and Behavioral Neurosciences, McMaster University, Hamilton, ON, Canada
Eric K. Duku
Department of Surgery, Division of Plastic Surgery, McMaster University, Hamilton, ON, Canada
Achilles Thoma & Jessica Murphy

Authors

Charles H. Goldsmith
View author publications
You can also search for this author in PubMed Google Scholar
Eric K. Duku
View author publications
You can also search for this author in PubMed Google Scholar
Achilles Thoma
View author publications
You can also search for this author in PubMed Google Scholar
Jessica Murphy
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Charles H. Goldsmith .

Editor information

Editors and Affiliations

Department of Surgery, Division of Plastic Surgery, Department of Health Research Methods, Evidence and Impact (HEI), McMaster University, Hamilton, ON, Canada
Achilles Thoma
Department of Surgery, Division of Orthopaedic Surgery, Department of Health Research Methods, Evidence and Impact (HEI), McMaster University, Hamilton, ON, Canada
Sheila Sprague
Department of Surgery, Division of Plastic Surgery, McMaster University, Hamilton, ON, Canada
Sophocles H. Voineskos
Department of Health Research Methods, Evidence and Impact (HEI), McMaster University, Hamilton, ON, Canada
Charles H. Goldsmith

Appendices

Appendix 1: Articles Identified in the Literature Search

1.
Barberan-Garcia A, Ubré M, Roca J, Lacy AM, Burgos F, Risco R, Momblan D, Balust J, Blanco I, Martinez-Palli G. Personalized prehabilitation in high-risk patients undergoing elective major abdominal surgery: a randomized blinded controlled trial. Ann Surg. 2018;267(1):50–6. https://doi.org/10.1097/00000000000002293.
2.
Berkel AEM, Bongers BC, vanKamp M-JS, Kotte H, Weltevreden P, deJongh FHC, Eijvogel MMM, Wymenga ANM, Bigirwamungu-Bargeman M, vander Palen J, van Det MC, van Meeteren NLU, Klasse JM. The effects of prehabilitation versus usual care to reduce postoperative complications in high-risk patients with colorectal cancer of dysplasia scheduled for elective colorectal resection: study protocol of a randomized controlled trial. BMC Gastroenterol. 2018;18:29. https://doi.org/10.1186/s12876-018-0754-6.
3.
Mayo NE, Feldman L, Scott S, Zavorsky G, Kim DJ, Charlebois P, Stein B, Carli F. Impact of preoperative change in physical function on postoperative recovery: argument supporting prehabilitation for colorectal surgery. Surgery. 2011;150(3):505–14. https://doi.org/10.1016/j.surg.2011.07.045.
4.
NCT02024776. Effectiveness of prehabilitation program for high-risk patients underwent abdominal surgery. [Registration]. Online Publication Date: 2018. Available from: http://clinicaltrials.gov/show/nct02024776.2013. [This is the registration for A1-1.]
5.
NCT02934230. The prehabilitation study: exercise before surgery to improve patient function in people. [Registration]. Online Publication Date: 2018. Available from: http://clinicaltrials.gov/show/nct02934230.2016.

Appendix 2: Sensitivity Analysis for Table 27.3 Using Chi-Square Test and Minitab 18

(a)
Assume outcomes for missing patients were all NO, for both arms

Comps\Arm	I, %	C, %	Total, %	Δ	Pearson X², LR X²	DF	P values	Report P2
Yes	19, 26	39, 55	58.40	29	12.499, 12.793	1	0.000, 0.000	< 0.001, < 0.001
No	54	32	86
Total	73	71	144

Comps = complications, Δ = %C − %I, LR = Likelihood Ratio , X² = computed chi-square statistic , DF = degrees of freedom, P2 = two-tailed P value ; also for b, c, and d.

(b)
Assume outcomes for missing patients were all NO for I and Yes for C
Comps\Arm
I, %
C, %
Total, %
Δ
Pearson X², LR X²
DF
P values
Report P2
Yes
19, 26
47, 66
66.46
40
23.394, 24.077
1
0.000, 0.000
< 0.001, < 0.001
No
54
24
78

Total
73
71
144
(c)
Assume outcomes for missing patients were all NO for C and Yes for I
Comps\Arm
I, %
C, %
Total, %
Δ
Pearson X², LR X²
DF
P values
Report P2
Yes
30, 41
39, 55
69.48
14
2.760, 2.769
1
0.097, 0.096
No change
No
54
32
86

Total
73
71
144
(d)
Assume outcomes for missing patients were all Yes, for both arms
Comps\Arm
I, %
C, %
Total, %
Δ
Pearson X², LR X²
DF
P values
Report P2
Yes
30, 41
47, 66
77.53
25
9.115, 9.219
1
0.003, 0.002
No change
No
43
24
67

Total
73
71
144
(e)
Sensitivity analysis summary of all four tables above: think checkerboard! Using a 3 × 3 board

	I No		I Yes
C No	< 0.001, < 0.001, a	–	0.097, 0.096, c
	–	–	–
C Yes	< 0.001, < 0.001, b	–	0.003, 0.002, d

Entries are P2 values, letters indicate the tables above, and dashes (–) indicate parts of the sensitivity that we did not compute. Notice that the bolded cells, a, b, and d are all less than α = 0.05, so are statistically significant, while c cell that is not bolded so is not statistically significant.

Appendix 3: Risk of Bias Scoring of the Barberan-Garcia et al. [1] Paper

Risk of Bias

A	Was the method of randomization adequate?	Yes
B	Was the treatment allocation concealed?	No
Was knowledge of the allocated interventions adequately prevented during the study?
C	1. Was the patient blinded to the intervention ?	No
D	2. Was the care provider blinded to the intervention?	No
E	3. Was the outcome assessor blinded to the intervention?	Unsure
Were incomplete outcome data adequately addressed?
F	1. Was the dropout rate described and acceptable?	No, No
G	2. Were all randomized participants analyzed in the group to which they were allocated?	No
H	Are reports of the study free of suggestion of selective outcome reporting?	Unsure
Other sources of potential bias :
I	1. Were the groups similar at baseline regarding the most important prognostic indicators?	Unsure
J	2. Were co-interventions avoided or similar?	Unsure
K	3. Was the compliance acceptable in all groups?	Unsure
L	4. Was the timing of the outcome assessment similar in all groups?	Unsure
M	Is there a serious and fatal flaw with this study? (focus on the impact of selection bias, information bias, reporting errors, and confounding)	Flawed
	Are other sources of potential bias unlikely? - funding bias, conflict of interest statement - outcome measure not valid	Unsure

The Cochrane risk of bias tool is available from Higgins et al. [3].

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Goldsmith, C.H., Duku, E.K., Thoma, A., Murphy, J. (2019). Simple Statistical Tests and P Values. In: Thoma, A., Sprague, S., Voineskos, S., Goldsmith, C. (eds) Evidence-Based Surgery. Springer, Cham. https://doi.org/10.1007/978-3-030-05120-4_27

Download citation

DOI: https://doi.org/10.1007/978-3-030-05120-4_27
Published: 26 March 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-05119-8
Online ISBN: 978-3-030-05120-4
eBook Packages: MedicineMedicine (R0)

Publish with us

Policies and ethics