Abstract
Having a general understanding of statistical tests , and how to interpret their results, is an important part of conducting, consuming, and applying evidence from the surgical literature. Unfortunately, statistical guides can often be focused on mathematical formulas, which maybe difficult for the average surgical researcher to understand. The goal of this chapter is to describe common statistical principles in English, leaving the mathematical symbolism to a minimum. This chapter utilizes a clinical trial from the surgical literature to explain simple statistical tests that are often seen in the surgical literature. These statistical tests are then reproduced using readily available statistical software, and compared to those results reported by the original authors. This chapter will introduce readers to common statistical ideas including, but not limited to chi-square tests, Student’s t test, Fisher exact tests, and sensitivity analysis . Additionally, readers will be introduced to the Cochrane risk of bias tool, and how this tool can be used to appraise a study. This chapter also acts as a reference to connect readers with additional resources, some more mathematical than others, that will offer additional information and different perspectives than this chapter.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Barberan-Garcia A, Ubré M, Roca J, Lacy AM, Burgos F, Risco R, et al. Personalised prehabilitation in high-risk patients undergoing elective major abdominal surgery: a randomized blinded controlled trial. Ann Surg. 2018;267(1):50–6.
Cadeddu M, Farrokhyar F, Levis C, Haines AT, Thoma A for the Evidence-Based Surgery Working Group. Users’ guide to the surgical literature; understanding confidence intervals. Can J Surg. 2012;55(3):207–11.
Higgins JPT, Altman DG, Gøtzsche PC, Jüni P, Moher D, Oxman AD, et al. The Cochrane Collaboration’s tool for assessing risk of bias in randomised trials. BMJ. 2011;343:d5928.
Schunemann HJ, Guyatt GH. Commentary—goodbye M(C)ID! Hello MID, where do you come from? Health Serv Res. 2005;40:593–7.
Minitab Inc. Minitab [Internet]. [cited 2018 Sep 6]. Available from: http://www.minitab.com/en-us/.
van Buuren S. Flexible imputation of missing data, 2nd ed. Boca Raton, FL: Chapman & Hall/CRC; 2018.
Goodman SN. p values, hypothesis tests, and likelihood: implications for epidemiology of a neglected historical debate. Am J Epidemiol. 1993;137(5):485–96.
Fraser DAS, Reid N. Crisis in science? Or crisis in statistics! Mixed messages in statistics with impact on science. J Statist Res. 2016;48–50(1):1–9.
Wasserstein RL, Lazar NA. The ASA’s statement on p-values: content, process, and purpose. Am Stat. 2016;70(2):129–33.
Mark DB, Lee KL, Harrell FE Jr. Understanding the role of P values and hypothesis tests in clinical research. JAMA Cardiol. 2016;1(9):1048–54.
Ioannidis JPA. The proposal to lower P values thresholds to .005. JAMA. 2018;319(14):1429–30.
Wild CJ, Pfannkuch M, Regan M. Towards more accessible conceptions of statistical inference. J Roy Statist Soc A. 2011;174(Part 2):1–23.
Hurlbert SH, Lombardi CM. Lopsided reasoning on lopsided tests and multiple comparisons. Aust NZ J Stat. 2012;54(1):23–42.
Altman DG, Machin D, Bryant TN, Gardner MJ, editors. Statistics with confidence, 2nd ed. BMJ Books; 2011.
Streiner DL. One-tailed and two-tailed tests. Statistics commentary series; commentary #12. J Clin Psychopharmacol. 2015;35(6):628–9.
CRAN (Comprehensive R Archive Network). The Comprehensive R Archive Network [Internet]. [cited 2018 Sep 6]. Available from: http://CRAN.R-project.org.
Wheatley M. 5 key considerations when choosing open source statistical software. Data informed. Big data and analytics in the enterprise. http://data-informed.com/author/nadaeum/ [It worked for CHG.].
Edwards H. A review of probability and statistics apps for mobile devices. In: Maker K, de Sousa B, Gould R, editors. Sustainability in statistics education. Proceedings of the ninth international conference on teaching statistics (ICOTS July). Flagstaff AZ: Voorburg; 2014. p. 1–4.
McCullough BD, Yalata AT. Spreadsheets in the cloud—not ready yet. J Stat Softw. 2013;52(7). Available from: https://www.jstatsoft.org/article/view/v052i07/v52i07.pdf.
Melard G. On the accuracy of statistical procedures in Microsoft Excel 2010. Comput Stat. 2014;29(5):1095–128.
The R Foundation. The R project for statistical computing [Internet]. Available from: https://www.r-project.org. Accessed 24 Jul 2018.
Pearson RK. Exploratory data analysis using R. Boca Raton FL: Chapman & Hall/CRC; 2018.
Thieme N. R generation. The story of a statistical programming language that became a subcultural phenomenon. Significance. 2018;15(4):14–9.
Little RJ, D’Agostino R, Cohen ML, Dickersin K, Emerson SS, Farrar JT, et al. The prevention and treatment of missing data in clinical trials. NEJM. 2012;367:1355–60.
Mayo-Wilson E, Fusco N, Li T, Hong H, Canner JK, Dickersin K for the MUDS Investigators. Multiple outcomes and analyses in clinical trials create challenges for interpretation and research synthesis. J Clin Epidemiol. 2017;86:39–50.
Perneger TV. What’s wrong with Bonferroni adjustments? BMJ. 1998;316(7137):1236–8.
Cook RJ, Farewell VT. Multiplicity considerations in the design and analysis of clinical trials. J Roy Statist Soc A. 1996;159(1):93–110.
US Department of Health and Human Services, Food and Drug Administration, Center for Drug Evaluation and Research (CDER), Center for Biologics Evaluation and Research (CBER). Multiple endpoints in clinical trials [Internet]. [cited 2018 Sep 6]. Available from: https://www.fda.gov/downloads/drugs/guidancecomplianceregulatoryinformation/guidances/ucm536750.pdf.
Bland JM, Altman DG. Multiple significance tests; the Bonferroni method. Statistics notes. BMJ. 1995;310:170.
Livingston EH. Introducing the JAMA guide to statistics and methods. NEJM. 2014;312(1):35.
Aslan D, Sandberg S. Simple statistics in diagnostic tests. J Med Biochem. 2007;26:309–13.
Lydersen S. Statistical review: frequently given comments. Ann Rheum Dis. 2015;74:323–5.
Lang TA, Altman DG. Basic statistical reporting for articles published in biomedical journals: the “Statistical Analyses and Methods in the Published Literature” or “the SAMPL guidelines”. In: Smart P, Maisonneuve H, Polderman A, editors. Science editors’ handbook. European Association of Science Editors; 2013. p. 1–8.
Wan X, Wang W, Liu J, Tong T. Estimating the sample mean and standard deviations from the sample size, median, range and/or interquartile range. BMC Med Res Methodol. 2014;14:13.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendices
Appendix 1: Articles Identified in the Literature Search
-
1.
Barberan-Garcia A, Ubré M, Roca J, Lacy AM, Burgos F, Risco R, Momblan D, Balust J, Blanco I, Martinez-Palli G. Personalized prehabilitation in high-risk patients undergoing elective major abdominal surgery: a randomized blinded controlled trial. Ann Surg. 2018;267(1):50–6. https://doi.org/10.1097/00000000000002293.
-
2.
Berkel AEM, Bongers BC, vanKamp M-JS, Kotte H, Weltevreden P, deJongh FHC, Eijvogel MMM, Wymenga ANM, Bigirwamungu-Bargeman M, vander Palen J, van Det MC, van Meeteren NLU, Klasse JM. The effects of prehabilitation versus usual care to reduce postoperative complications in high-risk patients with colorectal cancer of dysplasia scheduled for elective colorectal resection: study protocol of a randomized controlled trial. BMC Gastroenterol. 2018;18:29. https://doi.org/10.1186/s12876-018-0754-6.
-
3.
Mayo NE, Feldman L, Scott S, Zavorsky G, Kim DJ, Charlebois P, Stein B, Carli F. Impact of preoperative change in physical function on postoperative recovery: argument supporting prehabilitation for colorectal surgery. Surgery. 2011;150(3):505–14. https://doi.org/10.1016/j.surg.2011.07.045.
-
4.
NCT02024776. Effectiveness of prehabilitation program for high-risk patients underwent abdominal surgery. [Registration]. Online Publication Date: 2018. Available from: http://clinicaltrials.gov/show/nct02024776.2013. [This is the registration for A1-1.]
-
5.
NCT02934230. The prehabilitation study: exercise before surgery to improve patient function in people. [Registration]. Online Publication Date: 2018. Available from: http://clinicaltrials.gov/show/nct02934230.2016.
Appendix 2: Sensitivity Analysis for Table 27.3 Using Chi-Square Test and Minitab 18
-
(a)
Assume outcomes for missing patients were all NO, for both arms
Comps\Arm | I, % | C, % | Total, % | Δ | Pearson X2, LR X2 | DF | P values | Report P2 |
---|---|---|---|---|---|---|---|---|
Yes | 19, 26 | 39, 55 | 58.40 | 29 | 12.499, 12.793 | 1 | 0.000, 0.000 | < 0.001, < 0.001 |
No | 54 | 32 | 86 | |||||
Total | 73 | 71 | 144 |
Comps = complications, Δ = %C − %I, LR = Likelihood Ratio , X2 = computed chi-square statistic , DF = degrees of freedom, P2 = two-tailed P value ; also for b, c, and d.
-
(b)
Assume outcomes for missing patients were all NO for I and Yes for C
Comps\Arm
I, %
C, %
Total, %
Δ
Pearson X2, LR X2
DF
P values
Report P2
Yes
19, 26
47, 66
66.46
40
23.394, 24.077
1
0.000, 0.000
< 0.001, < 0.001
No
54
24
78
Total
73
71
144
-
(c)
Assume outcomes for missing patients were all NO for C and Yes for I
Comps\Arm
I, %
C, %
Total, %
Δ
Pearson X2, LR X2
DF
P values
Report P2
Yes
30, 41
39, 55
69.48
14
2.760, 2.769
1
0.097, 0.096
No change
No
54
32
86
Total
73
71
144
-
(d)
Assume outcomes for missing patients were all Yes, for both arms
Comps\Arm
I, %
C, %
Total, %
Δ
Pearson X2, LR X2
DF
P values
Report P2
Yes
30, 41
47, 66
77.53
25
9.115, 9.219
1
0.003, 0.002
No change
No
43
24
67
Total
73
71
144
-
(e)
Sensitivity analysis summary of all four tables above: think checkerboard! Using a 3 × 3 board
I No | I Yes | ||
---|---|---|---|
C No | < 0.001, < 0.001, a | – | 0.097, 0.096, c |
– | – | – | |
C Yes | < 0.001, < 0.001, b | – | 0.003, 0.002, d |
Entries are P2 values, letters indicate the tables above, and dashes (–) indicate parts of the sensitivity that we did not compute. Notice that the bolded cells, a, b, and d are all less than α = 0.05, so are statistically significant, while c cell that is not bolded so is not statistically significant.
Appendix 3: Risk of Bias Scoring of the Barberan-Garcia et al. [1] Paper
Risk of Bias
A | Was the method of randomization adequate? | Yes |
B | Was the treatment allocation concealed? | No |
Was knowledge of the allocated interventions adequately prevented during the study? | ||
C | 1. Was the patient blinded to the intervention ? | No |
D | 2. Was the care provider blinded to the intervention? | No |
E | 3. Was the outcome assessor blinded to the intervention? | Unsure |
Were incomplete outcome data adequately addressed? | ||
F | 1. Was the dropout rate described and acceptable? | No, No |
G | 2. Were all randomized participants analyzed in the group to which they were allocated? | No |
H | Are reports of the study free of suggestion of selective outcome reporting? | Unsure |
Other sources of potential bias : | ||
I | 1. Were the groups similar at baseline regarding the most important prognostic indicators? | Unsure |
J | 2. Were co-interventions avoided or similar? | Unsure |
K | 3. Was the compliance acceptable in all groups? | Unsure |
L | 4. Was the timing of the outcome assessment similar in all groups? | Unsure |
M | Is there a serious and fatal flaw with this study? (focus on the impact of selection bias, information bias, reporting errors, and confounding) | Flawed |
Are other sources of potential bias unlikely? - funding bias, conflict of interest statement - outcome measure not valid | Unsure |
The Cochrane risk of bias tool is available from Higgins et al. [3].
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Goldsmith, C.H., Duku, E.K., Thoma, A., Murphy, J. (2019). Simple Statistical Tests and P Values. In: Thoma, A., Sprague, S., Voineskos, S., Goldsmith, C. (eds) Evidence-Based Surgery. Springer, Cham. https://doi.org/10.1007/978-3-030-05120-4_27
Download citation
DOI: https://doi.org/10.1007/978-3-030-05120-4_27
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-05119-8
Online ISBN: 978-3-030-05120-4
eBook Packages: MedicineMedicine (R0)