Using evidence from different sources: an example using paracetamol 1000 mg plus codeine 60 mg
- 22k Downloads
Meta-analysis usually restricts the information pooled, for instance using only randomised, double-blind, placebo-controlled trials. This neglects other types of high quality information. This review explores using different information for the combination of paracetamol 1000 mg and codeine 60 mg in acute postoperative pain.
Randomised, double-blind, placebo-controlled trials of paracetamol 1000 mg and codeine 60 mg had an NNT of 2.2 (95% confidence interval 1.7 to 2.9) for at least 50% pain relief over four to six hours in three trials with 197 patients. Computer simulation of randomised trials demonstrated 92% confidence that the simulated NNT was within ± 0.5 of the underlying value of 2.2 with this number of patients. The result was supported a rational dose-response relationship for different doses of paracetamol and codeine in 17 additional trials with 1,195 patients. Three controlled trials lacking a placebo and with 117 patients treated with of paracetamol 1000 mg and codeine 60 mg had 73% (95%CI 56% to 81%) of patients with at least 50% pain relief, compared with 57% (48% to 66%) in placebo controlled trials. Six trials in acute pain were omitted because of design issues, like the use of different pain measures or multiple dosing regimens. In each paracetamol 1000 mg and codeine 60 mg was shown to be better than placebo or comparators for at least one measure.
Different designs of high quality trials can be used to support limited information used in meta-analysis without recourse to low quality trials that might be biased.
KeywordsPain Relief Paracetamol Acute Pain Acute Postoperative Pain Multiple Dose Regimen
The use of evidence-based approaches to therapeutic decision making can frequently raise the problem of how to make decisions when evidence is in limited supply. Often systematic reviews limit trial inclusion in an attempt to generate clinical homogeneity and allow sensible meta-analysis. The problem, though, is that other, useful, information is omitted. An example of this is the popular combination of paracetamol with codeine for treatment of acute and, more frequently, chronic pain. For the combination of 1000 mg paracetamol plus 60 mg codeine, for instance, there was information on only 127 patients in two placebo-controlled acute pain studies [1, 2].
Systematic reviews should seek unbiased evidence, which may limit the number of studies available for analysis. One approach to resolving the problem of apparently insufficient evidence may be to assess evidence of lower methodological quality. The amount of bias that could result from this approach would be a concern.
Schultz et al  demonstrated that lack of randomisation is the major source of bias in trials; studies which are not randomised can lead to overestimation of treatment effects by up to 40%. Restricting systematic reviews to include only randomised studies therefore makes sense for reviews of effectiveness. A classic example is a review of transcutaneous nerve stimulation for post-operative pain relief. Randomised studies overwhelmingly showed no benefit over placebo, while non-randomised studies did show benefit .
Non-blinded studies over-estimate treatment effects by about 17% . In a review of acupuncture for back pain , the inclusion of both blinded and non-blinded studies changed the overall conclusion. The blinded studies showed 57% of patients improved with acupuncture and 50% with control, a non-significant relative benefit of 1.2 (95% confidence interval 0.9 to 1.5). Five non-blinded studies showed a difference from control, with 67% improved with acupuncture and 38% with control. Here the relative benefit was significant at 1.8 (1.3 to 2.4).
Trials of poor reporting quality consistently overestimate the effect of treatment. Using a validated scoring system for methodological quality , studies of lower quality are likely to overestimate treatment effects [7, 8]. Other sources of bias may include small trials [9,10,11], covert duplication , and geography . Vickers and colleagues  showed that trials of acupuncture conducted in east Asia were universally positive, while those conducted in Australasia, north America or western Europe were positive only about half the time. Randomised trials of therapies other than acupuncture conducted in China, Taiwan, Japan or Russia/USSR were also overwhelmingly positive.
There is also the issue of the overall validity of a randomised trial. In some areas, like acute pain, valid methods for the conduct of clinical studies have been set out for many years, and are well understood . There are therefore many trials that are randomised and double blind, and conducted on patients with the same initial severity of pain under similar conditions and assessing identical or similar outcomes over the same time periods. Trials with low validity are more likely to have a positive result than those with higher validity , seen in acupuncture for head and neck pain.
It is obviously sensible to avoid bias where it is likely to occur. That means avoiding studies with designs or features where bias is possible. What other strategies remain when faced with apparently inadequate information from the literature? There are four that could be applied to the particular combination of paracetamol 1000 mg plus codeine 60 mg. These are:
1. The number of patients required for the number needed to treat (NNT) to be within ± 0.5 of the true value varies with the efficacy of an analgesic . We can therefore calculate how confident we can be in an NNT given the number of patients in the analysis, or calculate the number of patients needed to reach a required level of certainty.
2. We can assess evidence from other dose combinations of paracetamol and codeine and see whether there is a dose response relationship in placebo-controlled trials.
3. We can assess results for paracetamol 1000 mg plus codeine 60 mg against results for active comparators in high quality active controlled and placebo controlled studies. This can then be compared with systematic reviews done in similar clinical settings.
4. We can assess paracetamol 1000 mg plus codeine 60 mg in studies which, though randomised and double blind, had designs that did not allow their inclusion in a meta-analysis.
We applied these approaches to paracetamol 1000 mg plus codeine 60 mg through systematic review of the published literature.
Full reports of randomised controlled trials of oral paracetamol combined with codeine for acute postoperative pain were sought. Different search strategies were used to identify eligible reports in MEDLINE (1966 - March 2000), EMBASE (1980 - March 2000), the Cochrane Library (Issue 3, 2000) and the Oxford Pain Relief Database (1950 - 1994) . Reference lists of retrieved reports and reviews [1,2] were searched for additional trials. Abstracts, review articles and unpublished reports were not considered.
Criteria for inclusion for postoperative pain were: full journal publication, randomised controlled trials which included single dose treatment groups of oral paracetamol combined with codeine, double blind design, baseline postoperative pain of moderate to severe intensity, patients over 15 years of age, at least 10 patients per group, and the pain outcome measures of total pain relief (TOTPAR) or summed pain intensity difference (SPID) over 4-6 hours or sufficient data provided to allow their calculation. Pain measures allowed for the calculation of TOTPAR or SPID were a standard five point pain relief scale (none, slight, moderate, good, complete), a standard four point pain intensity scale (none, mild, moderate, severe) or a standard visual analogue scale (VAS) for pain relief or pain intensity. Each report was scored for quality using a three item, 1-5 score, quality scale .
For each trial, mean TOTPAR, SPID, VASTOTPAR or VASSPID values for each drug group were converted to %maxTOTPAR by division into the calculated maximum value . The proportion of patients in each treatment group who achieved at least 50%maxTOTPAR was calculated using valid equations [18,19,20]. The number of patients with >50%maxTOTPAR was then used to calculate relative benefit and NNT for paracetamol plus codeine versus placebo. Relative benefit and relative risk estimates were calculated with 95% confidence intervals (CI) using a fixed effects model . NNT with 95% confidence intervals was calculated by the method of Cook and Sackett . A statistically significant difference from control was assumed when the 95% confidence interval of the relative benefit did not include 1. Confidence intervals of proportions were calculated according to Morris and Gardner . Calculations were performed using Excel v 5.0 on a Power Macintosh G3. Simulations and calculations of probability were conducted as described in Moore et al, 1998 .
There were three placebo controlled [24,25,26] and three active controlled studies [27,28,29] involving paracetamol 1000 mg plus codeine 60 mg (Additional material: Table 3). Quality scores were 3 or better for all six. Information on other dose combinations is in Additional material: Table 4
Summary results for efficacy of paracetamol/codeine combinations from randomised, double-blind, placebo-controlled trials in acute pain
At least 50% pain relief
At least 50% pain
with paracetamol and
relief with placebo
NNT (95% CI)
1000 mg + 60 mg
4.8 (2.6 to 8.8)
2.2 (1.7 to 2.9)
600/650 mg + 60 mg
2.5 (2.0 to 3.1)
3.4 (2.8 to 4.3)
300 mg + 30 mg
3.2 (1.8 to 5.6)
5.6 (4.0 to 9.8)
Additional analyses undertaken to assess the reliability of this assessment of paracetamol 1000 mg plus codeine 60 mg included:
1 Assessment of Accuracy
How accurate is the NNT with currently observed group sizes and event rates? Using the values of the placebo event rate of 0.11 (9/83) and the paracetamol/codeine event rate of 0.57 (65/114), we simulated 10,000 trials with experimental group size of 114 and a control group size of 83 and counted the proportion in which the simulated NNT was within ± 0.5 of the true NNT of 1.9 . The results showed that 91.9% of simulated trials had an NNT of 1.8 to 2.6. The probability that the simulated NNT was within ± 0.5 of the underlying value of 2.2 was 0.92.
How much data would we require to be 95% sure that the observed NNT is within ± 0.5 of its true value? We answered this question in two ways, both of which consider the case of equal group sizes. Again the placebo rate of 0.11 and the paracetamol/codeine rate of 0.57 were used. The first method used the exact probability distribution . Using a group size of 130 we obtained a probability of 0.94, and with 140 we obtained 0.95. So the answer based on the exact distribution is between 130 and 140 per group. The second method used simulation. Using a group size of 130 the probability was 0.94, and with 140 it was 0.95. So the answer based on the simulation was again between 130 and 140 per group, a total of 260 to 280 patients.
2 Other dose Combinations of Paracetamol and Codeine
Information was available on 17 comparable studies [30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46] in which different dose combinations of paracetamol and codeine were available for at least two trials (Additional material: Table 4). Quality scores were 3 or better for all.
There were 13 trials comparing paracetamol 600/650 mg plus codeine 60 mg with placebo in acute pain. For paracetamol plus codeine 191/398 patients had at least 50% pain relief and 78/418 for placebo. For a single dose of paracetamol/codeine the proportion of patients with at least 50% pain relief was 48% (43% to 53%). The NNT for at least 50% pain relief over four to six hours was 3.4 (2.8 to 4.2).
There were four trials comparing paracetamol 300 mg plus codeine 30 mg with placebo (Table 1). For paracetamol plus codeine 56/215 patients had at least 50% pain relief and 14/164 for placebo. For a single dose of paracetamol/codeine the proportion of patients with at least 50% pain relief was 26% (20% to 32%). The NNT for at least 50% pain relief over four to six hours was 5.6 (4.0 to 9.8).
3 Active Controlled Trials
Summary results for efficacy of paracetamol/codeine combinations and active comparators in randomised, double-blind, active- and placebo-controlled trials in acute pain
Patients with at
least 50% pain
Paracetamol 300 mg + codeine 30 mg
25 (20 to 30)
Codeine 60 mg
33 (22 to 44)
Ibuprofen 400 mg
39 (28 to 50)
Paracetanol 600/650 mg
43 (38 to 49)
Paracetamol 1000 mg
42 (35 to 50)
Paracetamol 600 mg + codeine 60 mg
48 (43 to 53)
Diflusinal 500 mg
60 (50 to 70)
Ketorolac 10 mg
64 (55 to 73)
Diflusinal 1000 mg
66 (57 to 76)
Paracetamol 1000 mg + codeine 60 mg
66 (59 to 72)
Flurbiprofen 50 mg
70 (61 to 79)
Flurbiprofen 100 mg
76 (67 to 85)
Information on other paracetamol/codeine combinations and other active comparators from active and placebo controlled studies involving paracetamol and codeine combinations is shown in Table 2 and Figure 2. Again there was a rational dose response for paracetamol/codeine combinations, and paracetamol 1000 mg plus codeine 60 mg compared favourably with effective doses of non-steroidal anti-inflammatory drugs (NSAIDs) .
4 Trials Omitted from the Meta-Analysis
There were six trials in acute pain, which, though randomised and double blind, had design features that did not allow their inclusion in the meta-analysis ([47,48,49,50,51,52] Additional material: Table 5). The trials could not be included because they used different pain measures, over relatively short periods, or had multiple dosing regimens that obscured the efficacy of the first dose. Three had a placebo control. Quality scores were 3 or better for all six (Additional material: Table 5). The studies enrolled 907 patients, and there was information on 260 patients who took paracetamol 1000 mg plus codeine 60 mg with initial pain intensity of at least moderate intensity.
The outcomes of these six trials were not the same as those used for the meta-analysis. Despite this, the combination of paracetamol 1000 mg plus codeine 60 mg could be ascertained. In each of the studies paracetamol 1000 mg plus codeine 60 mg was shown to be better than placebo or comparators for at least one measure (Additional material: Table 5). Where effects of paracetamol 1000 mg plus codeine 60 mg on pain intensity was graphically displayed [49, 51, 52], large reductions in pain intensity were obtained within about one hour.
It can be the case that attempts to practice evidence based medicine come up against the problem that there is no evidence, or relatively little evidence. Reviews of paracetamol and codeine [1, 2] revealed that the combination of paracetamol 1000 mg plus codeine 60 mg was apparently highly effective, with an NNT of 1.9, but this was derived from only 127 patients in two trials. Using evidence from sources other than direct comparisons of paracetamol/codeine with placebo buttresses the conclusion of these earlier reviews.
Firstly we uncovered information about another trials that increased the number of patients to 197. The inclusion of the third trial barely changed the NNT from 1.9 to 2.2. Unpublished information from a fourth study is presently being sought.
We know that the amount of information required to be sure that an NNT is within ± 0.5 of the true value varies depending on the efficacy of treatment [11, 53]. Where efficacy is high, the requirement is for small numbers of patients. Where efficacy is low, the requirement is for large numbers of patients. For paracetamol 1000 mg plus codeine 60 mg, calculations show that with 197 patients we can be about 92% confident that the NNT obtained is within ± 0.5 of the true value, close to the conventionally accepted value of 95%. To be within the 95% limits we would need information from a further 60 to 80 patients.
Biological plausibility derives also from a substantial dose response relationship with the placebo controlled studies as the doses of paracetamol and of paracetamol and codeine increase. Seventeen additional studies of high quality (randomised, double blind, with clear entry criteria and outcomes measures) with 1195 additional patients demonstrated this (Table 1; Figure 1). Dose response was also shown when information from placebo and active controlled trials was pooled to examine the overall percentage of patients with at least 50% pain relief (Table 1).
This type of analysis additionally demonstrated that in trials examining paracetamol/codeine combinations, the paracetamol 1000 mg plus codeine 60 mg combination was much better than either paracetamol 1000 mg or codeine 60 mg alone (Table 2; Figure 2). There was clear separation between the lower confidence interval for paracetamol 1000 mg plus codeine 60 mg (59%) and the higher confidence interval for paracetamol 1000 mg (50%) and codeine 60 mg (44%). This confirms the added benefit seen with this combining codeine with paracetamol in a previous review .
Moreover, paracetamol 1000 mg plus codeine 60 mg demonstrated efficacy similar to effective doses of NSAIDs drugs tested in the selfsame trials (Table 2; Figure 2). There is now considerable and growing evidence on the effectiveness of NSAIDs as analgesics in acute pain [54, 55]. While ibuprofen 400 mg performed relatively poorly in two trials here, pooled information from large numbers of studies show it to be an effective analgesic with an NNT of 2.7 (2.5 to 3.0) . Ketorolac has also been shown to be effective orally; the 10 mg dose had an NNT of 2.6 (2.3 to 3.1) . In trials involving NSAIDs these effective analgesics are no better than paracetamol/codeine combinations.
Only six randomised comparisons of paracetamol 1000 mg plus codeine 60 mg with placebo or other analgesics without placebo were available for analysis using methods that have become traditional for analysis in acute pain . This number was doubled by examining studies that were randomised and double blind, but which used different pain intensity or relief outcomes, or other study designs. These additional six studies confirmed that paracetamol 1000 mg plus codeine 60 mg was an effective analgesic in acute pain.
Taken together, this approach shows two things. There is considerable supportive evidence for the conclusion that paracetamol 1000 mg plus codeine 60 mg is an effective analgesic with a low NNT. This support could be produced without resource to low quality studies, or those with known propensity for bias.
It will not always be possible to extend knowledge though high quality and essentially bias free trials in all circumstances: acute pain is particularly rich in trials, and especially those of high quality. Strategies like this will always be preferable. When placebo controlled trials are not available conclusions may have to be drawn from other trials. The techniques described here may be useful in circumstances where large numbers of high quality placebo controlled randomised trials are not available to us. Observational studies of good quality need not give results that are different from those of randomised trials, as testified by three recent studies [56,57,58]. But the potential for bias through lack of quality needs always to be addressed.
The study was supported with Pain Research funds and an unconditional educational grant from Schwarz Pharma. DG is the recipient of a Medical Research Council for Career Development Fellowship.
SPID Categorical scales use words to describe the magnitude of the pain. For analysis numbers are given to the verbal categories (for pain intensity, none=0, mild=1, moderate=2 and severe=3). Pain is assessed before the intervention is made and then on multiple occasions. The area under the time-analgesic effect curve for pain intensity, or the sum of pain intensity differences, is SPID. It is usually, but not always, measured over four to six hours.
TOTPAR Categorical scales also use words to describe the magnitude of pain relief. The commonest scale to measure pain relief is the five category scale (none=0, slight=1, moderate=2, good or lots=3 and complete=4). The initial pain relief is taken to be zero, and the pain relief is measured at intervals thereafter. Ideally the area under the time-analgesic effect curve for pain relief, or total pain relief, is TOTPAR.
VASSPID Visual analogue scales (VAS), 100 mm lines with left end labelled "least possible pain" and right end labelled "worst possible pain" can be used to measure pain intensity. Patients mark the line at the point which corresponds to their pain. The scores are obtained by measuring the distance between the least pain end and the patient's mark, usually in millimetres. The area under the time-analgesic effect curve for pain intensity, or the sum of pain intensity differences, is VASSPID.
VASTOTPAR Visual analogue scales (VAS), 100 mm lines with left end labelled "no relief of pain" and right end labelled "complete relief of pain", can be used to measure pain relief. The initial pain relief is taken to be zero, and thereafter patients mark the line at the point which corresponds to their pain relief. The scores are obtained by measuring the distance between the no relief end and the patient's mark, usually in millimetres. The area under the time-analgesic effect curve for pain relief, or the sum of total pain relief, is VASTOTPAR.
maxTOTPAR The maximum pain relief a patient can obtain is complete relief at the first measurement point, maintained for the whole period of observation. Thus using the categorical scale, the maximum relief would be 4, maintained for six hours, giving a maximum value of TOTPAR of 24. If a patient scored their pain relief in such a way as to give a TOTPAR of 15, this would be 15/24, or 63% of maxTOTPAR.
- 2.Moore A, Collins S, Carroll D, McQuay H, Edwards J: Single dose paracetamol (acetaminophen), with and without codeine, for postoperative pain (Cochrane Review). In: The Cochrane Library, Issue 4. 2000, . Oxford: Update Software.Google Scholar
- 9.Moore RA, Collins SL, McQuay HJ: Variation in the placebo response rate: implications for individual trials and consequences for meta-analysis. In Proceedings of the 8th World Congress on Pain, Progress in Pain Research & Management, Vol 8. Editors Jensen TS, Turner JA & Wiesenfeld-Hallin Z, IASP Press, Seattle,. 1997Google Scholar
- 14.McQuay HJ, Moore RA: An evidence-based resource for pain relief. Oxford: Oxford University Press,. 1998Google Scholar
- 17.Cooper SA: Single-dose analgesic studies: the upside and downside of assay sensitivity. In The design of analgesic clinical trials (Advances in Pain Research and Therapy Vol. 18). Edited by Max MB, Portenoy RK, and Laska EM. New York: Raven Press,. 1991, 117-124.Google Scholar
- 23.Morris JA, Gardner MJ: Calculating confidence intervals for relative risk, odds ratios and standardised ratios and rates. In Statistics with confidence -confidence intervals and statistical guidelines. Edited by Gardner, MJ and Altman DG. London: British Medical Journal,. 1995, 50-63.Google Scholar
- 53.Moore RA: Understanding clinical trials: what have we learned from systematic reviews?. In Progress in Pain Research & Management. Edited by Devor M, Rowbotham MC, Weisenfeld-Hallin Z. IASP Press, Seattle,. 2000, 757-770.Google Scholar
- 58.MacLehose RR, Reeves BC, Harvey IM, Sheldon TA, Russell IT, Black AMS: A systematic review of comparisons of effect sizes derived from randomised and non-randomised studies. Health Technology Assessment. 2000, 4: 34-Google Scholar
- The pre-publication history for this article can be accessed here:http://www.biomedcentral.com/content/backmatter/1471-2288-1-1-b1.pdf
This article is published under license to BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.