The aim of rectal cancer treatment is to improve survival and prevent local recurrence, while limiting treatment-related morbidity and preserving bowel, sexual, and genitourinary function.1,2 Consequently, patients with locally advanced rectal cancer (LARC) generally undergo neoadjuvant chemoradiation (CRT) followed by surgery.3,4 This combined modality approach decreases recurrence rates and improves survival compared with surgery only.4,5 The most frequently used neoadjuvant treatment strategy for LARC is a combination of radiotherapy (25 × 2 Gy or 28 × 1.8 Gy) and fluoropyrimidine-based chemotherapy [e.g., capecitabine or 5-fluorouracil (5FU)]. Hereby 15–20% of LARC patients achieve a pathological complete response (pCR) in which no tumor is found in the surgical resection specimen.6,7,8

Unfortunately, 30% of patients who receive this treatment will still die within 5 years due to local or distant recurrence.9 However, patients with pCR after neoadjuvant therapy are reported to have better survival, lower local recurrence, and less distant failure rates.10 The observation of pCR after surgery has led to a paradigm shift in rectal cancer management, in which organ preservation has become an increasingly important endpoint after neoadjuvant treatment in combination with reduction of local recurrence and survival rates.9 Organ-preserving treatment strategies can be considered when a complete response is detected clinically, radiographically, and/or endoscopically before surgery [i.e., clinical complete response (cCR)]. This strategy may protect patients from surgery-associated morbidity and the associated impairment in quality of life.11,12 As such, patients with cCR following neoadjuvant treatment are increasingly being offered watch-and-wait regimens or organ-sparing strategies, such as local excision.13,14 To further increase the number of eligible patients for such organ preservation strategies, physicians are searching for (new) neoadjuvant treatments with higher organ-sparing potential than the current standard of care.

Previous studies suggested that treatment intensification, i.e., adding chemotherapy or dose-escalated radiotherapy to standard chemoradiation, might enhance rectum preservation and/or improve oncological outcomes.15 Theoretically, intensified treatment would further downstage the tumor and any nodal disease prior to surgery and/or target potential micrometastatic disease.4 On the contrary, others prefer a short-course (radiation) schedule over long-course chemoradiation, based on its lower toxicity rates, better compliance, and lower cost.16,17,18,19

The present systematic review and meta-analysis gives an overview of available neoadjuvant treatment strategies for LARC and analyzes how these perform in achieving pCR (as a surrogate endpoint for cCR) compared with the current standard of care in patients with locally advanced rectal cancer based on available evidence from randomized trials.

Methods

The present study is registered in the PROSPERO database under number CRD 42017058674.

Search Strategy

Pubmed, Embase, and Cochrane Central bibliographic databases were searched (last update June 20, 2019) for randomized controlled trials on neoadjuvant treatment for locally advanced rectal cancer, restricted to full text and English language. The search strategy, search syntax, and characteristics of excluded studies are presented in Supplementary Tables 1 and 2 (available online). Cross-referencing was performed.

Eligibility Criteria

Phase II–III randomized controlled trials (RCTs), conducted after the introduction of total mesorectal excision (TME) surgery in the 1980s,20 in which patients received neoadjuvant treatment for magnetic resonance imaging (MRI)-staged nonmetastatic LARC were included. LARC was defined as stage II–III (cT3–4N0 or T1–4N1–2) rectal cancer. All neoadjuvant treatment modalities that entailed systemic therapy and/or radiotherapy were eligible. Radiotherapy, delivered in either a short course or a long course, was considered suitable, also optionally accompanied by radiation dose escalation. Inclusion was restricted to studies using an interval of at least 4 weeks between end of neoadjuvant therapy and surgery. The primary outcome was pCR, defined as ypT0N0. Studies that did not report ypTN stage were excluded. Secondary outcomes were ≥ grade 3 toxicity [according to the Common Terminology Criteria for Adverse Events (CTCAE) version 3.0 or 4.0], surgical outcomes (complication rate and R0 resection rate), local recurrence (LR), disease free survival (DFS), and overall survival (OS). Administration of postoperative systemic therapy was not an exclusion criterion since this could not influence our primary outcome. Study selection was solely based on the primary outcome.

Study Selection

Identified studies were listed in EndNote (1988–2012 Thomson Reuters). Two authors (S.H. and J.B.) independently screened on title and abstract. Full-text reports were retrieved and examined for eligibility criteria. Studies that only partially fulfilled the eligibility criteria were excluded. Disagreements were resolved by discussion between the two raters. Duplicates were removed, and multiple reports of the same study were linked together. Lastly, the corresponding author of each included study was contacted to obtain additional information or information at individual patient level.

Risk of Bias Assessment

Risk of bias was assessed by the first author using the Cochrane risk-of-bias tool,21 including random sequence generation, allocation concealment, blinding of participants and personnel, blinding of outcome assessment, incomplete outcome data, selective reporting, and other biases. All studies were included in the analyses, irrespective of their risk of bias.

Data Extraction

From each included trial, information about trial characteristics (study year/duration and year and country of publication), methodology (phase II or III RCT, number of arms, and sample size), characteristics of study participants [clinical tumor and nodal stage, involvement of the mesorectal fascia (MRF), and distance from the anus in cm], characteristics of intervention [agent(s), (radiotherapy) dose, duration, and interval to surgery in weeks], and outcomes [pCR (ypT0N0) rate, ≥ grade 3 toxicity (CTCAE), percentage of patients who received complete dose chemotherapy, percentage of patients that proceeded to surgery, surgical complications, R0 resection rate, and oncological outcomes (LR, DFS, and OS)] was collected. Survival data are reported as 3-year cumulative incidence rates. If available from the report, hazard ratios (HR) are also presented.

Data Analysis

Four subgroups were created based on neoadjuvant treatment: multiagent chemoradiation (n = 9), induction chemotherapy (n = 5), consolidation chemotherapy (n = 2), and short-course radiotherapy and delayed surgery (SCRT-delay, n = 1). A systematic review of all included studies was performed. A quantitative meta-analysis on the studies that compared an intervention with standard fluoropyrimidine-based chemoradiation (25–28 × 1.8–2 Gy + capecitabine/5FU) was conducted to investigate their effect size. The Mantel–Haenszel random-effects model (REM) was applied, assuming that heterogeneity among studies was not a result of chance alone. Heterogeneity was expressed with I2.22 The pooled effect size was calculated from per-protocol data and is expressed as the odds ratio (OR) and its 95% confidence interval (CI).

All analyses were performed using Review Manager (RevMan), version 5.3 (The Nordic Cochrane Centre, The Cochrane Collaboration, 2014, Copenhagen, Denmark). Results were reported according the Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) guidelines.23

Results

Eligible Studies

The literature search obtained 586 records after removal of duplicates, of which 526 records were excluded at title/abstract screening (Fig. 1). After full-text review, 17 articles met the inclusion and exclusion criteria and were included in the systematic review. Of those, 11 papers were included in the quantitative (meta)analysis. Four studies were excluded from the meta-analysis because these did not include a fluoropyrimidine-based (standard) CRT control arm. Two trials were excluded from quantitative analysis because these were the only ones in their subgroups.19, 24

Fig. 1
figure 1

PRISMA flowchart of included studies. Reasons for exclusion provided as Supplementary Data (available online)

Risk of Bias

In general, random sequence generation and allocation concealment were well performed and described (Fig. 2). Participants and personnel were not blinded in most studies. However, this was considered as low risk of bias since the primary outcome pCR was unlikely to be influenced by this. On the contrary, most studies lacked a blinded assessment of pCR, which could have increased the risk of detection (observer) bias.

Fig. 2
figure 2

Review authors’ judgements about each risk of bias item presented as percentages across all included studies: a risk of bias graph and b risk of bias summary

Characteristics of Included Studies

Ten phase II and nine phase III trials were conducted between 2001 and 2018 (Table 1). Interval to surgery varied from 4 to 12 weeks after end of neoadjuvant therapy. Detailed patient and tumor characteristics as well as an overview of administered therapy doses are presented in Supplementary Tables 3 and 4 (online accessible). The majority of patients had cT3N + tumors (Supplementary Table 3). MRF involvement was reported in eight studies and varied from 0 to 94.7%. Tumors located < 5 cm from the anus were present in 4–69.6% of included patients. The outcomes of included randomized controlled trials stratified by neoadjuvant treatment regimen are presented in Table 2.

Table 1 Study characteristics of randomized controlled trials stratified by neoadjuvant treatment regimen
Table 2 Overview of outcomes of included randomized controlled trials stratified by neoadjuvant treatment regimen

Fluoropyrimidine-Based CRT Versus Multiagent CRT

Nine trials compared fluoropyrimidine-based CRT with multiagent CRT. Six trials (two phase II trials and four phase III trials), including 2502 participants, entered the quantitative analysis. Overall, the pooled OR for pCR after multiagent CRT (n = 1248) versus standard CRT (n = 1254) was statistically significant at 1.46 (95% CI 1.18–1.79, I2 0%). Subgroup analysis revealed that the pooled OR resulting from phase II trials was not significant (OR 1.19, 95% CI 0.56–2.52, I2 34%), and the pooled OR from phase III trials remained statistically significant in favor of multiagent CRT (OR 1.50, 95% CI 1.20–1.87, I2 0%, Fig. 3a).

Fig. 3
figure 3

Pooled OR of pCR rates following multiagent chemoradiation, consolidation chemotherapy, and induction chemotherapy compared with standard fluoropyrimidine-based CRT

In five trials, the experimental group received a combination of fluoropyrimidine-based chemotherapy and oxaliplatin.25,26,27,28,29 In patients who received fluoropyrimidine-based CRT, ≥ grade 3 toxicity occurred in 10.7–40%. In the oxaliplatin CRT group, ≥ grade 3 toxicity rates were significantly higher (21.4–49.1%), but this did not affect the number of patients that completed neoadjuvant therapy or the percentage of participants that proceeded to surgery. Neoadjuvant fluoropyrimidine-based CRT resulted in pCR in 13.2–28.3% of patients. When oxaliplatin was added to this regimen, pCR rates were 17.4–28.4%. This was statistically significant in two trials.25,29 No differences were seen in R0 resections or surgical complications. Two trials compared 5FU-based CRT with multiagent CRT containing irinotecan.30,31,32 One trial described significantly less complete dose administration in the experimental group.30 No differences in pCR nor in surgical and survival outcomes were seen. One trial evaluated the effect of targeted therapy (bevacizumab) added to capecitabine-based CRT in 44 patients.24 Compared with patients who received capecitabine-based CRT (n = 46), no differences were seen in toxicity or treatment compliance. All but one patient (investigational group) underwent surgery after a median interval of 7.3 weeks. pCR was achieved in 10.9% of patients in the capecitabine group and 16.3% of patients in the bevacizumab group. This difference was not statistically significant. Survival data were not available.

For all multiagent comparisons, survival and recurrence data were available from five studies.26,27,29,30,31 No significant differences were reported in LR or OS. Three-year cumulative incidence rates for LR and OS in the monotherapy group varied from 4.6–6.1% to 86.4–88.0%, respectively. For the multiagent group, these rates were 2.9–4.4% and 88.3–90.3%, respectively. One study reported a significant better 3-year DFS after fluoropyrimidine plus oxaliplatin-based CRT (71.2% vs. 75.9%, HR 0.79, 95% CI 0.64–0.98, Table 2).29

Induction Chemotherapy

Five trials investigated the effect of CRT on pCR when this was preceded by induction chemotherapy. Induction chemotherapy plus CRT was compared with standard CRT in three phase II trials.33,34,35 In these trials, induction therapy consisted of multiagent chemotherapy (i.e., CAPOX or FOLFOX). Toxicity was higher after induction chemotherapy and resulted in significantly lower compliance to CRT in one trial.34,36 There were no differences in surgical outcomes or survival. There was no significant difference for pCR after induction chemotherapy (n = 123) versus standard CRT (n = 118) with a pooled OR of 1.20 (95% CI 0.62–2.35, I2 0%, Fig. 3b).

Two trials (GRECCAR-4 and CAO/ARO/AIO-12) in this subgroup were not used for quantitative analysis. The GRECCAR-4 trial randomized patients based on their response to induction FOLFIRINOX.37 Good responders either received additional capecitabine-based CRT or underwent surgery. Poor responders were randomized to either capecitabine-based CRT or capecitabine-based CRT with dose-escalated radiotherapy (60 Gy). The trial was stopped prematurely due to low accrual rates in the good-responders arm. In the good-responder arm (n = 20), pCR was achieved in 1 of 11 (9.1%) patients after FOLFIRINOX alone and in 11 of 19 (57.9%) patients after induction chemotherapy with FOLFIRINOX and capecitabine-based CRT. In the poor-responder group (n = 103), CRT with dose-escalated radiotherapy resulted in pCR in 9 of 51 (17.6%) patients compared with 7 of 52 (13.5%) patients in the standard-CRT group. This was not a significant difference. The higher radiation dose in the poor responders arm increased R0 resection from 83 to 88%. The CAO/ARO/AIO-12 trial compared CRT and consolidation chemotherapy with CRT and induction therapy.38 Acute ≥ grade 3 toxicity occurred in 21.8% and 35.9% patients after induction chemotherapy alone and CRT after induction chemotherapy, respectively, compared with 27.3% in participants undergoing CRT before consolidation chemotherapy and 20% during consolidation therapy. There were no differences in number of R0 resections. pCR was significantly higher in the consolidation group. Long-term survival outcomes were not available.

Consolidation Chemotherapy

Two RCTs (one phase II and one phase III trial) compared standard CRT with CRT followed by consolidation chemotherapy with either CAPOX or 5FU.39,40 Acute ≥ grade 3 toxicity was reported in one trial and did not differ between groups.39 R0 resections were achieved in 91.7–100% of patients after standard CRT and 88.6–92% of patients after CRT with consolidation CAPOX. This was a nonsignificant difference. The quantitative analysis for pCR in standard CRT (n = 76) versus CRT with consolidation CAPOX (n = 69) resulted in a nonsignificant difference with pooled OR of 1.17 (95% CI 0.33–4.23, I2 54%). On the subgroup analysis, the phase II trial was in favor of CRT with consolidation therapy (OR 2.58, 95% CI 0.61–10.99),41 and the phase III trial was in favor of standard CRT (OR 0.57, 95% CI 0.14–2.35).40 None of the ORs were statistically significant (Fig. 3c). Survival data were not reported.

Short-Course Radiotherapy and Delayed Surgery

One trial compared SCRT-delay with capecitabine-based CRT,19,42 resulting in a nonsignificant different pCR rate (4.4% vs. 11.1%, respectively). There were no differences in radicality or surgical complications. Five-year DFS was significantly worse after SCRT-delay compared with CRT (59% vs. 75.1%, HR 1.93, Table 2).

Discussion

This systematic review evaluates whether pCR rates are higher following alternative neoadjuvant treatment strategies as compared with standard neoadjuvant fluoropyrimidine-based chemoradiation. All included trials fail to deliver high-level evidence to show an improvement in pathological outcomes or survival compared with standard fluoropyrimidine-based CRT. The addition of oxaliplatin to fluoropyrimidine-based CRT might result in significantly more pCR, but at the expense of more ≥ grade 3 toxicity. Furthermore, this benefit does not translate into lower rates of local recurrence or improved overall survival. Other neoadjuvant treatment strategies, including consolidation/induction chemotherapy and short-course radiotherapy with delayed surgery, were not associated with improved pCR rates. None of the included trials reported benefit in local recurrence or overall survival.

pCR following neoadjuvant therapy has been associated with improved survival7 and may reflect the organ-sparing potential of a treatment protocol. To increase clinical response rates after neoadjuvant treatment and herewith enable rectum preservation, different intensification strategies have been investigated in phase I–II trials, e.g., multiagent CRT, targeted therapy, radiotherapy dose-escalation, or additional chemotherapy before or after CRT [total neoadjuvant treatment (TNT)]. On multivariable metaregression, the addition of a second concurrent chemotherapy agent was not associated with improved pCR rates.43 In accordance with our findings, previous meta-analyses showed that the addition of oxaliplatin to preoperative chemoradiotherapy improves pCR rate, decreases LR rate, and improves DFS, but significantly worsens toxicity.44,45 Also, no significant difference was found in the R0 resection rate, sphincter preservation rate, permanent stoma rate, postoperative complication, mortality, or overall survival.45 Dose-escalated radiotherapy could be associated with higher pCR rates.43,46 However, this has not yet been confirmed by a randomized controlled trial and could therefore not be further investigated in the present study.6 TNT might manage micrometastases, increase tumor regression that enhances R0 resection rates, and increase probabilities for organ preservation.38 A recent meta-analysis showed that patients who received TNT followed by surgery more often achieved pCR (OR 1.39, 1.08–1.81) and better DFS (HR 0.75, 0.52–1.07) and OS (HR 0.73 (0.59–0.9) than those who received CRT only. However, this analysis was largely based on nonrandomized comparative studies, and in subgroup analyses (prospective and retrospective series), there were no statistically significant differences between TNT and CRT arms.15 Several trials are still ongoing,47,48 but to date, the superiority of TNT over standard CRT remains inconclusive.

Targeted therapy is the latest development in rectal cancer management. Translational research has led to better understanding of molecular pathways and increased the interest in targeted therapy; For example, cancer cells can express epidermal growth factor receptor (EGFR), which stimulates cell proliferation, as well as vascular endothelial growth factor receptor (VEGFR), enabling vessel formation for growth,49,50 and EGFR signaling might promote resistance to radiotherapy. Retrospective analyses demonstrated worse DFS and lower pCR rates in patients with rectal tumors expressing EGFR, and elevated VEGF expression in tumors has been associated with inferior survival.49 The addition of cetuximab, a monoclonal antibody that can sensitize cells with overexpression of EGFR to radiotherapy,49 has been shown not to affect the pCR rate but to significantly improve OS.51 Bevacizumab, an anti-VEGF antibody reducing tumor vascular density,49, 50 did not improve pCR rates.24 However, these translational results are still preliminary, and clinical trials are needed.

In specific patient populations (elderly or frail) or in some countries, SCRT-delay is preferred over CRT because of its lower costs, better compliance, and less demanding nature.52 However, the use of SCRT remains elusive outside of Europe.9 Unsurprisingly, pCR rates are lower with this regimen based on its lower biological effective radiation dose compared with long-course chemoradiation. The largest randomized trial that investigated the effect of SCRT-delay was the Swedish Stockholm III trial.53 pCR was found in 10.4% of patients after SCRT-delay, and the risk of postoperative complications was significantly lower after SCRT-delay compared with SCRT and immediate surgery.18,54 However, this trial could not be included in the present study due to the lack of baseline tumor characteristics. Additionally, a combination of (induction/consolidation) chemotherapy and SCRT-delay could increase pCR rates and improve survival.42, 55,56 The results of a large RCT on this topic are still awaited.56 Therefore, at this moment, SCRT-delay only seems appropriate for frail LARC patients who are unfit to undergo CRT.

This is the first systematic review to provide an overview of the most widely used and available neoadjuvant treatment modalities investigated in a randomized trial. The evaluation of pathological outcomes in relation to toxicity and surgical and survival data provides more insight in the overall effect of these regimens. Nonetheless, this meta-analysis also encountered several limitations. First, only RCTs were included, whereas a lot of new interventions are trialed in prospective single-arm phase II trials. However, these trials are prone to selection bias as well as optimism in the intervention effect and often fail to demonstrate superiority in subsequent phase III trials.43,57, 58 Nonetheless, randomized phase II trials may also overestimate the treatment effect.59 We showed these differences between phase II and phase III trials in the analyses for multiagent CRT and for CRT plus consolidation chemotherapy. In addition, the RCT-limited analysis might represent a relatively well-conditioned study population,60 resulting in an underestimation of compliance and toxicity rates. Second, the generalizability might be limited due to strict MRI criteria and pCR definitions. Although MRI is considered to be the most optimal staging method,2,61 this may not be as widely available and easy accessible in all countries. In addition, the primary outcome was restricted to ypT0N0 because the interobserver agreement of other methods for tumor regression grading is low.62 The tumor regression grade (TRG) definition of pCR varies among approaches, and the application of a TRG is not recommended in the present TNM classification.62, 63 Moreover, subgroups were small, and secondary outcomes could not be extracted from all included trials, which might reduce power. Third, despite strict inclusion criteria and the use of a random-effects model, uncorrected heterogeneity in study protocols might still influence the pooled effect estimates.64 This is for instance reflected in the different intervals between the end of neoadjuvant treatment and surgery. A prolonged interval may increase pCR rates and recurrence-free survival without compromising surgical morbidity.65,66 As such, higher pCR rates after consolidation therapy compared with induction therapy may be the result of an increased interval between surgery and CRT rather than the therapy itself. And lastly, only those treatments compared with a similar baseline, namely standard fluoropyrimidine-based CRT, could be used in a formal meta-analysis. The opportunity to perform an extended network meta-analysis was explored but was not reliable due to the large heterogeneity in study design and the small amount of available RCTs.

The currently available data show that there is a wide variety of neoadjuvant treatment strategies available but no high-level evidence to show an improvement in pathological outcomes and survival compared with standard of care in terms of pCR achievement and organ-sparing potential. This is probably caused by the large number of confounding factors resulting from differences in diagnosis and treatment but, more importantly, also from differences in patient and tumor characteristics. In the era of personalized treatment, more high-level evidence on tumor characteristics, (pre)treatment response prediction, long-term quality of life, and oncological outcomes after different treatment modalities is needed to support optimal and individualized rectal cancer management. This requires new, efficient, and innovative research infrastructures, such as large prospective cohorts in which trials can be conducted according to the “Trials within Cohorts” (TwiCs) design.67,68 This enables investigation of novel prognostic and predictive factors in large populations as well as in small subgroups of patients and simultaneously provides the platform to conduct (partly) overlapping randomized trials with robust and validated analysis methods that provide clinically relevant answers that can be directly translated into changes for routine care.69