Introduction

The TNM staging system is maintained by the Union for International Cancer Control (UICC) and the American Joint Committee on Cancer (AJCC) as a tool for physicians to stage different types of cancer based on the extent of the tumor (T), the involvement of lymph nodes (N), and the presence of metastases (M). Currently, the TNM staging system is globally used for evaluating most malignancies, including gastric cancer. Since it was published in 1976, the TNM staging system remains the most widely implemented resource for cancer staging and served as a baseline for the National Comprehensive Cancer Network (NCCN) guidelines [1]. The previous updates to the TNM staging of gastric cancer were primarily based on patient data collected from the Western populations. However, the release of the eighth edition in 2016 included 25,000 cases of gastric cancer from Eastern and Western countries, thanks to a collaboration with the International Gastric Cancer Association (IGCA) [2].

One major highlight of the eighth edition is the addition of a separate staging system for stomach patients receiving neoadjuvant therapy (e.g., ypTNM staging) [3]. The pre-treatment extent of the disease may be clinically (cTNM) or pathologically (pTNM) determined and when neoadjuvant therapy is given before surgical intervention, the “y” prefix is added to the TNM category. The concept of y-stage was first described in the third English edition of the Japanese classification of gastric carcinoma [4], yet there were limited data supporting the prognostic value of the ypTNM staging platform. However, the proposed ypTNM staging system was supported by data of 683 patients from the American National Cancer Database (NCDB) [3]. Despite this, further validation of the new ypTNM staging system is not yet available, nor has any data been published about ypTNM staging in the Asian population.

To this end, we performed a retrospective analysis in accordance with the new edition, in the hopes to evaluate the prognostic value of the eighth edition with a special focus on the value of the ypTNM staging system in the Chinese population.

Methods

Patients

After obtaining approval from the Peking University Cancer Hospital Ethics Committee, we conducted a retrospective selection of patients who were diagnosed with gastric cancer from January 1, 2007, to January 1, 2015, at the Peking University Cancer Hospital and Institute. The inclusion criteria included: (1) proven diagnosis of gastric adenocarcinoma by endoscopic biopsy prior to surgery; (2) complete clinical and pathological data registered; (3) no signs of distant metastasis prior to the use of chemotherapy; (4) patients that had undergone neoadjuvant chemotherapy prior to surgery; and (5) the D2 gastrectomy was performed in our center. The exclusion criteria included: (1) proven diagnosis with gastrointestinal stromal tumor, lymphoma, neuroendocrine tumor, carcinoid tumors, soft tissue tumors or other non-gastric adenocarcinomas by endoscopic biopsy before surgery; (2) perioperative death within 1 month; (3) patients receiving chemotherapy for other neoplasms within 6 months before the gastric cancer diagnosis; (4) patients receiving neoadjuvant radiotherapy or targeted therapy before surgery; (5) remnant stomach cancer; and (6) patients receiving intra-peritoneal chemotherapy or hyperthermic intraperitoneal chemotherapy.

Neoadjuvant chemotherapy

Our center began implementing neoadjuvant chemotherapy in patients with locally advanced gastric cancer in 2001. The primary regimen included the selective use of fluorouracil, oxaliplatin, paclitaxel, and other drugs when necessary. The efficacy of neoadjuvant chemotherapy was evaluated every two cycles of treatment using enhanced computed tomography (CT) and ultrasound endoscopy, and the therapy was prematurely terminated in cases of the disease progression. In August 2011, the laparoscopic exploration and rapid cytology of the peritoneal lavage fluid became mandatory. In all resectable cases, elective gastrectomies were scheduled for 4–6 weeks after chemotherapy.

Data collection

The entry of clinical information on the eligible patients was completed by two independent researchers (Y.K.W. and F.S.) between September 2015 and March 2016 using Epidata v3.1 (EpiData Association, Odense, Denmark). These two investigators discussed and determined patient eligibility and data inconsistencies, while a third researcher was used if an agreement could not be reached.

For this study, the following clinical and pathological data were gathered: age, gender, height, weight, the Eastern Cooperative Oncology Group (ECOG) score, the American Society of Anesthesiologists (ASA) score, family history of cancer, operation time, resection type, multi-organ resection, blood loss, tumor location, T stage, total number of resected lymph nodes, total number of metastatic lymph nodes, pathologic type, differentiation grade, tumor diameter (in long and short axis), and vascular involvement. All included patients were staged according to the eighth AJCC guidelines.

Survival-related data, including overall survival (OS) and disease-free survival (DFS), were also collected. The OS was calculated from the time of the first chemotherapy cycle since diagnosis to the date of death (for any reason) or the most recent follow-up. DFS was determined from the time of the first chemotherapy cycle after diagnosis to any cause of recurrence or metastasis, the date of death if there was no clear recurrence of metastasis, or the last follow-up. The OS and DFS values were recorded in months. The follow-ups were conducted quarterly via telephone, and if the patient (or contact person) could not be reached in three attempts, the patient was considered to be lost.

Statistical analysis

Data were reported as mean and standard deviation (SD) or frequency and proportion as indicated. Three- and 5-year OS and DFS rates were calculated using the Kaplan–Meier method. We performed log-rank tests to compare the OS and DFS rates across the ypTNM stages, ypT stages, and ypN stages. We conducted univariate Cox regressions, followed by stepwise multivariate Cox models to determine the prognostic value of ypT and ypN stages on the OS and DFS while adjusting for covariates. Variables with p values less than 0.10 in the univariate analyses were included in the multivariate stepwise procedure. A stepwise variable selection was based on backward likelihood ratio tests with an entry level of 0.05 and removal of 0.10.

The current ypTNM staging system does not include ypT0 patients and does not have sub-stages within the ypTNM stage (e.g., Ia or Ib). For this reason, we explored if the current system can be further divided and extended. We divided the patients into 20 groups according to their tumor/node status by combining the five ypT categories (1/2/3/4a/4b) and the four ypN categories (0/1/2/3), from ypT1N0, ypT1N1, and onward to ypT4bN3. We calculated the hazard ratio of each combination compared to the base group within each ypTNM stage (i.e., ypT1N0 for stage I, ypT3N0 for stage II, ypT4aN1 for stage III). From this cluster analysis, we proposed a modified ypTNM staging system. We then used Harrell’s c-statistic to assess and compare the prediction ability of the current and modified staging systems.

In the overall comparisons, a p value of less than 0.05 was considered as statistically significant. In the pairwise comparisons, the p values were adjusted using the Bonferroni Correction. All reported p values were two-sided. The statistical analysis was performed with SPSS v25 (IBM Corp., Chicago, IL, USA) and Stata v14 (StataCorp, College Station, TX, USA).

Results

Table 1 displays the clinical, pathological, and surgical characteristics of the eligible patients. A total of 473 patients were included in this study, which encompassed 361 males (76.3%) and 112 females (23.7%). The average age was 59 years (26–86 years). The median number of total resected lymph nodes was 30 (15–91), and that of metastatic lymph nodes was two (0–68). The median follow-up duration was 37 months (3–106 months), and 194 patients died at the time of the last follow-up. The 3- and 5-year OS rates were 64.0 and 54.5%, respectively. The 3- and 5-year DFS rates were 59.3 and 51.8%, respectively.

Table 1 Patient, tumor and treatment characteristics

Survival by ypTNM, ypT, and ypN stages

According to the ypTNM staging system, there were 65 stage I cases, 126 stage II cases, and 251 stage III cases. Figure 1 shows the Kaplan–Meier OS and DFS curves by ypTNM stages. The 3-year OS rates for stage I, II, and III patients were 92.2, 77.9, and 46.0%, respectively (Chi square = 20.7, p < 0.001). while the DFS rates were 89.0, 75.5, and 39.6%, respectively (Chi square = 21.9, p < 0.001). The 5-year OS rates of stage I, II, III patients were 89.6, 65.5, and 36.5%, respectively (Chi square = 58.0, p < 0.001), while the DFS rates were 84.8, 64.5 and 33.2%, respectively (Chi square = 65.1, p < 0.001).

Fig. 1
figure 1

Post-neoadjuvant chemotherapy stage (ypTNM) and survival, stratified by ypStage groupings. a Overall survival (OS). The 5-year OS of pCR, stage I, II and III patients were 88.5, 89.6, 65.5, and 36.5%, respectively. No significant difference was determined between pCR and stage I patients (p = 0.514). Log-rank analysis was carried out in stage I, II and III cases, and significant differences in OS were determined (p < 0.001, respectively). b Disease free survival (DFS). The 3-year DFS of pCR, stage I, II and III patients were 92.2, 89.0, 75.5, and 39.6%, respectively. No significant difference was determined between pCR and stage I patients (p = 0.505). Log-rank analysis was carried out in stage I, II and III cases, and significant differences in DFS were determined in each comparison (p < 0.001, respectively). Statistical significance was defined as p < 0.017 after Bonferroni adjustment

The current ypTNM staging system does not include the ypT0 stage. There were 31 (6.6%) ypT0 patients in the current study, of which three were in the ypT0N1 stage and 28 with a pathological complete response (pCR). The three ypT0N1 patients were followed for 42, 61 and 75 months. At the final follow-up, the patients were alive without disease recurrence or metastasis. Log-rank tests showed that neither the OS (p = 0.575) nor DFS (p = 0.775) rates of the pCR patients were significantly different from that of the ypTNM stage I patients.

We performed log-rank tests to identify the differences in OS and DFS between adjacent ypT and ypN stage patients. No significant differences were found in the respective pairs of ypT stages, except for ypT4a and ypT4b (Chi square = 42.0, p = 0.001 < 0.0125 after Bonferroni correction for OS; Chi square = 35.0, p = 0.001 < 0.0125 after Bonferroni correction for DFS). For adjacent ypN stages, we observed a significant difference on the OS and DFS rates between ypN1 and ypN2 patients (Chi square = 10.9, p = 0.001 < 0.017 after Bonferroni correction for OS; Chi square = 35.0, p = 0.001 < 0.0125 after Bonferroni correction for DFS), and between ypN2 and ypN3 patients (Chi square = 8.7, p = 0.003 < 0.017 after Bonferroni correction for OS; Chi square = 12.2, p = 0.001 < 0.017 after Bonferroni correction for DFS). In Fig. 2, the survival curves of the ypN stages appear to be more diverged than that of the ypT stages (Fig. 3).

Fig. 2
figure 2

Post-neoadjuvant chemotherapy stage (ypTNM) and survival, stratified by ypT or ypN groupings. Statistical analysis (log-rank comparison) was performed in each pair of the adjacent groups. a Overall survival (OS) of respective ypT stage patients. No statistical significant difference was found between the adjacent groups (ypT1 vs. ypT2, p = 0.205; ypT2 vs. ypT3, p = 0.073; ypT3 vs. ypT4, p = 0.303). b Disease free survival (DFS) of respective ypT stage patients. No statistical significant difference was found between the adjacent groups (ypT1 vs. ypT2, p = 0.271; ypT2 vs. ypT3, p = 0.077; ypT3 vs. ypT4, p = 0.361). c OS of respective ypNstage patients. No statistical significant difference was found between ypN0 and ypN1 (p = 0.080), significant differences were determined between ypN1 and ypN2 (p = 0.001), and between ypN2 and ypN3 (p = 0.004). d DFS of respective ypN stage patients. No statistical significant difference was found between ypN0 and ypN1 (p = 0.036*), significant differences were determined between ypN1 and ypN2 (p = 0.002), and between ypN2 and ypN3 (p = 0.001). *Statistical significance was defined as p < 0.017 after Bonferroni adjustment

Fig. 3
figure 3

Subgroups of stage III and survival. Stage III patients were stratified into two subgroups: IIIa group, including 121 patients with stages of ypT4aN1M0, ypT3N2M0, or ypT4aN2M0; IIIb group, including 130 patients with stages of ypT2N3M0, ypT4bN0M0, ypT4bN1M0, ypT3N3M0, ypT4bN2M0, ypT4bN3M0, or ypT4aN3M0. The 3-year overall survival (OS) and disease free survival (DFS) were 60.4 and 55.9% in the IIIa group, and 32.5 and 24.1% in the IIIb group; the 5-year OS and DFS were 53.6 and 50.2% in the IIIa group, and 19.8 and 16.8% in the IIIb group. Survival analysis found significant worse survival IIIb patients than the IIIa ones with regard to OS (a HR 2.615, 95% CI 1.862–3.673, p < 0.001) and DFS (b HR 2.616, 95% CI 1.888–3.624, p < 0.001), respectively

ypT and ypN as a significant prognostic factor

Factors significantly associated with OS and DFS through univariate analysis are shown in Table 2. The following variables remained in the OS prediction model after the stepwise procedure: age, BMI, short tumor diameter, pathological type, ypT stage, ypN stage, and gastrectomy type. Adjusting for the covariates, ypT and ypN stages were significant prognostic factors of OS (both p < 0.001; Table 3). A similar analysis was performed for DFS, and likewise identified that the ypT stage and ypN stage were significantly associated with DFS (both p < 0.001; Table 3).

Table 2 Univariate analysis of overall survival and disease free survival (N = 442)
Table 3 Multivariate analysis of overall survival and disease free survival (N = 442)

Cluster analysis and modified ypTNM grouping

We performed a cluster analysis and found that patients in stage ypT4b or ypN3 had worse prognostic outcomes than the rest of stage III patients (Supplementary Table 4). No significant hazard ratios were found between stage I and II cancers. Therefore, we divide the stage III patients into two subgroups: group IIIa, including ypT4aN1M0, ypT3N2M0, and ypT4aN2M0; and group IIIb, including ypT2N3M0, ypT4bN0M0, ypT4bN1M0, ypT3N3M0, ypT4bN2M0, ypT4bN3M0, and ypT4aN3M0. Univariate Cox regression found significantly decreased survival for patients in stage IIIb than those in IIIa, with regard to OS (HR 2.62, 95% CI 1.86–3.67, p < 0.001) and DFS (HR 2.62, 95% CI 1.89–3.62, p < 0.001), which was also confirmed in the multivariate analysis when adjusted for the covariates listed in Table 3 (HR 2.24, 95% CI 1.57–3.21, p < 0.001 for OS; HR 2.30, 95% CI 1.62–3.26, p < 0.001 for DFS).

Next, we assessed the discrimination ability of the current and modified ypTNM (i.e., stage III divided into IIIa and IIIb) staging systems. The c-index of the modified ypTNM staging system, calculated from a univariate Cox model of OS, was significantly higher than that of the current ypTNM system (c-index = 0.708 vs. 0.657, p < 0.001). Similar results were reached using the multivariate Cox models adjusting for variables in Table 3 (c-index = 0.758 vs. 0.737 p = 0.005). Considering pCR and ypT0N1 were not included in the ypTNM staging system, we included ypT0 into stage I. After combining pCR and stage I patients, the c-index of the ypTNM staging (pCR and stages I/II/III) system was 0.681 and that of the modified ypTNM staging (pCR and stages I/II/IIIa/IIIb) system was 0.727. The difference of 0.046 was likewise significant (p < 0.001).

Discussion

Neoadjuvant therapy has been widely employed in the treatment of patients with gastric cancer. The eighth edition of TNM Staging Manual describes the staging system of patients with neoadjuvant therapy for the first time, which is undoubtedly a huge step forward, and has set unified standards and a solid foundation for patient evaluation following neoadjuvant chemotherapy. Furthermore, the manual used data from both Eastern and Western countries to revise the pTNM staging system. These intercontinental collaborations may improve the generalizability and applicability of the tumor staging system to different regions and ethnicities, as there have been discrepancies between the East and West regarding the staging and treatment of gastric cancer [1, 3, 5]. However, the ypTNM staging system included in the eighth edition was solely based on data obtained from the American population. To the best of our knowledge, this is the first study assessing the prognostic value of the ypTNM system in the Asian population.

We found that both ypT and ypN were independent risk factors for disease prognosis. However, ypT could not differentiate the patient survival. One potential explanation for this phenomenon may be that ypT can represent post-neoadjuvant tumor status, but not the tumor response to neoadjuvant chemotherapy, which may not be adequate to account for patient prognosis. Pathologic regression grade is one indicator to evaluate the efficacy of neoadjuvant chemotherapy. However, patients with high ypT stages do not necessarily respond poorly to chemotherapy. For instance, studies have found that up to 50% of patients in stage ypT3 and ypT4 can have residual tumor cells of less than 10%, whereas the percentage of residual tumor cells was not necessarily smaller than 10% in those patients with stage ypT1 [12]. Therefore, a few studies have proposed that the ypN stage, combined with pathologic regression grade, may be helpful in stratifying the prognosis of esophageal cancer patients who undergo preoperative chemotherapy or chemoradiotherapy [6, 13]. This proposition may also hold true for gastric cancer. Despite the significance of chemotherapy-induced tumor response to prognosis prediction, it has not been included in any of current cancer staging systems. This is partly due to the limited accuracy in evaluating malignancies with the current radiological and endoscopic examination strategies, which is a shortcoming acknowledged in the latest AJCC guidelines [3]. Nonetheless, one should note that our explanation is mainly speculation and the underlying mechanism of this phenomenon is yet to be determined. Further analysis in this regard should be encouraged.

The eighth edition of the TNM Staging Manual did not provide any subgrouping standards for ypTNM [3]. Based on our cluster analysis, we divided stage III patients into two subgroups, placing ypT4b and ypN3 into stage IIIb and the others into stage IIIa. The modified system had a significant better c-index than the current system. Another possible modification on current ypTNM would be to categorize ypT4b and ypN3 patients as stage IV. A revision like this is not unprecedented, as a similar change was made to stage IV in the eighth edition manual due to the poor prognosis of cT4b patients. However, we did not include stage IV cases, so we are unable to address this question. Nonetheless, our results do suggest the possibility of further subgrouping the current ypTNM system, which may achieve improved prognosis discrimination, which will assist physicians in treatment decision-making.

The new ypTNM staging system does not address pCR and ypT0N1 patients. In our study, there are 31 patients with ypT0 stage and the 5.9% pCR rate was comparable to the data from previous studies (3–15%) [14,15,16,17,18,19,20]. We found that the DFS and OS of pCR patients were similar to the ypStage I patients. Furthermore, the discrimination ability of the current ypTNM staging system improved when pCR patients were incorporated as stage I. Hence, it may be reasonable to treat patients as stage I, though larger scale analyses are needed to further inform the proper strategy for treating pCR patients.

There are several limitations embedded in this study. First, our study only included the resectable cases and did not contain stage IV patients. This limited our ability to examine the prognostic implication of ypTNM in patients with stage IV disease. Second, though our sample size is close to that used in the eighth edition manual, more studies with larger sample sizes are needed to evaluate and validate the new ypTNM staging system. Lastly, ypTNM staging is just one of many prognostic staging systems, and the disease outcomes are influenced by various prognostic factors, including the treatment regimen, targeted therapy, radiotherapy, postoperative adjuvant treatment regimen, and patient tolerance and sensitivity to chemotherapy. In this study, we only focused on the neoadjuvant chemotherapy to explore the clinical applications of ypTNM staging. We are still in the process of exploring their values through on-going clinical trials focusing on the other prognostic factors [21,22,23,24,25].

Conclusions

The eighth edition of the AJCC staging guidelines for gastric cancer provides unified standards for patient evaluation after neoadjuvant chemotherapy. In this retrospective study, we analyzed our database in accordance with the latest staging system with a special focus on the ypTNM staging system. We found significant survival differences between patients of different stages, which suggested that the generalized ypTNM staging system is effective for staging gastric cancers in the Asian population. We also found a better prognostic value of ypN compared with ypT stage and significantly worsened survival outcomes in ypT4b or ypN3 cases. Our data suggest that further analyses may facilitate subgrouping of each stage to reach a more accurate evaluation of prognosis.