Introduction

Gastric mesenchymal tumors (GMTs) are incidentally discovered during upper endoscopy for an unrelated condition, especially during national gastric cancer screening programs in Korea, and appear as firm, protruding subepithelial tumors; however, larger tumors occasionally can cause bleeding and obstruction [1]. Histopathologically, most of these tumors are completely or partially composed of spindle cells and display smooth muscle or nerve sheath differentiation. Most GMTs are gastrointestinal stromal tumors (GISTs) derived from the interstitial cells of Cajal [1,2,3]. GISTs are at risk of metastatic relapse, especially in the peritoneum and liver, even after surgery for localized diseases [4, 5]. Therefore, all GISTs are considered potentially malignant and are recommended to be resected if they are ≥ 1 cm in size [5,6,7].

In clinical practice, differentiation of GISTs from benign mesenchymal tumors, such as leiomyomas or schwannomas, is essential for appropriate clinical management. Endoscopic ultrasonography (EUS) is a valuable imaging tool for the evaluation of GMTs because it enables the demonstration of hypoechoic lesions that are contiguous with the fourth hypoechoic (muscularis propria) layer of the normal gastric wall [8,9,10]. Although there have been several studies about differentiation between GISTs and benign mesenchymal tumors using EUS [10, 11], the results are not consistent. Furthermore, there are limitations in the analysis of the characteristic features of such tumors due to poor interobserver agreement as a result of subjective interpretation of EUS images [12, 13].

According to recent guidelines for subepithelial tumors, when GMTs smaller than 2 cm are detected, they can usually be followed by periodic endoscopy or EUS once or twice a year until the tumors increase in size or become symptomatic, even if they are diagnosed as GISTs later on [14,15,16]. When GMTs are 2–5 cm in size, the decision-making process can be shared with patients regarding whether to make a histopathological diagnosis (for example, by EUS-guided fine-needle aspiration/biopsy [EUS-FNA/B]) or whether the patient should undergo further treatment [7, 15]. It is recommended that GMTs > 5 cm should be surgically resected and subjected to pathological examinations [14, 15]. Therefore, if more detailed information on the malignant potential (i.e., the possibility of GIST) of GMTs measuring 2–5 cm can be obtained from patients’ data, such as age, sex, and EUS features, these tumors can be resected endoscopically or surgically, or followed up without further invasive procedures to obtain tissue specimens, such as EUS-FNA/B or deep biopsy via endoscopic submucosal dissection [17].

Digital images consist of pixels, which are the basic elements that compose a two-dimensional picture. In digital image analysis, the distribution and spatial variation of pixels is computed using texture analysis to extract useful data, thus the digital image analysis provides objective information on EUS images, which can lessen the poor inter- and intra-observer agreements in the interpretation of EUS images. Previously, we reported the usefulness of digital image analysis of EUS images in GMTs [18]. Therefore, this study aimed to analyze differences in the clinicopathological characteristics and digital image analysis of EUS images according to the histopathology in GMTs measuring 2–5 cm, and to develop a scoring system to predict GISTs.

Methods

Subjects

We retrospectively analyzed patients who underwent both EUS and surgical resection for GMTs such as GIST, leiomyoma, or schwannoma, at the Pusan National University Hospital (Busan, Korea) between January 2007 and June 2018. We identified 153 patients; of them, 40 patients were excluded according to the size criteria (≤ 2 cm in 18 patients and > 5 cm in 22 patients). Thus, we included 103 patients who underwent both EUS and surgical resection for GMTs measuring 2–5 cm. This study was reviewed and approved by the Pusan National University Hospital Institutional Review Board (IRB number, H-1808-016-070).

Endoscopic ultrasonography

EUS was performed using a radial-scanning echoendoscope (GF-UM2000; Olympus, Tokyo, Japan) at 7.5 MHz. All examinations were performed under intravenous conscious sedation (midazolam with or without propofol). The tumor was scanned after filling the stomach with 400–600 mL of deaerated water. At least ten still EUS images were obtained for each lesion, and these images were saved digitally in Windows bitmap format.

EUS images were reviewed by a single experienced endosonographer (GH Kim) who was blinded to the final diagnosis. Only one still EUS image of the highest quality was selected for each lesion for further digital image analysis, which was performed on a standard desktop computer.

Digital image analysis

EUS can have different brightness histograms depending on various contrasts during a real examination. As stated in our previous study [18], a standardization process was performed using the brightness values of the anechoic center and outer hyperechoic rim of the echoendoscope to minimize these differences. Image revision with histogram smoothing was also necessary to obtain better contrast because the original EUS image might be skewed by the brightness of the histogram and, therefore, might not be useful for analyzing the tumor area. Then we applied an edge linking method for all edge pixels repetitively to produce an edge. Edge linking connected and recorded all values, satisfying formula (1) in a 3 × 3 area to current pixel:

$$|\nabla P({x_i},{y_i}) - \nabla P({x_{i+1}},{y_{i+1}})| \leq {\text{Th}}$$
(1)

The threshold (Th) of formula (1) was set at 30, based on our preliminary study [18]. Then we applied binarization, labeling, and noise removal using morphologic information to extract the anechoic center of the echoendoscope. The outer hyperechoic rim of the echoendoscope was extracted as the area that was brighter than the neighboring pixels. Finally, formula (2) was applied to complete the standardization process:

$$\begin{aligned} & {\text{StandardGray}}={\text{(255}} - {\text{RimGray)}} \times \frac{{1+({\text{255}} - {\text{RimGray}})}}{{{\text{CenterGray}}}} \\ & {\text{If}}\;({\text{CenterGray}}<X<{\text{StandardGray}}),{\text{then}}\;X=\frac{{{\text{StandardGray}}}}{{{\text{StandardGray}} - {\text{CenterGray}}}} \times (X - {\text{StandardGray}}) \\ & {\text{Else}}\;{\text{if (StandardGray}}<X<{\text{EdgeGray}}),{\text{then}}\;X={\text{StandardGray}}+\frac{{{\text{255}} - {\text{StandardGray}}}}{{{\text{RimGray}} - {\text{StandardGray}}}} \times (X - {\text{StandardGray}}) \\ \end{aligned}$$
(2)

In formula (2), CenterGray and RimGray denote the brightness values of the anechoic center and outer hyperechoic rim of the echoendoscope, respectively; StandardGray denotes a brightness value to differentiate the anechoic center from the outer hyperechoic rim.

From the standardized image, a region of interest (ROI) was selected by an experienced endosonographer (GH Kim) for tumor analysis. The above method provides brightness information, including the minimum, maximum, mean (Tmean), standard deviation (TSD), median, and interquartile values (Fig. 1).

Fig. 1
figure 1

An example of digital EUS image analysis. From the standardized image, a region of interest (ROI) is selected by an experienced endosonographer for tumor analysis. The final results for the ROI are expressed in the bottom histogram. The mean (Tmean) and standard deviation (TSD) of the brightness values are 76 and 26.86, respectively

Histopathology

The tumors were immunohistochemically classified into leiomyoma, schwannoma, or GIST [3, 19]. Leiomyoma was defined as a desmin-positive and c-kit (CD117)-negative tumor, schwannoma as an S-100-positive and c-kit-negative tumor, and GIST as a c-kit, DOG-1, or CD34-positive tumor. GISTs were divided into four groups in accordance with the consensus meeting report at the National Institutes of Health [6]. According to the malignant potential of GMTs, tumors were classified largely into two groups: non-GIST (leiomyoma and schwannoma) and GIST groups.

Statistical analysis

Data are expressed as mean ± SD. To identify the variables that were significantly associated with GISTs, the clinicopathological and EUS data were initially evaluated with χ2 test, Fisher’s exact test, and Student’s t test. We subsequently entered the candidate variables into a logistic regression analysis. Because we attempted to make a simple and practical scoring system (GIST-predicting scoring system, GIST-PSS) to distinguish GISTs from non-GISTs, clinicopathological and EUS data that were independently and statistically associated with GISTs were selected as examination items for GIST-PSS. The relative weighting of each GIST-PSS variable was based on its β-coefficient in the logistic regression analysis. Multivariate comparisons were expressed as odds ratios (OR) with 95% confidence intervals (CI). A receiver operating characteristic (ROC) curve was applied to find the best sensitivity and specificity cut-off values of age, Tmean, TSD, and GIST-PSS for differentiating GISTs from non-GISTs. All statistical analyses were performed using IBM SPSS version 23.0 for Windows (IBM Co., Armonk, New York, USA). Differences were considered statistically significant when P < 0.05.

Results

Clinicopathological characteristics of patients with gastric mesenchymal tumors

The baseline clinicopathological characteristics of 103 patients who underwent EUS and surgical resection for GMTs are summarized in Table 1. Of the 103 patients, 42 were males and 61 were females, with a mean age of 57.6 years (range 28–87 years). Most tumors (61/103, 59.2%) were located at the upper third of the stomach; surface ulceration was found in 25 tumors. The mean tumor size was 31.6 mm (range 21–50 mm). The final pathological diagnoses of the tumors were leiomyoma in 15 cases, schwannoma in 14, and GIST in 74. When the GISTs were classified according to histologic risk classification, 48 tumors were grouped as low risk, 17 as intermediate risk, and 9 as high risk.

Table 1 Baseline clinicopathological characteristics of 103 patients with gastric mesenchymal tumors

Univariate analysis for factors predicting gastrointestinal stromal tumor

Patients in the GIST group were significantly older than those in the non-GIST group (50.6 ± 10.8 years vs. 60.3 ± 11.1 years, P < 0.001). Patient sex, tumor location, and presence of surface ulceration did not differ between the two groups (Table 2). In all EUS images, the Tmean and TSD were calculated successfully after post-standardized image analysis. The Tmean, which is indicative of echogenicity, was significantly higher in the GIST group than in the non-GIST group (88.1 ± 25.3 vs. 57.3 ± 21.9, P < 0.001). The TSD, which is indicative of heterogeneity, was also significantly higher in the GIST group than in the non-GIST group (28.1 ± 7.5 vs. 24.5 ± 4.7, P = 0.016). However, there was no significant difference in the Tmean or TSD between leiomyoma and schwannoma in the non-GIST group (Supplementary Table 1). When the GISTs were classified into benign or malignant groups according to histologic risk classification, 48 cases were grouped as benign GISTs (low risk, 48 cases) and 26 cases as malignant GISTs (intermediate risk, 17 cases; high risk, 9 cases). There was no difference in the Tmean or TSD between benign and malignant GISTs (88.1 ± 24.0 vs. 88.0 ± 28.0, P = 0.992; 27.1 ± 6.8 vs. 30.0 ± 8.4, P = 0.106, respectively).

Table 2 Comparison of clinicopathological and EUS characteristics between gastrointestinal stromal tumor (GIST) and non-GIST groups

An ROC curve was created to identify the best sensitivity and specificity cut-off values of age, Tmean, and TSD for differentiating GISTs from non-GISTs (Supplementary Fig. 1). The sensitivity and specificity were almost optimized when the critical values of age, Tmean, and TSD were 57.5 years, 67.0, and 25.6, respectively. The sensitivity, specificity, and accuracy of age ≥ 58 years for predicting GIST in a given tumor was 62.2% (95% CI 55.5–67.0%), 75.9% (95% CI 58.9–88.3%), and 66.0% (95% CI 56.5–73.0%), respectively (Supplementary Table 2). The sensitivity, specificity, and accuracy of Tmean, and TSD for predicting GIST were 81.1% (95% CI 74.8–85.5%), 79.3% (95% CI 63.2–90.5%), and 80.6% (95% CI 71.5–86.9%), and 55.4% (95% CI 48.7–60.9%), 69.0% (95% CI 51.8–83.0%), and 59.2% (95% CI 49.6–67.1%), respectively.

Multivariate analysis for factors predicting gastrointestinal stromal tumor

Multivariate logistic regression analysis revealed age ≥ 58 years and Tmean ≥ 67 were relevant independent factors predicting GIST (Table 3). The possibility of GIST in a given tumor significantly increased in patients ≥ 58 years old (OR 6.600, 95% CI 1.945–22.403; P = 0.002) and in tumors with Tmean ≥ 67 (OR 16.184, 95% CI 4.951–52.897; P < 0.001). TSD ≥ 26 was associated with a possibility of GIST, but did not reach statistical significance (OR 0.947, 95% CI 0.794–8.359; P = 0.115).

Table 3 Multivariate logistic regression analysis for predicting gastrointestinal stromal tumor in gastric mesenchymal tumors

Establishment of gastrointestinal stromal tumor-predicting scoring system

The GIST-PSS was created based on the above results. One point was given for TSD ≥ 26, 2 for age ≥ 58 years, and 3 for Tmean ≥ 67 based on the β-coefficient. Thus, the range of the resulting GIST-PSS was 0–6 points (Table 4). An ROC curve was created to identify the best sensitivity and specificity cut-off values of GIST-PSS for differentiating GISTs from non-GISTs (Fig. 2). The sensitivity and specificity were almost optimized when the critical values of GIST-PSS were 2.5. Therefore, a total of three points was defined as the cutoff value between GISTs and non-GISTs. The sensitivity, specificity, and accuracy of the GIST-PSS were 86.5% (95% CI 80.3–91.0%), 75.9% (95% CI 60.0–87.4%), and 83.5% (95% CI 74.6–90.0%), respectively (Table 5).

Table 4 Proposed gastrointestinal stromal tumor-predicting scoring system
Fig. 2
figure 2

Receiver operating characteristic curve used to determine cut-off values for gastrointestinal stromal tumor-predicting scoring system (GIST-PSS). The sensitivity and specificity were almost optimized when the critical values for GIST-PSS were 2.5; the respective area under the curve values was 0.861

Table 5 Sensitivity, specificity, positive and negative predictive values, and accuracy of the gastrointestinal stromal tumor (GIST)-predicting scoring system for differentiating GISTs from non-GISTs

Discussion

In the present study, we tried to analyze the differences in the clinicopathological characteristics and digital image analysis of EUS images according to the histopathology in GMTs measuring 2–5 cm. We found that age ≥ 58 years and Tmean ≥ 67 were independent factors predicting GIST on multivariate analysis. Additionally, the scoring system (GIST-PSS) based on age, Tmean, and TSD showed an acceptable sensitivity, specificity and accuracy for predicting GIST in a given GMT. To the best of our knowledge, this is the first study to report a scoring system that predicts GIST using digital EUS image analysis.

We previously reported characteristic EUS findings of GISTs, schwannomas, and leiomyomas [10, 20]; heterogeneity, hyperechogenic spots, and higher echogenicity in comparison with the surrounding muscle layer were helpful for discriminating GISTs from other benign GMTs. However, interpretation of EUS images is subjective and endoscopist dependent, which results in poor interobserver agreement [12, 13]. To overcome this limitation, we aimed to seek more objective information from EUS images.

Unlike color endoscopy images, EUS images are black and white images composed of pixels, and their echo density is expressed as brightness values from 0 to 255. As a result, analysis of the brightness values can be an appropriate method to evaluate the level of echogenicity (expressed as Tmean) and the degree of heterogeneity (expressed as TSD) [18]. Additionally, EUS images can display different characteristics in accordance with various contrasts used during EUS examination. Therefore, to minimize these differences, we selected the brightness of the anechoic center and outer hyperechoic rim of the echoendoscope, which have the least variability, and then standardized the EUS images. We have already reported the usefulness of this standardization of EUS images in GMTs [18].

After post-standardized image analysis, both Tmean and TSD were significantly higher in the GIST group than in the non-GIST group on univariate analysis, which is consistent with our previous study [18]. Additionally, these findings also correspond with results of previous studies that a higher echogenicity in comparison with the surrounding muscle layer and heterogeneity are useful in diagnosing GISTs [10, 11, 21]. However, on multivariate analysis, only Tmean was significantly higher in the GIST group than in the non-GIST group. In addition, older age (≥ 58 years) was another predictor of GIST; this result is consistent with the median age of patients with GIST being 63–69 years in population-based studies [14, 22, 23]. On the contrary, the median age of patients with leiomyoma and schwannoma is 52–54 years [10, 20, 22]. Therefore, an older age can be another objective factor for predicting GIST when a GMT is found during endoscopy.

When a subepithelial tumor is found during endoscopy, especially when one is incidentally found during routine medical health examinations, tumor size and presence or absence of clinically malignant features are the main determinants for further detailed examinations, such as EUS, computed tomography with contrast enhancement, and/or EUS-FNA/B [14, 23]. Clinically malignant features on endoscopy include irregular borders, ulceration, and/or growth during endoscopic follow-up. When there are no clinically malignant features, a gastric subepithelial tumor ≤ 2 cm is usually followed up by endoscopy or EUS once or twice a year. On the contrary, it is recommended that subepithelial tumors > 5 cm, except for definitely benign tumors, should be surgically resected and subjected to pathological examinations [14, 15]. In the case of gastric GMTs measuring 2–5 cm, more invasive and expensive procedures to achieve tissue diagnosis, such as EUS-FNA/B and deep biopsy via endoscopic submucosal dissection, are usually recommended to evaluate their malignant potential, mainly to differentiate GISTs from non-GISTs. Therefore, how to approach GMTs measuring 2–5 cm is important to both patients and physicians in real practice. We aimed to make a scoring system with an acceptable accuracy to discriminate GISTs from non-GISTs. For this, we used the factors that are not endoscopist dependent and can be expressed as objective values. In the present study, we found three objective factors (age, Tmean, and TSD) that were significantly associated with the possibility of GIST, and then could make GIST-PSS. When a GIST-PSS score was 3 points or more, the sensitivity, specificity, and accuracy for predicting GIST were 86.5%, 75.9%, and 83.5%, respectively. Considering the diagnostic yields of EUS-FNA/B are 60–85% in subepithelial tumors [24,25,26], our results are thought to be acceptable to use in real clinics. However, further large prospective studies are required to validate our results of GIST-PSS based on digital image analysis.

This study has several limitations. First, this was a retrospective study that compared EUS images between GISTs and non-GISTs using digital image analysis. Therefore, there might have been a potential bias when retrospectively reviewing the EUS images. During EUS examination, we obtained at least ten EUS images to determine the characteristics of GMTs and selected one EUS image with the highest quality for digital image analysis; this could compensate, to some degree, the limitation of a retrospective study. Second, patients were selected for surgery according to the clinical opinions and decisions of the physicians. Third, even though we analyzed only EUS images obtained using a radial-scanning mechanical echoendoscope (GF-UM2000; Olympus) at 7.5 MHz to reduce differences between the images due to use of different frequencies, the actual settings of the EUS, such as gain and contrast, and different echoendoscopes and EUS systems, were different in each case. Therefore, we did attempt to standardize the EUS images according to the brightness values of the anechoic center and outer hyperechoic rim of the echoendoscope. However, this attempt to standardize the EUS images cannot overcome the limitations of a retrospective study. Further prospective studies are needed that use the same settings, such as gain and contrast, during EUS examination.

In conclusion, digital EUS image analysis provides objective information to differentiate GISTs from non-GISTs in GMTs. The scoring system shows a high sensitivity and specificity in predicting GIST in GMTs measuring 2–5 cm. Further prospective studies about the validation of the scoring system using digital image analysis are needed in a larger number of GMTs.