Published online Jun 28, 2022. doi: 10.3748/wjg.v28.i24.2721
Peer-review started: October 30, 2021
First decision: March 11, 2022
Revised: March 25, 2022
Accepted: May 8, 2022
Article in press: May 8, 2022
Published online: June 28, 2022
Processing time: 237 Days and 2.6 Hours
Bleeding is one of the major complications after endoscopic submucosal dissection (ESD) in early gastric cancer (EGC) patients. There are limited studies on estimating the bleeding risk after ESD using an artificial intelligence system.
To derivate and verify the performance of the deep learning model and the clinical model for predicting bleeding risk after ESD in EGC patients.
Patients with EGC who underwent ESD between January 2010 and June 2020 at the Samsung Medical Center were enrolled, and post-ESD bleeding (PEB) was investigated retrospectively. We split the entire cohort into a development set (80%) and a validation set (20%). The deep learning and clinical model were built on the development set and tested in the validation set. The performance of the deep learning model and the clinical model were compared using the area under the curve and the stratification of bleeding risk after ESD.
A total of 5629 patients were included, and PEB occurred in 325 patients. The area under the curve for predicting PEB was 0.71 (95% confidence interval: 0.63-0.78) in the deep learning model and 0.70 (95% confidence interval: 0.62-0.77) in the clinical model, without significant difference (P = 0.730). The patients expected to the low- (< 5%), intermediate- (≥ 5%, < 9%), and high-risk (≥ 9%) categories were observed with actual bleeding rate of 2.2%, 3.9%, and 11.6%, respectively, in the deep learning model; 4.0%, 8.8%, and 18.2%, respectively, in the clinical model.
A deep learning model can predict and stratify the bleeding risk after ESD in patients with EGC.
Core Tip: Bleeding is one of the major complications after endoscopic submucosal dissection (ESD) in early gastric cancer patients and requires hospital-based intervention. We established a deep learning model to stratify the bleeding risk after ESD and demonstrated its performance compared with a clinical model. The deep learning model showed acceptable area under the curve and could stratify the post-ESD bleeding risk as low-, intermediate-, and high-risk categories, which correlated with actual bleeding rate comparatively. A deep learning model would be valuable in assessing the bleeding risk after ESD in early gastric cancer patients.
- Citation: Na JE, Lee YC, Kim TJ, Lee H, Won HH, Min YW, Min BH, Lee JH, Rhee PL, Kim JJ. Utility of a deep learning model and a clinical model for predicting bleeding after endoscopic submucosal dissection in patients with early gastric cancer. World J Gastroenterol 2022; 28(24): 2721-2732
- URL: https://www.wjgnet.com/1007-9327/full/v28/i24/2721.htm
- DOI: https://dx.doi.org/10.3748/wjg.v28.i24.2721
In South Korea, gastric cancer has a high incidence and is the second most common malignancy and the fourth most common cause of cancer-related mortality[1]. After the advent of screening programs for gastric cancer in South Korea and Japan, up to 50%–70% of cases with gastric cancers have been diagnosed at an early stage[2-4]. With the increasing rate of diagnosis at early stages, endoscopic submucosal dissection (ESD) is being actively applied for the minimally invasive treatment of early gastric cancer (EGC) without suspicion of regional lymph node metastasis[5,6].
In accordance with the current trend of active use of ESD, it is necessary to pay attention to the post-ESD complications. Bleeding is one of the significant complications, with an incidence of 3.6%–6.9%[7,8]. Because bleeding after ESD requires hospitalization and hemostatic interventions, there is a need to predict patients at a high risk of bleeding after ESD. Therefore, there have been reports on risk factors related to bleeding after ESD[9-12]. Recently, a predictive risk-scoring model for bleeding after ESD was proposed in Japan; this tool is expected to raise awareness regarding the potential bleeding sources and thus, help physicians manage patients with EGC who are treated with ESD[13].
Currently, artificial intelligence systems are being applied in various fields of gastroenterology[14]. The machine learning models showed good performance in the triage of necessity for intervention in patients with upper gastrointestinal bleeding and predicting recurrent ulcer bleeding[15,16]. Deep learning is advantageous over the machine learning model among artificial intelligence systems; its performance is optimized by automatic learning while experiencing various cases. It can integrate and interpret multiple factors simultaneously without external intervention. Hence, the automatically trained deep learning model can generalize well. There has been no study on the efficacy of deep learning for predicting post-ESD bleeding (PEB), and no study has compared these systems with a clinical model.
This study aimed to develop and compare the performance of the deep learning and clinical model for predicting PEB in EGC patients. We chose deep learning among the artificial intelligence systems as a sophisticated algorithm.
Patients who underwent ESD for EGC between January 2010 and June 2020 at the Samsung Medical Center, Seoul, South Korea, were screened retrospectively. We excluded cases with: Failure to complete ESD (n = 1); prior gastrectomy (n = 2); additional gastrectomy within 28 d after ESD (n = 497); no residual tumor in the ESD specimen (n = 48); multiple procedures, such as EMR for other benign lesions and ESD for EGC (n = 46); and missing values for important variables (n = 7) (Figure 1). A total of 5629 patients were included in the analysis, and they were randomly categorized into the development set (80%) and the validation set (20%). The Institutional Review Board of the Samsung Medical Center, Korea, approved this study, and the requirement for obtaining informed consent was waived owing to the study's retrospective nature.
The main outcome included the development of a deep learning model and a clinical model that predict the bleeding after ESD in patients with EGC and the comparison of performance between the deep learning model and the clinical model.
The variables used to build the deep learning and clinical models were collected from the medical records retrospectively based on the date of ESD. These variables included: Age; sex; comorbidities such as hypertension, diabetes mellitus, liver cirrhosis, and chronic kidney disease (estimated glomerular filtration rate < 60 mL/min per 1.73 m2); patient management with antithrombotic agents (ATs) [aspirin, P2Y12 receptor agonist (P2Y12RA), warfarin, direct-acting oral anticoagulants (DOAC), and cilostazol], non-steroidal anti-inflammatory drugs (NSAIDs), interruption of ATs, replacement of antiplatelet agents (APA), and heparin bridging; tumor characteristics (single or multiple lesions, location, pathologic size, type of differentiation); piecemeal resection; and laboratory data (albumin level and international normalized ratio).
Bleeding after ESD was defined as the presence of signs of bleeding (melena, hematemesis, or a decrease in the hemoglobin level by > 2 g/dL) along with endoscopic stigmata of recent bleeding, such as Forrest class Ia, Ib, IIa, and IIb, within 28 d after ESD. Interruption of ATs was defined as the discontinuation of these medications before the procedure, according to the recommended duration. Replacement of APA was described as when the procedure was performed with aspirin or cilostazol alone in patients who were receiving multiple APAs. Heparin bridging was defined as the administration of heparin during the period between the discontinuation and resumption of anticoagulants. A hemoglobin reduction of > 2 g/dL was evaluated by calculating the differences in the hemoglobin levels between the day before and after ESD.
We built a deep learning model and a clinical model based on the development set, which comprised 80% of the overall cohort. Subsequently, we validated the deep learning and clinical models in the validation set, which comprised 20% of the overall cohort. The categorical variables were converted using one-hot encoding, and the continuous variables were normalized, as preprocessing. We built the deep learning model as follows: First, we augmented the development set using the borderline synthetic minority over-sampling technique to overcome the imbalance of the dataset. Synthetic data were generated from 5%–100% of the majority class. Second, we constructed the deep learning model using automated machine learning, called Keras Tuner, to tune hyperparameters automatically. The initial architecture of the model was configured similarly to a transformer based on the attention mechanism[17]. Then, we set the number of neurons as a hyperparameter variable, ranging from 12 to 24, in four dense layers. The learning rate was also set to a range from 1e-2 − to 1e-4 −. The combination of hyperparameters was determined using Bayesian optimization. Finally, we evaluated the performance in the validation set using a model tuned with the 20% of synthetic data of the majority class. The optimal units of dense layers were selected to 24. The optimal number of attention head was chosen to 16. The architecture is depicted in Supplementary Figure 1. The optimal learning rate with Adam optimizer was 1e-3.
Multivariable logistic regression analysis was performed in the development set to build the clinical model. Then, the clinical model was constructed as a formula with the sum of the beta coefficient values of significant factors with a P value of < 0.05.
The calculated value from the deep learning and clinical models was multiplied by 1000 and converted as a score. The score that indicated the risk probability was divided by the decile in the development set. We selected cutoff to discriminate the risk categories as low-, intermediate-, and high-risk at a bleeding rate of < 5% and < 9% in the development set referred to in a previous report[13]. Decile 1st to 4th was allocated to low risk, 5th to 8th to intermediate risk, and 9th to 10th to high-risk category. Link to the deep learning and clinical models: https:// github.com/YeongChanLee/Predict-PEB.
Descriptive statistics for continuous and categorical variables are presented as means (standard deviation) and frequencies (%). The deep learning model and the clinical model for prediction of bleeding after ESD were evaluated using two methods. First, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and receiver operating characteristic area (ROC) curve along with the area under the curve (AUC) were analyzed. The performance with AUC was compared using the bootstrap test. Second, the risk stratification of PEB based on the development set was applied to the validation set and compared with the actual bleeding rate in the validation set. For example, if the score of calculated cases belongs to the high-risk category, we verified that the real bleeding rate was in the predicted range of 9% or higher. The predictors for PEB were identified with multivariable logistic regression analysis in the entire cohort and development set. Model development for deep learning was performed using Tensor Flow 2.4.0, and Python 3.8.5. statistical analyses were performed using the R software (version 3.5.1, Vienna, Austria).
Of the 5629 patients, 325 experienced post-ESD bleeding (PEB). The non-PEB and PEB groups were comparable in age, liver cirrhosis status, albumin level, international normalized ratio level, a proportion of aspirin or cilostazol use, undifferentiated tumor type, and piecemeal resection. The PEB group had a higher proportion of males and comorbidities (hypertension, diabetes mellitus, and chronic kidney disease) than the non-PEB group. P2Y12RA and anticoagulants (warfarin or DOAC) and the proportion of patients receiving replacement therapy or heparin bridging were higher in the PEB group than in the non-PEB group. The PEB group had a higher proportion of multiple tumors and middle location of tumors and larger size of tumors than the non-PEB group (Table 1). There was no difference in the baseline characteristics between the development and validation sets (Supplementary Table 1).
Variable | Non-PEB, n = 5304 | PEB, n = 325 | P value1 |
Age2 | 64 ± 10 | 63 ± 11 | 0.065 |
Sex | 0.001 | ||
Female | 1245 (23.5) | 49 (15.1) | |
Male | 4059 (76.5) | 276 (84.9) | |
Hypertension | 1413 (26.6) | 120 (36.9) | < 0.001 |
Diabetes mellitus | 936 (17.6) | 74 (22.8) | 0.024 |
Liver cirrhosis | 93 (1.8) | 4 (1.2) | 0.629 |
Chronic kidney disease | 299 (5.6) | 33 (10.2) | 0.001 |
Aspirin | 515 (9.7) | 42 (12.9) | 0.074 |
P2Y12RA | 181 (3.4) | 23 (7.1) | 0.001 |
Warfarin | 22 (0.4) | 7 (2.2) | < 0.001 |
DOAC | 31 (0.6) | 6 (1.8) | 0.017 |
Cilostazol | 47 (0.9) | 3 (0.9) | 1.000 |
NSAIDs | 28 (0.5) | 3 (0.9) | 0.583 |
Preprocedure management of AT | < 0.001 | ||
No indication | 4605 (86.8) | 264 (81.2) | |
Interruption | 676 (12.7) | 53 (16.3) | |
Replacement or heparin bridge | 23 (0.4) | 8 (2.5) | |
Tumor | |||
Multiple | 284 (5.4) | 28 (8.6) | 0.018 |
Location | < 0.001 | ||
Upper | 433 (8.2) | 22 (6.8) | |
Middle | 1728 (32.6) | 157 (48.3) | |
Lower | 3143 (59.3) | 146 (44.9) | |
Size2, mm | 17 ± 10 | 21 ± 13 | < 0.001 |
Undifferentiated type | 125 (2.4) | 7 (2.2) | 0.963 |
Piecemeal resection | 64 (1.2) | 4 (1.2) | 1.000 |
Laboratory data | |||
Albumin2, g/dL | 4.3 ± 0.3 | 4.4 ± 0.4 | 0.345 |
INR2 | 1.0 ± 0.1 | 1.0 ± 0.1 | 0.106 |
In the overall cohort, the independent predictors were identified as follows: Age [odds ratio (OR) = 0.98; 95% confidence interval (CI): 0.96–0.99; P value < 0.001], male (OR = 1.65; 95%CI: 1.19–2.28; P value = 0.003), hypertension (OR = 1.56; 95%CI: 1.19–2.03; p value = 0.001), chronic kidney disease (OR = 1.78; 95%CI: 1.18–2.70; P value = 0.006), P2Y12RA (OR = 2.40; 95%CI: 1.22–4.74; P value = 0.011), DOAC (OR = 4.31; 95%CI: 1.26–14.78; P value = 0.020), middle location (OR = 1.72; 95%CI: 1.07–2.74; P value = 0.024), and size (OR = 1.03; 95%CI: 1.02–1.04; P value < 0.001) (Supplementary Table 2).
In the development set, age (OR = 0.98; 95%CI: 0.96–0.99; P value = 0.001), male (OR = 1.54; 95%CI: 1.09–2.19; P value = 0.015), hypertension (OR = 1.35; 95%CI: 1.00–1.82; P value = 0.049), chronic kidney disease (OR = 1.78; 95%CI: 1.12–2.84; P value = 0.015), P2Y12RA (OR = 2.26; 95%CI: 1.05–4.88; P value = 0.037), middle location (OR = 1.97; 95%CI: 1.14–3.41; P value = 0.015), and size (OR = 1.04; 95%CI: 1.03–1.05; P value < 0.001) were identified as independent predictors. The clinical model was a formula described bottom of Table 2.
Variables | Multivariable | ||||
OR | 95%CI | P value | ß regression coefficient | ||
Age | 0.98 | 0.96–0.99 | 0.001 | -0.024 | |
Sex | Female/male | 1.54 | 1.09–2.19 | 0.015 | 0.435 |
Hypertension | No/yes | 1.35 | 1.00–1.82 | 0.049 | 0.299 |
Diabetes mellitus | No/yes | 1.27 | 0.92–1.75 | 0.145 | 0.238 |
Liver cirrhosis | No/yes | 0.59 | 0.18–1.95 | 0.385 | -0.532 |
Chronic kidney disease | No/yes | 1.78 | 1.12–2.84 | 0.015 | 0.578 |
Aspirin | No/yes | 1.51 | 0.62–3.69 | 0.363 | 0.414 |
P2Y12RA | No/yes | 2.26 | 1.05–4.88 | 0.037 | 0.818 |
Warfarin | No/yes | 1.51 | 0.28–8.07 | 0.629 | 0.413 |
DOAC | No/yes | 3.24 | 0.76–13.82 | 0.113 | 1.174 |
Cilostazol | No/yes | 1.35 | 0.35–5.18 | 0.662 | 0.300 |
NSAIDs | No/yes | 2.65 | 0.77–9.14 | 0.124 | 0.973 |
Preprocedure management of AT | No indication | 1 | |||
Interruption | 0.63 | 0.24–1.67 | 0.353 | -0.464 | |
Replacement orHeparin bridge | 3.32 | 0.47–23.60 | 0.231 | 1.199 | |
Multiple | No/yes | 1.48 | 0.92–2.38 | 0.104 | 0.393 |
Location | Upper | 1 | |||
Middle | 1.97 | 1.14–3.41 | 0.015 | 0.680 | |
Lower | 1.11 | 0.64–1.91 | 0.711 | 0.103 | |
Size | 1.04 | 1.03–1.05 | < 0.001 | 0.036 | |
Undifferentiated type | No/yes | 0.56 | 0.20–1.57 | 0.271 | -0.579 |
Piecemeal | No/yes | 0.98 | 0.30–3.22 | 0.976 | -0.019 |
Albumin, g/dL | 1.33 | 0.89–2.00 | 0.168 | 0.286 | |
INR | 2.04 | 0.37–11.08 | 0.410 | 0.711 |
The deep learning model was found to have a sensitivity of 64.3%, specificity of 74.0%, PPV of 11.4%, NPV of 97.5%, and AUC of 0.71 (95%CI: 0.63–0.78). The clinical model had a sensitivity of 69.6%, specificity of 71.0%, PPV of 11.1%, NPV of 97.8%, and AUC of 0.70 (95%CI: 0.62–0.77) (Table 3 and Figure 2). There were no significant differences in the AUCs between the deep learning and clinical models (Table 3).
Deep learning model | Clinical model | P value | |
Sensitivity (%) | 64.3 (45.8–84.1) | 69.6 (54.2–80.8) | |
Specificity (%) | 74.0 (50.6–89.2) | 71.0 (68.5–79.5) | |
PPV (%) | 11.4 (7.4–18.1) | 11.1 (8.0–15.4) | |
NPV (%) | 97.5 (96.4–98.7) | 97.8 (96.6–98.7) | |
AUC (95%CI) | 0.71 (0.63–0.78) | 0.70 (0.62–0.77) | 0.730 |
The score multiplied by 1000 to the derived value based on the deep learning and clinical models reflects the risk probability and was divided into deciles. The maximum cutoff was 35.9 in low risk, 57.5 in intermediate risk, and over the 57.5 was assigned to a high-risk category of the deep learning model based on development set (Table 4). In the clinical model, the maximum cutoff was 12.7 in low risk, 24.6 in intermediate risk, and over 24.6 was considered a high-risk category based on development set (Table 4). In the validated set, the deep learning model showed an actual bleeding rate in low-, intermediate-, high-risk categories, respectively, of 2.2%, 3.9%, and 11.6%; the clinical model showed an actual bleeding rate of 4.0%, 8.8%, and 18.2%, respectively, in low-, intermediate-, high-risk categories (Table 4).
Decile | Risk categories | Deep learning | Clinical model | ||||||
Score1 | Patients | Bleeding | Rate (%) | Score1 | Patients | Bleeding | Rate (%) | ||
Development set | |||||||||
1 | Low | 25.7 | 451 | 11 | 2.4 | 8.4 | 451 | 12 | 2.6 |
2 | 29.1 | 451 | 15 | 3.3 | 12.2 | 451 | 15 | 3.3 | |
3 | 32.5 | 451 | 6 | 1.3 | 12.4 | 451 | 12 | 2.6 | |
4 | 35.9 | 450 | 17 | 3.8 | 12.7 | 450 | 20 | 4.4 | |
5 | Intermediate | 40.2 | 450 | 27 | 6.0 | 14.8 | 450 | 34 | 7.6 |
6 | 45.3 | 450 | 29 | 6.4 | 16.6 | 450 | 15 | 3.3 | |
7 | 50.8 | 450 | 36 | 8.0 | 23.3 | 450 | 24 | 5.3 | |
8 | 57.5 | 450 | 24 | 5.3 | 24.6 | 450 | 32 | 7.1 | |
9 | High | 67.2 | 450 | 41 | 9.1 | 31.0 | 450 | 54 | 12.0 |
10 | 197.0 | 450 | 63 | 14.0 | 122.0 | 450 | 51 | 11.3 | |
Validation set | |||||||||
Low | 35.9 | 411 | 9 | 2.2 | 12.7 | 956 | 38 | 4.0 | |
Intermediate | 57.5 | 466 | 18 | 3.9 | 24.5 | 137 | 12 | 8.8 | |
High | 147.0 | 249 | 29 | 11.6 | 155.0 | 33 | 6 | 18.2 |
The deep learning and clinical models for predicting bleeding after ESD in patients with EGC showed good performance. We demonstrated that deep learning and clinical models could stratify the PEB risk, which correlated with actual bleeding rates. Hence, we suggest that the deep learning model can aid in the prediction of bleeding after ESD, in addition to the clinical model.
This study was the first to establish a deep learning model for predicting bleeding after ESD and demonstrate its performance compared to that of a clinical model. The strengths of this study were its large sample size and the relatively recent data from a single institution. In addition, we included all essential variables and sought the advantages of the deep learning model that can deal with extensive data and complex problems and improve its performance incrementally by automated learning. We included all types of ATs separately and clarified the distinction between patients without an indication for ATs, patients who received an interruption before the procedure, and patients who received replacement or heparin bridging.
Our study identified younger age, male sex, hypertension, chronic kidney disease, P2Y12RA use, DOAC use, middle tumor location, and tumor size as the predictors of PEB. Previous studies also reported that younger age was associated with PEB[18-20]. It is unclear why younger age was associated with PEB. Several reports proposed that atrophic change along with aging might relate to decreasing the vascularity on the mucosal and submucosal layers[18,20-23]. Although aging and changes in intestinal vasculature have not been clearly elucidated, a decrease in the volume of vasculature with aging was observed in animals[24]. Aspirin did not increase the PEB risk after discontinuation about 1 wk[25]. Although some reported that maintaining aspirin did not increase the PEB risk[25-28], a meta-analysis showed that aspirin was associated with increased bleeding risk, requiring clinical caution[29]. There is still controversial due to limited evidence for P2Y12R[9,13,20,30]. In comparison, an increased bleeding risk after ESD has been reported consistently in patients receiving dual antiplatelets. In addition, there were reports that warfarin or DOAC are related to bleeding risk[13]; rather, some reported heparin bridging was associated with PEB risk[9,26]. The irony is that most of the patients who experience heparin bridging take warfarin or DOAC, but the results about each factor were inconsistent in previous retrospective studies. It is assumed that the duration of discontinuation and other individual factors might influence these results. In addition, it has been suggested that large size[8,19,20], CKD with hemodialysis[13,26,31], and long procedure time[20] were associated with bleeding after ESD. The upper location showed increased PEB risk[18,32]; in contrast, some others reported lower location related to increased PEB risk[18,32]; a recent meta-analysis did not prove significance according to the location[8].
Recently, a predictive risk-scoring model for PEB in Japan showed that CKD with hemodialysis, usage of aspirin, P2Y12RA, cilostazol, warfarin, DOAC, lower third tumor location, tumor size > 30 mm, and the presence of multiple tumors were the predictors of PEB, whereas interruption was a protective factor against PEB[13]. Another recent model proposed a simple algorithm including significant factors with continuous use of ATs, size ≥ 49 mm, and age < 62 years. We also found an association between P2Y12RA or DOAC usage and PEB; however, other ATs were not associated with PEB, and interruption and heparin bridging or replacement of APA were not identified as the protective factors. In our institution, ESD is classified as a high-risk procedure based on the national practice guidelines, and experts are consulted before ESD in patients receiving ATs. The expert assesses the thromboembolic risk depending on the underlying disease and recommends the possibility of interruption, duration of interruption, and the need for heparin bridging or replacement of APA[33-36]. Recently, a guideline published in South Korea also categorized ESD as an ultra-high-risk procedure and recommended interruption of ATs with heparin bridging or replacement of APA according to the thromboembolic risk[37].
The deep learning model in our study showed an AUC of 0.71, which was comparable to the AUC of 0.72 for a risk-scoring model in Japan[13] and the AUC of 0.70 for the clinical model in our study. In the validation set, predicted low-, intermediate-, and high-risk categories showed an actual bleeding rate of 2.2%, 3.9%, and 11.6%, respectively in the deep learning; 4.0%, 8.8%, and 18.2%, respectively, in the clinical model. Our study demonstrated that the deep learning and clinical models can stratify the bleeding risk after ESD. The predicted risk categories correlated with actual bleeding rate; even considering the actual bleeding rate was slightly lower than predicted range of ≥ 5% and < 9% (intermediate risk) in the deep learning and was close to upper range in the clinical model. Our findings support the clinical potential of the deep learning model for predicting PEB risk based on its comparable performance. Because bleeding after ESD requires intervention and hospitalization, physicians are concerned about the occurrence of PEB as a major complication. Based on the risk-prediction model, physicians could carefully assess the bleeding risk and perform preventive hemostasis during the procedure. Suppose additional management like the shielding method for preventing PEB in the selected high-risk group is attempted; in that case, it is anticipated that the deep learning model could support risk stratification.
Our study has several limitations. Due to its retrospective design, information such as the timing of the resumption of ATs, endoscopist’s experience, defect size, and procedure duration was missing. Furthermore, our study was designed as a single-center study; hence, hospital-based validation in other hospitals was not performed, and further proof is warranted. However, the deep learning model might be generalizable because it automatically identifies the risk or probability of bleeding without the external intervention of known relevant factors. Both the deep learning and clinical models showed a low PPV, which may be related to the low incidence of bleeding after ESD, even though bleeding is one of the major complications. In our cohort, the number of patients who received anticoagulants (warfarin or DOAC) was small; therefore, it is possible that the statistical significance of these variables was insufficient for establishing a clinical model in the development set. In this regard, despite the fact that our study focused on the development of a deep learning model and a clinical model, as well as the utility of the deep learning model, further accumulation of data and additional analysis will be required before the commencement of the clinical application of artificial intelligence systems.
In conclusion, we introduced a deep learning model to predict the risk of bleeding after ESD in patients with EGC. The model demonstrated its performance as comparable to the clinical model. The deep learning model could help physicians raise caution to the PEB and would be a desirable tool for supporting ESD application.
With the increasing rate of diagnosis at early stages of gastric cancer, endoscopic submucosal dissection (ESD) is being actively applied as the minimally invasive treatment. Bleeding is one of the significant complications, with an incidence of 3.6%–6.9%. Because bleeding after ESD requires hospitalization and hemostatic interventions, there is a need to predict patients at a high risk of bleeding after ESD.
Currently, artificial intelligence systems are being applied in various fields of gastroenterology. Deep learning among artificial intelligence systems was automatically trained so that it could be generalized well. There has been no study on the efficacy of deep learning for predicting post-ESD bleeding (PEB), and no study has compared these systems with a clinical model.
This study aimed to develop and compare the performance of the deep learning and clinical model for predicting PEB in early gastric cancer (EGC) patients.
Patients who underwent ESD for EGC between January 2010 and June 2020 at the Samsung Medical Center, Seoul, South Korea, were screened retrospectively. We built a deep learning model and a clinical model based on the development set, which comprised 80% of the overall cohort. Subsequently, we validated the deep learning and clinical models in the validation set, which comprised 20% of the overall cohort. The deep learning model and the clinical model for prediction of bleeding after ESD were evaluated using two methods. First, sensitivity, specificity, positive predictive value, negative predictive value, and receiver operating characteristic area curve along with the area under the curve (AUC) were analyzed. The performance with AUC was compared using the bootstrap test. Second, the risk stratification of PEB based on the development set was applied to the validation set and compared with the actual bleeding rate. The authors selected cutoff to discriminate the risk categories as low-, intermediate-, and high-risk at a bleeding rate of < 5% and < 9% in the development set referred to in a previous report.
Of the 5629 patients, 325 experienced PEB. The AUC for predicting PEB was 0.71 (95% confidence interval: 0.63-0.78) in the deep learning model and 0.70 (95% confidence interval: 0.62-0.77) in the clinical model, without significant difference (P = 0.730). In the validated set, the deep learning model showed an actual bleeding rate of 2.2%, 3.9%, and 11.6% in low-, intermediate-, high-risk categories, respectively; the clinical model showed an actual bleeding rate of 4.0%, 8.8%, and 18.2% in low-, intermediate-, high-risk categories, respectively.
In conclusion, we introduced a deep learning model to predict the risk of bleeding after ESD in patients with EGC. The model demonstrated its performance as comparable to the clinical model.
Based on the risk-prediction model, physicians could carefully assess the bleeding risk and perform preventive hemostasis during the procedure. Suppose additional management like the shielding method for preventing PEB in the selected high-risk group is attempted; in that case, it is anticipated that the deep learning model could support risk stratification.
Provenance and peer review: Unsolicited article; Externally peer reviewed.
Peer-review model: Single blind
Specialty type: Gastroenterology and hepatology
Country/Territory of origin: South Korea
Peer-review report’s scientific quality classification
Grade A (Excellent): 0
Grade B (Very good): B, B
Grade C (Good): C
Grade D (Fair): 0
Grade E (Poor): 0
P-Reviewer: Pattarajierapan S, Thailand; Singh A, India; Sulbaran MN, Brazil S-Editor: Ma YJ L-Editor: Filipodia P-Editor: Ma YJ
1. | Hong S, Won YJ, Park YR, Jung KW, Kong HJ, Lee ES; Community of Population-Based Regional Cancer Registries. Cancer Statistics in Korea: Incidence, Mortality, Survival, and Prevalence in 2017. Cancer Res Treat. 2020;52:335-350. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 244] [Cited by in F6Publishing: 302] [Article Influence: 75.5] [Reference Citation Analysis (0)] |
2. | Kim YG, Kong SH, Oh SY, Lee KG, Suh YS, Yang JY, Choi J, Kim SG, Kim JS, Kim WH, Lee HJ, Yang HK. Effects of screening on gastric cancer management: comparative analysis of the results in 2006 and in 2011. J Gastric Cancer. 2014;14:129-134. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 45] [Cited by in F6Publishing: 59] [Article Influence: 5.9] [Reference Citation Analysis (0)] |
3. | Hamashima C; Systematic Review Group and Guideline Development Group for Gastric Cancer Screening Guidelines. Update version of the Japanese Guidelines for Gastric Cancer Screening. Jpn J Clin Oncol. 2018;48:673-683. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 120] [Cited by in F6Publishing: 229] [Article Influence: 38.2] [Reference Citation Analysis (0)] |
4. | Japanese Gastric Cancer Association Registration Committee; Maruyama K, Kaminishi M, Hayashi K, Isobe Y, Honda I, Katai H, Arai K, Kodera Y, Nashimoto A. Gastric cancer treated in 1991 in Japan: data analysis of nationwide registry. Gastric Cancer. 2006;9:51-66. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 186] [Cited by in F6Publishing: 189] [Article Influence: 10.5] [Reference Citation Analysis (0)] |
5. | Park CH, Yang DH, Kim JW, Kim JH, Min YW, Lee SH, Bae JH, Chung H, Choi KD, Park JC, Lee H, Kwak MS, Kim B, Lee HJ, Lee HS, Choi M, Park DA, Lee JY, Byeon JS, Park CG, Cho JY, Lee ST, Chun HJ. Clinical Practice Guideline for Endoscopic Resection of Early Gastrointestinal Cancer. Clin Endosc. 2020;53:142-166. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 41] [Cited by in F6Publishing: 86] [Article Influence: 21.5] [Reference Citation Analysis (0)] |
6. | Hatta W, Gotoda T, Koike T, Masamune A. History and future perspectives in Japanese guidelines for endoscopic resection of early gastric cancer. Dig Endosc. 2020;32:180-190. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 52] [Cited by in F6Publishing: 57] [Article Influence: 14.3] [Reference Citation Analysis (0)] |
7. | Saito I, Tsuji Y, Sakaguchi Y, Niimi K, Ono S, Kodashima S, Yamamichi N, Fujishiro M, Koike K. Complications related to gastric endoscopic submucosal dissection and their managements. Clin Endosc. 2014;47:398-403. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 50] [Cited by in F6Publishing: 71] [Article Influence: 7.1] [Reference Citation Analysis (0)] |
8. | Libânio D, Costa MN, Pimentel-Nunes P, Dinis-Ribeiro M. Risk factors for bleeding after gastric endoscopic submucosal dissection: a systematic review and meta-analysis. Gastrointest Endosc. 2016;84:572-586. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 71] [Cited by in F6Publishing: 90] [Article Influence: 11.3] [Reference Citation Analysis (0)] |
9. | Sato C, Hirasawa K, Koh R, Ikeda R, Fukuchi T, Kobayashi R, Kaneko H, Makazu M, Maeda S. Postoperative bleeding in patients on antithrombotic therapy after gastric endoscopic submucosal dissection. World J Gastroenterol. 2017;23:5557-5566. [PubMed] [DOI] [Cited in This Article: ] [Cited by in CrossRef: 42] [Cited by in F6Publishing: 45] [Article Influence: 6.4] [Reference Citation Analysis (0)] |
10. | Yano T, Tanabe S, Ishido K, Suzuki M, Kawanishi N, Yamane S, Watanabe A, Wada T, Azuma M, Katada C, Koizumi W. Different clinical characteristics associated with acute bleeding and delayed bleeding after endoscopic submucosal dissection in patients with early gastric cancer. Surg Endosc. 2017;31:4542-4550. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 26] [Cited by in F6Publishing: 36] [Article Influence: 5.1] [Reference Citation Analysis (0)] |
11. | Hashimoto M, Hatta W, Tsuji Y, Yoshio T, Yabuuchi Y, Hoteya S, Doyama H, Nagami Y, Hikichi T, Kobayashi M, Morita Y, Sumiyoshi T, Iguchi M, Tomida H, Inoue T, Mikami T, Hasatani K, Nishikawa J, Matsumura T, Nebiki H, Nakamatsu D, Ohnita K, Suzuki H, Ueyama H, Hayashi Y, Sugimoto M, Fujishiro M, Masamune A, Ohira H; Collaborators. Rebleeding in patients with delayed bleeding after endoscopic submucosal dissection for early gastric cancer. Dig Endosc. 2021;33:1120-1130. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 4] [Cited by in F6Publishing: 4] [Article Influence: 1.3] [Reference Citation Analysis (0)] |
12. | Kataoka Y, Tsuji Y, Sakaguchi Y, Minatsuki C, Asada-Hirayama I, Niimi K, Ono S, Kodashima S, Yamamichi N, Fujishiro M, Koike K. Bleeding after endoscopic submucosal dissection: Risk factors and preventive methods. World J Gastroenterol. 2016;22:5927-5935. [PubMed] [DOI] [Cited in This Article: ] [Cited by in CrossRef: 56] [Cited by in F6Publishing: 54] [Article Influence: 6.8] [Reference Citation Analysis (0)] |
13. | Hatta W, Tsuji Y, Yoshio T, Kakushima N, Hoteya S, Doyama H, Nagami Y, Hikichi T, Kobayashi M, Morita Y, Sumiyoshi T, Iguchi M, Tomida H, Inoue T, Koike T, Mikami T, Hasatani K, Nishikawa J, Matsumura T, Nebiki H, Nakamatsu D, Ohnita K, Suzuki H, Ueyama H, Hayashi Y, Sugimoto M, Yamaguchi S, Michida T, Yada T, Asahina Y, Narasaka T, Kuribasyashi S, Kiyotoki S, Mabe K, Nakamura T, Nakaya N, Fujishiro M, Masamune A. Prediction model of bleeding after endoscopic submucosal dissection for early gastric cancer: BEST-J score. Gut. 2021;70:476-484. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 37] [Cited by in F6Publishing: 72] [Article Influence: 24.0] [Reference Citation Analysis (0)] |
14. | Glissen Brown JR, Waljee AK, Mori Y, Sharma P, Berzin TM. Charting a path forward for clinical research in artificial intelligence and gastroenterology. Dig Endosc. 2022;34:4-12. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 1] [Cited by in F6Publishing: 1] [Article Influence: 0.5] [Reference Citation Analysis (0)] |
15. | Shung DL, Au B, Taylor RA, Tay JK, Laursen SB, Stanley AJ, Dalton HR, Ngu J, Schultz M, Laine L. Validation of a Machine Learning Model That Outperforms Clinical Risk Scoring Systems for Upper Gastrointestinal Bleeding. Gastroenterology. 2020;158:160-167. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 84] [Cited by in F6Publishing: 122] [Article Influence: 30.5] [Reference Citation Analysis (0)] |
16. | Wong GL, Ma AJ, Deng H, Ching JY, Wong VW, Tse YK, Yip TC, Lau LH, Liu HH, Leung CM, Tsang SW, Chan CW, Lau JY, Yuen PC, Chan FK. Machine learning model to predict recurrent ulcer bleeding in patients with history of idiopathic gastroduodenal ulcer bleeding. Aliment Pharmacol Ther. 2019;49:912-918. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 28] [Cited by in F6Publishing: 29] [Article Influence: 5.8] [Reference Citation Analysis (0)] |
17. | Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I. Attention is all you need. 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA. [cited April 1, 2021] Available from: https://arxiv.org/pdf/1706.03762.pdf. [Cited in This Article: ] |
18. | Jeon SW, Jung MK, Cho CM, Tak WY, Kweon YO, Kim SK, Choi YH. Predictors of immediate bleeding during endoscopic submucosal dissection in gastric lesions. Surg Endosc. 2009;23:1974-1979. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 62] [Cited by in F6Publishing: 79] [Article Influence: 4.9] [Reference Citation Analysis (0)] |
19. | Choe YH, Jung DH, Park JC, Kim HY, Shin SK, Lee SK, Lee YC. Prediction model for bleeding after endoscopic submucosal dissection of gastric neoplasms from a high-volume center. J Gastroenterol Hepatol. 2021;36:2217-2223. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 5] [Cited by in F6Publishing: 4] [Article Influence: 1.3] [Reference Citation Analysis (0)] |
20. | Nam HS, Choi CW, Kim SJ, Kim HW, Kang DH, Park SB, Ryu DG. Risk factors for delayed bleeding by onset time after endoscopic submucosal dissection for gastric neoplasm. Sci Rep. 2019;9:2674. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 28] [Cited by in F6Publishing: 36] [Article Influence: 7.2] [Reference Citation Analysis (0)] |
21. | Kim JW, Kim HS, Park DH, Park YS, Jee MG, Baik SK, Kwon SO, Lee DK. Risk factors for delayed postendoscopic mucosal resection hemorrhage in patients with gastric tumor. Eur J Gastroenterol Hepatol. 2007;19:409-415. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 17] [Cited by in F6Publishing: 14] [Article Influence: 0.8] [Reference Citation Analysis (0)] |
22. | Kim N, Park YS, Cho SI, Lee HS, Choe G, Kim IW, Won YD, Park JH, Kim JS, Jung HC, Song IS. Prevalence and risk factors of atrophic gastritis and intestinal metaplasia in a Korean population without significant gastroduodenal disease. Helicobacter. 2008;13:245-255. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 98] [Cited by in F6Publishing: 92] [Article Influence: 5.8] [Reference Citation Analysis (0)] |
23. | Joo YE, Park HK, Myung DS, Baik GH, Shin JE, Seo GS, Kim GH, Kim HU, Kim HY, Cho SI, Kim N. Prevalence and risk factors of atrophic gastritis and intestinal metaplasia: a nationwide multicenter prospective study in Korea. Gut Liver. 2013;7:303-310. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 60] [Cited by in F6Publishing: 61] [Article Influence: 5.5] [Reference Citation Analysis (0)] |
24. | Chen YM, Zhang JS, Duan XL. Changes of microvascular architecture, ultrastructure and permeability of rat jejunal villi at different ages. World J Gastroenterol. 2003;9:795-799. [PubMed] [DOI] [Cited in This Article: ] [Cited by in CrossRef: 22] [Cited by in F6Publishing: 22] [Article Influence: 1.0] [Reference Citation Analysis (0)] |
25. | Lim JH, Kim SG, Kim JW, Choi YJ, Kwon J, Kim JY, Lee YB, Choi J, Im JP, Kim JS, Jung HC, Song IS. Do antiplatelets increase the risk of bleeding after endoscopic submucosal dissection of gastric neoplasms? Gastrointest Endosc. 2012;75:719-727. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 69] [Cited by in F6Publishing: 76] [Article Influence: 6.3] [Reference Citation Analysis (0)] |
26. | Matsumura T, Arai M, Maruoka D, Okimoto K, Minemura S, Ishigami H, Saito K, Nakagawa T, Katsuno T, Yokosuka O. Risk factors for early and delayed post-operative bleeding after endoscopic submucosal dissection of gastric neoplasms, including patients with continued use of antithrombotic agents. BMC Gastroenterol. 2014;14:172. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 42] [Cited by in F6Publishing: 51] [Article Influence: 5.1] [Reference Citation Analysis (0)] |
27. | Harada H, Suehiro S, Murakami D, Nakahara R, Nagasaka T, Ujihara T, Sagami R, Katsuyama Y, Hayasaka K, Amano Y. Feasibility of gastric endoscopic submucosal dissection with continuous low-dose aspirin for patients receiving dual antiplatelet therapy. World J Gastroenterol. 2019;25:457-468. [PubMed] [DOI] [Cited in This Article: ] [Cited by in CrossRef: 12] [Cited by in F6Publishing: 11] [Article Influence: 2.2] [Reference Citation Analysis (0)] |
28. | Sanomura Y, Oka S, Tanaka S, Numata N, Higashiyama M, Kanao H, Yoshida S, Ueno Y, Chayama K. Continued use of low-dose aspirin does not increase the risk of bleeding during or after endoscopic submucosal dissection for early gastric cancer. Gastric Cancer. 2014;17:489-496. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 49] [Cited by in F6Publishing: 50] [Article Influence: 4.5] [Reference Citation Analysis (0)] |
29. | Wu W, Chen J, Ding Q, Yang D, Yu H, Lin J. Continued use of low-dose aspirin may increase risk of bleeding after gastrointestinal endoscopic submucosal dissection: A meta-analysis. Turk J Gastroenterol. 2017;28:329-336. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 3] [Cited by in F6Publishing: 5] [Article Influence: 0.7] [Reference Citation Analysis (0)] |
30. | Oh S, Kim SG, Kim J, Choi JM, Lim JH, Yang HJ, Park JY, Han SJ, Kim JL, Chung H, Jung HC. Continuous Use of Thienopyridine May Be as Safe as Low-Dose Aspirin in Endoscopic Resection of Gastric Tumors. Gut Liver. 2018;12:393-401. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 13] [Cited by in F6Publishing: 13] [Article Influence: 2.2] [Reference Citation Analysis (0)] |
31. | Numata N, Oka S, Tanaka S, Higashiyama M, Sanomura Y, Yoshida S, Arihiro K, Chayama K. Clinical outcomes of endoscopic submucosal dissection for early gastric cancer in patients with chronic kidney disease. J Gastroenterol Hepatol. 2013;28:1632-1637. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 22] [Cited by in F6Publishing: 21] [Article Influence: 1.9] [Reference Citation Analysis (0)] |
32. | Chung IK, Lee JH, Lee SH, Kim SJ, Cho JY, Cho WY, Hwangbo Y, Keum BR, Park JJ, Chun HJ, Kim HJ, Kim JJ, Ji SR, Seol SY. Therapeutic outcomes in 1000 cases of endoscopic submucosal dissection for early gastric neoplasms: Korean ESD Study Group multicenter study. Gastrointest Endosc. 2009;69:1228-1235. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 429] [Cited by in F6Publishing: 447] [Article Influence: 29.8] [Reference Citation Analysis (0)] |
33. | ASGE Standards of Practice Committee; Acosta RD, Abraham NS, Chandrasekhara V, Chathadi KV, Early DS, Eloubeidi MA, Evans JA, Faulx AL, Fisher DA, Fonkalsrud L, Hwang JH, Khashab MA, Lightdale JR, Muthusamy VR, Pasha SF, Saltzman JR, Shaukat A, Shergill AK, Wang A, Cash BD, DeWitt JM. The management of antithrombotic agents for patients undergoing GI endoscopy. Gastrointest Endosc. 2016;83:3-16. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 417] [Cited by in F6Publishing: 409] [Article Influence: 51.1] [Reference Citation Analysis (1)] |
34. | Chan FKL, Goh KL, Reddy N, Fujimoto K, Ho KY, Hokimoto S, Jeong YH, Kitazono T, Lee HS, Mahachai V, Tsoi KKF, Wu MS, Yan BP, Sugano K. Management of patients on antithrombotic agents undergoing emergency and elective endoscopy: joint Asian Pacific Association of Gastroenterology (APAGE) and Asian Pacific Society for Digestive Endoscopy (APSDE) practice guidelines. Gut. 2018;67:405-417. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 106] [Cited by in F6Publishing: 104] [Article Influence: 17.3] [Reference Citation Analysis (0)] |
35. | Fujimoto K, Fujishiro M, Kato M, Higuchi K, Iwakiri R, Sakamoto C, Uchiyama S, Kashiwagi A, Ogawa H, Murakami K, Mine T, Yoshino J, Kinoshita Y, Ichinose M, Matsui T; Japan Gastroenterological Endoscopy Society. Guidelines for gastroenterological endoscopy in patients undergoing antithrombotic treatment. Dig Endosc. 2014;26:1-14. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 262] [Cited by in F6Publishing: 325] [Article Influence: 32.5] [Reference Citation Analysis (0)] |
36. | Veitch AM, Vanbiervliet G, Gershlick AH, Boustiere C, Baglin TP, Smith LA, Radaelli F, Knight E, Gralnek IM, Hassan C, Dumonceau JM. Endoscopy in patients on antiplatelet or anticoagulant therapy, including direct oral anticoagulants: British Society of Gastroenterology (BSG) and European Society of Gastrointestinal Endoscopy (ESGE) guidelines. Gut. 2016;65:374-389. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 176] [Cited by in F6Publishing: 177] [Article Influence: 22.1] [Reference Citation Analysis (0)] |
37. | Lim H, Gong EJ, Min BH, Kang SJ, Shin CM, Byeon JS, Choi M, Park CG, Cho JY, Lee ST, Kim HG, Chun HJ. Clinical Practice Guideline for the Management of Antithrombotic Agents in Patients Undergoing Gastrointestinal Endoscopy. Clin Endosc. 2020;53:663-677. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 2] [Cited by in F6Publishing: 7] [Article Influence: 1.8] [Reference Citation Analysis (0)] |