Published online May 16, 2022. doi: 10.4253/wjge.v14.i5.311
Peer-review started: May 13, 2021
First decision: July 4, 2021
Revised: July 15, 2021
Accepted: April 27, 2022
Article in press: April 27, 2022
Published online: May 16, 2022
Processing time: 368 Days and 9.4 Hours
Esophagitis is an inflammatory and damaging process of the esophageal mucosa, which is confirmed by endoscopic visualization and may, in extreme cases, result in stenosis, fistulization and esophageal perforation. The use of deep learning (a field of artificial intelligence) techniques can be considered to determine the presence of esophageal lesions compatible with esophagitis.
To develop, using transfer learning, a deep neural network model to recognize the presence of esophagitis in endoscopic images.
Endoscopic images of 1932 patients with a diagnosis of esophagitis and 1663 patients without any pathological diagnosis provenient from the KSAVIR and HyperKSAVIR datasets were splitted in training (80%) and test (20%) and used to develop and evaluate a binary deep learning classifier built using the DenseNet-201 architecture, a densely connected convolutional network, with weights pretrained on the ImageNet image set and fine-tuned during training. The classifier model performance was evaluated in the test set according to accuracy, sensitivity, specificity and area under the receiver operating characteristic curve (AUC).
The model was trained using Adam optimizer with a learning rate of 0.0001 and applying binary cross entropy loss function. In the test set (n = 719), the classifier achieved 93.32% accuracy, 93.18% sensitivity, 93.46% specificity and a 0.96 AUC. Heatmaps for spatial predictive relevance in esophagitis endoscopic images from the test set were also plotted. In face of the obtained results, the use of dense convolutional neural networks with pretrained and fine-tuned weights proves to be a good strategy for predictive modeling for esophagitis recognition in endoscopic images. In addition, adopting the classification approach combined with the subsequent plotting of heat maps associated with the classificatory decision gives greater explainability to the model.
It is opportune to raise new studies involving transfer learning for the analysis of endoscopic images, aiming to improve, validate and disseminate its use for clinical practice.
Core Tip: Considering the clinical relevance of esophagitis, we proposed a deep learning model for its diagnosis from endoscopic images of the Z-line, via binary classification of the images according to the presence or absence of esophageal inflammation signs. The excellent accuracy and area under the receiver operating characteristic curve achieved demonstrate the potential of the adopted strategy, consisting of the conjunction of densely connected neural networks and transfer learning. With this, we contribute to the improvement and methodological advancement in the development of automated diagnostic tools for the disease, which reveal great potential in optimizing the management of these patients.