Minireviews
Copyright ©The Author(s) 2021.
Artif Intell Gastrointest Endosc. Apr 28, 2021; 2(2): 12-24
Published online Apr 28, 2021. doi: 10.37126/aige.v2.i2.12
Table 1 Commonly used databases in image recognition of gastric cancer
Database
Time collected
Number of samples
Resolution
Training set
Test set
GR-AIDS[31]20191036496512 × 512829197103650
Jang Hyung Lee[32]2019787224 × 22471770
Toshiaki Hirasawa[33]201813584512 × 512135842496
Bum-Joo Cho[34]20195017512 × 5124205812
Hiroya Ueyama[35] 20207874512 × 51255742300
Lan Li[36] 20202088512 × 5121747341
Mads Sylvest Bergholt[37] 20111063512 × 512850213
Table 2 Specific concepts of the main evaluation indicators
Index
Description
Usage
Unit
DICERepeat rate between the segmentation results and markersCommonly%
RMSDThe root mean square of the symmetrical position surface distance between the segmentation results and the markersCommonlymm
VOEThe degree of overlap between the segmentation results and the actual segmentation results represents the error rateCommonly%
RVDThe difference in volume between the segmentation results and the markersRarely%
Table 3 Comparison of recognition performance of convolutional neural network, full convolutional neural network, and ensemble convolutional neural network models
Methods
DICE/%
VOE/%
RMSD/mm
Toshiaki Hirasawa (CNN)0.57380.59776.491
Hiroya Ueyama (CNN)0.63270.53737.257
Jang Hyung Lee (FCN)0.81020.3192.468
Bum-Joo Cho (FCN)0.93500.1221-
Dat Tien Nguyen (ECNN)0.89470.113-
Table 4 Comparison of convolutional neural network, full convolutional neural network, and generative adversarial network models
Model features
Contributions
Advantages
Disadvantages
Scope of application
CNNThe topology can be extracted from a two-dimensional image, and the backpropagation algorithm is used to optimize the network structure and solve the unknown parameters in the networkShared convolution kernel, processing high-dimensional data without pressure; Feature extraction can be done automaticallyWhen the network layer is too deep, the parameters near the input layer will be changed slowly by using BP propagation to modify parameters. A gradient descent algorithm is used to make the training results converge to the local minimum rather than the global minimum. The pooling layer will lose a lot of valuable informationSuitable for data scenarios with similar network structures
FCNThe end-to-end convolutional network is extended to semantic segmentation. The deconvolution layer is used for up-sampling; A skip connection is proposed to improve the roughness of the upper samplingCan accept any size; Input image; Jump junction; The structure combines fine layers and coarse; Rough layers, generating precise segmentationThe receptive field is too small to obtain the global information;Small storage overheadApplicable to large sample data
GANWith adversarial learning criteria, there are two No's: The same network, not a single networkCan produce a clearer, more realistic sample; any generated network can be trainedTraining is unstable and difficult to train; GAN is not suitable for processing data in discrete formSuitable for data generation (e.g., there are not many data sets with labels), image style transfer; Image denoising and restoration; Used to counter attacks