T2-weighted imaging-based radiomic-clinical machine learning model for predicting the differentiation of colorectal adenocarcinoma

doi:10.4251/wjgo.v16.i3.819

Advanced Search

BPG is committed to discovery and dissemination of knowledge

Home / Archive / Volume 16, Issue 3

This Article

Academic Content and Language Evaluation of This Article

CrossCheck and Google Search of This Article

Academic Rules and Norms of This Article

Citation of this article

Corresponding Author of This Article

Research Domain of This Article

Article-Type of This Article

Open-Access Policy of This Article

Times Cited Counts in Google of This Article

Number of Hits and Downloads for This Article

Total Article Views (3541)

All Articles published online

The chart showing PDF series, WORD series, HTML series, Figures (1-10) series, Tables (1-3) series.

Item

Count

PDF

WORD

HTML

1638

Figures (1-10)

585

Tables (1-3)

356

Sum=2673

Publishing Process of This Article

The chart showing Browse series, Download series.

Item

Count

Browse

129

Download

626

Sum=755

Mar 15, 2024 (publication date) through Jul 12, 2025

Times Cited of This Article

Times Cited (0)

Journal Information of This Article

Publication Name

World Journal of Gastrointestinal Oncology

ISSN

1948-5204

Publisher of This Article

Baishideng Publishing Group Inc, 7041 Koll Center Parkway, Suite 160, Pleasanton, CA 94566, USA

Retrospective Study

World J Gastrointest Oncol. Mar 15, 2024; 16(3): 819-832
Published online Mar 15, 2024. doi: 10.4251/wjgo.v16.i3.819

Open in New Tab Full Size Figure Download Figure

Figure 1 Workflow of the study. The workflow for constructing a machine learning model based on T2-weighted images to predict the differentiation degree of colorectal cancer patients included segmentation, feature extraction, feature selection, model construction and validation. ROI: Region of intrest; CV: Cross validation; MSE: Mean square error; ROC: Receiver operating characteristic; DCA: Decision curve analysis; GLCM: Grayscale co-occurrence matrix; GLRLM: Grayscale run length matrix; GLSZM: Grayscale size region matrix, NGTDM: Neighbourhood grayscale difference matrix; GLDM: Grayscale dependency matrix.

Open in New Tab Full Size Figure Download Figure

Figure 2 Examples of labelled lesions. A: Primary lesions of colorectal cancer (CRC) on oblique axial T2-weighted images; B: The primary tumour of CRC was drawn at this level, and the range of the red curve indicates the range of the primary tumour at this level.

Open in New Tab Full Size Figure Download Figure

Figure 3 Distribution of radiomic features. A: The number and proportion of extracted radiomic features; B: Violin plots showing all features and corresponding P values, which help us observe the centralized trend and dispersion of the data. GLCM: Grayscale co-occurrence matrix; GLRLM: Grayscale run length matrix; GLSZM: Grayscale size region matrix, NGTDM: Neighbourhood grayscale difference matrix; GLDM: Grayscale dependency matrix.

Open in New Tab Full Size Figure Download Figure

Figure 4 The selection process of the least absolute shrinkage and selection operator method. A: 10-fold cross-validation and minimization of standard selection parameters (lamdba) in the least absolute shrinkage and selection operator model; B: Eight radiomic features with nonzero coefficients were selected for the optimal parameter lamdba (lambda = 0.0146). MSE: Mean square error.

Open in New Tab Full Size Figure Download Figure

Figure 5 Histogram of the Rad-score based on the selected features. GLCM: Grayscale co-occurrence matrix; GLRLM: Grayscale run length matrix; NGTDM: Neighbourhood grayscale difference matrix.

Open in New Tab Full Size Figure Download Figure

Figure 6 Receiver operating characteristic curves of logistic regression, support vector machine, K-nearest neighbour, random forest, extra trees, extreme gradient boosting, light gradient boosting machine, and multilayer perceptron. A: In the training cohort; the area under the curve (AUC) values were 0.737, 0.986, 0.880, 1.000, 1.000, 1.000, 0.972, and 0.796, respectively; B: Receiver operating characteristic curves of logistic regression (LR), support vector machine (SVM), K-nearest neighbour (KNN), random forest (RF), extra trees (ET), extreme gradient boosting (XGBoost), light gradient boosting machine (LightGBM), and multilayer perceptron (MLP) in the validation cohort, the AUC values were 0.728, 0.684, 0.629, 0.597, 0.620, 0.594, 0.601 and 0.735, respectively. Except for LR and MLP, the other machine learning algorithms exhibited overfitting, and the AUC of MLP was greater than that of LR. LR: Logistic regression; SVM: Support vector machine; KNN: K-nearest neighbour; RF: Random forest; ET: Extra trees; XGBoost: Extreme gradient boosting; LightGBM: Light gradient boosting machine; MLP: Multilayer perceptron; ROC: Receiver operating characteristic; AUC: Area under the curve.

Open in New Tab Full Size Figure Download Figure

Figure 7 The nomogram integrates clinical and radiomic features. In the two factors of "nerve invasion" and "vascular invasion", "0" represents absent, "1" represents present, and in "circumference", "0" represents ≤ 1/2, "1" represents > 1/2.

Open in New Tab Full Size Figure Download Figure

Figure 8 Receiver operating characteristic curves of the radiomic model, clinical model and radiomic-clinical model. A: The area under the curve (AUC) of the three models (clinical, radiomic, and radiomic-clinical model) in the training cohort were 0.751 (95%CI: 0.661-0.842), 0.796 (95%CI: 0.723-0.869), and 0.862 (95%CI: 0.796-0.927), respectively. B: The AUC of the three models (clinical, radiological, and radiomic-clinical model) in the validation cohort were 0.676 (95%CI: 0.525-0.827), 0.735 (95%CI: 0.604-0.866), and 0.761 (95%CI: 0.635-0.887), respectively. ROC: Receiver operating characteristic; AUC: Area under the curve.

Open in New Tab Full Size Figure Download Figure

Figure 9 Three models (clinical, radiomic, and combined models) were used to predict the calibration curve of colorectal cancer differentiation in the training cohort and the validation cohort. A: Calibration curves for the training cohort; B: Calibration curves for the validation cohort. The straight line at 45° represents the standard curve with the probability of perfect matching between the actual (y-axis) and nomogram-predicted (x-axis) differentiation grade. With respect to the training cohort and the validation cohort, the predicted probabilities of the clinical model and the radiomic model closely corresponded to the actual probabilities. Rad: radiomics.

Open in New Tab Full Size Figure Download Figure

Figure 10 Decision curve analysis of the prediction model. A: The training cohort; B: The validation cohort. The three models (clinical, radiomic, and radiomic-clinical model) showed good clinical applicability in a certain range. DCA: Decision curve analysis.

Citation: Zheng HD, Huang QY, Huang QM, Ke XT, Ye K, Lin S, Xu JH. T2-weighted imaging-based radiomic-clinical machine learning model for predicting the differentiation of colorectal adenocarcinoma. World J Gastrointest Oncol 2024; 16(3): 819-832
URL: https://www.wjgnet.com/1948-5204/full/v16/i3/819.htm
DOI: https://dx.doi.org/10.4251/wjgo.v16.i3.819