Retrospective Study Open Access
Copyright ©The Author(s) 2025. Published by Baishideng Publishing Group Inc. All rights reserved.
World J Radiol. Jun 28, 2025; 17(6): 106682
Published online Jun 28, 2025. doi: 10.4329/wjr.v17.i6.106682
Non-contrast computed tomography radiomics model to predict benign and malignant thyroid nodules with lobe segmentation: A dual-center study
Hao Wang, Yu-Sheng Du, You Wang, Zhuo-Jie Bai, Department of Radiology, The Fourth Affiliated Hospital of Nanjing Medical University, Nanjing 210031, Jiangsu Province, China
Xuan Wang, Di Wu, Wu-Liang Tang, Department of Radiology, Zhongda Hospital Southeast University (Jiangbei), Nanjing 210048, Jiangsu Province, China
Han-Ling Zeng, Jing Tao, Department of General Surgery, The Fourth Affiliated Hospital of Nanjing Medical University, Nanjing 210031, Jiangsu Province, China
Jian He, Department of Nuclear Medicine, Nanjing Drum Tower Hospital, The Affiliated Hospital of Medicine School, Nanjing University, Nanjing 210008, Jiangsu Province, China
ORCID number: Hao Wang (0000-0003-2659-7231); Jian He (0000-0001-8140-4610).
Co-first authors: Hao Wang and Xuan Wang.
Co-corresponding authors: Jing Tao and Jian He.
Author contributions: Wang H wrote the original manuscript, contributed software, and resources to the manuscript; Wang X reviewed and edited the manuscript; Wang H and Wang X wrote the manuscript, they contributed equally to this article, they are the co-first authors of this manuscript; Wang H, Du YS, and Tang WL organized the data; Wang Y and Wu D performed the validation; Wang H, Bai ZJ, and Zeng HL performed the methodology; Tao J supervised; He J performed project management; Tao J and He J contributed equally to this article, they are the co-corresponding authors of this manuscript; and all authors thoroughly reviewed and endorsed the final manuscript.
Supported by the Science and Technology Development Fund of Nanjing Medical University, No. NMUB20230037; and the Youth Scientific Research Nurturing Fund of Jiangbei Campus of Zhongda Hospital Affiliated with Southeast University, No. JB2024Q01.
Institutional review board statement: This study was approved by the Medical Ethics Committee of The Fourth Affiliated Hospital of Nanjing Medical University, approval No. 20240628-K077.
Informed consent statement: Due to the retrospective nature of the study, requirements for informed consent were waived.
Conflict-of-interest statement: All the authors report no relevant conflicts of interest for this article.
Data sharing statement: The data utilized and/or analyzed during the present study are available from the corresponding author upon reasonable request.
Open Access: This article is an open-access article that was selected by an in-house editor and fully peer-reviewed by external reviewers. It is distributed in accordance with the Creative Commons Attribution NonCommercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: https://creativecommons.org/Licenses/by-nc/4.0/
Corresponding author: Jian He, MD, PhD, Associate Professor, Chief Physician, Department of Nuclear Medicine, Nanjing Drum Tower Hospital, The Affiliated Hospital of Medicine School, Nanjing University, No. 321 Zhongshan Road, Nanjing 210008, Jiangsu Province, China. hjxueren@126.com
Received: March 12, 2025
Revised: April 17, 2025
Accepted: May 21, 2025
Published online: June 28, 2025
Processing time: 106 Days and 20.8 Hours

Abstract
BACKGROUND

Accurate preoperative differentiation of benign and malignant thyroid nodules is critical for optimal patient management. However, conventional imaging modalities present inherent diagnostic limitations.

AIM

To develop a non-contrast computed tomography-based machine learning model integrating radiomics and clinical features for preoperative thyroid nodule classification.

METHODS

This multicenter retrospective study enrolled 272 patients with thyroid nodules (376 thyroid lobes) from center A (May 2021-April 2024), using histopathological findings as the reference standard. The dataset was stratified into a training cohort (264 lobes) and an internal validation cohort (112 lobes). Additional prospective temporal (97 lobes, May-August 2024, center A) and external multicenter (81 lobes, center B) test cohorts were incorporated to enhance generalizability. Thyroid lobes were segmented along the isthmus midline, with segmentation reliability confirmed by an intraclass correlation coefficient (≥ 0.80). Radiomics feature extraction was performed using Pearson correlation analysis followed by least absolute shrinkage and selection operator regression with 10-fold cross-validation. Seven machine learning algorithms were systematically evaluated, with model performance quantified through the area under the receiver operating characteristic curve (AUC), Brier score, decision curve analysis, and DeLong test for comparison with radiologists interpretations. Model interpretability was elucidated using SHapley Additive exPlanations (SHAP).

RESULTS

The extreme gradient boosting model demonstrated robust diagnostic performance across all datasets, achieving AUCs of 0.899 [95% confidence interval (CI): 0.845-0.932] in the training cohort, 0.803 (95%CI: 0.715-0.890) in internal validation, 0.855 (95%CI: 0.775-0.935) in temporal testing, and 0.802 (95%CI: 0.664-0.939) in external testing. These results were significantly superior to radiologists assessments (AUCs: 0.596, 0.529, 0.558, and 0.538, respectively; P < 0.001 by DeLong test). SHAP analysis identified radiomic score, age, tumor size stratification, calcification status, and cystic components as key predictive features. The model exhibited excellent calibration (Brier scores: 0.125-0.144) and provided significant clinical net benefit at decision thresholds exceeding 20%, as evidenced by decision curve analysis.

CONCLUSION

The non-contrast computed tomography-based radiomics-clinical fusion model enables robust preoperative thyroid nodule classification, with SHAP-driven interpretability enhancing its clinical applicability for personalized decision-making.

Key Words: Papillary thyroid carcinoma; Thyroid nodules; Radiomics; Machine learning; Non-contrast computed tomography

Core Tip: This study introduces a novel non-contrast computed tomography-based machine learning model integrating radiomics and clinical features with lobe segmentation for preoperative differentiation of benign and malignant thyroid nodules. Leveraging dual-center data and thyroid lobe segmentation, the extreme gradient boosting model demonstrated superior diagnostic accuracy and stability across diverse cohorts, outperforming traditional radiologist assessments. Key predictors, including radiomic score, age, and tumor size group, calcify and cystic, were showed through SHAP analysis, enhancing model interpretability. The approach offers a robust, non-invasive tool for personalized preoperative decision-making, with the potential to improve clinical management of thyroid nodules.



INTRODUCTION

Thyroid nodules, the most prevalent endocrine tumors, have been ranked as the third most common malignancy in thyroid cancer according to the 2022 China Cancer Burden Report[1]. Among these, papillary thyroid carcinoma (PTC) represents the predominant malignant subtype, accounting for 85%-90% of all thyroid malignancies[2]. Although PTC generally exhibits a favorable prognosis, regional lymph node metastasis occurs in 30%-80% of patients, complicating diagnosis and increasing the risk of adverse outcomes[3,4]. Clinical management strategies vary significantly depending on malignancy status: Benign nodules are typically monitored conservatively, whereas malignant cases necessitate radical surgery, often supplemented by radioactive iodine therapy and targeted treatments[5]. This highlights the critical need for early and accurate differentiation between benign and malignant nodules to optimize therapeutic strategies and improve patient prognosis[6].

The National Comprehensive Cancer Network guidelines emphasize the essential role of imaging in thyroid nodule evaluation[7]. Ultrasound (US) is the primary diagnostic modality due to its noninvasive nature, absence of ionizing radiation, and ease of operation[8]. However, its diagnostic accuracy remains suboptimal (60%-70%) due to spatial resolution limitations and operator-dependent variability, particularly in detecting microcalcifications and occult lesions[9]. While US-guided fine-needle aspiration is the diagnostic gold standard, its invasive nature poses risks of bleeding and infection. Additionally, the American Thyroid Association discourages fine-needle aspiration for nodules < 1 cm unless local infiltration or lymph node metastasis is suspected[10]. Contrast-enhanced computed tomography (CECT) offers superior spatial resolution and 3D reconstruction for preoperative assessment of tumor invasiveness, but its clinical utility is constrained by radiation exposure, iodine contrast risks, and limited quantitative analytical capacity[11]. These limitations underscore the need for a more accessible, low-risk diagnostic approach with improved efficacy.

Recent advancements in artificial intelligence have revolutionized medical imaging analysis, with machine learning algorithms demonstrating significant potential in complex image classification by leveraging large-scale data training, thereby enhancing diagnostic accuracy and consistency[12]. The field of radiomics, first introduced in 2012, enables noninvasive tumor characterization through high-throughput extraction of imaging features, transforming visual data into quantifiable metrics[13-15]. Current radiomics research on thyroid nodules has predominantly focused on US and CECT, while non-contrast computed tomography (NCCT) remains underutilized[16,17]. NCCT presents several advantages for thyroid nodule assessment. Unlike CECT, it eliminates the need for iodine-based contrast agents while delivering lower radiation exposure. Additionally, NCCT provides sufficient morphological detail of thyroid nodules, particularly in characterizing calcification patterns - a critical feature for differentiating benign from malignant lesions. Moreover, many studies rely on single-nodule segmentation, which may introduce bias due to manual lesion localization and margin delineation in multinodular cases[18]. This study aims to develop an NCCT-based radiomics machine learning model utilizing thyroid lobe segmentation and to evaluate its clinical utility in thyroid nodule diagnosis, providing a low-risk, efficient diagnostic tool for early detection.

MATERIALS AND METHODS
Patient’s selection

This retrospective study was conducted in accordance with the Declaration of Helsinki and received formal approval from the Institutional Review Board of Nanjing Medical University's Fourth Affiliated Hospital, approval No. 20240628-K077). The ethics committee granted an exemption from informed consent requirements, as per institutional policies for retrospective data analysis. The study population comprised patients who underwent thyroid resection between May 2021 and April 2024. Eligible patients met the following inclusion criteria: (1) Histopathological confirmation of thyroid nodules, with malignant cases classified as papillary carcinoma and benign cases as adenoma/nodular goiter; (2) Preoperative neck computed tomography (CT) performed within 15 days prior to surgery; and (3) Thyroid nodules with a maximum diameter of ≥ 2 mm. Exclusion criteria included: (1) Diffuse thyroid pathology (Hashimoto’s or granulomatous thyroiditis; n = 8); (2) Nodules < 2 mm (n = 21); (3) Non-diagnostic image quality (n = 19); (4) Nodules located in the isthmus or pyramidal lobe (n = 4); (5) History of malignancy or prior radiotherapy (n = 12); and (6) Age < 18 years (n = 1). After excluding 65 ineligible patients, 376 thyroid lobes from 272 patients (104 bilateral cases) were included and stratified into training and validation cohorts (7:3 ratio) using the thyroid isthmus midline as the anatomical boundary. Additionally, the study incorporated a temporal test cohort (May-August 2024; 97 lobes from 75 patients) and an external test cohort (Zhongda Hospital; January-June 2024; 81 lobes from 75 patients) for independent evaluation. The study adhered to the CLEAR checklist[19], with patient selection workflows illustrated in Figures 1 and 2.

Figure 1
Figure 1 Study inclusion flowchart. CT: Computed tomography; PTC: Papillary thyroid carcinoma; TA: Thyroid adenoma; NG: Nodular goiter.
Figure 2
Figure 2 Workflow of model development. CT: Computed tomography; PTC: Papillary thyroid carcinoma; TA: Thyroid adenoma; NG: Nodular goiter.
Imaging acquisition

Patients enrolled in the training, validation, and temporal test cohorts received both NCCT and CECT examinations through the IQon Spectral CT (Philips Medical Systems, Netherlands). The imaging protocol encompassed anatomical regions extending from the skull base through the superior mediastinal compartment. Subjects were positioned in a standardized supine orientation with cervical extension and shoulder depression to reduction of cervical sclerotic bundle artifacts while maintaining normal steadily breathing throughout image acquisition. Acquisition parameters included a 120 kVp tube voltage, DoseRight auto-adjusting tube current (mean 145 mAs, Index 23), 64 × 0.625 mm collimation, 250 mm field of view, pitch 0.969, matrix 512 × 512, iDose4 iterative reconstruction, and 1 mm isotropic resolution (350/60 window settings). CECT involved dual-phase imaging at 25 seconds (arterial) and 60 seconds (venous) post-iohexol injection (320 mgI/mL, 3 mL/s), with Digital Imaging and Communications in Medicine data archived in the Picture Achieving and Communication System. The external test cohort underwent imaging on a Revolution CT scanner (GE Healthcare, United states) using distinct acquisition parameters: 120 kVp tube voltage, automated tube current modulation, 256 mm × 0.625 mm collimation, 250 mm field of view, pitch 0.992, matrix 512 × 512, Adaptive Statistical Iterative Reconstruction-VEO (50%), and 0.625 mm slice thickness (350/50 window settings). Notably, contrast-phase data were excluded from all analyses to maintain NCCT specificity.

Pathological results, imaging features, and clinical variable collection

Histopathological assessment was performed on formalin-fixed, paraffin-embedded surgical specimens obtained via total or hemithyroidectomy. Systematic sampling was conducted, with representative sections harvested from each nodule - particularly targeting regions exhibiting suspicious calcifications or cytological atypia. All specimens underwent blinded evaluation by two fellowship-trained thyroid pathologists (with 15 years and 10 years of subspecialty experience, respectively). Malignant lesions were histologically confirmed as PTC, while benign pathologies were classified as follicular adenomas or nodular goiters. radiologist A (junior, 15 years’ experience) and radiologist B (senior, 25 years’ experience) independently evaluated CT imaging features through Picture Achieving and Communication System workstations, including tumor size group [small (≤ 5 mm), intermediate (5-10 mm), large (> 10 mm)], multiple nodules, calcify, and cystic. Interobserver discrepancies (18 cases, 3.25%) primarily involved sub-centimeter nodule measurements (6 cases) and microcalcification detection (12 cases) and were adjudicated by a third radiologist (25 years of experience). Calcify and cystic exhibited perfect inter-rater agreement due to standardized diagnostic criteria. Demographic and biochemical parameters - including age, sex, body mass index, free triiodothyronine, free thyroxine, thyroid-stimulating hormone, thyroglobulin antibody, and thyroid peroxidase antibody - were extracted from the hospital information system.

For human-machine comparative analysis, both radiologists performed blinded NCCT evaluations on the training and validation cohorts following a one-month washout period and standardized diagnostic training. The assessment protocol restricted access to clinical data, allowing only laterality information while concealing patient identifiers, cohort allocation, and prior interpretations. Malignancy classification was based on the following criteria: (1) Irregular nodule morphology; (2) Extracapsular invasion or infiltration into adjacent tissues; (3) Lower nodule density relative to surrounding thyroid parenchyma; (4) Gravelly microcalcifications within the lesion; and (5) Presence of metastatic cervical lymph nodes[20].

Lobe thyroid tissue segmentation

Segmentation was conducted using 3D Slicer 5.7.0 (https://www.slicer.org) with isotropic voxel resampling (0.5 mm × 0.5 mm × 1 mm) to enhance spatial resolution for small thyroid structures. A radiographer with 14 years of CT imaging expertise performed segmentation of the thyroid lobe region of interest on axial CT slices, guided by pathological diagnosis results. The segmentation adhered to anatomical boundaries defined by the midline of the thyroid isthmus. Radiologist A subsequently conducted a meticulous layer-by-layer review to verify delineation accuracy.

Radiomics feature extraction and consistency validation

The radiomics extraction process was conducted utilizing the PyRadiomics toolkit (https://github.com/Radiomics/pyradiomics), which resulted in the generation of 1130 features. These features encompassed 14 shape descriptors, 18 first-order histogram-based indices, and 75 textural characteristics derived through grey-level matrix transformations (including GLCM, GLDM, GLRLM, GLSZM, and NGTDM analyses). In addition, 279 Log-filtered features and 744 features obtained through wavelet decomposition were identified. To assess segmentation reproducibility, 40 randomly selected thyroid lobes underwent duplicate manual segmentation by radiologist A, followed by an independent segmentation by radiographer A one week later. The intraclass correlation coefficient (ICC) quantified feature consistency across intra- and inter-observer assessments, with features demonstrating ICC ≥ 0.8 retained to ensure robustness against segmentation variability.

Feature engineering and establishment of radiomics score

All radiomic features were standardized using Z-score normalization [Z = (X - μ)/σ, where X represents the feature value, μ the mean, and σ the standard deviation] to mitigate scale disparities. To counteract class imbalance in the training cohort, where benign samples were underrepresented, synthetic minority oversampling was applied at a 1:1.3 ratio to equalize benign and malignant cases. Feature selection involved Pearson correlation analysis, retaining a single representative feature per cluster exhibiting r > 0.7 to minimize multicollinearity. Least absolute shrinkage and selection operator (LASSO) regression with 10-fold cross-validation determined the optimal regularization parameter (λ min) and identified non-zero coefficient features, culminating in the construction of the radiomic score (RadScore) through linear combination: RadScore = Σ (feature_weight × feature_value) + intercept.

Machine learning model construction and evaluation

Seven machine learning models were constructed based on differential analysis of clinical features in the training cohort, integrating statistically significant variables alongside RadScore. These models included decision trees, random forests (RF), logistic regression, support vector machines (SVM), extreme gradient boosting (XGB), K-nearest neighbors, and light gradient boosting machines. Model performance was evaluated by comparing the area under the receiver operating characteristic curve (AUC), with the optimal algorithm determined by the highest AUC values and further visualized through confusion matrices. Calibration curves assessed predictive accuracy, while decision curve analysis quantified clinical net benefit across probability thresholds. Feature importance was interpreted using SHapley Additive exPlanations (SHAP) values to determine variable contributions to model predictions.

Statistical analysis

All analyses were conducted in R software (v4.2.1), utilizing the glmnet package for LASSO regression and the rms package for nomogram construction and calibration curve generation. The normality of data distribution was assessed via the Kolmogorov-Smirnov test, with non-normally distributed data expressed as median (interquartile range) and compared using the Mann-Whitney U test. Categorical variables were presented as frequencies (percentages) and analyzed via the χ2 test. The performance metrics of the model included AUC, accuracy, sensitivity, specificity, positive and negative predictive values (positive predictive value/negative predictive value), precision, recall, F1 score, and Brier score. AUC comparisons across models were conducted using DeLong's test, with statistical significance defined as P < 0.05.

RESULTS
Baseline characteristics

The retrospective cohort comprised 272 patients [60 males (22.1%), 212 females (77.9%); mean age, 48.2 ± 13.7 years] with 376 thyroid lobes, including 104 bilateral cases. Histopathological evaluation identified 270 PTC and 106 benign lobes. Patients were randomly allocated into a training cohort (271 lobes) and a validation cohort (105 lobes), with no significant baseline differences between groups (all P > 0.05). The temporal test cohort included 75 patients (97 lobes, 22 bilateral), while the external test cohort consisted of 75 patients (81 lobes, 6 bilateral) from an independent institution (Table 1). Univariate analysis identified four malignancy-associated variables: Patient age (P = 0.016), nodule size category (P < 0.001), calcify patterns (P = 0.003), and cystic (P = 0.012). Comparative statistics between benign and malignant characteristics are detailed in Table 2.

Table 1 Multi-cohort comparison of thyroid lobes characteristics, n (%).
Characteristics
Training cohort
Validation cohort
Temporal test cohort
External test cohort
P valuea
P valueb
P valuec
Groupedn = 264n = 112n = 97n = 810.2030.6050.053
Benign80 (30)26 (23)26 (27)15 (19)
Malignant184 (70)86 (77)71 (73)66 (81)
Gender0.3550.4150.463
Female211 (80)84 (75)73 (75)61 (75)
Male53 (20)28 (25)24 (25)20 (25)
Age (years), median (P25-P75)51.00 (38.75-60.00)48.50 (35.75-58.00)50.00 (41.00-57.00)52.00 (41.00-60.00)0.1000.9440.413
BMI, median (P25-P75)24.46 (22.04-26.30)24.73 (22.18-27.05)24.77 (23.11-26.9)25.01 (24.29-25.78)0.2880.1610.042
Multiple nodules0.5170.5190.036
No148 (56)58 (52)50 (52)34 (42)
Yes116 (44)54 (48)47 (48)47 (58)
Tumor size group0.0950.0010.007
Small (≤ 5 mm)43 (16)29 (26)21 (22)24 (30)
Medium (5-10 mm)99 (38)38 (34)52 (54)33 (41)
Large (> 10 mm)122 (46)45 (40)24 (25)24 (30)
Calcify0.6290.344< 0.001
No192 (73)78 (70)76 (78)42 (52)
Yes72 (27)34 (30)21 (22)39 (48)
Cystic0.7410.399< 0.001
No229 (87)95 (85)88 (91)38 (47)
Yes35 (13)17 (15)9 (9)43 (53)
FT3 (pmol/L), median (P25-P75)4.76 (4.39-5.09)4.76 (4.44-4.99)4.71 (4.46-5.03)4.79 (4.32-5.45)0.8910.9180.299
FT4 (pmol/L), median (P25-P75)16.20 (14.88-17.90)16.20 (14.57-18.30)17.40 (15.70-19.00)16.80 (14.50-19.60)0.9760.0020.204
TSH (mU/L), median (P25-P75)1.73 (1.37-2.44)1.62 (1.13-2.59)2.16 (1.38-2.67)1.68 (1.04-3.01)0.0980.1530.665
TGAb (IU/mL), median (P25-P75)16.70 (15.4-21.45)16.70 (15.3-17.72)17.00 (15.81-17.90)17.80 (17.30-18.10)0.1730.7890.001
TPOAb (IU/mL), median (P25-P75)13.00 (12.35-13.22)13.00 (11.35-15.10)15.00 (14.20-15.74)12.60 (12.10-13.50)0.707< 0.001< 0.001
Table 2 Comparison between the benign and malignant thyroid lobes in the training cohort, n (%).
Characteristics
Benign (n = 80)
Malignant (n = 184)
P value
Gender0.234
Female68 (85)143 (78)
Male12 (15)41 (22)
Age (years), median (P25-P75)54.50 (49.00-64.25)49.00 (35.00-57.25)< 0.001
BMI, median (P25-P75)24.16 (21.93-25.98)24.80 (22.19-26.35)0.296
Multiple nodules0.716
No43 (54)105 (57)
Yes37 (46)79 (43)
Tumor size group< 0.001
Small (≤ 5 mm)4 (5)39 (21)
Medium (5-10 mm)21 (26)78 (42)
Large (> 10 mm)55 (69)67 (36)
Calcify0.028
No66 (82)126 (68)
Yes14 (18)58 (32)
Cystic< 0.001
No57 (71)172 (93)
Yes23 (29)12 (7)
FT3 (pmol/L), median (P25-P75)4.76 (4.43-5.01)4.76 (4.39-5.14)0.421
FT4 (pmol/L), median (P25-P75)16.20 (14.88-16.92)16.20 (14.95-18.05)0.142
TSH (mU/L), median (P25-P75)1.73 (1.40-2.03)1.73 (1.37-2.63)0.227
TGAb (IU/mL), median (P25-P75)16.70 (15.62-18.05)16.70 (15.4-25.85)0.164
TPOAb (IU/mL), median (P25-P75)13.00 (12.80-13.82)13.00 (12.35-13.00)0.671
Feature screening and radscore calculation

Radiomic feature extraction using PyRadiomics yielded 1130 features per thyroid lobe, subjected to a multi-stage refinement process. Inter- and intra-observer reliability analysis (ICC ≥ 0.8) retained 612 stable features, followed by Pearson correlation filtering (r ≥ 0.7), which removed 51 redundant parameters. LASSO regression with ten-fold cross-validation (optimal λ = 0.01236625) identified 27 non-zero discriminative predictors. These features were linearly combined using LASSO-derived coefficients to generate the RadScore, which demonstrated significant diagnostic stratification between benign and malignant lesions, as detailed in Table 3 and illustrated through feature selection dynamics in Figures 3 and 4.

Figure 3
Figure 3 Least absolute shrinkage and selection operator feature selection and comparison. A: Least absolute shrinkage and selection operator feature selection plot, where the horizontal axis represents the logarithm of the regularization parameter (λ), and the vertical axis denotes the corresponding feature weights. As λ increases, feature weights progressively decrease until exclusion at zero; B: 10-fold cross-validation curve, where the vertical axis represents the cross-validated mean squared error. The λ-value at the lowest point (λ min) is identified as the optimal regularization parameter, balancing model complexity and prediction error; C: 27 radiomic features and their corresponding weight coefficients retained after least absolute shrinkage and selection operator regression; D: Correlation heatmap of the 27 selected radiomic features, with darker colors indicating stronger correlations.
Figure 4
Figure 4 Radiomic score distribution across cohorts. A: Comparison of radiomic score (Radscore) between benign and malignant lesions in the training; B: Comparison of Radscore between benign and malignant lesions in the validation; C: Comparison of Radscore between benign and malignant lesions in the temporal test; D: Comparison of Radscore between benign and malignant lesions in the external test cohorts. aP < 0.05; bP < 0.001.
Table 3 Weights of least absolute shrinkage and selection operator selected features and training set Z-score parameters.
Feature names
Weight
Average
Variance
Intercept0.250252--
Original_shape_Elongation0.3110960.5240.104
Original_firstorder_10Percentile0.21334950.02915.014
Original_firstorder_interquartilerange0.15709229.21610.044
Original_glcm_maximumprobability-0.103440.3040.092
Original_gldm_dependenceentropy0.1921436.1330.351
Original_ngtdm_Contrast-0.034290.0090.006
Original_ngtdm_Strength-0.009310.4721.371
Log-sigma-1-mm-3D_glcm_InverseVariance-0.444710.3100.026
Log-sigma-1-mm-3D_glszm_LargeAreaHighGrayLevelEmphasis0.32620568429444.000388518307.356
log-sigma-1-mm-3D_glszm_LargeAreaLowGrayLevelEmphasis-0.0243810850.30046169.980
Log -sigma-2-mm-3D_firstorder_Maximum0.17341656.16427.585
Log -sigma-2-mm-3D_glrlm_RunLengthNonUniformity-0.116351356.938450.535
Log -sigma-2-mm-3D_glszm_ZoneEntropy-0.187665.2560.371
Log -sigma-3-mm-3D_firstorder_TotalEnergy-0.2676857352644.00020352814.000
Log -sigma-3-mm-3D_glcm_ClusterProminence0.1238951173.183544.414
Wavelet-LLH_firstorder_Mean0.151319.8502.798
Wavelet-LLH_firstorder_Skewness0.088985-1.8691.851
Wavelet-LHH_glcm_InverseVariance-0.388860.5030.005
Wavelet-LHH_gldm_LargeDependenceHighGrayLevelEmphasis-0.250053423.8153331.008
wavelet-HLL_glszm_GrayLevelNonUniformityNormalized0.1042640.1590.094
Wavelet-HLL_glszm_LargeAreaLowGrayLevelEmphasis-0.0597810742.59083666.610
Wavelet-HLH_glcm_InverseVariance-0.145260.5010.005
Wavelet-HHL_glszm_GrayLevelNonUniformity0.08703918.16612.923
Wavelet-HHH_firstorder_InterquartileRange0.2849186.4971.234
Wavelet-LLL_glcm_InverseVariance-0.453740.4480.023
Wavelet-LLL_gldm_DependenceEntropy-0.273257.0390.328
Wavelet-LLL_glszm_GrayLevelNonUniformity-0.2356939.39414.681
Model construction and evaluation

Five clinically significant predictors - RadScore, age, tumor size group, calcify pattern, and cystic - were integrated into seven machine learning algorithms. Comprehensive evaluation across multiple validation metrics identified the XGB model as the optimal classifier, demonstrating superior discriminative performance. The XGB model achieved an AUC of 0.889 [95% confidence interval (CI): 0.845-0.932] in the training cohort, 0.803 (95%CI: 0.715-0.890) in the validation cohort, 0.855 (95%CI: 0.775-0.935) in the temporal test, and 0.802 (95%CI: 0.664-0.939) in the external test. Robust accuracy (0.696-0.845) and F1 scores (0.790-0.852) were maintained across all datasets, model performance across cohorts is summarized in Table 4. The model exhibited high calibration fidelity (Brier scores: 0.121-0.144) and substantial clinical utility, as indicated by decision curve analysis with net benefit thresholds exceeding 20% probability (Figure 5). These results collectively validate the XGB model’s generalizability and reliability in thyroid nodule characterization.

Figure 5
Figure 5 Performance evaluation of seven machine learning models. A: Receiver operating characteristic curves of different models across the training, validation, temporal test, and external test cohorts (from left to right). The area under the receiver operating characteristic curve quantifies classification performance, with higher area under the receiver operating characteristic curve values indicating superior discrimination; B: Calibration curves for different models in the training, validation, temporal test, and external test cohorts (from left to right). These curves assess the agreement between predicted probabilities and actual outcomes, with lower Brier scores indicating better calibration, as reflected by proximity to the 45° diagonal line; C: Decision curve analysis in the training, validation, temporal test, and external test cohorts (from left to right), evaluating the clinical net benefit of different models across probability thresholds. Overall, the extreme gradient boosting model exhibits consistent and superior performance across multiple cohorts. DT: Decision trees; KNM: K-nearest neighbors; LR: Logistic regression; RF: Random forests; LGBM: Light gradient boosting machines; SVM: Support vector machines; XGB: Extreme gradient boosting; AUC: Area under the receiver operating characteristic curve.
Table 4 Multi-cohort predictive performance of various models.
Model
AUC (95%CI)
Accuracy
Sensitivity
Specificity
PPV
NPV
Precision
Recall
F1
Brier
Training cohort (benign = 80, malignant = 184), SMOTE (benign = 104)
LR0.845 (0.794-0.896)0.7990.8320.7250.8740.6520.8740.8320.8520.140
DT0.806 (0.745-0.866)0.8260.8290.8150.9460.5500.9460.8290.8830.136
RF1.000 (1.000-1.000)1.0001.0001.0001.0001.0001.0001.0001.0000.027
XGB0.899 (0.845-0.932)0.8450.7830.8750.9350.6360.9350.7830.8520.125
SVM0.844 (0.791-0.896)0.8140.8640.7000.8690.6910.8690.8640.8660.141
KNN1.000 (1.000-1.000)1.0001.0001.0001.0001.0001.0001.0001.0000.031
LGBM0.692 (0.631-0.752)0.5610.7550.6000.8130.5160.8130.7550.7830.208
Senior radiologist0.596 (0.505-0.633)0.5420.5000.6380.7600.3570.7600.5000.6030.209
Junior radiologist0.529 (0.464-0.594)0.4960.4460.6130.7360.3240.7260.4460.5520.211
Validation cohort (benign = 26, malignant = 86)
LR0.834 (0.750-0.917)0.7680.8370.5380.8570.5000.8570.8370.8470.128
DT0.646 (0.529-0.762)0.6960.8020.3460.8020.3460.8020.8020.8020.218
RF0.729 (0.620-0.838)0.759 0.8600.4230.8310.4780.8310.8600.8460.166
XGB0.803 (0.715-0.890)0.6960.7440.5380.8420.3890.8420.7440.7900.144
SVM0.820 (0.728-0.912)0.7950.8600.5770.8710.5560.8710.8600.8650.130
KNN0.793 (0.697-0.890)0.7410.7790.6150.8700.4570.8700.7790.8220.180
LGBM0.724 (0.628-0.820)0.5450.4300.9230.9490.3290.9490.4300.5920.182
Senior radiologist0.558 (0.449-0.667)0.5270.5000.6150.8110.2710.8110.5000.6190.182
Junior radiologist0.538 (0.428-0.649)0.5180.5000.5770.7960.2590.7960.5000.6140.182
Temporal test cohort (benign = 26, malignant = 71)
LR0.814 (0.717-0.912)0.8250.9300.5380.8460.7370.8460.9300.8860.137
DT0.757 (0.652-0.851)0.7420.7800.5330.9010.3080.9010.7800.8370.178
RF0.795 (0.696-0.894)0.7630.9010.3850.8000.5880.8000.9010.8480.152
XGB0.855 (0.775-0.935)0.7730.8310.6150.8550.5710.8550.8310.8430.139
SVM0.816 (0.719-0.913)0.8450.9720.5000.8410.8670.8410.9720.9020.141
KNN0.719 (0.607-0.832)0.7110.8030.4620.8030.4620.8030.8030.8030.208
LGBM0.800 (0.697-0.904)0.8350.8730.7310.8990.6790.8990.8730.8860.193
External test cohort (benign = 15, malignant = 66)
LR0.782 (0.607-0.907)0.7650.8480.4000.8620.3750.8620.8480.8550.127
DT0.715 (0.566-0.863)0.7780.8870.4210.8330.5330.8330.8870.8590.166
RF0.751 (0.589-0.913)0.8150.8790.5330.8920.5000.8920.8790.8850.143
XGB0.802 (0.644-0.939)0.7280.7270.7330.9230.3790.9230.7270.8140.121
SVM0.800 (0.674-0.926)0.8150.9090.4000.8700.5000.8700.9090.8890.120
KNN0.813 (0.697-0.928)0.8030.8480.6000.9030.4740.9030.8480.8750.158
LGBM0.728 (0.668-0.789)0.7530.8480.3330.8480.3330.8480.8480.8480.163
Model performance and interpretability

In the validation cohort, the XGB model outperformed both human radiologists (P < 0.001) and other machine learning models, as confirmed by the DeLong test (decision trees: P < 0.05; RF: P < 0.01), as detailed in Table 5. SHAP analysis ranked RadScore as the most influential predictor, followed by age, tumor size group, calcify, and cystic. This interpretability framework was further corroborated through representative case illustrations (Figure 6), highlighting the distinct predictive contributions of each feature in one benign and one malignant case.

Figure 6
Figure 6 SHapley Additive exPlanations analysis and clinical application. A: Honeycomb and bar plot ranking SHapley Additive exPlanations values by feature importance; B: Case illustration of a 67-year-old female with an 8 mm right thyroid lobe nodule, pathologically confirmed as nodular goiter; C: Case illustration of a 43-year-old female with a 4 mm right thyroid lobe nodule, pathologically confirmed as papillary thyroid carcinoma. SHAP: SHapley Additive exPlanations.
Table 5 DeLong test for different models and human radiologists in the validation cohort.
Models
DT
LR
RF
XGB
SVM
KNN
LGBM
Senior radiologist
Junior radiologist
DT-0.0110.118< 0.001< 0.0010.0220.3090.2350.179
LR0.011-0.1370.6150.8330.5370.093< 0.001< 0.001
RF0.1180.137-0.0380.0160.1000.9500.0310.017
XGB< 0.0010.6150.038-0.4690.8250.239< 0.001< 0.001
SVM< 0.0010.8330.0160.469-0.5350.159< 0.001< 0.001
KNN0.0220.5370.1000.8250.535-0.3210.002< 0.001
LGBM0.3090.0930.9500.2390.1590.321-0.0170.008
Senior radiologist0.235< 0.0010.031< 0.001< 0.0010.0020.017-0.810
Junior radiologist0.179< 0.0010.017< 0.001< 0.001< 0.0010.0080.810-
DISCUSSION

This retrospective study developed a machine learning model integrating NCCT radiomics and clinical features for thyroid nodule malignancy prediction. The XGB algorithm exhibited superior stability and generalizability across all datasets. Leveraging NCCT-derived radiomic signatures, the model enables noninvasive differentiation of thyroid nodules, presenting a novel framework for precision diagnostics. SHAP-based interpretability further enhances its clinical applicability by providing transparent decision-support insights.

PTC, the most prevalent thyroid malignancy, is characterized by BRAFV600E mutations and rearranged during transfection/PTC rearrangements, both of which drive tumor progression via sustained mitogen activated protein kinase/extracellular signal regulated kinase pathway activation[21]. In contrast, benign nodules typically maintain follicular architecture, exhibit homogeneous colloid distribution, and lack infiltrative margins[22]. Conventional imaging modalities, including US, which detects microcalcifications with 91% specificity, and CT, which assesses capsular integrity with 85% sensitivity, remain limited in their capacity to quantify tumor microenvironment heterogeneity or molecular features[23]. Radiomics, a high-throughput analytical approach, addresses these limitations by facilitating machine learning-driven differentiation of malignant phenotypes[24]. Feature analysis revealed multidimensional distinctions: Shape irregularity correlated with invasive growth patterns[25], first-order statistics reflected grayscale dispersion associated with cellular density heterogeneity[26], and texture features captured microstructural disorganization characteristic of malignancy[27]. Additionally, Log and wavelet transformations enhanced sensitivity to microcalcifications and angiogenic patterns[28]. Beyond diagnostic applications, these imaging biomarkers provide mechanistic insights into thyroid carcinogenesis and may inform targeted therapeutic strategies[29].

Our XGB model demonstrated robust and consistent diagnostic performance across all evaluation cohorts, achieving AUC values of 0.899 (95%CI: 0.845-0.932) in training, 0.803 (95%CI: 0.715-0.890) in validation, 0.855 (95%CI: 0.775-0.935) in temporal testing, and 0.802 (95%CI: 0.664-0.939) in external testing, all significantly superior to radiologists assessments (AUC range: 0.529-0.596; P < 0.001 by DeLong test). These results corroborate previous CT-based radiomics studies including Kong et al[30] (AUC = 0.84 using arterial-phase CT) and Lin et al[31] (AUC = 0.92 with multiphase-enhanced CT), while introducing the novel application of NCCT-derived RadScore that leverages high spatial resolution to capture comprehensive 3D morphologic and textural features without requiring hemodynamic data[32]. Compared to CECT, our NCCT-based approach provides distinct clinical advantages including significantly reduced radiation exposure and elimination of iodinated contrast-associated risks such as allergic reactions and nephrotoxicity[33,34]. The implementation of semilobar segmentation further enhances model robustness by facilitating complete volumetric assessment of thyroid lobes while minimizing potential biases from multinodular localization and segmentation variability[35]. We observed that certain machine learning algorithms (e.g., RF and K-nearest neighbors) were prone to overfitting, as evidenced by near-perfect training performance (AUC = 1.000) but poor generalization, which we mitigated through a rigorous multi-tiered validation strategy incorporating independent validation, prospective temporal testing, and multicenter external validation cohorts to ensure the selected XGB model maintains strong performance across diverse clinical settings - a critical prerequisite for successful clinical translation.

Several key clinical predictors emerged from the analysis. Malignant nodules were associated with younger patients (median age: 49.0 vs 54.5 years, P < 0.001), consistent with established thyroid cancer epidemiology[36]. The predominance of malignancy in nodules ≤ 10 mm (63% vs 31% > 10 mm in the benign group) may reflect the high prevalence of BRAFV600E mutations in microcarcinomas[37]. Calcify were significantly more frequent in malignant lesions (32% vs 18%, P = 0.028), aligning with the presence of psammoma bodies in PTC[38], while cystic was more common in benign nodules (29% vs 7%, P < 0.001), a pattern indicative of follicular adenoma degeneration[39]. These findings corroborate established sonographic malignancy markers, emphasizing the value of multimodal characterization.

Several important limitations should be acknowledged in this study. Despite employing standardized pre-processing procedures including image resampling, inherent biases persist in the external test set due to variations in CT scanner manufacturers and acquisition protocols across institutions. While synthetic minority oversampling was implemented to address class imbalance in the training cohort, its effectiveness in improving benign nodule prediction may be constrained by the original limited sample size of benign cases, necessitating future validation with expanded benign cohorts. Although we comprehensively evaluated multiple machine learning algorithms, the maximum achieved AUC below 0.9 indicates potential for improvement through larger, more diverse datasets. The manual segmentation approach, while ensuring consistency through rigorous inter-reader ICC verification, remains subject to inherent observer variability and represents a time-intensive process that may hinder clinical scalability. Future research directions will prioritize the development of automated segmentation pipelines leveraging U-Net architectures, coupled with the integration of advanced deep learning models (ResNet, DenseNet, Vision Transformer) within multicenter validation studies to enhance both predictive accuracy and clinical applicability while maintaining rigorous methodological standards.

CONCLUSION

In conclusion, this study developed an NCCT-based XGB model leveraging thyroid lobe segmentation for noninvasive differentiation of benign and malignant thyroid nodules. When coupled with SHAP interpretability, this model holds promise for clinical decision support in preoperative evaluation.

Footnotes

Provenance and peer review: Unsolicited article; Externally peer reviewed.

Peer-review model: Single blind

Specialty type: Radiology, nuclear medicine and medical imaging

Country of origin: China

Peer-review report’s classification

Scientific Quality: Grade A, Grade A, Grade B, Grade B, Grade C

Novelty: Grade A, Grade B, Grade B, Grade B, Grade C

Creativity or Innovation: Grade B, Grade B, Grade B, Grade B, Grade C

Scientific Significance: Grade A, Grade B, Grade B, Grade B, Grade C

P-Reviewer: Liu Y; Tang W; Zeng JQ S-Editor: Bai Y L-Editor: A P-Editor: Wang WB

References
1.  Han B, Zheng R, Zeng H, Wang S, Sun K, Chen R, Li L, Wei W, He J. Cancer incidence and mortality in China, 2022. J Natl Cancer Cent. 2024;4:47-53.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Full Text (PDF)]  [Cited by in Crossref: 61]  [Cited by in RCA: 598]  [Article Influence: 598.0]  [Reference Citation Analysis (0)]
2.  Li X, Jian J, Zhang A, Xiang JM, Huang J, Chen Y. The role of immune cells and immune related genes in the tumor microenvironment of papillary thyroid cancer and their significance for immunotherapy. Sci Rep. 2024;14:18125.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in RCA: 5]  [Reference Citation Analysis (0)]
3.  Kim H, Kwon H, Moon BI. Association of Multifocality With Prognosis of Papillary Thyroid Carcinoma: A Systematic Review and Meta-analysis. JAMA Otolaryngol Head Neck Surg. 2021;147:847-854.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in Crossref: 9]  [Cited by in RCA: 49]  [Article Influence: 12.3]  [Reference Citation Analysis (0)]
4.  Shao L, Wang Z, Dong W, Sun W, Zhang H. Risk factors associated with preferential lateral lymph node metastasis in papillary thyroid carcinoma. Cancer Med. 2023;12:20670-20676.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in RCA: 4]  [Reference Citation Analysis (0)]
5.  Alexander EK, Cibas ES. Diagnosis of thyroid nodules. Lancet Diabetes Endocrinol. 2022;10:533-539.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in Crossref: 10]  [Cited by in RCA: 94]  [Article Influence: 31.3]  [Reference Citation Analysis (0)]
6.  Miao S, Jing M, Sheng R, Cui D, Lu S, Zhang X, Jing S, Zhang X, Shan T, Shan H, Xu T, Wang B, Wang Z, Liu Y. The analysis of differential diagnosis of benign and malignant thyroid nodules based on ultrasound reports. Gland Surg. 2020;9:653-660.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in Crossref: 9]  [Cited by in RCA: 13]  [Article Influence: 2.6]  [Reference Citation Analysis (0)]
7.  Haddad RI, Bischoff L, Ball D, Bernet V, Blomain E, Busaidy NL, Campbell M, Dickson P, Duh QY, Ehya H, Goldner WS, Guo T, Haymart M, Holt S, Hunt JP, Iagaru A, Kandeel F, Lamonica DM, Mandel S, Markovina S, McIver B, Raeburn CD, Rezaee R, Ridge JA, Roth MY, Scheri RP, Shah JP, Sipos JA, Sippel R, Sturgeon C, Wang TN, Wirth LJ, Wong RJ, Yeh M, Cassara CJ, Darlow S. Thyroid Carcinoma, Version 2.2022, NCCN Clinical Practice Guidelines in Oncology. J Natl Compr Canc Netw. 2022;20:925-951.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in Crossref: 2]  [Cited by in RCA: 215]  [Article Influence: 71.7]  [Reference Citation Analysis (0)]
8.  Boers T, Braak SJ, Rikken NET, Versluis M, Manohar S. Ultrasound imaging in thyroid nodule diagnosis, therapy, and follow-up: Current status and future trends. J Clin Ultrasound. 2023;51:1087-1100.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in RCA: 7]  [Reference Citation Analysis (0)]
9.  Chen Z, Wang JJ, Guo DM, Zhai YX, Dai ZZ, Su HH. Combined fine-needle aspiration with core needle biopsy for assessing thyroid nodules: a more valuable diagnostic method? Ultrasonography. 2023;42:314-322.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Full Text (PDF)]  [Cited by in RCA: 3]  [Reference Citation Analysis (0)]
10.  Haugen BR, Alexander EK, Bible KC, Doherty GM, Mandel SJ, Nikiforov YE, Pacini F, Randolph GW, Sawka AM, Schlumberger M, Schuff KG, Sherman SI, Sosa JA, Steward DL, Tuttle RM, Wartofsky L. 2015 American Thyroid Association Management Guidelines for Adult Patients with Thyroid Nodules and Differentiated Thyroid Cancer: The American Thyroid Association Guidelines Task Force on Thyroid Nodules and Differentiated Thyroid Cancer. Thyroid. 2016;26:1-133.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in Crossref: 10769]  [Cited by in RCA: 9516]  [Article Influence: 1057.3]  [Reference Citation Analysis (1)]
11.  Bervini S, Trelle S, Kopp P, Stettler C, Trepp R. Prevalence of Iodine-Induced Hyperthyroidism After Administration of Iodinated Contrast During Radiographic Procedures: A Systematic Review and Meta-Analysis of the Literature. Thyroid. 2021;31:1020-1029.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in Crossref: 6]  [Cited by in RCA: 21]  [Article Influence: 5.3]  [Reference Citation Analysis (0)]
12.  Eloranta S, Boman M. Predictive models for clinical decision making: Deep dives in practical machine learning. J Intern Med. 2022;292:278-295.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Full Text (PDF)]  [Cited by in Crossref: 2]  [Cited by in RCA: 34]  [Article Influence: 11.3]  [Reference Citation Analysis (0)]
13.  Wu G, Jochems A, Refaee T, Ibrahim A, Yan C, Sanduleanu S, Woodruff HC, Lambin P. Structural and functional radiomics for lung cancer. Eur J Nucl Med Mol Imaging. 2021;48:3961-3974.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Full Text (PDF)]  [Cited by in Crossref: 11]  [Cited by in RCA: 72]  [Article Influence: 18.0]  [Reference Citation Analysis (0)]
14.  Ibrahim A, Primakov S, Beuque M, Woodruff HC, Halilaj I, Wu G, Refaee T, Granzier R, Widaatalla Y, Hustinx R, Mottaghy FM, Lambin P. Radiomics for precision medicine: Current challenges, future prospects, and the proposal of a new framework. Methods. 2021;188:20-29.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in Crossref: 62]  [Cited by in RCA: 143]  [Article Influence: 28.6]  [Reference Citation Analysis (0)]
15.  Frix AN, Cousin F, Refaee T, Bottari F, Vaidyanathan A, Desir C, Vos W, Walsh S, Occhipinti M, Lovinfosse P, Leijenaar RTH, Hustinx R, Meunier P, Louis R, Lambin P, Guiot J. Radiomics in Lung Diseases Imaging: State-of-the-Art for Clinicians. J Pers Med. 2021;11:602.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Full Text (PDF)]  [Cited by in Crossref: 10]  [Cited by in RCA: 53]  [Article Influence: 13.3]  [Reference Citation Analysis (0)]
16.  Lee JH, Ha EJ, Kim JH. Application of deep learning to the diagnosis of cervical lymph node metastasis from thyroid cancer with CT. Eur Radiol. 2019;29:5452-5457.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in Crossref: 54]  [Cited by in RCA: 68]  [Article Influence: 11.3]  [Reference Citation Analysis (0)]
17.  Park VY, Lee E, Lee HS, Kim HJ, Yoon J, Son J, Song K, Moon HJ, Yoon JH, Kim GR, Kwak JY. Combining radiomics with ultrasound-based risk stratification systems for thyroid nodules: an approach for improving performance. Eur Radiol. 2021;31:2405-2413.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in Crossref: 9]  [Cited by in RCA: 29]  [Article Influence: 5.8]  [Reference Citation Analysis (0)]
18.  Rehman AU, Ehsan M, Javed H, Ameer MZ, Mohsin A, Aemaz Ur Rehman M, Nawaz A, Amjad Z, Ameer F. Solitary and multiple thyroid nodules as predictors of malignancy: a systematic review and meta-analysis. Thyroid Res. 2022;15:22.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in RCA: 8]  [Reference Citation Analysis (0)]
19.  Kocak B, Baessler B, Bakas S, Cuocolo R, Fedorov A, Maier-Hein L, Mercaldo N, Müller H, Orlhac F, Pinto Dos Santos D, Stanzione A, Ugga L, Zwanenburg A. CheckList for EvaluAtion of Radiomics research (CLEAR): a step-by-step reporting guideline for authors and reviewers endorsed by ESR and EuSoMII. Insights Imaging. 2023;14:75.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in Crossref: 130]  [Cited by in RCA: 212]  [Article Influence: 106.0]  [Reference Citation Analysis (0)]
20.  Tessler FN, Middleton WD, Grant EG. Thyroid Imaging Reporting and Data System (TI-RADS): A User's Guide. Radiology. 2018;287:1082.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in Crossref: 18]  [Cited by in RCA: 19]  [Article Influence: 2.7]  [Reference Citation Analysis (0)]
21.  Xing M, Alzahrani AS, Carson KA, Viola D, Elisei R, Bendlova B, Yip L, Mian C, Vianello F, Tuttle RM, Robenshtok E, Fagin JA, Puxeddu E, Fugazzola L, Czarniecka A, Jarzab B, O'Neill CJ, Sywak MS, Lam AK, Riesco-Eizaguirre G, Santisteban P, Nakayama H, Tufano RP, Pai SI, Zeiger MA, Westra WH, Clark DP, Clifton-Bligh R, Sidransky D, Ladenson PW, Sykorova V. Association between BRAF V600E mutation and mortality in patients with papillary thyroid cancer. JAMA. 2013;309:1493-1501.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in Crossref: 619]  [Cited by in RCA: 703]  [Article Influence: 58.6]  [Reference Citation Analysis (0)]
22.  Rossi ED, Pantanowitz L, Hornick JL. A worldwide journey of thyroid cancer incidence centred on tumour histology. Lancet Diabetes Endocrinol. 2021;9:193-194.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in Crossref: 35]  [Cited by in RCA: 68]  [Article Influence: 17.0]  [Reference Citation Analysis (0)]
23.  Ha EJ, Chung SR, Na DG, Ahn HS, Chung J, Lee JY, Park JS, Yoo RE, Baek JH, Baek SM, Cho SW, Choi YJ, Hahn SY, Jung SL, Kim JH, Kim SK, Kim SJ, Lee CY, Lee HK, Lee JH, Lee YH, Lim HK, Shin JH, Sim JS, Sung JY, Yoon JH, Choi M. 2021 Korean Thyroid Imaging Reporting and Data System and Imaging-Based Management of Thyroid Nodules: Korean Society of Thyroid Radiology Consensus Statement and Recommendations. Korean J Radiol. 2021;22:2094-2123.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Full Text (PDF)]  [Cited by in Crossref: 14]  [Cited by in RCA: 154]  [Article Influence: 38.5]  [Reference Citation Analysis (0)]
24.  Wentland AL, Yamashita R, Kino A, Pandit P, Shen L, Brooke Jeffrey R, Rubin D, Kamaya A. Differentiation of benign from malignant solid renal lesions using CT-based radiomics and machine learning: comparison with radiologist interpretation. Abdom Radiol (NY). 2023;48:642-648.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in RCA: 13]  [Reference Citation Analysis (0)]
25.  Tong Y, Zhang J, Wei Y, Yu J, Zhan W, Xia H, Zhou S, Wang Y, Chang C. Ultrasound-based radiomics analysis for preoperative prediction of central and lateral cervical lymph node metastasis in papillary thyroid carcinoma: a multi-institutional study. BMC Med Imaging. 2022;22:82.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Full Text (PDF)]  [Cited by in Crossref: 1]  [Cited by in RCA: 20]  [Article Influence: 6.7]  [Reference Citation Analysis (0)]
26.  She Y, Zhang L, Zhu H, Dai C, Xie D, Xie H, Zhang W, Zhao L, Zou L, Fei K, Sun X, Chen C. The predictive value of CT-based radiomics in differentiating indolent from invasive lung adenocarcinoma in patients with pulmonary nodules. Eur Radiol. 2018;28:5121-5128.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in Crossref: 62]  [Cited by in RCA: 87]  [Article Influence: 12.4]  [Reference Citation Analysis (0)]
27.  Park VY, Han K, Seong YK, Park MH, Kim EK, Moon HJ, Yoon JH, Kwak JY. Diagnosis of Thyroid Nodules: Performance of a Deep Learning Convolutional Neural Network Model vs. Radiologists. Sci Rep. 2019;9:17843.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Full Text (PDF)]  [Cited by in Crossref: 54]  [Cited by in RCA: 48]  [Article Influence: 8.0]  [Reference Citation Analysis (1)]
28.  Eloyan A, Yue MS, Khachatryan D. Tumor heterogeneity estimation for radiomics in cancer. Stat Med. 2020;39:4704-4723.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in Crossref: 3]  [Cited by in RCA: 26]  [Article Influence: 5.2]  [Reference Citation Analysis (0)]
29.  Dercle L, Henry T, Carré A, Paragios N, Deutsch E, Robert C. Reinventing radiation therapy with machine learning and imaging bio-markers (radiomics): State-of-the-art, challenges and perspectives. Methods. 2021;188:44-60.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in Crossref: 12]  [Cited by in RCA: 27]  [Article Influence: 5.4]  [Reference Citation Analysis (0)]
30.  Kong D, Zhang J, Shan W, Duan S, Guo L. Evaluation of Radiomics Models Based on Computed Tomography for Distinguishing Between Benign and Malignant Thyroid Nodules. J Comput Assist Tomogr. 2022;46:978-985.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in RCA: 8]  [Reference Citation Analysis (0)]
31.  Lin S, Gao M, Yang Z, Yu R, Dai Z, Jiang C, Yao Y, Xu T, Chen J, Huang K, Lin D. CT-Based Radiomics Models for Differentiation of Benign and Malignant Thyroid Nodules: A Multicenter Development and Validation Study. AJR Am J Roentgenol. 2024;223:e2431077.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in Crossref: 2]  [Cited by in RCA: 5]  [Article Influence: 5.0]  [Reference Citation Analysis (0)]
32.  Heo J, Sim Y, Kim BM, Kim DJ, Kim YD, Nam HS, Choi YS, Lee SK, Kim EY, Sohn B. Radiomics using non-contrast CT to predict hemorrhagic transformation risk in stroke patients undergoing revascularization. Eur Radiol. 2024;34:6005-6015.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in Crossref: 1]  [Cited by in RCA: 11]  [Article Influence: 11.0]  [Reference Citation Analysis (0)]
33.  Hamatani K, Eguchi H, Ito R, Mukai M, Takahashi K, Taga M, Imai K, Cologne J, Soda M, Arihiro K, Fujihara M, Abe K, Hayashi T, Nakashima M, Sekine I, Yasui W, Hayashi Y, Nakachi K. RET/PTC rearrangements preferentially occurred in papillary thyroid cancer among atomic bomb survivors exposed to high radiation dose. Cancer Res. 2008;68:7176-7182.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in Crossref: 112]  [Cited by in RCA: 117]  [Article Influence: 6.9]  [Reference Citation Analysis (0)]
34.  Lee SY, Rhee CM, Leung AM, Braverman LE, Brent GA, Pearce EN. A review: Radiographic iodinated contrast media-induced thyroid dysfunction. J Clin Endocrinol Metab. 2015;100:376-383.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in Crossref: 147]  [Cited by in RCA: 130]  [Article Influence: 13.0]  [Reference Citation Analysis (0)]
35.  Lu S, Ren Y, Lu C, Qian X, Liu Y, Zhang J, Shan X, Sun E. Radiomics features from whole thyroid gland tissue for prediction of cervical lymph node metastasis in the patients with papillary thyroid carcinoma. J Cancer Res Clin Oncol. 2023;149:13005-13016.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in RCA: 8]  [Reference Citation Analysis (0)]
36.  Li W, Liang H, Wang W, Liu J, Liu X, Lao S, Liang W, He J. Global cancer statistics for adolescents and young adults: population based study. J Hematol Oncol. 2024;17:99.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Full Text (PDF)]  [Cited by in Crossref: 19]  [Cited by in RCA: 16]  [Article Influence: 16.0]  [Reference Citation Analysis (0)]
37.  Lai HF, Hang JF, Kuo PC, Kuo CS, Yao SF, Chen JY, Lee CH. BRAF V600E Mutation Lacks Association with Poorer Clinical Prognosis in Papillary Thyroid Carcinoma. Ann Surg Oncol. 2024;31:3495-3501.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in Crossref: 3]  [Cited by in RCA: 9]  [Article Influence: 9.0]  [Reference Citation Analysis (0)]
38.  Ye M, Wu S, Zhou Q, Wang F, Chen X, Gong X, Wu W. Association between macrocalcification and papillary thyroid carcinoma and corresponding valuable diagnostic tool: retrospective study. World J Surg Oncol. 2023;21:149.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in RCA: 2]  [Reference Citation Analysis (0)]
39.  Lam H, Saoud C, Shi Q, Wong KS, Cibas ES, Rooper LM, Baloch Z, Ali SZ. Degenerative atypia in benign thyroid nodules: a potential diagnostic pitfall on fine-needle aspiration. J Am Soc Cytopathol. 2023;12:341-350.  [RCA]  [PubMed]  [DOI]  [Full Text]  [Cited by in RCA: 1]  [Reference Citation Analysis (0)]