Published online Mar 7, 2025. doi: 10.3748/wjg.v31.i9.101383
Revised: December 2, 2024
Accepted: January 8, 2025
Published online: March 7, 2025
Processing time: 158 Days and 22.6 Hours
The global prevalence of non-alcoholic steatohepatitis (NASH) and its associated risk of adverse outcomes, particularly in patients with advanced liver fibrosis, underscores the importance of early and accurate diagnosis.
To develop a machine learning-based diagnostic model for advanced liver fibrosis in NASH patients.
A total of 749 patients who underwent liver biopsy at Beijing Ditan Hospital, Capital Medical University, between January 2010 and January 2020 were in
The Extreme Gradient Boosting (XGBoost) model outperformed all other machine learning models, achieving an AUROC of 0.934 (95%CI: 0.914-0.955) in the training cohort and 0.917 (95%CI: 0.880-0.953) in the validation cohort (P < 0.001). Incorporating liver stiffness measurement into the model further improved its performance, with an AUROC of 0.977 (95%CI: 0.966-0.980) in the training cohort and 0.970 (95%CI: 0.950-0.990) in the validation cohort, significantly surpassing APRI and FIB-4 scores (P < 0.001). The XGBoost model also demonstrated superior clinical utility, as evidenced by DCA and calibration curve analysis in both cohorts.
The XGBoost model provides a highly accurate, non-invasive diagnosis of advanced liver fibrosis in NASH patients, outperforming traditional methods. An online tool based on this model has been developed to assist clinicians in evaluating the risk of advanced liver fibrosis.
Core Tip: This study employed Shapley Additive Explanations (SHAP) to select key features for diagnosing advanced liver fibrosis in non-alcoholic steatohepatitis patients. Among five machine learning models, the Extreme Gradient Boosting model achieved the best performance and was further developed into an online diagnostic tool. SHAP was also used to provide local explanations, clarifying its applicability across clinical populations.