Published online Feb 7, 2024. doi: 10.3748/wjg.v30.i5.424
Peer-review started: November 21, 2023
First decision: December 5, 2023
Revised: December 19, 2023
Accepted: January 12, 2024
Article in press: January 12, 2024
Published online: February 7, 2024
Processing time: 71 Days and 2.6 Hours
The high rate of early recurrence in hepatocellular carcinoma (HCC) post curative surgical intervention poses a substantial clinical hurdle, impacting patient outcomes and complicating postoperative management. The advent of machine learning provides a unique opportunity to harness vast datasets, identifying subtle patterns and factors that elude conventional prognostic methods. Machine learning models, equipped with the ability to analyse intricate relationships within datasets, have shown promise in predicting outcomes in various medical disciplines. In the context of HCC, the application of machine learning to predict early recurrence holds potential for personalized postoperative care strategies. This editorial comments on the study carried out exploring the merits and efficacy of random survival forests (RSF) in identifying significant risk factors for recurrence, stratifying patients at low and high risk of HCC recurrence and comparing this to traditional COX proportional hazard models (CPH). In doing so, the study demonstrated that the RSF models are superior to traditional CPH models in predicting recurrence of HCC and represent a giant leap towards precision medicine.
Core Tip: This study addresses the crucial issue of early recurrence in hepatocellular carcinoma, emphasizing the significance of aggressive tumour characteristics. random survival forests, a machine learning model, surpasses conventional COX proportional hazard models, offering improved prediction, clinical usefulness, and overall performance. The model's ability to stratify risk facilitates targeted postoperative strategies, showcasing its potential as a guide for personalized patient care.
- Citation: Ravikulan A, Rostami K. Leveraging machine learning for early recurrence prediction in hepatocellular carcinoma: A step towards precision medicine. World J Gastroenterol 2024; 30(5): 424-428
- URL: https://www.wjgnet.com/1007-9327/full/v30/i5/424.htm
- DOI: https://dx.doi.org/10.3748/wjg.v30.i5.424
Developing a reliable pre-operative prediction model for postoperative recurrence of hepatocellular carcinoma (HCC) is essential in guiding individualized treatment and prognostication process of HCC.
In this issue of World Journal of Gastroenterology, Zeng et al[1] endeavour to identify key variables in pre-operative clinical and imaging data using machine learning algorithms to construct multiple risk prediction models for early postoperative recurrence of HCC.
HCC remains a significant challenge in the realm of oncology, particularly due to its propensity for early recurrence following curative resection[2,3]. It is the sixth most common cancer worldwide[4]. Though surgical resection remains the mainstay of curative therapy for HCC, early recurrence of HCC (within 1 year) stands as a substantial barrier to positive patient outcomes[3,5]. Survival rates in early recurrence of HCC can be as low as 25% at 3-5 years post resection[5]. There are no current approved therapeutic regimens available for the recurrence of HCC[6].
This raises a significant need to improve models for the early detection of patients at risk of recurrence. Many factors have been identified in predicting risk of recurrence of HCC[2] and this editorial explores the promising avenue in the quest for precision medicine[1] the development of a machine learning model as highlighted by the authors aimed at predicting early recurrence after surgical intervention[7]. The advent of machine learning provides a unique opportunity to harness vast datasets, identifying subtle patterns and factors that elude conventional prognostic methods[8,9].
Machine learning models, equipped with the ability to analyse intricate relationships within datasets, have shown promise in predicting outcomes in various medical disciplines[8]. In the context of HCC, the application of machine learning to predict early recurrence holds potential for personalized postoperative care strategies[10].
Traditionally, predictive models, such as COX proportional hazard (CPH) models, have been employed, but their limitations have spurred the exploration of innovative methodologies[9-11]. This study undertakes a critical examination, comparing the efficacy of random survival forests (RSF) with CPH models in forecasting early recurrence for HCC patients following curative resection.
Drawing from a comprehensive cohort of 4758 patients across two medical centres, this study utilized 15 key features to construct the RSF model. Features encompassed demographic, clinical, and tumour-specific factors. The RSF model was rigorously evaluated for discrimination, calibration, clinical utility, and overall performance, benchmarked against traditional models.
Out of 5686 patients with HCC undergoing definitive surgical therapy at Eastern Hepatobiliary Surgery Hospital (January 2008 to December 2015), 4376 met inclusion criteria. The study included patients with Child-Pugh A cirrhosis or B7 Liver function, without extrahepatic metastases, and complete resection of macroscopic tumour with histological evidence of tumour free margins. Exclusions (n = 1310) were due to preoperative anticancer treatment, history of other malignancies, palliative surgery, loss to follow up within 2 months of surgery, and perioperative death. The training cohort comprised 3370 patients (January 2008 to December 2013), internal validation cohort 1006 patients (January 2014 to December 2015), and external validation cohort 382 patients from Mengchao Hepatobiliary Hospital of Fujian Medical University.
The RSF model was constructed and used as a regression algorithm with faster training and lower estimation bias. This was achieved by using techniques of random forests such as feature and sample bragging. The model was constructed using fifteen factors including age, gender, aetiology, platelet count, albumin, total bilirubin, alpha-fetoprotein (AFP), tumour size, tumour number, microvascular invasion, macrovascular invasion, Edmondson-Steiner grade, tumour capsular, satellite nodules and liver cirrhosis. As 200 survival trees were built, the prediction error was significantly low and at 500 trees constructed, the variable importance for all 15 features was also generated. Utilizing cut-off values (50th and 85th centiles) from the training cohort's risk index, RSF classified patients into low-risk, intermediate-risk, and high-risk groups, providing valuable insights for postoperative follow-up and adjuvant therapy. Kaplan-Meier analysis validated the stratification in all cohorts (P < 0.0001) (Figure 1).
Model performance was assessed across several methods including model discrimination, model calibration, clinical usefulness and overall performance. In training, internal, and external validation cohorts, RSF outperformed existing models with C-index values of 0.725, 0.762, and 0.747, respectively. Overall performance time-dependent Brier (2 years) were 0.147, 0.129, and 0.156. RSF excelled against five other models that follow CPH. Decision curve analysis affirmed RSF’s superior net benefit over other models (Table 1).
Performance | Cohort | RSF | ERASL | Korean | AJCC TNM | BCLC | Chinese |
Model Discrimination: (Harrell’s C-Index) | Training | 0.725 | 0.706 | 0.658 | 0.674 | 0.635 | 0.684 |
Internal | 0.762 | 0.726 | 0.672 | 0.711 | 0.646 | 0.709 | |
External | 0.747 | 0.727 | 0.722 | 0.711 | 0.658 | 0.696 | |
Overall Performance: Time dependent Brier (2 years) | Training | 0.147 | 0.156 | 0.174 | 0.160 | 0.167 | 0.161 |
Internal | 0.129 | 0.143 | 0.159 | 0.144 | 0.154 | 0.146 | |
External | 0.156 | 0.162 | 0.161 | 0.169 | 0.180 | 0.176 | |
Clinical Usefulness: Net benefit at threshold 50% | Training | 0.166 | 0.154 | 0.093 | 0.139 | 0.137 | 0.137 |
Internal | 0.121 | 0.092 | 0.041 | 0.095 | 0.073 | 0.073 | |
External | 0.206 | 0.190 | 0.222 | 0.185 | 0.154 | 0.154 |
The RSF model, employing 500 survival trees, showcased superior predictive power. Key factors influencing recurrence were tumour size (which was the most significant risk factor for early recurrence), followed by macrovascular invasion, microvascular invasion, tumour number, and AFP levels.
The limitations to this study include selection bias as the cohort of patients largely had liver disease secondary to hepatitis B, which leaves a large space to question the applicability of these outcomes to other aetiologies of liver disease, and indeed, the RSF model did not consider aetiology and liver cirrhosis as important predictors of recurrence. Further studies will need to be conducted with the RSF model using patients of different aetiologies of liver disease to validate its use across different demographics in predicting HCC recurrence and reduce selection bias.
The user-friendly aspect of the web-tool developed, encompasses multiple complex aspects of the predictive model to increase its application in clinical practice.
In conclusion, the RSF model emerges as a beacon in the quest for precision postoperative care in HCC. Its demonstrated superiority over traditional models, coupled with its ability to stratify risk, ushers in a new era of individualized treatment strategies. The future role of artificial intelligence (AI) in evaluating hepatic diseases holds tremendous promise for revolutionizing diagnostic and treatment approaches. AI technologies, particularly machine learning algorithms, can analyse vast amounts of medical data, including imaging studies, laboratory results, and patient histories, to identify patterns and subtle anomalies that may escape the human eye. In hepatic diseases, AI can play a crucial role in early detection, risk assessment, and personalized treatment planning. Advanced imaging techniques, such as magnetic resonance imaging (MRI) and computed tomography (CT) scans, can be enhanced by AI algorithms to provide more accurate and timely diagnoses of liver conditions. Machine learning models can also predict disease progression, helping healthcare professionals tailor interventions based on individual patient profiles.
As we navigate this machine learning odyssey, the RSF model stands poised to redefine the landscape of HCC prognostication and guide clinicians toward more informed and personalized patient care.
In conclusion, the RSF model emerges as a beacon in the quest for precision postoperative care in HCC. Its demonstrated superiority over traditional models, coupled with its ability to stratify risk, ushers in a new era of individualized treatment strategies. The future role of AI in evaluating hepatic diseases holds tremendous promise for revolutionizing diagnostic and treatment approaches. AI technologies, particularly machine learning algorithms, can analyse vast amounts of medical data, including imaging studies, laboratory results, and patient histories, to identify patterns and subtle anomalies that may escape the human eye. In hepatic diseases, AI can play a crucial role in early detection, risk assessment, and personalized treatment planning. Advanced imaging techniques, such as MRI and CT scans, can be enhanced by AI algorithms to provide more accurate and timely diagnoses of liver conditions. Machine learning models can also predict disease progression, helping healthcare professionals tailor interventions based on individual patient profiles.
As we navigate this machine learning odyssey, the RSF model stands poised to redefine the landscape of HCC prognostication and guide clinicians toward more informed and personalized patient care.
Provenance and peer review: Invited article; Externally peer reviewed.
Peer-review model: Single blind
Specialty type: Gastroenterology and hepatology
Country/Territory of origin: New Zealand
Peer-review report’s scientific quality classification
Grade A (Excellent): 0
Grade B (Very good): B, B
Grade C (Good): 0
Grade D (Fair): 0
Grade E (Poor): 0
P-Reviewer: Lu GR, China; Morya AK, India S-Editor: Qu XL L-Editor: A P-Editor: Yu HG
1. | Zeng J, Zeng J, Lin K, Lin H, Wu Q, Guo P, Zhou W, Liu J. Development of a machine learning model to predict early recurrence for hepatocellular carcinoma after curative resection. Hepatobiliary Surg Nutr. 2022;11:176-187. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 8] [Cited by in F6Publishing: 16] [Article Influence: 8.0] [Reference Citation Analysis (0)] |
2. | Nevola R, Ruocco R, Criscuolo L, Villani A, Alfano M, Beccia D, Imbriani S, Claar E, Cozzolino D, Sasso FC, Marrone A, Adinolfi LE, Rinaldi L. Predictors of early and late hepatocellular carcinoma recurrence. World J Gastroenterol. 2023;29:1243-1260. [PubMed] [DOI] [Cited in This Article: ] [Cited by in CrossRef: 3] [Cited by in F6Publishing: 51] [Article Influence: 51.0] [Reference Citation Analysis (3)] |
3. | Poon RT, Fan ST, Lo CM, Liu CL, Wong J. Long-term survival and pattern of recurrence after resection of small hepatocellular carcinoma in patients with preserved liver function: implications for a strategy of salvage transplantation. Ann Surg. 2002;235:373-382. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 647] [Cited by in F6Publishing: 666] [Article Influence: 30.3] [Reference Citation Analysis (0)] |
4. | Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, Bray F. Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA Cancer J Clin. 2021;71:209-249. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 50630] [Cited by in F6Publishing: 53420] [Article Influence: 17806.7] [Reference Citation Analysis (123)] |
5. | Portolani N, Coniglio A, Ghidoni S, Giovanelli M, Benetti A, Tiberio GA, Giulini SM. Early and late recurrence after liver resection for hepatocellular carcinoma: prognostic and therapeutic implications. Ann Surg. 2006;243:229-235. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 558] [Cited by in F6Publishing: 690] [Article Influence: 38.3] [Reference Citation Analysis (0)] |
6. | Gao YX, Ning QQ, Yang PX, Guan YY, Liu PX, Liu ML, Qiao LX, Guo XH, Yang TW, Chen DX. Recent advances in recurrent hepatocellular carcinoma therapy. World J Hepatol. 2023;15:460-476. [PubMed] [DOI] [Cited in This Article: ] [Cited by in F6Publishing: 3] [Reference Citation Analysis (0)] |
7. | Wang D, Xiao M, Wan ZM, Lin X, Li QY, Zheng SS. Surgical treatment for recurrent hepatocellular carcinoma: Current status and challenges. World J Gastrointest Surg. 2023;15:544-552. [PubMed] [DOI] [Cited in This Article: ] [Cited by in CrossRef: 1] [Cited by in F6Publishing: 1] [Article Influence: 1.0] [Reference Citation Analysis (0)] |
8. | Ngiam KY, Khor IW. Big data and machine learning algorithms for health-care delivery. Lancet Oncol. 2019;20:e262-e273. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 371] [Cited by in F6Publishing: 515] [Article Influence: 128.8] [Reference Citation Analysis (0)] |
9. | Christou CD, Tsoulfas G. Challenges and opportunities in the application of artificial intelligence in gastroenterology and hepatology. World J Gastroenterol. 2021;27:6191-6223. [PubMed] [DOI] [Cited in This Article: ] [Cited by in CrossRef: 17] [Cited by in F6Publishing: 17] [Article Influence: 5.7] [Reference Citation Analysis (7)] |
10. | Singal AG, Mukherjee A, Elmunzer BJ, Higgins PD, Lok AS, Zhu J, Marrero JA, Waljee AK. Machine learning algorithms outperform conventional regression models in predicting development of hepatocellular carcinoma. Am J Gastroenterol. 2013;108:1723-1730. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 205] [Cited by in F6Publishing: 177] [Article Influence: 16.1] [Reference Citation Analysis (1)] |
11. | Pickett KL, Suresh K, Campbell KR, Davis S, Juarez-Colunga E. Random survival forests for dynamic predictions of a time-to-event outcome using a longitudinal biomarker. BMC Med Res Methodol. 2021;21:216. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 21] [Cited by in F6Publishing: 11] [Article Influence: 3.7] [Reference Citation Analysis (0)] |