Published online Feb 15, 2025. doi: 10.4251/wjgo.v17.i2.101379
Revised: November 4, 2024
Accepted: November 22, 2024
Published online: February 15, 2025
Processing time: 127 Days and 22.8 Hours
In this article, we comment on the article published by Yu et al. By employing LASSO regression and Cox proportional hazard models, the article identified nine significant variables affecting survival, including body mass index, Karnofsky performance status, and tumor-node-metastasis staging. We firmly concur with Yu et al regarding the vital significance of clinical prediction models (CPMs), including logistic regression and Cox regression for assessment in esophageal carcinoma (EC). However, the nomogram's limitations and the complexities of integrating genetic factors pose challenges. The integration of immunological data with advanced statistics offers new research directions. High-throughput sequencing and big data, facilitated by machine learning, have revolutionized cancer research but require substantial computational resources. The future of CPMs in EC depends on leveraging these technologies to improve predictive accuracy and clinical application, addressing the need for larger datasets, patient-reported outcomes, and regular updates for clinical relevance.
Core Tip: Clinical prediction model has great development space and practical value in the medical field. Despite significant efforts to explore the prognosis of esophageal carcinoma, current prognostic models remain imperfect. Traditional predictive models, such as Cox proportional hazards regression and logistic regression, are widely used but often lack effective evaluation mechanisms to determine their optimal performance. Moreover, due to limitations in sample size and predictive factors, the reproducibility of these models is poor, which severely restricts their broad application in clinical practice. Therefore, it is necessary to further explore and select more appropriate analytical methods to construct more accurate and reliable predictive models, thereby better serving clinical needs.
- Citation: Chen J, Xing QC. Advancements and challenges in esophageal carcinoma prognostic models: A comprehensive review and future directions. World J Gastrointest Oncol 2025; 17(2): 101379
- URL: https://www.wjgnet.com/1948-5204/full/v17/i2/101379.htm
- DOI: https://dx.doi.org/10.4251/wjgo.v17.i2.101379
Esophageal carcinoma (EC) is one of the predominant malignant tumors in the world[1]. While surgical techniques have improved and multimodal therapies have been incorporated, the prognosis for EC remains unsatisfactory[2]. Clinical prediction models (CPMs), also known as clinical prediction rules, risk prediction models, predictive models, or risk scores, are statistical tools that utilize mathematical formulas to predict the probability of a subject having a certain disease or experiencing a specific outcome in the future[3,4]. These models encompass parametric, semi-parametric, or non-parametric approaches. Currently, the primary models that are widely used in clinical settings are logistic regression and Cox regression, although machine-learning algorithms are also being increasingly applied in the construction of CPMs[5]. These models play a crucial role in clinical decision-making by providing quantified risk assessment, which helps strategize disease prevention, diagnosis, and treatment strategies[6]. This letter delves into the current landscape of prognostic models for EC, highlighting key insights and pinpointing promising avenues for future investigative efforts.
Several studies are available on prognostic prediction models for EC both domestically and internationally, and they can mainly categorized into four types: Clinical pathological factor models, tissue immunology models, gene molecular models and databases, and machine algorithm models[7]. The tumor staging system of the American Joint Committee on Cancer and the Union for International Cancer Control commonly used in clinical practice, as well as the improved staging scheme based on them, belong to the clinical pathological factor models from among the abovementioned models. The tumor-node-metastasis staging system falls short of providing an accurate survival prognosis for EC patients owing to its omission of clinical traits, treatment options, and laboratory data.
Based on the complex interaction between cancer and the host immune response, immune molecules may be the key factor in determining the prognosis. In 2018, Jiang et al[8] pioneered the development of a gastric cancer prognosis prediction model that leverages tissue immunology indicators. This groundbreaking study utilized data from 879 patients and, notably, employed immunohistochemistry to assess the expression and survival data of 27 immune characteristics across 251 specimens. The resulting prediction model was expertly crafted using a LASSO-Cox regression model, with a focus on five pivotal immune markers. This approach not only expanded the horizons of CPMs but also underscored the potential of integrating advanced statistical algorithms with immunological data to enhance the prognostic accuracy in cancer research. Although there are only a few studies on the prediction of the prognosis of EC by tissue immunology indexes, it provides a new direction for future research.
With the advent of the post-genomic era, the new generation of high-throughput sequencing technologies in transcriptomics, such as gene chips, RNA sequencing, and other technologies, have been widely applied in the field of medical research with some progress[9]. Recent studies[10,11] have shown that the gene expression profiles can be applied in the identification, staging and guiding of treatment and intervention approaches for a variety of cancers. The prognosis of EC is also related to gene molecules and results in differential prognosis. Despite the significant strides made in genetic research, the integration of factors such as CDK8, ARID1A, and certain autophagy-related long non-coding RNAs into prognostic models is challenged by the high costs and complexity associated with genetic testing[10].
With the increasing prevalence of computer technology and the internet, we have entered the big data era, which is distinguished from the traditional clinical case collection based on its sheer volume, diversity, velocity, and value. In recent years, extensive genomic sequencing initiatives like The Cancer Genome Atlas and the International Cancer Genome Consortium have facilitated the analysis of vast datasets, leading to the discovery of the underlying processes involved in the tumor formation, progression, and prognosis[12]. This aspect has provided valuable insights into the development of personalized diagnostic and therapeutic strategies for cancer patients. However, the vast amount of available data demands substantial processing; this challenge has been effectively addressed by the advent of machine-learning algorithms[13]. In a study by Abuhelwa et al[14], a machine-learning algorithm was created to forecast survival rates in urothelial cancer patients undergoing atezolizumab therapy. The research indicated that the Gradient Boosting Machine model demonstrated superior predictive accuracy compared to alternative models, including CoxBoost, random forests, and generalized linear models, in estimating patient survival outcomes. This study employed LASSO regression to select variables, which is a powerful variable selection tool. However, the criteria for variable selection, including the selection process and reasons for penalty coefficient λ, warrants further elaboration. In addition, it is necessary to consider comparing the results of the LASSO Cox model with other variable selection methods (such as forward selection, backward elimination or AIC/BIC standard) to evaluate the robustness of the model. Although the training set and validation set were used to evaluate the performance of the model, the further utilization of k-fold cross-validation is expected to enhance the reliability of model evaluation.
The development and application of these models face challenges such as the need for larger and more diverse datasets, the incorporation of patient-reported outcomes, and the requirement for continuous updating to maintain their accuracy and relevance in clinical practice. The future of CPMs lies in leveraging big data, advanced analytics, and multidisciplinary approaches to enhance their predictive capabilities and clinical utility.
1. | Abnet CC, Arnold M, Wei WQ. Epidemiology of Esophageal Squamous Cell Carcinoma. Gastroenterology. 2018;154:360-373. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 935] [Cited by in F6Publishing: 1060] [Article Influence: 151.4] [Reference Citation Analysis (1)] |
2. | Baba Y, Yoshida N, Kinoshita K, Iwatsuki M, Yamashita YI, Chikamoto A, Watanabe M, Baba H. Clinical and Prognostic Features of Patients With Esophageal Cancer and Multiple Primary Cancers: A Retrospective Single-institution Study. Ann Surg. 2018;267:478-483. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 33] [Cited by in F6Publishing: 74] [Article Influence: 12.3] [Reference Citation Analysis (1)] |
3. | Ranstam J, Cook JA, Collins GS. Clinical prediction models. Br J Surg. 2016;103:1886. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 9] [Cited by in F6Publishing: 24] [Article Influence: 3.0] [Reference Citation Analysis (0)] |
4. | Yu SS, Zheng X, Li XS, Xu QJ, Zhang W, Liao ZL, Lei HK. Development of a nomogram for overall survival in patients with esophageal carcinoma: A prospective cohort study in China. World J Gastrointest Oncol. 2025;17:96686. [PubMed] [DOI] [Cited in This Article: ] [Reference Citation Analysis (4)] |
5. | Bi WL, Hosny A, Schabath MB, Giger ML, Birkbak NJ, Mehrtash A, Allison T, Arnaout O, Abbosh C, Dunn IF, Mak RH, Tamimi RM, Tempany CM, Swanton C, Hoffmann U, Schwartz LH, Gillies RJ, Huang RY, Aerts HJWL. Artificial intelligence in cancer imaging: Clinical challenges and applications. CA Cancer J Clin. 2019;69:127-157. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 848] [Cited by in F6Publishing: 706] [Article Influence: 117.7] [Reference Citation Analysis (3)] |
6. | Min BH, Yang JW, Min YW, Baek SY, Kim S, Kim HK, Choi YS, Shim YM, Choi YL, Zo JI. Nomogram for prediction of lymph node metastasis in patients with superficial esophageal squamous cell carcinoma. J Gastroenterol Hepatol. 2020;35:1009-1015. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 16] [Cited by in F6Publishing: 25] [Article Influence: 5.0] [Reference Citation Analysis (0)] |
7. | van den Boorn HG, Engelhardt EG, van Kleef J, Sprangers MAG, van Oijen MGH, Abu-Hanna A, Zwinderman AH, Coupé VMH, van Laarhoven HWM. Prediction models for patients with esophageal or gastric cancer: A systematic review and meta-analysis. PLoS One. 2018;13:e0192310. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 22] [Cited by in F6Publishing: 39] [Article Influence: 5.6] [Reference Citation Analysis (0)] |
8. | Jiang Y, Zhang Q, Hu Y, Li T, Yu J, Zhao L, Ye G, Deng H, Mou T, Cai S, Zhou Z, Liu H, Chen G, Li G, Qi X. ImmunoScore Signature: A Prognostic and Predictive Tool in Gastric Cancer. Ann Surg. 2018;267:504-513. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 236] [Cited by in F6Publishing: 333] [Article Influence: 55.5] [Reference Citation Analysis (0)] |
9. | Bawa P, Zackaria S, Verma M, Gupta S, Srivatsan R, Chaudhary B, Srinivasan S. Integrative Analysis of Normal Long Intergenic Non-Coding RNAs in Prostate Cancer. PLoS One. 2015;10:e0122143. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 25] [Cited by in F6Publishing: 26] [Article Influence: 2.6] [Reference Citation Analysis (0)] |
10. | Deng J, Weng X, Chen W, Zhang J, Ma L, Zhao K. A nomogram and risk classification model predicts prognosis in Chinese esophageal squamous cell carcinoma patients. Transl Cancer Res. 2022;11:3128-3140. [PubMed] [DOI] [Cited in This Article: ] [Reference Citation Analysis (1)] |
11. | Zhao X, Wang Y, Meng F, Liu Z, Xu B. Risk Stratification and Validation of Eleven Autophagy-Related lncRNAs for Esophageal Squamous Cell Carcinoma. Front Genet. 2022;13:894990. [PubMed] [DOI] [Cited in This Article: ] [Reference Citation Analysis (1)] |
12. | Wu WT, Li YJ, Feng AZ, Li L, Huang T, Xu AD, Lyu J. Data mining in clinical big data: the frequently used databases, steps, and methodological models. Mil Med Res. 2021;8:44. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 80] [Cited by in F6Publishing: 198] [Article Influence: 49.5] [Reference Citation Analysis (0)] |
13. | Li MX, Sun XM, Cheng WG, Ruan HJ, Liu K, Chen P, Xu HJ, Gao SG, Feng XS, Qi YJ. Using a machine learning approach to identify key prognostic molecules for esophageal squamous cell carcinoma. BMC Cancer. 2021;21:906. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 4] [Cited by in F6Publishing: 4] [Article Influence: 1.0] [Reference Citation Analysis (0)] |
14. | Abuhelwa AY, Kichenadasse G, McKinnon RA, Rowland A, Hopkins AM, Sorich MJ. Machine Learning for Prediction of Survival Outcomes with Immune-Checkpoint Inhibitors in Urothelial Cancer. Cancers (Basel). 2021;13:2001. [PubMed] [DOI] [Cited in This Article: ] [Cited by in Crossref: 6] [Cited by in F6Publishing: 7] [Article Influence: 1.8] [Reference Citation Analysis (0)] |