Published online Feb 7, 2024. doi: 10.3748/wjg.v30.i5.450
Peer-review started: October 16, 2023
First decision: December 6, 2023
Revised: December 19, 2023
Accepted: January 12, 2024
Article in press: January 12, 2024
Published online: February 7, 2024
Processing time: 106 Days and 20.3 Hours
Colorectal cancer (CRC) is a serious threat worldwide. Although early screening is suggested to be the most effective method to prevent and control CRC, the current situation of early screening for CRC is still not optimistic. In China, the incidence of CRC in the Yangtze River Delta region is increasing dramatically, but few studies have been conducted. Therefore, it is necessary to develop a simple and efficient early screening model for CRC.
To develop and validate an early-screening nomogram model to identify individuals at high risk of CRC.
Data of 64448 participants obtained from Ningbo Hospital, China between 2014 and 2017 were retrospectively analyzed. The cohort comprised 64448 individuals, of which, 530 were excluded due to missing or incorrect data. Of 63918, 7607 (11.9%) individuals were considered to be high risk for CRC, and 56311 (88.1%) were not. The participants were randomly allocated to a training set (44743) or validation set (19175). The discriminatory ability, predictive accuracy, and clinical utility of the model were evaluated by constructing and analyzing receiver operating characteristic (ROC) curves and calibration curves and by decision curve analysis. Finally, the model was validated internally using a bootstrap resampling technique.
Seven variables, including demographic, lifestyle, and family history information, were examined. Multifactorial logistic regression analysis revealed that age [odds ratio (OR): 1.03, 95% confidence interval (CI): 1.02-1.03, P < 0.001], body mass index (BMI) (OR: 1.07, 95%CI: 1.06-1.08, P < 0.001), waist circumference (WC) (OR: 1.03, 95%CI: 1.02-1.03 P < 0.001), lifestyle (OR: 0.45, 95%CI: 0.42-0.48, P < 0.001), and family history (OR: 4.28, 95%CI: 4.04-4.54, P < 0.001) were the most significant predictors of high-risk CRC. Healthy lifestyle was a protective factor, whereas family history was the most significant risk factor. The area under the curve was 0.734 (95%CI: 0.723-0.745) for the final validation set ROC curve and 0.735 (95%CI: 0.728-0.742) for the training set ROC curve. The calibration curve demonstrated a high correlation between the CRC high-risk population predicted by the nomogram model and the actual CRC high-risk population.
The early-screening nomogram model for CRC prediction in high-risk populations developed in this study based on age, BMI, WC, lifestyle, and family history exhibited high accuracy.
Core Tip: This was the first large-scale study to investigate early screening for detection of colorectal cancer (CRC) in Ningbo, China, which was part of the national early screening CRC program. The study focused on collecting information on the general population who attended annual health checks. Our findings showed that the area under the curve was 0.734 for the final validation set receiver operating characteristic (ROC) curve and 0.735 for the training set ROC curve. Therefore, we developed an early screening model with high accuracy for CRC.