
Establishment and validation of a clinical prediction model for colorectal adenoma risk factors
- Authors:
- Published online on: May 2, 2025 https://doi.org/10.3892/ol.2025.15068
- Article Number: 322
-
Copyright: © Li et al. This is an open access article distributed under the terms of Creative Commons Attribution License.
Abstract
Introduction
According to a global cancer study performed by the International Agency for Research on Cancer in 2018, colorectal cancer (CRC) ranks fourth and fifth in terms of new cases and fatalities, respectively, among all malignant tumors (1). Furthermore, China has one of the highest annual rates of new cases and deaths from CRC and there are ~376,000 new cases and ~191,000 deaths each year (2). The adenoma-adenocarcinoma pathway serves a notable role in the pathophysiology of CRC, with adenoma malignancy accounting for 50–80% of CRC cases (3). Although timely resection of early colorectal lesions can improve survival rate, most patients are diagnosed in the middle or late stages. Even after treatment, the 5-year survival rate remains very low at <50% (4,5). Early detection and treatment are critical for colorectal adenoma, the most common precancerous lesion in CRC.
The gold standard for diagnosing colorectal adenoma is an electronic colonoscopy (e-colonoscopy) examination. However, patients find this procedure less acceptable than others as it is invasive and requires intestinal clearance beforehand. As a result, e-colonoscopy is not part of the general medical examination routine (6,7). In China, early CRC screening primarily involves a fecal occult blood test, tumor marker detection and a related symptom questionnaire. Following this, individuals identified as high-risk are referred for a colonoscopy for further examination (8). Several studies have been performed in China and other counties to identify risk factors for colorectal adenoma, and these studies reported that several factors, including genetics, age, metabolic syndrome and non-alcoholic fatty liver disease, influence the incidence of CRC (9,10). However, the findings from these several studies are incongruent. Therefore, the present study aimed to gather relevant clinical data, create a prediction model through statistical analyses and test the efficacy of the model to provide data support and an evidence-based medical foundation for future studies. The findings of the present could increase the diagnosis rate of colorectal adenoma and achieve positive secondary prevention of CRC by facilitating earlier screening of high-risk populations for colorectal adenoma.
Materials and methods
Patients and study design
The present study was a retrospective case-control study, approved by the Ethics Committee of the Affiliated Hospital of Shandong University of Chinese Medicine [Jinan, China; approval no. (2024) Ethics Review No. (017)-YJS]. The study only involved the collection of case data using anonymized or de-identified information that cannot be traced back to individuals and poses no risk to patients. Therefore, the need for informed patient consent was waived by the Ethics Committee. Data recorded from patients receiving e-colonoscopy at the Department of Proctology of The Affiliated Hospital of Shandong University of Traditional Chinese Medicine from January 2013 to December 2023 were included. The inclusion criteria were as follows: i) Age of >18 years old; ii) presence of polyps identified using colonoscopy and precise pathological report results; iii) detailed clinical and laboratory data; and iv) no colorectal resection performed. The exclusion criteria were as follows: i) Diagnosis of inflammatory bowel disease, intestinal tuberculosis, CRC or schistosomiasis; ii) a history of gastrointestinal surgery; iii) diagnosis of an autoimmune disease or serious underlying disease, such as systemic lupus erythematosus and severe heart disease; and iv) insufficient basic information. A total of 730 patients were included in analysis and randomly divided into training and validation cohorts in a 7:3 ratio.
Data collection
The following data was collected: i) Background information such sex, age, body mass index, the daily number of bowel movements, stool texture, sleep status, as well as the medical history of encephalopathy, lung disease, hypertension, diabetes, liver disease, smoking and drinking and family history of polyps or tumors of the digestive system; ii) results from blood routine examinations, such as the level of red blood cells, white blood cells and neutrophils, platelet count (PLT), hemoglobin levels and thrombin time (TT); iii) biochemical indexes, such as alanine aminotransferase (ALT), aspartate aminotransferase (AST), γ-glutamyl transpeptidase, AST/ALT ratio, albumin/globulin ratio, total bilirubin, total bile acid, creatinine (Cr), uric acid (UA), glucose (GLU), total cholesterol, triglycerides, high-density lipoprotein (HDL), low density lipoprotein, amylase, series of tumor carcinoembryonic antigen tests, α-fetoprotein, ferroprotein; and iv) the number and pathological type of polyps.
Pathological diagnosis
A total of two experienced pathologists diagnosed the patients according to the World Health Organization classification of gastrointestinal neoplasms (diagnostic criteria of the 2019 edition) (11,12). If agreement on the diagnosis could not be achieved, a third pathologist assisted in making a final decision.
Basis of grouping
The present study divided the patients into two groups: i) The adenoma group, which included patients with histological examinations identified as adenomatous polyps; and ii) the control group, which included patients with non-adenomatous polyps, such as inflammatory or hyperplastic polyps, or those with no apparent abnormalities on endoscopic examination.
Statistical analyses
Establishment of the prediction modelA collinearity test was applied to the data. If multicollinearity was confirmed, the Least Absolute Shrinkage and Selection Operator cross-validation method was applied to filter the variables (13). In the absence of multicollinearity, data were screened using a single-factor analysis. Before performing the single-factor analysis, the Kolmogorov-Smirnov test was applied to continuous variables to determine the normality of the data distribution. Continuous variables with normal distributions are represented as mean ± standard deviation and were analyzed using the unpaired Student's t-test or a nonparametric Mann-Whitney U test. The χ2 test was used to analyze categorical variables. Finally, the selected variables were incorporated in the multivariate logistic regression analysis and the prediction model was built based on the results by plotting a nomogram.
Validation of the model
The C-index, equivalent to the area under the receiver operating characteristic (ROC) curve, estimated the probability of uniformity between the predicted and actual results (14). The model was considered to have no discrimination ability when the C-index was 0.5, whereas it possessed full discrimination ability when the C-index was 1. The calibration diagram visually depicts the relationship between the predicted and actual probabilities. The closer the prediction probability is to the 45° diagonal, the higher the calibration degree. The sensitivity, specificity and efficacy of the predictive models were comprehensively evaluated by plotting calibration curves, ROC curves and decision curve analysis (DCA) graphs. The present analysis consisted of two parts: Internal and external validations. For the internal validation, the Bootstrap method was used to validate the model and draw calibration curves. For the external validation, the logistic regression formula obtained in the training cohort was applied to patients in the validation cohort for logistic regression analysis. The C-index of the validation set was then calculated and the calibration curve was plotted. The calibration degree was then confirmed using the Hosmer-Lemeshow test. ROC and DCA curves were employed for both internal and external validation to assess the accuracy and clinical utility of the model. R studio (version 4.1.2; http://www.r-project.org/) and IBM SPSS Statistics (version 26.0: IBM Corp.) were used for statistical analysis. The statistical significance levels reported are two-sided and P<0.05 was considered to indicate a statistically significant difference.
Reporting standard
The present study was reported in strict accordance with the requirements in the STROBE Guidelines checklist (15).
Results
Essential characteristics of the training and validation cohorts
A total of 730 cases were included based on the inclusion and exclusion criteria, with 286 assigned to the case group and 444 assigned to the control group. In a 7:3 ratio, patients were randomly divided into the training set (n=511) and the validation cohort (n=219). Table I presents the 40 basic characteristics (39 independent variables and 1 dependent variable) of the two sets.
Selection of variables and the establishment of prediction model
Single-factor analysisThe Variance Inflation Factor (VIF) is a measure of the severity of multicollinearity in a multiple linear regression model. The absence of covariance is indicated when the VIF value is <10. As presented in Table II, the VIF values of all the variables included in the analysis were <10, indicating that there was no multicollinearity in the data. Therefore, a single-factor analysis was used to screen the variables. Continuous variables did not conform to a normal distribution, so the Mann-Whitney U test was applied, and categorical variables were analyzed using the χ2 test. The analysis revealed a significant difference (P<0.05) between the adenoma and control groups for 12/39 variables, including for sex, age, the daily number of bowel movements, history of hypertension, history of diabetes, PLT, TT, Cr, UA, GLU, HDL and the number of polyps (Table III).
Single- and multi-factor logistic regression analyses
The univariate logistic regression analysis demonstrated that sex, age, the daily number of bowel movements, history of hypertension, history of diabetes, PLT, TT, Cr, UA, HDL and the number of polyps were significantly associated with the likelihood of colorectal adenomas (P<0.05). Colorectal adenomas were more common in men than in women and in those who had an abnormal daily number of bowel movements, a history of hypertension and diabetes and in those with ≥3 polyps. Furthermore, Cr, UA, TT and age were positively associated with the likelihood of colorectal adenomas, whereas PLT and HDL were negatively associated (Table IV).
![]() | Table IV.Results of the univariate and multifactor logistic regression models for the probability of colorectal adenoma. |
The results of the multifactorial logistic regression analysis demonstrated a significant association between age, the daily number of bowel movements, TT and the number of polyps, with the likelihood of colorectal adenomas (P<0.05; Table IV). This suggests that the four aforementioned factors are independent risk factors for colorectal adenoma when >1 factor is evaluated at the same time.
After several rounds of cascading statistical methods, the range of variables initially included was progressively narrowed. Eventually only age, the daily number of bowel movements, TT and the number of polyps were included in the nomogram to build the prediction model (Fig. 1). Different positions on each line segment represent specific ranges of values, whilst the length reflects the extent of their influence on the results. The nomogram consists of the variable name, score and probability. Depending on the location of a variable, the score for that criterion is determined by the value at the top of the chart. The scores of all metrics are summed to give the total score, and finally, the probability of occurrence of each outcome event is further derived through the function transformation relationship (16).
Validation of predictive models
Internal validationThe C-index in the training cohort was 0.7054 [95% confidence interval (CI), 0.6596–0.7512]. The calibration curve was plotted using the Bootstrap method with n=1,000 (Fig. 2). The prediction probability curve was very close to the 45° diagonal, indicating that the model had a high level of predictability. The ROC curve is a tool used to evaluate the performance of a binary classification model by plotting the relationship between True Positive Rate and False Positive Rate, to reflect the performance of the classifier. The closer the ROC curve is to the point (1,0), the more effective the generalization performance of the model is. Fig. 3 displays the ROC curve for the training set, indicating that the model demonstrated a strong performance. By contrast, DCA is more focused on the practical application of clinical decision-making, integrating patient or decision-maker preferences into the analysis and considering the clinical utility of a particular model. It quantifies the balance between the benefits of using the model and the potential drawbacks. As such, the core significance of the DCA curve is to translate the statistical performance of a predictive model into the actual value of clinical decision-making, and it is an important tool for assessing the diagnostic accuracy and clinical utility of predictive models. Traditional metrics (such as area under the curve, sensitivity and specificity) do not directly reflect the clinical value of the model, and DCA fills this gap by providing a more intuitive decision support tool (17–19). In DCA, the horizontal coordinate is the threshold probability and the vertical coordinate is the net benefit (NB). The greater the NB value of the DCA curve is above the extreme curve in the range of the horizontal coordinates, the more improved the clinical application. The DCA curve of the training cohort in the present study is presented in Fig. 4, which reveals that the model had a good clinical application value.
External validation
The logistic regression formula derived from the training cohort was applied to the validation cohort, yielding a C-index of 0.6306 (95% CI, 0.5560–0.7053). Furthermore, the calibration curves also demonstrated good predictive power (Fig. 5). The Hosmer-Lemeshow test demonstrated that P=0.28 and χ2=9.7893 with P<0.05, indicating that the model fit was good. The ROC and DCA curves of the validation cohort are presented in Figs. 6 and 7, demonstrating that this model also showed a good performance and clinical application value in the validation cohort.
Discussion
The present study applied appropriate statistical methods to analyze relevant clinical data and develop a predictive model of risk factors for colorectal adenoma. The line graph, also called nomogram, is widely used in clinical practice to assess the risk of disease occurrence and prognosis (20,21). The core principle is to integrate multiple predictors based on the results of regression analysis, use the regression model to measure the degree of influence of different variables on the outcome events and transform this information into intuitive and easy-to-understand lines with scale markings. Nomograms transform complex regression equations into visual graphs, making regression analysis results and predictive model profiles more readable to facilitate fast and efficient assessment of clinical data. It is due to this intuitive and efficient advantage that it has gradually gained more attention and application in medical research and clinical practice as well (21). To date, several clinical researchers have used the method of predictive modeling with nomograms to present their data analysis results, including certain high-level studies published in high-impact factor journals, including studies by Gafita et al (22), Liu et al (23) and Pietrantonio et al (24). These high-level nomogram studies similarly construct a model through multifactorial logistic regression analysis and then assess model performance through ROC curves, calibration curves, as well as clinical applicability through DCA curves. However, they tend to use large samples or multi-center clinical data to enhance the accuracy of the results. By contrast to the aforementioned studies, a broader range of variables was included in the present study. In addition to the conventional variables reported in previous research, additional factors that may be related to colorectal adenomas were comprehensively considered. For the first time to the best our knowledge, three new variables, sleep status, history of encephalopathy and lung disease, were introduced.
The link between sleep status and colorectal adenomas has been previously investigated. In a clinical case-control study, a marked association was reported between patients with colorectal adenomas and their corresponding sleep duration, with an average sleep duration of <6 h per night increasing the risk of adenomas by ~50% (25). Sleep deprivation increases the expression of TNF-α, triggering a localized inflammatory response. Melatonin, which is closely related to sleep, it increases the expression of oncogene P53 and promotes the repair of DNA. At the same time, melatonin can inhibit the cell cycle and reduce cell proliferation, contributing to tumor suppression. Poor sleep conditions will lead to a large reduction in the secretion of melatonin, which may increase the risk of developing intestinal tumors. In addition, circadian rhythm disorders can also trigger mutations in circadian rhythm protein 2, which can lead to a series of diseases (26,27). Moreover, the concept of the brain-gut axis was proposed as early as the 1960s. Further research on microorganisms has gradually recognized that the gut microbiota serves a key regulatory role in the interaction between the brain and the gastrointestinal tract; therefore, the term ‘microbial-brain-gut axis’ was coined (28). The microbial-brain-gut axis is involved in the development of several intestinal diseases. It also influences tumor cell proliferation, apoptosis, invasion and other processes in intestinal tumors. Meanwhile the brain can regulate gastrointestinal tumors through anatomical neural and neuroendocrine pathways (29). Several studies have reported that CRC and precancerous lesions are associated with cognitive function and mental health. The microbiota-gut-brain axis may serve a notable role in this relationship (30–32). Additionally, dysregulation of the microbiota composition and abnormalities in intestinal barrier function may be key factors in the underlying pathological mechanisms (33). Dysregulation of the intestinal flora has been reported in a number of patients with colorectal adenomas, inflammatory bowel disease and cancer. Intestinal flora can form a cancer-promoting intestinal microenvironment through chronic inflammation thus affecting the development of the disease (34). Therefore, in the prevention and treatment of colorectal tumors and precancerous lesions, the concept of ‘brain and intestines together’ could represent a breakthrough.
Furthermore, the present study incorporated lung disease history into the model as Chinese medical theory posits that the lung and large intestine are interconnected and influence each other (35). In previous years, several researchers have assessed this theory by performing experimental studies, reporting that there is a molecular biology basis for the link between the lung and the large intestine (36–38). First, histoembryological studies have reported that the differentiation and development of the lungs, trachea and intestines originate from the endoderm and share a common embryonic origin (39–42). The mucosal immunity has a crucial role in connecting the lungs and the large intestine through both anatomical and physiological activities. The mucous membranes found in the gastrointestinal and respiratory tracts are integral parts of the overall mucosal immune system, which allows for interactions and communication between these two systems. The molecular basis of this mucosal immunity is the secretory IgA (SIgA), which can be secreted in large amounts by both the respiratory and gastrointestinal tract, the latter being the primary site of the SIgA immune response. Through the ‘homing’ and the common immune system, the mucosal immunity of the intestinal tract activates the lymphocytes and reaches the respiratory tract and other mucosal lymphoid tissues, where it exerts the immune response against the same antigen, forming the common mucosal immune system (43). Intestinal trefoil factor 3 (TFF3) is secreted by the cup cells of the intestinal mucosa, where it exerts an important protective effect on the mucosa of the digestive tract. Notably, TFF3 was reported to have a higher level of expression in the respiratory tract than in the colonic tissues and is closely associated with lung function, which further suggests that an inextricable link exists between the lungs and the intestines (44,45). In addition, it was reported that the increase in inflammatory cells (neutrophils, eosinophils and B-lymphocytes) in the lungs, stimulated by lung pathologies such as pneumonia, was accompanied by marked changes in the microbial community in the gut (38,46). These suggest that there is a notable interaction between the functions of the lungs and the intestines, and this inspired the present study.
According to the prediction model in the present study, abnormal bowel frequency is an independent risk factor for developing colorectal adenoma. Changes in bowel frequency, habits and stool characteristics are strongly associated with the emergence of colorectal adenoma (47). These were also early symptoms of CRC, which may be caused by long-term diarrhea and constipation, resulting in intestinal dysfunction and intestinal microecological changes, resulting in chronic inflammation (48). Inflammation is inextricably linked to tumorigenesis. Inflammation can cause DNA changes in intestinal cells and abnormal microbial metabolism, resulting in a series of lesions and, eventually, tumorigenesis (49). This is in line with findings that indicate that constipation could cause the accumulation of harmful substances in the intestinal lumen and damage the intestinal mucosa. This results in repeated repair and healing, leading to DNA changes in the epithelial cells of the mucosal surface, which could lead to adenomas (50,51). The aforementioned findings are consistent with the results of the present study, indicating that an abnormal number of bowel movements was a significant risk factor for colorectal adenomas.
The prediction model revealed that the TT was positively associated with a risk of colorectal adenoma. The TT is the time it takes for blood to clot after standardized thrombin is added to the plasma. TT is considered prolonged only when it is >3 sec from the standard control, suggesting hyperfibrinolysis, increased heparin and heparin-like substances (52). Only 3/730 patients in the present study had abnormal TT values, with the rest within the normal range, implying no association between abnormal TT values and the risk of colorectal adenoma. However, this could indicate that even within the normal range, the greater the TT value, the more likely a colorectal adenoma is to occur. TT is one of the indicators commonly used in clinical practice to reflect the coagulation function and its relationship with malignant tumors has received increasing attention in recent years (53,54). Several researchers have assessed the concept of malignancy-associated coagulation abnormality. Most studies reported that alterations in coagulation and fibrinolytic processes are associated with the occurrence and progression of malignant tumors and 95% of patients are accompanied by abnormalities in coagulation function, such as pancreatic, nasopharyngeal, lung and gastric cancers (55–62). In addition, it is generally considered that activation of the coagulation system is closely associated with the angiogenesis process occurring in human malignant tumors (63). Moreover, the relationship between CRC and coagulation function has been previously assessed, and certain studies have reported that the risk of coagulation abnormality in CRC is higher than that in other tumors (64). Certain studies have also reported that there is an association between the development of CRC and the hypercoagulable state of the blood in several malignant tumors (65,66). Abnormal coagulation can affect the proliferation and migration of tumor cells and even promote tumor angiogenesis, but its specific mechanism needs to be further elucidated (67). The significance of one of the coagulation function indicators, TT, has also been increasingly recognized. Several researchers have included TT in similar diagnostic or prognostic modeling studies. For instance, a predictive modeling study by Junsheng et al (68) reported that TT is an independent risk factor for the progression of colorectal adenoma to CRC and this finding was integrated into the predictive modeling. The clinical study by Fu et al (69) also reported that the prothrombin time, prothrombin time activity, activated partial thromboplastin time, TT and fasting blood glucose levels were higher in patients with CRC than in control patients, suggesting that patients with CRC have notable coagulation abnormalities. Zihong et al (70) reported that there was a marked difference in TT between patients with early-stage CRC and the controls. These findings suggest that coagulation indicators such as TT should be considered in the early diagnosis, treatment and prevention of colorectal adenoma and CRC.
The number of polyps was associated with the risk of colorectal adenoma in the present study. Individuals with ≥3 polyps were more likely to develop colorectal adenoma than those with one to two polyps. According to one study, >50% of colorectal adenomas were multiple polyps (71). Another study reported that most patients with adenomas had ≥3 polyps (72). These findings are consistent with the results of the present study.
The relationship between age and the risk of colorectal adenoma has been assessed by several studies, resulting in generally consistent conclusions. Both colorectal adenoma and CRC have been reported to be were strongly associated with age, and the prevalence increases with advanced age (73–77). Karsenti et al (78) analyzed 6,027 colonoscopy results and reported that the detection rate of adenomas and the rate of advanced tumor formation in individuals aged ≥45 years increased 2-fold compared with those <45 years old. Lin et al (79), in a CRC screening of 2 million elderly individuals, reported that the positive predictive value of CRC and advanced adenomas increased with age. The results of an epidemiologic survey of adenomas based on the Chinese population reported that the prevalence of adenomas increased from 4.6% at age 39 years to a peak of 27.3% at age ≥65 years (80). The detection rate of adenomas and the risk of cancerous transformation in individuals >60 years of age increased dramatically (81,82). The reason for this may be that with age, the function of all organs of the human body gradually declines and the ability of the intestinal mucosa to repair damage is greatly weakened, making it prone to lesions. The older the age, the greater the chance of developing metabolic syndromes such as abnormalities in blood glucose, lipids and blood, pressure which are closely related to the risk of early-onset CRC (83). Certain studies have also suggested that the main reason for this phenomenon is the decreasing stability of colonic pluripotent stem cells with age. This leads to the overproliferation of cells when exposed to adverse factors, ultimately resulting in the formation of adenomas (84). Multiple reasons make age one of the non-negligible risk factors associated with colorectal precancerous lesions.
Numerous previous studies have reported that smoking and drinking history, hypertension, diabetes, uric acid, lipids and triglycerides levels, non-alcoholic fatty liver disease and a family history of polyps are associated with an increased risk of colorectal adenoma (85–90). However, the aforementioned variables were not among the independent risk factors identified in the present study. This discrepancy may be caused by variations in the patient population of different studies, including differences in regions, ethnic groups and sample sizes.
By contrast to conventional epidemiologic studies, the present study introduces the method of nomogram and provides a comprehensive assessment of their performance and clinical utility. The statistical methods used in the present study are more complex and rigorous, and the research process is completely different. In addition to incorporating potential association variables identified in previous studies, the current study included further variables, such as history of encephalopathy and lung disease for the first time, to the best of our knowledge. The results also identified new potential risk factors, such as TT, daily number of bowel movements, increased age and the number of polyps. Using this new research method, further possible influences on diseases could be identified with greater confidence. The clinical prediction model enables the identification of high-risk individuals for colorectal adenomas, thereby facilitating timely e-colonoscopic surveillance, thus increasing the detection rate of adenoma whilst reducing medical costs, preventing the occurrence of colorectal precancerous lesions and interrupting the progression of colorectal adenoma to CRC. This could effectively lower the incidence and mortality rates from CRC.
However, there are several limitations to the present research. First, the C-index suggested that the constructed model had only a moderate predictive performance and there may be a certain degree of selection bias as the study was retrospective. Second, the data for both the training and validation cohorts came from the same source, which may lead to model overfitting. Third, most of the patients included in the study were from Shandong Province in China and the surrounding areas; therefore, it is unclear whether the model applies to individuals from other regions. Subsequently, multicenter studies employing larger samples are needed to validate the reliability and applicability of the present model. Moreover, the aim of the study was to derive the overall risk factors for developing adenomas by comparing the data of adenomatous and non-adenomatous populations, so the study did not take adenoma grading into consideration. However, the grade of colorectal adenoma is an important indicator; therefore, assessing the association between risk factors and certain grades of colorectal cancer is a future research direction.
In conclusion, the present study analyzed a large amount of clinical data and constructed a predictive model of colorectal adenoma risk factors with good efficacy and stability, which provide new ideas for timely screening of high-risk groups and improve the early detection rate of colorectal adenoma. It is expected that the model in the present study will contribute to the development of clinical medicine.
Acknowledgements
Not applicable.
Funding
The present research was supported by the Qilu Traditional Chinese Medicine Superiority Specialty Cluster Project (grant no. YWC2022KJQ0003).
Availability of data and materials
The data generated in the present study may be requested from the corresponding author.
Authors' contributions
DLL and ZAG confirm the authenticity of all the raw data. DLL designed the study framework, acquired the research data, conducted the statistical analyses for the validation of the model and drafted the initial manuscript. LLM finalized the study design details, performed data cleaning and organization, compiled the datasets and conducted data analysis for the establishment of the prediction model. ZAG evaluated and optimized the final study design, applied for and obtained ethical approvals to ensure that the research data could be accessed, and agreed to be accountable for the work in ensuring that questions related to the integrity of any part of the work are appropriately investigated and resolved. YXZ and CJ assisted with data acquisition, conducted data cross-checking and verification, checked all statistical methods and medical terminology in the manuscript and revised it critically for important intellectual content. All authors read and approved the final version of the manuscript.
Ethics approval and consent to participate
The present study was approved by the Ethics Committee of the Affiliated Hospital of Shandong University of Chinese Medicine [approval no. (2024) Ethics Review No. (017)-YJS]. The study only involved the collection of case data using anonymized or de-identified information that cannot be traced back to individuals and poses no risk to patients; therefore, the need for informed patient consent was waived by the Ethics Committee.
Patient consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
References
Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA and Jemal A: Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 68:394–424. 2018. View Article : Google Scholar : PubMed/NCBI | |
The Chinese standard of care for colorectal cancer (2020 edition). Chin J Surg. 58:561–585. 2020.PubMed/NCBI | |
Hui YF, Li C and Guo TH: Colorectal adenomas in TCM syndrome distribution laws and its clinical intervention study. China's Basic Med J Traditional Chin Med. 29:1866–1870. 2023.(In Chinese). | |
Yu M, Liu XB, Zhou M, Zhu ZD, Pan JG and WQ: Change trend of colorectal malignant tumors detected by electronic colonoscopy from 2015 to 2020. J Clin Gastroenterol. 34:361–364. 2022. | |
Courtney RJ, Paul CL, Carey ML, Sanson-Fisher RW, Macrae FA, D'Este C, Hill D, Barker D and Simmons J: A population-based cross-sectional study of colorectal cancer screening practices of first-degree relatives of colorectal cancer patients. BMC Cancer. 13:132013. View Article : Google Scholar : PubMed/NCBI | |
Rex DK, Boland CR, Dominitz JA, Giardiello FM, Johnson DA, Kaltenbach T, Levin TR, Lieberman D and Robertson DJ: Colorectal cancer screening: Recommendations for physicians and patients from the U.S. Multi-society task force on colorectal cancer. Gastroenterology. 153:307–323. 2017. View Article : Google Scholar : PubMed/NCBI | |
Bretthauer M, Kaminski MF, Løberg M, Zauber AG, Regula J, Kuipers EJ, Hernán MA, McFadden E, Sunde A, Kalager M, et al: Population-based colonoscopy screening for colorectal cancer: A randomized clinical trial. JAMA Intern Med. 176:894–902. 2016. View Article : Google Scholar : PubMed/NCBI | |
Li P, Wang CJ and Chen GY: Consensus on screening and diagnosis of early colorectal cancer and precancerous lesions in China. Chin J Practical Internal Med. 35:211–227. 2015.(In Chinese). View Article : Google Scholar | |
Wu XL, Wang LK and Huang XT: A retrospective study on the epidemiological characteristics of colorectal cancer. China Med Herald. 16:60–63. 2019.(In Chinese). | |
Li QQ, Wang J and Zhao Y: Current status of research on risk factors associated with the development of colorectal polyps. Med Rev. 26:3196–3200. 2020. | |
Huang D, Zhu XZ and Sheng WQ: Interpretation of the gastrointestinal epithelial tumors of 2019 edition of <WHO classification of tumors of the digestive system>. Chin J Pathol. 49:209–213. 2020. | |
Fang SG, Wei JG and Chen ZW: WHO classification of tumours of the digestive system (2019). J Diagn Pathol. 26:865–870. 2019.(In Chinese). | |
Tibshirani R: Regression shrinkage and selection via the lasso: A ret-rospective. J Royal Statistical Soc. 58:273–282. 2011. View Article : Google Scholar | |
Sninsky JA, Shore BM, Lupu GV and Crockett SD: Risk factors for colorectal polyps and cancer. Gastrointest Endosc Clin N Am. 32:195–213. 2022. View Article : Google Scholar : PubMed/NCBI | |
von Elm E, Altman DG, Egger M, Pocock SJ, Gøtzsche PC and Vandenbroucke JP; STROBE Initiative, : The strengthening the reporting of observational studies in epidemiology (STROBE) statement: Guidelines for reporting observational studies. Lancet. 370:1453–1457. 2007. View Article : Google Scholar : PubMed/NCBI | |
Fang R: Construction of a mult-icenter sleep database and a risk prediction modeling of moderate to severe obstructive sleep apnea in the population with sleep disorders (unpublished PhD thesis). Southern Medical University; 2024, (In Chinese). | |
Zhao L, Leng Y, Hu Y, Xiao J, Li Q, Liu C and Mao Y: Understanding decision curve analysis in clinical prediction model research. Postgrad Med J. 100:512–515. 2007. View Article : Google Scholar : PubMed/NCBI | |
Vickers AJ and Holland F: Decision curve analysis to evaluate the clinical benefit of prediction models. Spine J. 21:1643–1648. 2021. View Article : Google Scholar : PubMed/NCBI | |
Kerr KF, Brown MD, Zhu K and Janes H: Assessing the clinical impact of risk prediction models with decision curves: Guidance for correct interpretation and appropriate use. J Clin Oncol. 34:2534–2540. 2016. View Article : Google Scholar : PubMed/NCBI | |
Iasonos A, Schrag D, Raj GV and Panageas KS: How to build and interpret a nomogram for cancer prognosis. J Clin Oncol. 26:1364–1370. 2008. View Article : Google Scholar : PubMed/NCBI | |
Balachandran VP, Gonen M, Smith JJ and DeMatteo RP: Nomograms in oncology: More than meets the eye. Lancet Oncol. 16:e173–e180. 2015. View Article : Google Scholar : PubMed/NCBI | |
Gafita A, Calais J, Grogan TR, Hadaschik B, Wang H, Weber M, Sandhu S, Kratochwil C, Esfandiari R, Tauber R, et al: Nomograms to predict outcomes after 177Lu-PSMA therapy in men with metastatic castration-resistant prostate cancer: An international, multicentre, retrospective study. Lancet Oncol. 22:1115–1125. 2021. View Article : Google Scholar : PubMed/NCBI | |
Liu H, Li J, Guo J, Shi Y and Wang L: A prediction nomogram for neonatal acute respiratory distress syndrome in late-preterm infants and full-term infants: A retrospective study. EClinicalMedicine. 50:1015232022. View Article : Google Scholar : PubMed/NCBI | |
Pietrantonio F, Lonardi S, Corti F, Infante G, Elez ME, Fakih M, Jayachandran P, Shah AT, Salati M, Fenocchio E, et al: Nomogram to predict the outcomes of patients with microsatellite instability-high metastatic colorectal cancer receiving immune checkpoint inhibitors. J Immunother Cancer. 9:e0033702021. View Article : Google Scholar : PubMed/NCBI | |
Viswanathan AN, Hankinson SE and Schernhammer ES: Night shift work and the risk of endometrial cancer. Cancer Res. 67:10618–10622. 2007. View Article : Google Scholar : PubMed/NCBI | |
Thompson CL, Larkin EK, Patel S, Berger NA, Redline S and Li L: Short duration of sleep increases risk of colorectal adenoma. Cancer. 117:841–8747. 2011. View Article : Google Scholar : PubMed/NCBI | |
Um K, Park CS, Yoo C, Ahn YS, Kim M and Jeong KS: Risk factors including night shift work of colorectal polyp. Ann Occup Environ Med. 32:e262020. View Article : Google Scholar : PubMed/NCBI | |
Alpert O, Begun L, Issac T and Solhkhah R: The brain-gut axis in gastrointestinal cancers. J Gastrointest Oncol. 12 (Suppl 2):S301–S331. 2021. View Article : Google Scholar : PubMed/NCBI | |
Wang X, Wang S and Cao K: Prevention and treatment of colorectal cancer by ‘brain-intestinal axis’ of traditional Chinese medicine. World TCM. 18:3085–3089. 2023. | |
Cryan JF, O'Riordan KJ, Cowan CSM, Sandhu KV, Bastiaanssen TFS, Boehme M, Codagnone MG, Cussotto S, Fulling C, Golubeva AV, et al: The microbiota-gut-brain axis. Physiol Rev. 99:1877–2013. 2019. View Article : Google Scholar : PubMed/NCBI | |
Xiao W, Su J, Gao X, Yang H, Weng R, Ni W and Gu Y: The microbiota-gut-brain axis participates in chronic cerebral hypoperfusion by disrupting the metabolism of short-chain fatty acids. Microbiome. 10:622022. View Article : Google Scholar : PubMed/NCBI | |
Wu S, Liu X, Jiang R, Yan X and Ling Z: Roles and mechanisms of gut microbiota in patients With Alzheimer's disease. Front Aging Neurosci. 13:6500472021. View Article : Google Scholar : PubMed/NCBI | |
Ma C, Li Y, Mei Z, Yuan C, Kang JH, Grodstein F, Ascherio A, Willett WC, Chan AT and Huttenhower C: Association between bowel movement pattern and cognitive function: Prospective cohort study and a metagenomic analysis of the gut microbiome. Neurology. 101:e2014–e2025. 2023. View Article : Google Scholar : PubMed/NCBI | |
Polimeno L, Barone M, Mosca A, Viggiani MT, Joukar F, Mansour-Ghanaei F, Mavaddati S, Daniele A, Debellis L, Bilancia M, et al: Soy Metabolism by gut microbiota from patients with precancerous intestinal lesions. Microorganisms. 8:4692020. View Article : Google Scholar : PubMed/NCBI | |
Wei H: Theoretical connotation and biological mechanism of ‘The Lung is Connected with the Large Intestine, and the Large Intestine Corresponds to the Skin’. J Beijing Univ Traditional Chin Med. 46:1350–1356. 2023. | |
Liu B, Yu Y, Zhao M, Xiao K, Yan P, Duan Z, Wang K, Zhao N, Cao J, Wang J and Xie L: Correlation analysis of the microbiome and immune function in the lung-gut axis of critically Ill patients in the ICU. Front Med (Lausanne). 9:8083022022. View Article : Google Scholar : PubMed/NCBI | |
Zhao M, Shao F, Yu D, Zhang J, Liu Z, Ma J, Xia P and Wang S: Maturation and specialization of group 2 innate lymphoid cells through the lung-gut axis. Nat Commun. 13:76002022. View Article : Google Scholar : PubMed/NCBI | |
Mazumder MHH, Gandhi J, Majumder N, Wang L, Cumming RI, Stradtman S, Velayutham M, Hathaway QA, Shannahan J, Hu G, et al: Lung-gut axis of microbiome alterations following co-exposure to ultrafine carbon black and ozone. Part Fibre Toxicol. 20:152023. View Article : Google Scholar : PubMed/NCBI | |
Zorn AM and Wells JM: Vertebrate endoderm development and organ formation. Annu Rev Cell Dev Biol. 25:221–251. 2009. View Article : Google Scholar : PubMed/NCBI | |
Rankin SA, Han L, McCracken KW, Kenny AP, Anglin CT, Grigg EA, Crawford CM, Wells JM, Shannon JM and Zorn AM: A retinoic acid-hedgehog cascade coordinates mesoderm-inducing signals and endoderm competence during lung specification. Cell Rep. 16:66–78. 2016. View Article : Google Scholar : PubMed/NCBI | |
Liu S, Liu XY, Li LH, Wang X, Cui YF, Sun GZ and Guo XZ: Histology and cytology basic research on'Lung and Large Intestine being interior-exteriorly Related'. Chin J Trad Chin Med Pharm. 27:1167–1170. 2012.(In Chinese). | |
Li LH: ‘Biological mechanism of the relationship between the lung and the large intestine: A study on the physiological mechanism of the correlation between lung and intestinal tissues in a rat]. Beijing: Beijing University of Chinese Medicine; pp. 1–4. 2012 | |
Tulic MK, Piche T and Verhasselt V: Lung-gut cross-talk: Evidence, mechanisms and implications for the mucosal inflammatory diseases. Clin Exp Allergy. 46:519–528. 2016. View Article : Google Scholar : PubMed/NCBI | |
Tan XD, Chen YH, Liu QP, Gonzalez-Crussi F and Liu XL: Prostanoids mediate the protective effect of trefoil factor 3 in oxidant-induced intestinal epithelial cell injury: Role of cyclooxygenase-2. J Cell Sci. 113:2149–2155. 2000. View Article : Google Scholar : PubMed/NCBI | |
Ganguly K and Schulz H: Association studies of lung function in mice. Dtsch Tierarztl Wochenschr. 115:276–284. 2008.PubMed/NCBI | |
Pietrantonio F, Lonardi S, Corti F, Infante G, Elez ME, Fakih M, Jayachandran P, Shah AT, Salati M, Fenocchio E, et al: Nomogram to predict the outcomes of patients with microsatellite instability-high metastatic colorectal cancer receiving immune checkpoint inhibitors. J Immunother Cancer. 9:e0033702021. View Article : Google Scholar : PubMed/NCBI | |
Mao WX, Zhong ZS, Huang SP, Zhang W and Wang J: Analysis of risk factors and TCM mechanism of adenomatous polyps. Chin J Integr Tradit West Med Dig. 27:726–729,734. 2019. | |
Hao Y, Wang Y, Qi M, He X, Zhu Y and Hong J: Risk factors for recurrent colorectal polyps. Gut Liver. 14:399–411. 2020. View Article : Google Scholar : PubMed/NCBI | |
Lucas C, Barnich N and Nguyen HTT: Microbiota, inflammation and colorectal cancer. Int J Mol Sci. 18:13102017. View Article : Google Scholar : PubMed/NCBI | |
He X, Wu K, Ogino S, Giovannucci EL, Chan AT and Song M: Association between risk factors for colorectal cancer and risk of serrated polyps and conventional adenomas. Gastroenterology. 155:355–373.e18. 2018. View Article : Google Scholar : PubMed/NCBI | |
Yun GY, Moon HS, Kwon IS, Kim JS, Kang SH, Lee ES, Kim SH, Sung JK, Lee BS and Jeong HY: Left-sided colectomy: One of the important risk factors of metachronous colorectal adenoma after colectomy for colon cancer. Dig Dis Sci. 63:1052–1061. 2018. View Article : Google Scholar : PubMed/NCBI | |
International Society on Thrombosis and Haemostasis (ISTH), . ISTH guidelines for coagulation testing. 2021.https://www.isth.org | |
Ren Y, Liang H, Huang Y, Miao Y, Li R, Qiang J, Wu L, Qi J, Li Y, Xia Y, et al: Key candidate genes and pathways in T lymphoblastic leukemia/lymphoma identified by bioinformatics and serological analyses. Front Immunol. 15:13412552024. View Article : Google Scholar : PubMed/NCBI | |
Wu LL, Lin WK, Qian JY, Ma SS, Li MJ, Li K, Li ZX, Lan G and Xie D: Prognostic assessment of lung adenocarcinoma patients with early-staging diseases: A nomogram based on coagulation-related factors. Eur J Cardiothorac Surg. 64:ezad3132023. View Article : Google Scholar : PubMed/NCBI | |
Wang HY, Xiu DR, Li ZF and Wang G: Coagulation function in patients with pancreatic carcinoma. Chin Med J (Engl). 122:697–700. 2009.PubMed/NCBI | |
Ward MP, E Kane L, Norris L, Mohamed BM, Kelly T, Bates M, Clarke A, Brady N, Martin CM, Brooks RD, et al: Platelets, immune cells and the coagulation cascade; friend or foe of the circulating tumour cell? Mol Cancer. 20:592021. View Article : Google Scholar : PubMed/NCBI | |
Yin J and Zhu SS: Routine coagulation molecules predict nasopharyngeal carcinoma and associated metastases. Br J Biomed Sci. 76:178–183. 2019. View Article : Google Scholar : PubMed/NCBI | |
Peng PH, Wu CC, Liu SC, Chang KP, Chen CD, Chang YT, Hsu CW, Chang YS and Yu JS: Quantitative plasma proteome analysis reveals aberrant level of blood Coagulation-related proteins in nasopharyngeal carcinoma. J Proteomics. 74:744–757. 2011. View Article : Google Scholar : PubMed/NCBI | |
Marinho FC and Takagaki TY: Hypercoagulability and lung cancer. J Bras Pneumol. 34:312–322. 2008.(In English, Portuguese). View Article : Google Scholar : PubMed/NCBI | |
Hammouda A, Souilah S, Ferhat-Hamida MY, Amir ZC, Aouichat-Bouguerra S and Hariti G: Activation of coagulation in patients with lung cancer. Ann Biol Clin (Paris). 77:272–280. 2019.(In French). PubMed/NCBI | |
Repetto O and De Re V: Coagulation and fibrinolysis in gastric cancer. Ann N Y Acad Sci. 1404:27–48. 2017. View Article : Google Scholar : PubMed/NCBI | |
Wang B, Zou D, Wang N, Wang H, Zhang T, Gao L, Ma C, Zheng P, Gu B, Li X, et al: Construction and validation of a novel coagulation-related 7-gene prognostic signature for gastric cancer. Front Genet. 13:9576552022. View Article : Google Scholar : PubMed/NCBI | |
Chuang IW, Cho YH, Ahn MJ, Lee MJ, Kim GM, Chung CS and Bang OY: Association of cancer cell type and extracellular vesicles with coagulopathy in patients with lung cancer and stroke. Stroke. 49:1282–1285. 2018. View Article : Google Scholar | |
Kessler CM: The link between cancer and venous thromboembolism: A review. Am J Clin Oncol. 32 (4 Suppl):S2–S7. 2009. View Article : Google Scholar | |
Guo Q, Zhang B, Dong X, Xie Q, Guo E, Huang H and Wu Y: Elevated levels of plasma fibrinogen in patients with pancreatic cancer: Possible role of a distant metastasis predictor. Pancreas. 38:e75–e79. 2009. View Article : Google Scholar : PubMed/NCBI | |
Tang L, Liu K, Wang J, Wang C, Zhao P and Liu J: High preoperative plasma fibrinogen levels are associated with distant metastases and impaired prognosis after curative resection in patients with colorectal cancer. J Surg Oncol. 102:428–432. 2010. View Article : Google Scholar : PubMed/NCBI | |
Sahni A, Simpson-Haidaris PJ, Sahni SK, Vaday GG and Francis CW: Fibrinogen synthesized by cancer cells augments the prolifrative effect of fibroblast growth factor-2. J Thromb Haemost. 6:176–183. 2008. View Article : Google Scholar : PubMed/NCBI | |
Junsheng L, Ziling Y and Zhengyuan H: Construction of a prediction model to distinguish colorectal adenoma from colorectal cancer based on conventional test indicators. Tumor Control Res. 51:353–360. 2024.(In Chinese). | |
Liu ZH, Zhang N, Li FX, Li SX and Liu JT: Correlation between homocysteine, coagulation function indicators and the risk of colorectal cancer incidence: a case-control study. Chin J Clin Oncol. 50:654–660. 2023.(In Chinese). | |
Chen ZH, Sun SB, Wang XM, Yu SS, Wu XF, Chen XS, Geng T, Liu H, Liu XM and Nan Q: Abnormal Coagulation Parameters in Patients with Early Colorectal Cancer and Its Clinical Significance. Journal of Kunming Medical University. 42:96–100. 2021.(In Chinese). | |
Negro S, Bao QR, Scarpa M, Scognamiglio F, Pucciarelli S, Remo A, Agostini M, D'Angelo E, Mammi I, Schiavi F, et al: Multiple colorectal adenomas syndrome: The role of MUTYH mutation and the polyps' number in clinical management and colorectal cancer risk. Dig Liver Dis. 56:1087–1094. 2024. View Article : Google Scholar : PubMed/NCBI | |
Wang W and Wang XR: Exploring Risk Factors for Colorectal Polyp Malignant Transformation in Middle-Aged and Elderly Patients Based on Multivariate Logistic Regression Analysis. Modern Medicine and Health Research Electronic Journal. 9:122–124. 2025.(In Chinese). | |
Kim SE, Shim KN, Jung SA, Yoo K and Moon IH: An association be-tween obesity and the prevalence of colonic adenoma according to age and gender. J Gastroenterol. 42:616–623. 2007. View Article : Google Scholar : PubMed/NCBI | |
Meester RGS, Mannalithara A, Lansdorp-Vogelaar I and Ladabaum U: Trends in incidence and stage at diagnosis of colorectal cancer in adults aged 40 through 49 years, 1975–2015. JAMA. 321:1933–1934. 2019. View Article : Google Scholar : PubMed/NCBI | |
Chen XY and Kong LB: Progress of research on factors and mechanisms associated with carcinogenesis of colorectal adenomatous polyps. Chin J Cancer Prevention Treatment. 26:354–358. 2019. | |
Wu H, Zhang J and Zhou B: Metabolic syndrome and colorectal adenoma risk: A systematic review and metaanalysis. Clin Res Hepatol Gastroenterol. 45:1017492021. View Article : Google Scholar : PubMed/NCBI | |
Han X, Qian W, Liu Y, Zheng T, Su XJ, Zhang PP, Chen Y, Hu LH and Li ZS: Effects of age, sex and pathological type on the risk of multiple polyps: A Chinese teaching hospital study. J Dig Dis. 21:505–511. 2020. View Article : Google Scholar : PubMed/NCBI | |
Karsenti D, Tharsis G, Burtin P, Venezia F, Tordjman G, Gillet A, Samama J, Nahon-Uzan K, Cattan P and Cavicchi M: Adenoma and advanced neoplasia detection rates increase from 45 years of age. World J Gastroen. 25:447–456. 2019. View Article : Google Scholar | |
Lin G, Feng Z, Liu H, Li Y, Nie Y, Liang Y and Li K: Mass screening for colorectal cancer in a population of two million older adults in Guangzhou, China. Sci Rep. 9:104242019. View Article : Google Scholar : PubMed/NCBI | |
Hong W, Dong L, Stock S, Basharat Z, Zippi M and Zhou M: Prevalence and characteristics of colonic adenoma in mainland China. Cancer Manag Res. 10:2743–2755. 2018. View Article : Google Scholar : PubMed/NCBI | |
Sun DH: Clinical characteristics of 543 cases of tubular adenomatous polyps and their carcinogenesis of the large intestine. Gansu Med. 36:705–707. 2017. | |
McCashland TM, Brand R, Lyden E and de Garmo P; CORI Research Project, : Gender differences in colorectal polyps and tumors. Am J Gastroenterol. 96:882–886. 2001. View Article : Google Scholar : PubMed/NCBI | |
Jin EH, Han K, Lee DH, Shin CM, Lim JH, Choi YJ and Yoon K: Association between metabolic syndrome and the risk of colorectal cancer diagnosed before age 50 years according to tumor location. Gastroenterology. 163:637–648.e2. 2022. View Article : Google Scholar : PubMed/NCBI | |
Chen H, Li N, Ren J, Feng X, Lyu Z, Wei L, Li X, Guo L, Zheng Z, Zou S, et al: Participation and yield of a population-based colorectal cancer screening programme in China. Gut. 68:1450–1457. 2016. View Article : Google Scholar : PubMed/NCBI | |
Soltani G, Poursheikhani A, Yassi M, Hayatbakhsh A, Kerachian M and Kerachian MA: Obesity, diabetes and the risk of colorectal adenoma and cancer. BMC Endocr Disord. 19:1132019. View Article : Google Scholar : PubMed/NCBI | |
Lin CC, Huang KW, Luo JC, Wang YW, Hou MC, Lin HC, Lee FY and Chan WL: Hypertension is an important predictor of recurrent colorectal adenoma after screening colonoscopy with adenoma polypectomy. J Chin Med Assoc. 77:508–512. 2014. View Article : Google Scholar : PubMed/NCBI | |
Xing J, Ren J and Zhang Q: Analysis of risk factors for the development of colorectal adenomatous polyps. J Capital Med Univ. 42:601–608. 2021. | |
Liu ZH, Zhang GX, Zhang H, Jiang L, Deng Y, Chan FSY and Fan JKM: Association of body fat distribution and metabolic syndrome with the occurrence of colorectal adenoma: A Case-control study. J Dig Dis. 22:222–229. 2021. View Article : Google Scholar : PubMed/NCBI | |
Blackett JW, Verna EC and Lebwohl B: Increased prevalence of colorectal adenomas in patients with nonalcoholic fatty liver disease: A Cross-sectional study. Dig Dis. 38:222–230. 2020. View Article : Google Scholar : PubMed/NCBI | |
Han X, Qian W, Liu Y, Zheng T, Su XJ, Zhang PP, Chen Y, Hu LH and Li ZS: Effects of age, sex and pathological type on the risk of multiple polyps: A Chinese teaching hospital study. J Dig Dis. 21:505–511. 2020. View Article : Google Scholar : PubMed/NCBI |