Figures
Abstract
Background
The unique anatomical characteristics and blood supply of the rectosigmoid junction confer particular significance to its physiological functions and clinical surgeries. However, research on the prognosis of rectosigmoid junction cancer (RSC) is scarce, and reliable clinical prediction models are lacking.
Methods
This retrospective study included 524 patients diagnosed with RSC who were admitted to the Department of Gastrointestinal and Colorectal Surgery at the First Hospital of Jilin University between January 1, 2017, and June 1, 2019. Univariate and multivariate Cox regression analyses were conducted in this study to identify independent risk factors impacting the survival of RSC patients. Subsequently, models were constructed using six different machine learning algorithms. Finally, the discrimination, calibration, and clinical applicability of each model were evaluated to determine the optimal model.
Results
Through univariate and multivariate Cox regression analyses, we identified seven independent risk factors associated with the survival of RSC patients: age (HR = 1.9, 95% CI: 1.3-2.8, P = 0.001), gender (HR = 0.6, 95% CI: 0.4-0.9, P = 0.013), diabetes (HR = 2.0, 95% CI: 1.3-3.1, P = 0.002), tumor differentiation (HR = 2.1, 95% CI: 1.4-3.1, P < 0.001), tumor N stage (HR = 2.02, 95% CI: 1.2-3.4, P = 0.009), distant metastasis (HR = 4.2, 95% CI: 2.7-6.7, P < 0.001), and anastomotic leakage (HR = 2.4, 95% CI: 1.1-5.3, P = 0.034). After evaluating each model, the prediction model based on XGBoost was determined to be the optimal model, with AUC of 0.7856, 0.8484, and 0.796 at 1, 3, and 5 years. It also had the lowest Brier scores at all time points, and decision curve analysis (DCA) demonstrated the best clinical decision benefits compared to other models.
Citation: Wang Y, Chen B, Yu J (2025) A machine learning-based model for predicting survival in patients with Rectosigmoid Cancer. PLoS ONE 20(3): e0319248. https://doi.org/10.1371/journal.pone.0319248
Editor: Mert Tanal, Acibadem Maslak Hospital: Acibadem Maslak Hastanesi, TÜRKIYE
Received: October 14, 2024; Accepted: January 30, 2025; Published: March 25, 2025
Copyright: © 2025 Wang et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting Information files.
Funding: The author(s) received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
Abbreviations: RSC, rectosigmoid cancer; AAMR, age-adjusted mortality rate; CEA, carcinoembryonic Antigen; XGBoost, extreme gradient boosting; Rpart, recursive partitioning and regression trees; GBM, gradient boosting machine; SVM, support vector machine; RSF, random survival forest; Lasso-Cox, lasso-cox model; AUROC, area under the receiver operating characteristic curve; AUC, area under the curve; DCA, decision curve analysis; IQR, interquartile range; HR, hazard ratio; CI, confidence interval; OS, overall survival; NK, natural killer (cells)
Introduction
As a disease prevalent worldwide, colorectal cancer is the second most diagnosed cancer in women and the third in men [1]. Despite advances and the widespread adoption of screening technologies, colorectal cancer continues to be a major public health issue [2,3]. Recent statistics indicate an increase in colorectal cancer incidence among individuals younger than 55 years. During the late 1990s, colorectal cancer was the fourth leading cause of cancer deaths among individuals under 50, but it has recently emerged as the top cause of cancer mortality in men and the second highest in women [4].
Among the various tumor locations within the colorectal region, tumors located at the rectosigmoid junction hold a unique position, encompassing parts of both the rectum and the colon. Anatomically, the rectosigmoid junction is situated at the distal end of the sigmoid colon and the upper segment of the rectum. At this junction, the diameter of the intestinal lumen gradually narrows, aiding in intestinal peristalsis and fecal excretion, thus playing a crucial role in bowel function [5,6]. In terms of blood supply, the rectosigmoid junction stands in stark contrast to the sigmoid colon and rectum due to its unique vascular network. The blood supply to the rectosigmoid junction relies on the convergence of terminal branches of the inferior mesenteric artery and the superior rectal artery, forming a complex arterial anastomosis [7]. In contrast, arteries that solely supply the rectum or sigmoid colon rarely form such a dense vascular network. This distinctive anatomical feature and blood supply make the rectosigmoid junction particularly important in both physiological function and clinical surgery, setting it apart from the rectum or sigmoid colon alone.
From 1999 to 2020, the age-adjusted mortality rate (AAMR) for patients with rectosigmoid junction cancer (RSC) in the United States significantly increased [8]. Previous studies have shown that colorectal cancer exhibits considerable inter-patient heterogeneity at both histopathological and molecular levels, leading to varying disease outcomes even for patients with identical tumor stages [9]. Therefore, developing clinical prediction models capable of predicting individual prognoses for colorectal cancer is both important and meaningful. While several studies have attempted to establish clinical prediction models or other methods to predict the prognosis or subsequent treatment outcomes for colorectal cancer, there is a paucity of research focused on the prognosis of tumors at the rectosigmoid junction, and well-reported clinical prediction models are lacking [10–12].
This study aims to explore the clinical factors influencing the survival of patients with RSC and to develop an accurate survival prediction model using machine learning algorithms. By integrating clinical, pathological, and demographic data, machine learning overcomes the limitations of traditional statistical methods and offers greater precision in survival analysis. The proposed model is designed to provide data-driven support for clinicians, optimize treatment decisions, and facilitate personalized therapeutic strategies. Ultimately, it seeks to improve survival outcomes for RSC patients, highlighting the potential of machine learning in advancing cancer care.
Patients and methods
This retrospective study collected and followed up on 575 patients diagnosed with RSC who were admitted to the Department of Gastrointestinal and Colorectal Surgery at the First Hospital of Jilin University between January 1, 2017, and June 1, 2019. For the definition of the rectosigmoid junction, we selected the distance of 15 cm from the anal verge using a rigid endoscope [13–16]. The inclusion criteria for our cases were as follows: (1) admission between January 1, 2017, and June 1, 2019; (2) age ≥ 18 years; (3) preoperative rigid colonoscopy with a corresponding report; (4) colonoscopy indicating the lower edge of the tumor was 15-18 cm from the anal verge; or, despite the colonoscopy indicating the lower edge of the tumor was not 15-18 cm from the anal verge, the tumor volume covered this area; or colonoscopy clearly indicating the tumor was located at the rectosigmoid junction; (5) primary RSC; (6) undergoing radical tumor resection during hospitalization; (7) surgical pathology reporting adenocarcinoma; (8) patients having complete surgical pathology information, including tumor volume, tumor stage, pericolic lymph node metastasis, perineural invasion and vascular invasion. Patients who did not meet these criteria or were missing any inclusion variables were excluded from the analysis. Additionally, we excluded 51 patients due to loss of follow-up. Based on strict adherence to the inclusion and exclusion criteria, a total of 524 patients were ultimately included in this study. Data access occurred from July 1, 2024, to July 31, 2024, with data sourced from the medical records management system of the First Hospital of Jilin University. Data retrieval was conducted after obtaining the corresponding ethics approval. The data does not contain personally identifiable information. The study was approved by the Ethics Committee of the First Hospital of Jilin University, which granted a waiver for informed consent, as this retrospective research does not involve patient privacy or any potential health risks. The study adhered to the Helsinki Declaration. Detailed patient screening information is shown in Fig 1.
We collected clinical information for patients diagnosed with RSC who were admitted to the First Hospital of Jilin University between January 1, 2017, and June 1, 2019, using the medical record collection system. The data were subsequently processed accordingly. The pathological staging used in this study followed the 8th edition of the AJCC Cancer Staging Manual. Categorical variables were encoded in S1 Table.
Model construction and evaluation
In this study, we used six machine learning algorithms, including Extreme Gradient Boosting (XGBoost), Recursive Partitioning and Regression Trees (Rpart), Gradient Boosting Machine (GBM), Support Vector Machine (SVM), Random Survival Forest (RSF), and Lasso-Cox Model (Lasso-cox). XGBoost is a gradient boosting framework that leverages tree-based learning algorithms, has demonstrated efficacy in constructing prognostic and diagnostic medical models in recent years. Rpart is an intuitive decision tree algorithm that builds predictive models by recursively partitioning the dataset, excelling in handling both classification and regression problems. GBM is an ensemble learning method that improves model predictive performance by sequentially building multiple weak learners, with each new learner attempting to correct the errors of the previous model, thereby gradually enhancing the overall model accuracy. SVM is a widely implemented supervised learning algorithm, extensively utilized for both classification and regression tasks. It constructs a maximum margin hyperplane in high-dimensional space to effectively separate data points, possessing strong capabilities in handling nonlinear and high-dimensional data. RSF is an extension of the random forest for survival analysis. It reduces training variance and enhances ensemble and generalization by constructing multiple survival decision trees, offering high predictive accuracy and stability. Lasso-cox is a model that combines dimensionality reduction with the traditional Cox model, effectively fitting data related to survival analysis. The data were randomly split into training cohort and testing cohort with an 8:2 ratio. After identifying independent risk factors related to the survival of RSC patients through univariate and multivariate Cox regression analyses, we trained the training set data using six different machine learning algorithms. To determine the best model among these six, we conducted a comprehensive evaluation using multiple performance metrics. Subsequently, we applied these trained models to the internal validation set to assess their generalization ability and extrapolation performance. By comparing the performance of each model, we identified the final model. The evaluation of the models covered three dimensions: discrimination, calibration, and clinical applicability. Discrimination is typically assessed using the area under the receiver operating characteristic curve (AUC). The AUC values reflect the predictive ability of the model: an AUC < 0.50 indicates no predictive ability; 0.50 < AUC ≤ 0.70 indicates moderate accuracy; 0.70 < AUC ≤ 0.90 indicates moderate to high accuracy; and AUC > 0.90 indicates high accuracy. To evaluate the predictive accuracy of the models, we introduced the Brier score. The Brier score quantifies the mean squared error between predicted probabilities and actual outcomes; lower scores indicate better predictive performance. The unique advantage of the Brier score is that it not only assesses the accuracy of the model but also evaluates whether the predicted probabilities are consistent with the actual outcomes. For the clinical applicability of the models, we used decision curve analysis. The DCA curve integrates the preferences of patients and decision-makers into the analysis, offering a better assessment of the clinical utility of predictive models compared to the ROC curve.
Survival analysis
Survival analysis in this study focused on overall survival (OS). The Kaplan-Meier method was used to estimate OS, and differences between groups were compared using the log-rank test. Cox proportional hazards regression was employed to identify factors independently associated with OS, with results reported as hazard ratio (HRs) and 95% confidence intervals (CIs). Variables with P-values < 0.05 in univariate Cox regression analysis were included in a multivariate Cox model to adjust for potential confounders.
Statistical analysis
All analyses in this study were performed using R software (version 4.2.3). We have made the code used in this study available in S1 Data. The Shapiro-Wilk test was used to assess the normality of continuous variables. Continuous variables with a normal distribution were presented as mean ± standard deviation (mean ± SD) and compared using the independent samples t-test. Non-normally distributed continuous variables were reported as median and interquartile range (IQR) and analyzed using the Kruskal-Wallis test. Categorical variables were shown as counts (percentages) and compared using the chi-square test or Fisher’s exact test. Fisher’s exact test was used when T < 1; the corrected chi-square test was used when 1 ≤ T ≤ 5; and Pearson’s chi-square test was used when T ≥ 5. A two-sided P-value < 0.05 was considered statistically significant.
Results
Clinical characteristics of patients
A total of 524 patients were included in this study, randomly divided into a training cohort (n = 415) and an internal testing cohort (n = 109) (Table 1). The majority of RSC patients were elderly males over 60 years old (57.3%) (69.5%). A small portion of the patients had hypertension (21.2%) and/or diabetes (11.5%). Most patients underwent laparoscopic surgery during hospitalization (97.7%), while a few underwent open surgery (2.3%). The tumors were predominantly moderately to poorly differentiated (84.9%), with a small portion being well-differentiated (15.1%). The maximum diameter of most tumors was ≤ 5 cm (72.9%). Among the patients, 17.2% were in T stage I-II, while 82.8% were in T stage III-IV. 48.1% of the patients were in N stage 0, while 51.9% were in N stage 1-2. 31.1% of the patients had perineural invasion, and 48.0% had vascular invasion. The vast majority of patients had no distant metastasis at diagnosis (80.8%). Very few patients experienced anastomotic leakage (1.9%) or underwent reoperation (1.1%) postoperatively. Most patients had no history of surgery before hospitalization (73.7%). A small number of patients underwent protective stoma surgery (5.3%), while the vast majority did not (94.7%). Only a few RSC patients received radiotherapy postoperatively (4.6%), while more patients opted for chemotherapy (57.1%). The mean survival was 1664.60 days, with a standard deviation of 624.92 days.
The included patients were randomly divided into training and validation sets in an 8:2 ratio (Table 2). To verify the balance between the training and test sets, we performed Kaplan-Meier survival curve analysis on the survival probabilities of the two groups of patients (Fig 2). The results showed no significant difference in survival curves between the test set and the training set (P = 0.41), indicating that the survival outcomes of patients in the training and test sets were statistically similar. This similarity suggests that our dataset is balanced, ensuring that the model to be constructed will have consistent predictive performance across both datasets. Through this analysis, we can ensure that the training and testing data are comparable.
Univariate and multivariate cox regression analysis of survival in RSC patients
We applied both univariate and multivariate Cox regression analyses to pinpoint the risk factors linked to the survival of patients with rectosigmoid junction cancer (RSC) in the training set (Table 3). The results of the univariate Cox regression showed that age, gender, diabetes, tumor differentiation grade, tumor T stage, tumor N stage, perineural invasion, vascular invasion, distant metastasis, anastomotic leakage, protective stoma, CA199 level, and CEA level were all associated with cancer survival in RSC patients (P < 0.05). After multivariate regression analysis, we identified seven independent risk factors associated with the survival of RSC patients: age (HR = 1.9, 95% CI: 1.3-2.8, P = 0.001), gender (HR = 0.6, 95% CI: 0.4-0.9, P = 0.013), diabetes (HR = 2.0, 95% CI: 1.3-3.1, P = 0.002), tumor differentiation grade (HR = 2.1, 95% CI: 1.4-3.1, P < 0.001), tumor N stage (HR = 2.02, 95% CI: 1.2-3.4, P = 0.009), distant metastasis (HR = 4.2, 95% CI: 2.7-6.7, P < 0.001), and anastomotic leakage (HR = 2.4, 95% CI: 1.1-5.3, P = 0.034).
Evaluation of the predictive performance of six machine learning models for RSC patient survival
We used ROC curves to evaluate the predictive performance of six machine learning algorithms for overall survival at 1 year, 3 years, and 5 years (Fig 3). For the 1-year survival prediction, XGBoost (AUC = 0.786), Rpart (AUC = 0.747), RSF (AUC = 0.739), Lasso-Cox (AUC = 0.730), GBM (AUC = 0.705), and SVM (AUC = 0.556). For the 3-year survival prediction, XGBoost (AUC = 0.848), Lasso-Cox (AUC = 0.840), GBM (AUC = 0.839), RSF (AUC = 0.819), Rpart (AUC = 0.815), and SVM (AUC = 0.685). For the 5-year survival prediction, XGBoost (AUC = 0.796), GBM (AUC = 0.786), Lasso-Cox (AUC = 0.782), Rpart (AUC = 0.775), RSF (AUC = 0.748), and SVM (AUC = 0.588). These results indicate that XGBoost model consistently exhibited the strongest predictive capability across short-term, medium-term, and long-term survival predictions. It outperformed the other models in terms of predictive accuracy, demonstrating its stability and reliability for predicting the survival of RSC patients.
Abbreviations: AUC, Area under the Curve; XGBoost, Extreme Gradient Boosting; Rpart,Recursive Partitioning and Regression Trees; GBM, Gradient Boosting Machine; SVM, Support Vector Machine; RSF, Random Survival Forest; Lasso-Cox, Lasso-Cox Model.
To visually demonstrate the predictive performance of each model at different time points, we plotted the continuous AUC comparisons of the models at 1 year, 3 years, and 5 years (Fig 4). The results showed that the AUC values of the XGBoost model were higher than other models at all time points, indicating its superior performance in predicting short-term, medium-term, and long-term survival. We also displayed the Brier score curves of the six machine learning models at different time points (1 year, 3 years, and 5 years). It indicated that the Brier scores of the XGBoost model were the lowest at all time points, indicating that its predictive performance was superior to that of the other models (Fig 5).
Abbreviations:AUC, Area under the Curve; XGBoost, Extreme Gradient Boosting; Rpart,Recursive Partitioning and Regression Trees; GBM, Gradient Boosting Machine; SVM, Support Vector Machine; RSF, Random Survival Forest; Lasso-Cox, Lasso-Cox Model.
Abbreviations: XGBoost, Extreme Gradient Boosting; Rpart,Recursive Partitioning and Regression Trees; GBM, Gradient Boosting Machine; SVM, Support Vector Machine; RSF, Random Survival Forest; Lasso-Cox, Lasso-Cox Model.
To evaluate the clinical applicability of the models, we plotted the decision curves for each model at 1 year, 3 years, and 5 years (Fig 6). DCA revealed that the XGBoost model demonstrated the highest net benefit across most thresholds for 1-year, 3-year, and 5-year survival predictions, indicating its superior practical value in clinical decision-making. Although other models also performed well at certain specific thresholds, they were generally less stable and accurate compared to the XGBoost model.
Abbreviations: XGBoost, Extreme Gradient Boosting; Rpart,Recursive Partitioning and Regression Trees; GBM, Gradient Boosting Machine; SVM, Support Vector Machine; RSF, Random Survival Forest; Lasso-Cox, Lasso-Cox Model.
Overall, the XGBoost model demonstrated the best performance in predicting the survival of RSC patients. It significantly outperformed the other models in terms of discrimination ability (AUC values), predictive accuracy and calibration (Brier scores), and net benefit in clinical decision-making (decision curve analysis). These results indicate that the XGBoost model not only has high predictive performance but also strong clinical utility and reliability, making it the optimal choice for predicting the survival of RSC patients.
Discussion
In this study, we analyzed independent risk factors associated with the survival of RSC patients. Subsequently, we constructed and evaluated survival prediction models based on six advanced machine learning algorithms. Our results showed that the Extreme Gradient Boosting (XGBoost) model performed the best across all evaluation metrics, demonstrating its great potential in clinical prediction models. Although several studies have previously focused on constructing prognostic models for colorectal cancer, these studies typically categorized cancers at the rectosigmoid junction as either rectal or sigmoid colon cancers, without separate investigation. In contrast, we have innovatively applied machine learning algorithms to construct a prognostic model specifically for RSC patients. Compared to traditional nomogram models, our machine learning-based predictive model demonstrates significant improvements in both performance and applicability. To our knowledge, this is the first machine learning-based prediction model specifically constructed for the survival of RSC patients.
Through univariate and multivariate regression analyses, we identified seven independent risk factors affecting the survival of RSC patients: age, gender, diabetes, tumor differentiation grade, tumor N stage, distant metastasis, and anastomotic leakage. Age plays a significant role in predicting the survival of RSC patients. Our study found that patients over 60 years old had poorer prognoses. Elderly patients often exhibit higher mortality rates and shorter survival periods when diagnosed with colorectal cancer due to decreased immune function, multiple comorbidities, and poorer tolerance to treatment [17]. Additionally, older patients tend to have a lower tolerance for treatments such as chemotherapy and radiotherapy, increasing the likelihood of complications, which can adversely affect survival [18]. Therefore, when formulating treatment plans for elderly RSC patients, it is crucial to pay closer attention to their overall health status. It may be necessary to consider reducing the invasiveness of surgery and subsequent treatments.
Among the 524 patients included in the study, female patients accounted for only 30.5% of RSC cases. From a physiological perspective, women may have a relatively lower risk of developing colorectal cancer due to the protective effects of hormones such as estrogen. Estrogen is believed to have anti-inflammatory and anti-tumor properties, inhibiting tumor occurrence and progression. Estrogen and its receptors play crucial roles in regulating autoimmune diseases and anti-tumor immunity by influencing the interactions between tumor cells, immune cells, and the tumor microenvironment, thereby modulating tumor immunity [19]. However, our study indicates that once diagnosed with RSC, the prognosis for women is not significantly better than that for men. This may be related to the biological characteristics of the tumor and the response of female patients to treatment [20]. Differences between males and females in cancer progression and treatment response need to be considered when developing personalized treatment plans. For example, adjustments in chemotherapy dosage or the selection of different drug combinations may be necessary to account for varying biological responses in male and female patients.
Diabetes is considered an important prognostic factor for RSC patients. Previous studies have shown that a hyperglycemic environment promotes tumor cell growth and spread through various mechanisms. For example, hyperglycemia can enhance tumor cell proliferation and survival by activating insulin receptors and insulin-like growth factor receptors (IGF-IR) [21]. Hyperglycemia may also accelerate tumor progression by inhibiting the activity of key immune cells, such as natural killer (NK) cells, thereby allowing tumor cells to evade immune surveillance [22]. Diabetic patients often have other metabolic disorders and chronic diseases, which can exacerbate their condition and increase the risk of complications during treatment, leading to poor prognosis and affecting survival [23]. For RSC patients with diabetes, it is crucial to closely monitor and manage their blood glucose levels throughout the cancer treatment process to minimize the adverse effects of diabetes on the effectiveness of cancer therapy.
In our study, the tumor differentiation grade significantly influenced the survival of RSC patients. It is a crucial indicator of malignancy. Highly differentiated tumor cells resemble normal cells, proliferate and spread more slowly, and generally have a better prognosis. In contrast, moderately to poorly differentiated tumor cells have higher proliferative capacity and invasiveness, leading to poorer outcomes [24]. Tumor N stage reflects the extent of lymph node involvement by the tumor, which is a critical parameter for prognosis evaluation. A higher N stage indicates a broader range of tumor invasion and worse prognosis [25]. Lymph node metastasis is an early sign of tumor spread, often indicating a higher aggressiveness and risk of dissemination [26]. Distant metastasis is one of the key factors affecting the prognosis of RSC patients. It typically signifies that the tumor has spread beyond its primary site through the blood or lymphatic systems to other organs, significantly increasing the complexity and difficulty of treatment [27]. Our study shows that patients with distant metastasis have markedly poorer prognoses compared to those without. This is likely because patients with distant metastasis usually require more comprehensive and individualized treatment strategies, involving combinations of surgery, chemotherapy, and radiotherapy. Therefore, for RSC patients with low differentiation, high N staging, and distant metastases, more aggressive postoperative treatments may be required, including high-dose chemotherapy or combination therapy, to improve survival rates.
Anastomotic leakage (AL) is a common postoperative complication that significantly impacts the prognosis of RSC patients. It substantially increases the incidence of postoperative complications, prolongs hospitalization, and is associated with higher mortality rates. Additionally, AL increases the risk of infection, further affecting patient recovery and long-term survival [28]. Preventing and promptly managing anastomotic leakage is crucial for improving patient outcomes. In the surgical treatment of RSC, meticulous anastomotic techniques and strict postoperative monitoring measures are essential.
Notably, in our multivariate analysis, the P-value for perineural invasion was 0.05, indicating its marginal statistical significance. Perineural invasion, a pathological feature, has been shown to correlate with poor prognosis in various cancers [29,30]. Its presence typically signifies higher tumor aggressiveness and greater potential for spread, leading to higher recurrence rates and poorer survival outcomes. Additionally, perineural invasion may increase the risk of local recurrence and distant metastasis, further affecting overall prognosis. Although perineural invasion was not included in our model, its potential clinical significance and supporting literature suggest that future research should explore its impact on RSC patient survival and validate its value as a prognostic indicator. This could assist in more comprehensive assessment and management of RSC treatment plans, enhancing long-term survival rates.
Finally, although our study demonstrated that the XGBoost model performed the best in predicting the survival of RSC patients, there are some limitations that need to be discussed. Due to the constraints of a retrospective study, we were unable to conduct uniform genetic sequencing examinations on the patients, resulting in the absence of genetic variables in our clinical data. Additionally, our study data were derived from a single institution, which may have regional and sample size limitations. Future research could incorporate multicenter data to further optimize the XGBoost model or combine it with other techniques to enhance its predictive performance and application value. Although our internal validation has demonstrated high accuracy, external validation remains crucial. We plan to continue refining and upgrading our predictive model in subsequent studies to ensure its broader applicability and reliability.
Conclusion
We developed a prediction model based on the optimal machine learning algorithm, XGBoost, which can assist clinical decision-making and potentially extend the survival of patients with rectosigmoid junction cancer.
Supporting information
S1 Table. De-identified raw clinical dataset of RSC patients.
https://doi.org/10.1371/journal.pone.0319248.s001
(XLSX)
Acknowledgments
We sincerely thank all the patients who participated in the follow-up and all the researchers involved in this study.
References
- 1. Keum N, Giovannucci E. Global burden of colorectal cancer: emerging trends, risk factors and prevention strategies. Nat Rev Gastroenterol Hepatol. 2019;16(12):713–32. pmid:31455888
- 2. Dekker E, Tanis PJ, Vleugels JLA, Kasi PM, Wallace MB. Colorectal cancer. Lancet. 2019;394(10207):1467–80. pmid:31631858
- 3. Wadhwa V, Patel N, Grover D, Ali FS, Thosani N. Interventional gastroenterology in oncology. CA Cancer J Clin. 2023;73(3):286–319. pmid:36495087
- 4. Siegel RL, Giaquinto AN, Jemal A. Cancer statistics, 2024. CA Cancer J Clin. 2024;74(1):12–49. pmid:38230766
- 5. Bampton PA, Dinning PG, Kennedy ML, Lubowski DZ, deCarle D, Cook IJ. Spatial and temporal organization of pressure patterns throughout the unprepared colon during spontaneous defecation. Am J Gastroenterol. 2000;95(4):1027–35. pmid:10763955
- 6. Kamm MA, van der Sijp JR, Lennard-Jones JE. Colorectal and anal motility during defaecation. Lancet. 1992;339(8796):820. pmid:1347853
- 7. Massalou D, Moszkowicz D, Mariage D, Baqué P, Camuzard O, Bronsard N. Is it possible to give a single definition of the rectosigmoid junction? Surg Radiol Anat. 2018;40(4):431–8. pmid:29218384
- 8. Tan JY, Yeo Y-H, Ng WL, Fong ZV, Brady JT. How have US colorectal cancer mortality trends changed in the past 20 years?. Int J Cancer. 2024;155(3):493–500. pmid:38525799
- 9. Mattesen TB, Rasmussen MH, Sandoval J, Ongen H, Árnadóttir SS, Gladov J, et al. MethCORR modelling of methylomes from formalin-fixed paraffin-embedded tissue enables characterization and prognostication of colorectal cancer. Nat Commun. 2020;11(1):2025. pmid:32332866
- 10. Feng L, Liu Z, Li C, Li Z, Lou X, Shao L, et al. Development and validation of a radiopathomics model to predict pathological complete response to neoadjuvant chemoradiotherapy in locally advanced rectal cancer: a multicentre observational study. Lancet Digit Health. 2022;4(1):e8–17. pmid:34952679
- 11. Kinugasa T, Nagasu S, Murotani K, Mizobe T, Ochi T, Isobe T, et al. Analysis of risk factors for anastomotic leakage after lower rectal Cancer resection, including drain type: a retrospective single-center study. BMC Gastroenterol. 2020;20(1):315. pmid:32977772
- 12. Zhao B, Gabriel RA, Vaida F, Eisenstein S, Schnickel GT, Sicklick JK, et al. Using machine learning to construct nomograms for patients with metastatic colon cancer. Colorectal Dis. 2020;22(8):914–22. pmid:31991031
- 13. Farrugia G. Interstitial cells of Cajal in health and disease. Neurogastroenterol Motil. 2008;20(Suppl 1):54–63. pmid:18402642
- 14. Kapiteijn E, Marijnen CA, Nagtegaal ID, Putter H, Steup WH, Wiggers T, et al. Preoperative radiotherapy combined with total mesorectal excision for resectable rectal cancer. N Engl J Med. 2001;345(9):638–46. pmid:11547717
- 15. Sebag-Montefiore D, Stephens RJ, Steele R, Monson J, Grieve R, Khanna S, et al. Preoperative radiotherapy versus selective postoperative chemoradiotherapy in patients with rectal cancer (MRC CR07 and NCIC-CTG C016): a multicentre, randomised trial. Lancet. 2009;373(9666):811–20. pmid:19269519
- 16. DʼSouza N, de Neree Tot Babberich MPM, d’Hoore A, Tiret E, Xynos E, Beets-Tan RGH, et al. Definition of the Rectum: An International, Expert-based Delphi Consensus. Ann Surg. 2019;270(6):955–9. pmid:30973385
- 17. Podda M, Sylla P, Baiocchi G, Adamina M, Agnoletti V, Agresta F, et al. Multidisciplinary management of elderly patients with rectal cancer: recommendations from the SICG (Italian Society of Geriatric Surgery), SIFIPAC (Italian Society of Surgical Pathophysiology), SICE (Italian Society of Endoscopic Surgery and new technologies), and the WSES (World Society of Emergency Surgery) International Consensus Project. World J Emerg Surg. 2021;16(1):35.
- 18. Kawamura H, Morishima T, Sato A, Honda M, Miyashiro I. Effect of adjuvant chemotherapy on survival benefit in stage III colon cancer patients stratified by age: a Japanese real-world cohort study. BMC Cancer. 2020;20(1):19. pmid:31906959
- 19. Wang T, Jin J, Qian C, Lou J, Lin J, Xu A, et al. Estrogen/ER in anti-tumor immunity regulation to tumor cell and tumor microenvironment. Cancer Cell Int. 2021;21(1):295. pmid:34098945
- 20. Tuca A, Gallego R, Ghanem I, Gil-Raga M, Feliu J. Chemotherapy and Targeted Agents in the Treatment of Elderly Patients with Metastatic Colorectal Cancer. J Clin Med. 2020;9(12):4015. pmid:33322567
- 21. Olatunde A, Nigam M, Singh RK, Panwar AS, Lasisi A, Alhumaydhi FA, et al. Cancer and diabetes: the interlinking metabolic pathways and repurposing actions of antidiabetic drugs. Cancer Cell Int. 2021;21(1):499. pmid:34535145
- 22. Duan Q, Li H, Gao C, Zhao H, Wu S, Wu H, et al. High glucose promotes pancreatic cancer cells to escape from immune surveillance via AMPK-Bmi1-GATA2-MICA/B pathway. J Exp Clin Cancer Res. 2019;38(1):192. pmid:31088566
- 23. Ahmad E, Lim S, Lamptey R, Webb DR, Davies MJ. Type 2 diabetes. Lancet. 2022;400(10365):1803–20. pmid:36332637
- 24. Xu X-F, Xing H, Han J, Li Z-L, Lau W-Y, Zhou Y-H, et al. Risk Factors, Patterns, and Outcomes of Late Recurrence After Liver Resection for Hepatocellular Carcinoma: A Multicenter Study From China. JAMA Surg. 2019;154(3):209–17. pmid:30422241
- 25. Kuipers EJ, Grady WM, Lieberman D, Seufferlein T, Sung JJ, Boelens PG, et al. Colorectal cancer. Nat Rev Dis Primers. 2015;1:15065. pmid:27189416
- 26. Audisio RA, Papamichael D. Treatment of colorectal cancer in older patients. Nat Rev Gastroenterol Hepatol. 2012;9(12):716–25. pmid:23045000
- 27. Yachida S, Jones S, Bozic I, Antal T, Leary R, Fu B, et al. Distant metastasis occurs late during the genetic evolution of pancreatic cancer. Nature. 2010;467(7319):1114–7. pmid:20981102
- 28. Qi X, Liu M, Xu K, Gao P, Tan F, Yao Z, et al. Risk factors of symptomatic anastomotic leakage and its impacts on a long-term survival after laparoscopic low anterior resection for rectal cancer: a retrospective single-center study. World J Surg Oncol. 2021;19(1):187. pmid:34172053
- 29. Yang Y, Huang X, Sun J, Gao P, Song Y, Chen X, et al. Prognostic value of perineural invasion in colorectal cancer: a meta-analysis. J Gastrointest Surg. 2015;19(6):1113–22. pmid:25663635
- 30. Liebig C, Ayala G, Wilks JA, Berger DH, Albo D. Perineural invasion in cancer: a review of the literature. Cancer. 2009;115(15):3379–91. pmid:19484787