Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

AI-based prediction of recurrence after carbon ion radiotherapy for early stage non-small cell lung cancer

  • Yuhei Miyasaka,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Visualization, Writing – original draft

    Affiliations Gunma University Heavy Ion Medical Center, Maebashi, Japan, Department of Radiation Oncology, Gunma University Graduate School of Medicine, Maebashi, Japan

  • Hanae Yoshida,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Visualization, Writing – original draft

    Affiliations Gunma University Advanced Particle Beam Medical Science Joint Research Course (Hitachi, Ltd.), Maebashi, Japan, Healthcare Innovation Center, Hitachi, Ltd., Tokyo, Japan

  • Naoko Okano ,

    Roles Conceptualization, Methodology, Project administration, Resources, Supervision, Validation, Writing – review & editing

    okano.n@gunma-u.ac.jp

    Affiliations Gunma University Heavy Ion Medical Center, Maebashi, Japan, Department of Radiation Oncology, Gunma University Graduate School of Medicine, Maebashi, Japan, Gunma University Advanced Particle Beam Medical Science Joint Research Course (Hitachi, Ltd.), Maebashi, Japan

  • Hirofumi Shimada,

    Roles Data curation, Resources

    Affiliation Gunma University Heavy Ion Medical Center, Maebashi, Japan

  • Nobuteru Kubo,

    Roles Resources, Writing – review & editing

    Affiliations Gunma University Heavy Ion Medical Center, Maebashi, Japan, Department of Radiation Oncology, Gunma University Graduate School of Medicine, Maebashi, Japan

  • Hidemasa Kawamura,

    Roles Resources, Writing – review & editing

    Affiliations Gunma University Heavy Ion Medical Center, Maebashi, Japan, Department of Radiation Oncology, Gunma University Graduate School of Medicine, Maebashi, Japan

  • Tatsuya Ohno

    Roles Funding acquisition, Project administration, Supervision

    Affiliations Gunma University Heavy Ion Medical Center, Maebashi, Japan, Department of Radiation Oncology, Gunma University Graduate School of Medicine, Maebashi, Japan

Abstract

Lung cancer is a leading cause of cancer-related deaths. Carbon ion radiotherapy (CIRT) is a treatment modality for patients with inoperable conditions or who decline surgery, but there is room for research to identify patients at high risk of recurrence. The use of artificial intelligence (AI)-based predictive models in healthcare is growing, yet their application in predicting outcomes after CIRT in NSCLC remains unexplored. This study developed an AI prediction model using clinical and imaging data to identify patients at high risk of recurrence after CIRT for early stage NSCLC. Patients with untreated early stage peripheral NSCLC undergoing CIRT between June 2010 and December 2020 were included. Simulated computed tomography (CT) images and clinical data were used to develop a model to predict recurrence within 2 years of CIRT. The model was tested using 5-fold cross-validation and evaluated using receiver operating characteristic (ROC) analysis. The study involved 124 patients. Two-year overall survival, local control, and progression-free survival rates stood at 90.8%, 91.0%, and 69.4%, respectively. The three-axis plane method for CT image input was more predictive than the three-transverse plane or 3D methods. Our AI-based model using CT images and clinical data predicted recurrence within 2 years of CIRT with a median area under the ROC curve of 0.762. Gradient-weighted class activation mapping enhanced model interpretability. The multimodal AI-based model identifies early stage NSCLC patients at high recurrence risk after CIRT although external validation is required for its generalizability and robustness.

Introduction

Lung cancer is a leading cause of cancer-related deaths worldwide, accounting for 21% of the estimated cancer deaths in the US by 2022 [1]. Approximately 80% of lung cancer cases are non-small cell lung cancer (NSCLC). The standard treatment for early stage NSCLC is lobectomy with mediastinal lymph node resection or segmentectomy. Radiotherapy, such as stereotactic body radiotherapy (SBRT), is indicated for patients with inoperable disease or those who refuse surgery [2]. Carbon ion radiotherapy (CIRT), characterised by a steeper dose distribution and greater biological effectiveness than X-ray radiotherapy [3], is another treatment option for early stage NSCLC. Several clinical studies have shown favourable outcomes [410], even in patients with interstitial lung disease [1113]. Furthermore, although there was no randomized clinical trial comparing CIRT and other modalities, a previous study suggested that CIRT may offer a more favourable prognosis than SBRT in a historical cohort setting [14]. However, evidence shows that the progression-free survival (PFS) rate at 3 years after CIRT is approximately 60% [8,14], underscoring the need to identify high-risk populations.

Recently, the use of artificial intelligence (AI)-based prediction models in healthcare has rapidly increased [15]. This trend extends to radiotherapy for NSCLC [1619], with studies showing a favourable predictive performance for SBRT [1719]. Nemoto et al. reported a neural network model using clinical variables to predict cancer progression within 5 years after SBRT [18]. Furthermore, Zheng et al. developed an AI-based model utilising pretreatment CT images and clinical variables to predict overall survival within 2 years of radiotherapy [19]. To date, no AI-based predictions incorporating CT images have been reported for patients with early stage NSCLC treated with CIRT. Multimodal models can provide more accurate predictions of patient prognosis. Thus, we developed and tested a multimodal AI-based prediction model that can be helpful in identifying high-risk populations.

Materials and methods

Study design

The present study was designed and conducted according to the principles outlined in the Declaration of Helsinki and the Ethical Guidelines for Medical and Health Research Involving Human Subjects. The Gunma University Hospital Clinical Research Review Board approved this study [IRB2021-056(1947)]. The need for written informed consent was waived because of the historical and observational nature of this study, but all the participants or their relatives had the opportunity to opt-out.

We identified a series of patients who met the following criteria: (i) had pathologically or clinically diagnosed and untreated NSCLC, (ii) had clinical T1a–2bN0M0 (Union for International Cancer Control [UICC], 8th edition), (iii) had peripheral disease, (iv) were treated with CIRT at Gunma University Heavy Ion Medical Center between June 2010 and December 2020, and (v) were followed up for more than 2 years since the initiation of CIRT. Patients with multifocal lung cancer or lung cancer treated within 3 years were excluded. Information of individual participants were identified. Data were accessed between 9 December 2022–1 July 2023.

Simulation CT images with two body positions and clinical data were obtained from all participants and used to develop and test AI-based prediction models for any recurrence or metastasis within 2 years after CIRT.

Treatment procedures

The following examinations were performed before CIRT: blood cell and biochemical analyses, pulmonary function tests, CT of the thorax and abdomen, whole-brain magnetic resonance imaging or CT, and, in some cases, 18F‐fluorodeoxyglucose positron emission tomography (18F-FDG-PET). Based on these examinations, the appropriateness of CIRT was determined by the Institutional Cancer Board, which included pulmonologists, respiratory surgeons, radiation oncologists, and diagnostic radiologists. Details of the CIRT procedures have been reported elsewhere [8] and are shown in S1 File.

After treatment, the patients were followed up with blood tests, including measurement of tumour markers, and CT scans or chest radiographs. In principle, examinations were performed every 1–3 months for the first 3 years, and every 3–6 months thereafter. For patients who refused regular follow-ups at our hospital, their clinical course was obtained via telephone, letters, and patient referral documents.

Development and testing of predictive models

An outline of the predictive model is shown in Fig 1. A machine learning framework, PyTorch, was used to develop and test the prediction models [20].

thumbnail
Fig 1. Illustration of development of AI-based models.

Image features were obtained from simulation CT images (a). The image and clinical features were used to develop an integrated model with XGBoost (b).

https://doi.org/10.1371/journal.pone.0342481.g001

The prediction models included CT images represented in three transverse planes, three axis planes, and three-dimensional (3D) images. To decide which model to include in the integrated model, we performed 5-fold cross-validation. The cohort was randomly divided into five equal groups; each group consisted of 24 or 25 patients, in which three groups were used as the training set, one group as the validation set, and the remaining group as the test set. To avoid obtaining contingent results, we repeated the 5-fold cross-validation five times and evaluated the median values. In addition, focus maps showing where the deep learning model was focused were generated using gradient-weighted class activation mapping (Grad-CAM) [21].

The integrated predictive model included the following clinical factors: age at the initiation of CIRT, sex, Eastern Cooperative Oncology Group performance status score, operability, Brinkman index, T stage (UICC, 8th edition), maximum SUV of 18F-FDG-PET binarized with a threshold of 2.5, and the prescribed dose. These factors and the aforementioned image features were used to develop a prediction model with extreme gradient boosting (XGBoost) [22]. XGBoost incorporates regularization, tree depth control, subsampling, and learning-rate shrinkage, which jointly mitigate overfitting and stabilize performance. Additionally, XGBoost can construct a decision tree by first generating a decision tree (learner) using only data with no missing values, and then assigning the data with missing values to the side with the best score for each segmentation. This feature enables the generation of a prediction model without the imputation of missing values, even for data containing missing values [22]. Hyperparameter tuning for model development was performed using Optuna (S1 Table) [23]. A calibration plot was generated to assess the agreement between predicted probabilities and observed outcomes.

Statistical analyses

Overall survival (OS), PFS, and local control (LC) rates were estimated from the first day of irradiation using the Kaplan–Meier method. Receiver operating characteristic (ROC) analysis was performed to evaluate the performance of the prediction models and areas under the ROC curves (AUCs). The AUC and 95% confidence interval (CI) were calculated using the DeLong method. All analyses and visualisations were performed using R 3.6.2 (R Core Team, Vienna, Austria) [24].

Results

Clinical outcomes

A total of 124 patients met the inclusion criteria and were included in the study. The patient characteristics are summarised in Table 1. The median follow-up period was 42.3 months (range, 3.5–137). The OS rates were 90.8% (95% CI, 85.7–96.1) at 2 years and 70.6% (95% CI, 61.8–80.6) at 5 years. LC was 91.0% (95% CI, 85.8–96.5) at 2 years and 89.7% (95% CI, 84.0–95.7) at 5 years. PFS was 69.4% (95% CI, 61.7–78.0) at 2 years and 54.6% (95% CI, 45.7–65.2) at 5 years (Fig 2). There were 34 disease progression events within 2 years of CIRT, including 10 local recurrences, 17 regional lymph node metastases, and 23 distant metastases. Grades 2 and 3 radiation pneumonitis were observed in two (1.6%) patients. No patient developed grade 4 or higher adverse events.

thumbnail
Fig 2. Kaplan-Meir curves for the entire cohort (n = 124).

Red, green, and blue lines show LC, OS, and PFS, respectively. Abbreviations: LC, local control; OS, overall survival; PFS, progression-free survival.

https://doi.org/10.1371/journal.pone.0342481.g002

Development of the prediction model

The details of the convolution neural networks are shown in S1 Fig and S2 Table. In the development of models using CT images, we tested the following three models: the three-transverse plane model, three-axis plane model, and the 3D image model. The 5-fold cross-validation, repeated five times, demonstrated that the median AUC values in the test sets were 0.714 in the three-transverse plane model, 0.746 in the three-axis plane model, and 0.706 in the 3D image model (S2 Fig).

Consequently, we incorporated a model based on three-axis planes into the integrated model. The integrated model achieved a median AUC of 0.762 (95% CI, 0.487–1) (Fig 3). The calibration curve showed a moderate deviation from the ideal line, with a tendency to underestimate risk at lower predicted values and overestimate risk at higher predicted values (S3 Fig). The focus maps were obtained using Grad-CAM (Fig 4).

thumbnail
Fig 3. ROC curves of the integrated model using clinical data and CT images by three axes planes.

CI, confidence interval.

https://doi.org/10.1371/journal.pone.0342481.g003

thumbnail
Fig 4. Input and Grad-CAM images.

Input images in (a) RGB and (b) greyscale. (c) Grad-CAM image and (d) the fusional image of (b) with (c). Lines indicate contours of gross target volume in each plane.

https://doi.org/10.1371/journal.pone.0342481.g004

Discussion

In this study, we developed and tested an AI-based recurrence prediction model in patients with newly diagnosed early stage peripheral NSCLC treated with CIRT at 52.8 or 60.0 Gy (RBE) in four fractions. With respect to the CT input methods, the three-axis plane model was slightly better than the three-transverse plane and 3D input models. Furthermore, the integrated model using CT images and clinical data predicted recurrence within 2 years, with a median AUC of 0.762. To our knowledge, this is the first study to report AI-based recurrence prediction models for prognosis after CIRT in patients with early stage NSCLC.

Several recent studies have reported AI-based prediction models for outcomes after SBRT [16,18,19]. Nemoto et al. developed and tested a prediction model for cancer progression in the first 5 years after SBRT for Tis–4N0M0 NSCLC using clinical data and achieved an AUC of 0.72 in the internal validation cohort and 0.70 in the external validation cohort [18]. Zheng et al. reported an AI-based model for predicting 2-year OS after SBRT in patients with stage I-IIIA NSCLC. Their model using pretreatment CT images and clinical features, including age, sex, T stage, clinical stage, and BED10, achieved a median AUC of 0.76 in the internal validation cohort and 0.64 in the external cohort.

Our AI-based models aimed to predict any recurrence within two years after CIRT, intentionally excluding the impact of other treatment modalities, such as systemic therapy for recurrent disease. This approach may be beneficial for patient decision-making. Our model’s performance, with a median AUC of 0.762, was comparable despite having a smaller and more selective population. In addition, the calibration curve confirmed the validity of our model. We believe that, in the future, it would be beneficial to develop a predictive model that can recommend the most appropriate treatment option.

Employing a focus map to highlight the significant features is useful for understanding AI-based prediction models. We applied the widely used Grad-CAM technique to create a focus map. Grad-CAM generates a localisation map by leveraging the gradients associated with a target concept as they propagate back to the last convolutional layer, highlighting the image areas critical for predicting the concept [21]. Fig 4 shows that the focus map identified hotspots within a solid nodule. Although there is not quantitative analysis, it were observed in the most of cases we evaluated, supporting the rationality of the model. Interpretability becomes crucial, especially when an AI-based model is developed from a cohort of a limited size.

Limitations

This study had several limitations. First, our cohort was single-institutional and relatively small, and the robustness of the performance of our model was not sufficiently evaluated because this study did not include an external cohort. It should be noted that model performance generally tends to be lower in an external cohort. Additionally, the relatively small dataset would partially cause the miscalibration. Future studies are warranted to externally test our model. Second, as we included patients over a decade, advances in medication and supportive care would have influence on outcomes. Third, there may be room for improvement in model performance because our dataset did not include information of driver mutations and programmed cell death ligand 1(PD-L1) expression. Fourth, our study included approximately 39% of patients without a pathological diagnosis. While it means our predictive model is applicable for these patients, the impact of pathological diagnosis on recurrence could not be sufficiently considered. Fifth, as our model predicts any recurrence, factors specifically associated with LC, lymph node metastasis, or distant metastasis are unknown. Finally, it is unknown whether our model can be utilised for patients with centrally located, multifocal, advanced, and previously treated NSCLC, as our study did not include such patients.

Conclusion

We developed and tested a multimodal AI-based model that predicted recurrence within 2 years of carbon ion radiotherapy in newly diagnosed, early stage, peripheral NSCLC patients in a single institutional cohort. Although external validation is required, the predictive performance of our model is feasible.

Supporting information

S1 Fig. Schema of convolutional neural network.

https://doi.org/10.1371/journal.pone.0342481.s001

(PDF)

S2 Fig. ROC curves of CNN models using CT images in the form of (A) three transverse planes, (B) three axes planes, and (C) three-dimensional images.

https://doi.org/10.1371/journal.pone.0342481.s002

(PDF)

S3 Fig. The calibration curves of the prediction model which had the median AUC.

https://doi.org/10.1371/journal.pone.0342481.s003

(PDF)

S1 Table. Details of hyperparameter tuning with optuna.

https://doi.org/10.1371/journal.pone.0342481.s004

(PDF)

S2 Table. Main parameters of convolutional neural network.

https://doi.org/10.1371/journal.pone.0342481.s005

(PDF)

References

  1. 1. Siegel RL, Miller KD, Fuchs HE, Jemal A. Cancer statistics, 2022. CA Cancer J Clin. 2022;72(1):7–33. pmid:35020204
  2. 2. National Cancer Comprehensive Network. NCCN Clinical Practice Guidelines in Oncology Non-Small Cell Lung Cancer Version 3. 2023. [cited 10 Jul 2023]. Available from: https://www.nccn.org/professionals/physician_gls/pdf/nscl.pdf
  3. 3. Kanai T, Endo M, Minohara S, Miyahara N, Koyama-ito H, Tomura H, et al. Biophysical characteristics of HIMAC clinical irradiation system for heavy-ion radiation therapy. Int J Radiat Oncol Biol Phys. 1999;44(1):201–10. pmid:10219815
  4. 4. Koto M, Miyamoto T, Yamamoto N, Nishimura H, Yamada S, Tsujii H. Local control and recurrence of stage I non-small cell lung cancer after carbon ion radiotherapy. Radiother Oncol. 2004;71(2):147–56. pmid:15110447
  5. 5. Miyamoto T, Baba M, Sugane T, Nakajima M, Yashiro T, Kagei K, et al. Carbon ion radiotherapy for stage I non-small cell lung cancer using a regimen of four fractions during 1 week. J Thorac Oncol. 2007;2(10):916–26. pmid:17909354
  6. 6. Miyamoto T, Baba M, Yamamoto N, Koto M, Sugawara T, Yashiro T, et al. Curative treatment of Stage I non-small-cell lung cancer with carbon ion beams using a hypofractionated regimen. Int J Radiat Oncol Biol Phys. 2007;67(3):750–8. pmid:17293232
  7. 7. Yamamoto N, Miyamoto T, Nakajima M, Karube M, Hayashi K, Tsuji H, et al. A dose escalation clinical trial of single-fraction carbon ion radiotherapy for peripheral stage I non–small cell lung cancer. J Thorac Oncol. 2017;12(4):673–80.
  8. 8. Saitoh J-I, Shirai K, Mizukami T, Abe T, Ebara T, Ohno T, et al. Hypofractionated carbon-ion radiotherapy for stage I peripheral nonsmall cell lung cancer (GUNMA0701): Prospective phase II study. Cancer Med. 2019;8(15):6644–50. pmid:31532584
  9. 9. Kubo N, Suefuji H, Nakajima M, Tokumaru S, Okano N, Yoshida D, et al. Clinical results of carbon ion radiotherapy for inoperable stage I non-small cell lung cancer: A Japanese national registry study (J-CROS-LUNG). Radiother Oncol. 2023;183:109640. pmid:36990390
  10. 10. Kubo N, Suefuji H, Nakajima M, Tokumaru S, Okano N, Yoshida D, et al. Five-year survival outcomes after carbon-ion radiotherapy for operable stage I NSCLC: A Japanese National Registry Study (J-CROS-LUNG). J Thorac Oncol. 2024;19(3):491–9. pmid:37924974
  11. 11. Nakajima M, Yamamoto N, Hayashi K, Karube M, Ebner DK, Takahashi W, et al. Carbon-ion radiotherapy for non-small cell lung cancer with interstitial lung disease: a retrospective analysis. Radiat Oncol. 2017;12:1–9.
  12. 12. Okano N, Kubo N, Yamaguchi K, Kouno S, Miyasaka Y, Mizukami T, et al. Efficacy and safety of carbon-ion radiotherapy for stage I non-small cell lung cancer with coexisting interstitial lung disease. Cancers (Basel). 2021;13(16):4204. pmid:34439358
  13. 13. Okano N, Suefuji H, Nakajima M, Tokumaru S, Kubo N, Yoshida D, et al. Clinical results of carbon-ion radiotherapy for stage I non-small cell lung cancer with concomitant interstitial lung disease: a Japanese national registry study (J-CROS-LUNG). J Radiat Res. 2023;64(Supplement_1):i2–7. pmid:37036751
  14. 14. Miyasaka Y, Komatsu S, Abe T, Kubo N, Okano N, Shibuya K, et al. Comparison of oncologic outcomes between carbon ion radiotherapy and stereotactic body radiotherapy for early-stage non-small cell lung cancer. Cancers (Basel). 2021;13(2):176. pmid:33419147
  15. 15. Rajpurkar P, Chen E, Banerjee O, Topol EJ. AI in health and medicine. Nat Med. 2022;28(1):31–8. pmid:35058619
  16. 16. Xu Y, Hosny A, Zeleznik R, Parmar C, Coroller T, Franco I, et al. Deep learning predicts lung cancer treatment response from serial medical imaging. Clin Cancer Res. 2019;25(11):3266–75. pmid:31010833
  17. 17. Lou B, Doken S, Zhuang T, Wingerter D, Gidwani M, Mistry N, et al. An image-based deep learning framework for individualizing radiotherapy dose. Lancet Digit Health. 2019;1(3):e136–47. pmid:31448366
  18. 18. Nemoto T, Takeda A, Matsuo Y, Kishi N, Eriguchi T, Kunieda E, et al. Applying artificial neural networks to develop a decision support tool for Tis-4N0M0 non-small-cell lung cancer treated with stereotactic body radiotherapy. JCO Clin Cancer Inform. 2022;6:e2100176. pmid:35749675
  19. 19. Zheng S, Guo J, Langendijk JA, Both S, Veldhuis RNJ, Oudkerk M, et al. Survival prediction for stage I-IIIA non-small cell lung cancer using deep learning. Radiother Oncol. 2023;180:109483. pmid:36690302
  20. 20. PyTorch. [cited 25 Aug 2023]. Available from: https://pytorch.org/
  21. 21. Selvaraju RR, Cogswell M, Das A, Vedantam R, Parikh D, Batra D. Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization. Int J Comput Vis. 2019;128(2):336–59.
  22. 22. Chen T, Guestrin C. XGBoost: A Scalable Tree Boosting System. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York, NY, USA: Association for Computing Machinery; 2016. pp. 785–94.
  23. 23. Akiba T, Sano S, Yanase T, Ohta T, Koyama M. Optuna: A Next-generation Hyperparameter Optimization Framework. Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. New York, NY, USA: Association for Computing Machinery; 2019. pp. 2623–31.
  24. 24. RCore Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. 2019. Available from: https://www.r-project.org/