The aim of this study was to develop a multivariate logistic regression model with least absolute shrinkage and selection operator (LASSO) to make valid predictions about the incidence of moderate-to-severe patient-rated xerostomia among head and neck cancer (HNC) patients treated with IMRT.
Methods and Materials
Quality of life questionnaire datasets from 206 patients with HNC were analyzed. The European Organization for Research and Treatment of Cancer QLQ-H&N35 and QLQ-C30 questionnaires were used as the endpoint evaluation. The primary endpoint (grade 3+ xerostomia) was defined as moderate-to-severe xerostomia at 3 (XER3m) and 12 months (XER12m) after the completion of IMRT. Normal tissue complication probability (NTCP) models were developed. The optimal and suboptimal numbers of prognostic factors for a multivariate logistic regression model were determined using the LASSO with bootstrapping technique. Statistical analysis was performed using the scaled Brier score, Nagelkerke R2, chi-squared test, Omnibus, Hosmer-Lemeshow test, and the AUC.
Eight prognostic factors were selected by LASSO for the 3-month time point: Dmean-c, Dmean-i, age, financial status, T stage, AJCC stage, smoking, and education. Nine prognostic factors were selected for the 12-month time point: Dmean-i, education, Dmean-c, smoking, T stage, baseline xerostomia, alcohol abuse, family history, and node classification. In the selection of the suboptimal number of prognostic factors by LASSO, three suboptimal prognostic factors were fine-tuned by Hosmer-Lemeshow test and AUC, i.e., Dmean-c, Dmean-i, and age for the 3-month time point. Five suboptimal prognostic factors were also selected for the 12-month time point, i.e., Dmean-i, education, Dmean-c, smoking, and T stage. The overall performance for both time points of the NTCP model in terms of scaled Brier score, Omnibus, and Nagelkerke R2 was satisfactory and corresponded well with the expected values.
Citation: Lee T-F, Chao P-J, Ting H-M, Chang L, Huang Y-J, Wu J-M, et al. (2014) Using Multivariate Regression Model with Least Absolute Shrinkage and Selection Operator (LASSO) to Predict the Incidence of Xerostomia after Intensity-Modulated Radiotherapy for Head and Neck Cancer. PLoS ONE 9(2): e89700. https://doi.org/10.1371/journal.pone.0089700
Editor: Nils Cordes, Dresden University of Technology, Germany
Received: July 31, 2013; Accepted: January 22, 2014; Published: February 28, 2014
Copyright: © 2014 Lee et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This study was supported financially, in part, by grants from the National Science Council (NSC) of the Executive Yuan of the Republic of China, (NSC-101-2221-E-151-007-MY3, and NSC-102-2221-E-182A-002). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Head and neck cancers (HNC) are a leading cause of cancer mortality in Taiwan. Radiotherapy (RT) plays an important role in the treatment of HNC. However, xerostomia is a common complication in patients with HNC after radiotherapy. According to the Late Effects of Normal Tissues – Subjective, Objective, Management, Analytic (LENT-SOMA) criteria, severe xerostomia is defined as long-term salivary function <25% of the pre-RT baseline value , . Based on the quantitative analysis of normal tissue effects in the clinic (QUANTEC) guideline to limit the incidence of severe xerostomia to below 20%, at least one parotid gland should receive a mean dose of ≤20 Gy, or both parotid glands should receive a mean dose of ≤25 Gy , . The patient-rated quality of life (QoL) questionnaire (QLQ-C30) and the xerostomia-specific QoL questionnaire (QLQ-H&N35) have been shown to allow a reliable assessment of the relationships between QoL and salivary function and xerostomia in patients receiving radiotherapy –. The normal tissue complication probability (NTCP) model was developed using either a univariate or multivariate logistic regression model to predict the incidence of xerostomia. However, the development of xerostomia as reported by patients most likely depends on a variety of prognostic factors –, , . Some variables such as clinical and treatment-related factors that may have important effects on the risk of radiation-induced complications need to be taken into consideration. Therefore, one goal of this study was to develop predictive models for patient-rated xerostomia, taking into account dose distributions in parotid glands as well as other potential clinical and treatment-related prognostic factors.
Developing a multivariate logistic regression model requires knowing the optimal number of prognostic factors to include. Many investigators used the log likelihood (LL), average likelihood, Akaike information criterion (AIC), and Bayesian information criterion (BIC) to deal with this problem . Xu et al. – introduced the least absolute shrinkage and selection operator (LASSO) and Bayesian model averaging (BMA) to build NTCP models of xerostomia after three-dimensional conformal radiation therapy (3D-CRT) for HNC. However, the BMA needs a priori information to build a fair predictive model. The LASSO is based on shrinkage estimation and has been widely used in the statistical field –. The advantages of LASSO include: 1) a smaller mean squared error (MSE) than conventional methods; 2) it handles the multicollinearity problem; 3) overall variable selection; and 4) coefficients shrink –. Being easy to implement is another of the merits that attract users. Xu et al. recommended the LASSO method for multivariate logistic regression NTCP modeling .
Beetz et al. showed that 3D-CRT-based models for patient-rated xerostomia among HNC patients treated with primary RT turned out to be less valid for patients treated with intensity-modulated radiotherapy (IMRT), and that the 3D-CRT NTCP models cannot be used for IMRT cohorts . The main message was that models developed in a population treated with a specific technique cannot be generalized and extrapolated to a population treated with another technique without external validation. Therefore, the purpose of this study was to develop a multivariate logistic regression model with LASSO to make valid predictions about the incidence of patient-rated xerostomia among HNC patients treated with IMRT at 3 and 12 months after treatment.
Materials and Methods
We aimed to develop a multivariate logistic regression NTCP model with LASSO to make valid predictions about the risk of moderate-to-severe patient-rated xerostomia using QoL datasets. In total, 206 patients with HNC were enrolled. All participants were treated with IMRT at the Kaohsiung Chang Gung Memorial Hospital between May 2007 and October 2010. The characteristics of the 206 patients with HNC are presented in Table 1. QoL questionnaires completed by patients prior to treatment and at 3 and 12 months after treatment were analyzed and used for multivariate logistic regression NTCP model response fitting. This study was approved by the Chang Gung medical foundation institutional review board (99-1420B, 96-1231B) and all participants gave written informed consent.
Each patient was immobilized using a commercially available thermoplastic mask and/or an individually customized bite block. Computed tomographic images (2.5-mm slice thickness, 512×512 pixels/slice) were acquired from the top of the vertex to the level of the carina. Both parotid glands were delineated by a radiation oncologist. Dose distributions were calculated and dose–volume histograms (DVHs) were generated separately for each parotid gland, enabling separate analysis. Two IMRT techniques were used: simultaneous integrated boost (SIB) and sequential mode (SQM). The prescribed total dose ranged from 54.0 to 77.4 Gy (median, 70.0 Gy).
IMRT was delivered by the computer-controlled auto-sequencing segment or the dynamic multileaf collimator of a linear accelerator [Varian Clinac 21 EX (Varian Medical Systems, Palo Alto, CA) or Elekta Precise (Elekta, Crawley, UK)] as described previously , aiming to spare the parotid glands (predominantly contralateral side) while treating the primary targets and lymph nodes at risk. The prescribed doses were 54.0–77.4 Gy (median, 70.0 Gy) to the macroscopic tumor planning target volume (PTV1), 48.0–72.0 Gy (median, 61.2 Gy) to the resected tumor bed planning target volume (PTV2), and 41.4 Gy to the subclinical disease planning target volume (PTV3), at 1.6–2.12 Gy per fraction for SIB and 1.8–2.0 Gy per fraction for SQM and five fractions per week.
According to the Radiation Therapy Oncology Group 0225 and 0615 trials, the planning objectives for PTVs were a minimum dose >95%, and no more than 5% of any PTV1 received ≥110% of the prescribed dose. The structural constraints for the parotid gland were a mean dose ≤26 Gy or V30Gy≤50%; a mean dose ≤40 Gy was used in the oral cavity excluding the PTV. Mean DVH values were calculated for the parotid glands for every patient. All data were based on the DVHs obtained using Pinnacle3® (Philips, Fitchburg, WI) with a bin size resolution of 0.01 Gy. The resolution of dose calculation was 2.5 mm for all IMRT plans.
A prospective survey of QoL using the European Organization for Research and Treatment of Cancer (EORTC) C30 and H&N35 QoL questionnaires (QLQ-C30 and QLQ-H&N35) was performed on 206 survivors of HNC. The patients were asked to complete the questionnaire prior to treatment and 3 months, 6 months, 1 year, and 2 years after IMRT. For the purposes of this analysis, the 3-month (n = 206) and 12-month (n = 128) follow-up time points were used. Chinese versions of the EORTC QLQ-C30 and QLQ-H&N35 questionnaires were obtained from the Quality of Life Unit, EORTC Data Center, Brussels, Belgium . For each item on the EORTC QLQ-C30 and QLQ-H&N35 questionnaires, the following four-point Likert scale was used: none (0), a little (33), quite a lot (66), and a lot (100). All QoL scores are given in the text. A high score on the functional or global QoL scale represents a relatively high/healthy level of functioning or global QoL, whereas a high score on the symptom scale represents the presence of a symptom or problem. The EORTC QLQ-H&N35 questionnaire was used to evaluate xerostomia (i.e., the analytical endpoint). Grade 3+ xerostomia was defined as moderate (66) to severe (100) xerostomia 3 and 12 months after the completion of RT; this corresponds to the two highest scores on the four-point Likert scale. As we were primarily interested in grade 3+ xerostomia induced by RT itself, patients with moderate–to-severe xerostomia at baseline were excluded from further analysis –, . The primary endpoint (grade 3+ xerostomia) was defined as moderate-to-severe xerostomia at 3 (XER3m) and 12 months (XER12m) after the completion of IMRT, and excluded patients with grade 3+ xerostomia at baseline.
NTCP models for moderate-to-severe patient-rated xerostomia were developed using a multivariate logistic regression analysis with an extended bootstrapping technique as described by El Naqa et al.  and Beetz et al. –. For each patient, predictive values were calculated for each set of prognostic factors based on the multivariate logistic regression coefficients according to the formula:(1)in which n is the number of prognostic factors in the built model; variables xi represent different prognostic factors; and βi are the corresponding regression coefficients.
For each patient, 16 candidate prognostic factors were initially included in the variable selection procedure. The candidates included 14 clinical and two dosimetric factors. The clinical candidate factors were chemotherapy (C/T), treatment mode (SIB or SQM), gender, age, AJCC stage, baseline xerostomia, T stage, node classification, education, family history, financial status, marriage, smoking, and alcohol abuse.
The dosimetric candidate factors were the mean dose given to the contralateral parotid gland (Dmean-c) and the ipsilateral parotid gland (Dmean-i) (Gy). We excluded Vx values, which were previously found to be highly correlated with each other ; Dmean-c and Dmean-i were the only two DVH-parameters in this study. The candidate prognostic factors are listed in Table 2. We used the LASSO process along with the Hosmer-Lemeshow test to select the optimal and suboptimal numbers of potential prognostic factors for the predictive model.
The LASSO was first proposed by Tibshirani in 1996, the details of which can be found in . It uses the following equation to shrink the coefficients and select the prognostic factors:(2)where d is the number of variables selected, and t are tuning parameters that control the degree of penalty, which can be determined by cross-validation , . Based on the nature of the constraint, LASSO tends to produce some coefficients as zero, and it improves the overall prediction accuracy by allowing a small amount of bias to reduce the variance of the predicted values. The selection of the optimal number of prognostic factors was performed by cross-validation in this study.
To account for the overfitting problem in our study, two datasets were used, i.e., a training set and a test set; a model was built based on a training set and fitted to the training set itself and also tested with a test set. We used nested 10-fold cross-validation to obtain the best prognostic factor subsets.
To generalize the use of the models, a compact model can be generated with a small suboptimal number of prognostic factors for more user-friendly handling; however, accuracy has to remain at a certain level. For the selection of a suboptimal number of prognostic factors, we first processed the LASSO to rank the correlations for the potential prognostic factors, i.e., the DVH parameters and patients' clinical data. In order to reduce the number of prognostic factors in the model, the suboptimal number of prognostic factors for a multivariate logistic regression model was determined using a bootstrapping method with the Hosmer-Lemeshow test. We set the Hosmer-Lemeshow test to show a significant agreement between predicted risk and observed outcome when the Hosmer-Lemeshow test value was ≥0.05. The first number needed for the model was recorded when the value was ≥0.05, after which we put the selected prognostic factors into the model to calculate the area under the receiver operating characteristic curve (AUC). We carried on increasing the number of prognostic factors with a higher Hosmer-Lemeshow test value and stopped when the system AUC did not increase significantly (<5%). The Hosmer-Lemeshow test revealed the accuracy of the model. The AUC was also used as a measure to evaluate prediction performance and it described the discriminatory ability of the model. The prognostic factors selected were used for the definitive NTCP model for moderate-to-severe patient-rated xerostomia.
After selecting the prognostic factors with optimal or suboptimal performance, odds ratios (OR) and 95% confidence intervals (95% CI) were calculated for these factors. Model performance was described using different validation tools –. The system's performance was evaluated using the AUC, scaled Brier score, Nagelkerke R2, Omnibus, and Hosmer-Lemeshow test –. Statistical analyses were performed using SPSS 19.0 (SPSS, Chicago, IL, USA).
Beetz et al. used the forward selection method to develop NTCP models for moderate-to-severe patient-rated xerostomia using a multivariate logistic regression analysis with an extended bootstrapping technique and forward variable selection , . We compared their method with the LASSO mentioned above. For every model order, the likelihood of predictions was calculated and the number of selected variables with the highest likelihood was selected for the definitive predictive model for patient-rated xerostomia. Two thousand bootstraps were used for each analysis. The details can be found in , .
All patients completed QoL questionnaires at three time points (before RT, during RT, and at 3 months after RT). In addition, 128 patients completed a QoL questionnaire 12 months after IMRT. Of the 206 patients assessed at the 3-month time point, 21 who were already suffering from moderate-to-severe xerostomia at baseline were excluded, leaving 185 patients for analysis. Of the 128 patients assessed at the 12-month time point, 11 who were already suffering from moderate-to-severe xerostomia at baseline were excluded from further analysis, leaving 117 patients for analysis.
At 3 months after treatment, 42.2% of the patients reported moderate- to-severe xerostomia. After 12 months, 33.6% reported moderate-to-severe xerostomia (Table 1).
The following steps were used to select the optimal and suboptimal numbers of prognostic factors. First, LASSO ranked how strongly the factors correlated, then chose the suboptimal number of prognostic factors using the Hosmer-Lemeshow test and AUC. LASSO of bootstrap prediction in the multivariate logistic regression analysis ranked the prognostic factors in descending order, as shown in Table 3 (the LASSO shrinking path diagrams are shown in Figs. 1a and b) for both the 3- and 12-month time points.
Abbreviation- LASSO: least absolute shrinkage and selection operator; Definition of prognostic factors: Same as Table 2.
Eight prognostic factors were selected by LASSO for the 3-month time point: Dmean-c, Dmean-i, age, financial status, T stage, AJCC stage, smoking, and education. For the 12-month time point, nine prognostic factors were selected: Dmean-i, education, Dmean-c, smoking, T stage, baseline xerostomia (none vs. a little), alcohol abuse, family history, and node classification.
For the 3-month time point, three suboptimal prognostic factors were selected by the Hosmer-Lemeshow test and AUC. We set the Hosmer-Lemeshow test to show a significant agreement between predicted risk and observed outcome when the Hosmer-Lemeshow test value was ≥0.05. Increasing the number of variables to four or eight did not further increase the AUC of the model compared with the three-factor model, i.e., Dmean-c, Dmean-i, and age.
For the 12-month time point, five suboptimal prognostic factors were selected by the Hosmer-Lemeshow test and AUC, i.e., Dmean-i, education, Dmean-c, smoking and T stage. Increasing the number of variables to five or seven did not further increase the AUC of the model compared with the five-factor model.
All corresponding coefficients of the multivariate logistic regression NTCP models are shown in Tables 4 and 5. The NTCP value for each individual patient can be calculated using the following logistic regression formula:
For the 3-month time point, the optimal model was where S = −22.283+(Dmean-c*0.102)+(Dmean-i*0.102)+(Age*0.292)+(financial status*corresponding coefficient)+(T stage*corresponding coefficient)+(AJCC stage*corresponding coefficient)+(smoking*0.187)+(education*corresponding coefficient). The suboptimal model was where S = −21.298+(Dmean-c*0.097)+(Dmean-i*0.101)+(Age*0.285).
For the 12-month time point, the NTCP value for each individual patient can be calculated by the above logistic regression formula, and where S = −12.045+(Dmean-i*0.176)+(education*corresponding coefficient)+(Dmean-c*0.121)+(smoking*0.993)+(T stage*corresponding coefficient)+(baseline xerostomia*0.898)+(alcohol abuse*0.913)+(family history*−0.871)+(node classification*1.034). The suboptimal model was where S = −8.028+(Dmean-i*0.155)+(education*corresponding coefficient)+(Dmean-c*0.067)+(smoking*1.143)+(T stage*corresponding coefficient).
The overall performance for both time points of the NTCP model for patient-rated xerostomia in terms of scaled Brier score, Omnibus, and Nagelkerke R2 was satisfactory and corresponded well with the expected values (Table 6). The AUC for the suboptimal model was 0.84 (95% CI 0.80–0.86) and 0.84 (95% CI 0.81–0.86) for 3 and 12 months, respectively. The AUC for the optimal model was only slightly improved (less than 5%) compared with the suboptimal model; this was 0.86 (95% CI 0.83–0.88) and 0.87 (95% CI 0.84–0.89) for 3 and 12 months, respectively. Finally, the Hosmer-Lemeshow test showed significant agreement between predicted risk and observed outcome for both LASSO optimal and suboptimal models (Table 6). In our population, the DVH diagrams for the mean doses delivered to the ipsilateral and contralateral parotids are shown in Fig. 2 for both the 3- and 12-month time points.
Abbreviation- DVH: dose–volume histogram; XER3m: xerostomia at 3-month time point; XER12m: xerostomia at 12-month time point.
For the likelihood forward selection, the numbers of prognostic factors selected were 9 and 11 for the 3- and 12-month time points, respectively. The AUC of the forward selection model was similar to that achieved by LASSO: 0.85 (95% CI 0.83–0.88) and 0.86 (95% CI 0.83–0.88) for 3 and 12 months, respectively.
Early NTCP models, like the Lyman-Kutcher-Burman and the univariate logistic regression model , are based on information derived from DVHs generated from dose distributions in the target volumes and the surrounding organs at risk. For example, the mean dose received in the parotid glands is the only prognostic factor for xerostomia in univariate models. In this multivariate model study, not only the mean doses to the contralateral/ipsilateral parotid glands were the principal components causing xerostomia; but also age, education, and smoking were also to be the effected components.
For the IMRT planning goal, the mean dose to each parotid gland should be kept as low as possible, consistent with the desired clinical target volume coverage. Sparing at least one parotid gland appears to eliminate complications , . Severe xerostomia can usually be avoided if at least one parotid gland has been given a mean dose ≤20 Gy or if both glands have been given a mean dose ≤25 Gy. A lower parotid mean dose usually results in better function with respect to the effects on patients' QoL , , .
In our population, the average mean doses to the ipsilateral and contralateral parotid were 36.3 Gy vs. 30.6 Gy, respectively. These doses were more or less similar to those observed in previous IMRT treatment reports, such as Beetz et al. , who found that they were 35.2 Gy and 28.0 Gy to the ipsilateral and contralateral parotid gland, respectively; Nutting et al.  reported that the average mean doses to the ipsilateral and contralateral parotid glands were 47.6 Gy and 25.4 Gy, respectively. In our cohort, these average dose levels were much higher in the subset of nasopharyngeal cancer patients: 41.6 Gy and 38.7 Gy to the ipsilateral and contralateral gland, respectively. A possible explanation for this is that the overlap of the PTV with parts of the parotid glands in nasopharyngeal cancer patients makes it more difficult to spare the parotid glands with IMRT . This implies that more effective new technologies that could spare the parotid glands without compromising target volume coverage need to be investigated.
The current analysis, for the 3-month time point, showed that the mean dose to the contralateral parotid gland was the most important variable determining acute xerostomia. The three-factor model containing the mean dose to the contralateral gland and the ipsilateral gland performed significantly better than the single variable model. The dose to the contralateral parotid gland was more important than the dose to the ipsilateral parotid gland when treated with IMRT; similar reports can be found in a number of papers on patient-rated xerostomia , , –.
For the 12-month time point, the dose to the contralateral parotid gland was also significantly correlated with late xerostomia. However, the mean dose to the ipsilateral parotid gland was the most important variable. As the mean dose to the contralateral parotid gland was significantly lower in patients with/without xerostomia when treated with IMRT, it is not surprising that recovery can be expected during this period. Indeed, some investigators showed that the contralateral parotid recovered after 9 to 24 months in different recovery levels. Eisbruch et al.  reported the correlation between mean parotid dose and parotid salivary flow recovery in patients with non-nasopharyngeal head-and-neck cancer. In the Kwong et al.  study, 60% of patients recovered at least 25% of their baseline stimulated parotid salivary flow at 12 months post-IMRT. In Hsiung et al. , 64.5% of 31 parotid glands recovered at least 25% of their baseline maximal excretion ratio at 9 months post-IMRT. This was consistent with the results of another study , which suggested that reducing radiotherapy doses to both parotid glands to <26 Gy can reduce xerostomia significantly.
Although parotid gland dysfunction plays an important role in the development of patient-rated xerostomia , it is not the only prognostic factor. Beetz et al. recently showed that age and baseline xerostomia were independent prognostic factors for patient-rated xerostomia, in addition to the mean dose to the parotid glands , . In the current study, we likewise found that elderly patients have a higher probability of suffering from xerostomia than younger patients. Beetz et al. claimed that older patients are more likely to use medication and to have co-morbidities that may influence and reduce saliva production at rest , . Fortin et al.  showed that in a univariate and multivariate analysis, total parotid mean dose and age were strongly associated with a lower incidence of grade ≥2 xerostomia at 6, 12 and 24 months.
In this study, those who had a higher financial status or a higher level of education tended to avoid the inconvenience of xerostomia. Similarly, Ramsey et al.  showed that lower financial status in colorectal cancer patients was associated with a worse outcome for reported pain. Fang et al.  found that NPC survivors with a higher annual family income and level of education presented a significantly better outcome on QoL scores. These findings suggest that the patient's individual abilities and the resources available to cope with the threat of treatment complications are powerful variables that affect their future quality of life. Financial status and education remain two of the most significant variables correlated with xerostomia in our cohort. In addition, smoking, a recognized carcinogenic factor in the overwhelming majority for head and neck cancers , , was shown to be a contributory cause of xerostomia after IMRT in our cohort. Rad et al.  also indicated that long-term smoking would significantly reduce whole-mouth salivary flow rate.
That the risk of complications may depend on more factors than simply the dose to a single organ seems to be true. Clinical datasets on normal tissue complications often include a large number of variables, many of which need to be investigated and incorporated in a model because they may be related to a given complication. As reported by El Naqa et al., the prediction of endpoints can be improved by mixing clinical and dose-volume factors, while bootstrap-based variable selection analysis increases the reliability of the predictive models . Indeed, our results showed better performance of the multivariate model compared with the univariate relationships between dose–volume prognostic factors and XER3mor XER12m.
Recently, we reported the results of a retrospective study that was conducted to develop a univariate NTCP model for patient-rated moderate-to-severe xerostomia among HNC patients treated with IMRT . The AUC values for the model were 0.68 (95% CI 0.61–0.74) and 0.72 (95% CI 0.64–0.80) for the 3- and 12-month time points, respectively. The only prognostic factor was the mean dose to parotid glands. In this study, for the selection of suboptimal prognostic factors, the system performance AUC values improved from 0.68 to 0.84 for the 3-month time point, and from 0.72 to 0.84 for the 12-month time point when the number of prognostic factors used increased from one to three or four, respectively. The multivariate approach allowed the integration of different prognostic factors for estimating the risk on XER3m and XER12m in individual patients. In this regard, it should be emphasized that dose–effect relationships for this endpoint should be described by multiple NTCP curves rather than by a single NTCP curve. Moreover, whether the gain is worth the increased complexity requires further investigation. The problem of increased complexity is the potential limitation of this study. After all, a large number of selected prognostic factors may lead to instability in the models.
In the literature, a similar report by Xu et al. stated that the NTCP model obtained by LASSO was statistically significant . Their dataset contained 185 patients who were all treated with primary 3D-CRT for HNC at 6 months after RT; 106 patients were assessed as having xerostomia. The primary endpoint was defined as Radiation Therapy Oncology Group (RTOG) grade 2+ xerostomia using the RTOG late radiation morbidity scoring system. The prognostic factors used for the NTCP modeling included five clinical and 16 dosimetric factors. The five clinical variables were chemotherapy, gender, age, treatment center, and baseline xerostomia score. The dosimetric factors were the mean dose given to the organs at risk and their volume. These organs were the lower lip, the soft palate, the contralateral and ipsilateral parotid gland, the contralateral and ipsilateral sublingual gland, and the contralateral and ipsilateral submandibular gland. The differences in comparison with our models included treatment technique (IMRT vs. 3D-CRT), time point (3- and 12-month vs. 6-month), endpoint (G3+ vs. G2+), and candidate prognostic factors (14 clinical and two dosimetric factors vs. five clinical and 16 dosimetric factors). However, the system performance AUC values were similar with both regression models (approximately 0.85). Therefore, LASSO was recommended for multivariate logistic regression NTCP modeling.
The LL forward selection of bootstrap predictions in the multivariate logistic regression analysis was optimal with a model consisting of 9 and 11 variables for the 3- and 12-month time points, respectively. The system performance AUC values were no better than the results of LASSO. However, the selection of too many prognostic factors may result in overfitting.
There are a number of potential limitations of this study. Arjen van der Schaaf et al.  claimed that approximately 200 patients are required to obtain a model with high predictive power. In the current study, the number of patients evaluable at 12 months was far below the recommended 200 patients. The predictive power can be improved by increasing the sample size. Data from 6 months were not evaluated on the basis of previously published data –, in which the dose dependency of salivary function and recovery with time after RT was found up to 12 months post-RT. In addition, in the Kwong et al.  study, 60% of patients recovered at least 25% of their baseline stimulated parotid salivary flow at 12 months post-IMRT. Eisbruch et al. ,  showed that a parotid mean dose <26 Gy decreases the severity of late complications and gives a better chance of functional parotid recovery at 12 months. The stability for evaluating late toxicity was more reliable at 12 months than 6 months.
That chemotherapy, a non-dosimetric patient factor, may affect the risk of xerostomia is an issue of special concern. Deasy et al.  and Moiseenko et al.  stated that the use of chemotherapy was not typically correlated with xerostomia risk. This is consistent with our results, as chemotherapy was not significant among the 16 candidate prognostic factors and there was no association between chemotherapy and risk of xerostomia.
LASSO is recommended for multivariate NTCP modeling. However, in practice, the suboptimal LASSO method tends to incorporate fewer prognostic factors than the optimal approach. The suboptimal LASSO method is useful in practice. The prognostic factors included in the model are useful to further optimize current IMRT treatment with regard to patient-rated xerostomia and to indicate which prognostic factors are the most important, thus helping to spare the glands as much as possible, and to optimize current treatment with IMRT.
Conceived and designed the experiments: TL PC. Performed the experiments: CC HT YH HW MH JL YH JW LC. Analyzed the data: PC TL. Contributed reagents/materials/analysis tools: FF SL PC TL. Wrote the paper: TL PC.
- 1. Lim C, Tassone C, Liu X, Thometz J, Lyon R (2012) Correlation of idiopathic scoliosis assessments between newly developed milwaukee topographic scanner and Quantec. Stud health technol inform 176: 255.
- 2. Moiseenko V, Wu J, Hovan A, Saleh Z, Apte A, et al. (2012) Treatment planning constraints to avoid xerostomia in head-and-neck radiotherapy: an independent test of QUANTEC criteria using a prospectively collected dataset. Int J Radiat Oncol Biol Phys 82: 1108–1114.
- 3. Beetz I, Schilstra C, Burlage FR, Koken PW, Doornaert P, et al. (2012) Development of NTCP models for head and neck cancer patients treated with three-dimensional conformal radiotherapy for xerostomia and sticky saliva: The role of dosimetric and clinical factors. Radiother Oncol 105: 86–93.
- 4. Beetz I, Schilstra C, van der Schaaf A, van den Heuvel ER, Doornaert P, et al. (2012) NTCP models for patient-rated xerostomia and sticky saliva after treatment with intensity modulated radiotherapy for head and neck cancer: The role of dosimetric and clinical factors. Radiother Oncol 105: 101–106.
- 5. Beetz I, Schilstra C, van Luijk P, Christianen ME, Doornaert P, et al. (2012) External validation of three dimensional conformal radiotherapy based NTCP models for patient-rated xerostomia and sticky saliva among patients treated with intensity modulated radiotherapy. Radiother Oncol 105: 94–100.
- 6. Lee T-F, Fang F-M (2013) Quantitative analysis of normal tissue effects in the clinic (QUANTEC) guideline validation using quality of life questionnaire datasets for parotid gland constraints to avoid causing xerostomia during head-and-neck radiotherapy. Radiother Oncol 106: 352–358.
- 7. El Naqa I, Bradley J, Blanco AI, Lindsay PE, Vicic M, et al. (2006) Multivariable modeling of radiotherapy outcomes, including dose–volume and clinical factors. Int J Radiat Oncol Biol Phys 64: 1275–1286.
- 8. van de Water TA, Bijl HP, Westerlaan HE, Langendijk JA (2009) Delineation guidelines for organs at risk involved in radiation-induced salivary dysfunction and xerostomia. Radiother Oncol 93: 545–552.
- 9. Xu C-J, van der Schaaf A, Schilstra C, Langendijk JA, van't Veld AA (2012) Impact of statistical learning methods on the predictive power of multivariate normal tissue complication probability models. Int J Radiat Oncol Biol Phys 82: e677–e684.
- 10. Xu C-J, van der Schaaf A, Van't Veld AA, Langendijk JA, Schilstra C (2012) Statistical validation of normal tissue complication probability models. Int J Radiat Oncol Biol Phys 84: e123–e129.
- 11. Xu J, Yin J (2013) Kernel least absolute shrinkage and selection operator regression classifier for pattern classification. Comput Vis, IET 7: 48–55.
- 12. Colombani C, Legarra A, Fritz S, Guillaume F, Croiseau P, et al. (2012) Application of Bayesian least absolute shrinkage and selection operator (LASSO) and BayesCπ methods for genomic selection in French Holstein and Montbéliarde breeds. J Dairy Sci 96: 575–591.
- 13. Ogutu JO, Schulz-Streeck T, Piepho H-P (2012) Genomic selection using regularized linear regression models: ridge regression, lasso, elastic net and their extensions. BMC proceedings: BioMed Central Ltd. pp. S10.
- 14. Rasmussen MA, Bro R (2012) A tutorial on the Lasso approach to sparse modeling. Chemometr Intell Lab
- 15. Teppola P, Taavitsainen V-M (2013) Parsimonious and robust multivariate calibration with rational function Least Absolute Shrinkage and Selection Operator and rational function Elastic Net. Analytica Chimica Acta 768: 57–68.
- 16. Stock JH, Watson MW (2012) Generalized shrinkage methods for forecasting using many predictors. J Bus Econ Stat 30: 481–493.
- 17. Chen W-C, Hwang T-Z, Wang W-H, Lu C-H, Chen C-C, et al. (2009) Comparison between conventional and intensity-modulated post-operative radiotherapy for stage III and IV oral cavity cancer in terms of treatment results and toxicity. Oral oncol 45: 505–510.
- 18. Lee T-F, Ting H-M, Chao P-J, Fang F-M (2012) Dual Arc volumetric-modulated Arc radiotherapy (VMAT) of nasopharyngeal carcinomas: a simultaneous integrated boost treatment plan comparison with intensity-modulated radiotherapies and single arc VMAT. Clini Oncol 24: 196–207.
- 19. Leung SW, Lee T-F (2013) Treatment of nasopharyngeal carcinoma by tomotherapy: five-year experience. BMC Cancer 5: 9, 1–9.
- 20. Bjordal K, Hammerlid E, Ahlner-Elmqvist M, de Graeff A, Boysen M, et al. (1999) Quality of life in head and neck cancer patients: validation of the European Organization for Research and Treatment of Cancer Quality of Life Questionnaire-H&N35. J Clini Oncol 17: 1008–1008.
- 21. Fang F-M, Tsai W-L, Lee T-F, Liao K-C, Chen H-C, et al. (2010) Multivariate analysis of quality of life outcome for nasopharyngeal carcinoma patients after treatment. Radiother Oncol 97: 263–269.
- 22. Tibshirani R (1996) Regression shrinkage and selection via the lasso. J Roy Stat Soc B 267–288.
- 23. Hastie T, Tibshirani R, Friedman J, Franklin J (2005) The elements of statistical learning: data mining, inference and prediction. The Math Intellg 27: 83–85.
- 24. Skwarchuk MW, Jackson A, Zelefsky MJ, Venkatraman ES, Cowen DM, et al. (2000) Late rectal toxicity after conformal radiotherapy of prostate cancer (I): multivariate analysis and dose–response. Int J Radiat Oncol Biol Phys 47: 103–113.
- 25. Deasy JO, Moiseenko V, Marks L, Chao K, Nam J, et al. (2010) Radiotherapy dose–volume effects on salivary gland function. Int J Radiat Oncol Biol Phys 76: S58–S63.
- 26. Beetz I, Steenbakkers RJ, Chouvalova O, Leemans CR, Doornaert P, et al. (2013) The QUANTEC criteria for parotid gland dose and their efficacy to prevent moderate to severe patient-rated xerostomia. Acta Oncologica 1–8.
- 27. Nutting CM, Morden JP, Harrington KJ, Urbano TG, Bhide SA, et al. (2011) Parotid-sparing intensity modulated versus conventional radiotherapy in head and neck cancer (PARSPORT): a phase 3 multicentre randomised controlled trial. Lancet Oncol 12: 127–136.
- 28. Eisbruch A, Kim HM, Terrell JE, Marsh LH, Dawson LA, et al. (2001) Xerostomia and its predictors following parotid-sparing irradiation of head-and-neck cancer. Int J Radiat Oncol Biol Phys 50: 695–704.
- 29. Eisbruch A, Ten Haken RK, Kim HM, Marsh LH, Ship JA (1999) Dose, volume, and function relationships in parotid salivary glands following conformal and intensity-modulated irradiation of head and neck cancer. Int J Radiat Oncol Biol Phys 45: 577–587.
- 30. Graff P, Lapeyre M, Desandes E, Ortholan C, Bensadoun R-J, et al. (2007) Impact of intensity-modulated radiotherapy on health-related quality of life for head and neck cancer patients: matched-pair comparison with conventional radiotherapy. Int J Radiat Oncol Biol Phys 67: 1309–1317.
- 31. Kwong DL, Pow EH, Sham JS, McMillan AS, Leung LH, et al. (2004) Intensity-modulated radiotherapy for early-stage nasopharyngeal carcinoma. Cancer 101: 1584–1593.
- 32. Hsiung C-Y, Ting H-M, Huang H-Y, Lee C-H, Huang E-Y, et al. (2006) Parotid-sparing intensity-modulated radiotherapy (IMRT) for nasopharyngeal carcinoma: preserved parotid function after IMRT on quantitative salivary scintigraphy, and comparison with historical data after conventional radiotherapy. Int J Radiat Oncol Biol Phys 66: 454–461.
- 33. Tribius S, Sommer J, Prosch C, Bajrovic A, Muenscher A, et al. (2013) Xerostomia after radiotherapy: What matters-mean total dose or dose to each parotid gland? Strahlenther Onkol 189: 216–222.
- 34. Leal SC, Bittar J, Portugal A, Falcao DP, Faber J, et al. (2010) Medication in elderly people: its influence on salivary pattern, signs and symptoms of dry mouth. Gerodontology 27: 129–133.
- 35. Fortin I, Fortin B, Lambert L, Clavel S, Alizadeh M, et al. (2013) Xerostomia in patients treated for oropharyngeal carcinoma: Comparing LINAC-based intensity-modulated radiation therapy to helical tomotherapy. Head Neck-J Sci Spec
- 36. Sturgis EM, Cinciripini PM (2007) Trends in head and neck cancer incidence in relation to smoking prevalence. Cancer 110: 1429–1435.
- 37. Sturgis EM, Wei Q, Spitz MR (2004) Descriptive epidemiology and risk factors for head and neck cancer. Semi Oncol 31: 726–733.
- 38. Rad M, Chamani G, Shahravan A, Hedayati N, Alizadeh F (2013) Survey of prevalence of xerostomia in a population of Kerman, Iran, attending medical and dental clinics. j oral health & oral epidemiology 2: 28–34.
- 39. van der Schaaf A, Xu C-J, van Luijk P, van't Veld AA, Langendijk JA, et al. (2012) Multivariate modeling of complications with data driven variable selection: Guarding against overfitting and effects of data set size. Radiother Oncol 105: 115–121.
- 40. Henson B, Inglehart M, Eisbruch A, Ship J (2001) Preserved salivary output and xerostomia-related quality of life in head and neck cancer patients receiving parotid-sparing radiotherapy. Oral oncol 37: 84–93.
- 41. Kaneko M, Shirato H, Nishioka T, Ohmori K, Takinami S, et al. (1998) Scintigraphic evaluation of long-term salivary function after bilateral whole parotid gland irradiation in radiotherapy for head and neck tumour. Oral oncol 34: 140–146.
- 42. Franzén L, Funegård U, Ericson T, Henriksson R (1992) Parotid gland function during and following radiotherapy of malignancies in the head and neck: a consecutive study of salivary flow and patient discomfort. Europ J Cancer 28: 457–462.