Using machine learning for the personalised prediction of revision endoscopic sinus surgery

Mikko Nuutinen; Jari Haukka; Paula Virkkula; Paulus Torkki; Sanna Toppila-Salmi

doi:10.1371/journal.pone.0267146

Abstract

Background

Revision endoscopic sinus surgery (ESS) is often considered for chronic rhinosinusitis (CRS) if maximal conservative treatment and baseline ESS prove insufficient. Emerging research outlines the risk factors of revision ESS. However, accurately predicting revision ESS at the individual level remains uncertain. This study aims to examine the prediction accuracy of revision ESS and to identify the effects of risk factors at the individual level.

Methods

We collected demographic and clinical variables from the electronic health records of 767 surgical CRS patients ≥16 years of age. Revision ESS was performed on 111 (14.5%) patients. The prediction accuracy of revision ESS was examined by training and validating different machine learning models, while the effects of variables were analysed using the Shapley values and partial dependence plots.

Results

The logistic regression, gradient boosting and random forest classifiers performed similarly in predicting revision ESS. Area under the receiving operating characteristic curve (AUROC) values were 0.744, 0.741 and 0.730, respectively, using data collected from the baseline visit until six months after baseline ESS. The length of time during which data were collected improved the prediction performance. For data collection times of 0, 3, 6 and 12 months after baseline ESS, AUROC values for the logistic regression were 0.682, 0.715, 0.744 and 0.784, respectively. The number of visits before or after baseline ESS, the number of days from the baseline visit to the baseline ESS, patient age, CRS with nasal polyps (CRSwNP), asthma, non-steroidal anti-inflammatory drug exacerbated respiratory disease and immunodeficiency or suspicion of it all associated with revision ESS. Patient age and number of visits before baseline ESS carried non-linear effects for predictions.

Conclusions

Intelligent data analysis identified important predictors of revision ESS at the individual level, such as the frequency of clinical visits, patient age, Type 2 high diseases and immunodeficiency or a suspicion of it.

Citation: Nuutinen M, Haukka J, Virkkula P, Torkki P, Toppila-Salmi S (2022) Using machine learning for the personalised prediction of revision endoscopic sinus surgery. PLoS ONE 17(4): e0267146. https://doi.org/10.1371/journal.pone.0267146

Editor: Usman Qamar, National University of Sciences and Technology (NUST), PAKISTAN

Received: September 14, 2021; Accepted: April 3, 2022; Published: April 29, 2022

Copyright: © 2022 Nuutinen et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: The datasets generated and analysed during the current study are not publicly available because they contain information that could compromise research participant privacy. The national law (Act on Secondary Use of Health and Social Data) unfortunately prevents the sharing of material containing personal information at the individual level. Restrictions on data sharing were also imposed by the ethics committee of the Hospital District of Helsinki (eettiset.toimikunnat@hus.fi) and Uusimaa. Data access requests may be directed to Anne Pitkäranta, Research Director of the Hospital District of Helsinki (contact via anne.pitkaranta@hus.fi) and to Uusimaa (HUS), PO BOX 100 (Stenbäckinkatu), 00029 HUS, +35840 643 0511.

Funding: Yes. This work was supported in part by research grants from Paulo Foundation, State funding for university-level health research (TYH2019322), The Finnish Anti-Tuberculosis Association Foundation, The Tampere Tuberculosis Foundation, and The Väinö and Laina Kivi Foundation. All in Finland. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: STS reports a grant of GSK and consultancies for AstraZeneca, ERT, Novartis, Sanofi Pharma and Roche. All these are outside the submitted work. All other authors declare no conflicts of interest. This does not alter our adherence to PLOS ONE policies on sharing data and materials.

Introduction

Chronic rhinosinusitis (CRS) is a symptomatic inflammatory disease of the nasal and paranasal mucosa lasting more than 12 weeks [1]. With a prevalence of about 11%, CRS diminishes patient quality of life and productivity and increases healthcare costs [1]. The main phenotypes are CRS with nasal polyps (CRSwNP) and without (CRSsNP) [1–3]. The majority of CRS cases occurring in Western countries are characterised by Type 2 high inflammation with elevated levels of eosinophils, interleukin-4 (IL-4), IL-5 and IL-13 [1]. Nonsteroidal anti-inflammatory drug (NSAID) exacerbated respiratory disease is a Type 2 high chronic inflammatory syndrome with a partially unknown pathobiology associated with CRSwNP and asthma and with an increased morbidity [4–6].

Endoscopic sinus surgery (ESS) represents a cost-effective treatment [7] if conservative therapy (such as intranasal corticosteroids and nasal saline irrigation) is insufficient [1]. The success rates for initial ESS range from 76% to 98% [8, 9]. The early identification of CRS recurrence risk following ESS is cost-effective [10, 11], helping to correctly target treatment [12] and prevent permanent tissue changes [1].

A substantial number of studies have identified the risk factors of revision ESS [13–21], in studies varying according to sample size (n = 66 [21] vs. n = 61 000 [15]), data collection methods (large retrospective database [15] vs. prospective questionnaires [14]) or geographic location (USA [15], Australia [22] and Finland [13]). Commonly recognised risk factors include nasal polyps, asthma, allergy, non-steroidal anti-inflammatory drug (NSAID) exacerbated respiratory disease (NERD) and a previous ESS. In a meta-analysis [19], the strongest predictors of revision ESS were allergic fungal rhinosinusitis, NERD, asthma, prior polypectomy and operations prior to 2008. However, no prior research has analysed the prediction accuracy of revision ESS at the individual level or for variables with a nonlinear association. In this study, we examined the accuracy of the personalised prediction of revision ESS, and attempted to identify the effects of important predictor variables via modern machine-learning algorithms and methods.

Materials and methods

Patients

This study consisted of rhinitis or rhinosinusitis patients presenting at the Department of Otorhinolaryngology at the Hospital District of Helsinki and Uusimaa (HUS), Finland. The HUS ethics committee approved the study protocol (nro 31/13/03/00/2015), thereby precluding the need to obtain written informed consent from patients for this retrospective follow-up study.

The inclusion criteria for the initial patient population (n = 5080) was an ICD-10 diagnosis of J30, J31, J32, J33 or J01 registered during outpatient visits in 2005, 2007, 2009, 2011 or 2013. Longitudinal data for a random patient sample were collected from the electronic health records (EHRs), such that the sample size was the same for each sampling year and each month of the sampling year. The last data collection day for follow-up was 31 September 2019. CRS was defined as diagnostic codes J33 and/or J32. ESS was defined based on the surgical codes (Table A in S1 File). In total, we excluded 27 CRS patients <16 years of age. The baseline visit was defined as the first clinic visit, and baseline ESS was defined as the first ESS identified in EHRs at the specific sampling time. Revision ESS was defined as an ESS performed following the baseline ESS during the follow-up period.

A total of 111 of 767 (14.5%) CRS patients underwent revision ESS (mean±stdev) 30.3±31.0 months following the baseline ESS (Fig 1a and Table 1). Among revised patients, 88 underwent one revision ESS and 23 patients underwent two or more revisions (Fig 1b).

Download:

Fig 1. Histograms of (a) time to revision ESS and (b) the number of revision ESS surgeries.

ESS, endoscopic sinus surgery.

https://doi.org/10.1371/journal.pone.0267146.g001

Download:

Table 1. Average follow-up time and number of ESS surgeries among patients with and without revision ESS.

https://doi.org/10.1371/journal.pone.0267146.t001

Variables

Table 2 summarises the patient characteristics that were analysed both from the structured EHR data (visits, procedure codes and patient diagnoses) and free clinical texts (diagnoses and comorbidities). Comorbidity-related variables were obtained from the ICD-10 codes (Table B in S1 File) and using validated keyword-based information extraction from free clinical texts (see S2 File). For asthma, we used ICD-10 code J45, doctor-diagnosed lung function test–confirmed asthma. A NERD diagnosis was obtained from EHR text and was based on a typical history of airway symptoms following the ingestion of NSAID with/without challenge test confirmation of NERD.

Download:

Table 2. Characteristics of patients without and with the revision ESS.

P values calculated using the Fisher’s exact or Mann-Whitney U test.

https://doi.org/10.1371/journal.pone.0267146.t002

Machine learning algorithms

In this study, we conducted four analyses: univariate model, machine learning classifier comparison, the effect of the data collection time and model interpretability analyses. The univariate models examined the prediction accuracy of individual variables using univariate logistic regression classifiers. Machine learning classifier comparison examined the predictive performance of three classifiers: random forest, logistic regression and gradient boosting. The random forest and gradient boosting classifiers were chosen for the machine learning classifier comparison since they are widely used with a demonstrated good performance [23]. Logistic regression was chosen because it is simple and still performs relatively well [24]. It remains important to determine if simpler algorithms perform comparably well. To understand the effect of the time of data collection time, the performance of the classifier was calculated when the variable collection time was from the baseline visit to the baseline ESS or to 3, 6 or 12 months following baseline ESS. Fig 2 illustrates the timeline of the data collection for the models 0, 3, 6 and 12 months, respectively. For example, the model for 3 months was trained and validated using patient data collected from patient EHRs between patients’ baseline visits and 3 months following baseline ESS. The logistic regression classifier was selected for the analysis of the data collection time period because it is simple and because the machine learning classifier study demonstrated that its performance was higher or similar to other classifiers.

Download:

Fig 2. Data collection for the analysis of the effect of the data collection time period.

The classifiers’ performance was calculated based on when the data were collected from the timeline of the baseline visit to the baseline ESS or to 3, 6 or 12 months following the baseline ESS. The baseline visit represents the initial clinic visit, and baseline ESS represents the first ESS identified in EHRs. Revision ESS represents the ESS performed following baseline ESS during the follow-up time period. The models were trained to predict the revision ESS. ESS, endoscopic sinus surgery; EHR, electronic health record.

https://doi.org/10.1371/journal.pone.0267146.g002

However, the logistic regression classifier is linear and thus not able to model possible nonmonotonic relationships between predictors and outcomes. The random forest and gradient boosting classifiers can model complex, non-monotonous relationships, but are so-called black box models or uninterpretable classifiers. The relationships between inputs and outputs are difficult to understand directly from the parameters or the structure of the trained model. For the model interpretability analysis, we chose to use gradient boosting classifiers. The model interpretability analysis was calculated using Shapley values (SHAP) and partial dependence plots (PDPs) [25, 26], and were analysed for their importance and the possible nonmonotonic effects of the variables.

Model training

Fig 3 shows the data flows for training and testing the classifiers. Original data were first divided into two distinct data folds: the training fold (70% of the data) and the test fold (30% of the data). We used the training fold to select the variables and hyperparameters and to train the final models. The test fold relied on an external dataset which we used to measure the performance of the final models. During splitting, folds were stratified to preserve the proportion of patients in both target classes (no revision vs. revision).

Download:

Fig 3. Data flows for the prediction model learning pipelines.

Variable selection (step 1), search for hyperparameters (step 2), model training (step 3) and performance evaluation (step 4). Data flow (a) is used to compare machine learning classifiers and the effect of data collection time period analyses. Data flow (b) is used for the univariate model and model interpretability analyses.

https://doi.org/10.1371/journal.pone.0267146.g003

The data flow shown in Fig 3a was used to compare the machine learning classifiers and for the analysis of the data collection time period. Here, we summarise the steps in the process, which included variable selection (step 1), searching for model hyperparameters (step 2), model training (step 3) and performance evaluation (step 4). The data flow in Fig 3b provides the univariate model and the model interpretability analyses. Specifically, we proceeded by searching for model hyperparameters (step 2), model training (step 3) and performance evaluation (step 4). One primary difference between the data flows presented is that Fig 3a uses the k-fold cross-validation for variable selection (SFS, sequential forward selection). Data flow of Fig 3b relies on predefined variables for training models.

Sequential forward variable selection

The data flow in Fig 3a contains the method of sequential forward variable selection (SFS, step 1) [27]. SFS begins with an empty set, and adds one variable at a time from the original variable set S_all (Table 2) for classifier F(⋅) by maximising the performance measure. We used the area under the receiver operating characteristic (AUROC) curve as the performance metric. Because our data are unbalanced, we used class weight balanced loss functions. The output of the SFS was 15 most important variables S_k,sel for classifier F(⋅). The (average) importance of each variable a was measured using the following rank metric: (1) where r(k, a) is the rank of variable a based on the data set k and #F is the size of the largest subset resulting from SFS [28–30]. In this study, #F = 15. A higher R(a) (rank score) indicates that variable a is more important according to SFS, because it was selected in the smaller size variable subsets. That is, the variable has a higher predictive capability according to SFS, whereby its revision ESS prediction ability is high. The optimal hyperparameters for classifier F(⋅) with variables S_m,sel were identified using the grid-search method (step 2). The hyperparameter values for different classifiers and the summary statistics for the selected hyperparameter values appear in S5 File. Following the identification of the optimal hyperparameters for classifier F(⋅) using variables S_m,sel, the model was trained (step 3) and the performance was calculated using dataset X_test (step 4).

Performance analysis

We used the following standard performance metrics: AUROC, the area under the precision recall curve (AUPRC), precision, sensitivity, specificity and the F1 score. AUROC is the mostly used evaluation metric for measuring the performance of any classification model. An AUROC of 0.5 indicates no discrimination above chance, while an AUROC of 1.0 indicates a perfect classification. A rough guide for the classification ability of a model is AUROC = 0.9–1.0 indicates an excellent performance, AUROC = 0.8–0.9 indicates a good performance, AUROC = 0.7–0.8 indicates a fair performance and AUROC = 0.6–0.7 indicates a poor performance [31, 32]. AUPRC is often used evaluation metric for imbalanced data sets. The baseline (discrimination above change) of AUPRC is equal to the fraction of positives. The baseline AUPRC of our study is 0.145, indicating that 14.5% of the patients underwent revision ESS. The baseline values for the AUROC and AUPRC metrics were confirmed for our data by training and testing the models using randomised label data (Table A in S3 File).

Precision refers to the number of true positive results divided by the number of all positive results, including those not identified correctly. In this study, precision refers specifically to the ability of a model to identify only revision patients. Sensitivity, by comparison, indicates the number of true positive results divided by the number of all samples that should have been identified as positive. In this study, then, sensitivity refers specifically to the ability of the model to identify all of the revision patients. Specificity is the number of true negative results divided by the number of all samples that should have been identified as negative. In this study, specificity specifically refers to the ability of the model to identify all patients not needing revision. Finally, the F1 score represents the harmonic mean between the precision and sensitivity. Precision, sensitivity, specificity and F1 score are calculated using the following equations: (2) (3) (4) (5) where TP is the number of true positives predicted by the classifier, FP is the number of false positives, FN is the number of false negatives and TN is the number of true negatives.

Software

We used seven Python packages—sklearn [33], xgboost [34], mlxtend [35], numpy [36], pandas [37], shap [25, 38] and pdpbox [39]—to implement the classifiers and compute the performance values and model interpretations. SFS was computed using the ‘SequentialFeatureSelector’ function in the mlxtend package. The classifiers of random forest, logistic regression and gradient boosting were implemented using functions from the sklearn.linear_model, sklearn.ensemble and xgboost packages. The grid search for the hyperparameters was conducted using the ‘GridSearchCV’ function in the sklearn.model_selection package. We computed the Shapley values using the ‘TreeExplainer’ function in the shap package. Partial dependency plots (PDPs) were created using the ‘pdp_isolate’ function in the pdpbox package. The packages of numpy and pandas were used for data reading and processing.

Results

The CRS patient population which underwent baseline ESS (n = 767), included 448 (58%) females, ranging in age from 16 to 90 years. Table 2 summarises the patient characteristics and the proportion who did and did not undergo revision ESS. The following comorbidities significantly associated with patients who underwent revision ESS during follow-up: doctor-diagnosed lung function test-confirmed asthma, CRSwNP, allergies, chronic respiratory disease, EHR text-based NERD and immunodeficiency or a suspicion of immunodeficiency. The following continuous variables significantly associated with revision ESS: an older age, a shorter time from the baseline visit to baseline ESS, a higher frequency of visits between the baseline visit and baseline ESS and a higher number of visits from the baseline ESS to 3 months postoperatively, 6 months postoperatively and 12 months postoperatively, respectively.

Univariate analyses

Table 3 presents the results of the univariate logistic regression models to predict revision ESS following baseline ESS. Results represent the average of 10 reformulations from the training and test folds (Fig 3b). Among continuous variables, the highest AUROC values were for the number of visits 12, 6, and 3 months following baseline ESS (AUROC = 0.77, 0.70, 0.66, respectively). The next highest AUROC values were for the time between the baseline visit and baseline ESS (AUROC = 0.59) and for the frequency of visits between the baseline visit and baseline ESS (AUROC = 0.58). Among categorical variables, the highest AUROC values were for asthma (AUROC = 0.65), CRSwNP (AUROC = 0.64), immunodeficiency or a suspicion of it (AUROC = 0.61), allergies (AUROC = 0.60), chronic respiratory diseases (AUROC = 0.59) and NERD (AUROC = 0.59). We also found that the odds ratios (ORs) for continuous variables all exceeded 1.0 with one exception, indicating that a higher number of visits and greater frequency of visits and a shorter time between baseline visit and baseline ESS increased the probability of a revision ESS.

Download:

Table 3. Odds ratios (ORs) and performance values (AUROC, sensitivity, specificity and F1 score) for predicting a revision ESS using different variables in univariate logistic regression models.

https://doi.org/10.1371/journal.pone.0267146.t003

Machine learning classifier comparison

The plots in Fig 4 show the AUROC values for the classifiers of random forest, logistic regression and gradient boosting as a function of the number of variables. We applied the SFS method to select variables collected from the baseline visit until six months after the baseline ESS. Results are reported as the averages from 10 reformulations from the training and test folds (see Fig 3a). The AUROC values first increased rapidly and then reached a plateau as a function of the number of variables. For the logistic regression classifier, the highest average AUROC (0.744) was achieved using six variables. For the gradient boosting classifier, the highest AUROC (0.741) was with eight variables. For the random forest classifier, the highest AUROC (0.737) was with 11 variables. The AUPRC values for the same number of variables were 0.354, 0.348 and 0.378, respectively. The baseline AUROC and AUPRC values (presenting discrimination above chance) calculated using the random classifier were 0.499, 0.472 0.489 and 0.149, 0.151 0.149 (Table A in S3 File). The performance of the ensemble model in which the logistic regression, random forest and gradient boosting classifiers were combined was comparable with the performance of each individual classifier (Table A in S4 File). Tables 4 and 5 summarise the AUROC, AUPRC, sensitivity and specificity values as a function of the number of variables.

Download:

Table 4. AUROC and AUPRC values as a function of the number of variables for predicting revision ESS.

We selected variables using the sequential forward selection (SFS) method. Three models were used and the classifiers in these models were the logistic regression (LR), gradient boosting (GB) and random forest (RF) for predicting revision ESS. Table 4 (AUROC values) is related to Fig 4.

https://doi.org/10.1371/journal.pone.0267146.t004

Download:

Table 5. Sensitivity and specificity values as a function of the number of variables for predicting revision ESS.

We selected variables using the sequential forward selection (SFS) method. Three models were used and the classifiers in these models were the logistic regression (LR), gradient boosting (GB) and random forest (RF) for predicting revision ESS.

https://doi.org/10.1371/journal.pone.0267146.t005

Download:

Fig 4. AUROC values as a function of the number of variables used to predict revision ESS.

We used three models and the classifiers in these models were logistic regression, gradient boosting and random forest for predicting revision ESS. AUROC, area under the receiver operating characteristics curve; ESS, endoscopic sinus surgery.

https://doi.org/10.1371/journal.pone.0267146.g004

Table 6 presents the variables selected using SFS in order of the rank scores calculated using Eq 1. When using any of the three classifiers, the following variables resulted in high rank scores, indicating their importance as predictors of revision ESS: the number of visits six months after baseline ESS, CRSwNP, asthma and NERD. In addition, the frequency of visits from the baseline visit to baseline ESS and the number of visits before the baseline ESS emerged as important predictors.

Download:

Table 6. The top ten variables in prediction capacity of revision ESS.

Three models were used and the classifiers of the three models were the logistic regression, gradient boosting and random forest for predicting revision ESS. We used the sequential forward selection (SFS) method to select the top performing variables. In each SFS run, the best variable was awarded 15 points, the next best variable 14 points and so on. Ten runs were performed using each of the three classifiers. The rank score represents the sum of points (range, 0–150 points).

https://doi.org/10.1371/journal.pone.0267146.t006

Effect of the data collection time

The effect of the length of time for data collection on the model’s ability to predict the risk of revision ESS was evaluated using the logistic regression classifier. Fig 5 presents the AUROC values when the data collection time period was from the baseline visit to the baseline ESS or until 3, 6 or 12 months after baseline ESS. Table 7 summarises the AUROC, AUPRC, sensitivity, specificity and F1 score values. The highest performance (AUROC = 0.784) was, as expected, found in the model that included a 12-month follow-up period, because more information was available in that model compared with models using 3- or 6-month follow-up periods or no follow-up period at all. The sensitivity for the 12-month model reached 0.61, indicating that the model identified 61% of patients needing revision ESS. The specificity for the 12-month model reached 0.79, indicating that 79% of patients classified as negative did not need revision ESS.

Download:

Fig 5. Receiver operating characteristics curves (ROC) to predict revision ESS.

We used the logistic regression classifier in the four models, which used four different data collection time periods. The data collection time periods (0, 3, 6 or 12 months) indicate the times during which data were collected after baseline ESS. ESS, endoscopic sinus surgery.

https://doi.org/10.1371/journal.pone.0267146.g005

Download:

Table 7. Average performance values using all of the variables to predict revision ESS by using different data collection time periods in the logistic regression models.

The data collection time periods (0, 3, 6 or 12 months) indicate the time during which data were collected after baseline ESS. See also Fig 5.

https://doi.org/10.1371/journal.pone.0267146.t007

Interpretability analysis

For the model interpretability analysis, we trained the gradient boosting classifier using the variables collected from the baseline visit until 6 months following baseline ESS employing the data flow from Fig 3b. Fig 6 illustrates the variables sorted based on the highest sum from the absolute Shapley values across all patients. The distributions of the data points on the plots show the impact of each variable on the classifier output. We found that a high number of visits after baseline ESS and a short time interval between the baseline visit and baseline ESS both increased the revision ESS risk. In addition, CRSwNP, asthma and allergies increased the revision ESS risk. The Shapley values revealed that patient age and the frequency of clinical visits from baseline visit to baseline ESS (that is, the time period from the baseline visit to the baseline ESS and the number of visits before baseline ESS) affected the revision ESS risk in a nonmonotonic manner. That is, the red values (the higher than the average values) of these variables are dispersed on both sides of the scale.

Download:

Fig 6. Shapley values (SHapley Additive exPlanations, SHAP) for the ten most important variables predicting revision ESS.

The gradient boosting algorithm was used as the classifier. The definition of the ten most important variables is based on the sum of the absolute Shapley values. The red points indicate higher patient-specific variable values than the average value of the variable and, the blue points indicate lower patient-specific variable values than the average value for the variable. A longer distance between a red and blue point indicates a better capacity of the variable to predict revision ESS. ESS, endoscopic sinus surgery; CRSwNP, chronic rhinosinusitis with nasal polopys.

https://doi.org/10.1371/journal.pone.0267146.g006

Fig 7 shows the PDPs for the ten variables with the highest Shapley values. The PDP plot for the number of visits 6 months following baseline ESS revealed a wide risk score range, from a value of 0.1 for patients with less than two visits following baseline ESS up to a value of about 0.26 for patients with more than seven visits (Fig 7b). Similarly, if the patient had two or more postoperative visits within 3 months, the risk score for revision ESS increased (Fig 7g). The plot for the time between the baseline visit and baseline ESS revealed a sharp drop in the risk score after about 100 days (Fig 7f). When the time between the baseline visit and ESS was less than 100 days, the risk score was about 0.15. When the time increase to >500 days, the risk score decreased to <0.13. The PDP curve for age was nonmonotonic and the risk scores varied from 0.1 for patients aged 16 to 30 years to about 0.14 for patients aged 70 to 90 years (Fig 7e). The risk score was 0.16 for patients aged 30 to 65 years. Furthermore, the number of visits between the baseline visit and baseline ESS was nonmonotonic. Patients with 10 to 25 visits between the baseline visit and baseline ESS exhibited a smaller risk for revision ESS than patients with fewer than 10 visits or for patients with more than 25 visits (Fig 7j).

Download:

Fig 7. Partial dependence plots (PDP) for predicting revision ESS using the gradient boosting classifier of the ten variables with the highest SHAP values (Shapley or SHapley Additive exPlanations).

ESS, endoscopic sinus surgery; CRSwNP, chronic rhinosinusitis with nasal polyps. See also Fig 6.

https://doi.org/10.1371/journal.pone.0267146.g007

We also detected a moderate correlation between the number of days from the baseline visit to the baseline ESS and the number of visits (p < 0.001, correlation coefficient r = 0.51 from the Pearson’s linear correlation test). Yet, the correlation was weak between the number of days from the baseline visit to the baseline ESS and the following variables: age (r = 0.14), CRSwNP (r = 0.06), asthma (r = 0.19) and immunodeficiency (r = −0.00).

Discussion

This study aimed to identify individual-level risk factors associated with revision ESS among CRS patients through the use of machine-learning algorithms. Personalised risk assessment is a process whereby an individual’s level of risk is calculated using multiple predictors [40]. Personalised risk communication represents a process through which the results of an individual’s risk assessment are tailored to their preferences and for specific uses [40]. In part, we identified previously unpublished important predictors of revision ESS, such as a high number of visits before and after baseline ESS as well as a short time interval between the baseline visit and baseline ESS. Our data also demonstrated that demographic variables such as age, Type 2 high diseases (CRSwNP, asthma and NERD) and immunodeficiency or a suspicion of it were important predictors of revision ESS at the individual level. These findings agree with previous observations at the population level [41]. In addition, our findings reinforce the importance of diagnostics and the management of NERD, nasal polyps, asthma and other comorbidities in preventing uncontrolled CRS.

In terms of clinical implications, our findings may prove relevant to patient counselling, following up on and planning treatment, such as that of biological therapy [12]. However, validation studies for these results remain necessary. Personalised risk communication has previously proven effective in clinical decision-making, such as in COVID-19 diagnostics [42], patient selection for cardiac resynchronisation therapy [43] and in organising follow-up for patients receiving adjuvant endocrine therapy [44].

To our knowledge, machine learning models have not been previously used to predict revision ESS among CRS patients. Machine learning, however, has previously been used in allergology and related research [45], including in the prediction of persistent early childhood asthma [46], eosinophilic esophagitis [47], eosinophilic CRS [48] or osteomeatal complex inflammation [49]. In addition, machine learning has found applications in predicting postoperative outcomes for degenerative cervical myelopathy [50], revision surgery following knee replacement [51], prolonged opioid prescription following surgery for lumbar disc herniation [52], blood transfusion following adult spinal deformity surgery [53], surgical infections [54] and olfactory recovery after ESS [55]. None of these previous studies, however, have presented models designed to predict revision ESS at the individual level. Revision ESS risk has previously been studied at the population level relying instead on traditional statistical models such as Cox’s proportional hazard models [13, 15, 16] or logistic regression models [14, 15, 18, 20]. Such studies have assumed associations are linear and that an alpha error <5% indicates the importance of a predictor.

We found that a greater number of visits, a higher frequency of visits and a shorter time period between the baseline visit and baseline ESS all associated with revision ESS. This might reflect that patients with a high number of visits exhibit more uncontrolled CRS and thus may ultimately undergo revision ESS. In other words, our findings suggest that an increasing number of visits before ESS might signal more severe disease, which affects not just the physician’s and but also the patient’s decision regarding ESS at baseline as well as the revision ESS during follow-up. These results indicate that patients who achieved control of disease following baseline ESS did not require further follow-up visits through tertiary care centres and were unsubscribed from the hospital. Patients with ongoing problems, however, tend to visit the clinic more frequently and exhibit a higher probability of ultimately undergoing revision ESS. We found little evidence in the literature on the predictive potential of visits at the individual level. A retrospective cohort study from the USA (n = 6985) revealed that the number of postoperative outpatient visits associated with revision surgery for anterior cruciate ligament reconstructions [56], findings similar to ours, albeit different types of surgeries and at a population level. Our findings indicate that patients with a higher frequency of visits at baseline exhibit a higher risk only partially controlled by surgery might prove helpful when counselling patients.

Our study also showed that CRSwNP, asthma and NERD represent important predictors of revision ESS at the individual level. In accordance with this, previous studies demonstrated at the hospital population level that several factors associate with CRS recurrence and/or revision ESS, including CRSwNP, asthma, allergic rhinitis, NERD, eosinophilia and smoking [1, 13, 57, 58]. CRSwNP patients with a comorbidity of asthma and/or NERD carry an increased risk for recurrence and revision ESS, although these patients appear to benefit from an initial ESS [19, 41, 59–61]. This finding may reflect more severe disease, typically presenting with comorbidities for NERD, anosmia, Type 2 high eosinophilic inflammation and a higher likelihood of polyp regrowth [5, 57, 62–70]. In SFS, immunodeficiency or a suspicion of it also emerged as one of the top ten predictors in all three classifiers. Immunodeficiency increases the risk of infectious exacerbations and uncontrolled CRS, thereby also increasing the risk of revision ESS. This agrees with a previous study that demonstrated that at the hospital population level immunodeficiency and granulomatosis with polyangiitis increase the revision ESS risk [71]. While the variable ‘suspicion of immunodeficiency’ is not the same as a diagnosed immunodeficiency, it might indirectly reflect a similar situation regarding poor CRS control, leaving a physician to suspect a rare comorbidity or allowing consideration for the need of revision ESS.

We also demonstrated that the length of time for EHR data collection increased the predictive accuracy of the models. The time period for data collection from the baseline visit until 12 months following the baseline ESS carried the highest predictive accuracy in our models. The time interval for data collection for the model serves to optimise the time required following baseline ESS and model accuracy.

We validated the predictive accuracy using three classifiers. To do, we chose to use logistic regression, gradient boosting and random forest classifiers since they possess different properties and generally have been used in predicting surgical outcomes [50, 72] or persistent asthma [46]. The logistic regression classifier is linear and thus incapable of modelling possible nonmonotonic and nonlinear relationships between predictors and outcomes [73]. The random forest and gradient boosting classifiers can model complex relationships, but they represent so-called black box models, meaning that are uninterpretable classifiers, whereby the relationships between inputs and results are difficult to directly interpret beyond the parameters or the structure of the trained model [73]. Since the predictive accuracy of the variables was similar across the three classifiers in our study, we used logistic regression primarily to validate the variable collection time period. Overall, our findings indicate the importance of validating outcome prediction using different classifiers and evaluating the effect of the data collection time period, as suggested in previous studies [74, 75]. By evaluating different classifiers, we found that a simple and interpretable logistic regression model may prove adequate for clinical application. However, if modelling requires nonlinear relationships, then random forest or gradient boosting models can be used. Classification performance proved comparable across all classifiers.

Revision ESS risk was previously studied at the population level using Cox’s proportional hazard [13, 15, 16] or logistic regression [14, 15, 18, 20] models, which usually assume associations are linear and that an alpha error <5% indicates the importance of a predictor. Using these assumptions, previous studies have demonstrated that a younger age associated with revision ESS [13, 66]. We found that age actually affects revision ESS risk in a nonmonotonic manner, thus indicating that machine learning improves the prediction potential of age in revision ESS risk. Similarly, nonlinear approaches have significantly improved the prediction of stroke risk [76].

Both our own and previous study groups have examined populations of CRSwNP [66] or CRS [13] patients. In our study, we found that age actually affects revision ESS risk in a nonmonotonic manner. Thus logistic regression models appear less than ideal for examining the impact of the individual patient’s age on revision ESS risk. By performing partial dependency plot analyses, we showed that the revision ESS risk was highest for patients aged 30 to 70 years, and medium high for patients older than 70 years, whereas the risk was lowest among patients aged 16 to 30 years. Younger patients experience less CRSwNP or CRSwNP among such patients often comprises antrochoanal polyps, which carry a smaller revision surgery risk [1].

Furthermore, the number of visits before baseline ESS carried nonlinear effects as predictors in our study. Patients logging 10 to 25 clinical visits between the baseline visit and baseline ESS exhibited a lower risk for revision ESS than patients with fewer than 10 or more than 25 clinic visits. Those patients visiting the clinic 10 to 25 times before baseline ESS may have CRSsNP with acute recurrent exacerbations. However, this subgroup warrants further study in order to confirm this assumption, since the number of subjects in our study was rather small. We can speculate that some physicians may schedule more frequent follow-up visits even with sufficient disease control. That said, consistent practices have been employed in our hospital, the clinical visit frequencies are closely monitored and routine controls are not reserved. Thus we argue that a visit frequency ≥2 per year reflects relatively poor disease control. Previous studies found that CRSwNP patients with recurrent acute rhinosinusitis episodes benefit from an initial ESS [1]. Previous studies on other conditions and on other predictors revealed a U-shaped association between the predictor variable and outcome, including associations between intraoperative net fluid balance and early atrial tachyarrhythmia recurrence [77] as well as between body mass index and asthma in Japanese children [78]. These findings highlight the importance of evaluating the linearity of associations to improve the personalised predictive value of them.

The strengths of this study include the random sample of hospital patients, the long follow-up time period we captured and the discovery of nonlinear associations between certain variables and outcomes. In addition, the novelty of this study lies in the validation of models employing several classifiers, which were also tested at the individual level.

We should also mention several limitations to our study, which include changes which occurred in ESS and CRS care during the sampling time period. To minimise the impact of any possible chronological or seasonal bias, we spread the sampling time over several baseline years (2005, 2007, 2009, 2011 and 2013) and each month during the baseline year. Patients with recurrence may have sought treatment elsewhere, although this potential bias was minimal since over 90% of ESS are performed in public healthcare settings [79]. In this study, we were authorised to extract data from a relatively small number of patients. However, this limitation was addressed by using cross-validation methods. Unfortunately, we did not process time series variables. Thus, recurrent neural network type models such as long short-term memory (LSTM) or bidirectional LSTM could not be used to predict revision ESS risk. EHR data have been available in our hospital since 2005. We acknowledge that the baseline ESS does not always indicate the first ESS. As such, we lacked data for possible earlier ESS, which we have previously shown to affect the revision ESS risk on a population level [13]. Furthermore, data were lacking for some other important factors, such as postoperative treatment, validated symptoms, endoscopic nasal polyp score, medication, the Lund Mackay score for sinus computed tomography scans, smoking status, eosinophils and the extent of baseline ESS. Yet, some of these variables, such as smoking [13] or total ethmoidectomy, have not emerged as strong predictors of revision ESS compared with Type 2 high diseases [57] in our previous studies. That said, we acknowledge that the inclusion of more variables and additional cases would most likely improve our estimates. Therefore, before extrapolating our results to clinical practice, replication studies in other populations and with additional variables are needed.

Conclusions

Our results indicate that Type 2-high conditions (CRSwNP, asthma and NERD), a high clinical visit frequency, a short time interval between the baseline clinic visit and ESS and immunodeficiency or a suspicion of it increase the likelihood of revision ESS at the individual level. Moreover, age and the number of preoperative clinical visits predict a nonlinear revision ESS risk. Although these findings require validation in other populations, our results reinforce the importance of diagnostics and the management of NERD, CRSwNP, asthma and other comorbidities to prevent uncontrolled CRS, and carry relevancy for patient counselling specifically.

Supporting information

S1 File. Procedure and ICD10-codes.

List of procedure and ICD10 codes that were used for identifying ESS patients and chronic diseases.

https://doi.org/10.1371/journal.pone.0267146.s001

(PDF)

S2 File. Key words mining from clinical texts.

Keyword-based information extraction method was used for processing variables from free clinical texts.

https://doi.org/10.1371/journal.pone.0267146.s002

(PDF)

S3 File. Performance of baseline machine learning classifier.

Baseline performance values for machine learning classifiers are presented. The values were calculated by training and testing the classifiers when the labels of data were randomized.

https://doi.org/10.1371/journal.pone.0267146.s003

(PDF)

S4 File. Performance of ensemble classifier.

Performance values for ensemble machine learning classifiers are presented. The values were calculated by training and testing the ensemble classifier of logistic regression, random forest and gradient boosting.

https://doi.org/10.1371/journal.pone.0267146.s004

(PDF)

S5 File. Hyperparameter values for machine learning models.

The hyperparameter values of machine learning model comparison study that were searched by grid-search method are presented.

https://doi.org/10.1371/journal.pone.0267146.s005

(PDF)

Acknowledgments

Open access funded by Helsinki University Library.

References

1. Fokkens W, Lund V, Hopkins C, Hellings P, Kern R, Reitsma S, et al. European Position Paper on Rhinosinusitis and Nasal Polyps 2020. Rhinology journal. 2020;58:1–464. pmid:32077450
- View Article
- PubMed/NCBI
- Google Scholar
2. Liao B, Liu JX, Li ZY, Zhen Z, Cao PP, Yao Y, et al. Multidimensional endotypes of chronic rhinosinusitis and their association with treatment outcomes. Allergy. 2018;73(7):1459–1469. pmid:29331025
- View Article
- PubMed/NCBI
- Google Scholar
3. Wei B, Liu F, Zhang J, Liu Y, Du J, Liu S, et al. Multivariate analysis of inflammatory endotypes in recurrent nasal polyposis in a Chinese population. Rhinology. 2018;56(3):216–226. pmid:29785413
- View Article
- PubMed/NCBI
- Google Scholar
4. Kowalski ML, Agache I, Bavbek S, Bakirtas A, Blanca M, Bochenek G, et al. Diagnosis and management of NSAID-Exacerbated Respiratory Disease (N-ERD)–a EAACI position paper. Allergy. 2019;74(1):28–39. pmid:30216468
- View Article
- PubMed/NCBI
- Google Scholar
5. Lyly A, Laulajainen-Hongisto A, Turpeinen H, Vento SI, Myller J, Numminen J, et al. Factors affecting upper airway control of NSAID-exacerbated respiratory disease: A real-world study of 167 patients. Immunity, Inflammation and Disease. 2021;9(1):80–89. pmid:33400396
- View Article
- PubMed/NCBI
- Google Scholar
6. Toppila-Salmi S, Lemmetyinen R, Chanoine S, Karjalainen J, Pekkanen J, Bousquet J, et al. Risk Factors of Severe Adult-onset Asthma: A Multi-factor Approach; 2021. pmid:34238263
- View Article
- PubMed/NCBI
- Google Scholar
7. Rudmik L, Soler ZM, Mace JC, Schlosser RJ, Smith TL. Economic evaluation of endoscopic sinus surgery versus continued medical therapy for refractory chronic rhinosinusitis. The Laryngoscope. 2015;125(1):25–32. pmid:25186499
- View Article
- PubMed/NCBI
- Google Scholar
8. Chang CC, Tai CJ, Ng TY, Tsou YA, Tsai MH. Can FESS Combined with Submucosal Resection (SMR)/Septoplasty Reduce Revision Rate? Otolaryngology–Head and Neck Surgery. 2014;151(4):700–705. pmid:25146305
- View Article
- PubMed/NCBI
- Google Scholar
9. Hopkins C, Browne JP, Slack R, Lund V, Topham J, Reeves B, et al. The national comparative audit of surgery for nasal polyposis and chronic rhinosinusitis. Clinical Otolaryngology. 2006;31(5):390–398. pmid:17014448
- View Article
- PubMed/NCBI
- Google Scholar
10. Anderson WC, Szefler SJ. Cost-effectiveness and comparative effectiveness of biologic therapy for asthma: To biologic or not to biologic. Annals of allergy, asthma and immunology. 2019;122(4):367–372. pmid:30703438
- View Article
- PubMed/NCBI
- Google Scholar
11. Guo C, Liao B, Liu J, Pan L, Z L. Predicting difficult-to-treat chronic rhinosinusitis by noninvasive biological markers. Rhinology. 2021;59(1):81–90. pmid:32974623
- View Article
- PubMed/NCBI
- Google Scholar
12. Lyly A, Laulajainen-Hongisto A, Gevaert P, Kauppi P, Toppila-Salmi S. Monoclonal Antibodies and Airway Diseases. International Journal of Molecular Sciences. 2020;21(24). pmid:33322143
- View Article
- PubMed/NCBI
- Google Scholar
13. Koskinen A, Salo R, Huhtala H, Myller J, Rautiainen M, Kääriäinen J, et al. Factors affecting revision rate of chronic rhinosinusitis. Laryngoscope Investigative Otolaryngology. 2016;1(4):96–105. pmid:28894807
- View Article
- PubMed/NCBI
- Google Scholar
14. Rudmik L, Soler ZM, Hopkins C. Using postoperative SNOT-22 to help predict the probability of revision sinus surgery. Rhinology. 2016;54(2):111–116. pmid:26800771
- View Article
- PubMed/NCBI
- Google Scholar
15. Stein NR, Jafari A, DeConde AS. Revision rates and time to revision following endoscopic sinus surgery: A large database analysis. The Laryngoscope. 2018;128(1):31–36. pmid:28688189
- View Article
- PubMed/NCBI
- Google Scholar
16. Smith KA, Orlandi RR, Oakley G, Meeks H, Curtin K, Alt JA. Long-term revision rates for endoscopic sinus surgery. International Forum of Allergy & Rhinology. 2019;9(4):402–408. pmid:30570840
- View Article
- PubMed/NCBI
- Google Scholar
17. Younis RT, Ahmed J. Predicting revision sinus surgery in allergic fungal and eosinophilic mucin chronic rhinosinusitis. The Laryngoscope. 2017;127(1):59–63. pmid:27577717
- View Article
- PubMed/NCBI
- Google Scholar
18. Loftus CA, Soler ZM, Desiato VM, Koochakzadeh S, Yoo F, Storck KA, et al. Factors impacting revision surgery in patients with chronic rhinosinusitis with nasal polyposis. International Forum of Allergy & Rhinology. 2020;10(3):289–302. pmid:31943850
- View Article
- PubMed/NCBI
- Google Scholar
19. Loftus CA, Soler ZM, Koochakzadeh S, Desiato VM, Yoo F, Nguyen SA, et al. Revision surgery rates in chronic rhinosinusitis with nasal polyps: meta-analysis of risk factors. International Forum of Allergy & Rhinology. 2020;10(2):199–207. pmid:31752051
- View Article
- PubMed/NCBI
- Google Scholar
20. Veloso-Teles R, Cerejeira R. Endoscopic Sinus Surgery for Chronic Rhinosinusitis with Nasal Polyps: Clinical Outcome and Predictive Factors of Recurrence. American Journal of Rhinology & Allergy. 2017;31(1):56–62. pmid:28234156
- View Article
- PubMed/NCBI
- Google Scholar
21. Mueller S, Wendler O, Nocera A, Grundtner P, Schlegel P, Agaimy A, et al. Escalation in mucus cystatin 2, pappalysin‐A, and periostin levels over time predict need for recurrent surgery in chronic rhinosinusitis with nasal polyps. International Forum of Allergy & Rhinology. 2019;9:1212–1219. pmid:31430426
- View Article
- PubMed/NCBI
- Google Scholar
22. Morrissey DK, Bassiouni A, Psaltis AJ, Naidoo Y, Wormald PJ. Outcomes of modified endoscopic Lothrop in aspirin-exacerbated respiratory disease with nasal polyposis. International Forum of Allergy & Rhinology. 2016;6(8):820–825. pmid:26992029
- View Article
- PubMed/NCBI
- Google Scholar
23. Olson RS, Cava WL, Mustahsan Z, Varik A, Moore JH. In: Data-driven advice for applying machine learning to bioinformatics problems;. p. 192–203. Available from: https://www.worldscientific.com/doi/abs/10.1142/9789813235533_0018.
24. Christodoulou E, Ma J, Collins GS, Steyerberg EW, Verbakel JY, Van Calster B. A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. Journal of Clinical Epidemiology. 2019;110:12–22. pmid:30763612
- View Article
- PubMed/NCBI
- Google Scholar
25. Lundberg SM, Erion G, Chen H, DeGrave A, Prutkin JM, Nair B, et al. From local explanations to global understanding with explainable AI for trees. Nature Machine Intelligence. 2020;2(1):2522–5839. pmid:32607472
- View Article
- PubMed/NCBI
- Google Scholar
26. Friedman JH. Evaluation: from Precision, Recall and F-measure to ROC, Informedness, Markedness and Correlation. The Annals of Statistics. 2001;29(5):1189–232.
- View Article
- Google Scholar
27. Hastie T, Tibshirani R, Friedman J. The Elements of Statistical Learning. New York, NY, USA: Springer-Verlag; 2009.
28. Xin L, Zhu M. Stochastic Stepwise Ensembles for Variable Selection. Journal of Computational and Graphical Statistics. 2012;21(2):275–294.
- View Article
- Google Scholar
29. Cheng L, Zhu M, Poss JW, Hirdes JP, Glenny C, Stolee P. Opinion versus practice regarding the use of rehabilitation services in home care: an investigation using machine learning algorithms. BMC Medical Informatics and Decision Making. 2015;15(1):1–11. pmid:26453354
- View Article
- PubMed/NCBI
- Google Scholar
30. Liu N, Koh ZX, Goh J, Lin Z, Haaland B, Ting BP, et al. Prediction of adverse cardiac events in emergency department patients with chest pain using machine learning for variable selection. BMC Medical Informatics and Decision Making. 2014;14(1):75. pmid:25150702
- View Article
- PubMed/NCBI
- Google Scholar
31. Roelen C, Bültmann U, Rhenen W, Klink J, Twisk J, Heymans M. External validation of two prediction models identifying employees at risk of high sickness absence: Cohort study with 1-year follow-up. BMC public health. 2013;13:105. pmid:23379546
- View Article
- PubMed/NCBI
- Google Scholar
32. Fischer J, Bachmann L, Jaeschke R. A readers’ guide to the interpretation of diagnostic test properties: clinical example of sepsis. Intensive Care Medicine. 2003;29:1043–1051. pmid:12734652
- View Article
- PubMed/NCBI
- Google Scholar
33. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research. 2011;12:2825–2830.
- View Article
- Google Scholar
34. Chen T, Guestrin C. XGBoost: A Scalable Tree Boosting System. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. KDD’16. New York, NY, USA: ACM; 2016. p. 785–794. Available from: http://doi.acm.org/10.1145/2939672.2939785.
35. Raschka S. Mlxtend; 2016. Available from: http://dx.doi.org/10.5281/zenodo.49235.
36. van der Walt S, Colbert SC, Varoquaux G. The NumPy Array: A Structure for Efficient Numerical Computation. Computing in Science Engineering. 2011;13(2):22–30.
- View Article
- Google Scholar
37. McKinney W. Data Structures for Statistical Computing in Python. In: van der Walt S, Millman J, editors. Proceedings of the 9th Python in Science Conference; 2010. p. 56–61.
38. Lundberg SM, Lee SI. A Unified Approach to Interpreting Model Predictions. In: Guyon I, Luxburg UV, Bengio S, Wallach H, Fergus R, Vishwanathan S, et al., editors. Advances in Neural Information Processing Systems 30. Curran Associates, Inc.; 2017. p. 4765–4774. Available from: http://papers.nips.cc/paper/7062-a-unified-approach-to-interpreting-model-predictions.pdf.
39. Jiangchun L. PDPbox: python partial dependence plot toolbox; 2018. Available from: https://github.com/SauceCat/PDPbox.
40. Manuel DG, Abdulaziz KE, Perez R, Beach S, Bennett C. Personalized risk communication for personalized risk assessment: Real world assessment of knowledge and motivation for six mortality risk measures from an online life expectancy calculator. Informatics for Health and Social Care. 2018;43(1):42–55. pmid:28068156
- View Article
- PubMed/NCBI
- Google Scholar
41. Mendelsohn D, Jeremic G, Wright ED, Rotenberg BW. Revision Rates after Endoscopic Sinus Surgery: A Recurrence Analysis. Annals of Otology, Rhinology & Laryngology. 2011;120(3):162–166. pmid:21510141
- View Article
- PubMed/NCBI
- Google Scholar
42. Delafiori J, Navarro LC, Siciliano RF, de Melo GC, Busanello ENB, Nicolau JC, et al. Covid-19 Automated Diagnosis and Risk Assessment through Metabolomics and Machine Learning. Analytical Chemistry. 2021;93(4):2471–2479. pmid:33471512
- View Article
- PubMed/NCBI
- Google Scholar
43. Hu SY, Santus E, Forsyth AW, Malhotra D, Haimson J, Chatterjee NA, et al. Can machine learning improve patient selection for cardiac resynchronization therapy? PLOS ONE. 2019;14(10):1–13. pmid:31581234
- View Article
- PubMed/NCBI
- Google Scholar
44. Harrell M, Levy M, Fabbri D. Supervised Machine Learning to Predict Follow-Up Among Adjuvant Endocrine Therapy Patients. In: 2017 IEEE International Conference on Healthcare Informatics (ICHI); 2017. p. 490–495.
45. Juhn Y, Liu H. Natural language processing to advance EHR-based clinical research in Allergy, Asthma, and Immunology. Journal of Allergy and Clinical Immunology. 2019;145.
- View Article
- Google Scholar
46. Bose S, Kenyon CC, Masino AJ. Personalized prediction of early childhood asthma persistence: A machine learning approach. PLOS ONE. 2021;16(3):1–17. pmid:33647071
- View Article
- PubMed/NCBI
- Google Scholar
47. Sallis BF, Erkert L, Moñino-Romero S, Acar U, Wu R, Konnikova L, et al. An algorithm for the classification of mRNA patterns in eosinophilic esophagitis: Integration of machine learning. Journal of Allergy and Clinical Immunology. 2018;141(4):1354–1364.e9. pmid:29273402
- View Article
- PubMed/NCBI
- Google Scholar
48. Thorwarth R, Scott D, Lal D, MJ MM. Machine learning of biomarkers and clinical observation to predict eosinophilic chronic rhinosinusitis: a pilot study. Int Forum Allergy Rhinol. 2021;1:8–15. pmid:32472743
- View Article
- PubMed/NCBI
- Google Scholar
49. Chowdhury NI, Smith TL, Chandra RK, Turner JH. Automated classification of osteomeatal complex inflammation on computed tomography using convolutional neural networks. International Forum of Allergy & Rhinology. 2019;9(1):46–52. pmid:30098123
- View Article
- PubMed/NCBI
- Google Scholar
50. Merali ZG, Witiw CD, Badhiwala JH, Wilson JR, Fehlings MG. Using a machine learning approach to predict outcome after surgery for degenerative cervical myelopathy. PLOS ONE. 2019;14(4):1–12. pmid:30947300
- View Article
- PubMed/NCBI
- Google Scholar
51. Aram P, Trela-Larsen L, Sayers A, Hills A, Blom A, McCloskey E, et al. Estimating an Individual’s Probability of Revision Surgery After Knee Replacement: A Comparison of Modeling Approaches Using a National Data Set. American journal of epidemiology. 2018;187. pmid:29893799
- View Article
- PubMed/NCBI
- Google Scholar
52. Karhade AV, Ogink PT, Thio QCBS, Cha TD, Gormley WB, Hershman SH, et al. Development of machine learning algorithms for prediction of prolonged opioid prescription after surgery for lumbar disc herniation. The Spine Journal. 2019;19(11):1764–1771. pmid:31185292
- View Article
- PubMed/NCBI
- Google Scholar
53. Durand WM, DePasse JM, Daniels AH. Predictive Modeling for Blood Transfusion After Adult Spinal Deformity Surgery. Spine. 2018;43(15):1058–1066. pmid:29215501
- View Article
- PubMed/NCBI
- Google Scholar
54. da Silva DA, ten Caten CS, dos Santos RP, Fogliatto FS, Hsuan J. Predicting the occurrence of surgical site infections using text mining and machine learning. PLOS ONE. 2019;14(12):1–17. pmid:31834905
- View Article
- PubMed/NCBI
- Google Scholar
55. Katotomichelakis M, Gouveris H, Tripsianis G, Simopoulou M, Papathanassiou J, Danielides V. Biometric predictive models for the evaluation of olfactory recovery after endoscopic sinus surgery in patients with nasal polyposis. American journal of rhinology and allergy. 2010;24(4):276–280. pmid:20819466
- View Article
- PubMed/NCBI
- Google Scholar
56. Inacio MCS, Cafri G, Funahashi TT, Maletis GB, Paxton EW. Type and frequency of healthcare encounters can predict poor surgical outcomes in anterior cruciate ligament reconstruction patients. International Journal of Medical Informatics. 2016;90:32–39. pmid:27103195
- View Article
- PubMed/NCBI
- Google Scholar
57. Lilja MJ, Koskinen A, Virkkula P, Vento SI, Myller J, Hammaren-Malmi S, et al. Factors Affecting the Control of Chronic Rhinosinusitis With Nasal Polyps: A Comparison in Patients With or Without NERD. Allergy & Rhinology. 2021;12:21526567211003844.
- View Article
- Google Scholar
58. Virkkula P, Penttila E, Vento S, Myller J, Koskinen A, Hammaren-Malmi S, et al. Assessing Cut-off Points of Eosinophils, Nasal Polyp, and Lund-Mackay Scores to Predict Surgery in Nasal Polyposis: A Real-World Study. Allergy and Rhinology. 2020;11. pmid:35141001
- View Article
- PubMed/NCBI
- Google Scholar
59. Zhang L, Zhang Y, Gao Y, Wang K, Lou H, Meng Y, et al. Long-term outcomes of different endoscopic sinus surgery in recurrent chronic rhinosinusitis with nasal polyps and asthma. Rhinology. 2020;58(2). pmid:31904028
- View Article
- PubMed/NCBI
- Google Scholar
60. van der Veen J, Seys SF, Timmermans M, Levie P, Jorissen M, Fokkens WJ, et al. Real-life study showing uncontrolled rhinosinusitis after sinus surgery in a tertiary referral centre. Allergy. 2017;72(2):282–290. pmid:27392210
- View Article
- PubMed/NCBI
- Google Scholar
61. Laulajainen-Hongisto A, Turpeinen H, Vento SI, Numminen J, Sahlman J, Kauppi P, et al. High Discontinuation Rates of Peroral ASA Treatment for CRSwNP: A Real-World Multicenter Study of 171 N-ERD Patients. The Journal of Allergy and Clinical Immunology: In Practice. 2020;8(10):3565–3574. pmid:32693216
- View Article
- PubMed/NCBI
- Google Scholar
62. Steinke JW, Payne SC, Borish L. Eosinophils and Mast Cells in Aspirin-Exacerbated Respiratory Disease. Immunology and Allergy Clinics of North America. 2016;36(4):719–734. pmid:27712766
- View Article
- PubMed/NCBI
- Google Scholar
63. Brescia G, Barion U, Zanotti C, Giacomelli L, Martini A, Marioni G. The prognostic role of serum eosinophil and basophil levels in sinonasal polyposis. International Forum of Allergy & Rhinology. 2017;7(3):261–267. pmid:27992119
- View Article
- PubMed/NCBI
- Google Scholar
64. Lou H, Meng Y, Piao Y, Wang C, Zhang L, Bachert C. Predictive Significance of Tissue Eosinophilia for Nasal Polyp Recurrence in the Chinese Population. American Journal of Rhinology & Allergy. 2015;29(5):350–356. pmid:26219765
- View Article
- PubMed/NCBI
- Google Scholar
65. Brescia G, Pedruzzi B, Barion U, Cinetto F, Giacomelli L, Martini A, et al. Are neutrophil-, eosinophil-, and basophil-to-lymphocyte ratios useful markers for pinpointing patients at higher risk of recurrent sinonasal polyps? American Journal of Otolaryngology. 2016;37(4):339–345. pmid:27045767
- View Article
- PubMed/NCBI
- Google Scholar
66. Brescia G, Marioni G, Franchella S, Ramacciotti G, Giacomelli L, Marino F, et al. A prospective investigation of predictive parameters for post-surgical recurrences in sinonasal polyposis. Archiv für Klinische und Experimentelle Ohren- Nasen- und Kehlkopfheilkunde. 2016;273:655–660. pmid:25763571
- View Article
- PubMed/NCBI
- Google Scholar
67. Tao X, Chen F, Sun Y, Wu S, Hong H, Shi J, et al. Prediction models for postoperative uncontrolled chronic rhinosinusitis in daily practice. The Laryngoscope. 2018;128(12):2673–2680. pmid:30295929
- View Article
- PubMed/NCBI
- Google Scholar
68. Vlaminck S, Vauterin T, Hellings PW, Jorissen M, Acke F, Cauwenberge PV, et al. The Importance of Local Eosinophilia in the Surgical Outcome of Chronic Rhinosinusitis: A 3-Year Prospective Observational Study. American Journal of Rhinology & Allergy. 2014;28(3):260–264. pmid:24980239
- View Article
- PubMed/NCBI
- Google Scholar
69. Nakayama T, Yoshikawa M, Asaka D, Okushi T, Matsuwaki Y, Otori N, et al. Mucosal eosinophilia and recurrence of nasal polyps–new classification of chronic rhinosinusitis. Rhinology. 2011;49(4):392–396. pmid:21991563
- View Article
- PubMed/NCBI
- Google Scholar
70. Ikeda K, Shiozawa A, Ono N, Kusunoki T, Hirotsu M, Homma H, et al. Subclassification of chronic rhinosinusitis with nasal polyp based on eosinophil and neutrophil. The Laryngoscope. 2013;123(11):E1–E9. pmid:23670893
- View Article
- PubMed/NCBI
- Google Scholar
71. Miglani A, Divekar RD, Azar A, Rank MA, Lal D. Revision endoscopic sinus surgery rates by chronic rhinosinusitis subtype. International Forum of Allergy & Rhinology. 2018;8(9):1047–1051. pmid:29851243
- View Article
- PubMed/NCBI
- Google Scholar
72. Jalali A DNPJGMKSGSJJRMAL Lonsdale H. Deep Learning for Improved Risk Prediction in Surgical Outcomes. Sci Rep. 2020;10:9289. pmid:32518246
- View Article
- PubMed/NCBI
- Google Scholar
73. Hastie T, Tibshirani R, Friedman J. The Elements of Statistical Learning. Springer Series in Statistics. New York, NY, USA: Springer New York Inc.; 2008.
74. Nuutinen M, Leskelä RL, Suojalehto E, Tirronen A, Komssi V. BMC Med Inform Decis Mak. Development and validation of classifiers and variable subsets for predicting nursing home admission. 2017;39.
- View Article
- Google Scholar
75. Hamerla G, Meyer HJ, Schob S, Ginat DT, Altman A, Lim T, et al. Comparison of machine learning classifiers for differentiation of grade 1 from higher gradings in meningioma: A multicenter radiomics study. Magnetic Resonance Imaging. 2019;63:244–249. pmid:31425811
- View Article
- PubMed/NCBI
- Google Scholar
76. Orfanoudaki A, Chesley E, Cadisch C, Stein B, Nouh A, Alberts MJ, et al. Machine learning provides evidence that stroke risk is not linear: The non-linear Framingham stroke risk score. PLOS ONE. 2020;15(5):1–20.
- View Article
- Google Scholar
77. Minami K, Kabata D, Kakuta T, Fukushima S, Fujita T, Shintani A, et al. U-Shaped Association Between Intraoperative Net Fluid Balance and Risk of Postoperative Recurrent Atrial Tachyarrhythmia Among Patients Undergoing the Cryo-Maze Procedure: An Observational Study. Journal of Cardiothoracic and Vascular Anesthesia. 2020;. pmid:33158709
- View Article
- PubMed/NCBI
- Google Scholar
78. Tanaka K, Miyake Y, Arakawa M, Sasaki S, Ohya Y. U-Shaped Association between Body Mass Index and the Prevalence of Wheeze and Asthma, but not Eczema or Rhinoconjunctivitis: The Ryukyus Child Health Study. The Journal of asthma: official journal of the Association for the Care of Asthma. 2011;48:804–10.
- View Article
- Google Scholar
79. Toppila-Salmi S, Rihkanen H, Arffman M, Manderbacka K, Keskimaki I, Hytönen M. Regional differences in endoscopic sinus surgery in Finland: A nationwide register-based study. BMJ Open. 2018;8:e022173. pmid:30341122
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Fokkens W, Lund V, Hopkins C, Hellings P, Kern R, Reitsma S, et al. European Position Paper on Rhinosinusitis and Nasal Polyps 2020. Rhinology journal. 2020;58:1–464. pmid:32077450
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Liao B, Liu JX, Li ZY, Zhen Z, Cao PP, Yao Y, et al. Multidimensional endotypes of chronic rhinosinusitis and their association with treatment outcomes. Allergy. 2018;73(7):1459–1469. pmid:29331025
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Wei B, Liu F, Zhang J, Liu Y, Du J, Liu S, et al. Multivariate analysis of inflammatory endotypes in recurrent nasal polyposis in a Chinese population. Rhinology. 2018;56(3):216–226. pmid:29785413
View Article
PubMed/NCBI
Google Scholar

[10] View Article

[11] PubMed/NCBI

[12] Google Scholar

[ref4] 4. Kowalski ML, Agache I, Bavbek S, Bakirtas A, Blanca M, Bochenek G, et al. Diagnosis and management of NSAID-Exacerbated Respiratory Disease (N-ERD)–a EAACI position paper. Allergy. 2019;74(1):28–39. pmid:30216468
View Article
PubMed/NCBI
Google Scholar

[14] View Article

[15] PubMed/NCBI

[16] Google Scholar

[ref5] 5. Lyly A, Laulajainen-Hongisto A, Turpeinen H, Vento SI, Myller J, Numminen J, et al. Factors affecting upper airway control of NSAID-exacerbated respiratory disease: A real-world study of 167 patients. Immunity, Inflammation and Disease. 2021;9(1):80–89. pmid:33400396
View Article
PubMed/NCBI
Google Scholar

[18] View Article

[19] PubMed/NCBI

[20] Google Scholar

[ref6] 6. Toppila-Salmi S, Lemmetyinen R, Chanoine S, Karjalainen J, Pekkanen J, Bousquet J, et al. Risk Factors of Severe Adult-onset Asthma: A Multi-factor Approach; 2021. pmid:34238263
View Article
PubMed/NCBI
Google Scholar

[22] View Article

[23] PubMed/NCBI

[24] Google Scholar

[ref7] 7. Rudmik L, Soler ZM, Mace JC, Schlosser RJ, Smith TL. Economic evaluation of endoscopic sinus surgery versus continued medical therapy for refractory chronic rhinosinusitis. The Laryngoscope. 2015;125(1):25–32. pmid:25186499
View Article
PubMed/NCBI
Google Scholar

[26] View Article

[27] PubMed/NCBI

[28] Google Scholar

[ref8] 8. Chang CC, Tai CJ, Ng TY, Tsou YA, Tsai MH. Can FESS Combined with Submucosal Resection (SMR)/Septoplasty Reduce Revision Rate? Otolaryngology–Head and Neck Surgery. 2014;151(4):700–705. pmid:25146305
View Article
PubMed/NCBI
Google Scholar

[30] View Article

[31] PubMed/NCBI

[32] Google Scholar

[ref9] 9. Hopkins C, Browne JP, Slack R, Lund V, Topham J, Reeves B, et al. The national comparative audit of surgery for nasal polyposis and chronic rhinosinusitis. Clinical Otolaryngology. 2006;31(5):390–398. pmid:17014448
View Article
PubMed/NCBI
Google Scholar

[34] View Article

[35] PubMed/NCBI

[36] Google Scholar

[ref10] 10. Anderson WC, Szefler SJ. Cost-effectiveness and comparative effectiveness of biologic therapy for asthma: To biologic or not to biologic. Annals of allergy, asthma and immunology. 2019;122(4):367–372. pmid:30703438
View Article
PubMed/NCBI
Google Scholar

[38] View Article

[39] PubMed/NCBI

[40] Google Scholar

[ref11] 11. Guo C, Liao B, Liu J, Pan L, Z L. Predicting difficult-to-treat chronic rhinosinusitis by noninvasive biological markers. Rhinology. 2021;59(1):81–90. pmid:32974623
View Article
PubMed/NCBI
Google Scholar

[42] View Article

[43] PubMed/NCBI

[44] Google Scholar

[ref12] 12. Lyly A, Laulajainen-Hongisto A, Gevaert P, Kauppi P, Toppila-Salmi S. Monoclonal Antibodies and Airway Diseases. International Journal of Molecular Sciences. 2020;21(24). pmid:33322143
View Article
PubMed/NCBI
Google Scholar

[46] View Article

[47] PubMed/NCBI

[48] Google Scholar

[ref13] 13. Koskinen A, Salo R, Huhtala H, Myller J, Rautiainen M, Kääriäinen J, et al. Factors affecting revision rate of chronic rhinosinusitis. Laryngoscope Investigative Otolaryngology. 2016;1(4):96–105. pmid:28894807
View Article
PubMed/NCBI
Google Scholar

[50] View Article

[51] PubMed/NCBI

[52] Google Scholar

[ref14] 14. Rudmik L, Soler ZM, Hopkins C. Using postoperative SNOT-22 to help predict the probability of revision sinus surgery. Rhinology. 2016;54(2):111–116. pmid:26800771
View Article
PubMed/NCBI
Google Scholar

[54] View Article

[55] PubMed/NCBI

[56] Google Scholar

[ref15] 15. Stein NR, Jafari A, DeConde AS. Revision rates and time to revision following endoscopic sinus surgery: A large database analysis. The Laryngoscope. 2018;128(1):31–36. pmid:28688189
View Article
PubMed/NCBI
Google Scholar

[58] View Article

[59] PubMed/NCBI

[60] Google Scholar

[ref16] 16. Smith KA, Orlandi RR, Oakley G, Meeks H, Curtin K, Alt JA. Long-term revision rates for endoscopic sinus surgery. International Forum of Allergy & Rhinology. 2019;9(4):402–408. pmid:30570840
View Article
PubMed/NCBI
Google Scholar

[62] View Article

[63] PubMed/NCBI

[64] Google Scholar

[ref17] 17. Younis RT, Ahmed J. Predicting revision sinus surgery in allergic fungal and eosinophilic mucin chronic rhinosinusitis. The Laryngoscope. 2017;127(1):59–63. pmid:27577717
View Article
PubMed/NCBI
Google Scholar

[66] View Article

[67] PubMed/NCBI

[68] Google Scholar

[ref18] 18. Loftus CA, Soler ZM, Desiato VM, Koochakzadeh S, Yoo F, Storck KA, et al. Factors impacting revision surgery in patients with chronic rhinosinusitis with nasal polyposis. International Forum of Allergy & Rhinology. 2020;10(3):289–302. pmid:31943850
View Article
PubMed/NCBI
Google Scholar

[70] View Article

[71] PubMed/NCBI

[72] Google Scholar

[ref19] 19. Loftus CA, Soler ZM, Koochakzadeh S, Desiato VM, Yoo F, Nguyen SA, et al. Revision surgery rates in chronic rhinosinusitis with nasal polyps: meta-analysis of risk factors. International Forum of Allergy & Rhinology. 2020;10(2):199–207. pmid:31752051
View Article
PubMed/NCBI
Google Scholar

[74] View Article

[75] PubMed/NCBI

[76] Google Scholar

[ref20] 20. Veloso-Teles R, Cerejeira R. Endoscopic Sinus Surgery for Chronic Rhinosinusitis with Nasal Polyps: Clinical Outcome and Predictive Factors of Recurrence. American Journal of Rhinology & Allergy. 2017;31(1):56–62. pmid:28234156
View Article
PubMed/NCBI
Google Scholar

[78] View Article

[79] PubMed/NCBI

[80] Google Scholar

[ref21] 21. Mueller S, Wendler O, Nocera A, Grundtner P, Schlegel P, Agaimy A, et al. Escalation in mucus cystatin 2, pappalysin‐A, and periostin levels over time predict need for recurrent surgery in chronic rhinosinusitis with nasal polyps. International Forum of Allergy & Rhinology. 2019;9:1212–1219. pmid:31430426
View Article
PubMed/NCBI
Google Scholar

[82] View Article

[83] PubMed/NCBI

[84] Google Scholar

[ref22] 22. Morrissey DK, Bassiouni A, Psaltis AJ, Naidoo Y, Wormald PJ. Outcomes of modified endoscopic Lothrop in aspirin-exacerbated respiratory disease with nasal polyposis. International Forum of Allergy & Rhinology. 2016;6(8):820–825. pmid:26992029
View Article
PubMed/NCBI
Google Scholar

[86] View Article

[87] PubMed/NCBI

[88] Google Scholar

[ref23] 23. Olson RS, Cava WL, Mustahsan Z, Varik A, Moore JH. In: Data-driven advice for applying machine learning to bioinformatics problems;. p. 192–203. Available from: https://www.worldscientific.com/doi/abs/10.1142/9789813235533_0018.

[ref24] 24. Christodoulou E, Ma J, Collins GS, Steyerberg EW, Verbakel JY, Van Calster B. A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. Journal of Clinical Epidemiology. 2019;110:12–22. pmid:30763612
View Article
PubMed/NCBI
Google Scholar

[91] View Article

[92] PubMed/NCBI

[93] Google Scholar

[ref25] 25. Lundberg SM, Erion G, Chen H, DeGrave A, Prutkin JM, Nair B, et al. From local explanations to global understanding with explainable AI for trees. Nature Machine Intelligence. 2020;2(1):2522–5839. pmid:32607472
View Article
PubMed/NCBI
Google Scholar

[95] View Article

[96] PubMed/NCBI

[97] Google Scholar

[ref26] 26. Friedman JH. Evaluation: from Precision, Recall and F-measure to ROC, Informedness, Markedness and Correlation. The Annals of Statistics. 2001;29(5):1189–232.
View Article
Google Scholar

[99] View Article

[100] Google Scholar

[ref27] 27. Hastie T, Tibshirani R, Friedman J. The Elements of Statistical Learning. New York, NY, USA: Springer-Verlag; 2009.

[ref28] 28. Xin L, Zhu M. Stochastic Stepwise Ensembles for Variable Selection. Journal of Computational and Graphical Statistics. 2012;21(2):275–294.
View Article
Google Scholar

[103] View Article

[104] Google Scholar

[ref29] 29. Cheng L, Zhu M, Poss JW, Hirdes JP, Glenny C, Stolee P. Opinion versus practice regarding the use of rehabilitation services in home care: an investigation using machine learning algorithms. BMC Medical Informatics and Decision Making. 2015;15(1):1–11. pmid:26453354
View Article
PubMed/NCBI
Google Scholar

[106] View Article

[107] PubMed/NCBI

[108] Google Scholar

[ref30] 30. Liu N, Koh ZX, Goh J, Lin Z, Haaland B, Ting BP, et al. Prediction of adverse cardiac events in emergency department patients with chest pain using machine learning for variable selection. BMC Medical Informatics and Decision Making. 2014;14(1):75. pmid:25150702
View Article
PubMed/NCBI
Google Scholar

[110] View Article

[111] PubMed/NCBI

[112] Google Scholar

[ref31] 31. Roelen C, Bültmann U, Rhenen W, Klink J, Twisk J, Heymans M. External validation of two prediction models identifying employees at risk of high sickness absence: Cohort study with 1-year follow-up. BMC public health. 2013;13:105. pmid:23379546
View Article
PubMed/NCBI
Google Scholar

[114] View Article

[115] PubMed/NCBI

[116] Google Scholar

[ref32] 32. Fischer J, Bachmann L, Jaeschke R. A readers’ guide to the interpretation of diagnostic test properties: clinical example of sepsis. Intensive Care Medicine. 2003;29:1043–1051. pmid:12734652
View Article
PubMed/NCBI
Google Scholar

[118] View Article

[119] PubMed/NCBI

[120] Google Scholar

[ref33] 33. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research. 2011;12:2825–2830.
View Article
Google Scholar

[122] View Article

[123] Google Scholar

[ref34] 34. Chen T, Guestrin C. XGBoost: A Scalable Tree Boosting System. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. KDD’16. New York, NY, USA: ACM; 2016. p. 785–794. Available from: http://doi.acm.org/10.1145/2939672.2939785.

[ref35] 35. Raschka S. Mlxtend; 2016. Available from: http://dx.doi.org/10.5281/zenodo.49235.

[ref36] 36. van der Walt S, Colbert SC, Varoquaux G. The NumPy Array: A Structure for Efficient Numerical Computation. Computing in Science Engineering. 2011;13(2):22–30.
View Article
Google Scholar

[127] View Article

[128] Google Scholar

[ref37] 37. McKinney W. Data Structures for Statistical Computing in Python. In: van der Walt S, Millman J, editors. Proceedings of the 9th Python in Science Conference; 2010. p. 56–61.

[ref38] 38. Lundberg SM, Lee SI. A Unified Approach to Interpreting Model Predictions. In: Guyon I, Luxburg UV, Bengio S, Wallach H, Fergus R, Vishwanathan S, et al., editors. Advances in Neural Information Processing Systems 30. Curran Associates, Inc.; 2017. p. 4765–4774. Available from: http://papers.nips.cc/paper/7062-a-unified-approach-to-interpreting-model-predictions.pdf.

[ref39] 39. Jiangchun L. PDPbox: python partial dependence plot toolbox; 2018. Available from: https://github.com/SauceCat/PDPbox.

[ref40] 40. Manuel DG, Abdulaziz KE, Perez R, Beach S, Bennett C. Personalized risk communication for personalized risk assessment: Real world assessment of knowledge and motivation for six mortality risk measures from an online life expectancy calculator. Informatics for Health and Social Care. 2018;43(1):42–55. pmid:28068156
View Article
PubMed/NCBI
Google Scholar

[133] View Article

[134] PubMed/NCBI

[135] Google Scholar

[ref41] 41. Mendelsohn D, Jeremic G, Wright ED, Rotenberg BW. Revision Rates after Endoscopic Sinus Surgery: A Recurrence Analysis. Annals of Otology, Rhinology & Laryngology. 2011;120(3):162–166. pmid:21510141
View Article
PubMed/NCBI
Google Scholar

[137] View Article

[138] PubMed/NCBI

[139] Google Scholar

[ref42] 42. Delafiori J, Navarro LC, Siciliano RF, de Melo GC, Busanello ENB, Nicolau JC, et al. Covid-19 Automated Diagnosis and Risk Assessment through Metabolomics and Machine Learning. Analytical Chemistry. 2021;93(4):2471–2479. pmid:33471512
View Article
PubMed/NCBI
Google Scholar

[141] View Article

[142] PubMed/NCBI

[143] Google Scholar

[ref43] 43. Hu SY, Santus E, Forsyth AW, Malhotra D, Haimson J, Chatterjee NA, et al. Can machine learning improve patient selection for cardiac resynchronization therapy? PLOS ONE. 2019;14(10):1–13. pmid:31581234
View Article
PubMed/NCBI
Google Scholar

[145] View Article

[146] PubMed/NCBI

[147] Google Scholar

[ref44] 44. Harrell M, Levy M, Fabbri D. Supervised Machine Learning to Predict Follow-Up Among Adjuvant Endocrine Therapy Patients. In: 2017 IEEE International Conference on Healthcare Informatics (ICHI); 2017. p. 490–495.

[ref45] 45. Juhn Y, Liu H. Natural language processing to advance EHR-based clinical research in Allergy, Asthma, and Immunology. Journal of Allergy and Clinical Immunology. 2019;145.
View Article
Google Scholar

[150] View Article

[151] Google Scholar

[ref46] 46. Bose S, Kenyon CC, Masino AJ. Personalized prediction of early childhood asthma persistence: A machine learning approach. PLOS ONE. 2021;16(3):1–17. pmid:33647071
View Article
PubMed/NCBI
Google Scholar

[153] View Article

[154] PubMed/NCBI

[155] Google Scholar

[ref47] 47. Sallis BF, Erkert L, Moñino-Romero S, Acar U, Wu R, Konnikova L, et al. An algorithm for the classification of mRNA patterns in eosinophilic esophagitis: Integration of machine learning. Journal of Allergy and Clinical Immunology. 2018;141(4):1354–1364.e9. pmid:29273402
View Article
PubMed/NCBI
Google Scholar

[157] View Article

[158] PubMed/NCBI

[159] Google Scholar

[ref48] 48. Thorwarth R, Scott D, Lal D, MJ MM. Machine learning of biomarkers and clinical observation to predict eosinophilic chronic rhinosinusitis: a pilot study. Int Forum Allergy Rhinol. 2021;1:8–15. pmid:32472743
View Article
PubMed/NCBI
Google Scholar

[161] View Article

[162] PubMed/NCBI

[163] Google Scholar

[ref49] 49. Chowdhury NI, Smith TL, Chandra RK, Turner JH. Automated classification of osteomeatal complex inflammation on computed tomography using convolutional neural networks. International Forum of Allergy & Rhinology. 2019;9(1):46–52. pmid:30098123
View Article
PubMed/NCBI
Google Scholar

[165] View Article

[166] PubMed/NCBI

[167] Google Scholar

[ref50] 50. Merali ZG, Witiw CD, Badhiwala JH, Wilson JR, Fehlings MG. Using a machine learning approach to predict outcome after surgery for degenerative cervical myelopathy. PLOS ONE. 2019;14(4):1–12. pmid:30947300
View Article
PubMed/NCBI
Google Scholar

[169] View Article

[170] PubMed/NCBI

[171] Google Scholar

[ref51] 51. Aram P, Trela-Larsen L, Sayers A, Hills A, Blom A, McCloskey E, et al. Estimating an Individual’s Probability of Revision Surgery After Knee Replacement: A Comparison of Modeling Approaches Using a National Data Set. American journal of epidemiology. 2018;187. pmid:29893799
View Article
PubMed/NCBI
Google Scholar

[173] View Article

[174] PubMed/NCBI

[175] Google Scholar

[ref52] 52. Karhade AV, Ogink PT, Thio QCBS, Cha TD, Gormley WB, Hershman SH, et al. Development of machine learning algorithms for prediction of prolonged opioid prescription after surgery for lumbar disc herniation. The Spine Journal. 2019;19(11):1764–1771. pmid:31185292
View Article
PubMed/NCBI
Google Scholar

[177] View Article

[178] PubMed/NCBI

[179] Google Scholar

[ref53] 53. Durand WM, DePasse JM, Daniels AH. Predictive Modeling for Blood Transfusion After Adult Spinal Deformity Surgery. Spine. 2018;43(15):1058–1066. pmid:29215501
View Article
PubMed/NCBI
Google Scholar

[181] View Article

[182] PubMed/NCBI

[183] Google Scholar

[ref54] 54. da Silva DA, ten Caten CS, dos Santos RP, Fogliatto FS, Hsuan J. Predicting the occurrence of surgical site infections using text mining and machine learning. PLOS ONE. 2019;14(12):1–17. pmid:31834905
View Article
PubMed/NCBI
Google Scholar

[185] View Article

[186] PubMed/NCBI

[187] Google Scholar

[ref55] 55. Katotomichelakis M, Gouveris H, Tripsianis G, Simopoulou M, Papathanassiou J, Danielides V. Biometric predictive models for the evaluation of olfactory recovery after endoscopic sinus surgery in patients with nasal polyposis. American journal of rhinology and allergy. 2010;24(4):276–280. pmid:20819466
View Article
PubMed/NCBI
Google Scholar

[189] View Article

[190] PubMed/NCBI

[191] Google Scholar

[ref56] 56. Inacio MCS, Cafri G, Funahashi TT, Maletis GB, Paxton EW. Type and frequency of healthcare encounters can predict poor surgical outcomes in anterior cruciate ligament reconstruction patients. International Journal of Medical Informatics. 2016;90:32–39. pmid:27103195
View Article
PubMed/NCBI
Google Scholar

[193] View Article

[194] PubMed/NCBI

[195] Google Scholar

[ref57] 57. Lilja MJ, Koskinen A, Virkkula P, Vento SI, Myller J, Hammaren-Malmi S, et al. Factors Affecting the Control of Chronic Rhinosinusitis With Nasal Polyps: A Comparison in Patients With or Without NERD. Allergy & Rhinology. 2021;12:21526567211003844.
View Article
Google Scholar

[197] View Article

[198] Google Scholar

[ref58] 58. Virkkula P, Penttila E, Vento S, Myller J, Koskinen A, Hammaren-Malmi S, et al. Assessing Cut-off Points of Eosinophils, Nasal Polyp, and Lund-Mackay Scores to Predict Surgery in Nasal Polyposis: A Real-World Study. Allergy and Rhinology. 2020;11. pmid:35141001
View Article
PubMed/NCBI
Google Scholar

[200] View Article

[201] PubMed/NCBI

[202] Google Scholar

[ref59] 59. Zhang L, Zhang Y, Gao Y, Wang K, Lou H, Meng Y, et al. Long-term outcomes of different endoscopic sinus surgery in recurrent chronic rhinosinusitis with nasal polyps and asthma. Rhinology. 2020;58(2). pmid:31904028
View Article
PubMed/NCBI
Google Scholar

[204] View Article

[205] PubMed/NCBI

[206] Google Scholar

[ref60] 60. van der Veen J, Seys SF, Timmermans M, Levie P, Jorissen M, Fokkens WJ, et al. Real-life study showing uncontrolled rhinosinusitis after sinus surgery in a tertiary referral centre. Allergy. 2017;72(2):282–290. pmid:27392210
View Article
PubMed/NCBI
Google Scholar

[208] View Article

[209] PubMed/NCBI

[210] Google Scholar

[ref61] 61. Laulajainen-Hongisto A, Turpeinen H, Vento SI, Numminen J, Sahlman J, Kauppi P, et al. High Discontinuation Rates of Peroral ASA Treatment for CRSwNP: A Real-World Multicenter Study of 171 N-ERD Patients. The Journal of Allergy and Clinical Immunology: In Practice. 2020;8(10):3565–3574. pmid:32693216
View Article
PubMed/NCBI
Google Scholar

[212] View Article

[213] PubMed/NCBI

[214] Google Scholar

[ref62] 62. Steinke JW, Payne SC, Borish L. Eosinophils and Mast Cells in Aspirin-Exacerbated Respiratory Disease. Immunology and Allergy Clinics of North America. 2016;36(4):719–734. pmid:27712766
View Article
PubMed/NCBI
Google Scholar

[216] View Article

[217] PubMed/NCBI

[218] Google Scholar

[ref63] 63. Brescia G, Barion U, Zanotti C, Giacomelli L, Martini A, Marioni G. The prognostic role of serum eosinophil and basophil levels in sinonasal polyposis. International Forum of Allergy & Rhinology. 2017;7(3):261–267. pmid:27992119
View Article
PubMed/NCBI
Google Scholar

[220] View Article

[221] PubMed/NCBI

[222] Google Scholar

[ref64] 64. Lou H, Meng Y, Piao Y, Wang C, Zhang L, Bachert C. Predictive Significance of Tissue Eosinophilia for Nasal Polyp Recurrence in the Chinese Population. American Journal of Rhinology & Allergy. 2015;29(5):350–356. pmid:26219765
View Article
PubMed/NCBI
Google Scholar

[224] View Article

[225] PubMed/NCBI

[226] Google Scholar

[ref65] 65. Brescia G, Pedruzzi B, Barion U, Cinetto F, Giacomelli L, Martini A, et al. Are neutrophil-, eosinophil-, and basophil-to-lymphocyte ratios useful markers for pinpointing patients at higher risk of recurrent sinonasal polyps? American Journal of Otolaryngology. 2016;37(4):339–345. pmid:27045767
View Article
PubMed/NCBI
Google Scholar

[228] View Article

[229] PubMed/NCBI

[230] Google Scholar

[ref66] 66. Brescia G, Marioni G, Franchella S, Ramacciotti G, Giacomelli L, Marino F, et al. A prospective investigation of predictive parameters for post-surgical recurrences in sinonasal polyposis. Archiv für Klinische und Experimentelle Ohren- Nasen- und Kehlkopfheilkunde. 2016;273:655–660. pmid:25763571
View Article
PubMed/NCBI
Google Scholar

[232] View Article

[233] PubMed/NCBI

[234] Google Scholar

[ref67] 67. Tao X, Chen F, Sun Y, Wu S, Hong H, Shi J, et al. Prediction models for postoperative uncontrolled chronic rhinosinusitis in daily practice. The Laryngoscope. 2018;128(12):2673–2680. pmid:30295929
View Article
PubMed/NCBI
Google Scholar

[236] View Article

[237] PubMed/NCBI

[238] Google Scholar

[ref68] 68. Vlaminck S, Vauterin T, Hellings PW, Jorissen M, Acke F, Cauwenberge PV, et al. The Importance of Local Eosinophilia in the Surgical Outcome of Chronic Rhinosinusitis: A 3-Year Prospective Observational Study. American Journal of Rhinology & Allergy. 2014;28(3):260–264. pmid:24980239
View Article
PubMed/NCBI
Google Scholar

[240] View Article

[241] PubMed/NCBI

[242] Google Scholar

[ref69] 69. Nakayama T, Yoshikawa M, Asaka D, Okushi T, Matsuwaki Y, Otori N, et al. Mucosal eosinophilia and recurrence of nasal polyps–new classification of chronic rhinosinusitis. Rhinology. 2011;49(4):392–396. pmid:21991563
View Article
PubMed/NCBI
Google Scholar

[244] View Article

[245] PubMed/NCBI

[246] Google Scholar

[ref70] 70. Ikeda K, Shiozawa A, Ono N, Kusunoki T, Hirotsu M, Homma H, et al. Subclassification of chronic rhinosinusitis with nasal polyp based on eosinophil and neutrophil. The Laryngoscope. 2013;123(11):E1–E9. pmid:23670893
View Article
PubMed/NCBI
Google Scholar

[248] View Article

[249] PubMed/NCBI

[250] Google Scholar

[ref71] 71. Miglani A, Divekar RD, Azar A, Rank MA, Lal D. Revision endoscopic sinus surgery rates by chronic rhinosinusitis subtype. International Forum of Allergy & Rhinology. 2018;8(9):1047–1051. pmid:29851243
View Article
PubMed/NCBI
Google Scholar

[252] View Article

[253] PubMed/NCBI

[254] Google Scholar

[ref72] 72. Jalali A DNPJGMKSGSJJRMAL Lonsdale H. Deep Learning for Improved Risk Prediction in Surgical Outcomes. Sci Rep. 2020;10:9289. pmid:32518246
View Article
PubMed/NCBI
Google Scholar

[256] View Article

[257] PubMed/NCBI

[258] Google Scholar

[ref73] 73. Hastie T, Tibshirani R, Friedman J. The Elements of Statistical Learning. Springer Series in Statistics. New York, NY, USA: Springer New York Inc.; 2008.

[ref74] 74. Nuutinen M, Leskelä RL, Suojalehto E, Tirronen A, Komssi V. BMC Med Inform Decis Mak. Development and validation of classifiers and variable subsets for predicting nursing home admission. 2017;39.
View Article
Google Scholar

[261] View Article

[262] Google Scholar

[ref75] 75. Hamerla G, Meyer HJ, Schob S, Ginat DT, Altman A, Lim T, et al. Comparison of machine learning classifiers for differentiation of grade 1 from higher gradings in meningioma: A multicenter radiomics study. Magnetic Resonance Imaging. 2019;63:244–249. pmid:31425811
View Article
PubMed/NCBI
Google Scholar

[264] View Article

[265] PubMed/NCBI

[266] Google Scholar

[ref76] 76. Orfanoudaki A, Chesley E, Cadisch C, Stein B, Nouh A, Alberts MJ, et al. Machine learning provides evidence that stroke risk is not linear: The non-linear Framingham stroke risk score. PLOS ONE. 2020;15(5):1–20.
View Article
Google Scholar

[268] View Article

[269] Google Scholar

[ref77] 77. Minami K, Kabata D, Kakuta T, Fukushima S, Fujita T, Shintani A, et al. U-Shaped Association Between Intraoperative Net Fluid Balance and Risk of Postoperative Recurrent Atrial Tachyarrhythmia Among Patients Undergoing the Cryo-Maze Procedure: An Observational Study. Journal of Cardiothoracic and Vascular Anesthesia. 2020;. pmid:33158709
View Article
PubMed/NCBI
Google Scholar

[271] View Article

[272] PubMed/NCBI

[273] Google Scholar

[ref78] 78. Tanaka K, Miyake Y, Arakawa M, Sasaki S, Ohya Y. U-Shaped Association between Body Mass Index and the Prevalence of Wheeze and Asthma, but not Eczema or Rhinoconjunctivitis: The Ryukyus Child Health Study. The Journal of asthma: official journal of the Association for the Care of Asthma. 2011;48:804–10.
View Article
Google Scholar

[275] View Article

[276] Google Scholar

[ref79] 79. Toppila-Salmi S, Rihkanen H, Arffman M, Manderbacka K, Keskimaki I, Hytönen M. Regional differences in endoscopic sinus surgery in Finland: A nationwide register-based study. BMJ Open. 2018;8:e022173. pmid:30341122
View Article
PubMed/NCBI
Google Scholar

[278] View Article

[279] PubMed/NCBI

[280] Google Scholar

Figures

Abstract

Background

Methods

Results

Conclusions

Introduction

Materials and methods

Patients

Variables

Machine learning algorithms

Model training

Sequential forward variable selection

Performance analysis

Software

Results

Univariate analyses

Machine learning classifier comparison

Effect of the data collection time

Interpretability analysis

Discussion

Conclusions

Supporting information

S1 File. Procedure and ICD10-codes.

S2 File. Key words mining from clinical texts.

S3 File. Performance of baseline machine learning classifier.

S4 File. Performance of ensemble classifier.

S5 File. Hyperparameter values for machine learning models.

Acknowledgments

References