Figures
Abstract
Background
Generalized regression neural network (GRNN) and logistic regression (LR) are extensively used in the medical field; however, the better model for predicting stroke outcome has not been established. The primary goal of this study was to compare the accuracies of GRNN and LR models to identify the most optimal model for the prediction of acute stroke outcome, as well as explore useful biomarkers for predicting the prognosis of acute stroke patients.
Method
In a single-center study, 216 (80% for the training set and 20% for the test set) acute stroke patients admitted to the Shenzhen Second People’s Hospital between December 2019 to June 2021 were retrospectively recruited. The functional outcomes of the patients were measured using Barthel Index (BI) on discharge. A training set was used to optimize the GRNN and LR models. The test set was utilized to validate and compare the performances of GRNN and LR in predicting acute stroke outcome based on the area under the receiver operating characteristic curve (AUROC), accuracy, sensitivity, and the Kappa value.
Result
The LR analysis showed that age, the National Institute Health Stroke Scale score, BI index, hemoglobin, and albumin were independently associated with stroke outcome. After validating in test set using these variables, we found that the GRNN model showed a better performance based on AUROC (0.931 vs 0.702), sensitivity (0.933 vs 0.700), specificity (0.889 vs 0.722), accuracy (0.896 vs 0.729), and the Kappa value (0.775 vs 0.416) than the LR model.
Conclusion
Overall, the GRNN model demonstrated superior performance to the LR model in predicting the prognosis of acute stroke patients. In addition to its advantage in not affected by implicit interactions and complex relationship in the data. Thus, we suggested that GRNN could be served as the optimal statistical model for acute stroke outcome prediction. Simultaneously, prospective validation based on more variables of the GRNN model for the prediction is required in future studies.
Citation: Qu S, Zhou M, Jiao S, Zhang Z, Xue K, Long J, et al. (2022) Optimizing acute stroke outcome prediction models: Comparison of generalized regression neural networks and logistic regressions. PLoS ONE 17(5): e0267747. https://doi.org/10.1371/journal.pone.0267747
Editor: M. Shamim Kaiser, Jahangirnagar University, BANGLADESH
Received: August 15, 2021; Accepted: April 16, 2022; Published: May 11, 2022
Copyright: © 2022 Qu et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting Information files.
Funding: This work was supported by the Guangdong Basic and Applied Basic Research Foundation (Grant No. 2020A1515111062, http://gdstc.gd.gov.cn) and Shenzhen Second People’s Hospital Clinical Research Foundation (Grant No. 20203357021, 20203357019, 20203357022, http://www.szrch.com) to MCZ. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Stroke is among the leading causes of morbidity and mortality globally [1]. Up to 70% of patients have a persistent disability, and more than 40% have a severe disability, which places tremendous economic burdens on the families of patients and society [2]. To mitigate pressure on the healthcare system, assist medical professionals in making optimal clinical decisions [3], assist therapists in setting realistic therapeutic goals, improve the quality of life and life expectancy of acute stroke patients [4], as well as inform educated decisions on posthospital service needs for patients [5], accurate prognostic prediction is imperative. Besides, accurate outcome predictions would also assist policymakers and care providers in setting appropriate healthcare goals and optimizing the allocation of medical resources [6].
Studies on the clinical outcomes of patients have been at the forefront of healthcare research. Logistic regression (LR) and artificial neural networks (ANNs) have been used to evaluate prognosis for patients with stroke [7, 8]. The application of predictive models in clinical practice requires high standards: they must feature simple calculations and obvious structure and be tested with an independent dataset for simplicity, applicability, and reliability [9]. However, these models have advantages and disadvantages. Some are too complex for use in the clinical context, and others are susceptible to bias [10]. Consequently, establishing a reliable and feasible model for stroke prognosis prediction is an important challenge for rehabilitation physicians [6].
LR is commonly used for developing predictive models for dichotomous outcomes in medicine [11]. Its popularity may be attributed to the determination of the relationship between dependent variables and one or more nominal, ordered, interval, or ratio-level predictors and understanding the effects of predictors on outcome variables [8, 12]. However, it has several limitations: it is limited to linear relationships and it is prone to similarity and variance errors [9]. Therefore, the application of LR to the prediction of prognosis in patients with acute stroke has limitations [11].
ANN is a relatively advanced computational model used in bio-medicine and medical fields, and it is a mathematical model inspired by biological neural networks [13]. ANN simulates the problem-solving process of the human brain by applying knowledge gained from experience to build a system of "neurons" that can make new decisions, classifications, and predictions [9]. At present, four main neural network models are used in clinical practice: the back-propagation neural network (BPNN), generalized regression neural network (GRNN), fuzzy neural network, and probabilistic neural network [14]. The BPNN has been widely used [15] but suffers from some limitations, such as slow convergence speed, a habit of falling into a local optimal solution, and difficulties with setting an optimal configuration [16]. More efficient architecture through the GRNN has been proposed, with better approximation capabilities, non-linear fitting, easy parameter setting, and accelerated learning, as compared with the BPNN [17]. Based on these characteristics, the GRNN is suitable for solving problems of non-linear function and is used for data regression and classification [16].
Reports on the application of GRNNs to the prognostic prediction of diseases have been increasing. One study investigating antimalarial activities of selected therapies against Plasmodium falciparum found the prediction accuracy of a GRNN model for test sets to be 88.9% and that of a support vector machine to be 87.3%, which were both higher than that of a traditional LR model [18]. However, there are only a few studies on the value of a GRNN model in predicting acute stroke outcomes. Therefore, this study aimed to compare the prediction accuracy of acute stroke outcome using the GRNN and LR models and to identify the variables predictive of the prognosis of acute stroke patients from clinical data.
Materials and methods
Ethics statement
This was a single-center retrospective study that used the data of a previous study on the value of grip strength on the unaffected side in patients with stroke [19]. In our previous study, we not only explored related indicators of cardiopulmonary function in stroke patients, but also assessed Barthel index (BI) scores on discharge. In the context, this data was further utilized to construct LR and GRNN models. The medical records on admission and the functional outcome data on discharge from stroke patients at the Rehabilitation Medicine Department of Shenzhen Second People’s Hospital between December 2019 and June 2021 were collected. This study was approved by the Ethics Committee of Shenzhen Second People’s Hospital (Ethics No. KS20191119005) and registered in the Chinese Clinical Trial Registry (Registration No. ChiCTR1900028228).
Study population
A total of 216 acute stroke patients who met the inclusion criteria were included in this study. The inclusion criteria were as follows: (a) age ≥18 years old; (b) onset within 90 days; (c) diagnosis of stroke based on history and physical examination and computed tomography (CT) findings; (d) ischemic (infarction of the central nervous system) or hemorrhagic (spontaneous, non-traumatic bleeding) stroke. The exclusion criteria were as follows: missing data, such as incomplete biochemical tests after admission, and loss to BI scores at discharge. All patients (or their family members) were provided complete written and oral information about all study procedures.
Study variables, outcomes on discharge, and missing data
The variables were screened based on previously published articles and clinical experience [6, 11, 20]. Accordingly, 25 clinical variables were used in this study, and the data of the 216 patients were randomly divided into training (n = 168, 80%) and test (n = 48, 20%) sets using the Kennard-Stone algorithm. These included demographic data: age (years), gender, body mass index (BMI) (kg/m2), National Institute Health Stroke Scale (NIHSS), Baseline Barthel Index (BI), and etiology; medical history of coronary heart disease, hypertension, diabetes and atrial fibrillation; social history (drinking and smoking); physical examination findings (systolic blood pressure, diastolic blood pressure, pulse); biochemical indicators: D-dimer (μg/mL), total cholesterol (mmol/L), triglyceride (mmol/L), high-density lipoprotein cholesterol (mmol/L), low-density lipoprotein cholesterol (mmol/L), hemoglobin (Hb, g/L), serum Na+ (mmol/L), serum K+ (mmol/L), creatinine (μmol/L), and albumin (g/L).
The baseline function and functional outcomes of acute stroke patients were measured using the BI scale on admission and discharge (28–30 days of hospitalization) in this study. BI is widely used in clinical prognosis assessment, and has been called the most common ADL assessment scale for adult rehabilitation [21, 22]. According to previous reports [23, 24], patients with a BI ≥60 on discharge were allocated to the good outcome group, while patients with a BI <60 were allocated to the poor outcome group. The BI scores of the patients on admission and discharge were evaluated by the same professional physiotherapists.
Data cleaning and preprocessing were performed before analysis. Discarding and forcibly replacing large amounts of missing data are unreasonable. Continuous covariates with less than 1% of missing digital data could be imputed with the mean value, but covariates with more than 1% of missing data were excluded. Missing categorical covariates were recorded as negative.
Development of the LR model
The LR model is used for classification based on certain predictors [12]. The dependent variable of LR is usually a binary variable, such as the outcomes (good or bad) of acute stroke patients [6]. For this model, the probability of a stroke outcome, given as, in relation to n prognosis-related covariates, X1, X2, X3, ⋯, is given as follows:
Here, βi is the intercept; β1, β2, and βi are the regression coefficients for the predictive variables X1, X2 and Xi. The regression coefficients indicate the contributions of the predictive variables to the results. The regression coefficients and their standard deviations are generally based on the least-squares fitting method. The LR model usually includes only important predictors, and variables are usually selected through some form of backward or forward stepwise logistic regression analyses [8]. The statistical significance of a variable was determined based on its coefficient, using p-values of <0.05. Then, the probability of an outcome was calculated using the logistic equation, and this was used to identify cases with poor prognoses in the test set.
Development of the GRNN model
The GRNN algorithm was first introduced by Specht [25] and considered to be used for classification problems where the target variable is continuous [26]. The algorithm does not reach the local minimum easily, and it has fewer adjustment parameters to be optimized. Therefore, it can be used to analyze large and unstable datasets [18]. The GRNN algorithm consists of four layers, as shown in Fig 1: input, pattern, summation, and output layers.
The variable definitions and interpretation are as follows:
: input vector of GRNN;
: output vector of GRNN;
n: the dimension of the vector variable X;
k: the dimension of vector variable Y;
In a GRNN model, the number of neurons in the input layer is equal to the dimension of the input vector [18].The input layer does not process the data but merely passes the input vector to the pattern layer. The number of neurons in the pattern layer is equal to that of the sample observations. Each neuron can be considered as a radial basis function and used to process a sample observation. The transfer function of the pattern layer, pi, serves as a Gaussian function as follows:
where Di represents the distance between the point of prediction X and the training sample Xi; σ is the width coefficient of the Gaussian function, a smoothness parameter; and m is the number of sample observations.
If the distance, Di, is large, becomes small. Therefore, the contribution of the training samples to the prediction is relatively small. On the other hand, if the distance, Di, between the point of prediction X and the training sample Xi is small,
will become large. Therefore, the contribution of the training samples to the prediction is relatively large.
The summation layer uses two separate types of neurons for summation. The first type of calculation is:
The second type of calculation is:
For the output layer, the number of neurons in the output layer is equal to the dimension of the output vector, and the output value of j is:
Statistics
The continuous and discrete variables are presented as mean ± standard deviation and median (interquartile range, IQR), respectively, and the categorical variables are presented as percentages. T-test of two independent samples and Mann-Whitney U test were used to compare the continuous and discrete variables, respectively. The categorical variables were analyzed using the chi-squared test. Significant variables from the univariate analysis were incorporated into the LR model, the non-significant variables were eliminated by backward selection, and the logistic regression model was finalized. The 95% confidence interval and the odds ratio were used to describe the relationships between variables. Hosmer–Lemeshow goodness of fit (χ2) was used to test the fit of the model.
A sensitivity analysis was used to evaluate the relative importance of input variables using the GRNN model and to rank the importance of variables. The global sensitivity of the input variables against the output variable was expressed as the ratio of the network error (sum of squares of residuals) with a given input omitted to the network error with the input included. The input variable with the highest global sensitivity was the variable with the greatest influence on the output variable.
The performance of the predictive model developed in our study was evaluated for discrimination ability using the test set; the discrimination ability was quantified using area under the curve (AUC) based on the actual BI scores of the acute stroke patients on discharge. Likewise, the accuracy, sensitivity, specificity, and the Kappa value were used to evaluate and compare the predictive performances of the LR and GRNN models.
We used SPSS version 26 (IBM, Armonk, NY, USA) to implement statistical analysis and construct the LR model. The GRNN model was generated using MATLAB 7.0. P<0.05 denoted statistical significance.
Results
Patient characteristics and univariate analysis
A total of 358 patients were screened; of these 11 patients were excluded due to missing data, 113 patients were excluded due to their non-acute stroke phase, and 18 patients were excluded due to lost the BI scores on discharge. In the end, 216 patients were included (168 patients in the training set and 48 patients in the test set) (Fig 2).
In the training set, 62.5% were for male patients with the mean age of 60.79±14.31 years; for the test set, 68.75% of the data were for male patients with a mean age of 58.44±14.90 years. (Table 1) In the training set, univariate analysis showed that gender, age, NIHSS, baseline BI, Hb, D-dimer, and albumin were significantly different in the good and poor outcome group of acute stroke patients (p<0.05). Compared with the patients with poor outcomes, the patients with good outcomes were much younger (58.01±14.40) and had greater BI scores on admission (58.97%), higher Hb (128.59±16.75) and albumin concentrations (39.83±3.74), and lower NIHSS (1.50 [0–5]) and D-dimer concentration (0.49 [0.31–0.87]) (Table 2).
Multiple logistic regression model
Multiple logistic regression was performed to analyze the independent associations among the indicators for the acute stroke outcomes. Based on the results of the univariate analysis, the dominant factors correlated with stroke outcomes, including age, NIHSS, Baseline BI, Hb, and albumin, were incorporated in the multiple logistic regression analysis. As shown in Table 3, age (OR = 1.039, 95% CI = [1.003, 1.076], p = 0.035), NIHSS (OR = 1.111, 95% CI = [1.038, 1.190], P = 0.002), Baseline BI (OR = 0.027, 95% CI = [0.007, 0.114], p = 0.000), Hb (OR = 0.972, 95% CI = [0.946, 0.997], p = 0.032), and albumin (OR = 1.154, 95% CI = [1.013, 1.314], p = 0.031) were independently associated with stroke outcome. The Hosmer-Lemeshow goodness of fit test result was 5.448 (P = 0.709) with 8 degrees of freedom. Furthermore, we derived the predictive equation using the training set with the following LR parameters (Pi is the probability of an acute stroke outcome):
Pi = 1+1/ (1 + e–X) = where X = −3.483 + 0.038 [age (years)] +0.106[NIHSS scores]– 3.061 [Baseline BI scores]– 0.029[Hemoglobin (g/L) + 0.143[Albumin (g/L)].
Comparison of the two models
The test set was used to compare the two models (Fig 3): the AUCs for the LR and GRNN models were 0.702 (95% CI: 0.553, 0.825) and 0.931 (95% CI: 0.820, 0.984), respectively. The LR and GRNN models had sensitivities of 0.700 (95% CI: 0.506, 0.853) and 0.933 (95% CI: 0.779, 0.992), specificities of 0.722 (95% CI: 0.465, 0.903) and 0.889 (95% CI: 0.653, 0.986), and accuracies of 0.729 and 0.896, respectively. Furthermore, the Kappa values for the LR and the GRNN models were 0.416 (95% CI: 0.149, 0.682) and 0.775 (95% CI: 0.589, 0.961), respectively (Table 4 and Table 5).
Global sensitivity analysis
The training data was used to calculate the variable sensitivity ratios (VSRs) for the GRNN model. The results indicated that the prognosis of patients with acute stroke is not determined by a single input variable but by a combination of multiple factors, among which baseline BI was the most influential or sensitive parameter affecting acute stroke outcome, followed by albumin and NIHSS (Table 6).
Discussion
Stroke is a fatal and debilitating chronic disease associated with high mortality and morbidity rates and economic costs. Therefore, accurate, and reliable predictive models for prognostic predictions have clinical relevance. This is the first study to create a GRNN-based prediction model for acute stroke patients. GRNN model was a better model for predicting the short-term outcome of patients with acute stroke than the LR model. Secondly, we also found that Hb and albumin are important biochemical indicators of prognosis in patients with acute stroke by using an LR model.
Receiver operating characteristic curve analysis was used to compare the predictive performance of the LR and GRNN models, and a higher area under the receiver operating characteristic curve indicated a better predictive value. In our study, the GRNN model showed better performance based on the AUC (0.931vs 0.702), sensitivity (0.933 vs 0.700), specificity (0.880 vs 0.720), accuracy (0.896 vs 0.729), and Kappa value (0.775 vs 0.416) than the LR model. Therefore, we speculate that the GRNN model has better performance than the LR model for the prognostic prediction for acute stroke. Similar results have been reported by Liu et al [14], who reported that GRNN models may have better predictive ability than traditional multivariate LR models for dental caries based on the outcomes of an oral health epidemiology survey. Currently, GRNN models have shown use in predicting obstructive sleep apnea [27], quantitative structure–pharmacokinetic relationship properties of chemical agents [28], and so on. There are no previous studies that have constructed predictive GRNN models for patients with acute stroke. Yaru et al [11] compared the performances of BPNN models and LR models in predicting prognosis and found that the BPNN models had better performance indicated by higher AUC, accuracy, and specificity for acute ischemic stroke patients. The BPNN and GRNN are both types of ANN. A GRNN is a probabilistic neural network based on the knowledge generated by a Bayesian strategy for pattern classification, which does not need an iterative training procedure like a BPNN [29, 30]. GRNNs are reported to be superior to BPNNs in solving neutron spectrum problems [30]. Therefore, a GRNN was selected for the first time in this study to construct a prognostic model for acute stroke patients.
We considered that the GRNN model is superior to the LR model in predicting the prognosis of acute stroke patients for the following reasons. (1) GRNN is not affected by the interaction of factors and can include several dependent variables, which may have advantages in analyzing the complex variables involved in stroke [31]. (2) GRNN can "learn" without supervision, and those who have little experience in mathematical model construction can also build a model using software [7]. (3) GRNN is similar in principle to neurons of the human brain; it does not have difficulties related to the use of mathematics, and it is likely to make accurate predictions [32]. However, GRNN is considered a “black box”, which makes it difficult to determine its processes and how it arrives at a prediction or explain the disease-related variables in the predictive model [14]. LR, which is a commonly used medical statistical tool, also has its irreplaceable advantages, especially in the interpretability and simplicity of finding independent parameters associated with the prognosis of stroke patients [8, 33]. LR analysis is not the best at analyzing complex nonlinear relationships among variables, which has been proven in biology and complex epidemiological analysis [31].
In general, each model has its strengths and limitations, and the selection of a model should be based on its advantages and the intended purpose of the research. GRNN should be useful when the interest is in prediction, particularly when there are implicit interactions and complex relationships in the data, and it provides a better model fit than conventional logistic regression [7, 8]. For clinical interpretations, the LR model remains the better choice when statistical inferences must be drawn from the causal relationships between covariates and outcomes, and the estimated model regression coefficients provide odds ratios for easy identification of significant predictive indicators [34]. Thus, combining the best characteristics of LR and GRNN models may facilitate a more ideal predictive model for acute stroke patients.
Logistic regression analysis revealed that age, NIHSS, BI scores, Hb and albumin concentrations were independently associated with the prognosis of acute stroke. It is well-known that the prognosis of acute stroke is worse with increasing age due to reduced immunity and physiological function. Aged stroke patients may have higher mortality, longer hospital stay, and poorer quality of life than their younger counterparts [35]. Bielewicz et al. [6] reported that the BI scores assessment on day 10 was predictive of outcomes 3 months after ischemic stroke onset, indicating that the preexisting functional status was positively correlated with the prognosis of stroke patients. Besides, the NIHSS score has been shown to reflect neurological deficits and is considered valuable for predicting the outcome of stroke patients [6]. Based on global sensitivity analysis, BI scores and NIHSS scores influenced the prognosis of acute stroke patients, and so the evaluation results of these two scales should be paid attention to in the early stage.
Among the biochemical biomarkers, our study found that low albumin concentration was predictive of poor prognosis of acute stroke patients based on the univariate analysis; however, albumin was a risk factor based on the multivariate analysis, which may be related to the influence of the protective factors, BI and hemoglobin, on the predictiveness of albumin for the prognosis of acute stroke patients. Global sensitivity analysis indicated that albumin plays an important role in the prognosis of acute stroke. Hashem et al. [36], Aquilani et al. [37], and Xu et al. [38] reported that the high dose of albumin positively affects the prognosis of ischemic stroke patients, which were consistent with the results of this study. Albumin is often regarded as a marker of nutritional status and a negative phase protein that decreases in concentration during injury [39]. In animal models of ischemic stroke, a high concentration of albumin was found to have a significant neuroprotective effect on focal cerebral ischemia [40]. The neuroprotective effect of albumin is mainly related to scavenging free radicals, preventing lipid peroxidation, or binding metal ions and free fatty acids [41]. In other respects, high doses of albumin can reduce the risk of sarcopenia [42]. Therefore, we can conclude that early correction of hypoalbuminemia may help reduce the risk of poor outcomes of stroke.
Otherwise, a high concentration of Hb was a predictor of good outcome of acute stroke patients in our study. Only a few studies on the relationship between hemoglobin and the prognosis of acute stroke patients are available. As Bellwald et al. [43] demonstrated, a decrease in Hb had a linear relationship with larger infarct sizes in ischemic stroke patients, indicating a poorer prognosis. Any cause of the decrease in Hb during acute stroke had a negative impact on prognosis [43]. However, other studies showed that a high baseline concentration of Hb was associated with a poor prognosis of ischemic stroke [44]. High concentrations of hemoglobin increase blood viscosity, thereby increasing blood pressure and worsening cardiovascular function [45]. Thus, the effect of Hb on stroke outcomes is a controversial issue that needs to be further clarified in larger prospective studies.
Our study had several limitations. First, the patient’s follow-up after discharge were not evaluated, and predictors of long-term outcome of the acute strokes may be different from on discharged outcome. Second, the evaluation of model performance in this study was based on available hospital data at the time, and researchers should consider other data for validation before using them on a large scale. Third, this was a retrospective study, and selection bias was inevitable; besides, the sample was small, and this may have limited the generalization of the research results. Fourth, we only registered the stroke patients within 3 months of onset; Hence, it is unclear if this model would be as optimal for patients with subacute and chronic stages of the disease. Fifth, GRNN algorithm needs sophisticated software, and the complexity and unfamiliarity with the networks have been major drawbacks. With future advancements in computer technology and rapid training algorithms, this limitation of neural computing should be overcome. However, the results presented in our study are relevant, and they can guide and inspire further research on clinical stroke prognosis models. Furthermore, future research should be placed on developing prospective studies, collect more variables and complete the long-term follow-up data in subacute and chronic stroke population on a large scale, to validate the performance of the GRNN model.
Conclusion
The study showed that acute stroke outcome can be readily predictable by using the GRNN model combining age, BI, NIHSS, hemoglobin, and albumin concentrations on admission and that this model could outperform LR model. The GRNN model is an accurate and meaningful tool, which can optimize existing models to facilitate more accurate predictions will improve the efficiency and accuracy of clinicians in predicting acute stroke outcomes at discharge and managing it. To make the GRNN model more representative, further researches should consider to design the prospective study collecting more variables of acute stroke patients on a large scale.
Acknowledgments
The authors of this study thanks all the staffs who support this research from rehabilitation department of Shenzhen Second People’s Hospital.
References
- 1. Wu F, Liu Y, Ye G, Zhang Y. Respiratory Muscle Training Improves Strength and Decreases the Risk of Respiratory Complications in Stroke Survivors: A Systematic Review and Meta-analysis. Arch Phys Med Rehabil. 2020;101(11):1991–2001. Epub 2020/05/24. pmid:32445847.
- 2. Xu G, Ma M, Liu X, Hankey GJ. Is there a stroke belt in China and why? Stroke. 2013;44(7):1775–83. Epub 2013/05/16. pmid:23674531.
- 3. Kasner SE. Clinical interpretation and use of stroke scales. Lancet Neurol. 2006;5(7):603–12. Epub 2006/06/20. pmid:16781990.
- 4. Muscari A, Puddu GM, Santoro N, Zoli M. A simple scoring system for outcome prediction of ischemic stroke. Acta Neurol Scand. 2011;124(5):334–42. Epub 2011/01/19. pmid:21241255.
- 5. Lahoud N, Salameh P, Hosseini H, Saleh N, Rizk E, Hallit S, et al. Care and Discharge Outcome of Acute Stroke in Lebanon: A Hospital-Based Study. J Nerv Ment Dis. 2018;206(8):637–43. Epub 2018/07/19. pmid:30020204.
- 6. Bielewicz JE, Kurzepa J, Kamieniak P, Daniluk B, Szczepańska-Szerej A, Rejdak K. Clinical and biochemical predictors of late-outcome in patients after ischemic stroke. Annals of agricultural and environmental medicine: AAEM. 2020;27(2):290–4. Epub 2020/06/27. pmid:32588608.
- 7. Tang ZH, Liu J, Zeng F, Li Z, Yu X, Zhou L. Comparison of prediction model for cardiovascular autonomic dysfunction using artificial neural network and logistic regression analysis. PloS one. 2013;8(8):e70571. Epub 2013/08/14. pmid:23940593; PubMed Central PMCID: PMC3734274.
- 8. Tu JV. Advantages and disadvantages of using artificial neural networks versus logistic regression for predicting medical outcomes. Journal of clinical epidemiology. 1996;49(11):1225–31. Epub 1996/11/01. pmid:8892489.
- 9. Eftekhar B, Mohammad K, Ardebili HE, Ghodsi M, Ketabchi E. Comparison of artificial neural network and logistic regression models for prediction of mortality in head trauma based on initial clinical data. BMC Med Inform Decis Mak. 2005;5:3. Epub 2005/02/17. pmid:15713231; PubMed Central PMCID: PMC551612.
- 10. Olai L, Omne-Pontén M, Borgquist L, Svärdsudd K. Prognosis assessment in stroke patients at discharge from hospital. Age Ageing. 2007;36(2):184–9. Epub 2006/12/21. pmid:17178766.
- 11. Liang Y, Li Q, Chen P, Xu L, Li J. Comparative Study of Back Propagation Artificial Neural Networks and Logistic Regression Model in Predicting Poor Prognosis after Acute Ischemic Stroke. Open Med (Wars). 2019;14:324–30. Epub 2019/04/19. pmid:30997395; PubMed Central PMCID: PMC6463818.
- 12. Erguzel TT, Noyan CO, Eryilmaz G, Ünsalver B, Cebi M, Tas C, et al. Binomial Logistic Regression and Artificial Neural Network Methods to Classify Opioid-Dependent Subjects and Control Group Using Quantitative EEG Power Measures. Clin EEG Neurosci. 2019;50(5):303–10. Epub 2019/01/16. pmid:30642219.
- 13. Ramos LA, Kappelhof M, van Os HJA, Chalos V, Van Kranendonk K, Kruyt ND, et al. Predicting Poor Outcome Before Endovascular Treatment in Patients With Acute Ischemic Stroke. Frontiers in neurology. 2020;11:580957. Epub 2020/11/13. pmid:33178123; PubMed Central PMCID: PMC7593486.
- 14. Liu L, Wu W, Zhang SY, Zhang KQ, Li J, Liu Y, et al. Dental Caries Prediction Based on a Survey of the Oral Health Epidemiology among the Geriatric Residents of Liaoning, China. Biomed Res Int. 2020;2020:5348730. Epub 2020/12/31. pmid:33376726; PubMed Central PMCID: PMC7739046.
- 15. Panaye A, Fan BT, Doucet JP, Yao XJ, Zhang RS, Liu MC, et al. Quantitative structure-toxicity relationships (QSTRs): a comparative study of various non linear methods. General regression neural network, radial basis function neural network and support vector machine in predicting toxicity of nitro- and cyano- aromatics to Tetrahymena pyriformis. SAR and QSAR in environmental research. 2006;17(1):75–91. Epub 2006/03/04. pmid:16513553.
- 16. Wu J, Peng D, Li Z, Zhao L, Ling H. Network intrusion detection based on a general regression neural network optimized by an improved artificial immune algorithm. PloS one. 2015;10(3):e0120976. Epub 2015/03/26. pmid:25807466; PubMed Central PMCID: PMC4373783.
- 17. Wang YW, Shen ZZ, Jiang Y. Comparison of autoregressive integrated moving average model and generalised regression neural network model for prediction of haemorrhagic fever with renal syndrome in China: a time-series study. BMJ open. 2019;9(6):e025773. Epub 2019/06/19. pmid:31209084; PubMed Central PMCID: PMC6589045.
- 18. Liu Q, Deng J, Liu M. Classification models for predicting the antimalarial activity against Plasmodium falciparum. SAR and QSAR in environmental research. 2020;31(4):313–24. Epub 2020/03/20. pmid:32191533.
- 19. Zhou M, Zha F, Chen Y, Liu F, Zhou J, Long J, et al. Handgrip Strength-Related Factors Affecting Health Outcomes in Young Adults: Association with Cardiorespiratory Fitness. Biomed Res Int. 2021;2021:6645252. Epub 2021/05/11. pmid:33969122; PubMed Central PMCID: PMC8084643.
- 20. Chung CC, Chan L, Bamodu OA, Hong CT, Chiu HW. Artificial neural network based prediction of postthrombolysis intracerebral hemorrhage and death. Scientific reports. 2020;10(1):20501. Epub 2020/11/27. pmid:33239681; PubMed Central PMCID: PMC7689530.
- 21. MacIsaac RL, Ali M, Taylor-Rowan M, Rodgers H, Lees KR, Quinn TJ. Use of a 3-Item Short-Form Version of the Barthel Index for Use in Stroke: Systematic Review and External Validation. Stroke. 2017;48(3):618–23. Epub 2017/02/06. pmid:28154094.
- 22. Quinn TJ, Dawson J, Walters MR, Lees KR. Functional outcome measures in contemporary stroke trials. Int J Stroke. 2009;4(3):200–5. Epub 2009/08/08. pmid:19659822.
- 23. Nakibuuka J, Sajatovic M, Nankabirwa J, Ssendikadiwa C, Furlan AJ, Katabira E, et al. Early mortality and functional outcome after acute stroke in Uganda: prospective study with 30 day follow-up. Springerplus. 2015;4:450. Epub 2015/09/01. pmid:26322256; PubMed Central PMCID: PMC4547979.
- 24. Xin X, Chang J, Gao Y, Shi Y. Correlation Between the Revised Brain Symmetry Index, an EEG Feature Index, and Short-term Prognosis in Acute Ischemic Stroke. J Clin Neurophysiol. 2017;34(2):162–7. Epub 2016/09/02. pmid:27584547.
- 25. Specht DF. A general regression neural network. IEEE Trans Neural Netw. 1991;2(6):568–76. Epub 1991/01/01. pmid:18282872.
- 26. Hayat M, Khan A. Predicting membrane protein types by fusing composite protein sequence features into pseudo amino acid composition. Journal of theoretical biology. 2011;271(1):10–7. Epub 2010/11/30. pmid:21110985.
- 27. Kirby SD, Eng P, Danter W, George CF, Francovic T, Ruby RR, et al. Neural network prediction of obstructive sleep apnea from clinical criteria. Chest. 1999;116(2):409–15. Epub 1999/08/24. pmid:10453870.
- 28. Yap CW, Chen YZ. Quantitative Structure-Pharmacokinetic Relationships for drug distribution properties by using general regression neural network. Journal of pharmaceutical sciences. 2005;94(1):153–68. Epub 2005/03/12. pmid:15761939.
- 29. Abut F, Akay MF. Machine learning and statistical methods for the prediction of maximal oxygen uptake: recent advances. Medical devices (Auckland, NZ). 2015;8:369–79. Epub 2015/09/09. pmid:26346869; PubMed Central PMCID: PMC4556298.
- 30. Martínez-Blanco MD, Ornelas-Vargas G, Solís-Sánchez LO, Castañeda-Miranada R, Vega-Carrillo HR, Celaya-Padilla JM, et al. A comparison of back propagation and Generalized Regression Neural Networks performance in neutron spectrometry. Applied radiation and isotopes: including data, instrumentation and methods for use in agriculture, industry and medicine. 2016;117:20–6. Epub 2016/05/03. pmid:27133196.
- 31. Hassanipour S, Ghaem H, Arab-Zozani M, Seif M, Fararouei M, Abdzadeh E, et al. Comparison of artificial neural network and logistic regression models for prediction of outcomes in trauma patients: A systematic review and meta-analysis. Injury. 2019;50(2):244–50. Epub 2019/01/21. pmid:30660332.
- 32. Ayer T, Chhatwal J, Alagoz O, Kahn CE Jr., Woods RW, Burnside ES. Informatics in radiology: comparison of logistic regression and artificial neural network models in breast cancer risk estimation. Radiographics. 2010;30(1):13–22. Epub 2009/11/11. pmid:19901087; PubMed Central PMCID: PMC3709515.
- 33. Dreiseitl S, Ohno-Machado L. Logistic regression and artificial neural network classification models: a methodology review. J Biomed Inform. 2002;35(5–6):352–9. Epub 2003/09/13. pmid:12968784.
- 34. Sutradhar R, Barbera L. Comparing an Artificial Neural Network to Logistic Regression for Predicting ED Visit Risk Among Patients With Cancer: A Population-Based Cohort Study. J Pain Symptom Manage. 2020;60(1):1–9. Epub 2020/02/24. pmid:32088358.
- 35. Arboix A, García-Eroles L, Massons J, Oliveres M, Targa C. Acute stroke in very old people: clinical features and predictors of in-hospital mortality. J Am Geriatr Soc. 2000;48(1):36–41. Epub 2000/01/21. pmid:10642019.
- 36. Hashem SS, Helmy SM, El-Fayomy NM, Oraby MI, Menshawy M, Dawood NA, et al. Predictors of stroke outcome: the role of hemorheology, natural anticoagulants, and serum albumin. Egypt J Neurol Psychiatr Neurosurg. 2018;54(1):18. Epub 2018/07/27. pmid:30046235; PubMed Central PMCID: PMC6028876.
- 37. Aquilani R, Maestri R, Boselli M, Achilli MP, Arrigoni N, Bruni M, et al. The relationship between plasma amino acids and circulating albumin and haemoglobin in postabsorptive stroke patients. PloS one. 2019;14(8):e0219756. Epub 2019/08/15. pmid:31412042; PubMed Central PMCID: PMC6693779 (Milano, Italy). This company had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. This does not alter the authors’ adherence to PLOS ONE policies on sharing data and materials. The other authors declare no conflict of interest.
- 38. Xu WH, Dong C, Rundek T, Elkind MS, Sacco RL. Serum albumin levels are associated with cardioembolic and cryptogenic ischemic strokes: Northern Manhattan Study. Stroke. 2014;45(4):973–8. Epub 2014/02/20. pmid:24549868; PubMed Central PMCID: PMC3966953.
- 39. Seet RC, Lim EC, Chan BP, Ong BK. Serum albumin level as a predictor of ischemic stroke outcome. Stroke. 2004;35(11):2435; author reply -6. Epub 2004/10/09. pmid:15472082.
- 40. Belayev L, Zhao W, Pattany PM, Weaver RG, Huh PW, Lin B, et al. Diffusion-weighted magnetic resonance imaging confirms marked neuroprotective efficacy of albumin therapy in focal cerebral ischemia. Stroke. 1998;29(12):2587–99. Epub 1998/12/04. pmid:9836772.
- 41. Remmers M, Schmidt-Kastner R, Belayev L, Lin B, Busto R, Ginsberg MD. Protein extravasation and cellular uptake after high-dose human-albumin treatment of transient focal cerebral ischemia in rats. Brain Res. 1999;827(1–2):237–42. Epub 1999/05/13. pmid:10320717.
- 42. Aquilani R, Boselli M, Paola B, Pasini E, Iadarola P, Verri M, et al. Is stroke rehabilitation a metabolic problem? Brain Inj. 2014;28(2):161–73. Epub 2014/01/25. pmid:24456056.
- 43. Bellwald S, Balasubramaniam R, Nagler M, Burri MS, Fischer SDA, Hakim A, et al. Association of anemia and hemoglobin decrease during acute stroke treatment with infarct growth and clinical outcome. PloS one. 2018;13(9):e0203535. Epub 2018/09/27. pmid:30256814; PubMed Central PMCID: PMC6157859.
- 44. Guo D, Zhu Z, Zhong C, Peng H, Xu T, Wang A, et al. Hemoglobin level and three-month clinical outcomes among ischemic stroke patients with elevated systolic blood pressure. J Neurol Sci. 2019;396:256–61. Epub 2018/12/12. pmid:30530284.
- 45. Lowe GD, Lee AJ, Rumley A, Price JF, Fowkes FG. Blood viscosity and risk of cardiovascular events: the Edinburgh Artery Study. Br J Haematol. 1997;96(1):168–73. Epub 1997/01/01. pmid:9012704