Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Use of machine learning to identify risk factors for coronary artery disease

  • Alexander A. Huang ,

    Contributed equally to this work with: Alexander A. Huang, Samuel Y. Huang

    Roles Conceptualization, Data curation, Formal analysis, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

    alexander.huang@northwestern.edu

    Affiliations Department of Statistics and Data Science, Cornell University, Ithaca, New York, United States of America, Department of MD Education, Northwestern University Feinberg School of Medicine, Chicago, Illinois, United States of America

  • Samuel Y. Huang

    Contributed equally to this work with: Alexander A. Huang, Samuel Y. Huang

    Roles Conceptualization, Data curation, Writing – original draft, Writing – review & editing

    Affiliations Department of Statistics and Data Science, Cornell University, Ithaca, New York, United States of America, Department of Internal Medicine, Virginia Commonwealth University School of Medicine, Richmond, Virginia, United States of America

Abstract

Coronary artery disease (CAD) is the leading cause of death in both developed and developing nations. The objective of this study was to identify risk factors for coronary artery disease through machine-learning and assess this methodology. A retrospective, cross-sectional cohort study using the publicly available National Health and Nutrition Examination Survey (NHANES) was conducted in patients who completed the demographic, dietary, exercise, and mental health questionnaire and had laboratory and physical exam data. Univariate logistic models, with CAD as the outcome, were used to identify covariates that were associated with CAD. Covariates that had a p<0.0001 on univariate analysis were included within the final machine-learning model. The machine learning model XGBoost was used due to its prevalence within the literature as well as its increased predictive accuracy in healthcare prediction. Model covariates were ranked according to the Cover statistic to identify risk factors for CAD. Shapely Additive Explanations (SHAP) explanations were utilized to visualize the relationship between these potential risk factors and CAD. Of the 7,929 patients that met the inclusion criteria in this study, 4,055 (51%) were female, 2,874 (49%) were male. The mean age was 49.2 (SD = 18.4), with 2,885 (36%) White patients, 2,144 (27%) Black patients, 1,639 (21%) Hispanic patients, and 1,261 (16%) patients of other race. A total of 338 (4.5%) of patients had coronary artery disease. These were fitted into the XGBoost model and an AUROC = 0.89, Sensitivity = 0.85, Specificity = 0.87 were observed (Fig 1). The top four highest ranked features by cover, a measure of the percentage contribution of the covariate to the overall model prediction, were age (Cover = 21.1%), Platelet count (Cover = 5.1%), family history of heart disease (Cover = 4.8%), and Total Cholesterol (Cover = 4.1%). Machine learning models can effectively predict coronary artery disease using demographic, laboratory, physical exam, and lifestyle covariates and identify key risk factors.

Introduction

Coronary artery disease (CAD) is the leading cause of death in both developed and developing nations [1]. CAD is an atherosclerotic disease that is associated with major complications, including angina, myocardial infarction, and sudden cardiac death [25]. Due to the high prevalence, morbidity, and mortality of CAD, identification of risk factors is a public health priority [6]. Genome-wide association studies have identified several genetic variants linked to CAD [710]. Additionally, epidemiological studies have identified significant socioeconomic, race, and sex disparities in CAD prevalence, quality measures, and outcomes [1, 1113]. Further work has found that a combination of genetic, demographic, and environmental factors contributes to the severity of CAD and other cardiovascular diseases [1, 1417]. Furthermore, lifestyle factors, such as diet and exercise, have been found to play an important role in the risk for CAD and other cardiovascular diseases [6, 1820]. These studies have been combined to develop joint risk scores, factoring in both physiological covariates (blood pressure, cholesterol) as well as demographic covariates (age, race, gender) [5, 8, 9, 21]. Despite the strong literature studying the risk factors for CAD, most studies focus upon hypothesis testing or epidemiology focusing upon specific risk factors of interest [2224]. While CAD is recognized as being of “multifactorial” cause, little is known regarding the relative predictive power of different risk factors (lifestyle vs genetic vs chronic disease comorbidities). Given these limitations in the literature, we will leverage transparent machine-learning methods including Shapely Additive Explanations (SHAP model explanations) and model gain statistics to identify pertinent risk-factors for CAD and compute their relative contribution to model prediction of CAD risk; the NHANES 2017–2020 cohort, a large, nationally representative sample of US adults, will be used within this study.

Methods

A retrospective, cross-sectional cohort study using the publicly available National Health and Nutrition Examination Survey (NHANES) was conducted in patients who completed the demographic, dietary, exercise, and mental health questionnaire and had laboratory and physical exam data.

Ethics approval and consent to participate

The acquisition and analysis of the data within this study was approved by the National Center for Health Statistics Ethics Review Board.

Dataset and cohort selection

The National Health and Nutrition Examination Survey (NHANES 2017–2020) is a program designed by the National Center for Health Statistics (NCHS), which has been leveraged to assess the health and nutritional status of the United States population [25]. The NHANES dataset is a series of cross-sectional, complex, multi-stage surveys conducted by the Centers for Disease Control and Prevention (CDC) on a nationally representative cohort of the United States population to provide health, nutritional, and physical activity data. In the present study, we analyzed adult (≥18 years old) patients in the NHANES dataset who completed the demographic, dietary, exercise, and mental health questionnaire and had laboratory and physical exam data.

Assessment of coronary artery disease

The medical conditions file was used to define coronary artery disease. Participants were asked: “Has a doctor or other health professional ever told you that you have coronary heart disease?” Participants who answered “Yes” to this question were considered as having CAD within this study.

Independent variable

Potential model covariates were identified within the demographics, dietary, physical examination, laboratory, and medical questionnaire datasets in NHANES. All covariates were extracted and merged with the CAD indicator.

Model construction and statistical analysis

Univariate logistic models, with CAD as the outcome, were used to identify covariates that were associated with CAD. Covariates that had a p<0.0001 on univariate analysis were included within the final machine-learning model. The machine learning model XGBoost was used due to its prevalence within the literature as well as its increased predictive accuracy in healthcare prediction. XGBoost models were fit with a train:test (80:20), and model accuracy statistics (AUROC, Sensitivity, Specificity, F1, Balanced Accuracy) were computed. Model covariates were ranked according to the Gain, Cover, and Frequency (representations of the relative contribution (“model importance”) of each of the covariates) to identify risk factors for CAD. The Gain statistic represents the overall proportion of the model prediction is attributed to a given statistic. The Cover and Frequency are representations of the proportion of trees that each of the covariates appear within the machine-learning model. SHAP explanations were utilized to visualize the relationship between these potential risk factors and CAD.

Results

Table 1 shows that f the 7,929 patients that met the inclusion criteria in this study, 4,055 (51%) were female, 2,874 (49%) were male. The mean age was 49.2 (SD = 18.4), with 2,885 (36%) White patients, 2,144 (27%) Black patients, 1,639 (21%) Hispanic patients, and 1,261 (16%) patients of another race. A total of 338 (4.5%) of patients had coronary artery disease.

The machine learning model had 58 out of a total 684 features that were found to be significant on univariate analysis (P<0.0001 used). These were fitted into the XGBoost model and an AUROC = 0.89, Sensitivity = 0.85, Specificity = 0.87 were observed Fig 1.

thumbnail
Fig 1. Restricted operator characteristic curve and model statistics.

The ROC Curve for the machine-learning model predicting coronary artery disease. AUROC = 0.89.

https://doi.org/10.1371/journal.pone.0284103.g001

Table 2 shows the top five highest ranked features by cover, a measure of the percentage contribution of the covariate to the overall model prediction, were age (Cover = 21.1%), Platelet count (Cover = 5.1%), family history of heart disease (Cover = 4.8%), and Total Cholesterol (Cover = 4.1%).

thumbnail
Table 2. Model feature importance statistics.

https://doi.org/10.1371/journal.pone.0284103.t002

In Fig 2, on SHAP visualization, we observed that: interpret the top four covariates age had a sigmoidal relationship with risk for coronary artery disease.

thumbnail
Fig 2. Overall SHAP explanations.

SHAP explanations, purple color representing higher values of the covariate while yellow representing lower values of the covariate. X-axis is the change in log-odds for CAD.

https://doi.org/10.1371/journal.pone.0284103.g002

Figs 3, 4a and 4b shows the SHAP Explanations for various SHAP features. We observed that at ages between 20 and 35, there was no significant change in risk for CAD with increasing age, with age increasing between 35 and 70, there was a significant increase in risk for CAD with increasing age, and above 70 years of age, there was no significant increase in CAD with increasing age. Additionally, a curvilinear relationship was observed analyzing the relationship with total-cholesterol and risk for CAD. Patients with significantly decreased total cholesterol were observed to have increased risk for heart disease, and patients with increased cholesterol were observed to also be at increased risk, with a minimum risk around 200 mg/dL of cholesterol. A curvilinear relationship was also observed for the relationship between platelet count and risk for CAD, with significantly decreased platelet counts linked with CAD and significantly increased platelet counts also linked with CAD, a minimum observed around 300,000 cells/uL. Family history was also a significant predictor for CAD. Patients with close relatives having a heart attack in the past had significant increased risk for CAD.

thumbnail
Fig 3. SHAP explanations for the top 4 covariates.

SHAP explanations, covariate value on the x-axis, change in log-odds on the y-axis, red line represents the relationship between the covariate and log-odds for CAD, each black dot represents an observation. Covariates (top left—Age, top right—Total Cholesterol, bottom left—platelets, bottom right—Close relative with a heart attack (Yes = 1, no = 2)).

https://doi.org/10.1371/journal.pone.0284103.g003

thumbnail
Fig 4.

a: Covariates of interest to evaluate sensibility of the model. SHAP explanations for the relationship between HDL-Cholesterol and odds of CAD. Covariate value on the x-axis, change in log-odds on the y-axis, red line represents the relationship between the covariate and log-odds for CAD, each black dot represents an observation. b: SHAP explanations for the relationship between Systolic blood pressure and odds of CAD. Covariate value on the x-axis, change in log-odds on the y-axis, red line represents the relationship between the covariate and log-odds for CAD, each black dot represents an observation.

https://doi.org/10.1371/journal.pone.0284103.g004

Discussion

In this retrospective, cross sectional cohort of United States adults, a machine learning model utilizing demographic, laboratory, physical examination, and lifestyle questionnaire data had strong predictive accuracy (AUROC = 0.89). The greatest predictors for coronary artery disease included age, total cholesterol, total platelets, and family history of a heart attack.

The visualizations completed for the top four covariates were concordant with current literature around the relationship between these covariates and coronary artery disease: there is strong epidemiological and physiological evidence for the link between increased age and cholesterol as major risk factors for coronary artery disease [2628]. The non-linear relationship between cholesterol and coronary artery disease matches survival-modeling and restricted cubic spline analysis from other studies [2836]. Furthermore, multiple genetic and sociological studies have found that family history is a significant risk factor for coronary artery disease [3740]. Additionally, low-platelets being associated with coronary artery disease is associated with pathology such as thrombocytopenia [4143]. In addition to the top four covariates within our model, we also wanted to explore if the machine-learning model was able to generate predictions for HDL-Cholesterol and Systolic Blood pressure, two major risk factors that have been widely studied within the cardiovascular literature. In these visualizations, we observed a strong negative relationship between HDL-cholesterol and risk for coronary artery disease (Fig 4a). We observe a curvilinear relationship between systolic blood pressure and coronary heart disease, with blood pressures lower than 120 being associated with increased risk for coronary heart disease and blood pressures above being strongly associated with coronary heart disease as well.

Since visualizations for risk factors match literature relationships, we have increased confidence that the machine learning model is able to capture the actual physiological relationships of these covariates [4446]. These transparent machine-learning tools allow for increased confidence that these algorithms are picking up true signal within these covariates to predict coronary artery disease rather than just replicating potential biases stemming from systemic data- = quality errors that are present within the dataset. Additionally, these SHAP visualizations allow us to interpret that the increase predictive power of these machine-learning methods is associated with the ability for these non-parametric methods to more accurately capture the non-linear interactive relationship between the covariates, rather than just over-fitting the model to get increased accuracy.

The greatest strength of this algorithmic method for identification of the covariates is the ability to search through hundreds of covariates systematically without relying upon judgment form the researcher, which may be muddled by potential personal biases. This method also allows for the ranking of the relative importance of each of these covariates through the cover statistic, which allows us to obtain the relative contribution to the prediction each covariate has and thus infer from there an estimate for the relative contribution to true risk for coronary artery disease that each patient has. Another strength is that after these covariates are selected and the model built, SHAP visualizations can be used to make sure that each of the covariate either matches current literature understandings of the covariate’s association with coronary heart disease or in the case of a discrepancy, allow researchers to validate the plausibility of this feature and then evaluate for potential errors in data-quality.

Some potential weaknesses to this machine-learning analysis is the necessity of the retrospective nature of this cohort. The covariates that were selected within this study will be better at predicting coronary heart disease risk for this cohort than for other cohorts. However, this was limited by the use of training: testing sets to be able to minimize the errors that come with overfitting. Furthermore, visualizations of SHAP allow researchers to test for physiologic plausibility of each of these covariates and allows for effective analysis by researchers of whether these effects are due to true signal or if they are just noise that may be contributing to a type-1 error.

Given the analysis of the strengths and weaknesses of these methods, we argue that use of machine-learning methods can be an effective first step in the identification of risk-factors that can then be further selected by clinicians based upon the specific clinical presentation.

Limitations

This study has several strengths and weaknesses. We utilized the NHANES dataset, which is a retrospective cohort, carrying the limitations of retrospective studies. However, this study allows for the selection of a large cohort, evaluation of data quality, and due to the publicly available nature of the cohort, allows for increased replication and follow-up studies based upon the same cohort. Furthermore, the cohort relied on surveys to obtain the outcome of interest (CAD) as well as the dietary and lifestyle information. More accurate measurements may have been achieved with prospective studies with automated measurement of foods. However, self-reported survey information allows for the volume of participants to be included within this study. Another weakness was the voluntary nature of this cohort, with participants choosing to opt into the study instead of being randomly selected. This may artificially select a different cohort that may significantly differ from the population. However, our analysis found a demographically diverse population, so these results may still be generalizable to other cohorts.

Conclusion

Machine learning models can effectively predict coronary artery disease using demographic, laboratory, physical exam, and lifestyle covariates. Age, total cholesterol, total platelets, and family history of heart attack are the strongest predictors of coronary artery disease.

Acknowledgments

The authors acknowledge the NHANES dataset, which was used in this study.

References

  1. 1. Albar HM, Alahmdi RA, Almedimigh AA, et al. Prevalence of coronary artery disease and its risk factors in Majmaah City, Kingdom of Saudi Arabia. Front Cardiovasc Med. 2022;9:943611. pmid:36158800
  2. 2. AlOthman AF, Sait ARW, Alhussain TA. Detecting Coronary Artery Disease from Computed Tomography Images Using a Deep Learning Technique. Diagnostics (Basel). Aug 26 2022;12(9) pmid:36140475
  3. 3. Bhattad PB, Sherif AA, Mishra AK, Roumia M. Left Main Coronary Artery Disease: The Forgotten Lead of Electrocardiogram Is Predictive. Cureus. Aug 2022;14(8):e28391. pmid:36168367
  4. 4. Luu JM, Wei J, Shufelt CL, et al. Clinical Practice Variations in the Management of Ischemia With No Obstructive Coronary Artery Disease. J Am Heart Assoc. Sep 29 2022:e022573. pmid:36172938
  5. 5. Maamari DJ, Brockman DG, Aragam K, et al. Clinical Implementation of Combined Monogenic and Polygenic Risk Disclosure for Coronary Artery Disease. JACC Adv. Aug 2022;1(3) pmid:36147540
  6. 6. de Souza ESCG, Buginga GC, de Souza ESEA, et al. Prediction of Mortality in Coronary Artery Disease: Role of Machine Learning and Maximal Exercise Capacity. Mayo Clin Proc. Aug 2022;97(8):1472–1482. pmid:35431026
  7. 7. Eurlings C, Bektas S, Sanders-van Wijk S, et al. Use of artificial intelligence to assess the risk of coronary artery disease without additional (non-invasive) testing: validation in a low-risk to intermediate-risk outpatient clinic cohort. BMJ Open. Sep 26 2022;12(9):e055170. pmid:36167368
  8. 8. Gola D, Erdmann J, Muller-Myhsok B, Schunkert H, Konig IR. Polygenic risk scores outperform machine learning methods in predicting coronary artery disease status. Genet Epidemiol. Mar 2020;44(2):125–138. pmid:31922285
  9. 9. Goodman MO, Cade BE, Shah N, et al. Pathway-Specific Polygenic Risk Scores Identify Obstructive Sleep Apnea-Related Pathways Differentially Moderating Genetic Susceptibility to Coronary Artery Disease. Circ Genom Precis Med. Sep 28 2022:101161CIRCGEN121003535. pmid:36170352
  10. 10. Gulati M, Khan N, George M, et al. Ischemia with no obstructive coronary artery disease (INOCA): A patient self-report quality of life survey from INOCA international. Int J Cardiol. Sep 23 2022; pmid:36162521
  11. 11. Ullah M, Wahab A, Khan SU, et al. Stent as a Novel Technology for Coronary Artery Disease and their Clinical Manifestation. Curr Probl Cardiol. Sep 23 2022:101415. pmid:36155199
  12. 12. Yang R, Zhang W, Wang X, et al. Nonlinear association of 1,5-anhydroglucitol with the prevalence and severity of coronary artery disease in chinese patients undergoing coronary angiography. Front Endocrinol (Lausanne). 2022;13:978520. pmid:36133308
  13. 13. Zhu H, Yin C, Schoepf UJ, et al. Machine Learning for the Prevalence and Severity of Coronary Artery Calcification in Nondialysis Chronic Kidney Disease Patients: A Chinese Large Cohort Study. J Thorac Imaging. May 3 2022; pmid:35576551
  14. 14. Agrawal S, Klarqvist MDR, Emdin C, et al. Selection of 51 predictors from 13,782 candidate multimodal features using machine learning improves coronary artery disease prediction. Patterns (N Y). Dec 10 2021;2(12):100364. pmid:34950898
  15. 15. Akella A, Akella S. Machine learning algorithms for predicting coronary artery disease: efforts toward an open source solution. Future Sci OA. Mar 29 2021;7(6):FSO698. pmid:34046201
  16. 16. Al’Aref SJ, Maliakal G, Singh G, et al. Machine learning of clinical variables and coronary artery calcium scoring for the prediction of obstructive coronary artery disease on coronary computed tomography angiography: analysis from the CONFIRM registry. Eur Heart J. Jan 14 2020;41(3):359–367. pmid:31513271
  17. 17. Alizadehsani R, Abdar M, Roshanzamir M, et al. Machine learning-based coronary artery disease diagnosis: A comprehensive review. Comput Biol Med. Aug 2019;111:103346. pmid:31288140
  18. 18. Deka P, Blesa J, Pathak D, et al. Combined Dietary Education and High-Intensity Interval Resistance Training Improve Health Outcomes in Patients with Coronary Artery Disease. Int J Environ Res Public Health. Sep 10 2022;19(18) pmid:36141673
  19. 19. Lee YH, Tsai TH, Chen JH, et al. Machine learning of treadmill exercise test to improve selection for testing for coronary artery disease. Atherosclerosis. Jan 2022;340:23–27. pmid:34871817
  20. 20. Vandeloo B, Andreini D, Brouwers S, et al. Diagnostic performance of exercise stress tests for detection of epicardial and microvascular coronary artery disease: the UZ Clear study. EuroIntervention. Sep 23 2022; pmid:36147027
  21. 21. Zaccardi F, Timmins IR, Goldney J, et al. Self-reported walking pace, polygenic risk scores and risk of coronary artery disease in UK biobank. Nutr Metab Cardiovasc Dis. Sep 2 2022; pmid:36163213
  22. 22. Mekhael M, Marrouche N, Hajjar AHE, Donnellan E. The Relationship between Atrial Fibrillation and Coronary Artery Disease: Understanding common denominators. Trends Cardiovasc Med. Sep 28 2022; pmid:36182022
  23. 23. Peksa JW, Pawlik A, Dziewierz A, Zawislak B. Unexpected severe coronary artery disease in a young patient with only one modifiable risk factor. Kardiol Pol. Sep 23 2022;
  24. 24. Peng Y, Cheng Z, Yi Q. A practical nomogram for predicting coronary thrombosis for Kawasaki disease patients with medium or large coronary artery aneurysm. Clin Exp Med. Sep 23 2022; pmid:36151486
  25. 25. Chen W, Shi S, Jiang Y, et al. Association of sarcopenia with ideal cardiovascular health metrics among US adults: a cross-sectional study of NHANES data from 2011 to 2018. BMJ Open. Sep 23 2022;12(9):e061789. pmid:36153025
  26. 26. QuickStats: Percentage* of Adults Aged ≥18 Years with Diagnosed Heart Disease,(dagger) by Urbanization Level (section sign) and Age Group—National Health Interview Survey, United States, 2020 (paragraph sign). MMWR Morb Mortal Wkly Rep. Jun 10 2022;71(23):778. pmid:35679171
  27. 27. Al-Shoaibi AAA, Li Y, Song Z, et al. Association of Low-Density Lipoprotein Cholesterol with Risk of Coronary Heart Disease and Stroke among Middle-Aged Japanese Workers: An Analysis using Inverse Probability Weighting. J Atheroscler Thromb. Jul 13 2022; pmid:35831131
  28. 28. Caselli C, De Caterina R, Smit JM, et al. Triglycerides and low HDL cholesterol predict coronary heart disease risk in patients with stable angina. Sci Rep. Oct 20 2021;11(1):20714. pmid:34671067
  29. 29. Chen BW, Liu JJ, Xing JH, et al. Analysis of the Correlation Between the Ratio of Monocytes to High-Density Lipoprotein Cholesterol and in-Stent Restenosis in Patients with Premature Coronary Heart Disease. Clin Appl Thromb Hemost. Jan-Dec 2022;28:10760296221079334. pmid:35187964
  30. 30. Cheng Q, Li Z, Wang X, et al. Relation Between New York Heart Association Functional Class and Remnant Cholesterol, and Non-high Density Lipoprotein Cholesterol in Coronary Heart Disease Patients With Type 2 Diabetes Mellitus. Angiology. Apr 24 2022:33197221091315. pmid:35466706
  31. 31. Doi T, Langsted A, Nordestgaard BG. Elevated Remnant Cholesterol Reclassifies Risk of Ischemic Heart Disease and Myocardial Infarction. J Am Coll Cardiol. Jun 21 2022;79(24):2383–2397. pmid:35710189
  32. 32. Ghanavati M, Alipour Parsa S, Nasrollahzadeh J. A calorie-restricted diet with nuts favourably raises plasma high-density lipoprotein-cholesterol in overweight and obese patients with stable coronary heart disease: A randomised controlled trial. Int J Clin Pract. Sep 2021;75(9):e14431. pmid:34080258
  33. 33. Guan J, Wu L, Xiao Q, Pan L. Levels and clinical significance of serum homocysteine (Hcy), high-density lipoprotein cholesterol (HDL-C), vaspin, and visfatin in elderly patients with different types of coronary heart disease. Ann Palliat Med. May 2021;10(5):5679–5686. pmid:34107717
  34. 34. Hayajneh AA, Alhusban IM, Rababa M. The Role of Traditional Obesity Parameters in Predicting Frailty among Coronary Artery Disease Patients Undergoing Cardiac Catheterization. Int J Clin Pract. 2022;2022:8676274. pmid:36160288
  35. 35. Hu Z, Cui J, Li X, Zhou Y, Cai L, Zhang S. High-Density Lipoprotein Cholesterol in Young Nondiabetic Coronary Heart Disease Patients. Cardiol Res Pract. 2021;2021:2970568. pmid:34336270
  36. 36. Jaishankar T, Shivasekar M, Vinodhini VM. Assessment of Remnant Lipoprotein Cholesterol and Oxidized Low density Lipoprotein Associated with Low-grade Inflammation in Coronary Heart Disease Subjects of Young South Indian Population. J Assoc Physicians India. Jun 2022;70(6):11–12. pmid:35702834
  37. 37. Kuusisto S, Karjalainen MK, Tillin T, et al. Genetic and observational evidence: No independent role for cholesterol efflux over static high-density lipoprotein concentration measures in coronary heart disease risk assessment. J Intern Med. Jul 2022;292(1):146–153. pmid:35289444
  38. 38. Liu B, Fang L, Xiong Y, et al. A Machine Learning Model Based on Genetic and Traditional Cardiovascular Risk Factors to Predict Premature Coronary Artery Disease. Front Biosci (Landmark Ed). Jul 4 2022;27(7):211. pmid:35866398
  39. 39. Manduchi E, Le TT, Fu W, Moore JH. Genetic Analysis of Coronary Artery Disease Using Tree-Based Automated Machine Learning Informed By Biology-Based Feature Selection. IEEE/ACM Trans Comput Biol Bioinform. May-Jun 2022;19(3):1379–1386. pmid:34310318
  40. 40. Orlenko A, Kofink D, Lyytikainen LP, et al. Model selection for metabolomics: predicting diagnosis of coronary artery disease using automated machine learning. Bioinformatics. Mar 1 2020;36(6):1772–1778. pmid:31702773
  41. 41. Dogdus M, Dindas F, Yenercag M, et al. The Role of Systemic Immune Inflammation Index for Predicting Saphenous Vein Graft Disease in Patients with Coronary Artery Bypass Grafting. Angiology. Sep 25 2022:33197221129356. pmid:36154493
  42. 42. Han M, Sun Y, Li N. Relationship between platelet-to-lymphocyte ratio and Coronary Artery Lesion in non-diabetic patients with coronary heart disease. J Pak Med Assoc. Jul 2022;72(7):1426–1428. pmid:36156573
  43. 43. Saint Croix G, Lacy SC, Gazzhal A, et al. Dual Antiplatelet Therapy in Patients Aged 75 Years and Older with Coronary Artery Disease: A Meta-Analysis and Systematic Review. J Interv Cardiol. 2022;2022:3111840. pmid:36176329
  44. 44. Cheang I, Zhu X, Zhu Q, et al. Inverse association between blood ethylene oxide levels and obesity in the general population: NHANES 2013–2016. Front Endocrinol (Lausanne). 2022;13:926971. pmid:36171904
  45. 45. Guo X, Li N, Wang H, et al. Exploratory analysis of the association between pyrethroid exposure and rheumatoid arthritis among US adults: 2007–2014 data analysis from the National Health and Nutrition Examination Survey (NHANES). Environ Sci Pollut Res Int. Sep 24 2022; pmid:36151437
  46. 46. Wang X, Xiao P, Wang R, et al. Relationships between urinary metals concentrations and cognitive performance among U.S. older people in NHANES 2011–2014. Front Public Health. 2022;10:985127. pmid:36148349