Applications of machine learning to undifferentiated chest pain in the emergency department: A systematic review

Jonathon Stewart; Juan Lu; Adrian Goudie; Mohammed Bennamoun; Peter Sprivulis; Frank Sanfillipo; Girish Dwivedi

doi:10.1371/journal.pone.0252612

Abstract

Background

Chest pain is amongst the most common reason for presentation to the emergency department (ED). There are many causes of chest pain, and it is important for the emergency physician to quickly and accurately diagnose life threatening causes such as acute myocardial infarction (AMI). Multiple clinical decision tools have been developed to assist clinicians in risk stratifying patients with chest. There is growing recognition that machine learning (ML) will have a significant impact on the practice of medicine in the near future and may assist with diagnosis and risk stratification. This systematic review aims to evaluate how ML has been applied to adults presenting to the ED with undifferentiated chest pain and assess if ML models show improved performance when compared to physicians or current risk stratification techniques.

Methods and findings

We conducted a systematic review of journal articles that applied a ML technique to an adult patient presenting to an emergency department with undifferentiated chest pain. Multiple databases were searched from inception through to November 2020. In total, 3361 articles were screened, and 23 articles were included. We did not conduct a metanalysis due to a high level of heterogeneity between studies in both their methods, and reporting. The most common primary outcomes assessed were diagnosis of acute myocardial infarction (AMI) (12 studies), and prognosis of major adverse cardiovascular event (MACE) (6 studies). There were 14 retrospective studies and 5 prospective studies. Four studies reported the development of a machine learning model retrospectively then tested it prospectively. The most common machine learning methods used were artificial neural networks (14 studies), random forest (6 studies), support vector machine (5 studies), and gradient boosting (2 studies). Multiple studies achieved high accuracy in both the diagnosis of AMI in the ED setting, and in predicting mortality and composite outcomes over various timeframes. ML outperformed existing risk stratification scores in all cases, and physicians in three out of four cases. The majority of studies were single centre, retrospective, and without prospective or external validation. There were only 3 studies that were considered low risk of bias and had low applicability concerns. Two studies reported integrating the ML model into clinical practice.

Conclusions

Research on applications of ML for undifferentiated chest pain in the ED has been ongoing for decades. ML has been reported to outperform emergency physicians and current risk stratification tools to diagnose AMI and predict MACE but has rarely been integrated into practice. Many studies assessing the use of ML in undifferentiated chest pain in the ED have a high risk of bias. It is important that future studies make use of recently developed standardised ML reporting guidelines, register their protocols, and share their datasets and code. Future work is required to assess the impact of ML model implementation on clinical decision making, patient orientated outcomes, and patient and physician acceptability.

Trial registration

International Prospective Register of Systematic Reviews registration number: CRD42020184977.

Citation: Stewart J, Lu J, Goudie A, Bennamoun M, Sprivulis P, Sanfillipo F, et al. (2021) Applications of machine learning to undifferentiated chest pain in the emergency department: A systematic review. PLoS ONE 16(8): e0252612. https://doi.org/10.1371/journal.pone.0252612

Editor: Giulia Bivona, University of Palermo, ITALY

Received: January 23, 2021; Accepted: March 26, 2021; Published: August 24, 2021

Copyright: © 2021 Stewart et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are within the paper and its S1 Appendix and S1 Checklist.

Funding: This project was supported by the Western Australian Health Translation Network’s Health Service Translational Research Project and the Australian Government’s Medical Research Future Fund (MRFF) as part of the Rapid Applied Research Translation program. Authors who received grant: JS, GD, MB, PS, FS Funder Website: https://wahtn.org/ The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Complex decision-making amongst uncertainty is at the core of emergency medicine [1]. Emergency physicians must manage parallel and competing demands in an often chaotic and unpredictable environment. There is an ongoing challenge in identifying patients with potentially life-threatening conditions from more common benign diagnosis. Chest pain exemplifies this diagnostic challenge.

Chest pain is amongst the most common reason for presentation to the emergency department (ED) [2]. There are many causes of chest pain, and it is important for the emergency physician to quickly and accurately assess, investigate, and diagnose life threatening causes such as acute coronary syndrome (ACS). ACS encompasses a range of important diagnosis related to cardiac ischemia including unstable angina (UA), non-ST elevation myocardial infarction (NSTEMI), and ST elevation myocardial infarction (STEMI) [3]. ACS causes significant mortality and morbidity, and outcomes are improved with early recognition and treatment [4].

The majority of patients who present to an ED with chest pain will not have ACS [5]. Risk stratification is an integral part of the evaluation of chest pain [6]. History and physical examination alone are unreliable in evaluating patients with chest pain [7]. This has led to the development of multiple clinical decision tools such as the TIMI score and the HEART score to assist clinicians in determining which patients with chest pain are at high risk of acute coronary syndrome [8, 9]. Many of these decision tools have been validated internationally in multiple prospective trials and the HEART score has achieved good results [10]. Despite these decision tools, a small number of cases of ACS are still missed [11] There is growing recognition that emerging artificial intelligence (AI) technologies will have a significant impact on the practice of medicine in the near future [12, 13]. There has been longstanding interest in the application of AI based techniques to chest pain [14].

The field of artificial intelligence can be broadly and pragmatically defined as “the theory and development of computer systems able to perform tasks normally requiring human intelligence” [15]. Over the last decade a combination of exponential increases in computing power, the digitalisation of data, and advances in AI algorithms has led to a renaissance in AI research [16]. Machine learning (ML) is a subfield of AI that uses various methods to automatically detect patterns in data, then use these patterns to make predictions or decisions [17]. By repeatedly comparing predictions with results, machine learning models iteratively adjust their internal parameters (a process called “training”) to improve their performance. A trained model’s predictions can then be tested on unseen data to ensure that the model can be generalised to new data and that it has not become ‘over fit’ to the data that was used to train it. Deep learning (DL) is a type of ML that uses a large number of interconnected non-linear processing units to obtain increasingly abstract representations of data, giving it the capability to learn to model very complex functions [18]. DL algorithms have been used to achieve impressive results in multiple diverse fields such as image recognition, speech recognition, and natural language processing [19–22].

State of the art ML technologies are overwhelmingly narrow rather than general in their current applications but have still achieved great successes, including on some problems previously thought to be intractable [23]. There are ongoing efforts to create more generalisable models, however application of already existing narrow ML technologies could still fundamentally change many industries including healthcare [24]. AI techniques have demonstrated capability to predict patient outcomes and risk stratify patients based on clinical and physiological data [25, 26]. AI techniques have recently been applied with success to the diagnosis of myocardial infarction [27]. The implementation of artificial intelligence techniques into clinical practice remains a challenge.

This systematic review aims to evaluate the applications of machine learning in undifferentiated chest pain in the ED by answering the following questions.

How has ML been applied to adults presenting to the ED with undifferentiated chest pain?
Do ML models show improved performance compared to physicians or current risk stratification techniques?

Methods

A systematic review protocol was reprepared in accordance with PRISMA-P guidelines and registered with the International Prospective Register of Systematic Reviews (PROSPERO) on 08/09/2020 (registration number CRD42020184977) [28]. We conducted and report this review in line with the PRISMA Reporting Guidelines for Systematic Reviews [29].

We included all journal articles that applied a ML technique to an adult patient presenting to an ED with undifferentiated chest pain. As this study aims to broadly assess the capability of ML applied to undifferentiated chest pain in the ED, all outcomes and comparators were included. Studies that did not use a comparator were still included in our review. We excluded conference abstracts, studies that did not use ML techniques, studies that did not assess undifferentiated chest pain, studies not based in an ED setting, and studies that focused solely on using ML for imaging or investigation interpretation.

The search strategy for this systematic review was developed with input from study authors and a health sciences librarian with expertise in systematic review searching. We searched Pubmed (MEDLINE), Cochrane Library, Web of Science, Embase, and Scopus for English language articles published from database inception to 11/08/2020. Electronic databases were first searched on 11/08/2020 and last searched on 15/11/2020. We searched for medical subject headings (MeSH) words and text keywords related to chest pain, artificial intelligence, machine learning, deep learning, and emergency medicine. The MEDLINE search strategy is provided in S1 Appendix. The MEDLINE strategy was adapted to the other databases. Reference lists of all included studies and authors personal archives were also reviewed for further relevant literature to ensure literature saturation was achieved.

Citations and abstracts were screened by two reviewers (JS and JL) against predefined inclusion and exclusion criteria. Both of the review authors were blind to the journal titles, study authors, and institutions. Full text articles were obtained for any articles identified by one reviewer to meet criteria. Two reviewers (JS and JL) then screened the full text reports against inclusion and exclusion criteria. Data were extracted by JS and JL using a standardised form. The form was piloted, and calibration exercises were conducted prior to formal data extraction to ensure consistency between reviewers. In all cases of conflict or discrepancy, additional study authors were involved until a decision was reached. Study authors were contacted by email to resolve any significant uncertainties.

Data extracted included study type, outcomes, population, input data used, ML methodology used, number of input variables in the ML model, comparisons, results, public availability of dataset, and public availability of model code. Risk of bias in studies was assessed by two authors (JS and JL) and using the Prediction model Risk of Bias Assessment Tool (PROBAST) [30].

We did not conduct a metanalysis due to a high level of heterogeneity between studies in both their methods, and reporting. We conducted a narrative analysis of the included studies to provide further commentary and exploration of the trends and findings.

Results

Study selection

We identified 3590 records following database searching and a further 42 records through other sources, including authors personal libraries. Following removal of duplicates, 3361 records remained and underwent title and abstract screening. 3279 records were excluded. The remaining 82 full-text articles were assessed for eligibility.

In total, 59 articles were excluded for the following reasons:

13 articles were excluded as they were an abstract only.
11 articles were excluded as they were a commentary or review
1 article was excluded as it focused only on CTCA result interpretation
15 articles were excluded as they did not use ML.
17 articles were excluded as they focused only on ECG interpretation.
2 articles were excluded as they did not focus on undifferentiated chest pain in the ED

Following these exclusions, 23 studies remained for inclusion in our qualitative synthesis. There was no disagreement between the two reviewers as to study inclusion or results of data extraction. This process is summarised in a PRISMA Flow Diagram (Fig 1).

Download:

Fig 1.

https://doi.org/10.1371/journal.pone.0252612.g001

Study characteristics

A summary of the included studies is shown in Table 1. There were 14 retrospective studies and 5 prospective studies. Four studies reported the development of a machine learning model retrospectively then tested it prospectively. The most common machine learning methods used were artificial neural networks (ANN) (14 studies), random forest (RF) (6 studies), support vector machine (SVM) (5 studies), and gradient boosting (2 studies).

Download:

Table 1. Summary of included studies.

https://doi.org/10.1371/journal.pone.0252612.t001

The most common primary outcomes assessed were diagnosis of Acute Myocardial Infarction (AMI) (12 studies), and prognosis of major adverse cardiovascular event (MACE) (6 studies). Three studies looked at diagnosis of ACS (including unstable angina). The NSTEMI/STEMI paradigm formally replaced the Q-wave/Non-Q-wave MI paradigm in 2000 [52]. This review identified 17 studies published since 2000 of which 4 studies excluded STEMI patients and 13 did not.

Sixteen studies used data from a single site and 7 studies used data from multiple sites. The largest number of sites used was by Than et al, an international collaboration using 12 cohorts of patients [27]. They developed their model using training data from 2 cohorts then validated their models using prospectively collected data from 7 cohorts.

The population size assessed varied greatly. The largest population was 85 254 patients by Zhang et al., who used data collected from chest pain presentations to three hospitals between 2009 and 2018 [31]. The smallest population assessed was 228 patients by Berikol et al. [36] Fourteen studies used a population of under 1000, seven studies had a population of between 1001 and 10 000, and two studies used a population of over 10 000.

The most frequently used prediction variables were demographics (21 studies), past medical history including smoking status and family history (18 studies), and ECG result (17 studies). Troponin was used in 12 studies. Only one study (Than et al) used serial troponins [27].

Laboratory tests other than troponin were used in 10 studies. Patient symptoms were used in 12 studies and examination findings were used in 8 studies.

The number of predictor variables used in the ML models varied. The median number of input variables used in the ML models was 23. Overall, 5 studies used models with 10 or less input variables, 8 studies used models with 11–30, 7 studies used models with 31–50, and 2 studies used models with more than 50 input variables. The number of variables was unknown in one study and attempts to contact the study authors to clarify this were unsuccessful. The largest number of input variables was 95 by Chazaro et al. and the smallest number used by Than et al. who used only 4 variables (age, gender, paired high-sensitivity troponins, rate of change of high-sensitivity troponin) [27, 48]. Liu et al found that their 3 variable model produced better results than their complete 23 variable model in predicting 3-day MACE (AUC of 0.812 vs AUC of 0.736), concluding that “more predictors do not necessarily guarantee better prediction result” [37].

Diagnosis of AMI or ACS

16 studies used ML algorithms to diagnose AMI or ACS in patients presenting to the ED with chest pain. Tsien et al. in 1998 and Harrison et al. in 2005 were the only authors to report that ML techniques did not outperform logistic regression and also suggested that appropriate models for use in clinical practice may be able to be developed with relatively few data items [43, 47]. Than et al. used ML to develop their “MI³ clinical support tool” which achieved a high AUC (0.963) in diagnosing type 1 myocardial infarction in the index admission when prospectively validated, and achieved similar performance in early and late presenters [27]. Their algorithm incorporated paired high-sensitivity troponins collected at presentation at another early, yet flexible time point. Their MI³ clinical support tool was designed to be used as a continuous measure but could also be adapted to work in the current paradigm of low/high risk chest pain. Using an example low risk threshold (69.5% of patients in their test set) they achieved a negative predictive value of 99.7% and sensitivity of 97.8%. At a high-risk threshold (10.6% of patients in their test set) they achieved a positive predictive value of 71.8% and specificity of 96.7%. At these thresholds their algorithm outperformed the European Society of Cardiology 0/3-hour pathway.

Prognosis (prediction of MACE and mortality)

In total 7 studies used ML algorithms to predict the prognosis of patients presenting to the ED with chest pain. 6 studies looked at composite prognostic outcomes (MACE) and 1 study (Zhang et al 2020) looked separately at 30-day all-cause mortality and 30-day AMI following ED presentation [31]. Prognostication studies varied in the timeframes considered. The longest time frame assessed was 90-day MACE by Wu et al. [32] Wu et al. used ML to select features for their risk stratification model, developing a full model that contained invasive (blood tests) variables, and a reduced model that only contained non-invasive variables. They also identify that in their data, QTc prolongation was a potentially novel predictor of MACE. Their full model achieved an AUC of 0.853 and their reduced model achieved an AUC of 0.808.

The shortest timeframe considered was by Liu et al, who applied ML to select variables from 8 vital signs 15 heart rate variability parameters to build a model to predict 3-day MACE [37]. Their top performing model contained only 3 variables and achieved an AUC of 0.812, outperforming the TIMI score (AUC 0.637) and the modified early warning score (AUC 0.622). Applying an arbitrary low/high risk cut-off score gave a sensitivity of 82.8% and specificity of 63.4%. The variables required for the model could be quickly and obtained non-invasively through collection of routine vital signs and a 5-minute ECG. In a subsequent paper Liu et al. developed a ML score that again incorporated vital signs and ECG heart rate variability data to predict 30-day MACE. Their ML score achieved an AUC of 0.81, again outperforming the TIMI score (AUC 0.71) [38].

Zhang et al. used a ML algorithm based on demographic information, past medical history (PMHx), and laboratory tests to predict AMI and all-cause mortality within one month [31]. In prospective validation their RF model achieved an AUC of 0.907 for AMI < 1 month and an AUC of 0.888 for all-cause mortality < 1 month.

Than et al. conducted a pre-planned secondary analysis on their MI3 algorithm to assess ability to predict patients who suffered an MI in the 30-days following discharge [27]. Their MI3 algorithm achieved an AUC of 0.957, and setting (arbitrary) low/high risk threshold values gave a sensitivity of 96.6% and PPV of 71.9% respectively.

McCullough et al used ANN to predict 30-day MACE based on demographics, PMHx, estrogen status (women only), patient symptoms, and subjective physician initial assessment of the chest pain (assessed as either Typical cardiac pain,” “Atypical cardiac pain,” or “Probable non-cardiac pain.”) [40]. They developed prediction models for male and female patients. They found that adding the subjective physician assessment to their model improved the performance of the model more for male patients (average improvement of 5%), than for female patients (average improvement of 1.48%). When their model used all features available and was trained all available data (male and female) it achieved an AUC of 0.9037 for females and 0.8552 for males. Training the model on only male data improved the AUC for males to 0.87.

Comparisons

The most frequently used comparator was logistic regression (6 studies). The HEART score was used as a comparator in 4 studies, the TIMI score was used as a comparator in 3 studies. Other comparators used included (ESC) 1-hour and 3-hour algorithms, the GRACE score, and MEWS. All existing chest pain risk stratification scores were outperformed by various ML models in all studies in which they were compared. Two studies compared the performance of various ML algorithms to one another. Zhang et al. compared RF, SVM, and KNN for AMI and mortality prognosis [31]. Ha et al. compared Decision tree, SVM, and ANN to diagnose MI [39]. In both cases decision trees (including RF) outperformed other ML algorithms. LR was used as a comparator in 6 studies. ML models outperformed LR in four studies. Tsien et al. and Harrison et al. reported that their ML models did not show increased performance when compared to logistic regression [43, 47]. Four studies compared ML to physician. All four studies compared an ANN to a physician in the diagnosis of AMI. Chazaro found the ED physician achieved greater sensitivity (87%) than the ANN (85%), however lower specificity (78% vs 91%) [48]. In the three other studies, the physician was outperformed by the ANN in all metrics [49–51]. Six studies did not include a comparator.

Integration into practice

Only 2 studies reported integrating the ML model into clinical practice. In 2003 Hollander et al. evaluated consecutive ED patients with chest pain before and after the implementation of an ANN [44]. The treating emergency physicians were provided with real-time outputs of the neural network, which had previously achieved 95% and specificity of 96% in diagnosing acute myocardial infarction. The implementation of the neural network did not significantly change admission decisions. There were only 2 patients (<1%) for whom the neural network output altered the physician disposition decision during real-time use. In a follow-up survey, 70% of physicians believed the neural network to be correct, and 52% had confidence in the network output. However only 7% stated they used the network score in their decision making. The main reasons given for not using the neural network score in their decision making was that the data were “presented to late’ and that the results ‘confirmed clinical suspicion but did not alter it”.

In 2020 Zhang et al. retrospectively developed a ML model for predicting MACE in 85,254 patients with chest pain in the EDs of three hospitals [31]. They used 14 clinical variables previously suggested to predict MACE including demographics, PMHx (defined as diagnosis before the index visit), and high-sensitivity troponins. They found a RF model using an oversampling approach outperformed SVM, KNN, and LR. After one month of testing and validation, the ML model was launched in their Hospital Information System to assist ED physicians with decision-making in real time. Prospective validation of the AI prediction model by new patients showed AUCs of 0.907 for AMI within 1 month and 0.888 for all-cause mortality within 1 month. Their model was able to automatically and rapid capture the necessary variables (including high sensitivity troponin) from their EHR when the physician requested the ML prediction. The authors acknowledge that they did not assess the impact of the ML prediction model on clinical practice, and that the impact on emergency physician decision making, change in clinical practice, and patient outcomes may need by be evaluated in future work.

Availability of code and dataset

Only 2 studies shared their datasets (Table 2). Conforti et al. provided publicly available link to their dataset, however this link no longer works [42]. Wu et al. stated that their dataset was available on reasonable request [32]. The code used for the ML models was not publicly available in any studies. The ML model used by Than et al. is proprietary but is available from the authors for research purposes on request [27]. Chazaro et al did not share their algorithm however did provide the numeric values for hyperparameters for their ANN that achieved their best results [48].

Download:

Table 2. Availability of dataset and code for included studies.

https://doi.org/10.1371/journal.pone.0252612.t002

Study quality—Risk of bias within and across studies

A summary of the PROBAST assessment is provided in Table 3. Overall, 16 studies were considered to have a high risk of bias and 4 low risk of bias. 5 studies had high applicability concerns and 7 studies had low applicability concerns. There were only 3 studies that were considered low risk of bias and had low applicability concerns. Only 4 studies externally validated their ML models. Only one study (Then et al) refers to a previously published or registered protocol [27]. All but two studies report positive results for machine learning algorithms, which raises the question if reporting bias may be present.

Download:

Table 3. PROBAST assessment of the included studies.

https://doi.org/10.1371/journal.pone.0252612.t003

Discussion

Interest and early work

This systematic review has found that there has been long-standing interest in the applications of ML to undifferentiated chest pain the ED, and that ML techniques have achieved impressive results both diagnostic and prognostic applications. These results could potentially relieve emergency physicians of diagnostic burden, deliver improved care to patients, and assist the health systems to provide care with greater efficiency. Over the last decade there has been rapid growth in technological capability, digitalisation of information, and dataset size. ML has become increasingly powerful while also becoming more accessible. Models described by Baxt in 1990 that took up to 48 hours to train would train in seconds today [14].

Compared to physicians and current standard of care

Pioneering work by Baxt in the 1990s found that “the non-linear artificial neural network performs more accurately than either physicians or other computer-based paradigms” [53]. Despite this, relatively few studies compared ML to physicians, and no study since 1998 has directly compared ML to physicians for the diagnosis or prognosis of undifferentiated chest pain in the ED. More recent studies that included comparisons have instead compared ML to current risk stratification tools such as the TIMI and HEART score. Though routinely used in clinical practice, there is emerging evidence that the HEART score may not perform better than clinical gestalt in certain clinical scenarios [54]. As ML tools become integrated into practice it will continue to be important to compare ML tools to physicians.

Small datasets

ML model performance tends to improve as dataset and model size increases [55]. Large high quality clinical datasets are difficult to obtain and their size is limited by the number of patient presentations. There is a trend to supplement real datasets with synthetically realistic generated data. This allows for arbitrarily large datasets with corresponding improved model performance. Class imbalance is also a common problem, with some data classes being abundant and other classes such as mortality expectedly being rare. New DL techniques have been developed to address this problem [56].

Reported ML architectures used by the studies in this review remain small compared to state-of-the-art architectures used in other fields, and the vast majority of datasets used were very small by modern machine learning standards. For perspective, State-of-the-art computer vision models are often trained on a dataset containing more than 14 million images [20]. A recently developed natural language processing algorithm (GTP-3) uses 499 billion tokens as input to train [22]. Rajkomar et al. predicted mortality through training on a dataset containing over 216,000 patients and over 46 billion data points [25]. At these scales, training cost becomes a significant consideration and prohibitively expensive to many researchers. Though training large models may be slow and expensive, after training predictions can be delivered rapidly using much less computational power, such as found in standard computers or mobile telephone. Zhang et al. reported that the time taken to generate prediction results following the ED physician clicking the appropriate button was < 1 second [31]. There is potential that large models could be developed and trained by researchers with the appropriate resources, then if these models are publicly available, they can be adapted to and validated on local data, reducing training time and cost. This may be especially important in low-resource settings.

Omitted data categories

Multiple studies achieved impressive results, despite not including some data routinely used by emergency physicians in the evaluation of undifferentiated chest pain. Almost half (11/23) of the studies assessed did not take into account patients’ symptoms. Incorporating unstructured data in datasets remained a challenge. Interpretations of echocardiogram and ECG data were used in all datasets that included them. No studies used deep learning to incorporate unstructured image or ECG data, and no studies applied natural language processing to incorporate free text clinical notes. Interestingly no studies incorporated chest x-rays, though they are routinely used in the work up of undifferentiated chest pain in the ED.

McCullough et al. conducted the only study that included emergency physician impression as an input in a ML algorithm [40]. It is perhaps reassuring that including the emergency physician impression improved their models results, however interestingly the results were improved more for male than female patients. Previous work has suggested male and female patients with chest pain may be treated differently [57]. It is unknown if their result is a reflection of this disparity. Their model achieved great results for female patients without the inclusion of emergency physician assessment. It is interesting to consider where the emergency physician is left if future studies find they are outperformed by an ML model, and the inclusion of their subjective assessment is not found to improve the model. The future role of the emergency physician may move from diagnosis of undifferentiated cases to interpreting and communicating results to patient and participating in shared decision making. It unlikely that ML models will be able to encroach on the emergency physician’s many other roles including resuscitation, practical skills, and team management.

Predictor variables

As is common in ML research, multiple studies experimented with different numbers of input variables and found that more variables did not necessarily improver results, or the addition of more variables only marginally improved performance. Liu et al astutely suggested that a simple model using non-invasive variables could play a role in patient triage [37]. ML also showed potential to identify and incorporate novel risk variables such as heart rate variability parameters and correct QT interval in ECG [34]. Troponin is an important component of the universal definition of MI [3]. The study cohorts were patients presenting with a symptom of myocardial ischemia (chest pain), and so all those with a rise and/or fall of troponin values (with at least 1 value above the 99th percentile) will meet current definitions for MI. Including a variable used in the definition of MI as an input in a ML model to predict MI is problematic and will likely lead to optimistic estimates of model performance [30]. In many cases, initial troponin measurements are likely to have formed part of the information used to determine the outcome.

Human interpretability

Though there are differing opinions, it is generally accepted that ML model output will need to be interpretable to be accepted and used in the health care setting [58]. There is now considerable research focusing on developing “explainable AI” [59]. No studies provided a human interpretable output of the diagnostic reasoning of their algorithms alongside their output. Than et al. developed an app mock-up that make the results human understandable [27]. This is an important step in communication of results but does not provide glimpse into the ‘black box’ that is the algorithm. Given the size, complexity and level of abstraction of the underlying models, interpretation is generally infeasible [24]. It may not be possible to achieve any more than an illusion of understanding. However, emergency physicians routinely prescribe medication with unclear mechanisms of action, but for which there is robust safety and efficacy data [60]. If a ML model consistently demonstrates predictable accuracy and safety in a wide variety of circumstances, it may be accepted despite remaining a ‘black box’.

Human factors affecting model implementation

Few studies considered the human factors that are involved in the implementation of ML algorithms into practice. Hollander et al. provided an important singular example of a study that evaluated the effect of algorithm implantation on clinical decision making, showing that despite implementing an ANN that was previously reported to outperform clinicians, few used it and it did not change clinical practice. New ML based diagnostic and prognostic technologies may be rejected by emergency physicians, especially if the results are not timely and do not change management [44]. Physicians are likely to remain skeptical of an unexplainable black box. There has also been no evaluation of ED patients attitudes and opinions on the use of ML in their care. To achieve physician and patient acceptance of ML technologies will likely require deep consideration of the human factors involved.

Ethical and legal issues

Zhang et al. point out that the implementation of ML prediction models in healthcare raises ethical and legal issues, including malpractice liability for both technology manufacturers and emergency physicians [31]. There is justified concern that important decisions could be based on output of an algorithm that isn’t or fundamentally can’t be understood by a human [61]. Current legal doctrine is likely to be inadequate to address ML-related medical malpractice [62].

Sensitivity and specificity

Physician, patient, and institutional risk tolerances differ. Achieving higher sensitivity at the expense of lower specificity will lead to more false positives, and the resulting over investigation of these cases can paradoxically cause more harm than if the test wasn’t conducted. The concept of ‘test-threshold’ shows the point at which risks of harm from false positive tests are equal to the risks of not testing [63]. Patients with risk below the test threshold do not benefit from further testing. This leads to a mathematically optimal miss rate. Kline et al. estimated that attempting to achieve a miss rate of under 2% for investigating patients with suspected cardiac chest pain may cause more harm through over investigation [64]. This miss rate may not be the miss-rate that clinicians are comfortable adopting, and physicians may be doing more harm than good by adopting unrealistically low miss rates for low-risk patients presenting with chest pain [65]. It remains to be seen if ML can solve this dilemma.

Implementation

Despite over 30 years of promising results, integration of ML algorithms into widespread clinical practice is yet to occur. Heterogeneity amongst healthcare systems is likely a significant barrier. Zhang et al. were able to deploy their model into practice, but also note that while providing a proof of concept, the model may not be generalisable to other hospitals [31]. They suggest that re-training and testing in other hospitals could overcome this issue. A mock-up app developed by Than et al. shows thoughtful consideration of how a centralised ML algorithm could be used in a low-resource setting, and how the results may be presented through a phone application to both physicians (diagnostic metrics) and patients (graphical format) [27]. Implementation of ML algorithms will require health system monitoring, oversight and development of algorithm stewardship frameworks to ensure that algorithms are used safely, effectively, and fairly in diverse patient populations [66].

ML reproducibility crisis, algorithm ownership

Reproducibility is a foundation of the scientific method. There is growing recognition that ML research is suffering from a reproducibility crisis [67]. This review found that few studies publicly shared their code or dataset. Furthermore, methodological details were insufficiently documented to allow for replication in many studies. Recent medical ML studies have been criticised for lacking sufficiently detailed methods, and not sharing data, algorithm code, or details of the computational environment that generated the published findings [68]. Sharing of data and code is widely viewed as important, and the lack of such sharing undermines the scientific value of the research [68]. Previously identified barrier to transparent and reproducible ML research include the privacy and ethical implications of sharing patient data, and the economic disincentives of sharing proprietary models [69].

Despite facing similar privacy challenges, the biomedical literature has shown some improvements in certain key indicators of reproducibility and transparency, and clear, detailed, and enforced guidelines have allowed for genomics researchers to share complex computational pipelines and sensitive datasets [68–71]. Solutions may involve creating a research culture that favours openness and replication, demonstration of the model on public datasets, or the ability of independent investigators able to access the data and verify the analysis prior to publication [68]. No studies identified by this review were replication studies. Ongoing effort to manage the tension between patient privacy, open science, and private enterprise is required.

Future direction

There have not been any randomised clinical trials comparing a ML algorithm to physicians or current risk scoring tools for the risk stratification of chest pain. No studies have have evaluated for a change in patient orientated outcomes following the implementation of a ML algorithm into clinical practice. It remains essential to assess the impact these tools have on clinical decision making. ML algorithms have potential to both decrease or increase bias and any future implementation of such must be conscious of this and develop appropriate algorithm stewardship frameworks [66]. There is significant scope to incorporate further input variables into machine learning models including physician assessment, free text clinical notes, raw ECG data, point of care echocardiogram, and chest x-ray. There will likely be an increasing emphasis on model explainability, though it should be remembered that this may only give the illusion of understanding through abstraction of the underlying complexity. Despite using broad search terms such as “Chest Pain”, all studies included in this review focused only on MI/ACS and MACE. No studies attempted to diagnose other life-threatening cause of undifferentiated chest pain (such as pulmonary embolism or aortic dissection). Future research may attempt to broaden the scope of ML in undifferentiated chest pain.

Patients with acute coronary artery occlusion benefit from emergent reperfusion therapy [72]. Currently these patients are mainly identified by the presence of ST-elevation on ECG. There is a subset of patients with acute coronary artery occlusion who are not identified by the STEMI/NSTEMI paradigm [72]. While some studies used angiogram results as part of their outcome definition, no studies have attempted to identify patients with acute coronary artery occlusion. Future studies may use ML to attempt to identify patients who have acute coronary artery occlusion but who do not meet current STEMI criteria.

Limitations

Limitations—Study level

There are a number of limitations to this review. The majority (87%) of included studies were assessed to have either a high risk of bias, or high applicability concerns, and their results may not be generalisable to other settings. The majority of studies were also single centre, retrospective, and without prospective or external validation. The definition of MI and biomarkers used to define MI has also changed over time. The extended timeframe of this review means that many studies were done before the introduction of high sensitivity troponins and so results from earlier studies may not be applicable to the modern setting. Since being introduced into the definition of MI in 2000, few studies (4/17) excluded patients with STEMI. The clinical usefulness and applicability of ML scores to patients with STEMI is likely very low as they are often quickly identified on the basis of ECG alone, and there are well established existing treatment pathways for these patients (emergency reperfusion). There was inconsistent reporting of methods and results among studies. ML reporting guidelines are not well established or adhered to, though efforts are ongoing to change this [73–75].

Limitations—Review level

Publication bias is known to be widespread in the medical literature. While there is no empirical evidence that it is present ML research, it is likely to be present, as in other fields of research. All but two study reported positive results for machine learning. Despite significant effort to develop broad and relevant search terms, some relevant research may be published under terms not included in the search. The search strategy also excluded abstracts, non-English articles. Quantitate synthesis was not performed due to a high level of study heterogeneity. Although this was expected and outlined in the research protocol, it means that this review does not provide a high level of evidence for the use of ML in undifferentiated chest pain. Machine learning is an evolving concept without a precise and universally accepted definition. Some definitions of ML include logistic regression, however, following common usage, this review did not consider it a ML technique.

Conclusion

Research on applications of ML for undifferentiated chest pain in the ED has been ongoing for decades. ML has been reported to outperform emergency physicians and current risk stratification tools to diagnose AMI and predict MACE but has rarely been integrated into practice. Many studies assessing the use of ML in undifferentiated chest pain in the ED have a high risk of bias. It is important that future studies make use of recently developed standardised ML reporting guidelines, register their protocols, and share their datasets and code. Future work is required to assess the impact of ML model implementation on clinical decision making, patient orientated outcomes, and patient and physician acceptability.

Supporting information

S1 Appendix. Medline search strategy.

https://doi.org/10.1371/journal.pone.0252612.s001

(DOCX)

S1 Checklist. PRISMA 2009 checklist.

https://doi.org/10.1371/journal.pone.0252612.s002

(PDF)

Acknowledgments

We thank University of Western Australia librarian Ms. Karen Jones for her assistance support in the development of our search strategy.

References

1. Croskerry P. The Cognitive Imperative Thinking about How We Think. Academic Emergency Medicine. 2000 Nov;7(11):1223–31. pmid:11073470
- View Article
- PubMed/NCBI
- Google Scholar
2. Austalian Institute of Health and Welfare. Emergency department care 2017–18 Australian hospital statistics [Internet]. Canberra, ACT: Australian Government; 2018. https://www.aihw.gov.au/reports/hospitals/emergency-department-care-2017-18/contents/table-of-contents
3. Thygesen K, Alpert JS, Jaffe AS, Chaitman BR, Bax JJ, Morrow DA, et al. Fourth Universal Definition of Myocardial Infarction (2018). Circulation. 2018 Nov 13;138(20):e618–51. pmid:30571511
- View Article
- PubMed/NCBI
- Google Scholar
4. Kumar A, Cannon CP. Acute Coronary Syndromes: Diagnosis and Management, Part I. Mayo Clinic Proceedings. 2009 Oct;84(10):917–38. pmid:19797781
- View Article
- PubMed/NCBI
- Google Scholar
5. Kohn MA, Kwan E, Gupta M, Tabas JA. Prevalence of acute myocardial infarction and other serious diagnoses in patients presenting to an urban emergency department with chest pain. J Emerg Med. 2005 Nov;29(4):383–90. pmid:16243193.
- View Article
- PubMed/NCBI
- Google Scholar
6. Anderson JL, Adams CD, Antman EM, Bridges CR, Califf RM, Casey DE Jr, et al, American College of Cardiology., American Heart Association Task Force on Practice Guidelines (Writing Committee to Revise the 2002 Guidelines for the Management of Patients With Unstable Angina/Non-ST-Elevation Myocardial Infarction)., American College of Emergency Physicians., Society for Cardiovascular Angiography and Interventions., Society of Thoracic Surgeons., American Association of Cardiovascular and Pulmonary Rehabilitation., Society for Academic Emergency
7. Swap CJ, Nagurney JT. Value and limitations of chest pain history in the evaluation of patients with suspected acute coronary syndromes [published correction appears in JAMA. 2006 May 17;295(19):2250]. JAMA. 2005;294(20):2623–2629. pmid:16304077
- View Article
- PubMed/NCBI
- Google Scholar
8. Antman EM, Cohen M, Bernink PJLM, McCabe CH, Hoacek T, Papuchis G, et al. The TIMI risk score for unstable angina/non-ST elevation MI: a method for prognostication and therapeutic decision making JAMA. 2000;284(7):835–42. pmid:10938172
- View Article
- PubMed/NCBI
- Google Scholar
9. Six AJ, Backus BE, Kelder JC. Chest pain in the emergency room: value of the HEART score. Neth Heart J. 2008;16(6):191–6. pmid:18665203
- View Article
- PubMed/NCBI
- Google Scholar
10. Fernando SM, Tran A, Cheng W, Rochwerg B, Taljaard M, Thiruganasambandamoorthy V, et al. Prognostic Accuracy of the HEART Score for Prediction of Major Adverse Cardiac Events in Patients Presenting With Chest Pain: A Systematic Review and Meta-analysis. Acad Emerg Med. 2019 Feb;26(2):140–151. Epub 2018 Nov 29. pmid:30375097.
- View Article
- PubMed/NCBI
- Google Scholar
11. Moy E, Barrett M, Coffey R, et al. Missed diagnoses of acute myocardial infarction in the emergency department: variation by patient and facility characteristics. j AHRQ Patient Safety Network. https://psnet.ahrq.gov/resources/resource/28747/missed-diagnoses-of-acute-myocardial-infarction-in-the-emergency-department-variation-by-patient-and-facility-characteristics. Accessed June 19, 2016. pmid:29540019
- View Article
- PubMed/NCBI
- Google Scholar
12. Lancet The. Artificial intelligence in health care: within touching distance. Lancet. 2018 Dec 23;390(10114):2739. pmid:29303711.
- View Article
- PubMed/NCBI
- Google Scholar
13. Stewart J, Sprivulis P, Dwivedi G. Artificial intelligence and machine learning in emergency medicine. Emerg Med Australas. 2018 Dec;30(6):870–874. Epub 2018 Jul 16. pmid:30014578.
- View Article
- PubMed/NCBI
- Google Scholar
14. Baxt WG. Use of an Artificial Neural Network for Data Analysis in Clinical Decision-Making: The Diagnosis of Acute Coronary Occlusion. Neural Computation. 1990 Dec;2(4):480–9.
- View Article
- Google Scholar
15. Oxford. Artificial Intelligence. English Oxford Living Dictionaries. 2018. [Cited 16 May 2018.] URL: https://en.oxforddictionaries.com/definition/artificial_intelligence
16. Jiang F, Jiang Y, Zhi H, Dong Y, Li H, Ma S, et al. Artificial intelligence in healthcare: past, present and future. Stroke and Vascular Neurology. 2017 Dec;2(4):230–43. pmid:29507784
- View Article
- PubMed/NCBI
- Google Scholar
17. Murphy KP. Machine Learning: A Probabilistic Perspective, Vol. 25. Cambridge, MA: MIT Press; 2012
18. LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015 May 28;521(7553):436–44. pmid:26017442.
- View Article
- PubMed/NCBI
- Google Scholar
19. Khan S, Rahmani H, Shah SAA, Bennamoun M. A Guide to Convolutional Neural Networks for Computer Vision. Synthesis Lectures on Computer Vision. 2018 Feb 13;8(1):1–207.
- View Article
- Google Scholar
20. Russakovsky O, Deng J, Su H et al. ImageNet large scale visual recogni- tion challenge. Int. J. Comput. Vis. 2015; 115: 211–52.
- View Article
- Google Scholar
21. Xiong W, Droppo J, Huang X, Seide F, Seltzer M, Stolcke A, et al. Achieving Human Parity in Conversational Speech Recognition. arXiv:161005256 [cs, eess] [Internet]. 2017 Feb 17 [cited 2020 Dec 8]; http://arxiv.org/abs/1610.05256
22. Brown TB, Mann B, Ryder N, Subbiah M, Kaplan J, Dhariwal P, et al. Language Models are Few-Shot Learners. arXiv:200514165 [cs] [Internet]. 2020 Jul 22; http://arxiv.org/abs/2005.14165
23. Sejnowski TJ. The unreasonable effectiveness of deep learning in artificial intelligence. Proceedings of the National Academy of Sciences Dec 2020, 117 (48) 30033–30038; pmid:31992643
- View Article
- PubMed/NCBI
- Google Scholar
24. Hinton G. Deep Learning-A Technology With the Potential to Transform Health Care. JAMA. 2018 Sep 18;320(11):1101–1102. pmid:30178065.
- View Article
- PubMed/NCBI
- Google Scholar
25. Rajkomar A, Oren E, Chen K et al. Scalable and accurate deep learning with electronic health records. npj Digital Medicine 2018; 1: 18. pmid:31304302
- View Article
- PubMed/NCBI
- Google Scholar
26. Shashikumar SP, Stanley MD, Sadiq I et al. Early sepsis detection in critical care patients using multi- scale blood pressure and heart rate dynamics. J. Electrocardiol. 2017; 50: 739–43. pmid:28916175
- View Article
- PubMed/NCBI
- Google Scholar
27. Than MP, Pickering JW, Sandoval Y, Shah ASV, Tsanas A, Apple FS, et al. Machine Learning to Predict the Likelihood of Acute Myocardial Infarction. Circulation. 2019 Aug 16;140(11):899–909. Epub ahead of print. pmid:31416346.
- View Article
- PubMed/NCBI
- Google Scholar
28. Moher D, Shamseer L, Clarke M, Ghersi D, Liberati A, Petticrew M, Shekelle P, Stewart LA. Preferred Reporting Items for Systematic Review and Meta-Analysis Protocols (PRISMA-P) 2015 statement. Syst Rev. 2015;4(1):1. pmid:25554246
- View Article
- PubMed/NCBI
- Google Scholar
29. Moher D, Liberati A, Tetzlaff J, Altman DG; PRISMA Group. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. PLoS Med. 2009 Jul 21;6(7):e1000097. Epub 2009 Jul 21. pmid:19621072.
- View Article
- PubMed/NCBI
- Google Scholar
30. Wolff RF, Moons KGM, Riley RD, Whiting PF, Westwood M, Collins GS, et al. PROBAST: A Tool to Assess the Risk of Bias and Applicability of Prediction Model Studies. Ann Intern Med. 2019 Jan 1;170(1):51–58. pmid:30596875.
- View Article
- PubMed/NCBI
- Google Scholar
31. Zhang PI, Hsu CC, Kao Y, Chen CJ, Kuo YW, Hsu SL, et al. Real-time AI prediction for major adverse cardiac events in emergency department patients with chest pain. Scand J Trauma Resusc Emerg Med. 2020 Sep 11;28(1):93. pmid:32917261.
- View Article
- PubMed/NCBI
- Google Scholar
32. Wu CC, Hsu WD, Wang YC, Kung WM, Tzeng IS, Huang CW, et al. An Innovative Scoring System for Predicting Major Adverse Cardiac Events in Patients With Chest Pain Based on Machine Learning. IEEE Access. 2020;8:124076–83.
- View Article
- Google Scholar
33. Mao HF, Chen XH, Li YM, Zhang SY, Mo JR, Li M, et al. A new risk stratification score for patients with suspected cardiac chest pain in emergency departments, based on machine learning. Chin Med J (Engl). 2020 Apr 5;133(7):879–880. pmid:32097209.
- View Article
- PubMed/NCBI
- Google Scholar
34. Wu CC, Hsu WD, Islam MM, Poly TN, Yang HC, Nguyen PA, et al. An artificial intelligence approach to early predict non-ST-elevation myocardial infarction patients with chest pain. Comput Methods Programs Biomed. 2019 May;173:109–117. Epub 2019 Jan 31. pmid:31046985.
- View Article
- PubMed/NCBI
- Google Scholar
35. Liu N, Sakamoto JT, Cao J, Koh ZX, Ho AFW, Lin Z, et al. Ensemble-based risk scoring with extreme learning machine for prediction of adverse cardiac events. Cogn Comput. 2017;9(4):545–554. [CrossRef] [Google Scholar]
- View Article
- Google Scholar
36. Berikol GB, Yildiz O, Özcan IT. Diagnosis of Acute Coronary Syndrome with a Support Vector Machine. J Med Syst. 2016 Apr;40(4):84. Epub 2016 Jan 27. pmid:26815338.
- View Article
- PubMed/NCBI
- Google Scholar
37. Liu N, Koh ZX, Goh J, Lin Z, Haaland B, Ting BP, et al. Prediction of adverse cardiac events in emergency department patients with chest pain using machine learning for variable selection. BMC Med Inform Decis Mak. 2014 Aug 23;14:75. pmid:25150702.
- View Article
- PubMed/NCBI
- Google Scholar
38. Liu N, Lee MA, Ho AF, Haaland B, Fook-Chong S, Koh ZX, et al. Risk stratification for prediction of adverse coronary events in emergency department chest pain patients with a machine learning score compared with the TIMI score. Int J Cardiol. 2014 Dec 20;177(3):1095–7. Epub 2014 Oct 7. pmid:25449521.
- View Article
- PubMed/NCBI
- Google Scholar
39. Ha SH, Joo SH. A Hybrid Data Mining Method for the Medical Classification of Chest Pain. International Journal of Computer, Electrical, Automation, Control and Information Engineering. 2010;4(1):6.
- View Article
- Google Scholar
40. McCullough CL, Novobilski AJ, Fesmire FM. Use of Neural Networks to Predict Adverse Outcomes from Acute Coronary Syndrome for Male and Female Patients. In: Sixth International Conference on Machine Learning and Applications (ICMLA 2007). Cincinnati, OH, USA: IEEE; 2007. p. 512–7.
41. Green M, Björk J, Forberg J, Ekelund U, Edenbrandt L, Ohlsson M. Comparison between neural networks and multiple logistic regression to predict acute coronary syndrome in the emergency room. Artif Intell Med. 2006 Nov;38(3):305–18. Epub 2006 Sep 7. pmid:16962295.
- View Article
- PubMed/NCBI
- Google Scholar
42. Conforti D, Guido R. Kernel-based support vector machine classifiers for early detection of myocardial infarction. Optimizat Methods Softw. (2005) 20:401–13.
- View Article
- Google Scholar
43. Harrison RF, Kennedy RL. Artificial neural network models for prediction of acute coronary syndromes using clinical data from the time of presentation. Ann Emerg Med. 2005 Nov;46(5):431–9. pmid:16271675.
- View Article
- PubMed/NCBI
- Google Scholar
44. Hollander JE, Sease KL, Sparano DM, Sites FD, Shofer FS, Baxt WG. Effects of neural network feedback to physicians on admit/discharge decision for emergency department patients with chest pain. Ann Emerg Med. 2004 Sep;44(3):199–205. pmid:15332058.
- View Article
- PubMed/NCBI
- Google Scholar
45. Baxt WG, Shofer FS, Sites FD, Hollander JE. A neural network aid for the early diagnosis of cardiac ischemia in patients presenting to the emergency department with chest pain. Ann Emerg Med. 2002 Dec;40(6):575–83. pmid:12447333.
- View Article
- PubMed/NCBI
- Google Scholar
46. Baxt WG, Shofer FS, Sites FD, Hollander JE. A neural computational aid to the diagnosis of acute myocardial infarction. Ann Emerg Med. 2002 Apr;39(4):366–73. pmid:11919522.
- View Article
- PubMed/NCBI
- Google Scholar
47. Tsien CL, Fraser HS, Long WJ, Kennedy RL. Using classification tree and logistic regression methods to diagnose myocardial infarction. Stud Health Technol Inform. 1998;52 Pt 1:493–7. pmid:10384505.
- View Article
- PubMed/NCBI
- Google Scholar
48. Chazaro A, Cravens G, Eberhart R. Myocardial infarction diagnosis by a neural network. In: Proceedings of the 20th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. 1998 Nov; 20:1121–4.
49. Kennedy RL, Harrison RF, Burton AM, Fraser HS, Hamer WG, MacArthur D, et al. An artificial neural network system for diagnosis of acute myocardial infarction (AMI) in the accident and emergency department: evaluation and comparison with serum myoglobin measurements. Comput Methods Programs Biomed. 1997 Feb;52(2):93–103. pmid:9034674.
- View Article
- PubMed/NCBI
- Google Scholar
50. Baxt WG, Skora J. Prospective validation of artificial neural network trained to identify acute myocardial infarction. Lancet. 1996 Jan 6;347(8993):12–5. pmid:8531540.
- View Article
- PubMed/NCBI
- Google Scholar
51. Baxt WG. Use of an artificial neural network for the diagnosis of myocardial infarction. Ann Intern Med. 1991 Dec 1;115(11):843–8. Erratum in: Ann Intern Med 1992 Jan 1;116(1):94. pmid:1952470.
- View Article
- PubMed/NCBI
- Google Scholar
52. Alpert JS, Thygesen K, Antman E, Bassand JP. Myocardial infarction redefined—a consensus document of The Joint European Society of Cardiology/American College of Cardiology Committee for the redefinition of myocardial infarction. J Am Coll Cardiol. 2000 Sep;36(3):959–69. Erratum in: J Am Coll Cardiol 2001 Mar 1;37(3):973. pmid:10987628.
- View Article
- PubMed/NCBI
- Google Scholar
53. Baxt WG. Complexity, chaos and human physiology: the justification for non-linear neural computational analysis. Cancer Lett. 1994 Mar 15;77(2–3):85–93. pmid:8168070.
- View Article
- PubMed/NCBI
- Google Scholar
54. Wang G, Zheng W, Wu S, Ma J, Zhang H, Zheng J, et al. Comparison of usual care and the HEART score for effectively and safely discharging patients with low-risk chest pain in the emergency department: would the score always help? Clin Cardiol. 2020 Apr;43(4):371–378. Epub 2019 Dec 23. pmid:31867780.
- View Article
- PubMed/NCBI
- Google Scholar
55. Hestness J, Narang S, Ardalani N, Diamos G, Jun H, Kianinejad H, et al. Deep Learning Scaling is Predictable, Empirically. arXiv:171200409 [cs, stat] [Internet]. 2017 Dec 1 [cited 2020 Dec 7]; http://arxiv.org/abs/1712.00409
56. Cost-Sensitive Learning of Deep Feature Representations From Imbalanced Data. IEEE Trans Neural Netw Learning Syst. 2018 Aug;29(8):3573–87. pmid:28829320
- View Article
- PubMed/NCBI
- Google Scholar
57. Langabeer JR 2nd, Champagne-Langabeer T, Fowler R, Henry T. Gender-based outcome differences for emergency department presentation of non-STEMI acute coronary syndrome. Am J Emerg Med. 2019 Feb;37(2):179–182. Epub 2018 May 7. pmid:29754965.
- View Article
- PubMed/NCBI
- Google Scholar
58. Ahmad MA, Eckert C, Teredesai A. Interpretable Machine Learning in Healthcare. Proceedings of the 2018 ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics. New York, NY, USA: ACM; 2018. pp. 559–560. https://doi.org/10.1145/3233547.3233667
59. Adadi A, Berrada M. Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI). IEEE Access. 2018;6:52138–60.
- View Article
- Google Scholar
60. Przybyła GW, Szychowski KA, Gmiński J. Paracetamol—An old drug with new mechanisms of action. Clin Exp Pharmacol Physiol. 2020 Aug 7. Epub ahead of print. pmid:32767405.
- View Article
- PubMed/NCBI
- Google Scholar
61. Shortliffe EH, Sepúlveda MJ. Clinical Decision Support in the Era of Artificial Intelligence. JAMA. 2018 Dec 4;320(21):2199. pmid:30398550
- View Article
- PubMed/NCBI
- Google Scholar
62. Sullivan HR, Schweikart SJ. Are Current Tort Liability Doctrines Adequate for Addressing Injury Caused by AI? AMA J Ethics. 2019 Feb 1;21(2):E160–166. pmid:30794126.
- View Article
- PubMed/NCBI
- Google Scholar
63. Pauker SG, Kassirer JP. The threshold approach to clinical decision making. N Engl J Med 1980;302:1109–17. pmid:7366635
- View Article
- PubMed/NCBI
- Google Scholar
64. Kline JA, Johnson CL, Pollack CV Jr, et al. Pretest probability assessment derived from attribute matching. BMC Med Inform Decis Mak 2005;5:26. pmid:16095534
- View Article
- PubMed/NCBI
- Google Scholar
65. Than M, Herbert M, Flaws D, Cullen L, Hess E, Hollander JE, et al. What is an acceptable risk of major adverse cardiac event in chest pain patients soon after discharge from the Emergency Department?: a clinical survey. Int J Cardiol. 2013 Jul 1;166(3):752–4. Epub 2012 Oct 22. pmid:23084108.
- View Article
- PubMed/NCBI
- Google Scholar
66. Eaneff S, Obermeyer Z, Butte AJ. The Case for Algorithmic Stewardship for Artificial Intelligence and Machine Learning Technologies. JAMA. 2020 Oct 13;324(14):1397. pmid:32926087
- View Article
- PubMed/NCBI
- Google Scholar
67. Hutson M. Artificial intelligence faces reproducibility crisis. Science. 2018 Feb 16;359(6377):725–6. pmid:29449469
- View Article
- PubMed/NCBI
- Google Scholar
68. Haibe-Kains B, Adam GA, Hosny A, Khodakarami F; Massive Analysis Quality Control (MAQC) Society Board of Directors, Waldron L, Wang B, et al. Transparency and reproducibility in artificial intelligence. Nature. 2020 Oct;586(7829):E14–E16. Epub 2020 Oct 14. pmid:33057217.
- View Article
- PubMed/NCBI
- Google Scholar
69. Stodden V, McNutt M, Bailey DH, Deelman E, Gil Y, Hanson B, et al. Enhancing reproducibility for computational methods. Science. 2016 Dec 9;354(6317):1240–1. pmid:27940837
- View Article
- PubMed/NCBI
- Google Scholar
70. Wallach J. D., Boyack K. W. & Ioannidis J. P. A. Reproducible research practices, transparency, and open access data in the biomedical literature, 2015–2017. PLoS Biol. 16, e2006930 (2018). pmid:30457984
- View Article
- PubMed/NCBI
- Google Scholar
71. Amann R. I. et al. Toward unrestricted use of public genomic data. Science 363, 350–352 (2019). pmid:30679363
- View Article
- PubMed/NCBI
- Google Scholar
72. Khan AR, Golwala H, Tripathi A, Bin Abdulhak AA, Bavishi C, Riaz H, et al. Impact of total occlusion of culprit artery in acute non-ST elevation myocardial infarction: a systematic review and meta-analysis. Eur Heart J. 2017 Nov 1;38(41):3082–3089. pmid:29020244.
- View Article
- PubMed/NCBI
- Google Scholar
73. Moons KG, Altman DG, Reitsma JB, Ioannidis JP, Macaskill P, Steyerberg EW, et al. Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med. 2015 Jan 6;162(1):W1–73. pmid:25560730.
- View Article
- PubMed/NCBI
- Google Scholar
74. Luo W, Phung D, Tran T, Gupta S, Rana S, Karmakar C, et al. Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View. J Med Internet Res. 2016 Dec 16;18(12):e323. pmid:27986644
- View Article
- PubMed/NCBI
- Google Scholar
75. Yusuf M, Atal I, Li J, et al. Reporting quality of studies using machine learning models for medical diagnosis: a systematic review. BMJ Open 2020;10:e034568. pmid:32205374
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Croskerry P. The Cognitive Imperative Thinking about How We Think. Academic Emergency Medicine. 2000 Nov;7(11):1223–31. pmid:11073470
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Austalian Institute of Health and Welfare. Emergency department care 2017–18 Australian hospital statistics [Internet]. Canberra, ACT: Australian Government; 2018. https://www.aihw.gov.au/reports/hospitals/emergency-department-care-2017-18/contents/table-of-contents

[ref3] 3. Thygesen K, Alpert JS, Jaffe AS, Chaitman BR, Bax JJ, Morrow DA, et al. Fourth Universal Definition of Myocardial Infarction (2018). Circulation. 2018 Nov 13;138(20):e618–51. pmid:30571511
View Article
PubMed/NCBI
Google Scholar

[7] View Article

[8] PubMed/NCBI

[9] Google Scholar

[ref4] 4. Kumar A, Cannon CP. Acute Coronary Syndromes: Diagnosis and Management, Part I. Mayo Clinic Proceedings. 2009 Oct;84(10):917–38. pmid:19797781
View Article
PubMed/NCBI
Google Scholar

[11] View Article

[12] PubMed/NCBI

[13] Google Scholar

[ref5] 5. Kohn MA, Kwan E, Gupta M, Tabas JA. Prevalence of acute myocardial infarction and other serious diagnoses in patients presenting to an urban emergency department with chest pain. J Emerg Med. 2005 Nov;29(4):383–90. pmid:16243193.
View Article
PubMed/NCBI
Google Scholar

[15] View Article

[16] PubMed/NCBI

[17] Google Scholar

[ref6] 6. Anderson JL, Adams CD, Antman EM, Bridges CR, Califf RM, Casey DE Jr, et al, American College of Cardiology., American Heart Association Task Force on Practice Guidelines (Writing Committee to Revise the 2002 Guidelines for the Management of Patients With Unstable Angina/Non-ST-Elevation Myocardial Infarction)., American College of Emergency Physicians., Society for Cardiovascular Angiography and Interventions., Society of Thoracic Surgeons., American Association of Cardiovascular and Pulmonary Rehabilitation., Society for Academic Emergency

[ref7] 7. Swap CJ, Nagurney JT. Value and limitations of chest pain history in the evaluation of patients with suspected acute coronary syndromes [published correction appears in JAMA. 2006 May 17;295(19):2250]. JAMA. 2005;294(20):2623–2629. pmid:16304077
View Article
PubMed/NCBI
Google Scholar

[20] View Article

[21] PubMed/NCBI

[22] Google Scholar

[ref8] 8. Antman EM, Cohen M, Bernink PJLM, McCabe CH, Hoacek T, Papuchis G, et al. The TIMI risk score for unstable angina/non-ST elevation MI: a method for prognostication and therapeutic decision making JAMA. 2000;284(7):835–42. pmid:10938172
View Article
PubMed/NCBI
Google Scholar

[24] View Article

[25] PubMed/NCBI

[26] Google Scholar

[ref9] 9. Six AJ, Backus BE, Kelder JC. Chest pain in the emergency room: value of the HEART score. Neth Heart J. 2008;16(6):191–6. pmid:18665203
View Article
PubMed/NCBI
Google Scholar

[28] View Article

[29] PubMed/NCBI

[30] Google Scholar

[ref10] 10. Fernando SM, Tran A, Cheng W, Rochwerg B, Taljaard M, Thiruganasambandamoorthy V, et al. Prognostic Accuracy of the HEART Score for Prediction of Major Adverse Cardiac Events in Patients Presenting With Chest Pain: A Systematic Review and Meta-analysis. Acad Emerg Med. 2019 Feb;26(2):140–151. Epub 2018 Nov 29. pmid:30375097.
View Article
PubMed/NCBI
Google Scholar

[32] View Article

[33] PubMed/NCBI

[34] Google Scholar

[ref11] 11. Moy E, Barrett M, Coffey R, et al. Missed diagnoses of acute myocardial infarction in the emergency department: variation by patient and facility characteristics. j AHRQ Patient Safety Network. https://psnet.ahrq.gov/resources/resource/28747/missed-diagnoses-of-acute-myocardial-infarction-in-the-emergency-department-variation-by-patient-and-facility-characteristics. Accessed June 19, 2016. pmid:29540019
View Article
PubMed/NCBI
Google Scholar

[36] View Article

[37] PubMed/NCBI

[38] Google Scholar

[ref12] 12. Lancet The. Artificial intelligence in health care: within touching distance. Lancet. 2018 Dec 23;390(10114):2739. pmid:29303711.
View Article
PubMed/NCBI
Google Scholar

[40] View Article

[41] PubMed/NCBI

[42] Google Scholar

[ref13] 13. Stewart J, Sprivulis P, Dwivedi G. Artificial intelligence and machine learning in emergency medicine. Emerg Med Australas. 2018 Dec;30(6):870–874. Epub 2018 Jul 16. pmid:30014578.
View Article
PubMed/NCBI
Google Scholar

[44] View Article

[45] PubMed/NCBI

[46] Google Scholar

[ref14] 14. Baxt WG. Use of an Artificial Neural Network for Data Analysis in Clinical Decision-Making: The Diagnosis of Acute Coronary Occlusion. Neural Computation. 1990 Dec;2(4):480–9.
View Article
Google Scholar

[48] View Article

[49] Google Scholar

[ref15] 15. Oxford. Artificial Intelligence. English Oxford Living Dictionaries. 2018. [Cited 16 May 2018.] URL: https://en.oxforddictionaries.com/definition/artificial_intelligence

[ref16] 16. Jiang F, Jiang Y, Zhi H, Dong Y, Li H, Ma S, et al. Artificial intelligence in healthcare: past, present and future. Stroke and Vascular Neurology. 2017 Dec;2(4):230–43. pmid:29507784
View Article
PubMed/NCBI
Google Scholar

[52] View Article

[53] PubMed/NCBI

[54] Google Scholar

[ref17] 17. Murphy KP. Machine Learning: A Probabilistic Perspective, Vol. 25. Cambridge, MA: MIT Press; 2012

[ref18] 18. LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015 May 28;521(7553):436–44. pmid:26017442.
View Article
PubMed/NCBI
Google Scholar

[57] View Article

[58] PubMed/NCBI

[59] Google Scholar

[ref19] 19. Khan S, Rahmani H, Shah SAA, Bennamoun M. A Guide to Convolutional Neural Networks for Computer Vision. Synthesis Lectures on Computer Vision. 2018 Feb 13;8(1):1–207.
View Article
Google Scholar

[61] View Article

[62] Google Scholar

[ref20] 20. Russakovsky O, Deng J, Su H et al. ImageNet large scale visual recogni- tion challenge. Int. J. Comput. Vis. 2015; 115: 211–52.
View Article
Google Scholar

[64] View Article

[65] Google Scholar

[ref21] 21. Xiong W, Droppo J, Huang X, Seide F, Seltzer M, Stolcke A, et al. Achieving Human Parity in Conversational Speech Recognition. arXiv:161005256 [cs, eess] [Internet]. 2017 Feb 17 [cited 2020 Dec 8]; http://arxiv.org/abs/1610.05256

[ref22] 22. Brown TB, Mann B, Ryder N, Subbiah M, Kaplan J, Dhariwal P, et al. Language Models are Few-Shot Learners. arXiv:200514165 [cs] [Internet]. 2020 Jul 22; http://arxiv.org/abs/2005.14165

[ref23] 23. Sejnowski TJ. The unreasonable effectiveness of deep learning in artificial intelligence. Proceedings of the National Academy of Sciences Dec 2020, 117 (48) 30033–30038; pmid:31992643
View Article
PubMed/NCBI
Google Scholar

[69] View Article

[70] PubMed/NCBI

[71] Google Scholar

[ref24] 24. Hinton G. Deep Learning-A Technology With the Potential to Transform Health Care. JAMA. 2018 Sep 18;320(11):1101–1102. pmid:30178065.
View Article
PubMed/NCBI
Google Scholar

[73] View Article

[74] PubMed/NCBI

[75] Google Scholar

[ref25] 25. Rajkomar A, Oren E, Chen K et al. Scalable and accurate deep learning with electronic health records. npj Digital Medicine 2018; 1: 18. pmid:31304302
View Article
PubMed/NCBI
Google Scholar

[77] View Article

[78] PubMed/NCBI

[79] Google Scholar

[ref26] 26. Shashikumar SP, Stanley MD, Sadiq I et al. Early sepsis detection in critical care patients using multi- scale blood pressure and heart rate dynamics. J. Electrocardiol. 2017; 50: 739–43. pmid:28916175
View Article
PubMed/NCBI
Google Scholar

[81] View Article

[82] PubMed/NCBI

[83] Google Scholar

[ref27] 27. Than MP, Pickering JW, Sandoval Y, Shah ASV, Tsanas A, Apple FS, et al. Machine Learning to Predict the Likelihood of Acute Myocardial Infarction. Circulation. 2019 Aug 16;140(11):899–909. Epub ahead of print. pmid:31416346.
View Article
PubMed/NCBI
Google Scholar

[85] View Article

[86] PubMed/NCBI

[87] Google Scholar

[ref28] 28. Moher D, Shamseer L, Clarke M, Ghersi D, Liberati A, Petticrew M, Shekelle P, Stewart LA. Preferred Reporting Items for Systematic Review and Meta-Analysis Protocols (PRISMA-P) 2015 statement. Syst Rev. 2015;4(1):1. pmid:25554246
View Article
PubMed/NCBI
Google Scholar

[89] View Article

[90] PubMed/NCBI

[91] Google Scholar

[ref29] 29. Moher D, Liberati A, Tetzlaff J, Altman DG; PRISMA Group. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. PLoS Med. 2009 Jul 21;6(7):e1000097. Epub 2009 Jul 21. pmid:19621072.
View Article
PubMed/NCBI
Google Scholar

[93] View Article

[94] PubMed/NCBI

[95] Google Scholar

[ref30] 30. Wolff RF, Moons KGM, Riley RD, Whiting PF, Westwood M, Collins GS, et al. PROBAST: A Tool to Assess the Risk of Bias and Applicability of Prediction Model Studies. Ann Intern Med. 2019 Jan 1;170(1):51–58. pmid:30596875.
View Article
PubMed/NCBI
Google Scholar

[97] View Article

[98] PubMed/NCBI

[99] Google Scholar

[ref31] 31. Zhang PI, Hsu CC, Kao Y, Chen CJ, Kuo YW, Hsu SL, et al. Real-time AI prediction for major adverse cardiac events in emergency department patients with chest pain. Scand J Trauma Resusc Emerg Med. 2020 Sep 11;28(1):93. pmid:32917261.
View Article
PubMed/NCBI
Google Scholar

[101] View Article

[102] PubMed/NCBI

[103] Google Scholar

[ref32] 32. Wu CC, Hsu WD, Wang YC, Kung WM, Tzeng IS, Huang CW, et al. An Innovative Scoring System for Predicting Major Adverse Cardiac Events in Patients With Chest Pain Based on Machine Learning. IEEE Access. 2020;8:124076–83.
View Article
Google Scholar

[105] View Article

[106] Google Scholar

[ref33] 33. Mao HF, Chen XH, Li YM, Zhang SY, Mo JR, Li M, et al. A new risk stratification score for patients with suspected cardiac chest pain in emergency departments, based on machine learning. Chin Med J (Engl). 2020 Apr 5;133(7):879–880. pmid:32097209.
View Article
PubMed/NCBI
Google Scholar

[108] View Article

[109] PubMed/NCBI

[110] Google Scholar

[ref34] 34. Wu CC, Hsu WD, Islam MM, Poly TN, Yang HC, Nguyen PA, et al. An artificial intelligence approach to early predict non-ST-elevation myocardial infarction patients with chest pain. Comput Methods Programs Biomed. 2019 May;173:109–117. Epub 2019 Jan 31. pmid:31046985.
View Article
PubMed/NCBI
Google Scholar

[112] View Article

[113] PubMed/NCBI

[114] Google Scholar

[ref35] 35. Liu N, Sakamoto JT, Cao J, Koh ZX, Ho AFW, Lin Z, et al. Ensemble-based risk scoring with extreme learning machine for prediction of adverse cardiac events. Cogn Comput. 2017;9(4):545–554. [CrossRef] [Google Scholar]
View Article
Google Scholar

[116] View Article

[117] Google Scholar

[ref36] 36. Berikol GB, Yildiz O, Özcan IT. Diagnosis of Acute Coronary Syndrome with a Support Vector Machine. J Med Syst. 2016 Apr;40(4):84. Epub 2016 Jan 27. pmid:26815338.
View Article
PubMed/NCBI
Google Scholar

[119] View Article

[120] PubMed/NCBI

[121] Google Scholar

[ref37] 37. Liu N, Koh ZX, Goh J, Lin Z, Haaland B, Ting BP, et al. Prediction of adverse cardiac events in emergency department patients with chest pain using machine learning for variable selection. BMC Med Inform Decis Mak. 2014 Aug 23;14:75. pmid:25150702.
View Article
PubMed/NCBI
Google Scholar

[123] View Article

[124] PubMed/NCBI

[125] Google Scholar

[ref38] 38. Liu N, Lee MA, Ho AF, Haaland B, Fook-Chong S, Koh ZX, et al. Risk stratification for prediction of adverse coronary events in emergency department chest pain patients with a machine learning score compared with the TIMI score. Int J Cardiol. 2014 Dec 20;177(3):1095–7. Epub 2014 Oct 7. pmid:25449521.
View Article
PubMed/NCBI
Google Scholar

[127] View Article

[128] PubMed/NCBI

[129] Google Scholar

[ref39] 39. Ha SH, Joo SH. A Hybrid Data Mining Method for the Medical Classification of Chest Pain. International Journal of Computer, Electrical, Automation, Control and Information Engineering. 2010;4(1):6.
View Article
Google Scholar

[131] View Article

[132] Google Scholar

[ref40] 40. McCullough CL, Novobilski AJ, Fesmire FM. Use of Neural Networks to Predict Adverse Outcomes from Acute Coronary Syndrome for Male and Female Patients. In: Sixth International Conference on Machine Learning and Applications (ICMLA 2007). Cincinnati, OH, USA: IEEE; 2007. p. 512–7.

[ref41] 41. Green M, Björk J, Forberg J, Ekelund U, Edenbrandt L, Ohlsson M. Comparison between neural networks and multiple logistic regression to predict acute coronary syndrome in the emergency room. Artif Intell Med. 2006 Nov;38(3):305–18. Epub 2006 Sep 7. pmid:16962295.
View Article
PubMed/NCBI
Google Scholar

[135] View Article

[136] PubMed/NCBI

[137] Google Scholar

[ref42] 42. Conforti D, Guido R. Kernel-based support vector machine classifiers for early detection of myocardial infarction. Optimizat Methods Softw. (2005) 20:401–13.
View Article
Google Scholar

[139] View Article

[140] Google Scholar

[ref43] 43. Harrison RF, Kennedy RL. Artificial neural network models for prediction of acute coronary syndromes using clinical data from the time of presentation. Ann Emerg Med. 2005 Nov;46(5):431–9. pmid:16271675.
View Article
PubMed/NCBI
Google Scholar

[142] View Article

[143] PubMed/NCBI

[144] Google Scholar

[ref44] 44. Hollander JE, Sease KL, Sparano DM, Sites FD, Shofer FS, Baxt WG. Effects of neural network feedback to physicians on admit/discharge decision for emergency department patients with chest pain. Ann Emerg Med. 2004 Sep;44(3):199–205. pmid:15332058.
View Article
PubMed/NCBI
Google Scholar

[146] View Article

[147] PubMed/NCBI

[148] Google Scholar

[ref45] 45. Baxt WG, Shofer FS, Sites FD, Hollander JE. A neural network aid for the early diagnosis of cardiac ischemia in patients presenting to the emergency department with chest pain. Ann Emerg Med. 2002 Dec;40(6):575–83. pmid:12447333.
View Article
PubMed/NCBI
Google Scholar

[150] View Article

[151] PubMed/NCBI

[152] Google Scholar

[ref46] 46. Baxt WG, Shofer FS, Sites FD, Hollander JE. A neural computational aid to the diagnosis of acute myocardial infarction. Ann Emerg Med. 2002 Apr;39(4):366–73. pmid:11919522.
View Article
PubMed/NCBI
Google Scholar

[154] View Article

[155] PubMed/NCBI

[156] Google Scholar

[ref47] 47. Tsien CL, Fraser HS, Long WJ, Kennedy RL. Using classification tree and logistic regression methods to diagnose myocardial infarction. Stud Health Technol Inform. 1998;52 Pt 1:493–7. pmid:10384505.
View Article
PubMed/NCBI
Google Scholar

[158] View Article

[159] PubMed/NCBI

[160] Google Scholar

[ref48] 48. Chazaro A, Cravens G, Eberhart R. Myocardial infarction diagnosis by a neural network. In: Proceedings of the 20th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. 1998 Nov; 20:1121–4.

[ref49] 49. Kennedy RL, Harrison RF, Burton AM, Fraser HS, Hamer WG, MacArthur D, et al. An artificial neural network system for diagnosis of acute myocardial infarction (AMI) in the accident and emergency department: evaluation and comparison with serum myoglobin measurements. Comput Methods Programs Biomed. 1997 Feb;52(2):93–103. pmid:9034674.
View Article
PubMed/NCBI
Google Scholar

[163] View Article

[164] PubMed/NCBI

[165] Google Scholar

[ref50] 50. Baxt WG, Skora J. Prospective validation of artificial neural network trained to identify acute myocardial infarction. Lancet. 1996 Jan 6;347(8993):12–5. pmid:8531540.
View Article
PubMed/NCBI
Google Scholar

[167] View Article

[168] PubMed/NCBI

[169] Google Scholar

[ref51] 51. Baxt WG. Use of an artificial neural network for the diagnosis of myocardial infarction. Ann Intern Med. 1991 Dec 1;115(11):843–8. Erratum in: Ann Intern Med 1992 Jan 1;116(1):94. pmid:1952470.
View Article
PubMed/NCBI
Google Scholar

[171] View Article

[172] PubMed/NCBI

[173] Google Scholar

[ref52] 52. Alpert JS, Thygesen K, Antman E, Bassand JP. Myocardial infarction redefined—a consensus document of The Joint European Society of Cardiology/American College of Cardiology Committee for the redefinition of myocardial infarction. J Am Coll Cardiol. 2000 Sep;36(3):959–69. Erratum in: J Am Coll Cardiol 2001 Mar 1;37(3):973. pmid:10987628.
View Article
PubMed/NCBI
Google Scholar

[175] View Article

[176] PubMed/NCBI

[177] Google Scholar

[ref53] 53. Baxt WG. Complexity, chaos and human physiology: the justification for non-linear neural computational analysis. Cancer Lett. 1994 Mar 15;77(2–3):85–93. pmid:8168070.
View Article
PubMed/NCBI
Google Scholar

[179] View Article

[180] PubMed/NCBI

[181] Google Scholar

[ref54] 54. Wang G, Zheng W, Wu S, Ma J, Zhang H, Zheng J, et al. Comparison of usual care and the HEART score for effectively and safely discharging patients with low-risk chest pain in the emergency department: would the score always help? Clin Cardiol. 2020 Apr;43(4):371–378. Epub 2019 Dec 23. pmid:31867780.
View Article
PubMed/NCBI
Google Scholar

[183] View Article

[184] PubMed/NCBI

[185] Google Scholar

[ref55] 55. Hestness J, Narang S, Ardalani N, Diamos G, Jun H, Kianinejad H, et al. Deep Learning Scaling is Predictable, Empirically. arXiv:171200409 [cs, stat] [Internet]. 2017 Dec 1 [cited 2020 Dec 7]; http://arxiv.org/abs/1712.00409

[ref56] 56. Cost-Sensitive Learning of Deep Feature Representations From Imbalanced Data. IEEE Trans Neural Netw Learning Syst. 2018 Aug;29(8):3573–87. pmid:28829320
View Article
PubMed/NCBI
Google Scholar

[188] View Article

[189] PubMed/NCBI

[190] Google Scholar

[ref57] 57. Langabeer JR 2nd, Champagne-Langabeer T, Fowler R, Henry T. Gender-based outcome differences for emergency department presentation of non-STEMI acute coronary syndrome. Am J Emerg Med. 2019 Feb;37(2):179–182. Epub 2018 May 7. pmid:29754965.
View Article
PubMed/NCBI
Google Scholar

[192] View Article

[193] PubMed/NCBI

[194] Google Scholar

[ref58] 58. Ahmad MA, Eckert C, Teredesai A. Interpretable Machine Learning in Healthcare. Proceedings of the 2018 ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics. New York, NY, USA: ACM; 2018. pp. 559–560. https://doi.org/10.1145/3233547.3233667

[ref59] 59. Adadi A, Berrada M. Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI). IEEE Access. 2018;6:52138–60.
View Article
Google Scholar

[197] View Article

[198] Google Scholar

[ref60] 60. Przybyła GW, Szychowski KA, Gmiński J. Paracetamol—An old drug with new mechanisms of action. Clin Exp Pharmacol Physiol. 2020 Aug 7. Epub ahead of print. pmid:32767405.
View Article
PubMed/NCBI
Google Scholar

[200] View Article

[201] PubMed/NCBI

[202] Google Scholar

[ref61] 61. Shortliffe EH, Sepúlveda MJ. Clinical Decision Support in the Era of Artificial Intelligence. JAMA. 2018 Dec 4;320(21):2199. pmid:30398550
View Article
PubMed/NCBI
Google Scholar

[204] View Article

[205] PubMed/NCBI

[206] Google Scholar

[ref62] 62. Sullivan HR, Schweikart SJ. Are Current Tort Liability Doctrines Adequate for Addressing Injury Caused by AI? AMA J Ethics. 2019 Feb 1;21(2):E160–166. pmid:30794126.
View Article
PubMed/NCBI
Google Scholar

[208] View Article

[209] PubMed/NCBI

[210] Google Scholar

[ref63] 63. Pauker SG, Kassirer JP. The threshold approach to clinical decision making. N Engl J Med 1980;302:1109–17. pmid:7366635
View Article
PubMed/NCBI
Google Scholar

[212] View Article

[213] PubMed/NCBI

[214] Google Scholar

[ref64] 64. Kline JA, Johnson CL, Pollack CV Jr, et al. Pretest probability assessment derived from attribute matching. BMC Med Inform Decis Mak 2005;5:26. pmid:16095534
View Article
PubMed/NCBI
Google Scholar

[216] View Article

[217] PubMed/NCBI

[218] Google Scholar

[ref65] 65. Than M, Herbert M, Flaws D, Cullen L, Hess E, Hollander JE, et al. What is an acceptable risk of major adverse cardiac event in chest pain patients soon after discharge from the Emergency Department?: a clinical survey. Int J Cardiol. 2013 Jul 1;166(3):752–4. Epub 2012 Oct 22. pmid:23084108.
View Article
PubMed/NCBI
Google Scholar

[220] View Article

[221] PubMed/NCBI

[222] Google Scholar

[ref66] 66. Eaneff S, Obermeyer Z, Butte AJ. The Case for Algorithmic Stewardship for Artificial Intelligence and Machine Learning Technologies. JAMA. 2020 Oct 13;324(14):1397. pmid:32926087
View Article
PubMed/NCBI
Google Scholar

[224] View Article

[225] PubMed/NCBI

[226] Google Scholar

[ref67] 67. Hutson M. Artificial intelligence faces reproducibility crisis. Science. 2018 Feb 16;359(6377):725–6. pmid:29449469
View Article
PubMed/NCBI
Google Scholar

[228] View Article

[229] PubMed/NCBI

[230] Google Scholar

[ref68] 68. Haibe-Kains B, Adam GA, Hosny A, Khodakarami F; Massive Analysis Quality Control (MAQC) Society Board of Directors, Waldron L, Wang B, et al. Transparency and reproducibility in artificial intelligence. Nature. 2020 Oct;586(7829):E14–E16. Epub 2020 Oct 14. pmid:33057217.
View Article
PubMed/NCBI
Google Scholar

[232] View Article

[233] PubMed/NCBI

[234] Google Scholar

[ref69] 69. Stodden V, McNutt M, Bailey DH, Deelman E, Gil Y, Hanson B, et al. Enhancing reproducibility for computational methods. Science. 2016 Dec 9;354(6317):1240–1. pmid:27940837
View Article
PubMed/NCBI
Google Scholar

[236] View Article

[237] PubMed/NCBI

[238] Google Scholar

[ref70] 70. Wallach J. D., Boyack K. W. & Ioannidis J. P. A. Reproducible research practices, transparency, and open access data in the biomedical literature, 2015–2017. PLoS Biol. 16, e2006930 (2018). pmid:30457984
View Article
PubMed/NCBI
Google Scholar

[240] View Article

[241] PubMed/NCBI

[242] Google Scholar

[ref71] 71. Amann R. I. et al. Toward unrestricted use of public genomic data. Science 363, 350–352 (2019). pmid:30679363
View Article
PubMed/NCBI
Google Scholar

[244] View Article

[245] PubMed/NCBI

[246] Google Scholar

[ref72] 72. Khan AR, Golwala H, Tripathi A, Bin Abdulhak AA, Bavishi C, Riaz H, et al. Impact of total occlusion of culprit artery in acute non-ST elevation myocardial infarction: a systematic review and meta-analysis. Eur Heart J. 2017 Nov 1;38(41):3082–3089. pmid:29020244.
View Article
PubMed/NCBI
Google Scholar

[248] View Article

[249] PubMed/NCBI

[250] Google Scholar

[ref73] 73. Moons KG, Altman DG, Reitsma JB, Ioannidis JP, Macaskill P, Steyerberg EW, et al. Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med. 2015 Jan 6;162(1):W1–73. pmid:25560730.
View Article
PubMed/NCBI
Google Scholar

[252] View Article

[253] PubMed/NCBI

[254] Google Scholar

[ref74] 74. Luo W, Phung D, Tran T, Gupta S, Rana S, Karmakar C, et al. Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View. J Med Internet Res. 2016 Dec 16;18(12):e323. pmid:27986644
View Article
PubMed/NCBI
Google Scholar

[256] View Article

[257] PubMed/NCBI

[258] Google Scholar

[ref75] 75. Yusuf M, Atal I, Li J, et al. Reporting quality of studies using machine learning models for medical diagnosis: a systematic review. BMJ Open 2020;10:e034568. pmid:32205374
View Article
PubMed/NCBI
Google Scholar

[260] View Article

[261] PubMed/NCBI

[262] Google Scholar

Figures

Abstract

Background

Methods and findings

Conclusions

Trial registration

Introduction

Methods

Results

Study selection

Study characteristics

Diagnosis of AMI or ACS

Prognosis (prediction of MACE and mortality)

Comparisons

Integration into practice

Availability of code and dataset

Study quality—Risk of bias within and across studies

Discussion

Interest and early work

Compared to physicians and current standard of care

Small datasets

Omitted data categories

Predictor variables

Human interpretability

Human factors affecting model implementation

Ethical and legal issues

Sensitivity and specificity

Implementation

ML reproducibility crisis, algorithm ownership

Future direction

Limitations

Limitations—Study level

Limitations—Review level

Conclusion

Supporting information

S1 Appendix. Medline search strategy.

S1 Checklist. PRISMA 2009 checklist.

Acknowledgments

References