Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Transitions in intensive care: Investigating critical slowing down post extubation

  • Lucinda Khalil ,

    Contributed equally to this work with: Lucinda Khalil, Sandip V. George

    Roles Conceptualization, Formal analysis, Investigation, Methodology, Visualization, Writing – original draft

    Affiliation Department of Mathematics, Imperial College London, London, United Kingdom

  • Sandip V. George ,

    Contributed equally to this work with: Lucinda Khalil, Sandip V. George

    Roles Conceptualization, Formal analysis, Investigation, Methodology, Supervision, Writing – original draft, Writing – review & editing

    sandip.george@ucl.ac.uk

    Affiliations Department of Computer Science, University College London, London, United Kingdom, Department of Physics, University of Aberdeen, Aberdeen, United Kingdom

  • Katherine L. Brown,

    Roles Methodology, Supervision, Writing – review & editing

    Affiliation Cardiac Intensive Care Unit, Great Ormond Street Hospital For Children NHS Foundation Trust, London, United Kingdom

  • Samiran Ray,

    Roles Data curation, Investigation, Methodology, Supervision, Writing – review & editing

    Affiliation Paediatric Intensive Care Unit, Great Ormond Street Hospital For Children NHS Foundation Trust, London, United Kingdom

  • Simon Arridge

    Roles Conceptualization, Methodology, Supervision, Writing – review & editing

    Affiliation Department of Computer Science, University College London, London, United Kingdom

Abstract

Complex biological systems undergo sudden transitions in their state, which are often preceded by a critical slowing down of dynamics. This results in longer recovery times as systems approach transitions, quantified as an increase in measures such as the autocorrelation and variance. In this study, we analysed paediatric patients in intensive care for whom mechanical ventilation was discontinued through removal of the endotracheal tube (extubation). Some patients failed extubation, and required a re-intubation within 48 hours. We investigated whether critical slowing down could be observed post failed extubations, prior to re-intubation. We tested for significant increases (p <.05) between extubation and re-intubation, in the variance and autocorrelation, over the time series data of heart rate, respiratory rate and mean blood pressure. The autocorrelation of the heart rate showed a significantly higher proportion of increases in the group that failed extubation, compared who those who did not. It also showed a significantly higher magnitude of increase for the failed extubation group in a t-test. Moreover, incorporating these magnitudes significantly improved the fit of a logistic regression model when compared to a model that solely used the mean and standard deviation of the vital signs. While immediate clinical utility is limited, the work marks an important first step towards using dynamical systems theory to understand the dynamics of signals measured at the bedside during intensive care.

1 Introduction

Dynamical systems undergo a variety of state changes due to external perturbations or internal changes. Many of these are difficult to predict from the mean response of the system. Prior to certain types of state changes in closed continuous systems, called critical transitions, a phenomenon called critical slowing down (CSD) is observed [1, 2]. CSD is characterised by an increased relaxation time in the dynamics of the system, and can be measured as increases in quantifiers such as the variance and autocorrelation over time [3, 4]. These metrics which characterize this increased relaxation time due to CSD are called early warning signals (EWS).

CSD has been retrospectively observed prior to transitions in ecology [57], astronomy [8], finance [9], neuroscience [10] and medicine [11, 12], amongst others. In the context of medicine, slowing down of recovery has been suggested as a sign of proximity to a tipping point [13]. This has been observed in contexts such as blood pressure regulation, where slow recovery from changes in blood pressure has been linked to increased risk of mortality or of ischemic stroke [1316] and in seizure prediction, where CSD has been reported in computational, in-vitro [17, 18], as well as in real data [19]. Critical slowing down is also thought to precede transitions in psychiatric disorders, including bipolar disorder and depression [2022].

An area where predicting transitions is of paramount importance is in healthcare, where decisions based on these predictions can have consequences on patient well-being. This is most evident in the data rich environment of critical care, where patients are monitored using multiple-modalities continuously, and where significant deterioration events are more frequent. The response of the cardio-pulmonary system and the effects of ventilatory support are often studied using the framework of nonlinear dynamical systems [23, 24], and hence events such as sudden deterioration in health or onsets of arrythmia can be studied as tipping points [12, 25]. These could possibly be anticipated due to CSD, using EWS such as increases in autocorrelation or variance [1].

About 40% of patients in the Intensive Care Unit (ICU) require mechanical ventilation at any point in time [26], with a slightly higher number reported (50%) in the UK (https://www.picanet.org.uk/annual-reporting-and-publications/annual-report-archive/). Determining the ideal point to discontinue mechanical ventilation is an important question, since both extending and prematurely withdrawing ventilatory support can lead to adverse outcomes [27]. Prolonged mechanical ventilation is associated with a large number of complications, including risk of ventilator-associated pneumonia, muscle atrophy, as well as other physiological and psychological complications [28], while premature extubation is associated with failure as described below [29].

Patient readiness for extubation is often tested at the bedside using spontaneous breathing trials, during which ventilator support is reduced or removed for a short space of time, and patient breathing effort is observed without removal of the breathing tube. However, these trials are not fail-safe, and there is a risk that the breathing tube may need to be reintroduced (re-intubation) following extubation. Reducing the need for an emergency re-intubation is important, since the procedure carries risk of adverse events, even in a critical care environment. When there is uncertainty around the success of extubation in adults, a tracheostomy is commonly performed, whereby the breathing tube is placed through the front of the neck, and the patient is given longer (periods of days to weeks) before this is reversed following rehabilitation. In paediatric populations such as those considered in this study, tracheostomies are performed infrequently because of anatomical constraints and higher risk of complications. As a result, extubation may be undertaken even in the face of clinical uncertainty. In 5–15% of cases, extubation can be unsuccessful and the breathing tube may need to be reintroduced, resulting in what is called an ‘extubation failure’ [30, 31]. This can be relatively immediate, or may become apparent over a period of hours. Extubation failure is associated with generally poorer outcomes including higher mortality and increased disease severity [32]. Detection of this ‘extubation failure’ may be useful to prepare for re-intubation, reducing the risk from an emergency procedure. Even post extubation, it is important to predict failure quickly, since a longer time to reintubation is associated with higher mortality [33].

A number of attempts have been made to predict extubation failure and re-intubation using statistical and machine learning methods. These include logistic regression, support vector machine, random forest, gradient boosting methods and neural networks. These methods showed varying degrees of success (AUC between 0.7 and 0.85) depending on the specifics of the algorithm and the dataset that was used for prediction [3438]. Most of these methods used a feature set consisting of measurements such as the length of ICU stay, pre-existing comorbidities, Glasgow Coma Scale score and averages of different vital signs [39].

A smaller number of studies have investigated extubation failure by studying the dynamics of vital signs time series. White et. al. demonstrated the association between lower interbreath interval complexity and extubation failure [40]. Seely et. al. showed that the heart and respiratory rate variability recorded prior to extubation was significantly associated with extubation outcome [41]. Keim-Malpass et. al. investigated how adding variables based on the dynamics of vital signs resulted in a significant increase in the predictive capacity of a standard model based solely on static measures [42]. A number of recent studies have also attempted to predict extubation outcome using various recurrent neural networks architectures, with varying degrees of success [4345]. It has also been shown previously that slow recovery to baseline levels of minute ventilation following successful spontaneous breathing trials is a risk factor for failure of extubation [46, 47]. Such slow recovery to baseline levels aligns with what would be observed as a result of CSD prior to critical transitions in nonlinear dynamical systems [13].

In this study, which was conducted on a sample of paediatric patients, we hypothesized that patients who failed extubations were approaching a tipping point which was preceded by CSD, which can be measured by increased autocorrelation and variance. This implies that prior to reintubation, an increased similarity between consecutive measurements and greater variability can be observed in the time series of physiological variables. Hence as patients approached re-intubation, the presence of CSD would result in their vital signs becoming more correlated and fluctuating more over time. We explored whether the presence of CSD could be established using EWS which are observed in patients in intensive care units (ICUs) post extubation and prior to re-intubation, where the measured patient responses were not masked by ventilator dynamics. Patients who did not undergo an extubation failure were also analysed for comparison, since no CSD was expected in these cases.

Prior to the study, the analysis plan was pre-registered based on analysis conducted on a single patient (ID PC0192.1). This patient failed the extubation and required re-intubation around 20 hours later. The variance and autocorrelation were calculated for three physiological variables (heart rate, respiration rate and mean blood pressure). In this case, 5 out of the 6 tests conducted produced significant results and provided evidence against the null hypothesis that CSD is not an early warning sign of critical transitions. Complete details of this investigation can be found in the pre-registration document [48]. All three variables used in the analysis have been linked to extubation outcome in the past [4952].

2 Methods

2.1 Ethics statement

All data used were previously collected during the period from 01/01/2016 to 31/12/2018 as part of standard clinical care, in the Great Ormond Street Hospital, London. No additional data were collected for the purposes of this work, or any wider research work. All data were fully de-identified. Therefore, the need for individual patient consent was waived under GAfREC 2018 and as a result is exempt from NHS REC approval. This was approved by the institutional joint research office (Great Ormond Street Hospital ref 19HL03) and the NHS Health Research Authority (IRAS 259636 received on 24/01/2019). The dataset was accessed for the present study between 16/08/2022 and 27/02/2023.

2.2 Background

CSD is a phenomenon which occurs in any closed continuous model which is approaching certain types of bifurcations, such as fold bifurcations [1]. We show this in an arbitrary dynamical system approaching a fold bifurcation. Its linearized dynamics is given by (1) where x is the perturbation from equilibrium, λ is the eigenvalue, σ is the noise amplitude and ϵ is a white noise process. This represents an Ornstein-Uhlenbeck process, the variance and autocorrelation of which are defined as (2)

For a fold bifurcation, λ → 0, which implies that the variance approaches ∞ and ACF approaches 1 [4, 53].

2.3 Data

The dataset consisted of ICU visits of all children admitted to the Great Ormond Street Hospital (GOSH) between 2016 and 2018. In the current work, time series of the heart rate, respiratory rate and mean blood pressure sampled every 5 seconds were used for analysis. The data for each patient also consisted of the sex, age, timestamp of extubation attempts, a timestamp for time of re-intubation, a timestamp for time of death, flags for failed extubation and death, and the ward they were admitted to [54]. The times of extubation and re-intubation were generally determined from gaps in the end tidal CO2 measurements. Missing data in the end tidal CO2, closest to the recorded times of extubation and re-intubation were chosen as the actual times of extubation/re-intubation. When large discrepancies were observed between the two, a clinician examined the patient records for determining a likely time of extubation, failing which the data was discarded.

The intensive care units at Great Ormond Street Hospital are divided into three distinct areas: the neonatal ICU (NICU), where premature children and those with congenital conditions are admitted; the cardiac ICU (CICU) where children with heart disease, primarily post-cardiac surgery are admitted, and the general ICU (PICU), where all other children from ages 0–18 are admitted. A flow chart describing the separation of data from the ICU into cohorts, prior to the stratification and cleaning described in the following sections, is shown in Fig 1. The characteristics of this dataset are listed in Table 1. The dataset was prepared and used for a prior data study group with the Alan Turing Institute, UK conducted in 2021. During the data study group attempts were made at predicting extubation failure using three machine learning methods, namely time series forest, random interval spectral ensemble and random forest. None of the methods used performed better than a naive baseline classifier [54]. All data were fully de-identified.

thumbnail
Fig 1. Flowchart depicting the steps involved in the partitioning of patient records into cohorts.

High frequency monitor data is extracted between extubation and re-intubation times for cohorts 1 and 3 and between extubation and pseudo-reintubation times for cohorts 2 and 4. The numbers mentioned for each of the cohorts is before age based stratification and before accounting for data losses described in S1 File. SpO2: Oxygen saturation; Et CO2: End-tidal CO2; CVP: Central Venous Pressure; Temp: Temperature.

https://doi.org/10.1371/journal.pone.0317211.g001

thumbnail
Table 1. The characteristics of the dataset prior to pre-processing.

https://doi.org/10.1371/journal.pone.0317211.t001

2.4 Pre-processing of the data

The given data set was first split into four different cohorts depending on whether the patient underwent re-intubation and/or died during a particular visit. The criteria for each cohort is as follows:

  1. Cohort 1: Failed extubation and were re-intubated and did not die
  2. Cohort 2: Successful extubation and were not re-intubated and did not die
  3. Cohort 3: Extubated and were not re-intubated and died
  4. Cohort 4: Extubated and were re-intubated and died after re-intubation (This group represented those in whom intensive care support was being withdrawn)

The main analysis was carried out between cohort 1 who were hypothesised to show signs of CSD and cohort 2 who were acting as a control group since it is assumed they do not reach a critical tipping point post-extubation. The same process was carried out on cohort 3 and 4 however in these cases the results are considered separately since there may be other significant biological factors affecting the dynamics of these patients. As seen in Fig 1, cohorts 1, 2, 3 and 4 have sizes of 184, 2479, 333 and 38 respectively, prior to pre-processing. For patients who did not die in care, this corresponds to a failure rate of about 6.9%.

In order to be able to make an unbiased comparison between cohort 2, the covariates of age, sex (M/F/Unknown) and ICU ward type (paediatric-PICU/neonatal-NICU/cardiac-CICU) were studied first. Fig 2 shows the distribution of both cohorts across these covariates. It is clear that the proportion of male and female instances are almost identical so this did not require stratification. Age shows more variation and since it is a key factor in determining the dynamics of the physiological data, it was stratified over. ICU ward type also shows a significant amount of variation, and therefore the testing was conducted on the entire cohorts as well the cohorts split by ICU ward type and the differences are discussed in the results section. Stratification was not carried out for cohort 3 or 4, since these cohorts were much smaller, and any stratification would have resulted in significant loss of data.

thumbnail
Fig 2. Comparison of age, sex and ICU ward type of cohort 1 and cohort 2.

https://doi.org/10.1371/journal.pone.0317211.g002

Moreover, since the calculation of the quantifiers use windows of 60 minutes of data (see section 2.5), any ICU visit where the duration between the failed extubation and critical point (either re-intubation or death) is less than 120 minutes was removed. This is so that the window size is not greater than half of the length of the time series, which would fail to pick up nuances in the data and provide inaccurate estimations [55].

In order to test the hypothesis, each instance of the control group (cohort 2) was randomly assigned a pseudo re-intubation time which was decided such that the distributions of the duration of extubated intervals are the same over cohort 1 and cohort 2. The mean and standard deviations of the measures used for analysis after pre-processing are presented in S1 File. Significant differences are seen between Cohorts 1 and 2 for all three measures, in line with previous literature [4952].

Missing data was handled through a two step process. When the gap in observations were longer than 10 time steps, the time series was concatenated. For gaps smaller than this, the missing part was replaced with the mean of the end points of the gap. Subsequently the long term trend was identified using a Gaussian filter and subtracted from the original time series to generate the residual time series.

The number of records available for analysis was reduced significantly, since for many records the time difference between extubation and re-intubation was less than 120 minutes. More records were lost after preprocessing and during the data analysis. Different numbers of patient records were available for the different physiological variables considered. The number of records lost and the causes for the same are detailed in S1 File.

Subsequently, the time series were detrended by applying Gaussian smoothing and removing the smoothed data from the original data. Such detrending is shown to reduce false positives in detecting early warning signals from data [56]. Outliers outside μ ± 3σ were discarded and replaced with μ, where μ is the mean and σ is the standard deviation.

2.5 Hypotheses

This analysis aims to investigate whether CSD can be observed post-extubation and prior to a patient being re-intubated. For the reasons discussed in Section 2.1, the following quantifiers will be studied:

  1. Variance, σ2—The presence of a period of CSD would cause an increase in variance as time approached the critical transition point. This would be due to a reduced resilience to external perturbations [57]. For a patient who did not require re-intubation, a significant increase in the variance is not expected (Note that there are instances of patients who experience a critical transition in their health but in the end did not require re-intubation, see Section 4 for further details.).
  2. Autocorrelation, r1—The presence of a period of CSD would also cause an increase in the lag-1 autocorrelation of the time series data. For a patient who did not require re-intubation, a significant increase in the autocorrelation is not expected.

Formally, the null and alternative hypotheses can be expressed as follows:

  1. H0: There is no period of CSD leading up to a critical tipping point
  2. H1: There is a period of CSD leading up to a critical tipping point

2.6 Statistical model

The method consisted of analysing the variance and autocorrelation of the heart rate (beats per minute/BPM), respiration rate (breaths per minute/BrPM) and mean blood pressure (millimeters of Mercury/mmHg).

For each of the heart rate, respiration rate and blood pressure time series, the variance and autocorrelation were calculated on sequential windows of data, slid forward by 1 minute. For a time series X = {x1, x2, …, xn}, a window of size m and starting time t, would be given as X{t,m} = {xt, xt+1, …, xt+m}. The variance and autocorrelation of each window would then be given by (3) (4) The resulting σ2,t and form a secondary time series, as the start point of the window, t, is slid forward, which shows how these measures change over time. There is a trade-off for the choice of window size in this study, since a window that is too large will result in significant loss in data and a window that is too small will be dominated by short-term fluctuations in the time series [58, 59]. As a trade-off, primary analysis was conducted using a window size of 60 minutes and variations observed with smaller window sizes of 30 and 15 minutes were considered in an exploratory analysis and reported in S3 File.

For each of these, the modified Mann-Kendall [60] test was then carried out which calculates the Kendall-Tau correlation coefficient while also applying a correction to correct for the dependence between overlapping windows [61, 62]. This resulted in a correlation coefficient and test statistic for both the variance and autocorrelation of each physiological measure (6 combinations in total). A Holm-Bonferroni hypothesis test was conducted to correct for multiple tests at a significance of α = 0.05. Since the application of CSD to detect the need for reintubation is a novel approach, it is important to minimize the risk of missing clinically relevant changes. Hence the most commonly used α level of.05 was chosen over stricter threshold, while also correcting for multiple testing using the Holm-Bonferroni approach.

The proportions of instances which gave significant results was then calculated for each cohort, and a z-test was conducted to check whether these proportions were significantly different between the cohorts. The predictive capacity of the model was calculated using the performance metrics, positive predictive value (PPV), negative predictive value (NPV), sensitivity and specificity.

Welch’s t-tests were also conducted on the corresponding distributions of Kendall-τ correlation coefficients for cohort 1 and 2 in order to explore the likelihood of these samples have the same mean.

Both the z test and the t-test were conducted to establish if Kendall-τ correlation coefficients could be used to distinguish the cohorts from each other. These tests were conducted on the results of the cohorts as a whole, as well as separating them by ICU ward type (PICU/CICU/NICU). All tests were conducted on Python 3.5, using the packages numpy, scipy and pymannkendall [60, 63, 64].

3 Results

In the following sections, we present the results of the statistical analyses conducted, including z-tests for significant increases in EWS, t-tests for comparing the magnitudes of increase, and a series of exploratory analyses to further investigate the trends in the data. The final number of files analyzed per ICU ward, cohort, and variable is shown in Table 2. This results in failure rates ranging from 5.4% to 6.4%, depending on the variable used without subdivision into wards, and from 3.9% to 14.8% when considering wards separately.

thumbnail
Table 2. Final number of files used for analysis each variable, cohort and ward type.

https://doi.org/10.1371/journal.pone.0317211.t002

3.1 Proportions of significant results

After conducting the Mann-Kendall hypothesis test as outlined in Section 2.5, the proportion of significant results (instances which showed an increase in variance or autocorrelation over time) for each measure were calculated and compared between cohort 1 and the control group, cohort 2. Fig 3 and Table 3 show these results and how they are split among the three ICU ward types.

thumbnail
Fig 3. Proportion of instances which showed significant increases in variance and autocorrelation respectively over the extubated period after conducting the Mann-Kendall hypothesis tests using the Holm-Bonferroni method.

Each measure of each cohort is also split by ICU ward type.

https://doi.org/10.1371/journal.pone.0317211.g003

thumbnail
Table 3. The proportions of significant Mann-Kendall hypothesis tests conducted on cohorts 1 and 2.

https://doi.org/10.1371/journal.pone.0317211.t003

When the entire cohorts are considered as a whole rather than split by ICU ward type, cohort 1 has a statistically significant higher proportion of instances showing an increase in autocorrelation of the heart rate. This provides evidence to reject the null hypothesis and suggests that CSD is observed before a critical tipping point (which is taken to be the point of re-intubation) for this measure. In no other measure was there sufficient evidence to reject the null hypothesis at the α level considered. However, in 5 of 6 cases, a higher proportion of trends were observed in cohort 1 than in cohort 2.

3.2 Distributions of correlation coefficients

The Kendall-τ correlation coefficients of each instance signal the strength of trends in variance and autocorrelation over time. Hence, a greater positive value of τ suggests a stronger presence of CSD. Fig 4 shows the distributions of these correlation coefficients for each combination of measures. We see that these distributions are roughly Gaussian, especially for cohort 2 where sample sizes are around 1000, and much larger than those of cohorts 1 which vary between 59 and 101.

thumbnail
Fig 4. Histograms showing distributions of the Kendall-Tau correlation coefficients of each time series, grouped by physiological measure (heart rate, respiration rate, mean blood pressure) and quantifier (variance, autocorrelation).

This is a comparison of the correlation coefficients of cohort 1 and 2.

https://doi.org/10.1371/journal.pone.0317211.g004

For each of these sets of τ values, a Welch’s t-test was conducted between cohort 1 and cohort 2 to determine whether the means of their distributions were significantly different, i.e, assuming that cohort 1 and 2 follow distributions of and respectively, the following hypotheses were tested:

The means, standard errors and p-values corresponding to each of these tests are shown in Table 4 with the significant results highlighted. This analyses were conducted on the entire cohorts as well as after splitting by ICU ward type. We see that only 1 out of 6 tests on the whole cohorts show significant results, and 3 of 18 when split over the various ICU ward types.

thumbnail
Table 4. Means and standard errors of Kendall-Tau correlation coefficients of cohorts 1 and 2, as well as t-statistics and p-values when Welch’s T-Test is conducted.

https://doi.org/10.1371/journal.pone.0317211.t004

The only significant test was again associated with the autocorrelation of the heart rate. This was the only measure that had a significant result in the z-test conducted in Section 3.1. Therefore, within this dataset, we see that the autocorrelation of the heart rate is the best indicator of CSD.

Overall, these results do not provide evidence to suggest that CSD may be observed in patients who are approaching a transition in their physiology, except in the autocorrelation of the heart rate.

3.3 Exploratory analyses

In this section, we describe the exploratory analyses that were not pre-registered as part of our main analyses. We list the broad results of our exploratory analyses here, and the details are given in supplementary material.

The first of these was to analyse the results of cohort 3 with cohort 2. Cohort 3 consisted of individuals who were extubated and subsequently died in care. The analysis was conducted for the whole time series from the point of extubation until death. None of the 6 measures showed significant differences in the proportion of significant trends between the two cohorts when they were considered as a whole, as well as when subdivided according to ICU ward.

A similar analysis was also conducted with Cohort 4, which consisted of a very small number of individuals who were re-intubated and still died in care. In this case, 1 out of 6 variables showed significant results, namely the autocorrelations of the mean blood pressures. However, due to the small size of the cohort, these results should be viewed with caution.

We also repeated the analysis described in Section 2 with smaller window sizes of 15 and 30 minutes. We see an increase in the number of false positives in almost all quantifiers. While no significant results were seen for the 30 minute windows, the proportion of the autocorrelation of the heart rate for cohort 1 was significantly higher than for cohort 2, when using 15 minute windows. This higher proportion was also observed when data from the CICU alone was considered.

Finally, we also explored the oxygen saturation (SpO2) time series for significant increases in autocorrelation and variance using hour long windows, for cohorts 1 and 2. While we see no significant differences between the two cohorts when taken together or split by ICU type, we do see that a Welch t-test showed significant differences between the distribution of correlation coefficients between the two groups, for both autocorrelation and variance of the SpO2. This difference also persisted when the PICU cases were considered alone. The autocorrelation of SpO2 was also significantly different between the two groups when the CICU cases were considered alone.

3.4 Model performance

The confusion values and various performance metrics of the measured EWS are shown in Table 5. The metrics used to evaluate the predictive capabilities of the model are Positive Predictive Value (PPV), Negative Predictive Value (NPV), Sensitivity, Specificity, Balanced Accurancy (AccB) and Area Under the Receiver Operator Characteristics (ROC) Curve (AUC). All quantifiers have values that vary between 0 and 1, with 1 indicating ideal classification performance. The PPV, NPV, Sensitivity and Specificity are affected by class imbalance, whereas AccB and AUC are robust to this. Values close to 0.5 for the latter indicate classification no better than chance.

thumbnail
Table 5. Confusion values, PPV, NPV, sensitivity and specificity of the model for cohorts 1 and 2.

https://doi.org/10.1371/journal.pone.0317211.t005

In this study, since we are interested in only predictions due to positive trends, the ROC curve was generated by varying the threshold of the Kendall τ coefficient, while setting negative τ to a minimum negative value (effectively excluding them).

The values close to 0.5 of AccB and AUC indicate that the model has limited predictive power. The low sensitivity values suggests that increasing variance and autocorrelation are not effective signals for predicting the need for re-intubation for patients in ICUs. The low values of PPV and high values for NPV in the study partially arise from the differences in sizes between cohorts 1 and 2. Since a significant proportion of the much larger Cohort 2 is incorrectly flagged as a positive, the chances of a positively flagged test actually being a true positive are incredibly small, as seen in the PPV.

Finally, we also show the results of logistic regression models when using multiple variables as predictors. In these models, the extubation outcome is the dependent variable, and the values of the Kendall correlation coefficients are used as the independent variables. A weight factor of 10:1 was taken to consider the relative prevalence of successful extubations relative to extubation failures. An additional model incorporating the mean values of the heart rate, respiratory rate, and blood pressure was also used for comparison. The logistic regression models used in the analysis are defined as follows.

  1. ExtOutcome
  2. ExtOutcome
  3. ExtOutcome
  4. ExtOutcome
  5. ExtOutcome ∼ HRmean + RRmean + ABPmean
  6. ExtOutcome ∼ All Variables

Here and represents the magnitude of the Kendall correlation coefficient for the autocorrelation and variance for each variable (HR, RR and ABP). The full results of the logistic regression models, including the logs of the odds (estimate), standard errors, z-values, p-values, degrees of freedom (N), Akaike information criterion (AIC) and R2 values are summarized in Table 6. The model incorporating all variables showed a higher R2 and lower AIC as compared to each of the individual models. An ANOVA showed that model 6 significantly differed from model 4 and 5 (model 4 and 6: Deviance: 162.64 p-value:<0.001; model 5 and 6: Deviance: 29.894; p-value<0.001). Since model 4 that combines the HR, RR and ABP variables performed better than models 1,2 and 3 individually, a one-sided t-test was conducted to check if Cohort 1 had a higher strength of positive transitions across variables (defined as the sum of positive τ for all variables) as compared to Cohort 2, yielding significant results (t-statistic = 2.056, p-value = 0.022). Moreover, the AUC of the combined strength of positive transitions was 0.597, higher than the AUC for the individual variables. The logistic regression was conducted using the glm function in R version 4.3.1.

thumbnail
Table 6. Logistic regression models using the Kendall -τ coefficients as predictors.

https://doi.org/10.1371/journal.pone.0317211.t006

4 Discussion

4.1 Summary and significance

In this paper, we explored whether paediatric patients who fail extubation in the ICU are approaching a critical transition in their physiology. We studied this by checking for signs of CSD in the dynamics of their heart rate, respiratory rate, and blood pressure. We tested these variables for increases in variance and autocorrelation over time. The proportion of significant increases was calculated and compared to the corresponding proportion in a stratified sample of individuals who did not fail extubation. We found that 1 out of the 6 tests, namely the autocorrelation of the heart rate showed a significantly larger proportion of positive tests in the individuals who failed extubation. The autocorrelation of the heart rate also showed a significantly higher magnitude of increase over all patients in Cohort 1 over time, as compared to Cohort 2. When calculating the model performance using metrics such as sensitivity and specificity, we observe low sensitivities and positive predictive values, suggesting limited clinical utility in using these metrics by themselves for predicting an approaching transition. Similar analyses in Cohort 3 yielded worse results with none of the proportion of trends in Cohort 3 being significantly larger than Cohort 2. While Cohort 4 showed 1 out of 6 results to have significantly larger proportions than Cohort 2, the small sample sizes make these results more suspect. In our exploratory analysis, we observe that when a smaller window size of 15 minutes was used, the autocorrelation of the heart rate showed significantly larger proportion of positive tests in the individuals who failed extubation. This effect, however, was not observed when 30 minute windows were used instead. We also observed that the SpO2 time series showed a significantly higher value for the magnitude of increase in auto-correlation and variance in individuals who failed extubation, as compared to those who did not. Finally, we also observe in a logistic regression model that using the values of correlation coefficients leads to a significantly better model than one that uses only the mean values of the measures.

One of the interesting features of the study is the distributions of the Kendall τ coefficients in Fig 4. Apart from strong false positives and negatives, we also see a number of negative trends in both cohorts 1 and 2. There is some evidence that such negative trends could occur in autocorrelation in systems exhibiting flickering prior to transitions or in both autocorrelation and variance as a consequence of processes that exhibit critical speeding up [6567]. The variance is also known to decrease when there is insufficient data and the fluctuations in the system are dominated by low frequencies or when the data is highly noisy [68]. In complex biological data, such as this both issues related to data, as well as competing processes with opposite effects could lead to such negative trends.

While our analysis detected some statistically significant differences, notably in the autocorrelation of heart rate, low values for performance metrics indicate limited utility for prediction. Taken together, our findings indicate that CSD based metrics alone are not sufficient to predict extubation outcome and may require either substantial methodological refinement or combining with other complimentary metrics to yield acceptable predictability.

Our exploration is an important step in various directions. To the best of our knowledge, it is the first study that explores CSD in the dynamics of critically ill patients. It also adds to the growing literature on using CSD and EWS to study transitions in medicine [12, 19, 69]. Moreover, it is among the first studies that use dynamical systems theory to study problems in critical care. It also identifies variables and quantifiers that could aid in enhancing the prediction of extubation outcome in the future. It uses a data agnostic approach, with the understanding that the presence of CSD can offer information to add to the expert knowledge of a clinician making a decision. The significance of these findings extends to addressing challenges posed by public health crises such as the COVID-19 pandemic, where the demand for ventilators exceeded supply.

4.2 Relation to existing literature

While our work uses a novel technique different from any previous explorations on critical care data, our observations conform with the results in the literature. For instance, in neonates, Goel et. al. observed that the heart rate characteristics index, a score that incorporates the variability, asymmetry, and entropy of ECG data, was correlated to extubation outcome and enhanced predictability in regression models [51] Miu et. al. observed that for the first 60 minutes after reintubation, the SpO2 significantly differed between patients requiring reintubation and those who did not. They also observed significantly different mean heart rate, SpO2, and breathing frequency between the extubation failure and success group leading up to extubation [35].

4.3 Reasons for low model performance

While the results do not conclusively detect CSD in critically ill patients, they do show an effect in the expected direction in almost all cases. However, sensitivity is generally low, with even the best performing indicators detecting less than a third of extubation failures, implying that a significant number of extubation failures could go undetected. In the context of critical care, missed detections are more crucial than false detections, since the former can prevent timely interventions. Hence it may be more important for future studies to improve sensitivity, even at the cost of specificity.

A major reason for the observed low model performance could be that the timing of re-intubation is subjective. Hence, clinicians may be conservative in estimating when a person should be re-intubated and could lead to re-intubation before signs of CSD are strong enough to detect. Consequently, even if patients were approaching a critical transition point, the strength of EWS they would display and the time at which they would appear, would vary considerably. Moreover, this study assumes that the time scale at which the dynamics evolves for all three signals considered are the same, an assumption which is not true.

Studies that examine CSD in medical data are limited by the definition of what constitutes a positive. In many cases, as mentioned above, the decision to re-intubate may have been taken before signs of CSD become prominent. These would be treated as false negatives in our study, but may not be so in reality. In other cases, CSD may have happened in reality, since the patient was approaching a transition point, but the system could have steadied itself due to internal regulatory mechanisms of the body. Moreover, a number of potential confounding factors could influence the dynamics of the vital signs considered in this work. These include influences from drug based interventions which could alter heart rate and blood pressure, as well as the patient’s fluid balance which relates to respiratory and cardiovascular stability [70, 71]. These cases would not be false positives in the correct sense, but would be considered as such in our study [4, 72]. An example of this can be observed in the case of patient FR3426 who showed significant increases in 5 out of 6 of the measures considered. As Fig 5 shows, the patient showed a major change in physiology at around 06:00 on the 29th, even though this did not lead to subsequent re-intubation or death. This is made clear by the steep increase in variance which begins at this time. This would be an example of a false positive in our study, which may not be so in reality.

thumbnail
Fig 5. Raw data (in purple) of the heart rate, respiration rate and mean blood pressure, as well as the variance (in green) calculated over a moving window, for the first visit of patient FR3426 between the time of extubation and pseudo re-intubation.

https://doi.org/10.1371/journal.pone.0317211.g005

Although increasing variance and autocorrelation have analytically been proven to be signals of CSD, these have often been difficult to observe in complex biological systems [69, 73, 74]. Several factors may contribute to this. For instance, the system itself may not be undergoing a bifurcation that is expected to be theoretically preceded by CSD [75]or the variables under consideration may not be in the direction of slowest recovery, where EWS are most prominent [7678]. Moreover, competing processes could lead to critical speeding up of the dynamics or flickering of the transition which could mask detection of EWS in the system [65, 67, 69].

Beyond these theoretical considerations, practical issues such as under-sampling, insufficient data and time scale of occurrence could lead to false negatives in real time series [7981]. False positives on the other hand can occur due to serial correlations and time dependent trends [56, 61]. While detrending can remove long term trends, too small a bandwidth removes important trends, and increases false negatives whereas too long a bandwidth may not remove all irrelevant trends and consequently increase false positives [82]. It has been suggested that in systems where naturally strong fluctuations exist in the system response, other early warning signals that quantify flickering could be more useful in detecting transitions [13, 66]. Metrics derived from detrended fluctuation analysis and significance testing using bootstrapping methods have both shown potential as robust approaches for detecting CSD in real-world scenarios [4, 12, 58].

In our study we attempted to overcome these difficulties by systematically examining and overcoming them in a single test case [48]. However the largely negative results of our study on the entire dataset seems to suggest that either the issues identified on the single dataset did not generalize to the whole system. To avoid this, future studies in such complex biological datasets may benefit from using a small, representative subset (perhaps 5%) of the dataset to optimize parameters for identifying CSD.

4.4 Limitations

While the present exploration is unique in many ways, it represents a preliminary approach towards using dynamical systems to study extubation failure. Although the high sampling frequency of the vital signs time series used in this work is rare in studies on extubation failure, the sample size of the data is comparatively small. This is particularly true in comparison to the MIMIC datasets which have been used extensively to study extubation failure [83, 84]. Furthermore, we use only data from a single paediatric hospital in London, and our results may not directly translate to an adult population.

As mentioned above, a major limitation in studies on extubation is in identifying true physiological transition points. In clinical practice, the decision to re-intubate is made preemptively in anticipation of potential instability, rather than waiting for definitive evidence of a transition. As a result transition points in this study may not indicate a change in the dynamics of the system, unlike in other fields where EWS have been detected. This study uses the timing of re-intubation as an approximate transition point, which may often be far from the true transition point.

4.5 Future directions

We have a few suggestions for future exploration based on the results of our study. We have largely null results for most of our exploration, except for the autocorrelation of the heart rate. While reasons for this have been discussed in the paragraphs above, one other possible avenue to explore could be analysis of waveform data from the ICU. The time series used presently are quantities derived from waveform data like ECG or breathing waveform, and as such the nonlinear dynamics could be obscured in these derived quantifiers.

To overcome the limitation of the subjectivity associated with transition points, future studies could focus on instances where emergency reintubation was required. In such cases, the actual point of reintubation may be much closer to the transition point than in a general sample. Another possibility could be to quantify markers of instability, such as deviations in physiological baselines using methods such as change point analysis. Only reintubations that also include significant quantitative change would then be checked for early warning signals, as done in studies on CSD in other clinical contexts [85].

A significant outcome of the present study is the improvement seen by using CSD variables in a simple logistic regression model. At the group level, the combination of CSD variables showed lower AIC and higher R2 than models using individual CSD variables. This difference persisted even when the degrees of freedom were kept the same. This indicated that a combination of variables could perform better than individual CSD variables to detect extubations. Moreover Cohort 1 showed a higher strength of positive transitions across variables as compared to Cohort 2, and a higher predictive capacity (measured using AUC) as compared to the individual variables. This indicates that using a combination of EWS at an individual level could reduce misdetection and increase sensitivity in the sample. Setting an optimal threshold for this combined strength of positive transitions could increase sensitivity even at the cost of specificity, which may be desirable in clinical contexts. We also see that adding CSD metrics to the mean levels of the variables in the logistic regression model also showed a significant improvement over a model without them. At an individual level this could indicate that the mean and CSD metrics calculated over a sliding window could predict extubation outcome better than using either of them alone. Going even further, while CSD metrics on their own may not be a reliable predictor of extubation outcome, they could constitute novel features in machine learning based prediction models and enhance their predictive power [42]. In a real time setting, binary outcomes could be replaced with a continuous measure that gives the probability of failure, which could aid in clinical decision support [86]. In the context of the CSD, this could be the Kendall τ values, or the deviations from baseline for a sliding window mean, or prediction probabilities for a machine learning algorithm.

Another suggested path for future exploration is to study differences in patients who exhibit CSD in their dynamics from those who do not. Such differences could include medical comorbidities such as asthma, heart arrhythmia or pregnancy stressors in neonatal admissions. Beyond extubation, testing CSD prior to more objective markers where the intervention point is not arbitrary, such as cardiac arrests, is likely to be a fruitful direction for further research. While detecting CSD in extubated patients is an important first step, it is important to conduct an extensive analysis to find how far in time before reintubation CSD can be detected, since a longer time to reintubation is associated with higher mortality [33]. In fact, it would be even more clinically relevant to test whether slowing down exists leading up to extubation in patients who are likely to fail extubation in future.

4.6 Conclusions

Predicting extubation outcome is an important question that has been studied extensively in the past, with mixed results. The results of the present study found limited evidence to support the use of CSD based metrics to predict extubation outcome, with only the autocorrelation of the heart rate showing significant differences between the groups. These largely null results highlight the challenges of detecting critical slowing down in complex biological systems in general, and more so in critical care, where confounding factors often obscure potential early warning signals. Future studies should be wary of these challenges and should carefully consider whether investigating CSD in clinical contexts is likely to yield meaningful benefits. Our findings point towards the need for a marked change in the way CSD based metrics are currently used for prediction in the context of medicine. For instance, combining CSD based metrics along with other indicators of extubation failure, or using them as one of many features in a machine learning algorithm could lead to improved prediction. This could perhaps be the way for future researchers looking to use CSD metrics or other measures from dynamical systems theory for prediction in the ICU.

Supporting information

S1 File. Characteristics of the dataset.

In this supplementary we discuss the reasons for missing records in the data and present the results of how the mean characteristics of the data vary between Cohorts 1 and 2.

https://doi.org/10.1371/journal.pone.0317211.s001

(PDF)

S2 File. Comparisons of cohorts 3 and 4 with control.

In this supplementary we explore how the proportions of records that show critical slowing down in cohorts 3 and 4 vary from cohort 2.

https://doi.org/10.1371/journal.pone.0317211.s002

(PDF)

S3 File. Variation with smaller window sizes.

In this supplementary we explore how the results vary when using smaller window sizes of 15 and 30 minutes.

https://doi.org/10.1371/journal.pone.0317211.s003

(PDF)

S4 File. Critical slowing down in oxygen saturation data.

In this supplementary we explore whether critical slowing down can be detected in oxygen saturation data, leading up to extubation failure.

https://doi.org/10.1371/journal.pone.0317211.s004

(PDF)

Acknowledgments

We acknowledge John Booth, Thalitha Grossman and Yael Feinstein for their work in helping to prepare and validate the data set used in this study. We gratefully acknowledge comments from an anonymous reviewer on this manuscript which has helped improve the discussion section considerably.

References

  1. 1. Scheffer M, Bascompte J, Brock WA, Brovkin V, Carpenter SR, Dakos V, et al. Early-warning signals for critical transitions. Nature. 2009;461(7260):53–59. pmid:19727193
  2. 2. Tredicce JR, Lippi GL, Mandel P, Charasse B, Chevalier A, Picqué B. Critical slowing down at a bifurcation. American Journal of Physics. 2004;72(6):799–809.
  3. 3. Scheffer M, Carpenter SR, Lenton TM, Bascompte J, Brock W, Dakos V, et al. Anticipating critical transitions. Science. 2012;338(6105):344–348. pmid:23087241
  4. 4. George SV, Kachhara S, Ambika G. Early warning signals for critical transitions in complex systems. Physica Scripta. 2023;98(7):072002.
  5. 5. Dakos V, Carpenter SR, van Nes EH, Scheffer M. Resilience indicators: Prospects and limitations for early warnings of regime shifts. Philosophical Transactions of the Royal Society B: Biological Sciences. 2015;370(1659):20130263.
  6. 6. Prettyman J, Kuna T, Livina V. A novel scaling indicator of early warning signals helps anticipate tropical cyclones. EPL (Europhysics Letters). 2018;121(1):10002.
  7. 7. Litzow MA, Hunsicker ME. Early warning signals, nonlinearity, and signs of hysteresis in real ecosystems. Ecosphere. 2016;7(12).
  8. 8. George SV, Kachhara S, Misra R, Ambika G. Early warning signals indicate a critical transition in Betelgeuse. Astronomy & Astrophysics. 2020;640:L21.
  9. 9. Tan J, Cheong SA. The regime shift associated with the 2004–2008 US housing market bubble. PloS one. 2016;11(9):e0162140. pmid:27583633
  10. 10. Meisel C, Klaus A, Kuehn C, Plenz D. Critical slowing down governs the transition to neuron spiking. PLoS computational biology. 2015;11(2):e1004097. pmid:25706912
  11. 11. Freestone DR, Karoly PJ, Peterson AD, Kuhlmann L, Lai A, Goodarzy F, et al. Seizure prediction: Science fiction or soon to become reality? Current Neurology and Neuroscience Reports. 2015;15(11). pmid:26404726
  12. 12. Nannes B, Quax R, Ashikaga H, Hocini M, Dubois R, Bernus O, et al. Early signs of critical slowing down in heart surface electrograms of ventricular fibrillation victims. Lecture Notes in Computer Science. 2020; p. 334–347.
  13. 13. Olde Rikkert MG, Dakos V, Buchman TG, Boer Rd, Glass L, Cramer AO, et al. Slowing down of recovery as generic risk marker for acute severity transitions in chronic diseases. Critical Care Medicine. 2016;44(3):601–606. pmid:26765499
  14. 14. Lagro J, Laurenssen NC, Schalk BW, Schoon Y, Claassen JA, Rikkert MGO. Diastolic blood pressure drop after standing as a clinical sign for increased mortality in older falls clinic patients. Journal of hypertension. 2012;30(6):1195–1202. pmid:22473020
  15. 15. Lagro J, Schoon Y, Heerts I, Meel-van den Abeelen AS, Schalk B, Wieling W, et al. Impaired systolic blood pressure recovery directly after standing predicts mortality in older falls clinic patients. Journals of Gerontology Series A: Biomedical Sciences and Medical Sciences. 2014;69(4):471–478. pmid:23873962
  16. 16. Le VV, Mitiku T, Sungar G, Myers J, Froelicher V. The blood pressure response to dynamic exercise testing: a systematic review. Progress in cardiovascular diseases. 2008;51(2):135–160. pmid:18774013
  17. 17. Jirsa VK, Stacey WC, Quilichini PP, Ivanov AI, Bernard C. On the nature of seizure dynamics. Brain. 2014;137(8):2210–2230. pmid:24919973
  18. 18. Chang WC, Kudlacek J, Hlinka J, Chvojka J, Hadrava M, Kumpost V, et al. Loss of neuronal network resilience precedes seizures and determines the ictogenic nature of interictal synaptic perturbations. Nature neuroscience. 2018;21(12):1742–1752. pmid:30482946
  19. 19. Maturana MI, Meisel C, Dell K, Karoly PJ, D’Souza W, Grayden DB, et al. Critical slowing down as a biomarker for seizure susceptibility. Nature communications. 2020;11(1):2172. pmid:32358560
  20. 20. Wichers M, Groot PC. Critical slowing down as a personalized early warning signal for depression. Psychotherapy and Psychosomatics. 2016;85(2):114–116. pmid:26821231
  21. 21. Bayani A, Hadaeghi F, Jafari S, Murray G. Critical slowing down as an early warning of transitions in episodes of bipolar disorder: a simulation study based on a computational model of circadian activity rhythms. Chronobiology international. 2017;34(2):235–245. pmid:28060532
  22. 22. Bos FM, Schreuder MJ, George SV, Doornbos B, Bruggeman R, Van der Krieke L, et al. Anticipating manic and depressive transitions in patients with bipolar disorder using early warning signals. International Journal of Bipolar Disorders. 2022;10(1):12. pmid:35397076
  23. 23. Thibault S, Heyer L, Benchetrit G, Baconnier P. Ventilatory support: a dynamical systems approach. Acta Biotheoretica. 2002;50(4):269–279. pmid:12675531
  24. 24. Fontecave Jallon J, Abdulhay E, Calabrese P, Baconnier P, Gumery PY. A model of mechanical interactions between heart and lungs. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences. 2009;367(1908):4741–4757.
  25. 25. Ashwin P, Wieczorek S, Vitolo R, Cox P. Tipping points in open systems: bifurcation, noise-induced and rate-dependent examples in the climate system. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences. 2012;370(1962):1166–1184. pmid:22291228
  26. 26. Wunsch H, Wagner J, Herlim M, Chong D, Kramer A, Halpern SD. ICU occupancy and mechanical ventilator use in the United States. Critical care medicine. 2013;41(12). pmid:23963122
  27. 27. Epstein S. Decision to extubate. Intensive care medicine. 2002;28(5):535–546. pmid:12029399
  28. 28. Busl KM, Ouyang B, Boland TA, Pollandt S, Temes RE. Prolonged mechanical ventilation is associated with pulmonary complications, increased length of stay, and unfavorable discharge destination among patients with subdural hematoma. Journal of Neurosurgical Anesthesiology. 2015;27(1):31–36. pmid:24922337
  29. 29. Epstein SK. Extubation failure: an outcome to be avoided. Critical Care. 2004;8:1–3. pmid:15469587
  30. 30. Newth CJ, Venkataraman S, Willson DF, Meert KL, Harrison R, Dean JM, et al. Weaning and extubation readiness in pediatric patients. Pediatric critical care medicine: a journal of the Society of Critical Care Medicine and the World Federation of Pediatric Intensive and Critical Care Societies. 2009;10(1):1. pmid:19057432
  31. 31. Rothaar RC, Epstein SK. Extubation failure: magnitude of the problem, impact on outcomes, and prevention. Current opinion in critical care. 2003;9(1):59–66. pmid:12548031
  32. 32. Thille AW, Harrois A, Schortgen F, Brun-Buisson C, Brochard L. Outcomes of extubation failure in medical intensive care unit patients. Critical care medicine. 2011;39(12):2612–2618. pmid:21765357
  33. 33. Epstein SK, Ciubotaru RL. Independent effects of etiology of failure and time to reintubation on outcome for patients failing extubation. American journal of respiratory and critical care medicine. 1998;158(2):489–493. pmid:9700126
  34. 34. Mueller M, Almeida JS, Stanislaus R, Wagner CL. Can machine learning methods predict extubation outcome in premature infants as well as clinicians? Journal of neonatal biology. 2013;2. pmid:25419493
  35. 35. Miu T, Joffe AM, Yanez ND, Khandelwal N, Dagal AH, Deem S, et al. Predictors of reintubation in critically ill patients. Respiratory care. 2014;59(2):178–185. pmid:23882103
  36. 36. Hsieh MH, Hsieh MJ, Chen CM, Hsieh CC, Chao CM, Lai CC. An artificial neural network model for predicting successful extubation in intensive care units. Journal of clinical medicine. 2018;7(9):240. pmid:30149612
  37. 37. Zhao QY, Wang H, Luo JC, Luo MH, Liu LP, Yu SJ, et al. Development and validation of a machine-learning model for prediction of extubation failure in intensive care units. Frontiers in medicine. 2021;8:676343. pmid:34079812
  38. 38. Guzatti NG, Klein F, Oliveira JA, Ratico GB, Cordeiro MF, Marmitt LP, et al. Predictive Factors of Extubation Failure in COVID-19 Mechanically Ventilated Patients. Journal of Intensive Care Medicine. 2022;37(9):1250–1255. pmid:35422150
  39. 39. Torrini F, Gendreau S, Morel J, Carteaux G, Thille AW, Antonelli M, et al. Prediction of extubation outcome in critically ill patients: a systematic review and meta-analysis. Critical Care. 2021;25(1):1–12. pmid:34782003
  40. 40. White CE, Batchinsky AI, Necsoiu C, Nguyen R, Walker KP III, Chung KK, et al. Lower interbreath interval complexity is associated with extubation failure in mechanically ventilated patients during spontaneous breathing trials. Journal of Trauma and Acute Care Surgery. 2010;68(6):1310–1316. pmid:20539175
  41. 41. Seely AJ, Bravi A, Herry C, Green G, Longtin A, Ramsay T, et al. Do heart and respiratory rate variability improve prediction of extubation outcomes in critically ill patients? Critical Care. 2014;18:1–12. pmid:24713049
  42. 42. Keim-Malpass J, Enfield KB, Calland JF, Lake DE, Clark MT. Dynamic data monitoring improves predictive analytics for failed extubation in the ICU. Physiological Measurement. 2018;39(7):075005. pmid:29932430
  43. 43. Catling FJ, Wolff AH. Temporal convolutional networks allow early prediction of events in critical care. Journal of the American Medical Informatics Association. 2020;27(3):355–365. pmid:31858114
  44. 44. Li H, Xie X. Deep learning-based prediction of mechanical ventilation reintubation in intensive care units. In: INFORMS International Conference on Service Science. Springer; 2022. p. 15–22.
  45. 45. Zeng Z, Tang X, Liu Y, He Z, Gong X. Interpretable recurrent neural network models for dynamic prediction of the extubation failure risk in patients with invasive mechanical ventilation in the intensive care unit. Biodata Mining. 2022;15(1):21. pmid:36163063
  46. 46. Martinez A, Seymour C, Nam M. Minute ventilation recovery time: a predictor of extubation outcome. Chest. 2003;123(4):1214–1221. pmid:12684314
  47. 47. Seymour CW, Halpern S, Christie JD, Gallop R, Fuchs BD. Minute ventilation recovery time measured using a new, simplified methodology predicts extubation outcome. Journal of intensive care medicine. 2008;23(1):52–60. pmid:18320706
  48. 48. Khalil L, George SV, Ray S, Arridge S. Transitions in intensive care: Investigating critical slowing down post extubation. OSF. 2022.
  49. 49. Frutos-Vivar F, Esteban A, Apezteguia C, González M, Arabi Y, Restrepo MI, et al. Outcome of reintubated patients after scheduled extubation. Journal of critical care. 2011;26(5):502–509. pmid:21376523
  50. 50. Knox KE, Nava-Guerra L, Hotz JC, Newth CJ, Khoo MC, Khemani RG. High breath-by-breath variability is associated with extubation failure in children. Critical care medicine. 2020;48(8):1165. pmid:32697487
  51. 51. Goel N, Chakraborty M, Watkins WJ, Banerjee S. Predicting extubation outcomes—a model incorporating heart rate characteristics index. The Journal of pediatrics. 2018;195:53–58. pmid:29329913
  52. 52. Gupta P, McDonald R, Goyal S, Gossett JM, Imamura M, Agarwal A, et al. Extubation failure in infants with shunt-dependent pulmonary blood flow and univentricular physiology. Cardiology in the Young. 2014;24(1):64–72. pmid:23328580
  53. 53. Boettner C, Boers N. Critical slowing down in dynamical systems driven by nonstationary correlated noise. Physical Review Research. 2022;4(1):013230.
  54. 54. Kafantaris E, Hatami F, Naveed H, Yi J, French J, Mateen B, et al. Data Study Group Final Report: Great Ormond Street Hospital; 2020. Available from: http://doi.org/10.5281/zenodo.3670726.
  55. 55. Baig SuR, Iqbal W, Berral JL, Carrera D. Adaptive sliding windows for improved estimation of Data Center Resource Utilization. Future Generation Computer Systems. 2020;104:212–224.
  56. 56. Jäger G, Füllsack M. Systematically false positives in early warning signal analysis. PLoS One. 2019;14(2):e0211072. pmid:30726240
  57. 57. Chang WC, Kudlacek J, Hlinka J, Chvojka J, Hadrava M, Kumpost V, et al. Loss of neuronal network resilience precedes seizures and determines the ICTOGENIC nature of interictal synaptic perturbations. Nature Neuroscience. 2018;21(12):1742–1752. pmid:30482946
  58. 58. Lenton TM. Early warning of climate tipping points. Nature climate change. 2011;1(4):201–209.
  59. 59. Tu C, D’Odorico P, Suweis S. Critical slowing down associated with critical transition and risk of collapse in crypto-currency. Royal Society Open Science. 2020;7(3):191450. pmid:32269788
  60. 60. Hussain M, Mahmud I. pyMannKendall: a python package for non parametric Mann Kendall family of trend tests. Journal of Open Source Software. 2019;4(39):1556.
  61. 61. Hamed KH, Rao AR. A modified Mann-Kendall trend test for autocorrelated data. Journal of hydrology. 1998;204(1-4):182–196.
  62. 62. Chen S, Ghadami A, Epureanu BI. Practical guide to using Kendall’s τ in the context of forecasting critical transitions. Royal Society Open Science. 2022;9(7):211346. pmid:35911200
  63. 63. Oliphant TE. A guide to NumPy. vol. 1. Trelgol Publishing USA; 2006.
  64. 64. Virtanen P, Gommers R, Oliphant TE, Haberland M, Reddy T, Cournapeau D, et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nature methods. 2020;17(3):261–272. pmid:32015543
  65. 65. Wang R, Dearing JA, Langdon PG, Zhang E, Yang X, Dakos V, et al. Flickering gives early warning signals of a critical transition to a eutrophic lake state. Nature. 2012;492(7429):419–422. pmid:23160492
  66. 66. Dakos V, van Nes EH, Scheffer M. Flickering as an early warning signal. Theoretical ecology. 2013;6(3):309–317.
  67. 67. Titus M, Watson J. Critical speeding up as an early warning signal of stochastic regime shifts. Theoretical Ecology. 2020;13:449–457.
  68. 68. Dakos V, van Nes EH, D’Odorico P, Scheffer M. Robustness of variance and autocorrelation as indicators of critical slowing down. Ecology. 2012;93(2):264–271. pmid:22624308
  69. 69. Helmich MA, Schreuder MJ, Bringmann LF, Riese H, Snippe E, Smit AC. Slow down and be critical before using early warning signals in psychopathology. Nature Reviews Psychology. 2024; p. 1–14.
  70. 70. De Jong A, Talmor D, Jaber S. How to optimize extubation? Intensive Care Medicine. 2023;49(3):337–340. pmid:36719457
  71. 71. Upadya A, Tilluckdharry L, Muralidharan V, Amoateng-Adjepong Y, Manthous CA. Fluid balance and weaning outcomes. Intensive care medicine. 2005;31:1643–1647. pmid:16193330
  72. 72. Freestone DR, Karoly PJ, Cook MJ. A forward-looking review of seizure prediction. Current opinion in neurology. 2017;30(2):167–173. pmid:28118302
  73. 73. Wilkat T, Rings T, Lehnertz K. No evidence for critical slowing down prior to human epileptic seizures. Chaos: An Interdisciplinary Journal of Nonlinear Science. 2019;29(9):091104. pmid:31575122
  74. 74. Helmich MA, Smit AC, Bringmann LF, Schreuder MJ, Oldehinkel AJ, Wichers M, et al. Detecting Impending Symptom Transitions Using Early-Warning Signals in Individuals Receiving Treatment for Depression. Clinical Psychological Science. 2021; p. 21677026221137006.
  75. 75. Thompson JMT, Sieber J. Predicting climate tipping as a noisy bifurcation: a review. International Journal of Bifurcation and Chaos. 2011;21(02):399–423.
  76. 76. Weinans E, Lever JJ, Bathiany S, Quax R, Bascompte J, van Nes EH, et al. Finding the direction of lowest resilience in multivariate complex systems. Journal of The Royal Society Interface. 2019;16(159):20190629. pmid:31662072
  77. 77. Patterson AC, Strang AG, Abbott KC. When and where we can expect to see early warning signals in multispecies systems approaching tipping points: insights from theory. The American Naturalist. 2021;198(1):E12–E26. pmid:34143719
  78. 78. Dakos V. Identifying best-indicator species for abrupt transitions in multispecies communities. Ecological indicators. 2018;94:494–502.
  79. 79. Wen H, Ciamarra MP, Cheong SA. How one might miss early warning signals of critical transitions in time series data: A systematic study of two major currency pairs. PloS one. 2018;13(3):e0191439. pmid:29538373
  80. 80. Arkilanian A, Clements CF, Ozgul A, Baruah G. Effect of time series length and resolution on abundance-and trait-based early warning signals of population declines. Ecology. 2020;101(7):e03040. pmid:32134503
  81. 81. van der Bolt B, van Nes EH, Scheffer M. No warning for slow transitions. Journal of the Royal Society Interface. 2021;18(176):20200935. pmid:33784883
  82. 82. Lenton T, Livina V, Dakos V, Van Nes E, Scheffer M. Early warning of climate tipping points from critical slowing down: comparing methods to improve robustness. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences. 2012;370(1962):1185–1204. pmid:22291229
  83. 83. Johnson AE, Pollard TJ, Shen L, Lehman LwH, Feng M, Ghassemi M, et al. MIMIC-III, a freely accessible critical care database. Scientific data. 2016;3(1):1–9. pmid:27219127
  84. 84. Johnson A, Bulgarelli L, Pollard T, Horng S, Celi LA, Mark R. Mimic-iv. PhysioNet Available online at: https://physionet.org/content/mimiciv/10/ (accessed August 23, 2021). 2020; p. 49–55.
  85. 85. Smit AC, Snippe E, Bringmann LF, Hoenders HR, Wichers M. Transitions in depression: if, how, and when depressive symptoms return during and after discontinuing antidepressants. Quality of Life Research. 2023;32(5):1295–1306. pmid:36418524
  86. 86. Sutton RT, Pincock D, Baumgart DC, Sadowski DC, Fedorak RN, Kroeker KI. An overview of clinical decision support systems: benefits, risks, and strategies for success. NPJ digital medicine. 2020;3(1):17. pmid:32047862