FormalPara Take-home message

Neurocritical care patients display an extubation failure rate of 19% and direct tracheostomy is used as the primary liberation strategy of invasive mechanical ventilation in 21% of patients. Our validated score of extubation success prediction can be used in various settings worldwide.

Introduction

Invasive mechanical ventilation (IMV) is a key intervention in critical care patients [1]. Timely weaning from invasive ventilation may reduce the risk of hospital-acquired pneumonia [2], chronic illness related to intensive care unit (ICU) stay [3], and costs of care [1]. Neurocritical care patients undergo longer duration of IMV [4] and higher extubation failure rates [5] compared to the general population of critically ill patients. The liberation of IMV is thus of major interest in this population. The latest guidelines in neurocritical care patients highlight the poor level of evidence for extubation management or use of tracheostomy [6]. Neurocritical patients are usually poorly represented in randomized-controlled trials [7] and in cohort studies [7, 8]. It is uncertain which factors contribute to extubation success, and it is also unclear which patients may benefit from direct tracheostomy, i.e., tracheostomy without an initial extubation attempt [9]. All have attempted to provide scores predicting extubation success [10,11,12,13], but suffer from methodological issues (monocentric studies, lack of validation cohort). Consequently, evidence-based clinical guidance for extubation and tracheostomy in neurocritical care patients is lacking.

The primary objective of the international, prospective, multicentre ENIO cohort (Extubation strategies and in neuro-intensive care unit patients and associations with outcomes, NCT03400904) was to validate a score predictive of extubation success. Secondary objectives were to describe the causes of extubation failure, and describe the association between liberation strategies (extubation attempt, extubation failure, tracheostomy when extubation strategy was not applied) and outcomes.

Methods

The ENIO study is an investigator-initiated prospective, multicentre, international, observational study examining a cohort of neurocritical care patients requiring IMV (NCT03400904). The protocol has been previously published, and the plan for the primary analysis was finalized before cleaning and closing of the database [14]. The study protocol was approved by local institutional review boards according to local regulations. Initial approval was obtained from Groupe Nantais d’Éthique dans le Domaine de la Santé, IRB No. 7-11-2017). Given the observational nature of the study, patient’s consent was generally waived. In centres, where it was not waived, informed consent was obtained from patients’ relatives, and afterwards from patients who recovered sufficiently. Oral and written information were provided, to the next-of-kin or legal representative, and to the patient whenever recovery was deemed adequate. The study was conducted according to the principles of the Declaration of Helsinki [15].

Participating centres and subjects

We recruited centres through the national and international intensive care and neurocritical care networks, and site investigators (mailing lists and websites of the PROtective VENTilation network, the European Society of Intensive Care Medicine, the Colegio Mexicano de Medicina Critica, the Atlanréa group and the Société Française d’Anesthésie-Réanimation–SFAR research network). Participating centres screened and included consecutive patients during a period of at least 6 months. Medical and research personnel prospectively screened patients for inclusion. Neurocritical care patients (defined as patients with traumatic brain injury (TBI), subarachnoid aneurysmal haemorrhage (SAH), intra-cranial haemorrhage (ICH), ischemic stroke, central nervous system infection (brain abscess, empyema, meningitis, encephalitis or brain tumour) were eligible to this study, if they were ≥ 18 years, admitted to the ICU with a baseline Glasgow Coma Score (GCS) ≤ 12 before endo-tracheal intubation, required invasive mechanical ventilation ≥ 24 h and underwent an attempt to liberate the patient from the ventilator, defined as an extubation trial and/or tracheostomy. Patients were excluded if < 18 years, pregnant, spinal cord injury above T4, resuscitated post-cardiac arrest, Guillain–Barré syndrome, motor neuron disease, muscular dystrophy and myasthenia gravis, death before extubation, withdrawal of life-sustaining treatment (WLST) in the first 24 h after ICU admission, end-of-life extubation, major respiratory co-morbidities (defined as chronic oxygen at home, chronic obstructive pulmonary disease grade III or IV of the Gold classification), and major chest trauma (Abbreviated Injury Score (AIS) ≥ 3). Patients who underwent tracheostomy prior to ICU admission were also excluded. Patients who died without any IMV liberation attempt were not eligible.

Data collection

Data were collected from the 26 of June 2018 to 15 of November 2020. Demographic and baseline data were collected [14]: age, height, weight, type and location (infra-tentorial) of brain injury, baseline GCS, neurocritical care management (barbiturate coma, therapeutic hypothermia, external ventricular drainage, decompressive craniectomy) and location of cerebral injury (posterior fossa). Respiratory data (e.g., mechanical ventilation parameters and laboratory results), sedation management, and the use of neuro-muscular blockade were collected at day 1, day 3 and day 7 after ICU admission. General in-ICU events such as health care-related pneumonia, trachea–bronchitis, acute respiratory distress syndrome and the occurrence of WLST were recorded.

The date of successful spontaneous breathing trial (SBT), first extubation attempt or tracheostomy was recorded. On the day of extubation, data on general management, such as the use of corticosteroids (post-extubation stridor prevention) or enteral nutrition discontinuation, were collected. A standardized clinical examination was performed on the day of extubation: vital signs (body temperature, heart rate, systolic arterial pressure), respiratory (including type and timing of SBT), physical examination (cough assessment, visual pursuit, eye-verbal-motor items of the GCS, gag reflex). The definition of these features was standardized according to previously described data (Online Resource, Text 1). For instance, cough strength was assessed using a 4-grade scale [16]: vigorous, moderate, weak, none. However, some quantitative indices such as the peak flow were not recorded. The exhaustive list of items collected the day of extubation is available on the Online resource. The timing and cause of re-intubation were recorded. Given the lack of consensus in the literature about the weaning and extubation of neurocritical patients [6], extubation strategies and post-extubation strategies (non-invasive mechanical ventilation) were performed according to each centre’s own protocol.

Objectives

The primary objective was to validate a score predictive of extubation success [14], and the primary endpoint was extubation failure [14]. Extubation failure was defined as the necessity to re-intubate patients, after the first planned or accidental extubation attempt [17]. We have screened for any re-intubation, from extubation attempt, until ICU discharge. In case the ICU length-of-stay was longer than 28 days, we stopped the screening of re-intubation.

Since there is no consensus about the time-frame that defines extubation failure in neurocritical care patients [6, 7, 18, 19], we selected a time frame that captured more than 90% of extubation failure [20], to propose a pragmatic approach. In our study, the time-frame for reintubation was set at day 5. However, given the various thresholds proposed in the literature to define extubation failure (2 days [7, 17], within 5 days [7, 18, 19]), and as planned, we provided data regarding the 5-day extubation failure in the results, and 2-day extubation failure in the Online resource [14].

The secondary objectives were to describe the timing and causes of extubation failure, describe the practices in the management of IMV at day 1, day 3 and day 7 after admission, describe sedation practices at day 1, day 3 and day 7 after admission, and compare the characteristics of patients with direct tracheostomy vs patients with extubation attempt. Finally, we explored the association between the IMV liberation strategies (extubation success vs extubation failure, direct tracheostomy without extubation attempt vs extubation trial), and clinical outcomes (duration of invasive and non-invasive mechanical ventilation (mask and high-flow nasal oxygen cannula), ICU length of stay (LOS), in-ICU mortality and in-hospital mortality).

Statistical analysis

The analysis and reporting of the study comply with the TRIPOD statement (EQUATOR network: https://www.equator-network.org/reporting-guidelines/tripod-statement/). As detailed in published study protocol [14], we aimed to include at least 1500 patients in our cohort to screen 300 patients with an extubation failure. Data were expressed as means (SD), medians (interquartile range) and proportions as N (%). The characteristics and the outcomes of the patients in the extubation success group, the extubation failure group and the direct tracheostomy group were compared. Student’s t test or Mann–Whitney U tests were used to compare continuous variables and chi-square tests were used for categorical variables.

To create a predictive score for extubation success, we excluded direct tracheostomy without extubation attempt, because these patients cannot be classified as extubation failure or success, withdrawal of life sustaining therapies during the ICU course which is a competing event with extubation failure. Patients who died without a context of withdrawal of life sustaining therapies and transferred to another facility were kept in the creation of the score, since reintubation was available. This data set was randomly split into a training set (2/3) and a validation set (1/3). The categorization of continuous variables was data-driven and assessment of plots of locally weighted regressions of the logit of extubation failure on each variable in the training set. Multiple imputation by chained equations (MICE) was performed in the setting of missing data [14]. Data were imputed five times via predictive mean matching. Swallowing attempts was the variable with the greatest proportion of missingness (8.3% overall).

Using this imputed data, we trained a logistic regression model with a group LASSO (Least Absolute Shrinkage Selection Operator) penalty [21]. The variables that the model were selected from was based upon literature review; the LASSO then retained variables with a non-zero coefficient. Collinearity is handled by the LASSO procedure, i.e., if 2 variables are highly correlated, only one will be retained in the final model [22, 23].

We used tenfold cross-validation to select the optimal ƛ based on the highest area under receiver operating characteristic curve (AUROC). To build a score that would be easier to calculate by hand at the bedside, we also built a simplified version by selecting the lowest ƛ for which the model retained a maximum of 12 coefficients. Each level of each variable was allocated points according to model coefficients. To create the score and convert the logit coefficients to points we took the following steps. First, we assigned all reference levels to have a coefficient of 0. Second, we identified the minimum and maximum value among coefficients for each variable (including reference level). Then, for each variable we calculated the difference between the minimum and maximum, and summed the differences to create a total weight. We took the difference between each coefficient and the minimum for that variable to create a raw point value. Finally, we multiplied the raw point value times 100 and divide by the total weight.

Model performance was estimated in both training and validation sets using only patients with complete data. Discrimination was evaluated with receiver operating characteristic (ROC) curves and AUCs. Calibration was evaluated with calibration plots and the Hosmer–Lemeshow test; the overall performance of the models was evaluated with the R2/Brier test. For the both models (complete and simplified), we then assessed sensitivity, specificity, and positive and negative predictive values for multiple thresholds chosen to maximize specificity. Positive and negative interval likelihood ratios were calculated for ranges of scores identified from the ROC curve and the distribution of scores in our data. The robustness of the model was tested by re-running the LASSO with 10 different seeds, and compared the list of variables selected by the LASSO.

Results

Patients and site characteristics

The first patient was enrolled in the Netherlands, in June 2018 and the last patient was included in France in October 2020. The flowchart of the study is available in Fig. 1. Of the final cohort (N = 1512), patients suffered from TBI (725 (47.9%)), ICH (521 (34.5%)), SAH (269 (17.8%)), with a median age of 54 [36–66] years and a baseline GCS of 7 [5–9]. The patients’ baseline characteristics are available in Table 1.

Fig. 1
figure 1

Flowchart of the study

Table 1 Baseline characteristics of patients with extubation trial and day 5 extubation failure

Primary objective

Extubation failure rate

In the entire ENIO cohort, 1193 (78.9%) patients had at least one extubation attempt, and 253 (21.2%) required reintubation within 28 days after the extubation attempt. National extubation failure rates varied from 0% up to 28.6% (Table 1). In our cohort, among 253 patients with extubation failure 231 (19.3%) patients required reintubation by day 5, which set the time-frame of extubation failure.

Patients in the 5-day extubation failure group had fewer TBI (82 (35.5%) vs 498 (51.8%), p < 0.001), were older (59 [45–68] years vs 54 [34–65], p = 0.002), and lower baseline GCS score (7 [5–8] vs 7 [5–9], p = 0.006 (Table 1). Results for the baseline characteristics of the day 2 extubation failure group are available in the Online resource, Table 1.

Extubation success factors: univariate analysis

There were no significant differences in general clinical management on the day of extubation between the two groups (day 5 extubation failure). The vital signs on the day of extubation significantly associated with success included higher body temperature, higher heart rate and lower SpO2. The airway clinical features associated with success were swallowing attempts, the presence of a gag reflex, and the frequency of endo-tracheal suctioning. The arousal and neurologic features associated with extubation success were visual pursuit, the total GCS and the motor score of the GCS (Table 2). Patients in the day 2 extubation failure group did not display significant differences with the success group (Online resource, Table 2) [17].

Table 2 Management and clinical features the day of extubation

Successful extubation prediction score

Of the cohort, 1106 patients were eligible to build the score to predict extubation success, resulting in 737 individuals in the training set and 369 in the validation set. The LASSO model with the optimal ƛ retained 20 variables (Online resource, Table 3). The AUC in the training cohort (N = 308) was 0.79 95% confidence interval (CI95) [0.71–0.87] and 0.71 CI95 [0.61–0.81] in the validation cohort (N = 166). The ROC curve, the calibration plot and the decision curve of the complete score in the validation cohort are available in Fig. 2. The ROC curve, the calibration plot and the decision curve of the complete score in the validation cohort are available in the Online Resource, Fig. R1.

Fig. 2
figure 2

ROC curve, calibration plot and decision curve and the complete score. The AUC of the ROC curve of the complete score was 0·71 CI95 [0·61–0·81] in the validation cohort. Hosmer–Lemeshow test in the validation set: p = 0·41. The Brier test of the global score was 0·08 in the training cohort and 0·14 in the validation cohort

Given the number of variables retained in the optimal model, a simplified user-friendly score was also validated. Only 7 predictors were kept in the simplified score (Online resource, Table 4): TBI, vigorous cough, gag reflex, swallowing attempts, endotracheal suctioning ≤ 2 times per hour, GCS motor score = 6 and body temperature the day of extubation. The AUC of the score was 0.79 CI95 [0.71–0.86] in the training cohort and 0.65 CI95 [0.53–0.76] in the validation cohort. The ROC Curve, the calibration plot and the decision curves of the simplified score are available in the Online resource, Fig. 1.

The sensitivity, specificity, positive and negative predictive values of the complete score are available in Table 3 and the values of the simplified score are available in the Online resource, Table 4. Interval likelihood ratios for both scores are in the Online resource, Tables 6 and 7. The likelihood ratio of extubation success for a complete score ≥ 70 points (theoretical range 0–91) was 3.67.

Table 3 Sensitivity, specificity, positive and negative values of the different thresholds of the complete score

In the analysis testing the robustness of our score (10 randomly generated seeds), the AUC in the test set ranged from 0.646 to 0.848, and the Brier score ranged from 0.089 to 0.119. This supplementary analysis is available in the Online resource, Text 2.

Secondary objectives

Causes of extubation failure

In the day 5 failure group, the primary causes of extubation failure were neurologic failure (92 (39.8%) patients), respiratory failure (126 (54.5%) patients) and airway failure (87 (37.7%) patients). Data regarding the causes of day 5 and day 2 extubation failure is available in the Online resource, Table 8.

Ventilator settings and sedation practices

Description of ventilatory settings, respiratory parameters, biology and sedation practices on day 1, day 3 and day 7 after ICU admission is available in the Online resource, Table 9.

Direct tracheostomy

Direct tracheostomy was performed in 319 (21.1%) patients, the median timing of tracheostomy was 9 [5–15] days after starting IMV. The main reasons for tracheostomy were severe neurologic impairment (237 (74.3%) patients), airway impairment (51 (16%) patients) and severe face/neck trauma (14 (4.4%) patients). We observed major differences in the use of tracheostomy between countries (p < 0.001) (Online resource, Table 10).

Extubation failure, tracheostomy and outcomes

Patients with day 5 extubation failure suffered from more frequent hospital-acquired pneumonia, Acute Respiratory Distress Syndrome, a longer duration of IMV and a higher in-ICU mortality rate (Online resource, Table 11). The analysis with day 2 extubation failure displayed similar results (Online resource, Table 12). Patients with direct tracheostomy experienced more hospital-acquired pneumonia, a longer duration of IMV and a higher in-ICU mortality rate, compared to patients without direct tracheostomy (Online resource, Table 13).

Discussion

In the international, prospective ENIO cohort, we found a wide variation in practices of extubation management and tracheostomy between geographic areas in neurocritical care patients. We also found that: (1) the extubation failure rate is around 20%; (2) a score can predict extubation success; (3) both patients with tracheostomy and extubation failure displayed prolonged duration of IMV, higher rates of respiratory infections, and higher rate of mortality.

Neurocritical patients present specific challenges regarding extubation due to the nature of their injuries (altered levels of consciousness complicate safe extubation and aspiration prevention) [24]. The WIND classification [25] may not be applicable to neurocritical care patients, as patients may easily pass SBT, but extubation can be delayed because of neurologic and airway impairment [16]. The poor level of evidence in the neurocritical care literature explains the major variability we observed between countries regarding the management of extubation and tracheostomy.

Extubation failure remains common in the general population of critically ill patients (10–15%) [5, 7] and is as high as 25% in patients with neurologic illness [5]. In previous cohorts with neurologic patients [11,12,13], the extubation failure rate was around 21–25%. The definition of extubation failure varies substantially between studies [12]. In addition, there is currently no consensus on the timeframe to define extubation failure. It was recently proposed to extend the timeframe for extubation failure from 3 days [18] to 7 days after extubation [19]. Miltiades et al. [20] proposed to use a timeframe that captures > 90% of extubation failures. Based on this pragmatic approach, a 5-day time frame was selected to account for > 90% of failures and perform the primary analysis. In addition, we decided to separate patients with direct tracheostomy from patients with extubation attempt, since these patients will not meet the primary outcome. Patients with a late tracheostomy after an extubation attempt, were kept in the analysis. We thus chose a clear definition of extubation failure by discarding tracheostomy or non-invasive ventilation [12].

More than 20 years ago, Coplin et al. [16] pointed out that delaying extubation in the context of successful SBT for safety reasons (neurological recovery) was associated with increased IMV duration and health-acquired pneumonia. Recently, few cohorts have developed specific scores to predict successful extubation in neuro-critical patients [11,12,13]. These scores include general features (age [11, 13], fluid balance [13]), level of consciousness (GCS [11], Coma-Recovery-Scale [12], specific features, such as visual pursuit [11, 12]) and airway evaluation (swallowing attempts [11]) to predict extubation success. However, most cohorts are monocentric, do not provide extensive neurologic and airway exploration at extubation (e.g., gag reflex [16]), and most of all lack external validation that could ensure generalizability of the scores [11,12,13]. To the best of our knowledge, ENIO is the first to propose a score developed on a large sample of international data and easily calculated at the bedside. The items identified as predictors of extubation success were also consistent with those identified in previous studies [11, 12, 16]. However, a threshold of 75 of the complete score with a perfect Positive Prediction Value (100%) could encourage systematic extubation, but bears low sensitivity and will be rarely seen in patients. On the contrary a threshold of 33 with a perfect Negative Predictive Value bears low specificity. The choice of the adequate balance between Positive and Negative Prediction value remains open to discussion. Future prospective trials are necessary to demonstrate whether our score could help physicians in a proactive extubation strategy to decrease morbidity associated with both delayed extubation and/or extubation failure [16].

Direct tracheostomy is a potential strategy to secure the liberation of IMV. There is conflicting evidence that tracheostomy timing could alter the short- and long-term outcomes of patients in the general ICU population [26, 27]. In a meta-analysis focusing on neuro-critical patients, it was reported that early tracheostomy could decrease mortality and the duration of IMV compared to late tracheostomy [9]. In a randomized-controlled trial testing two timing of tracheostomy in patients with an expected prolonged duration of IMV [28], nearly half patients in the late tracheostomy group did not undergo the intervention. These results underline the inability of clinicians to accurately select patients that could benefit from tracheostomy. Further data are needed to guide decision-making for direct tracheostomy in neurocritical care patients.

Strengths and limitations

Our study has strengths. First, the cohort has a large sample size, and was elaborated in various settings. Second, we elaborated a pre-planned analysis that was followed. Finally, the clear endpoints and objectives should end generalizability of our results.

The present study has nonetheless several limitations. Our data can be used to identify associations between outcomes and liberation strategy, but because of the study design, we cannot make causal inferences. Data collection was limited to specific timepoints, such as the first day of successful SBT or the day of tracheostomy. We deliberately chose to focus on major clinical features the day of extubation and in-ICU outcomes, to ensure feasibility of data collection. Since this was an open study, we cannot rule out a Hawthorne effect on extubation practices, with a modification of patient’s management. In addition, important ICU-specific factors such as nursing ratio, the presence of respiratory therapist, local protocols, post-extubation management such as high flow nasal cannula oxygen were not accounted for in this study. However, centres performed extubation according to local protocols. Their impact will be specifically studied in an ancillary study (Online resource, Text 3). Finally, the validation cohort is drawn from the same sample as the learning cohort. These two samples are not independent, and it will be mandatory to validate our findings in another prospective independent cohort.

Conclusions

In this international cohort of neurocritical care patients, extubation failure is high and should be monitored in the first 5 days after an extubation attempt. Neurocritical patients undergoing direct tracheostomy instead of extubation attempt, appear to be a selected group of patients with greater severity, and should be specifically explored.