Identifying Modifiable Predictors of Patient Outcomes After Intracerebral Hemorrhage with Machine Learning

Hall, Andrew N.; Weaver, Bradley; Liotta, Eric; Maas, Matthew B.; Faigle, Roland; Mroczek, Daniel K.; Naidech, Andrew M.

doi:10.1007/s12028-020-00982-8

Identifying Modifiable Predictors of Patient Outcomes After Intracerebral Hemorrhage with Machine Learning

Original Work
Published: 08 May 2020

Volume 34, pages 73–84, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Neurocritical Care Aims and scope Submit manuscript

Identifying Modifiable Predictors of Patient Outcomes After Intracerebral Hemorrhage with Machine Learning

Download PDF

Andrew N. Hall ORCID: orcid.org/0000-0002-5036-8423¹,
Bradley Weaver²,
Eric Liotta²,
Matthew B. Maas²,
Roland Faigle⁴,
Daniel K. Mroczek^1,3 &
…
Andrew M. Naidech²

1038 Accesses
23 Citations
1 Altmetric
Explore all metrics

Abstract

Background/Objective

Demonstrating a benefit of acute treatment to patients with intracerebral hemorrhage (ICH) requires identifying which patients have a potentially modifiable outcome, where treatment could favorably shift a patient’s expected outcome. A decision rule for which patients have a modifiable outcome could improve the targeting of treatments. We sought to determine which patients with ICH have a modifiable outcome.

Methods

Patients with ICH were prospectively identified at two institutions. Data on hematoma volumes, medication histories, and other variables of interest were collected. ICH outcomes were evaluated using the modified Rankin Scale (mRS), assessed at 14 days and 3 months after ICH, with “good outcome” defined as 0–3 (independence or better) and “poor outcome” defined as 4–6 (dependence or worse). Supervised machine learning models identified the best predictors of good versus poor outcomes at Institution 1. Models were validated using repeated fivefold cross-validation as well as testing on the entirely independent sample at Institution 2. Model fit was assessed with area under the ROC curve (AUC).

Results

Model performance at Institution 1 was strong for both 14-day (AUC of 0.79 [0.77, 0.81] for decision tree, 0.85 [0.84, 0.87] for random forest) and 3 month (AUC of 0.75 [0.73, 0.77] for decision tree, 0.82 [0.80, 0.84] for random forest) outcomes. Independent predictors of functional outcome selected by the algorithms as important included hematoma volume at hospital admission, hematoma expansion, intraventricular hemorrhage, overall ICH Score, and Glasgow Coma Scale. Hematoma expansion was the only potentially modifiable independent predictor of outcome and was compatible with “good” or “poor” outcome in a subset of patients with low hematoma volumes, good Glasgow Coma scale and premorbid modified Rankin Scale scores. Models trained on harmonized data also predicted patient outcomes well at Institution 2 using decision tree (AUC 0.69 [0.63, 0.75]) and random forests (AUC 0.78 [0.72, 0.84]).

Conclusions

Patient outcomes are predictable to a high level in patients with ICH, and hematoma expansion is the sole-modifiable predictor of these outcomes across two outcome types and modeling approaches. According to decision tree analyses predicting outcome at 3 months, patients with a high Glasgow Coma Scale score, less than 44.5 mL hematoma volume at admission, and relatively low premorbid modified Rankin Score in particular have a modifiable outcome and appear to be candidates for future interventions to improve outcomes after ICH.

Machine learning model prediction of 6-month functional outcome in elderly patients with intracerebral hemorrhage

Article Open access 06 May 2022

Development and validation of a machine learning-based predictive model for assessing the 90-day prognostic outcome of patients with spontaneous intracerebral hemorrhage

Article Open access 04 March 2024

Development and validation of a random forest model to predict functional outcome in patients with intracerebral hemorrhage

Article 10 May 2023

Discover the latest articles, news and stories from top researchers in related subjects.

Medical Imaging

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Intracerebral hemorrhage (ICH) is the most morbid form of stroke and has no specific treatment [1]. Patient outcomes after ICH are predictable with severity of injury scores, which generally include clinical severity (e.g., Glasgow Coma Scale) and neuroimaging (e.g., hematoma volume). The ICH Score, a composite prognostic scale, is predictive of functional outcome at hospital discharge and follow-up [2, 3]. Other severity of injury scales is similar, despite slight differences in accuracy [4, 5]. However, severity of injury scales for ICH identifies neither modifiable predictors of outcome nor patient subgroups who might benefit from treatment. Improved techniques to identify patients with modifiable outcome are needed to select patients most likely to benefit from treatments.

Most interventions improve patient outcomes in some subgroups of patients more than others. Therefore, if a treatment is administered to a group of patients in which a substantial subgroup has a non-modifiable poor outcome, the treatment will be deemed ineffective. Previous attempts to improve outcomes after ICH included reducing inflammation (e.g., deferoxamine) [6, 7], surgical drainage (e.g., MISTIE III) [8, 9], and reducing hematoma expansion (e.g., FAST trials of Factor VII) [10, 11]. The inclusion of patients with ICH who were highly unlikely to have a good outcome with any putative treatment (e.g., hematoma volume > 60 mL) in previous trials to reduce hematoma expansion may have obscured the benefit that was realized in those with modifiable outcomes, despite rapid initiation of therapy and hemostatic efficacy [12, 13].

Previous studies on severity of injury scales in ICH have relied on regression models to identify outcome predictors. While regression models (e.g., logistic regression) identify the contribution of independent predictors toward an outcome in the entire cohort, they do not readily identify subgroups in the cohort, which must be identified post hoc, which form the basis for a subsequent trial [14]. To identify subgroups, decision tree-based methods are more intuitive and appropriate, as they operate by segmenting the observations into subgroups that maximize differences between groups. This enables group-assignment predictions to be easily made in different subgroups of patients, which is the goal of the current study.

Identifying effective ICH therapies may depend on the ability to determine which subgroups of patients have a potentially modifiable outcome to focus on patients most likely to benefit. Our objective was to identify independent predictors of outcomes in ICH that are potentially modifiable to select patients for targeted treatment.

Materials and Methods

Study Overview

This study included the same patients as a previous study [15]. In brief, patients with spontaneous acute ICH were prospectively identified and enrolled in a registry between 2006 and 2016 at two unrelated institutions. ICH cases were diagnosed using CT by a board-certified neurologist, and patients were admitted to the ICU. Data were subsequently collected from patients, including Glasgow Coma Scale (GCS) scores, hematoma volume, hematoma expansion, demographics, and pre-ICH modified Rankin Scale (mRS) scores. The mRS, an ordinal scale from 0 (no symptoms) to 6 (death), was obtained at 14-day (or hospital discharge, whichever came first) and at 3-month follow-up, as previously reported. “Good outcome” was defined as mRS 0–3 (independence or better) and “poor outcome” as mRS 4–6 (dependence or death). This dichotomization has previously been implemented for trials with ICH patients [16, 17] and represents a common method of analyzing mRS results that can be implemented in binary classification algorithms [18]. Aspirin use was coded as over-the-counter use or as directed by a physician and is a predictor of hematoma expansion and the mRS in patients with ICH [19]. Other variables were selected based on a combination of theoretical associations with ICH patient outcomes and the pattern of missingness across variables. Only complete observations were used in the final analyses, which meant that variables with large amounts of missingness were not included. In total, 19 features were included in the analyses for Institution 1.

The analyses for this project can be broken into two main stages. In the first stage of analysis, we trained models on a single cohort of patients from Northwestern Memorial Hospital (Institution 1). Model construction was completed using a fivefold cross-validation framework replicated ten times with random splitting, with nested cross-validation within each of the outer folds implemented for parameter tuning. The outer fold of the cross-validation was used to estimate out-of-sample performance and to select parameters, which were then used to construct models on the entire training dataset. In the second stage of analysis, we provided further evaluation of model predictive performance by testing a representative predictive model in a cohort of patients from the Johns Hopkins Hospital and Johns Hopkins Bayview Medical Center, part of Johns Hopkins Medicine (Institution 2). At Institution 2, only discharge disposition was available (i.e., no mRS scores for Institution 2), so discharge disposition was used instead, as this outcome was present in both datasets. Discharge at both institutions corresponded to “good outcome” (discharged to home or rehabilitation) or “poor outcome” (discharged for long-term acute care, nursing facility, or death).

In order to estimate model performance on unknown data, we implemented the above-mentioned fivefold cross-validation framework. In this approach, the dataset is split into five equally sized parts, hereafter referred to as a “fold.” Subsequently, one of the folds is temporarily designated the “test” or “validation” fold (i.e., the data on which a model is evaluated) and the observations within that fold are left out of the model construction process. The observations in the remaining fourfold of the dataset then serve as the training data (i.e., the data on which the model is built), and model construction proceeds on these data without the inclusion of the observations in the test fold. Upon completion of model training, the final model is evaluated on the data in the test fold, producing predictions that are entirely separated from the model construction process. Once the model is evaluated on the test fold, a new fold is selected as the test fold and the other fourfold—including the fold that was previously the test fold—are considered the training data for a new round of model construction. This process continues until each of the fivefold has been considered the test dataset once. As a result of the cross-validation procedure, predictions are made for each of the observations in the dataset based on models that maintain training-test separation. In other words, the models are not built and evaluated on the same observations, thus reducing overfitting and enhancing the potential generalizability of the results beyond training data. Within a given loop, parameter tuning occurred through a separate cross-validation procedure. This form of nested cross-validation ensures that the same separation between training and test data implemented for generalization predictions applies to the process of parameter tuning, thus reducing the bias of the models toward the training data [20]. The cross-validation process was repeated ten times with different splits each time to reduce the likelihood that a given split improperly influenced the results.

In addition to the repeated cross-validated analyses, we went one step further in the second stage of analysis, isolating an additional dataset on which to test the models built on the data from Institution 1. We did this additional step as a further test of generalizability, as there could be characteristics of Institution 1 shared across folds that might artificially enhance confidence in the generalizability of our results. By testing our models on data from a separate institution, we alleviate this concern, as characteristics that are shared among patients at Institution 1 may be less likely to be shared with patients at Institution 2. In summary, we used fivefold cross-validation from Institution 1 to construct models and subsequently used the dataset from Institution 2 as a pure test set drawn from a different sample of patients.

Algorithms

To identify the independent predictors of good or poor outcome, we used decision tree-based algorithms to automatically classify subjects, based on values of the independent variables. Specifically, the algorithm automatically selected the binary splits on variable values that led to the largest reduction in the Gini Impurity index—a measure of impurity within nodes in a tree, with smaller values indicating observations that are more likely to be a part of a shared class—at a given step along the tree [21]. Node splits were determined using a standard binary recursive partitioning algorithm, splitting the dataset into two groups at each node in a recursive manner and selecting the split on a variable’s values among all possible variables that led to the largest Gini decrease. The decision trees were overgrown and subsequently pruned using a cost-complexity pruning parameter determined by using nested cross-validation and selecting the value of the complexity parameter that led to the overall highest average cross-validated area under the receiver operating characteristic curve (AUC). AUC is a single numerical value to quantify the relationship expressed in the ROC curve, which represents the relationship between the false-positive and true-positive rates across a variety of probability cutoffs. In concrete terms, AUC represents the probability that, when looking at two randomly chosen observations known to be from the “poor” and “good” outcome groups, the “good” outcome example will be more likely to be classified as “good” than the “poor” outcome. Final model performance was quantified using AUC across all models. Decision tree algorithm results can be visualized by a diagram in which each variable split is represented by branches that segment the sample into increasingly smaller partitions according to specific variable values.

To increase the confidence in our results, we also performed a random forest analysis, which is an ensemble method that includes bootstrapping and averaging multiple decision trees to build a more powerful predictive model [22, 23]. Random forest algorithms construct many possible trees by sampling observations with replacement from the overall sample and building decision trees based on these bootstrapped samples. Trees are de-correlated from each other by a process of randomly sampling variables at each node split, meaning that individual trees could be based on different sets of predictor variables. Results from these trees are averaged together to create a final model that uses the information from all constituent trees to make a classification decision. The final model classifies individual observations according to a majority vote process, meaning that each observation is assigned to the category that it was assigned in the majority of the constituent trees. As this is a “black box” method, the individual steps in the random forest process cannot be illustrated in the same format as an individual decision tree. However, variables important to the overall model performance can be summarized across the constituent trees. Predictions from random forest models tend to be more generalizable as they are less prone to the overfitting that can occur in individual decision trees.

We further tested decision tree and random forest algorithms in data from 206 patients from Institution 2 to test generalizability of the model. As machine learning algorithms mandate identical variables across training and testing samples and there is not a standardized set of variables collected on all ICH patients at every institution, this meant that some amount of variable harmonization was necessary before the algorithm could be tested on the independent dataset. Thus, variables between Institution 1 and Institution 2 were harmonized to ensure the same meaning (e.g., the same definition and measurement of hematoma expansion across institutions) [24] so that models trained on variables from Institution 1 could be applied to predict outcomes from observations in Institution 2. This harmonization process involved considering each of the variables across both datasets that were identical in essence, if not identical in measurement and changing the measurements of the variable to match across datasets. For example, both Institution 1 and Institution 2 had a measure of hematoma expansion; however, Institution 1 measured expansion as a continuous variable in milliliter units, while Institution 2 measured the presence of hematoma expansion as a true/false dichotomous variable using a previously validated definition [24]. For harmonization, hematoma expansion at Institution 1 was re-coded to match the definition at Institution 2. A similar process was conducted across the other variables, with each one changed so that measurements were identical across both datasets. Of note, Institution 2 did not have mRS scores, meaning the outcome variable across datasets was different than described previously. Specifically, the outcome used in both cases was the disposition of the patient at hospital discharge. “Good” disposition was described at both institutions as being discharged to one’s home or to rehab; “Poor” disposition was described as including long-term accurate care, nursing facility, or death. In total, 15 features were included in the analyses for Institution 2. Decision tree and random forest models were then derived as described above for Institution 1 and tested in data from Institution 2.

We used standard statistical software to construct decision trees (R package “rpart” [25]) and random forests (R package “randomForest” [26]) implemented in R version 3.5.3 [27]. We implemented fivefold nested cross-validation to optimize parameters and minimize overfitting (a model that is more accurate in the data from which it is derived because it fits noise in addition to signal) using the tuning framework from the “caret” package in R [28] for both the decision tree and random forest algorithms. An overfit model would tend to be predictive in the training cohort, but not in external data, as these models fit random variation in addition to true relationships in the data. Pruning of trees was explored to further reduce the likelihood of overfitting.

Standard Protocol Approvals, Registrations, and Patient Consents

The study was approved by the institutional review board (IRB). Written informed consent was obtained from the patient or a legally authorized representative. The IRB approved a waiver of consent for patients who died during initial hospitalization and those who were incapacitated and for whom a legal representative could not be located. Separately, identification of patients and collection of data in the Hopkins cohort were approved by the Johns Hopkins IRB.

Data Sharing

Qualified investigators who wish to reproduce the analysis or utilize the code should contact the corresponding author.

Results

Demographics of the cohort are shown for the 278 included ICH patients from Institution 1 (Table 1) and the 206 included ICH patients from Institution 2 (Table 2) stratified by outcome. From Institution 1, the decision tree algorithm selected initial hematoma volume, hematoma expansion, intraventricular hemorrhage (IVH), and Glasgow Coma Scale (GCS) scores as the most important predictors of 14-day outcome at each node, where variable importance is defined as the relative reduction in the loss of function of a variable at each node across the overall tree [25]. The model to predict good or poor patient outcome at 3 months was similar, with most important variables chosen being Glasgow Coma Scale (GCS) scores, hematoma volume at admission, and hematoma expansion. The decision tree for prediction of outcomes at 3 months can be found in Fig. 1. Hematoma expansion is the only modifiable outcome that can impact predicted outcomes in this tree. Of particular note is a subset of patients who have a GCS score of 13 or higher, hematoma volume at admission less than 44.5 mL, and premorbid modified Rankin Scale score of less than 4. For these patients, hematoma expansion is a critical predictor of outcome, with hematoma expansion greater than 6.65 mL leading to the prediction of poor outcome for 5% of the total sample (10 patients). These patients may be of particular interest for future clinical intervention.

Table 1 Characteristics of the 278 patients from Institution 1, stratified by “poor” (mRS 0–3) and “good” (mRS 4–6) outcomes

Full size table

Table 2 Characteristics of the 206 patients from Institution 2, broken out into “poor” (discharged to acute care or death) and “good” (discharged to home or rehab) outcomes

Full size table

Random forest models performed better than the decision trees across both outcome types in Institution 1. The cross-validated performance (measured using AUC) of decision tree and random forest algorithms on the validation data from Institution 1 are shown in Table 3, along with 95% confidence intervals (CIs) derived using stratified bootstrap resampling. For 14-day outcomes, the AUC for the decision tree approach was 0.79, with a 95% CI of [0.77, 0.81], compared to an AUC of 0.85 and a 95% CI of [0.84, 0.87] for the random forest. For 3-month outcomes, the AUC for the decision tree approach was 0.75, with a 95% CI of [0.73, 0.77], compared to an AUC of 0.82 with a 95% CI of [0.80, 0.84] for 3-month outcomes. Accuracy rates assuming a 50% classification cutoff point are provided in Table 4, along with proportional confusion matrix information.

Table 3 Area under the receiver operating characteristic curve (AUC) by each model in Institution 1

Full size table

Table 4 Proportional confusion matrix information for all four models after fivefold cross-validation repeated 10 times

Full size table

Models also weighted similar variables as being important for inclusion, as determined from the variable importance values produced by both modeling approaches. To predict good or poor outcome at 14 days, both decision tree and random forest algorithms selected hematoma volume at admission, hematoma expansion, intraventricular hemorrhage (IVH), GCS scores, and age as the most important predictors. To predict good or poor outcome at 3 months, the algorithm selected GCS score, hematoma volume at admission, hematoma expansion, age, premorbid mRS scores, and IVH as the most important variables. There was remarkable consistency across outcomes and modeling approaches in these importance values, suggesting generalizability across time and predictive approaches. These results can be seen in graphical format in Fig. 2.

We then turned to establishing generalizability in an independent dataset. Upon completion of variable harmonization using the process described above, data from Institution 1 were treated as the training dataset and data from Institution 2 were utilized as a test dataset. In training, the analysis procedure mimicked the analyses described above for the full Institution 1 analyses. Namely, repeated fivefold cross-validation was implemented to train both decision tree and random forest algorithms. Final models were evaluated on the test data from Institution 2. Predictions of discharge outcome in the test data resulted in an AUC value of 0.69 (95% CI: [0.63, 0.75]) for the decision tree and an AUC value of 0.78 (95% CI: [0.72, 0.84]) for the random forest model. Data on outcomes at 3 months were not available for the creation of comparison models. Similar to the models derived and validated at Institution 1, the most important variables were GCS scores, initial hematoma volume, age, and hematoma expansion. Additionally, age was an important variable in the random forest analyses.

Institution 2 did not have mRS data. In data from Institution 1, the mRS at 14 days was correlated with the mRS at 3 months (Spearman rho = 0.85, p < 0.001).

Discussion

We identified patients whose outcome could be either good or poor after ICH. Decision tree and random forest models consistently identified initial hematoma volume, hematoma expansion, GCS score, age, and intraventricular hemorrhage as the most important variables for the prediction of ICH outcomes at both 14 days and 3 months. Hematoma expansion was the only potentially modifiable predictor of patient outcome identified for these patients across modeling approaches and time periods. A subset of patients was identified, consisting of patients who have a GCS score of 13 or higher, hematoma volume at admission less than 44.5 mL, and premorbid modified Rankin Scale score of less than 4 (see Fig. 1). These patients’ outcomes depended on hematoma expansion and thus serve as a potential target for clinical intervention. In sum, across modeling approaches, datasets, and time periods, these data suggest that a subset of patients with ICH have a potentially modifiable outcome by reducing hematoma expansion.

Hematoma expansion predicts poor outcome after ICH. Most definitions of hematoma expansion have a similar predictive value (e.g., 26% of the initial hematoma volume, an absolute increase of ≥ 3 mL) [29]. Clinical trials of blood pressure control [30], Factor VII [10, 11], and tranexamic acid [31] have sought to improve patient outcomes by reducing the likelihood of hematoma expansion, but have had mixed results on preventing hematoma expansion and improving the odds of good outcome. Our findings suggest that in each of the clinical trials conducted to date, many randomized patients may already have been highly likely to have a poor outcome at follow-up because of large initial hematoma volume or IVH, regardless of hematoma expansion. These results suggest that future research might further narrow eligibility for a clinical trial in order to select patients with an outcome that is potentially modifiable from poor outcome to good outcome. While we are not the first group to suggest restricting inclusion criteria for clinical trials in patients in ICH [13], we are suggesting an innovative method for selecting those criteria using a generalizable model from existing datasets.

Decision tree-based algorithms are different from regression models. Regression models determine the marginal attribution of variance in the data from independent variables (e.g., patient age for each additional year) on a dependent variable (e.g., good or poor outcome) and are intuitive for determining the effect of independent variables generally in a cohort. In patients with ICH, predictors of patient outcome considered in logistic regression models are unlikely to be truly independent (e.g., larger hematoma volume eventually impairs consciousness), even if interaction terms do not significantly improve a statistical model. We chose decision tree-based algorithms because they are intuitively more appropriate for subgrouping subjects, as our goal was to identify patients with a potentially modifiable outcome.

The methods described in this paper are intended to inform further mechanistic research to improve patient outcomes after ICH. Specifically, these data suggest which patients are most likely to transition from “poor” to “good” outcome, particularly for patients in whom hematoma expansion might be reduced. Additionally, these data may be helpful for prognostication after accounting for hematoma expansion, a potential improvement over scales that are measured at admission and do not account for hematoma expansion.

There are potential limitations to these data. The sample size may not be adequate to detect prognostic factors with smaller effects on functional outcome. Transitioning a patient from “poor” to “good” outcome requires a substantial effect size [12], and more sensitive patient outcomes might increase the power to detect an effect of treatment. The results of the analysis across two independent datasets somewhat allay concerns about model generalizability, as the model performed similarly even though it was arguably handicapped by the harmonization process reducing the number and specificity of the variables. The mRS is also insensitive to certain domains of function that are important from a patient’s perspective (e.g., cognitive function) and, as a result, to predictors of disability in those domains [32]. We used repeated fivefold cross-validation and found very strong performance of decision tree and random forests models, as well as reasonable performance of a less-specific model on a separate dataset. Additional validation in external cohorts would be helpful; however, some of these are costly and might have different independent variables that would require reconciliation before analysis. Our results should be generalizable, however, given the efforts made to examine performance on data unimplicated in the model construction process. The cohort used in this analysis included a small proportion of patients who were known to take anticoagulant medication (warfarin or a novel oral anticoagulant) with ICH; however, the effect of anticoagulant medication is likely mediated through hematoma expansion, which is modifiable with specific treatment (e.g., prothrombin complex concentrate) [33].

In sum, we found that a subset of patients with ICH has a potentially modifiable outcome. Hematoma expansion is the prominent modifiable determinant of patient outcome. Future research to develop or evaluate therapies for ICH might focus on a smaller subset of patients with potentially modifiable outcome.

References

Hemphill JC, Greenberg SM, Anderson CS, et al. Guidelines for the management of spontaneous intracerebral hemorrhage. Stroke. 2015;46:2032–60.
Article Google Scholar
Hemphill JC, Bonovich D, Besmertis L, Manley G, Johnston SC, Tuhrim S. The ICH Score: a simple, reliable grading scale for intracerebral hemorrhage. Stroke. 2001;32:891–7.
Article Google Scholar
Hemphill JC, Farrant M, Neill TA. Prospective validation of the ICH Score for 12-month functional outcome. Neurology. 2009;73(14):1088–94.
Article Google Scholar
Schmidt FA, Liotta EM, Prabhakaran S, Naidech AM, Maas MB. Assessment and comparison of the max-ICH Score and ICH Score by external validation. Neurology. 2018;91(10):e939–46.
Article Google Scholar
Sembill JA, Gerner ST, Volbers B, et al. Severity assessment in maximally treated ICH patients: the max-ICH score. Neurology. 2017;89(5):423–31.
Article Google Scholar
Yeatts SD, Palesch YY, Moy CS, Selim M. High dose deferoxamine in intracerebral hemorrhage (HI-DEF) trial: rationale, design, and methods. Neurocrit Care. 2013;19(2):257–66.
Article CAS Google Scholar
Selim M, Yeatts S, Goldstein JN, et al. Safety and tolerability of deferoxamine mesylate in patients with acute intracerebral hemorrhage. Stroke. 2011;42(11):3067–74.
Article CAS Google Scholar
Hanley DF, Thompson RE, Rosenblum M, et al. Efficacy and safety of minimally invasive surgery with thrombolysis in intracerebral haemorrhage evacuation (MISTIE III): a randomised, controlled, open-label, blinded endpoint phase 3 trial. Lancet. 2019;393(10175):1021–32.
Article Google Scholar
Fam MD, Hanley D, Stadnik A, et al. Surgical performance in minimally invasive surgery plus recombinant tissue plasminogen activator for intracerebral hemorrhage evacuation phase iii clinical trial. Neurosurgery. 2017;81(5):860–6.
PubMed PubMed Central Google Scholar
Mayer SA, Brun NC, Broderick J, et al. Efficacy and safety of recombinant activated factor VII for acute intracerebral hemorrhage. New Engl J Med. 2008;358(20):2127–37.
Article CAS Google Scholar
Mayer SA, Brun NC, Begtrup K, et al. Recombinant activated factor VII for acute intracerebral hemorrhage. New Engl J Med. 2005;352:777–85.
Article CAS Google Scholar
Saver JL, Gornbein J. Treatment effects for which shift or binary analyses are advantageous in acute stroke trials. Neurology. 2009;72(15):1310–5.
Article Google Scholar
Mayer SA, Davis SM, Skolnick BE, et al. Can a subset of intracerebral hemorrhage patients benefit from hemostatic therapy with recombinant activated factor VII? Stroke. 2009;40(3):833–40.
Article CAS Google Scholar
Diringer MN, Skolnick BE, Mayer SA, Steiner T, Davis SM, Brun NC, Broderick JP. Stroke. 2010;41(1):48–53. https://doi.org/10.1161/STROKEAHA.109.561712.
Article CAS PubMed Google Scholar
Liotta EM, Prabhakaran S, Sangha RS, et al. Magnesium, hemostasis, and outcomes in patients with intracerebral hemorrhage. Neurology. 2017;89(8):813–9.
Article CAS Google Scholar
Hanley DF, Lane K, McBee N, et al. Thrombolytic removal of intraventricular haemorrhage in treatment of severe stroke: results of the randomised, multicentre, multiregion, placebo-controlled CLEAR III trial. Lancet. 2017;389(10069):603–11. https://doi.org/10.1016/S0140-6736(16)32410-2.
Article CAS PubMed PubMed Central Google Scholar
Luong CQ, Nguyen AD, Nguyen CV, et al. Effectiveness of combined external ventricular drainage with intraventricular fibrinolysis for the treatment of intraventricular haemorrhage with acute obstructive hydrocephalus. Cerebrovasc Dis Extra. 2019;9(2):77–89. https://doi.org/10.1159/000501530.
Article PubMed PubMed Central Google Scholar
Broderick JP, Adeoye O, Elm J. Evolution of the modified rankin scale and its use in future stroke trials. Stroke. 2017;48(7):2007–12. https://doi.org/10.1161/STROKEAHA.117.017866.
Article PubMed PubMed Central Google Scholar
Maas MB, Naidech AM, Kim M, et al. Medication history versus point-of-care platelet activity testing in patients with intracerebral hemorrhage. J Stroke Cerebrovasc Dis. 2018;27(5):1167–73.
Article Google Scholar
Cawley GC, Talbot NLC. On over-fitting in model selection and subsequent selection bias in performance evaluation. J Mach Learn Res. 2010;11:2079–107.
Google Scholar
Witten DM, Tibshirani R, Hastie T. A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis. Biostatistics. 2009;10(3):515–34.
Article Google Scholar
James G, Witten D, Hastie T, Tibshirani R. An introduction to statistical learning with applications in R. New York: Springer; 2013.
Book Google Scholar
Breiman L. Random forests. Mach Learn. 2001;45:5–32.
Article Google Scholar
Brott T, Broderick J, Kathari R, et al. Early hemorrhage growth in patients with intracerebral hemorrhage. Stroke. 1997;28:1–5.
Article CAS Google Scholar
Therneau T, Atkinson B. rpart: Recursive partitioning and regression trees. R package version 4.1-13. 2018.
Liaw A, Wiener M. Classification and regression by random forest. R News. 2002;2(3):18–22.
Google Scholar
R Core Team. R: a language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. 2019.
Kuhn M. caret: classification and Regression Training. R package version 6.0-84. 2019.
Dowlatshahi D, Demchuk AM, Flaherty ML, et al. Defining hematoma expansion in intracerebral hemorrhage: relationship with patient outcomes. Neurology. 2011;76(14):1238–44.
Article CAS Google Scholar
Anderson CS, Heeley E, Huang Y, et al. Rapid blood-pressure lowering in patients with acute intracerebral hemorrhage. N Engl J Med. 2013;368(25):2355–65.
Article CAS Google Scholar
Sprigg N, Flaherty K, Appleton JP, et al. Tranexamic acid for hyperacute primary IntraCerebral Haemorrhage (TICH-2): an international randomised, placebo-controlled, phase 3 superiority trial. Lancet. 2018;391(10135):2107–15.
Article CAS Google Scholar
Naidech AM, Beaumont JL, Berman M, et al. Dichotomous “good outcome” indicates mobility more than cognitive or social quality of life. Crit Care Med. 2015;43(8):1654–9.
Article Google Scholar
Steiner T, Poli S, Griebe M, et al. Fresh frozen plasma versus prothrombin complex concentrate in patients with intracranial haemorrhage related to vitamin K antagonists (INCH): a randomised trial. Lancet Neurol. 2016;15(6):566–73.
Article CAS Google Scholar
Hervé M. RVAideMemoire: testing and plotting procedures for biostatistics. R package version 0.9-74. 2020.
Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanche JC, Müller M. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinform. 2011;12:77.
Article Google Scholar

Download references

Funding

This work was supported, in part, by the National Institute for Health Grants R01 NS110779, awarded to A.M.N. and 5T32LM012203-03, which supports A.N.H’s graduate training, as well as the National Institute of Neurological Disorders and Stroke Grant K23NS101124, awarded to R.F.

Author information

Authors and Affiliations

Department of Psychology, Weinberg College of Arts and Sciences, Northwestern University, Evanston, IL, USA
Andrew N. Hall & Daniel K. Mroczek
Institute for Public Health and Medicine, Northwestern University Chicago, Chicago, IL, USA
Bradley Weaver, Eric Liotta, Matthew B. Maas & Andrew M. Naidech
Department of Medical Social Sciences, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
Daniel K. Mroczek
Department of Neurology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Roland Faigle

Authors

Andrew N. Hall
View author publications
You can also search for this author in PubMed Google Scholar
Bradley Weaver
View author publications
You can also search for this author in PubMed Google Scholar
Eric Liotta
View author publications
You can also search for this author in PubMed Google Scholar
Matthew B. Maas
View author publications
You can also search for this author in PubMed Google Scholar
Roland Faigle
View author publications
You can also search for this author in PubMed Google Scholar
Daniel K. Mroczek
View author publications
You can also search for this author in PubMed Google Scholar
Andrew M. Naidech
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

ANH and AMN conceived the study and drafted the manuscript. ANH carried out the statistical analyses and interpreted the results. AMN was responsible for data collection for Institution 1. RF was responsible for data collection for Institution 2. DKM served as doctoral advisor to ANH. All authors helped in manuscript revisions and approved of the final manuscript before submitting.

Corresponding author

Correspondence to Andrew N. Hall.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical Approval/Informed Consent

The study was approved by the institutional review board (IRB). Written informed consent was obtained from the patient or a legally authorized representative. The IRB approved a waiver of consent for patients who died during initial hospitalization and those who were incapacitated and for whom a legal representative could not be located. Separately, identification of patients and collection of data in the Hopkins cohort were approved by the Johns Hopkins IRB.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hall, A.N., Weaver, B., Liotta, E. et al. Identifying Modifiable Predictors of Patient Outcomes After Intracerebral Hemorrhage with Machine Learning. Neurocrit Care 34, 73–84 (2021). https://doi.org/10.1007/s12028-020-00982-8

Download citation

Published: 08 May 2020
Issue Date: February 2021
DOI: https://doi.org/10.1007/s12028-020-00982-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Identifying Modifiable Predictors of Patient Outcomes After Intracerebral Hemorrhage with Machine Learning