Background

Translating clinical research into patient care requires the ability to properly interpret clinical outcome scales. The modified Rankin Scale (mRS) and Barthel Index (BI) are designed for use in stroke patients to evaluate discharge disability [1], understand stroke prognosis [2], and assess stroke recovery [1, 3, 4]. With improvements in clinical practice leading to higher survival rates for intracerebral hemorrhage (ICH) patients [5], there is a growing emphasis on long-term recovery. Central to this understanding is how these clinical scales can capture recovery in survivors.

The mRS is a seven-point scale commonly used to assess global disability [1, 6], where 0 represents no symptoms nor disability and 6 represents death. Conversely, the BI assesses the extent of disability as a score from 0 to 100 across ten domains of activities of daily living [7]: bathing, bladder, bowels, dressing, feeding, grooming, mobility, stairs, toilet use, and transfer [3]. Among ischemic stroke patients, the mRS has been shown to better differentiate mild-to-moderate disability over 12 months [8]. Additionally, it has been suggested that ischemic stroke patients plateau in recovery on the BI between 2 and 5 months post-injury [9,10,11], suggesting maximal recovery within the first 6 months. However, this analysis of recovery has not been demonstrated in ICH survivors, whom exhibit much greater initial disability compared to ischemic stroke survivors [12], with studies in rehabilitation settings [12,13,14] suggesting differences in recovery profiles.

It has been proposed that 37.7% of ICH patients regain functional independence (i.e., BI ≥ 95) at 100 days post-injury [15]; however, to our knowledge the BI has not been used to evaluate survivors beyond this time-point in any prospective observational ICH study. Additionally, while the mRS is commonly used to assess outcome in ICH patients [16], there has been little focus on understanding the burden of disability at various time-points. As new interventions for ICH patients are being developed, choosing proper end points as well as clinically relevant assessments is important for well-designed trials. The current study was designed to prospectively evaluate the improvement in mRS and BI scores of ICH survivors across 1 year post-injury. The specific goal was to analyze the recovery profile of ICH survivors using both measures to understand how well these scales capture long-term recovery and how this may impact research practices. Our initial hypothesis was that both measures would show substantive improvement in scores between all follow-up time-points, suggesting a sustained recovery of ICH survivors up to 12 months post-injury.

Methods

This prospective observational study was conducted at a single medical center, Yale New Haven Hospital, between July 1, 2014, and October 5, 2016. Patients provided informed consent within 96 h of admission and evaluated at discharge, 3, 6, and 12 months using multiple neurological outcome measures. Inclusion criteria included patients greater than 18 years old, a computed tomography (CT) scan consistent with ICH, and a documented last known normal (LKN) time. The study subject’s medical decision maker was consented in cases in which the subject was intubated or incapacitated. Exclusion criteria included a clinical history of recent trauma or hemorrhagic conversion of an ischemic stroke.

Baseline Recordings

All demographic data were abstracted from Epic (Epic Systems Corporation, Verona, WI, USA) or obtained directly from study subjects. Intensive care unit hospitalization days, total hospitalization days, and likely etiology of ICH were all documented upon hospital discharge. If an etiology was not clearly established during the hospitalization (e.g., pending additional imaging or two potential diagnosis), the etiology was classified as “unknown.” All clinical information was stored in a REDCap™ database (Research Electronic Data Capture, Nashville, TN, USA). Two researchers (A.S. and A.L.) completed data abstraction, with an inter-rater reliability of 87.79% of the initial five subjects enrolled in the study.

Imaging Analysis

Baseline CT or CT angiography defined as the scan closest to LKN, was evaluated by two blinded independent raters (A.S. and A.L.) using SYNAPSE (Fujifilm, Tokyo, Japan). ICH location was classified as either: lobar, deep, cerebellum, brainstem, or unable to specify. All location classifications were validated by the official radiology report, and any discrepancies were adjudicated by the study P.I. (K.S.). ICH volume was determined using Analyze 11.0 (AnalyzeDirect, Overland Park, KS, USA) as previously described [17]. Volumes were determined by either of the independent raters (A.S. and A.L.), with a Pearson correlation between raters for the first ten measurements of r = 1.000, p < 0.001 for all measurements.

Outcomes Assessments

Baseline mRS and BI were collected at day 7 or discharge (±3 days). Follow-up assessments were conducted either in person or by phone at 3, 6, and 12 months (±2-week window). All evaluators completed online training regarding mRS administration and in-person staff training regarding BI administration. The mRS was collected via a semi-structured interview, and the BI was assessed using a pre-defined set of questions provided in the assessment form. Unclear cases were brought to group discussion among evaluators and adjudicated by the P.I. (K.S.).

Statistics

Initially, descriptive statistics were used to summarize demographic and baseline characteristics of the entire cohort. While mortality was included in the initial outcome characteristics, these data were subsequently excluded from any successive analyses in order to only focus on ICH survivors as stated in the overall goals. To directly compare differences between the mRS and BI, percentages of “good outcomes” were calculated using mRS < 3 and BI ≥ 85 as respective cutoffs. Cutoffs of good outcome were chosen based on their commonality within ICH research and clinical trials [16, 18, 19]. To specifically assess significant differences in the distribution of mRS scores (i.e., 0–5) between all time-points (i.e., 7 days, 3, 6, and 12 months), repeated measures nonparametric testing was conducted using the Friedman test. This test is analogous to a nonparametric version of an ANOVA, examining differences in the distribution of scores across multiple different time-points. If a significant difference was found in the distribution of scores across time, follow-up statistics were conducted using the Wilcoxon signed-rank Test. This specific test assesses for differences between individual time-points (i.e., 7 days to 3 months, 3–6 months). The same series of nonparametric tests were conducted to look for differences in the distribution of BI scores (i.e., 0–100) between time-points. Statistics were conducted using SPSS v.19 software (SPSS Inc., Chicago, IL, USA).

Ethics Approval

The study was approved by the Yale University School of Medicine IRB (HIC Approval #1405014045 and #1509016433).

Results

A total of 173 subjects were enrolled, with 83 subjects completing all follow-up time-points. We achieved follow-up rates of 88.5, 89.1, and 86.7% at 3, 6, and 12 months, respectively (electronic supplement Figure I). All demographics and descriptive statistics of the cohort are provided in Table 1. This cohort had 55.5% of participants over the age of 65, and 49.1% were male. The majority of subjects were Caucasian (69.4%), with hypertension (67.1%), hyperlipidemia (30.1%), and diabetes (19.7%) as the most common medical comorbidities. Additionally, 10.4% of subjects had a prior stroke, and 56.3% of these were previous ICHs. This resulted in 11.0% having a pre-admission mRS ≥ 2. At admission, the median NIHSS was 9 (±8.7), and the median GCS was 14 (±4.5).

Table 1 Baseline characteristics of the 173 subjects enrolled in the study

The average admission ICH volume was 26.1 ml (±31.9), with 24.9% of subjects with an additional intraventricular hemorrhage component. Additionally, 38.2% of patients exhibited a midline shift on admission CT, which was measured at 6.9 mm (±4.1) for this subset. The majority of ICHs were deep (41.0%), with almost an equivalent percentage of lobar ICHs (39.3%); a full breakdown of ICH locations for the cohort is included in electronic supplement Table I. During hospitalization, 19.7% required decompression and 23.7% were made comfort measures only (electronic supplement Table II). Hypertension (46.8%) was the leading identifiable cause of ICH; however, there were a large proportion of “undetermined” etiologies (27.8%).

Discharge mortality was 17.3%, which increased to 26.0, 28.3, and 30.6% at 3, 6, and 12 months, respectively. Outcomes for all surviving subjects are outlined in Table 2. By 3 months, the majority of survivors were home (64.2%). This proportion increased to 83.4% by 12 months. While the proportion of subjects with a good outcome on the mRS (mRS < 3) increased from 20.0% at 7 days to 62.0% at 12 months, these proportions were more pronounced for the BI (BI ≥ 85), 20.9% to 76.6% at 7 days and 12 months, respectively (Fig. 1).

Table 2 Outcome characteristics of the 173 subjects enrolled in the study
Fig. 1
figure 1

Proportion of patients observing a good outcome on the modified Rankin Scores (mRS), mRS < 3, and Barthel Index (BI), BI ≥ 85, from seven days to 12 months

Using repeated measures nonparametric testing, there was a significant difference between time-points with the mRS, χ 2(3) = 17.37, p = 0.001 (Fig. 2). Follow-up testing showed differences between 7 days and 3 (z = −5.28, p < 0.001), 6 (z = −5.48, p < 0.001), and 12 months (z = −3.93, p < 0.001). However, there were no differences between all follow-up time-points (i.e., 3–6, 3–12, 6–12 months). The BI showed significant differences between all time-points in nonparametric testing, χ 2(3) = 40.170, p < 0.001 (Fig. 3). There were also differences between most follow-up time-points, including 3 and 12 months (z = −2.49, p = 0.013) as well as 6 and 12 months (z = −2.23, p = 0.025). There was no significant difference between 3 and 6 months (z = −1.22, p = 0.222).

Fig. 2
figure 2

Improvement in distribution of mRS scores from 7 days to 12 months

Fig. 3
figure 3

Improvement in Barthel Index (BI) scores at 7 days, 3 months, 6 months, and 12 months. *Significant at p < 0.05. ***Significant at p < 0.001

Discussion

This study was designed to prospectively assess functional status of ICH survivors at repeated time-points to better characterize their recovery curves. Our results suggest significant differences in mRS scores for ICH subjects between discharge and follow-up time-points, however, not among the later time-points (e.g., 3–6, 6–12 months) (Fig. 2). Conversely, we were able to show differences on the BI between all short- and long-term time-points, except between 3 and 6 months (Fig. 3). Additionally, proportions of good outcome on either the mRS (mRS < 3) or BI (BI ≥ 85) showed more pronounced improvement via BI measurements compared to the mRS (Fig. 1). The majority of all ICH survivors were back home at 3 months, with 83.4% of living ICH subjects at home by the end of the study. Taken as a whole, these results suggest that survivors of the initial ICH injury will show improvement in overall outcomes over the course of 1 year post-injury.

The recovery profiles of our cohort suggest maximal recovery is observed on the mRS at 3 months post-ICH, but seen up to a year on the BI. Differences noted between the two measures likely stem from discrepancies in how survivors are evaluated with either measure. The mRS has a limited approach to assigning scores and lacks the specificity of the domains within the BI. For instance, the major discriminating factor within the mRS is the ability to walk, separating a score of 3 from 4. This excludes most other functional improvement from factoring into an evaluation. Conversely, the inclusion of specific activities of daily livings within the BI allows for evaluation of finer motor skills (i.e., brushing teeth, combing hair), which may take subjects longer to recover or adapt. While technically an ordinal scale, the BI is more consistent with an interval scale given the wide range of scores a subject could attain. This results in the ability to more easily discriminate outcome and achieve statistical significance. Our results suggest that the mRS is optimal for measuring short-term gross motor recovery for most ICH survivors, while the BI seems better suited to measure long-term finer motor recovery.

The utility of the BI in measuring long-term recovery in ICH patients seems to contradict research of ischemic stroke patients who plateau in recovery between 2 and 5 months [9,10,11]. It is likely that the higher initial morbidity of ICH patients [12] accounts for a delay in recovery, resulting in improved outcomes up to 1 year post-ICH. Ischemic stroke patients must compensate and undergo neuronal plasticity in the presence of dead cellular tissue. Conversely, hematoma formation in ICH may actually result in a lower burden of dead tissue. It has been suggested that neuronal plasticity only occurs once the hematoma resolves in ICH patients [20], which may take upward of months depending on volume. Consequently, ICH survivors may experience an overall good recovery that is simply delayed due to hematoma resolution.

There is currently no consensus on how these scales should be interpreted (i.e., ordinal scales or dichotomizations) [21]. Many mRS studies dichotomize outcomes [16, 22,23,24,25] into good vs. poor to improve the statistical power [26] and simplify analysis. Similarly, while some studies have used the BI with cutoffs [15, 27], others analyze the measure as a scale [28,29,30]. We chose not to dichotomize our statistical analysis, because of research suggesting that this process fails to capture smaller improvements in functional status [31] and limits applicability to other clinical trials [21]. By examining the distribution and shifts in distribution, this study is better able to evaluate finer improvements in functional status as evidenced by the improvements noted on the BI.

This study should be interpreted noting some limitations regarding its sample size. All subjects were enrolled at a single institution and reflect the clinical practices and outcomes of that institution. Additionally, effects of interventions (i.e., surgery) or baseline differences (i.e., previous stroke) were not considered in the analysis or interpretation of results. While these factors may affect overall outcomes, they would not account for the differences observed between time-points. Mortality in our cohort was approximately 30%; slightly lower than previous studies [32], but within the range of other larger cohort studies [33]. Additionally, our low initial discharge mortality may be attributable to the changes in clinical practice regarding aggressive care for ICH patients within 24 h as outlined in 2015 AHA/ASA ICH management guidelines. These were released during the course of this study [34]. This study did not include deceased subjects (mRS 6) in the analysis, which would skew overall outcomes at each of the time-points. Due to the fact that the focus of this study was on functional recovery of ICH survivors and not mortality, these results should only be applied to ICH survivors. Finally, since mRS and BI were assigned to patients at the same follow-up time-points, there could possibly be a bias when assigning scores together (i.e., rating higher on one scale can result in a favorability for higher rating on the other). This bias was minimized to an extent via semi-structured or standardized questions in evaluation. Conversely, the strengths of this study include its low loss to follow-up, with response rates over 85% for most time-points when controlling for mortality. While this study was unable to follow all its enrolled subjects to the 12-month time-point, the recovery profiles of the existing cohort already suggest a sustained recovery. We did not attempt any statistical method to impute missing data in order to avoid potential biases that may either strengthen or weaken overall conclusions.

Given the results of this observational study, further work is warranted regarding ICH recovery. Since our cohort showed improvement on the BI up to 1 year post-ICH, a longer-term study beyond 1 year would be required to evaluate the potential peak recovery in ICH survivors. These results should also prompt the inclusion of 12-month end points in ICH clinical trials, which commonly use 3- or 6-month primary end points. If the natural course of ICH injury requires a minimum at least 12 months to recover, these shorter-term end points will fail to evaluate peak efficacy of a potential intervention. Since most BI domains are scored up to 10 points, the noted improvement between 3 and 12 months (from 73.0 to 83.4; approximately 10-point difference) represents a clinically significant improvement. Finally, both of these measures focus on motor disability and do not examine quality of life, cognition, or emotional distress. Further work is still needed to evaluate non-motor-based recovery in this population.

Understanding when each of these neurological assessments should be used within the recovery process is important for both patient care and clinical research. This is the first study to our knowledge that tracks changes in both the mRS and BI in ICH survivors out to 12 months post-injury. Our results suggest that the mRS may be better suited for short-term recovery, while the BI could potentially be used up to 1 year post-injury. However, both measures are important for assessing functional status and specifically evaluate different aspects of recovery, gross versus fine motor recovery. The inclusion of both measures in clinical research seems warranted given the differences observed in ICH patients.