Introduction

Multiple sclerosis (MS) is a chronic inflammatory and neurodegenerative disease of the central nervous system (CNS) [1]. Eighty-five percent of patients are initially diagnosed with relapsing–remitting MS (RRMS) characterized by episodes of disability and recovery; after 20–25 years, about 90 % of untreated RRMS patients transition to secondary-progressive MS (SPMS) characterized by increasing disability without recovery [2, 3]. The gradual decline of both physical and neurological function associated with MS disease progression is attributed to brain atrophy caused by axonal and neuronal loss [4, 5]. Multiple sclerosis patients experience varying rates of brain volume loss (BVL) with whole brain atrophy ranging from 0.5 to 1.5 % per year among RRMS patients [6]. With the introduction of sequential volumetric magnetic resonance imaging (MRI) applied to MS patients, measurement of BVL over time and assessment of therapeutic effects on slowing brain atrophy have become possible.

In addition to brain atrophy, approximately 66 % of patients with MS experience a form of cognitive impairment, often resulting in a significant impact on daily activities [7]. The most common cognitive impairments experienced by MS patients are cognitive inefficiency and memory decline followed by verbal abilities and attention span. Studies have indicated that cognitive impairment is progressive and cognitive decline is associated with disease duration [810]. The MS Functional Composite (MSFC) score and its components, Paced Auditory Serial Addition Test (PASAT), Nine Hole Peg Test (9HPT), Timed 25 Foot Walk Test (T25FWT), and Symbol Digit Modalities Test (SDMT) are used to measure both cognitive and physical disability among MS patients. As the only cognitive component of MSFC, PASAT is the most frequently administered test for measuring information processing speed (IPS). Low test scores for PASAT, SDMT, and MSFC indicate worse performance. From these measures, cognitive impairment has been shown to be correlated with MRI measures, including both lesion and whole brain volume (BV) [11].

A number of studies examined the associations between BV and various patient outcomes. This study aimed to systematically catalogue the full set of published associations reported in 2003–2013, and to review published evidence of associations between BV and patient outcomes relating to cognitive impairment. In particular, this study evaluated the correlations between BV and selected patient outcomes measuring cognition in MS patients, including SDMT, PASAT, and MSFC, as well as the reported correlations between BV and Expanded Disability Status Scale (EDSS) and lesion volume (LV) and selected patient outcomes.

Materials and methods

Study selection criteria

We included longitudinal and cross-sectional studies that reported the relationship between whole BV measurements in both male and female MS patients and selected patient outcomes measuring cognition, including SDMT, PASAT, and MSFC scores.

Search strategy

We searched Medline, EMBASE, the Cochrane Central Register of Controlled Trials (CENTRAL), the NHS Economic Evaluation Database (NHS EED), and the Cumulative Index to Nursing and Allied Health Literature (CINAHL) for studies published in English between January 2003 and September 2013. We hand searched references of included reviews published in 2013 and reviewed the US clinical trial registry to identify additional studies that were not indexed in the electronic databases.

Study selection process and data extraction

Two independent reviewers applied the inclusion criteria and assessed the quality of the data collected using a standardized methodology. Each reviewer evaluated the data from the eligible studies and electronically entered the information into an Excel database developed specifically for the review with prepared fields. Disagreements between reviewers were resolved by consensus or by consultation with a third researcher, referring to the original sources.

We collected information on the study design, population, comparison or treatment groups, sample size, duration of follow-up, whole BV algorithm, average baseline characteristics of patients, MS type, MS disease duration, reported BV and associations. For BV measures, we extracted information for whole BV measures, including percent BV change (PBVC), brain parenchymal fraction (BPF), brain parenchymal volume (BPAV), normalized BV (NBV), relative BV (RBV), cerebral parenchymal volume (CPV), cerebral volume fraction (CVF), and whole BV (WBV). In addition, T1-hypointense LV (T1LV) and T2-hyperintense LV (T2LV) data were captured. We recorded physical function measured by EDSS and patient outcomes (i.e., PASAT, SDMT, and MSFC).

Data management and reporting

We used Endnote version X5 to store the bibliographic citations from the electronic search. For data entry and descriptive analyses, we used Microsoft Excel 2010. We followed the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) in reporting the systematic review [12].

Results

Search

The literature search identified 2837 publications to be screened based on the predetermined screening criteria (Fig. 1). Of the 599 publications eligible for full-text screening, 581 were excluded based on publication type (n = 13), study design (n = 28), population (n = 178), and outcomes (n = 362). A total of 18 studies meeting the study inclusion criteria were included in the qualitative synthesis [1330].

Fig. 1
figure 1

Study flow diagram on the relationship between brain volume loss and patient outcomes in multiple sclerosis patients

Overall study descriptions

The 18 studies included 13 cross-sectional studies and 5 longitudinal studies with study duration ranging from 12 to 96 months (Table 1). The study populations varied greatly across the 18 studies. Relapsing–remitting MS was the most common disease course with 12 (67 %) studies including patients with RRMS. In addition, nine (50 %) studies included patients with SPMS and five (28 %) studies included patients with primary-progressive MS (PPMS) (Table 2). Among the 17 studies reporting gender, 13 studies had over 50 % female patients. Among the 17 studies reporting age, the mean/median age of MS patients ranged from 34.2 to 55.7 years. In addition, the patient populations varied greatly with respect to disease duration; of the 17 studies reporting disease duration, the mean/median ranged from 1.4 to 19.9 years. The study with mean disease duration of 19.9 years was among a population of SPMS patients while the shortest disease duration (1.4 years) was among a population of RRMS patients (Table 1) [20, 23].

Table 1 Characteristics of included studies
Table 2 Average patient characteristics by patient group

The structural image evaluation using normalization of atrophy cross-sectional (SIENAx) algorithm was the most frequently used algorithm for cross-sectional studies [1315, 17, 19, 20]. Some studies reported SIENA as well as other algorithms such as Java Image, Statistical Parametric Mapping (SPM), MeVisLab Brain Volumetry, and in-house software programs to measure BV. BPF was the most frequently reported measure among the included studies (Table 2).

BV and SDMT

Six cross-sectional studies examined the association between BV and SDMT [1317, 30]. Four (67 %) studies reported unadjusted correlations (Table 3) and two (33 %) studies adjusted for age and premorbid conditions in linear regression models. The six studies varied based on patient characteristics with the mean/median age of patients ranging from 34.2 to 47.0 years. In addition, the studies varied by MS disease course; one (17 %) study included only RRMS patients, four (67 %) studies included both RRMS and SPMS patients, and one (17 %) study included RRMS, SPMS, and PPMS patients.

Table 3 Correlation between brain volume measures and SDMT scores

The four studies reporting correlations found positive correlations ranging from 0.40 to 0.54 (all p < 0.05) indicating that BVL is associated with lower SDMT scores (Table 3). One study using a linear regression model controlling for age and premorbid intelligence showed that in a sample of RRMS and SPMS patients, a partial correlation of 0.62 (p < 0.001) was found between BPF and SDMT scores [16]. Another study, in a sample of RRMS and SPMS patients, a partial correlation of 0.58 (p < 0.01) was reported for BV and SDMT scores after adjusting for age and years of education [14].

BV and PASAT

Fourteen studies, including 3 longitudinal and 11 cross-sectional studies, examined the associations of BV and PASAT [13, 1624, 26, 2830]. Patient characteristics varied across the studies with mean age ranging from 36.9 to 55.7 years. The disease composition of the included study populations varied by studies; five (36 %) studies included only RRMS patients, three (21 %) studies included only SPMS patients, two (14 %) studies included only PPMS patients, two (14 %) included both RRMS and SPMS patients, one (7 %) included RRMS, SPMS, and PPMS patients, and one (7 %) included patients with remitting MS.

Of these 14 studies, 12 (86 %) reported correlations (Table 4) [13, 1719, 2124, 26, 2830] (Table 4). Seven (58 %) studies reported statistically significant positive correlations ranging from 0.29 to 0.40 (all p < 0.05), demonstrating that BVL was associated with a decrease in PASAT scores [19, 2224, 26, 28, 30].

Table 4 Correlation between brain volume measures and PASAT scores

In addition, three studies demonstrated statistically significant associations between BV and PASAT scores in linear regression models. One study among SPMS patients demonstrated in a multivariable regression model controlling for age, gender, disease duration, and duration of SPMS that NBV was a significant predictor of MSFC (p < 0.0001) [20]. Another study using a linear regression model controlling for age and premorbid intelligence demonstrated that BPF predicted PASAT scores (R p = 0.34, p = 0.01) [16]. In addition to correlating BPF and PASAT, one study also analyzed the relationship while controlling for age, disease duration, and EDSS and found that BPF was a significant predictor of PASAT (R p = 0.60, p < 0.0001) [24].

BV and MSFC

Nine studies (five longitudinal and four cross-sectional) reported associations between MSFC and BV [1820, 22, 23, 25, 2729]. The mean age ranged from 36.9 to 55.7 years. The studies varied on disease characteristics: three (33 %) studies included only PPMS, two (22 %) studies included only RRMS, three (33 %) studies included only SPMS, and one (11 %) study included RRMS, SPMS, and PPMS patients. Of these nine studies, seven (78 %) reported the correlation of BV and MSFC and three examined the relationship in linear regression (Table 5).

Table 5 Correlation between brain volume measures and MSFC scores

Among the seven studies correlating BV and MSFC scores, six (86 %) studies found significant positive associations with correlations ranging from 0.31 to 0.52 (all p < 0.05). Linear regressions were performed in three studies to examine the associations between BV and MSFC. In a longitudinal study with only RRMS patients, the authors found that a decrease in MSFC score from baseline to year 2 was associated with a 0.57 % (95 % CI 0.06–1.08) decrease in BPF. The same study also found that a decrease in MSFC score from year 0 to 8 was associated with a 0.29 % (95 % CI 0.06–0.53) decrease in BPF [25]. In addition, one study among SPMS patients demonstrated in a multivariable regression adjusting for age, gender, and disease duration that NBV was a significant predictor of MSFC (p < 0.0001) [20]. Another study among SPMS patients also demonstrated that NBV was a significant predictor of MSFC (R p = 0.44, p = 0.001) [19].

LV and patient outcomes

Eight studies correlated LV with patient outcomes [13, 15, 1719, 22, 24, 30]. Total T2LV was correlated with SDMT in four studies with statistically significant negative correlations ranging from −0.36 to −0.55 (all p < 0.05), demonstrating that higher T2LV is associated with lower SDMT scores [13, 15, 17, 30]. In addition, two of the three studies reporting correlations between T2LV and MSFC found a statistically significant correlation (p < 0.05), demonstrating that an increase in T2LV is associated with a decrease in MSFC [18, 19, 22]. One cross-sectional study found a statistically significant negative correlation between T2LV and MSFC (r = −0.31, p < 0.01) [19]. One longitudinal study found a statistically significant positive correlation between change in T2LV and MSFC over 5 years (r = 0.31, p = 0.04), while a second longitudinal study found no significant correlation over 2 years (r = −0.11, p = 0.45) [18, 22]. T2LV was correlated with PASAT in six studies [13, 1719, 22, 30]. Of these studies, two cross-sectional studies found statistically significant correlations ranging from −0.37 to −0.42 (all p < 0.05) demonstrating that a higher T2LV is associated with lower PASAT [13, 30]. In addition, three studies correlated T2LV and EDSS [19, 22, 24]; one longitudinal study found a statistically significant negative correlation between change in T2LV over 5 years and EDSS at baseline (r = −0.35, p = 0.02), but no correlation between change in both T2LV and EDSS over 5 years [22]. One of the two cross-sectional studies found a significant positive correlation between T2LV and EDSS (p = 0.39, p = 0.03) [24].

Among the four studies correlating total T1LV with patient outcomes, the results were similar to those with T2LV. The two studies correlating T1LV and SDMT found statistically significant negative correlations ranging from −0.49 to −0.54 [13]. One study correlated T1LV with MSFC and found a significant correlation (r = −0.35, p < 0.01) [19, 30]. Three studies correlated T1LV and PASAT; two studies found significant negative correlations [17, 19, 30]. Of these two studies, one correlated T1LV and PASAT-3 (r = −0.41, p < 0.01) and the second correlated T1LV with both PASAT-3 (r = −0.31, p < 0.01) and PASAT-2 (r = −0.33, p < 0.01) [19, 30].

BV and LV

In addition to examining the relationship between BV and patient outcomes, seven studies reported the correlations between BV and LV [15, 16, 19, 20, 22, 24, 28]. Six studies reported correlations between BV and T2LV and four studies correlated BV and T1LV. Among the six studies examining the associations between BV and T2LV, five reported statistically significant correlations. The one longitudinal study demonstrated that change in BV over 5 years was associated with both change in T2LV over 5 years (r = 0.36, p = 0.017) and T2LV at baseline (r = 0.39, p = 0.009) [22]. Among the five cross-sectional studies, four studies reported significant correlations ranging from −0.29 to −0.53 (all p < 0.05) [16, 19, 20, 24, 28]. One study reported a statistically significant correlation between BPF and T2LV (r = −0.29, p = 0.05) [24].

Of the four studies examining the relationship between BV and T1LV, all four found a statistically significant correlation. One longitudinal study reported a statistically significant correlation between change in BV over 5 years and T1LV at baseline (r = 0.464, p = 0.001); however, the study found that change in T1LV over 5 years was not associated with change in BV over 5 years (r = 0.259, p = 0.696) or BV at baseline (r = 0.285, p = 0.60) [22]. Three cross-sectional studies reported correlations ranging from −0.29 to −0.52 [16, 19, 24].

BV and EDSS

Since EDSS is widely reported as a measure of disability in MS patients, further evaluation of correlations between BV and EDSS was made. Seven studies (three longitudinal and four cross-sectional) reported associations between BV and EDSS [19, 2124, 27, 29]. Mean age of patients ranged between 36.9 and 55.7 years. The studies differed by disease characteristics: four (57 %) studies included only RRMS patients, one (14 %) included only PPMS patients, one (14 %) included only SPMS patients, and one (14 %) included RRMS, SPMS, and PPMS patients.

Three cross-sectional studies demonstrated the association between BV and EDSS with correlations ranging from −0.39 to −0.27 (all p < 0.05). In addition to correlating BV and EDSS, one cross-sectional study also analyzed the association of BV and EDSS adjusting for age and disease duration, and found that EDSS was associated with BPF (r = −0.51, p = 0.005) [24].

Three longitudinal studies found that change in BV was associated with EDSS at different time points. One study found that the rate of reduction in BV over 5 years was significantly associated with EDSS at baseline (r = 0.43, p < 0.01); however, the same study also found that the rate of change in BV was not associated with change in EDSS (r = 0.12, p = 0.41) over the 5 year period [22]. Similarly, one study found that PBVC over 12 months was statistically significantly associated with EDSS at baseline (r = −0.51, p < 0.01), while another study found that PBVC over 2 years was statistically significantly associated with EDSS at the 2-year follow-up (r = −0.32, p < 0.01) [23, 27].

Discussion

Our systematic literature review documented the extent to which the relationship between BV and patient outcomes has been studied and qualitatively synthesized the findings. In particular, our study focused on patient outcomes measuring cognitive impairment, including SDMT, PASAT, and MSFC, to demonstrate that BVL is associated with a decline in cognitive function in MS patients. The findings for whole BV summarized in this review are in line with multiple studies demonstrating that regional BV is associated with cognitive outcomes. For instance, Batista et al. found that multiple regional BV measurements were all significantly correlated with PASAT and SDMT, after controlling for age [31]. Similarly, Amato et al. has published multiple studies demonstrating the statistically significant association between neocortical volume and cognitive outcomes [3235]. The findings of our review have implications for the use of BV in the monitoring and treatment of MS patients to prevent cognitive decline. Cognitive impairment is a key factor that may adversely affect a patient’s disease progression and quality of life [9, 36]. It is important to use measures of cognitive decline to identify cognitive impairment which may be evident early in the MS disease course.

In this study, we found that the relationship between BV and SDMT is more pronounced than BV and PASAT. Furthermore, the reported correlation estimates were consistent across the studies that reported SDMT. The strength of the correlations between BV and SDMT is supported by the literature suggesting that there is an underlying link between BV and cognitive impairment. The SDMT instrument is sensitive to cognitive impairment because it evaluates a number of domains, including visual scanning, visual attention and processing speed, and episodic memorization [37]. In addition, SDMT has been proven to be reliable when administered over multiple time points [37, 38]. Ease of use and test–retest validity of the instrument is important to the longitudinal study of cognitive decline in patients with MS.

Associations of BV and EDSS reported in the studies were examined because EDSS has been widely used and incorporates measures of physical disability based on individual functional system scores of CNS [39]. The results of our review suggest that cognitive outcomes such as PASAT and MSFC demonstrate a greater association with BV than EDSS. For example, Furby et al. found significant correlations between NBV and both PASAT-3 (r = 0.37, p < 0.01) and MSFC (r = 0.47, p < 0.01) but not between NBV and EDSS (r = −0.13, p = 0.17) [19].

We also reported the relationship between LV and patient outcomes to observe if there was a similar relationship when compared with the associations between BV and patient outcomes. T2LV has been used in the real-world setting as a measure of disease progression as well as a secondary endpoint in clinical trials [40]. It has been demonstrated that LV is associated with both physical and cognitive disability. In one review, the authors found that both BV and T2LV correlated well with information processing speed as measured by SDMT and PASAT and that there was a stronger relationship between T2LV and SDMT than T2LV and PASAT [11]. Another study analyzing the ratio of T1LV to T2LV and work productivity demonstrated an association between increased T1LV and lower chance of employment, which may be mediated through the cognitive domain [41]. However, studies also demonstrated that LV has a plateau relationship with disability over time [42]. In this study, we found that the correlations between BV and selected cognitive outcomes were greater when compared with those between LV and selected cognitive outcomes. The higher correlations found among studies correlating BV with cognitive outcomes are consistent with the literature demonstrating that BVL provides a more complete understanding of the full extent of tissue damage and its impact on patient outcomes [5]. Our systematic review further provides support for BV as a measure of disease activity and progression because of the strong correlations observed between BV and selected cognitive outcomes. In addition, a recent meta-analysis demonstrated that the size of the treatment effect of disease modifying therapies (DMTs) on brain atrophy is closely related to the size of the treatment effect on disability progression. In conjunction with the treatment effect on lesions, the two MRI measures could explain up to 75 % of the variance in disability progression [43]. The findings of these studies indicate that it is important to measure BV in patients in addition to LV to fully measure axonal loss and to monitor patient’s stability status. As a predictor of disability, BVL is an important endpoint to consider in the treatment of patients in both clinical trials as well as the real-world clinical setting.

The findings of this study highlight important characteristics of disability in MS that have implications for the on-going debate regarding the timing of treatment for patients with CIS and early stage clinically diagnosed MS. Patients with MS begin with a focal inflammatory disease and longer disease duration is associated with an accumulation of diffused brain inflammation, neuronal loss, cortical demyelination, and progressive axonal injury in white matter [44]. Studies have shown that BVL is present even among patients with CIS and early RRMS, and that overall cognitive performance begins to decline prior to the onset of clinical disease symptomology [9, 4446]. The US National MS Society recommends that early therapy should be given without interruption, particularly in patients at high risk for disease progression [2]. Furthermore, the early initiation of DMTs in clinically diagnosed MS has demonstrated improved effectiveness of treatment by reducing relapses as well as slowing disease progression in patients with RRMS [2]. Based on the demonstrated correlations between BV and patient outcomes and the evidence demonstrating that early treatment of MS reduces BVL and disease progression, literature suggests that MS patients may benefit from early treatment to prevent further disability. Withholding treatment from patients until the disease progresses may lead to reduced efficacy of DMTs and irreversible axonal and neuronal damage [47]. By treating early in the disease course and reducing BVL, the early initiation of effective DMTs may help to further improve patient outcomes and delay the onset of both cognitive and physical disabilities.

This study has a few notable limitations. We included studies published from January 2003 to September 2013 to adequately capture the use of MRIs in MS and focus on the technological advancement in imaging in the past decade. This restriction may limit the generalizability of the results to older studies published before 2003. In addition, we only included English studies which limited us from making inferences on studies published in other languages.

In our study, we were limited to reporting ranges of the correlation estimates instead of quantitatively synthesizing the relationships due to the high level of heterogeneity that exists across the studies. Cohen and Rudick reported that studies assessing cognitive impairment comprise different MS populations, variability of symptom onset, severity, and disease duration as well as instruments to reliably capture cognitive impairment [6]. One of the drivers of the variability is that there is no standardized method of measurement of BVL which complicates the evaluation of study results from various studies of BV and patient outcomes. Locatelli et al. correlated BPF and BPAV with PASAT and found that the correlation with BPF was significant, while the correlation with BPAV was not significant [24]. As demonstrated by Locatelli et al., the correlation of BV and patient outcomes varies greatly based on the BV measurement used; the normalized BPF consistently showed stronger correlations when compared to the non-normalized BPAV [24]. The variation in correlations between different BV measures and patient outcomes provides support for the establishment of a standard BV measurement to be used in studies examining BVL. BPF was the most frequently used measure reported in our study and studies have shown that BPF reduces variability between individuals and the high test–retest reproducibility improves the power to predict significant changes in a longitudinal setting [48, 49].

In addition, factors such as age and education (cognitive reserve) introduce considerable variability when examining the impact of BVL on cognitive function. Some patients can retain cognitive function while sustaining considerable disease burden due to higher levels of cognitive reserve [50]. Achiron et al. reported that the prevalence of cognitive impairment among MS patients ranged between 20 and 65 % and the wide range was due to the disease subtypes, disease duration, and level of disability of the MS population studied, as well as differences in the cognitive assessment scales, procedures, and tools used [9]. The studies included in our review varied greatly in both clinical and patient characteristics. Study populations included all disease courses and the mean age of patients varied by over 20 years across studies. As a result, we did not examine the relationship between BVL and cognitive impairment as a function of age because of the variability that existed in the studies. However, the studies reporting results that controlled for patient characteristics such as age and education demonstrated that BV was associated with cognitive decline.

Conclusion

Our study demonstrated that BVL is consistently associated with a decrease in SDMT score. Among the studies included in this review, lower BV was associated with lower SDMT, PASAT, and MSFC scores. Furthermore, correlations between BV and selected cognitive outcomes reported in the studies were higher when compared with those between LV and selected cognitive outcomes. This finding has implications for the use of BV in monitoring disease progression as well as early initiation of DMTs to minimize BVL and downstream effects of cognitive impairment. Further prospective studies examining the relationship between treatment, BVL, and cognitive functions may offer additional validations on the findings from this systematic literature review.