Introduction

Amyotrophic lateral sclerosis (ALS) is a progressive motor neuron disease showing signs of both upper motor neuron (UMN) and lower motor neuron (LMN) dysfunction, resulting in increasing muscle weakness in one or more body regions [1]. Due to different rates of degeneration and progression of loss of motor neurons in brain, brainstem, and spinal cord, clinical manifestations may vary widely between patients [2]. Adding to the heterogeneity in motor impairments, varying degrees of cognitive impairment and behavioral dysfunction are present in patients with ALS [3].

The heterogeneity of the ALS syndrome poses a challenge to multidisciplinary, neuro-palliative, rehabilitation management [4]. Because of the progressive course, the care needs form a broad and ever-changing spectrum. The timing of appropriate interventions is important and requires accurate prediction of the individual course of the disease. In clinical practice, individual prognostication with regard to disability progression and the timing of supportive interventions are largely based on the clinician’s cumulated experience. Despite attempts to construct prognostic algorithms for survival, prognostic tools for the functional course in ALS are lacking [5, 6]. Evidence on prognostic factors for disability progression may enhance clinical prediction and guide clinical decision making, including patient counseling and treatment selection [7, 8].

The aim of our study is to identify factors from the literature that have a predictive value for the course of the functional status of patients with ALS.

Methods

Literature selection

We searched the literature up to March 2014 to identify prognostic studies of the functional status in ALS. Two of the researchers (AB and HC) identified relevant articles by searching the computerized bibliographic databases EMBASE (1980–March 2014), CINAHL (1982–March 2014), PsycINFO (1995–March 2014), MEDLINE (1965–March 2013), and Web of Science (1988–March 2014).

A sensitive search strategy was built on the subjoined components: (1) patients with ALS (MESH term and text word), (2) prognostic studies (we used a revised version of the Yale University School of Medicine Prognosis and Natural History Filter), and (3) the functional status outcome—a self-compound filter of ALS-related functioning and disability assessment as measured with the ALSFRS/ALSFRS-R/ALSFRS-EX.

We decided to select the ALSFRS(-R) as outcome measure because it is a commonly used, validated, clinically meaningful, reliable, and easy to administer measure of functional status [912]. The full search strategy is available on request.

Two reviewers (HC and AB) independently screened the abstracts identified by the search strategy. Relevant publications, potentially eligible for inclusion, were read in full by the two reviewers and in case of disagreement on inclusion, subsequently discussed during a consensus meeting. One researcher (HC) screened the reference lists of each of the selected publications to retrieve relevant publications which had not been identified by the computerized search.

An eligible study aimed to identify prognostic factors (socio-demographic, disease-specific, psychosocial, and comorbid factors) on functional status assessed using (versions of) the ALSFRS in patients with ALS who were classified according to the El Escorial diagnostic criteria [1]. This had to be designed as a longitudinal cohort study, with at least one follow-up measurement. Both prospective and retrospective studies were included. Results were published in English as full length articles between January 1960 and March 2014.

Data extraction

One reviewer (HC) used a standardized form to record information and data regarding study design, sample size, study population, patients’ characteristics, treatment, outcome measure(-s), ALSFRS(-R) score(-s) at baseline, loss to follow-up, follow-up duration, prognostic factor(-s) studied and strength of association between prognostic factors and ALSFRS(-R)-outcome.

Methodological quality assessment

There is international consensus on how to appraise the quality of prognostic studies [13, 14]. The Quality in Prognosis Studies (QUIPS) tool is designed for systematic reviews of prognostic studies. We used an adapted QUIPS which contained five categories assessing potential sources of bias conducting prognostic studies—patient selection, study attrition, measurement of prognostic factors, outcome measurement, statistical analysis, and results presentation. We considered the original QUIPS items on confounding not relevant for our study, because included studies were designed to predict a specific outcome based on a combination of several potential prognostic factors. In advance, we determined the key characteristics specific for the source population, the baseline study sample, and the participants who completed the study and those who did not, to assess potential risks of bias in study participation and attrition.

Two reviewers (HC and AB) independently rated the methodological quality of the selected studies. Rating of adequacy of reporting in the included studies was carried out for each separate prompting item of the six categories using yes, partial, no, or unsure. Subsequently, potential bias for each of the six domains was rated: high quality when there was low risk of bias, moderate quality with moderate risk, and low quality when there was high risk of bias. The two reviewers discussed disagreement about whether a criterion was met, and resolved by consensus.

Data synthesis

Due to the expected heterogeneity of selected studies, we performed a qualitative best evidence synthesis. Most important elements of this synthesis were the prognostic factors, the strength of the association and the methodological quality of the studies (QUIPS). We applied the GRADE (Grading of Recommendations Assessment, Development, and Evaluation) approach to judge the quality of evidence across studies and to grade strength of recommendations in systematic reviews [1416]. Two reviewers (HC and AB) judged how the GRADE factors—phase of investigation, study limitations (QUIPS; subgroup analyses), inconsistency, indirectness, imprecision, publication bias, and moderate or large effect size—impacted the overall quality of evidence (BOX). We omitted the factor ‘exposure–response gradient’ as it was not relevant for the quality of evidence of the prognostic studies we selected. The level of evidence was rated as high, moderate, low, or very low according to the GRADE approach [17].

Results

Selection of studies

Thirteen publications with a total of 5,341 patients were included in this review (Fig. 1) [1931].

Fig. 1
figure 1

Flowchart study selection

Study characteristics

The studies were published between April 2006 and March 2014. The search yielded ten prospective cohort studies and three retrospective cohort studies. Four of the prospective studies were clinical trials. In one study, the data were generated from three clinical trials and one observational study [27]. The study characteristics are summarized in Table 1. The number of study patients varied from 31 to 2,452. Seventy-two percent of the total number of patients (n = 5,341) came from observational studies. Mean (±SD) age of the patients varied between 53.9 (12.5) and 63.4 (10.2) years. One study looked at two groups of patients whose presented median (range) ages were 61 (45–74) and 55 (35–73) years, respectively [29]. Bulbar type of onset varied from 16 to 40 %; this was not specified in two studies [19, 29]. The sample in one study consisted of patients from a population-based ALS register (PRO-ACT; https://nctu.partners.org/ProACT/) [19]. Duration of follow-up varied from 6 months to 5 years. Most studies did not report on co-interventions between baseline and follow-up (e.g., Riluzole use, Percutaneous Endoscopic Gastrostomy (PEG), non-invasive ventilation (NIV), multidisciplinary care). Mean (±SD) baseline ALSFRS(-R) score ranged from 27.0 (6.6) to 42.8 (5.8). Five studies included an inception cohort—the other studies used non-inception cohorts. Six studies presented the ALSFRS as outcome measure for disability accumulation. Three of these studies started patient recruitment prior to the revision of the ALSFRS in 1999 [10]. One study did not report the time period of inclusion [28].

Table 1 Characteristics of the included studies (n = 13)

Prognostic factors

Table 2 summarizes the candidate prognostic factors studied in relation to the outcome measures ALSFRS(-R) total score, and analyzed in more than one study (n = 14). A description of the candidate prognostic factors studied in relation to the outcome measures ALSFRS(-R) total score and ALSFRS-R bulbar domain score, and analyzed in single studies (n = 16), is provided as supplementary material (Appendix 1). The prognostic factors summarized involved socio-demographic characteristics (n = 11), clinical characteristics (n = 12), environmental factors (n = 1), psychosocial factors (n = 2), and biological markers of disease progression (n = 2). Ten studies used statistical analysis modeling for covariates to estimate the strength of the independent association between prognostic factor and outcome.

Table 2 Prognostic factors related to outcome measures

Methodological quality of included studies

The results of our risk of bias assessment using the QUIPS tool are summarized in the ‘risk of bias table’ (Appendix 2). The overall methodological quality of five studies was judged as ‘high’, five studies scored ‘moderate’, and three studies ‘low’ quality. In almost all studies, measurement of prognostic factors and ALSFRS(-R) outcomes were performed in a similar, valid, and reliable way for all participants. These quality domains were classified as ‘low’ risk of bias. Due to lack of reporting on key characteristics of the source population (‘study participation’) and of participants loss to follow-up (‘study attrition’), bias could not be ruled out. We were, therefore, compelled to classify studies as ‘moderate’ (n = 6) and ‘high’ risk (n = 5) of selection bias. The statistical analysis, model-building process or completeness of reporting were judged to be inadequate in six studies, resulting in ‘moderate’ to ‘high’ risk of bias. Some of the studies reviewed only presented results from univariate analysis on the prognostic factor(-s) studied, thus causing the data quality assessment to be downgraded [22, 26, 29, 31]. One study could be classified as ‘low’ risk of bias on all quality domains [23].

Synthesis of evidence

The GRADE qualitative synthesis of evidence for factors analyzed in two or more studies, resulted in a rating of ‘low’ quality of evidence for site of onset, age at onset, time from symptom onset to diagnosis, and ALSFRS-R score at diagnosis as independent prognostic factor for decline in ALSFRS(-R) total score (Table 3). Based on the GRADE approach, we concluded ‘high’ evidence for gender, ‘moderate’ evidence for weight and height, and ‘low’ evidence for years of education as factors that are not associated with a decline in ALSFRS(-R), and ‘very low’ evidence for age at diagnosis, initial rate of disease progression, forced vital capacity (FVC), body mass index (BMI), frontotemporal dementia (FTD), and comorbidities as prognostic factor for decline in ALSFRS(-R) total score. Age at onset and age at diagnosis could be considered as interrelated factors, representing the same prognostic variable. However, as the duration of the trajectory of diagnosis might differ within cohorts of patients or between health care systems, we present the prognostic value for each factor separately. We did not conduct sensitivity analyses because of the small number of studies for each prognostic factor.

Table 3 Adapted GRADE table for narrative systematic reviews of prognostic studies

The evidence according to the GRADE approach for the prognostic value of time from symptom onset to diagnosis, site of onset, ALSFRS-R score at diagnosis, and age at onset is summarized below.

Time from symptom onset to diagnosis

The ‘low’ quality of evidence according to the GRADE approach for the prognostic value of time from symptom onset to diagnosis was based on three studies. The independent association between longer time from symptom onset to diagnosis and a slower functional decline turned out to be significant in one study [30]. Another study presented a significant association in a univariate analysis [22]. The third study did not present the strength of the association and uncertainty estimation [29]. Imprecision and moderate to ‘high’ risk of bias within studies forced the quality of evidence to be downgraded to ‘low’.

Site of onset

Five studies investigated the prognostic value of site of onset for decline in ALSFRS(-R) total score. Prognostic value of bulbar onset versus limb onset was reported in four studies. Inconsistency due to non-significance in two of these four studies and imprecision of results, resulted in ‘low’ quality of evidence for greater decline in ALSFRS(-R) total score in bulbar onset ALS.

Baseline ALSFRS-R score

Within the four studies presenting baseline ALSFRS-R score as prognostic factor, we summarized the overall quality of evidence according to the GRADE approach as ‘low’ mainly due to inconsistent and imprecise findings. Two studies with ‘low’ risk of bias, showed inconsistent results. One study demonstrated a significant association between a higher baseline ALSFRS-R total score and a slower functional decline [25]. The multiple linear regression model in the other study, using the total ALSFRS-R slope as dependent variable, showed no statistically significant association [21]. We judged the other two studies to be ‘moderate’ quality or ‘low’ quality [19, 30].

Age at onset

Four studies investigated the prognostic value of age at onset for a decline in ALSFRS(-R) total score. Only one study presented a significant, independent association between older age at onset and greater decline in ALSFRS-R total score [25]. Inconsistency and imprecision in results, taking into account the small number of studies, resulted in an overall ‘low’ quality of evidence according to the GRADE approach.

Discussion

Our systematic review summarizes the evidence on prognostic factors for the rate of decline in functional status of (recently) diagnosed patients with ALS. The qualitative synthesis according to the GRADE approach resulted in ‘low’ evidence for age at onset, site of onset, time from symptom onset to diagnosis, and baseline ALSFRS(-R) score. We graded the evidence for age at diagnosis, initial rate of disease progression, FVC, FTD, BMI, and comorbidity as ‘very low’, mainly due to the heterogeneity, the limited number, and methodological quality of the selected studies.

In contrast to the limited number of studies reporting prognostic factors predictive for the patients’ functioning, a larger number of publications on prognostic factors for survival of patients with ALS are available. In their critical review, Chiò et al. [32] summarized the evidence on prognostic factors related to survival. Although several ALS outcome studies are based on register methodology (longitudinal cohorts, prospective population-based registers), the appraisal of the methodological quality of these studies was not conducted. In view of the strong correlation between rate of progression of ALSFRS-R and survival, certain prognostic factors for survival also have prognostic value for functional decline: age at onset, site of onset, diagnostic delay, FVC, BMI, and baseline ALSFRS(-R) [32]. This finding strengthens the relevance of the prognostic factors we found, contrary to our overall quality of evidence judgment of these factors based on the GRADE approach. Nevertheless, it seems reasonable to assume that other prognostic factors for survival will also be associated with functional decline, but these (e.g., El Escorial diagnostic categories, nutritional status, and cognitive functions) have either not yet been studied, or we found no evidence in the identified studies about their prognostic value for the functional course.

We selected the ALSFRS(-R) representing functional status in ALS as the most widely used outcome marker of disease progression in clinical practice and research. Recently, Franchignoni et al. (2013) [33] questioned the metric quality of the subjective ALSFRS-R and demonstrated that the scale as a single (total) score lacks unidimensionality. Treating an ordinal summed rating scale as an interval measure, assuming equal intervals, and subjecting such scale to parametric statistics (as regression analyses) will likely lead to invalid results [23]. This may well be one of the reasons for the low level of evidence we found for the prognostic factors, even for those that are often used by professionals in individual prognostication. Furthermore, most statistical models used in prognostic studies assume a linear decline in functioning during the entire disease trajectory; this assumption is not met for the total raw score of the ALSFRS(-R) in ALS [19, 25]. In contrast to the heterogeneity of the ALSFRS(-R) total score, the domains of the ALSFRS(-R) (bulbar, motor, and respiratory functions) appear unidimensional, and adequately represent the respective constructs [34]. It is surprising that only one study was identified that investigated prognostic factors for ALSFRS-R domains, and even this was limited to one domain (bulbar). It may well be that stronger associations can be found between potential prognostic factors and decline in bulbar, motor, or respiratory functions.

Most likely, ALS disease heterogeneity complicates the development of prognostic models. More knowledge about the genetic and molecular pathophysiological mechanisms is required to allow development of more accurate prediction models [35]. In this context, the identification of genetic subtypes in ALS might be of vital importance. The phenotypic heterogeneity in ALS is evident in view of the clinical manifestation of the region of symptom onset and disease course in ALS [22, 3739]. Although studies showed differing average rates of disease progression across ALS phenotypes, there was also a clear variation within these phenotypes [39, 40]. Within each subgroup, patients might have a relatively slow or fast disease course. Consequently, prediction of functional deterioration on the basis of the specific phenotype encompasses major uncertainty. Furthermore, there are no clear boundaries between phenotypes [41]. Future analysis of large numbers of patients within phenotypes might elucidate the within-phenotype variation and could identify phenotypes with consistent prognostic patterns [2].

Previous studies made use of prognostic categories for survival in ALS by applying prognostic models or staging systems [5, 42]. As far as we know, there are no studies about the validity and applicability of comparable models and systems for functional status. Further research is warranted to assess the validity of such models and systems and their clinical applicability in individual patient management [43].

This review has strengths and limitations. The robust search in five relevant databases, the use of the QUIPS tool for the appraisal of study quality and the GRADE framework for the quality of prognostic evidence are strengths of our study. It is possible that we have missed studies that are not indexed for these databases, but by checking references of included studies, we presume that no relevant articles were missed. Earlier studies reported negative consequences due to our selection criteria ‘published in English’. We conclude, based on the available English abstracts of the non-included, non-English articles, that it is unlikely that our results are biased because of language restrictions [44, 45].

As we selected studies with the ALSFRS (-R) as outcome measure for functional status, we may have missed prognostic studies with different outcome measures. One study suggested that the examination-based AALSS (Appel ALS Score) provides a better estimate of disease severity in ALS compared to the questionnaire-based ALSFRS [46]. Based on normalized scores of both scales, the ALSFRS underestimated disease severity defined by the AALSS; this increased with disease progression. Both scales, however, showed a significant change with disease progression. We do not expect that including other outcome measures for functional status in the present review would lead to higher levels of evidence for prognostic factors as the ALSFRS(-R) is the most frequently used clinical measure for disability progression in ALS.

The ability to predict the decline in functional status of the individual patient with ALS remains uncertain. This review has highlighted a gap in the current literature about prognostic value of clinical variables for functional decline. Comprehensive data collection—national and international standardized measurements of patient, disease, and intervention variables recorded in daily clinical practice and entered into large electronic databases—might enhance the possibility of identifying prognostic factors. Furthermore, it is likely that the genetic variation in ALS will be unraveled and genetic subtypes with less clinical heterogeneity will be identified in the years to come. This might enable the development of prediction tools stratified by ALS subtypes with better prognostic performance in individual patients.

Our results endorse the need for well-conducted prospective cohort studies on prognostic factors for the course of the functional status of patients with ALS. These studies should not only focus on socio-demographic and clinical prognostic factors, but also investigate the prognostic importance of psychosocial factors, cognition and comorbidity. Future prognostic studies preferably use an optimal design including selection of inception cohort, multivariate statistical models, registering reliable information about the (multidisciplinary) treatment given and all potential prognostic variables [47].

In conclusion, the current strength of evidence on prognostic factors for functional decline in ALS is insufficient to allow the development of a clinical prediction tool that can support the professionals’ judgement of the functional course. Methods for standardized assessment and registration of patients’ socio-demographic, clinical and psychosocial characteristics, symptom treatment, and outcome measures preferably may be agreed internationally; data sharing may enable more reliable evidence synthesis of prognostic factors for functional decline in ALS. With the future prospect of improved understanding of ALS genetics, more accurate prognostication may be achieved through the development of prediction tools stratified by genetically defined subtypes of ALS.