Background

There is increasing attention toward frailty as a clinically meaningful measure of geriatric health (1). Contemporary research has defined frailty’s clinical and physiological characteristics and highlights the vulnerability of frail, older adults to poor health outcomes (2). Accordingly, the number of publications on frailty has increased exponentially over the last few decades (3) as the determination of frailty status is emerging as a significant predictor of outcomes in other fields, including cardiology (4, 5), neurology (6, 7), oncology (8), orthopaedics (9), surgery (10), in addition to geriatrics in general. Consequently, the association between frailty and all-cause mortality has been investigated across different settings and populations.

While the concept of frailty is widely recognized, there is no single explicit criterion to define frailty. In 2013, a consensus statement by six major international scientific societies defined frailty as a medical syndrome with multiple causes and contributors that is characterized by diminished strength, endurance, and reduced physiologic function and increases an individual’s vulnerability for developing disability, dependency, or death (11, 12). In this definition, frailty is viewed as firstly, a clinical entity distinct from disability, sarcopenia, or multimorbidity; secondly, it affects a person’s physical or cognitive domains; and finally, it is considered as a dynamic state, which can improve or deteriorate over time (11). An intermediate or ‘prefrail’ stage has also been recognized (1315). Several frailty scales have been developed to characterize frailty in older adults, described in three broad categories. The first category includes focused physical scales, which most notably contain the Fried physical frailty phenotype from the Cardiovascular Health Study and adaptations derived from this original scale (13). It consists of five components: unintentional weight loss, muscle weakness, exhaustion or low energy level, slowness or slow gait, and low physical activity.

Persons are frail if three or more of the five criteria are met. The second category of frailty scales is a multidomain scale (16), which describes multidimensional characteristics of frailty containing more than one medical, physical, cognitive, or environmental factor. The third type of scale is a deficit accumulation frailty index (17). It consists of an inventory of various deficits covering multiple domains or body systems and the percentage of deficits calculated. These three types of scales capture different aspects of the frailty syndrome and, therefore, there may be differences in their association with health outcomes. Understanding these differences is important because it could inform how the various measures are best applied. All three categories of frailty assessment scales have some limitations. For example, the phenotype scale does not cover all frailty dimensions, such as cognition or affect (13). The multidomain scale and the deficit accumulation model-based scales are comprehensive but time-consuming. Previously, systematic collection of clinical information was not feasible in many settings, challenging the integration of this scale into regular healthcare practice (18). However, the growing popularity and implementation of electronic health records and automated frailty indexes are increasingly being developed in different countries, e.g., in the USA (19, 20), Australia (21) and various European countries (22). Furthermore, findings from the UK have shown that routine implementation of the electronic frailty index enabled the delivery of evidence-based interventions to improve outcomes in the older population (23).

Few systematic reviews have explored different frailty scales and determined whether frailty assessed by these scales is predictive of all-cause mortality. Furthermore, it remains unclear whether a particular frailty scale is a better predictor of mortality of community-dwelling older adults (13, 24). Therefore, the objective of this umbrella review is to qualitatively synthesize and evaluate the association between frailty determined by different frailty scales and all-cause mortality in community-dwelling older people.

Methods

A protocol was developed, and the review was conducted following the Preferred Reporting Items for Systematic reviews and meta-analyses (PRISMA) statement (25). The protocol was registered at the International Prospective Register of Systematic Reviews or PROSPERO (ID: CRD 42020155407).

Data sources and search strategy

The search strategy aimed to find published systematic reviews and meta-analyses that evaluated the association between frailty and all-cause mortality in community-dwelling older populations. Systematic and comprehensive searches were conducted in electronic databases: MEDLINE, Embase, PubMed, Cochrane Database of Systematic Reviews (CDSR), Joanna Briggs Institute Evidence-Based Practice (JBI EBP) Database, and Web of Science. The search was conducted in October 2019 and updated in July 2020. The search strategy and search terms are provided in Appendix I. Studies conducted on humans and articles published in English were considered eligible for this review, and duplicates were excluded. The searches were independently performed by two authors (ARMSE and CB). Any discrepancies were resolved by discussion.

Inclusion and exclusion criteria

We included systematic reviews and meta-analyses that have reported the association between frailty and mortality among community-dwelling older adults aged 65 years or above using any frailty scales, e.g., Fried physical frailty phenotype or modifications, deficit accumulation frailty index, and multidomain frailty index. We excluded systematic reviews and meta-analyses that included only hospitalized and institutionalized older adults or examined disease-specific outcomes (i.e., falls, fractures, heart failure, etc.) rather than mortality. However, we included two systematic reviews where few studies had participants less than 65 years of age as those were considered in their meta-analyses (26, 27).

Study selection and data extraction

Two reviewers (ARMSE and CB) independently searched titles. They screened abstracts before retrieving the full texts, assessed eligibility for the type of participants, study design, and outcomes. Data were extracted using a standardized form including author and year of publication, location, population characteristic, sample size, the proportion of female participants, age range, frailty scales used, number of deficits used to create the frailty scales, and follow-up period. We also noted the quality and bias assessment, effect sizes, and measure of variance, most commonly hazard ratios with 95% confidence intervals and heterogeneity assessments.

Methodological quality assessment

Manuscripts were assessed for methodological quality before inclusion in the review. The quality assessment of the included five systematic reviews were performed by ARMSE and CB. We used JBI Critical Appraisal Checklist for Systematic Reviews and Research Syntheses (28) (Table 1) and the online ‘A MeaSurement Tool to Assess systematic Reviews Version 2′ (AMSTAR 2) checklist (29).

Table 1 Critical appraisal checklist for systematic reviews and research syntheses

Data synthesis and analysis

The studies were combined using qualitative best evidence synthesis, as statistical pooling could not be done due to the high heterogeneity of the included studies’ meta-analyses. We extracted and reported the pooled effect sizes of the outcomes meta-analyzed within the reviews (Table 2).

Table 2 Summary characteristics of the included systematic reviews and meta-analyses

Results

Search results

A total of 969 records were identified from the six databases, and after removing the duplicates, 686 were screened for eligibility based on title and abstract. Twenty-three full-text articles were then reviewed for relevance, out of which 18 were excluded because aspects similar to, but not defined explicitly as, frailty were assessed, e.g., gait speed (30, 31), sarcopenia (32), various health indicators (33) or geriatric syndromes (34); outcomes other than mortality were examined, e.g., trauma (35), fractures (9, 36), falls (37), high blood pressure and cardiovascular outcomes (38) or heart failure (4); study population included were from clinical practice (39), nursing home (40) or critical care (41) but not from a community setting; the study involved interventions, e.g., treatment modalities (42); or the article was a systematic review protocol or an umbrella review which evaluated frailty scales for clinical outcomes from community, residential care and hospital settings (4345). This left five eligible systematic reviews and meta-analyses in the umbrella review (Figure 1). All five reviews were of moderate to high quality, as assessed by the JBI critical appraisal checklist (Table 1) and online AMSTAR-2 checklist. The reviews included 93 studies (some of which were included in multiple systematic reviews), and they assessed a range of outcomes. Of these studies, 77 examined the association between frailty and all-cause mortality over one to sixteen years of follow-up and were the focus of this review. Of the five systematic reviews, one review focused only on studies that used the frailty scale exclusively based on the Fried phenotype and its modifications (11 scales in a total of which four were original and seven modified) (46); one examined the FRAIL scale which is a questionnaire-based phenotype scale with five components, i.e., fatigue, resistance, ambulation, illness, and loss of weight (27); one review included studies assessing the deficit accumulation frailty index, with between 23 and 70 deficit items (26); one review included studies assessing either the Fried phenotype (7 studies) or the deficit accumulation frailty index (8 studies) (47); while the fifth review included 25 different scales of which five were Fried phenotype-based scales, 14 multidomain scales and six were deficit accumulation frailty index containing 23 to 83 deficits (16).

Figure 1
figure 1

PRISMA 2009 Flow Diagram(50): Frailty Status and All-Cause Mortality in Community-Dwelling Older Individuals: An Umbrella Review

Overall, the participants were predominantly over 65 years of age, with a minimum age for inclusion varying from 50 to 75. However, one study included participants with a minimum age of 15 years (48). The maximum age recorded in one study was 108 years (26). The participants included were community-dwelling individuals from Australia, Canada, China, Israel, Mexico, the United Kingdom (UK), the United States of America (USA), and multiple European countries. Female participants represented 42% to 74% of the sample in most studies (Table 2). Individual study’s frailty outcome was adjusted for a range of two to ten covariates (e.g., age, gender, education, smoking, alcohol intake, socioeconomic conditions) in their analysis.

Overall findings for the association between frailty and all-cause mortality

All five systematic reviews reported a significant association between frailty and an increased risk of mortality; however, the effect size between frailty and mortality varied across the included systematic reviews. For example, the meta-analysis that included 24 studies using three types of scales (i.e., Fried physical frailty phenotype, deficit accumulation frailty index, and multidomain frailty index) estimated an overall hazard ratio of 2.34 (95% CI:1.77, 3.09) between frailty and all-cause mortality (16). The estimated overall relative risk was 1.83 (95% CI: 1.68, 1.98). In their analysis, comparing the non-frail to frail groups, the risk associated with mortality varied depending on the frailty scales used. The Fried physical frailty phenotype was associated with a 2.6-fold increased risk of mortality (HR: 2.58; 95% CI: 1.83, 3.64; I2=89%, P <0.001); the multidomain frailty index with a 2.1-fold increased risk (HR: 2.13; 95% CI: 1.38, 3.29; I2=96%, P <0.001); and the deficit accumulation frailty index a 1.85-fold (HR:1.85; 95%CI: 1.30, 2.63; I2 = not available, P = not available) (16). Similar effect sizes were reported from the systematic review that included only the phenotype-based frailty index and found that frailty was associated with a two-fold increased risk of mortality than robust or non-frail persons (HR: 2.00; 95% CI: 1.73, 2.32) (46). Direct comparison of effect sizes from the other systematic review was not possible, given they considered the association between a one-unit increase in frailty score using the deficit accumulation frailty index and mortality (random effect model: HR:1.04; 95% CI: 1.03, 1.04; fixed effect model: HR: 1.28; 95% CI: 1.26, 1.31 per 0.1 increase in frailty index) (26). Only one systematic review included a questionnaire-based FRAIL scale to assess the relationship between frailty and mortality (27). From the eight studies included in this review, it was found that individuals classified as frail or prefrail, compared to non-frail individuals, had a 3.5-fold and 1.8-fold increased risk of mortality, respectively, over 2.4 years to 4.3 years of follow-up. The predictive value of mortality remained similar across definitions of frailty, ranging from 54% to 70% in the receiver operating characteristic curve areas using a questionnaire-based FRAIL scale (27) and remained around 70% if the Fried physical frailty phenotype or the deficit accumulation frailty index were used (47).

Gender differences

Three of the five reviews examined potential gender differences in the association between frailty and mortality and yielded some conflicting results (26, 46, 47). For example, one review using the Fried physical frailty phenotype and another utilizing the deficit accumulation frailty scale showed that older men with frailty had a higher risk of mortality than older women with frailty (26, 46). However, the third review (47) found mixed results depending on the individual study, with some reporting that men had an increased risk of mortality (4952). Still, others found that women had an increased risk (5153). One study reported a dose-response association between a more significant number of deficits and increased mortality in women across all age categories (51). However, this review (47) did not directly compare the risk between gender.

Age

Age did not appear to be an effect modifier of the relationship between frailty assessed using the Fried physical frailty phenotype or deficit accumulation index and mortality. Two of the five systematic reviews examined the association between frailty and mortality according to age groups (26, 46). The pooled estimates showed that the association between the deficit accumulation index and mortality did not vary between those aged below 65 years (HR:1.05; 95% CI: 1.03, 1.07) and those above 65 years (HR:1.04; 95% CI:1.03, 1.05) (26) per unit increase in a frailty index. Likewise, mortality risk was similar for those aged below 80 years (HR:1.62; 95% CI:1.39, 1.89) and above 80 years (HR:1.41; 95% CI:1.17, 1.70) estimated by the Fried physical frailty phenotype (46). Three other systematic reviews did not compare mortality based on age stratification (16, 27, 47).

Follow-up duration

The association between frailty and mortality varied according to follow-up duration. The risk of mortality was the lowest when the follow-up period was less than 12 months (HR:1.33; 95% CI:1.11, 1.60) and was the highest when the follow-up period was between two years to five years (HR: 3.25; 95% CI: 2.14, 4.94) (16). However, another review using the deficit accumulation frailty index found that the risk of mortality was higher when a shorter follow-up time was examined than a more extended follow-up, but effect sizes are not mentioned (26). Likewise, one systematic review compared a follow-up time of 4 years versus 11 years and observed that the strongest association between frailty and mortality was in the shorter follow-up group. However, individual values were not provided (47).

Discussion

This umbrella review synthesized evidence from five large systematic reviews (16, 26, 27, 46, 47) that examined major categories of frailty scales and the association of frailty identified by those scales their association with all-cause mortality in community-dwelling older individuals. A wideranging literature search identified five moderate to high-quality systematic reviews that included 93 primary studies comprising 434,115 participants from different countries. These primary studies used eighty different frailty scales, including Fried physical frailty phenotype, and various modifications of this scale, to multidomain scales. All the systematic reviews found that frailty is a predictor of mortality irrespective of the frailty scale used. These results will inform researchers and clinicians that frailty assessment is vital to predicting mortality.

Though all five systematic reviews reported a significant association between frailty and an increased risk of mortality, the effect size between frailty and mortality varied across the included systematic reviews. That means a person may be frail on one scale but not frail on another scale. Thus, the challenge remains which scale is to be used to predict frailty for researchers and clinicians. Only two of the five systematic reviews included in this umbrella review examined the predictability of frailty scales (27, 47). One review (47) compared the survival estimates based on age and adjusted relative risk using both the Fried physical frailty phenotype and the deficit accumulation frailty index. They found a 50% increased risk of mortality in frail older adults than non-frail older adults using the Fried phenotype. On the other hand, there was about a 15% increase in the risk of mortality per unit increase using the deficit accumulation index in frail older adults compared to those who were not frail (26). The variation in prediction values across the different frailty scales emphasizes the need for standardization across frailty scales for research purposes; however, clinically, it is essential that frailty be assessed and identified early such that appropriate preventive measures can be considered.

The included systematic reviews in this umbrella review examined gender differences (26, 46, 47), the role of age (26, 46) and follow-up duration (16, 26, 47) on frailty and mortality. Nevertheless, heterogeneity due to different population groups, diverse frailty scales and different follow-up periods made it challenging to draw definitive conclusions. However, age did not appear to be an effect modifier of the relationship between frailty assessed using the Fried phenotype or deficit accumulation index and mortality between those aged above or below 65 or those aged above or below 80 years. Gender differences were observed. The association between frailty and mortality also varied according to follow-up duration. These issues require further exploration in future longitudinal studies exploring and comparing different frailty scales’ ability to predict the development of frailty and mortality. Furthermore, most scales primarily focused on frailty’s physical and physiological aspects, although frailty’s social, cognitive and psychological elements are essential and merit future research.

Strengths and limitations

The current umbrella review has multiple strengths. The protocol was registered at PROSPERO, and the PRISMA guidelines were followed in completing this review. The review’s search strategy was robust and reproducible and utilized comprehensive search terms in multiple electronic databases. We evaluated five moderate to high-quality systematic reviews, which examined many participants from different parts of the world. Therefore, the generalizability of the results is high. We included systematic reviews that measured frailty using the commonly available scales, i.e., Fried physical frailty phenotype, multidomain frailty scale (including the questionnaire-based FRAIL scale), and deficit accumulation frailty index, meaning the findings will be relevant more broadly.

However, there are some limitations. This umbrella review did not include intervention studies, or systematic reviews of frail participants from hospitals or nursing homes were excluded. Thus, the findings apply to community-dwelling older individuals only. For researchers, this umbrella review shows that any category of frailty scale has utility for predicting mortality. Finally, this umbrella review focussed on the utility of frailty assessment to predict mortality though it could be considered that delaying mortality is not the only or best objective for the geriatric population. Improving the quality of life before death or extending life free of disability could be considered a critical outcome for assessing risk in frail old persons.

Conclusion

This umbrella review’s findings provide evidence that frailty is associated with mortality risk and highlight the importance of assessing frailty in primary community settings. This review has demonstrated that frailty is a significant predictor of all-cause mortality regardless of the specific frailty scale. For example, frailty assessed using five components that exclude cognition and affect Fried phenotype predicted mortality to a similar extent as did more comprehensive deficit accumulation frailty indices that included 83 items. As such, this implies that researchers and clinicians can use the most appropriate frailty scales given their circumstances, resources, and access to information. Together these findings emphasize that the assessment of frailty status itself may be more important than the choice of which type of scale is used. However, future longitudinal studies exploring the potential predictors for the development of frailty and its association with mortality using different frailty scales to determine the predictability would be beneficial.