Background

Health economic evaluation is nowadays a central component of health technology assessments for supporting reimbursement decisions throughout Europe. The health economic perspective is a key to health technology assessments, as healthcare expenditure is believed to increase while there is a trend towards budgetary constraints in healthcare [1]. The majority of European countries recommend cost-utility analyses in methodological guidelines as preferred analysis type for health economic evaluations [2].

In cost-utility analyses, health-related quality of life (HrQoL) is usually combined with a specific amount of time spent in a given health state into a single measure of effectiveness, namely quality-adjusted life years (QALYs). HrQoL is represented by utilities measured on a theoretical scale from 0 to 1, with 0 being defined as a health state corresponding to death and 1 being defined as full health [3]. One widely used generic instrument for estimating utilities is the EQ-5D that recently has been revised from a three-response levels version (EQ-5D-3L) to a five-response levels version (EQ-5D-5L) to increase sensitivity for detecting clinically important differences in HrQoL and to reduce potential ceiling effects [4]. Besides other questionnaire-based instruments for the derivation of HrQoL, the EQ-5D is the majorly recommended instrument throughout European countries [2].

The EQ-5D-5L consists of the five dimensions: mobility, self-care, usual activities, pain/discomfort and anxiety/depression. The dimensions are divided into the five ordinal levels ‘no problems’, ‘slight problems’, ‘moderate problems’, ‘severe problems’ and ‘extreme problems’ [4]. Based on these five dimensions and five levels, the EQ-5D-5L is able to describe 55 = 3125 different health states. To link the 3125 possible unique health states derived from the EQ-5D-5L with utilities, a set of preference valuations of the general population regarding different health states was derived and used to predict EQ-5D-5L index scores [5]. Recently, such a value set for EQ-5D-5L has been developed for Germany [6]. The German value set allows to assign EQ-5D-5L index scores to each of the health states ranging from − 0.661 (extreme problems in all 5 dimensions) to 1 (no problems in any dimension). As the German value set allows assigning negative EQ-5D-5L index scores, utilities that represent HrQoL worse than death are possible.

So far, certain studies are planned that intend to use the EQ-5D-5L for the economic analysis as measure of effectiveness (e.g., [7,8,9]). Yet, to be able to compare HrQoL of patients with a specific disorder or disease, e.g., derived from cost-utility analyses, with the HrQoL of the general population, normative values need to be derived and presented for different subgroups based on sociodemographic characteristics. Furthermore, German normative values of the EQ-5D-5L index are needed to adapt model-based economic evaluations to better reflect the German population and its characteristics and ultimately inform policy to improve health resources allocation [10,11,12].

To date, no normative values of the EQ-5D-5L based on societal preferences in a representative German sample have been published. Earlier publications focused on psychometric properties and normative values of the EQ-5D-5L in a representative German sample based on crude sum scores of the EQ-5D-5L response levels, or the visual analog scale of the EQ-5D (EQ-VAS) only [13, 14], or on normative values in representative samples of other countries, such as South Australia [15], Spain [11] or Poland [10]. Other publications focused on normative values of the EQ-5D-5L in samples of patients, such as asthma patients [16] or patients with chronic obstructive pulmonary disease [17].

Therefore, the primary aim of this study was to estimate normative values of the EQ-5D-5L for Germany based on a representative population sample and the recently developed societal preferences. A secondary aim of this study was to examine associations between HrQoL, represented by EQ-5D-5L utility scores and EQ-VAS scores, and selected sociodemographic factors.

Methods

Sample

The study is based on a cross-sectional phone survey conducted among 5005 participants of the German general adult population [18]. To select telephone numbers of households randomly and to ensure representativeness, they were drawn from an existing telephone sample comprising of registered and generated telephone numbers [19] proportional to the federal state residential structure, stratified by community size. Data collection took place from March to April 2014. Of the total 5005 participants, 4998 participants that had complete data for the EQ-5D-5L were included in this study. Observations were weighted according to age, gender, education and state of residence to increase representativeness of the results for the German general adult population. The German micro-census was used to generate a design weight that compensates underrepresented and overrepresented observations regarding the respective characteristics. Due to weighing, the total sample included in this study augmented to n = 5001. A more detailed description of the sample and the weighting can be found elsewhere [18].

Measures

The generic EQ-5D-5L was used to assess HrQoL [4]. The EQ-5D-5L consists of five questions (items) for self-classification of current problems in health by five ordinal levels. Furthermore, it consists of a visual analogue scale (EQ-VAS) that assesses subjective HrQoL on a scale ranging from 0 (worst imaginable health state) to 100 (best imaginable health state) [20].

Sociodemographic characteristics collected in the survey were age, gender, marital status, educational attainment, employment status, type of health insurance, area of residence and size of municipality. Age was categorized into the seven categories 18–24, 25–34, 35–44, 45–54, 55–64, 65–74 and ≥ 75 years. Furthermore, the number of medical conditions was calculated. For this purpose, participants were asked whether a doctor has ever diagnosed one of the following diseases: lung diseases, diabetes, other metabolic diseases, chronic pain, diseases of the digestive tract, cancer, cardiac and circulatory diseases, skin diseases, osteoporosis, mental disorders, joint diseases, injuries and intoxications, flu, urogenital diseases, neurological diseases, dental diseases or eye diseases. To maintain practicability of the survey, diagnoses were restricted to the most frequent (chronic) conditions or disease groups reported in common German surveys [18, 21,22,23].

Statistical analysis

Descriptive statistics were estimated for sociodemographic characteristics and number of medical conditions. Furthermore, descriptive statistics were estimated for item responses in EQ-5D-5L dimensions, the top 75% most frequently reported EQ-5D-5L health states and mean EQ-5D-5L index scores by sociodemographic characteristics and by gender. Correlation between EQ-5D-5L index score and the EQ-VAS score was evaluated with Spearman’s rank correlation coefficient (ρ). A weak correlation was defined by ρ < 0.30, a moderate correlation by 0.30 ≥ ρ < 0.50 and a strong correlation by ρ ≥ 0.50 [24].

To examine associations between EQ-5D-5L index scores, EQ-VAS scores and selected sociodemographic factors, a generalized linear model (GLM) with a Poisson family and log link function was used [25]. This model takes into account the skewed distribution and the heteroscedasticity of EQ-5D-5L index scores and EQ-VAS scores [26, 27]. As the Poisson family of the GLM is only defined for non-negative integers in the dependent variable, the EQ-5D-5L disutility score (1 − EQ-5D-5L index score) was used as dependent variable [15]. Following an earlier study that focused on normative values in representative samples of South Australia, the variables gender, categorized age and education attainment were entered into Model 1 [15]. In Model 2, the variables number of medical conditions, type of health insurance, employment status, and marital status were further added based on a backward stepwise multivariate analysis. From a model with a full set of potential explanatory variables, variables were manually considered for subtraction by a sequence of GLMs based on p values (deleted variables: smoking status, federal state and community size). For the Models 3 and 4, the EQ-VAS score was used as dependent variable, the regression model and the independent variables were chosen according to the Models 1 and 2, respectively.

All analyses were performed using Stata/SE 15.0 (StataCorp, TX, USA). All applied statistics were two-sided. In total, 32 tests for statistical significance of group differences in sociodemographic characteristics and mean EQ-5D-5L index scores, and one Spearman’s rank correlation were conducted. Therefore, all tests were assessed at the 0.0016 (0.05/32) alpha level, following Bonferroni’s correction for multiple significance tests to avoid a type I error [28].

Results

In the total weighted sample (n = 5001), 52% were female and the mean age was 51 years. Female participants were slightly older than males (52 vs. 49 years, p < 0.001). Of all participants, 49% were married or in a partnership, whereas 29% were never married. Furthermore, 39% of all participants reported a low level of qualification and 42% reported that they were not employed or retired. One or two medical conditions were reported by 29% and 17% of the participants, respectively, whereas three or more medical conditions were reported by 18% of the participants. Characteristics of the total sample are presented in Table 1.

Table 1 Characteristics of the sample

The most frequent item response for all EQ-5D-5L dimensions was ‘No problems’ in the total sample. Frequencies of item responses in each EQ-5D-5L dimension by age categories are presented in Table 5 (“Appendix 1”). The most frequently reported EQ-5D-5L health states were ‘11111’ (n = 1530; 31%), ‘11121’ (n = 715; 14%) and ‘21121’ (281; 6%). Those health states corresponded to a mean EQ-5D-5L index score of 1.00, 0.94 and 0.92 and were associated with a mean EQ-VAS score of 84.54, 78.10 and 72.98, respectively. In total, 371 individual health states of 3125 possible patterns (12%) were reported by the participants. The majority of the sample (75%) was represented by 23 out of all possible health states (Table 2).

Table 2 Most frequently reported EQ-5D-5L health states with index scores and EQ-VAS scores (weighted sample; n = 5001)

Both, the EQ-5D-5L index scores and the EQ-VAS scores were left-skewed with a clustering at 1.00 (n = 1530; 31%) and 80 (n = 944; 19%), respectively. Furthermore, the EQ-VAS scores clustered at 90 (n = 711; 14%), 70 (n = 579; 12%) and 50 (n = 552; 11%; Fig. 1). The mean EQ-5D-5L index score of the total sample was 0.88 (SD 0.18), corresponding to an overall mean EQ-VAS score of 71.59 (SD 21.36). The EQ-5D-5L index scores and the EQ-VAS scores were strongly positively correlated (ρ = 0.60, p < 0.001).

Fig. 1
figure 1

Scatterplot of EQ-5D-5L index scores and EQ-VAS scores (n = 4997; weighted sample), histograms of EQ-5D-5L index scores (n = 5001, weighted sample) and EQ-VAS scores (n = 4997; weighted sample)

Females had a statistically significantly lower mean EQ-5D-5L index score than men [0.86 (SD 0.16) vs. 0.90 (SD 0.20); p < 0.001]. There was no difference in mean EQ-VAS scores between genders. The mean EQ-5D-5L index scores were statistically significantly different for participants in different age categories (p < 0.001) and dropped from 0.94 (SD 0.08) for participants aged 18–24 to 0.80 (0.28) for participants aged 75 years and older. Mean EQ-5D-5L index scores differed statistically significantly in subsamples with different marital statuses, levels of education and employment statuses (all with p < 0.001). The mean EQ-5D-5L index scores by sociodemographic characteristics and by gender and the mean EQ-5D-5L index scores by age categories and by number of medical conditions are reported in Table 3 and in Table 6 (“Appendix 2”), respectively.

Table 3 Mean EQ-5D-5L index scores by sociodemographic characteristics by gender (weighted sample)

In Model 1, female gender was statistically significantly associated with a higher EQ-5D-5L disutility score (and, therefore, a lower EQ-5D-5L index score). The age categories from 35 to 44 years onwards were statistically significantly associated with increasingly higher EQ-5D-5L disutility scores (p < 0.001). Compared with low education, middle and high educations were statistically significantly associated with decreasingly lower EQ-5D-5L disutility scores (p < 0.001). By entering further variables in Model 2, the association between female gender and increased EQ-5D-5L disutility score became statistically insignificant. However, number of medical conditions was statistically significantly associated with increasingly higher EQ-5D-5L disutility scores (p < 0.001). Furthermore, the EQ-5D-5L disutility scores were statistically significantly lower in participants with private health insurance or private supplementary insurance(s) (p < 0.001). Not employed participants had statistically significantly higher EQ-5D-5L disutility scores than full-time employed participants (p < 0.001). In the Models 3 and 4, using the EQ-VAS score as dependent variable, the associations were in correspondence with the Models 1 and 2, respectively (Table 4).

Table 4 Generalized linear models of EQ-5D-5L disutility scores and EQ-VAS scores and selected sociodemographic characteristics (weighted sample)

Discussion

The aims of this study were to estimate normative values for the EQ-5D-5L for Germany and to examine associations between the EQ-5D-5L and selected sociodemographic factors. This was the first study that reported normative values based on a representative German population sample and the recently developed German value set [6].

The overall mean EQ-5D-5L index score of the representative German population sample was 0.88. This score was slightly worse than EQ-5D-5L index scores for representative population samples in South Australia (0.91) [15], Spain (0.90) [11] and Poland (0.89) [10]. The EQ-5D-5L index scores for both, the Spanish and Polish population were estimated by use of an interim value set, based on a mapping algorithm. According to Eurostat, in 2013, self-perceived health and overall life satisfaction were higher in Spain and lower in Poland when compared to Germany [29]. One earlier study calculated a crude overall mean EQ-5D-5L sum score of 91.5 based on equally weighted EQ-5D-5L health states with presumed equal widths of the response levels of a representative German population sample [13]. Another study reported an overall mean EQ-VAS score of 85.1, also based on a representative German population sample. Compared with the overall mean EQ-VAS score of the current study (71.6), the HrQoL in the other study seems to be considerably better [14]. However, the sample of the study by Hinz et al. [13] was younger (47 years vs. 51 years); yet, both samples were comparable with respect to gender, education and employment. The samples of those studies consisted of 2555 participants surveyed in 2011 [13] and 2040 participants surveyed in 2015 [14], respectively, and thus, not more than half the size compared to the current study.

In the sample of this study, almost one-third of the respondents (31%) reported an EQ-5D-5L health state with no problems across all dimensions. Such a ceiling effect of the EQ-5D-5L has already been reported elsewhere for German population samples. In two of those studies, even more than half to almost two-thirds of the respondents (60–64%) reported an EQ-5D-5L health state with no problems across all dimensions [13, 14]. According to an earlier study, a ceiling effect of 15% or higher of the population analyzed should be considered as serious [30]. Yet, based on the current representative German population sample, Konnopka and König [31] ascertained that the occurrence of a ceiling effect of the EQ-5D-5L is strongly related to morbidity. Accordingly, it was implied that EQ-5D-5L health states with no problems across all dimensions observed in population samples resulted from a correct measurement of good health rather than from a measurement limitation. For example, two-thirds of the German adult population did not report limitations in activities of daily living in the national health interview survey in 2014/2015 [32]. The EQ-VAS score of the respondents that reported an EQ-5D-5L health state with no problems across all dimensions in the current study was above 90.00 in more than half of the cases (mean EQ-VAS score 84.52).

Another 14% and 6% of the respondents reported having slight pain or discomfort only (health state ‘11121’) or both, having slight pain or discomfort and slight problems in walking about (health state ‘21121’), respectively. Thus, in more than half of the current population sample, HrQoL was explained by just three health states. In total, HrQoL was explained by 12% of all possible EQ-5D-5L health states. Two earlier studies were able to explain HrQoL using 50 and 63 different EQ-5D-3L health states in representative German population samples. In contrast, in the sample of this study, 370 different EQ-5D-5L health states have been used to explain HrQoL. Therefore, it is assumed that the discriminatory power of the EQ-5D-5L descriptive system has improved compared with the EQ-5D-3L [33, 34].

Female participants from the new German Federal States and Berlin had statistically significantly lower EQ-5D-5L index scores than females from old German Federal States (− 0.04; p < 0.001). Yet, female participants from the new German Federal States and Berlin were statistically significantly older (+ 6.44 years; p < 0.001) and had statistically significantly more medical conditions (+ 19.28%; p = 0.0016). Furthermore, female participants from new and old German Federal States differed statistically significantly in educational and employment status (data not shown). Notwithstanding, results of the German Federal Health Surveys show that middle-aged to old-aged females from old German Federal States stated good to very good subjective health more often compared with females from new German Federal States throughout all three surveys [35].

The multivariate regression analyses were able to reproduce findings from other studies regarding associations of the EQ-5D-5L index and sociodemographic factors [15, 36]. Older age, female gender and low level of educational status were independently associated with a lower EQ-5D-5L index score. The association between age and EQ-5D-5L index score diminished after additionally controlling for medical conditions.

Strengths and limitations

The first strength of this study was a large sample size and the weighting of the observations to ensure representativeness of the results. Second, this study was able to examine associations between HrQoL, selected sociodemographic factors and especially number of medical conditions by multivariate analyses. However, some limitations of the study have to be mentioned. First, there is a noteworthy chance of selection bias and recall bias due to the phone survey-approach. Despite weighting of the observations, there is some evidence for reduced representativeness of participants aged 65–74 years. Second, the enquiry of medical conditions was restricted to the most frequent, mainly chronic conditions or disease groups. It is possible that medical conditions not enquired in the survey used for this study introduced measurement error to the multivariate regression analyses and skewed the results. Yet, the association between HrQoL and the number of medical conditions in the multivariate analyses was statistically significant.

Conclusion

This study was able to provide normative values of the EQ-5D-5L based on societal preferences in a representative German sample. The normative values of the EQ-5D-5L can be used for model-based economic evaluations for the estimation of QALYs and for comparison of specific patient groups with the German general population. Further research is needed to investigate the association between HrQoL derived from the EQ-5D-5L and medical conditions using medical diagnoses of more chronic conditions and disease groups.