Introduction

The impact of informal care is taken into account in economic evaluation if a societal perspective is used, where the costs and benefits of an intervention include those that fall outside the health system. This can help avoid an inefficient allocation of resources as it takes into account the full societal effects of an intervention rather than just those falling on the health system. In England and Wales, the National Institute for Health and Care Excellence (NICE) reference case for health technology assessment does not recommend the routine inclusion of wider societal costs or benefits. The reference case takes the perspective that benefits should be measured using direct health effects using EQ-5D (although this can include carer health) and costs should only include those incurred by the National Health Service (NHS) and personal social services (PSS) [1]. However, there is an ongoing debate about the relevance of a societal perspective in health technology assessment as a means of capturing the full economic benefits of new medicines. In the United Kingdom, this interest was crystallized in the form of a policy proposal of value-based pricing for the pricing of new medicines that recognized the wider societal benefits and costs of interventions [2]. Notably, while there has been a tendency to equate wider societal benefits and costs with production losses, the value-based pricing proposals identified all non-health effects as potentially relevant, including informal care time. While the future of these proposals is uncertain, it is clear that wider societal benefits (and costs) are very much part of the health technology assessment research agenda and have the potential to change research conclusions markedly [3]. However, in order to take into account the costs of informal care for an intervention, informal care needs to be first measured and then valued using the appropriate costings for the type of care provided.

Informal care is care provided by friends, family, and volunteers to individuals who would have difficulty managing without this care. The informal care economy is substantial, with almost 6 million carers in the UK alone recorded in the 2001 census [4]. However, informal care is difficult to measure and value. In terms of measurement, the issue of joint production is important to recognize and correct for in order to avoid inflated estimates [5]. For valuation, it is important to recognize that informal care is a non-market or quasi-market of services supplied by carers who are often unpaid or who receive only nominal payments or state benefits that do not reflect the true cost or benefits of the care [6].

Typically informal care is measured using time spent caring, but it is difficult to distinguish informal care activities from usual household activities that would be undertaken regardless. A range of methods have been used to record time spent providing care, such as a time diary, recall [5, 7], experience sampling and direct observation, each with their own advantages and disadvantages. Time spent providing informal care can then be given a monetary value using a range of techniques: revealed preference using the opportunity cost, proxy good or wellbeing valuation method; or stated preference using discrete choice experiments or contingent valuation [79]. The monetary values that are produced by these methods can then be incorporated alongside other costs of an intervention.

The health effects of informal care that capture the effect on the carer of providing the care can also be valued using non-monetary valuation, for example by measuring their health-related quality of life using a generic preference-based measure such as EQ-5D [10] or by measuring their care-related quality of life [11] or carer experience [12]. These utility or quality of life values can then be included alongside other benefits of an intervention. Including the utility of the carer alongside the utility of the patient is straightforward if it is assumed that both should be given an equal weighting, but combining patient utility valued using EQ-5D and carer quality of life using a different measure is less straightforward.

The incorporation of costs of informal care into the overall cost of an intervention is more straightforward as they are all measured using an identical metric. However, it is important to ensure no double-counting, as the same impact from informal care cannot be measured using both costs and benefits. Willingness to accept valuations of carer time, for example, are likely to include an element of compensation for the reduction in health-related quality of life that is anticipated when caring duties are undertaken.

If informal care is to be included as a cost in economic evaluation, then the differences in informal care costs across interventions are required. However, many studies that are used to inform economic evaluation have not collected data that includes the direct measurement of informal care time across interventions. One solution is to use data that is typically collected on the health of the patient to predict the informal care required for each intervention. Given the use of EQ-5D to capture the direct health effects in economic evaluations submitted to NICE, there is interest in how informal care changes as the EQ-5D utility score of the patient changes. One exploratory study used OLS regression to estimate days of informal care used by EQ-5D utility score [13]. However, there were limitations with this study as it did not model the difference by health condition and did not use models that are most appropriate for the data.

This paper estimates informal care usage associated with the health of the patient, building upon the approach taken by Dixon et al. (2006) [13]. The analysis is based around the linkage between informal carer time and EQ-5D and International classification of diseases (ICD) chapter. Regressions are estimated that can be used to predict informal care using the EQ-5D and ICD chapter of the patient. This would enable prediction of the impact of health technologies on informal carer time via their impact on patient health. As such, the results could be applied to any cost-effectiveness analysis that uses the EQ-5D to produce its quality adjusted life years (QALYs). The informal carer time can then be given a monetary value using existing methods in the literature. These results can be applied to existing datasets to inform economic evaluation.

Methods

Data

The Health Outcomes Data Repository, HODaR, is a dataset generated from a prospective survey of inpatients and outpatients at Cardiff and Vale NHS Hospitals Trust, which is a large University hospital in South Wales, UK [14]. The survey is linked to existing routine hospital health data to provide a dataset with socio-demographic, health-related quality of life and ICD classification data. The survey includes subjects aged 18 years or older and excludes individuals who are known to have died. The survey also excludes people with a primary diagnosis on admission of a psychological illness or learning disability. This study focuses on the inpatient sample which has 96,282 eligible observations across 66,113 individuals discharged from hospital from April 2002 to January 2009, and of these there are 59,512 complete responses across 44,494 individuals for all variables used in the analysis and this is the sample used here.

Measures

Informal care

The informal care data is self-reported by the patient and records the number of days a friend or relative has needed to provide care or help with normal activities in the last 6 weeks. Raw observations had a maximum of 45 days; here the data has been censored at 42 days, the maximum number of days in 6 weeks.

Health and socio-demographics

Health of the patient is measured using EQ-5D, a preference-based measure with five dimensions (mobility, self-care, usual activities, pain/discomfort, anxiety/depression) each with three levels of severity [15]. The preference weights used here were elicited using time trade-off with a sample of 2997 members of the UK general population and modeled to produce a utility value for every health state [10]. ICD chapter dummy variables have a value of 1 where the ICD chapter is recorded in the first ICD classification of the patient during their hospital visit, see Table 1 for a description of each of the ICD chapters. An operation dummy variable has a value of 1 if the patient had an operation during their hospital visit, and a comorbidities dummy variable has a value of 1 if multiple ICD chapters were recorded during the hospital visit. Age of the patient is measured at the time of the survey.

Table 1 Summary of sample characteristics

Analysis of data

Descriptive analysis

Descriptive statistics of the sample are generated. The distribution of EQ-5D and days the patient needed informal care are examined.

Modeling the data for the econometric models

Regression analysis is used to estimate the relationship between the health of the patient, measured using EQ-5D and ICD chapter, and days of informal care required in the last 6 weeks. Individual patient-level data is used to make best use of the variability of responses in the available dataset.

Several properties of the data need to be considered before identifying the most appropriate method for modeling days of informal care in the last 6 weeks. The variable is a count data variable, the distribution is skewed with a very large spike at zero days and another spike at 42 days, and has some repeated observations per patient. The large spike in responses at zero days is usual in this type of data and is a reflection of the large number of people who do not require informal care when in less than full health. The spike at 42 days is, in part, a consequence of a decision to censor the distribution of observations, which has a maximum of 45 days. However, there was already a peak at 42 days prior to censoring the data at 42 days. There are also smaller spikes at multiples of 5 and 7 days usually due to individuals rounding off responses. To reduce the impact of these smaller peaks in the data on the analysis, models are often estimated in the literature grouping the dependent variable. In the analysis reported here these smaller spikes are ignored since grouping the dependent variable would not allow estimation of all parameters of interest. Although some respondents have multiple observations the majority do not. There are only 10,330 out of 44,494 respondents with multiple responses and these respondents have a median of two repeated observations.

Although linear regression models based on continuous variables are often applied to count data variables, this may result in biased, inconsistent, and inefficient estimates. A linear regression model ignores that the dependent variable is limited to zero and positive integers and it will only provide appropriate estimates in datasets with a high mean of the counts. Even then, it will only be able to provide an estimate of the average effect of a covariate on the conditional mean of the process [1618]. Alternative models that are appropriate for this type of data are estimated and compared before choosing the preferred model. For completeness some random effects models were estimated to address the issue that some respondents have multiple observations but given the small number of repeated observations robust standard errors can also be used to take into account clustering. The estimated models are Poisson; two-part model (using probit and truncated negative binomial); zero-inflated negative binomial and are described below (the random effects negative binomial model was also considered but was abandoned due to convergence difficulties). For the preferred model the exclusion of variables is explored for insignificant variables and variables that may not always be available to inform economic evaluation, e.g., the presence of comorbidities. Model fit was assessed using the distribution of the predictions of days in comparison to the observed distribution. Akaike information criterion (AIC) and Bayesian information criterion (BIC) were used to inform the selection of the model specification.

Models

This section describes the main models used in the analyses below. All of the models are capable of providing valid estimates when the dependent variable is characterized by count data. However, as each relies on different assumptions, the identification of the most appropriate model becomes an empirical issue that is also determined by the nature of the specific dataset used. The model is specified as:

$$y_{i} = {\mathbf{x}}_{i}^{'} {\varvec{\upbeta}} + \varepsilon_{i}$$
(1)

where the dependent variable, y i , represents days needed informal care for patient i; x i is a vector of explanatory variables including EQ-5D score, dummies for ICD chapter, comorbidities and operation as well as socio-demographic characteristics of age, age-squared, and female; \({\varvec{\upbeta}}\) is the corresponding vector of parameters to be estimated; and \(\varepsilon\) is the usual error term. Estimation is undertaken using Poisson, two-part model, zero-inflated negative binomial with constant inflation, and zero-inflated negative binomial with variable inflation.

The Poisson regression model assumes that the observed counts are drawn from a Poisson distribution with a mean \(\mu_{i}\) given by:

$$\mu_{i} = E\left( {y_{i} |{\mathbf{x}}_{i} } \right) = \exp \left( {{\mathbf{x}}_{i}^{'} {\varvec{\upbeta}}} \right)$$
(2)

The Poisson distribution rarely fits well in practice because of the well-known and restrictive equidispersion property, that is, the Poisson distribution variance equals its mean. In most applications, the variance usually exceeds the mean, a feature called overdispersion [1618]. The negative binomial overcomes this problem by introducing unobserved heterogeneity among observations in the form of an error term, \(\xi_{i}\), uncorrelated with the independent variables as follows:

$$\tilde{\mu }_{i} = \exp \left( {{\mathbf{x}}_{i}^{'} {\varvec{\upbeta}} + \xi_{i} } \right)$$
(3)

where \(\tilde{\mu }_{i}\) corresponds to Eq. (2) with unobserved heterogeneity included. Assuming that \(E\left[ {\exp \left( \xi \right)} \right] = 1\) the expected conditional mean of the negative binomial regression can be written as:

$$E\left[ {\tilde{\mu }_{i} } \right] = E\left( {y_{i} |{\mathbf{x}}_{i} } \right) = \exp \left( {{\mathbf{x}}_{i}^{'} {\varvec{\upbeta}}} \right) = \mu_{i}$$
(4)

It is clear from Eqs. (2) and (4) that both the Poisson and the negative binomial regression share the same mean structure. If the assumptions underlying the negative binomial regression are supported by the data the conditional expected rate of counts will be the same for both models but the standard errors of the Poisson regression model will be biased downwards.

Overdispersion can arise not only from unobserved heterogeneity but also from different processes generating the first and subsequent events, that is, more zeroes in the data than can be generated by the Poisson or even the negative binomial regression models. There are two leading modified count models that can be used to deal with this problem: the two-part model and the zero-inflated model.

The two-part or hurdle model relaxes the assumption that the zeroes and positive counts are generated by the same process. It combines a binary model (a probit or a logit) to predict the zeroes with a zero truncated Poisson or negative binomial regression to predict positive counts. In this model, zero is a hurdle to overcome before any positive counts can be attained and it is often interpreted as a two-stage decision-making process. Using a probit model for the binary outcome and a truncated negative binomial, the two-part model can be defined by the following two equations:

$$\Pr \left( {y_{i} = 0|{\mathbf{z}}_{i} } \right) = \varPhi \left( {{\mathbf{z}}_{i}^{'} {\varvec{\upgamma}}} \right)$$
(5a)

And

$$\Pr \left( {y_{i} |y_{i} > 0,{\mathbf{x}}_{i} } \right) = \frac{{\Pr \left( {y_{i} |{\mathbf{x}}_{i} } \right)}}{{1 - \left( {1 + \alpha e^{{{\mathbf{x}}_{i}^{'} {\varvec{\upbeta}}}} } \right)^{ - 1/\alpha } }}$$
(5b)

where \({\mathbf{z}}_{i}^{{}}\) is a vector of random variables which determine the probability of a zero in the data, \({\varvec{\upgamma}}\) is the corresponding parameter vector and \(\Pr \left( {y_{i} |{\mathbf{x}}_{i} } \right)\) is:

$$\Pr \left( {y_{i} = m|{\mathbf{x}}_{i} } \right) = \frac{{\varGamma \left( {y_{i} + \alpha^{ - 1} } \right)}}{{y_{i} !\varGamma \left( {\alpha^{ - 1} } \right)}}\left( {\frac{{\alpha^{ - 1} }}{{\alpha^{ - 1} + e^{{{\mathbf{x}}_{i}^{'} {\varvec{\upbeta}}}} }}} \right)^{{\alpha^{ - 1} }} \left( {\frac{{e^{{{\mathbf{x}}_{i}^{'} {\varvec{\upbeta}}}} }}{{\alpha^{ - 1} + e^{{{\mathbf{x}}_{i}^{'} {\varvec{\upbeta}}}} }}} \right)^{{y_{i} }}$$
(6)

where \(\varGamma \left( {} \right)\) is the gamma function and \(\alpha\) is the degree of dispersion. The vector \({\mathbf{z}}_{i}^{{}}\) could in principle be identical to \({\mathbf{x}}_{i}^{{}}\).

The zero-inflated negative binomial model allows the zeroes to be generated by two different processes. It supplements the negative binomial regression with a binary model to increase the zero count. It is often interpreted as heterogeneity in the population arising from two distinct unobserved groups. The outcome for one group (group 1) is zero with probability one. For the other group (group 2) the outcome might be zero but with a positive probability of a nonzero outcome. The zero-inflated negative binomial can be estimated using inflation of only the constant term or inflation of explanatory variables and the constant term. The zero-inflated negative binomial model can be defined by the negative binomial in Eq. (3) together with a logit model for the probability of group membership:

$$\Pr \left( {{\text{group 1}}|{\mathbf{z}}_{i} } \right) = \frac{{e^{{{\mathbf{z}}_{i}^{'} {\varvec{\upgamma}}}} }}{{1 + e^{{{\mathbf{z}}_{i}^{'} {\varvec{\upgamma}}}} }}$$
(7)

The predicted probabilities of the counts for the negative binomial model can be computed by substituting the estimated parameter values in Eq. (6). The expected counts for the model are found using the formula below:

$$E\left( {y_{i} |{\mathbf{x}}_{i} ,{\mathbf{z}}_{i}^{{}} } \right) = \left( {1 - \frac{{e^{{{\mathbf{z}}_{i}^{'} {\varvec{\upgamma}}}} }}{{1 + e^{{{\mathbf{z}}_{i}^{'} {\varvec{\upgamma}}}} }}} \right)e^{{{\mathbf{x}}_{i}^{'} {\varvec{\upbeta}}}}$$
(8)

STATA version 11 was used for all regression analysis and IBM SPSS version 19 was used for the descriptive statistical analysis.

Results

Descriptive statistics

Table 1 presents descriptive statistics of the sample. Mean EQ-5D score in the sample is 0.652 (SD = 0.321) and mean age is 59.00 (SD = 17.34). Figure 1 shows that the distribution of EQ-5D score is tri-modal and there are observations across the full range of utilities (1 to −0.594). The number of observations in each ICD chapter varies greatly, with larger proportions of patients in ICD chapter I (diseases of the circulatory system), K (diseases of the digestive system), and M (diseases of the musculoskeletal system and connective tissue), and few observations for A and B (infectious and parasitic diseases), F (mental and behavioral disorders), and Q (congenital malformations, deformations and chromosomal abnormalities). There are no observations for P (conditions originating in the perinatal period), and V, W, X and Y (external causes of morbidity or mortality).

Fig. 1
figure 1

Distribution of EQ-5D

Figure 2 presents the distribution of days the patient needed informal care, showing a huge spike at 0, smaller spikes at multiples of 5 and 7 and a large spike at 42 days (maximum number of days in the last 6 weeks).

Fig. 2
figure 2

Distribution of days needed care from a friend or relatives in the last 6 weeks

Table 2 presents the mean days the patient needed informal care by EQ-5D score. As expected the number of days increase as EQ-5D score decreases and health decreases, meaning that patients in more severe health needed more informal care. Patients with an EQ-5D score of 1 have a mean of 1.46 days (SD = 5.39) where they needed informal care. This may be expected given that EQ-5D measures health today whereas patients were asked to record their informal care need over the previous 6 weeks.

Table 2 Summary of days needed informal care in the last 6 weeks by EQ-5D score

Regression analysis

Table 3 presents regression analysis estimating informal care effects using days the patient needed informal care in the last 6 weeks. The sign and significance of the coefficients for the dummies for ICD chapters varies across the models. Coefficients have a consistent sign and are always significant for ICD chapters H, K, N, O, and R. EQ-5D score is negative and significant in all models as expected, meaning that as EQ-5D score increases the number of days the patient needed informal care decreases. The comorbidities dummy variable is positive and significant in all models, suggesting that patients with comorbidities need more days of informal care. The operation dummy is negative and significant in the Probit model of the two-part model, but is not significant in the other models. The coefficients for age/100 and (age/100)-squared are almost insignificant in the Poisson model but become highly significant when using alternative models. The signs of the coefficients of the age variables as well as the gender are in general consistent across models, with any remaining inconsistency explained by the model misspecifications described later in this Section. Table 4 presents the marginal effects for all models. There are substantial differences in the marginal effects across the different models. For example, changes in the primary ICD chapter H reduce the informal care needed by three and a half days in the Poisson model, the zero-inflated negative binomial with variable inflation predicts the reduction to be only two days. An increase in the EQ-5D score of 0.1, reduces the number of days of care needed by just under two days, less than a day and just over two days for the Poisson, zero-inflated negative binomial with constant inflation and the zero-inflated negative binomial with variable inflation, respectively. These differences are all statistically significant at 5 %.

Table 3 Regression output for days needed informal care in the last 6 weeks
Table 4 Marginal effects—days needed informal care in the last 6 weeks

Table 5 and Fig. 3 present predictions of days needed care for the estimated models. The generated predictions are the mean proportions of the population with a predicted given number of days, estimated using the mean probability across all individuals of the predicted probability for each given number of days. These predictions are presented to enable comparison of how well each model predicts the observed distribution of days needed informal care. The two-part model and the zero-inflated negative binomial models are the best at predicting. Overall the zero-inflated negative binomial with variable inflation and the two-part model most accurately predict the spikes at 0 days and 42 days but the two-part model seems to understate the frequency of the number of days between 1 and 7. The Poisson model is poor at estimating the large spike at 0 days, with the predictions instead forming a lower tail from 1 to 14 days, which does not reflect the observed distribution and there is no spike at 42 days. The Poisson model is appropriate when the mean and variance are similar, yet the distribution of days needed informal care suffers from overdispersion, where the variance is much larger than the mean. A test of the Poisson model versus the negative binomial model emphatically rejects the restrictions of the Poisson model at all standard significance levels. The Vuong test of the zero-inflated negative binomial against the standard negative binomial also rejects the standard negative binomial in favor of the zero-inflated versions at all standard significance levels.

Table 5 Predictions of days needed informal care in the last 6 weeks
Fig. 3
figure 3

Predictions of days needed informal care in the last 6 weeks (all models)

The specification of the model with the most accurate predictions, the zero-inflated negative binomial with variable inflation, is explored by excluding the comorbidity and operation variables and insignificant ICD chapters (Table TA1 in the Appendix). The comorbidity and operation variables are excluded, as these may not always be available in datasets used in economic evaluation, although it should be recognized that this lack of data may lead to biased estimates. An additional set of analysis is also undertaken by excluding statistically insignificant ICD chapters to obtain a more parsimonious model, although AIC and BIC indicate that the original specification of the model is preferred (Table TA1 in the Appendix).

Discussion

This paper has presented regression analyses that enable the estimation of informal care associated with the health of the patient. These analyses enable the prediction of days the patient needed informal care in the last 6 weeks using the health and sociodemographic characteristics of the patient. The linkage between EQ-5D and ICD chapter with informal care time enables the measurement of the impact of medicines on carer time via their impact on health. The estimate of days can then be converted into a cost value using available methods in the existing literature and used in economic evaluation of health interventions.

The approach is much simpler than the alternative approaches that have been used in evaluations to date, which have collected care data for the relevant patient-carer population [7]. It also removes the need to collect additional data on informal care to accompany the results of a randomized controlled trial for use in cost-effectiveness models. This method has the advantage that it can be used for all economic evaluations in submissions to agencies such as NICE where EQ-5D scores are available, providing consistency across all evaluations. The method can be used to evaluate both new treatments and displaced treatments. This is advantageous for agencies assessing new interventions where there is a need to consider displacement occurring in a health care system with limited resources. However, it must be recognized that direct measurement of informal care use will be more accurate than the informal care need predicted here, and for this reason the prediction approach presented here should be considered a second-best approach to direct measurement.

The distribution of days the patient needed informal care provided a variety of challenges for the regression analysis, with a very large spike at 0 days, spikes at multiples of 5 and 7 days and a large spike at 42 days. A variety of different regression models were estimated that are appropriate for the type and distribution of the data, and recommendations made on the preferred model. This can be used to inform future studies with similar data type and distribution. The Poisson model was poor at predicting the number of days, and taking into account repeated observations at the individual level using the random effects Poisson model did not improve these predictions. The two-part model was much better at predicting, but still underpredicted the very large spike at 0 days. The zero-inflated negative binomial models were consistently the best models at predicting the distribution of the number of days and were capable of describing the spikes at zero and 42 days. Not surprisingly, none of the models were able to accommodate the multiple spikes at multiples of 5 and 7 days.

The choice of preferred model between the zero-inflated negative binomial model with constant inflation and the zero-inflated negative binomial model with variable inflation needs to take into account their econometric performance. The zero-inflated negative binomial model with variable inflation was best overall at predicting both the spike at 42 days, which represents a large informal care cost as these individuals needed informal care every day in the last 6 weeks, and the very large spike at 0 days, which represents zero informal care. In contrast, the zero-inflated negative binomial model with constant inflation predicted the spike at 0 days but underpredicted the spike at 42 days. However, between around 8 days and 41 days all models overpredicted the number of days. This overprediction is possibly a consequence of the spike at 42 days which none of the models can fully capture, and as a result, this distorts the estimated distribution in the tail. Given that the spike appears to be partly an artefact of imprecise survey responses (as discussed below), this overprediction/distortion may represent a bias in estimated values in this range.

Model choice should also consider the decision-making context within which the predictions will be made. If a model fits well overall, but is poor at describing an important part of the distribution, then this should be taken into consideration. For this population, which represents patients recently undergoing treatment, accurate predictions of the observations at zero and 42 days is very important as these account for almost 70 % of the observations. Having considered both the fit of the models and the decision-making context within which the models will be applied, the zero-inflated negative binomial model with variable inflation was considered to be the most relevant. By way of example, the estimated equation needed to calculate the number of days of informal care based on model (4) and a 60-year-old female patient within ICD Chapter B, with no comorbidities or recent operation is given in the Appendix.

The relationship between EQ-5D and informal care was consistent and significant, where lower EQ-5D score meant higher informal care. Patients with comorbidities also consistently and significantly had higher informal care. The relationship between whether the patient had an operation and informal care was not clear. The sign and significance of many of the dummies for ICD chapter varied across models, but were consistent and significant for 5 ICD chapters. Exclusion of comorbidity and operation variables and insignificant ICD chapters from the full specification produced a deterioration in model performance (as indicated using AIC and BIC statistics), but had little impact on days predicted. The sign and significance of age, age-squared and a female dummy variable varied.

The analyses built upon the approach taken by Dixon et al. (2006) [13] using the same dataset but with additional observations (additional years of data) and regression models that are more appropriate for the data. Different model specifications are estimated here as Dixon et al. focused upon the use of dummy variables for the severity levels of each EQ-5D dimension rather than the utility score and did not incorporate ICD chapter or other health variables into their models. Dixon et al. found that using EQ-5D severity levels as independent variables, rather than the EQ-5D score, produced better models. However, such models were not explored here, as their applicability is limited because analysts rarely have access to patient level data. The results of these alternative formulations are consistent, with improvements in the health of the patient meaning reduced informal care.

Several other studies have identified relationships between informal care and patient health-related quality of life. Brouwer and colleagues [19] undertook a simple linear regression of patient health-related quality of life and caregiver employment status on amount of informal care, relating to a sample of one hundred and fifty-three patients with rheumatoid arthritis. Both explanatory variables were negatively related to amount for informal care provided.

Wimo et al. [20] undertook more complex linear regressions to explain the amount and type of formal care for ninety-two patients with dementia. In addition to patient characteristics, carer characteristics and amount of formal care received were also used as explanatory variables, although they were not consistently significant across all types of informal care. Neubauer and colleagues [21] extended this preceding research further by looking at the amount of informal care provided across multiple caregivers. This showed different relationships between patient health-related quality of life and amount of informal care for the primary caregiver and all caregivers.

While these studies have advantages over our study in terms of data quality, and have also extended the analysis to explore other important factors, they lack the breadth of disease coverage that we have been able to provide. Additionally, the ability of other researchers to use the results of these studies is limited by the need for data on all the explanatory variables. Our work is based primarily on sociodemographic variables that are commonly available in all clinical studies.

Krol et al. [22] adopt a different approach to predicting predict productivity loss (in terms of both absenteeism and presenteeism) using patient quality of life. In their study, a sample of the Dutch general population was asked to imagine the impact on their productivity associated with hypothetical EQ-5D states. This has the advantage that the EQ-5D health states were selected using an orthogonal design, meaning that the health states reflected a range of severity and there were a large and almost identical number of observations for each health state, meaning that the data was not dependent upon the incidence of observed health states in the sample. However, the use of stated preference where respondents imagine the impact on their productivity is likely to be less accurate than observational data where respondents report the actual impact on their productivity with their current EQ-5D health state. The advantage of the data used in this study is that there is a direct linkage between EQ-5D and informal care rather than relying upon hypothetical informal care values.

Limitations

One limitation of the analyses undertaken here is that they will not be appropriate for predicting informal care when either the EQ-5D is inappropriate for the patient population, such as for visual impairment in macular degeneration problems where the EQ-5D has been found to perform poorly [23]. The analyses may also be less reliable for the ICD chapters where there are fewer observations as the relationship is based on a smaller sample.

A further limitation of this study is that the dependent variable is generated from a question asking patients to recall how many days a friend or relative has needed to provide care or help them with normal activities in the last six weeks. It is well documented that questions of recall may be prone to bias and potentially systematic error. This may be exacerbated by the 6-week recall period as recalling care or help needed over the last 6 weeks is a difficult task. Potential problems with the data are illustrated by the fact that raw observations had a maximum of 45 days and spikes at multiples of 5 and 7 days, suggesting that respondents were approximating the number of days using weeks or months (for example, 6 weeks may be converted into one and a half months, which is approximately 30 + 15 = 45 days). Whereas the spikes at multiples of 5 and 7 days were not altered as it may reflect a real propensity for carers to plan their input in terms of whole weeks, responses of greater than 42 days were censored at 42 days. Alternative approaches are possible, for example, excluding or adjusting responses of over 42 days instead of censoring. While the recall period of the informal care question in HODaR may be an issue, the framing of the question in terms of ‘friends and relatives’ overcomes one problem with previous studies which have often focused on the primary caregiver. Neubauer and colleagues [18] have shown that estimating the time of the primary care giver can produce a substantial underestimate of the costs of informal care.

Another issue relates to the underlying patient population within HODaR. The vast majority of patients were recently discharged from hospital, and as such, are not in a stable health state. In addition, the EQ-5D captures the health of the patient at a single point at the end of the 6-week recall period. Consequently, a patient with a low EQ-5D on discharge (with informal care needs) could recover to full health (with no informal care needs) by the time they complete the EQ-5D at 6 weeks. This could lead to a systematic overestimation of the informal care needs for relatively mild health states.

A further limitation is that the dataset excludes people with a primary diagnosis on admission of a psychological illness or learning disability and therefore is not recommended for use in health interventions for these patients. Furthermore, the sample may suffer from response bias as only respondents with complete data on the variables used in the models were analyzed.

The most straightforward and inexpensive method of providing a monetary valuation of the days of informal care is to use existing values in the literature for specified tasks or hours. However, the dependent variable creates challenges for the monetary valuation of the number of days of informal care as the HODaR dataset does not include data on the number of hours in each day that the patient required informal care or the tasks that were undertaken during that time. We therefore recommend that future surveys of this nature should also collect data on the number of hours and tasks undertaken. Without this information the accuracy and precision of the monetary valuation of informal care will be compromised.

Our approach also assumes that health is exogenous to the amount of informal care received by an individual. If health is also determined by informal care our estimates will suffer from simultaneity bias. Unfortunately, the dataset does not provide appropriate instrumental variables with which to undertake two-stage least squares estimation. However, we consider that the amount of informal care would be expected to have a small effect on EQ-5D scores, given the descriptive system that underpins is heavily based around functioning. Impacts on mental health and wellbeing would be more likely, and only the former is included (crudely) within the EQ-5D.

The issues of measuring joint production and identifying care activities as opposed to incidental household activities were identified earlier. The dataset used here does not distinguish between the different types of input and output associated with the quantities of time recorded. However, the primary purpose for the analyses conducted here is for use in cost-effectiveness analyses, which are based on incremental differences between interventions. As such, any bias introduced by these issues should cancel each other out, as long as the nature of the interventions does not fundamentally change the level of joint production or mix of care/household activities across the interventions. Finally, the predictions of the models have not been validated in a separate dataset, and their predictive accuracy has not been tested in the context of incremental differences in informal care usage across different interventions.

This paper has shown that patients with lower EQ-5D scores have higher informal care needs than patients with higher EQ-5D scores. The preferred regression model estimated here can be used to predict changes in informal care associated with changes in the health of the patient measured using EQ-5D. This information can be used to inform economic evaluation of health interventions. The main limitations of this study relate to the underlying dataset, in particular, exclusions of some patient groups from the patient population and the imprecision of measurement. If a dataset can be generated that overcomes these issues, arguably the analysis presented here has shown that a robust relationship between patient EQ-5D utility scores and informal care time can be estimated. Such an approach has many advantages over the direct measurement of informal care in each individual patient population. However, the degree of imprecision introduced by estimation rather than observation is currently unknown and therefore this method is a second-best alternative to direct measurement. It is recommended that future research should specify and generate a more appropriate dataset for re-estimating the relationship between EQ-5D utility scores and amount of informal care. This relationship should then be used in studies that have collected informal care time data, so that the differences between the two methods can be calculated.