Abstract
Purpose
Using transformations of existing quality-of-life data to estimate utilities has the potential to efficiently provide investigators with utility information. We used within-method and across-method comparisons and estimated disutilities associated with increasing chronic kidney disease (CKD) severity.
Methods
In an observational cohort of veterans with diabetes (DM) and pre-existing SF-36/SF-12 responses, we used six transformation methods (SF-12 to EQ-5D, SF-36 to HUI2, SF-12 to SF-6D, SF-36 to SF-6D, SF-36 to SF-6D (Bayesian method), and SF-12 to VR-6D) to estimate unadjusted utilities. CKD severity was staged using glomerular filtration rate estimated from serum creatinines, with the modification of diet in renal disease formula. We then used multivariate regression to estimate disutilities specifically associated with CKD severity stage.
Results
Of 67,963 patients, 22,273 patients had recent-onset DM and 45,690 patients had prevalent DM. For the recent-onset group, the adjusted disutility associated with CKD derived from the six transformation methods ranged from 0.0029 to 0.0045 for stage 2; −0.004 to −0.0009 for early stage 3; −0.017 to −0.010 for late stage 3; −0.023 to −0.012 for stage 4; −0.078 to −0.033 for stage 5; and −0.012 to −0.001 for ESRD/dialysis.
Conclusion
Disutility did not increase monotonically as CKD severity increased. Differences in disutilities estimated using the six different methods were found. Both findings have implications for using such estimates in economic analyses.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Health economic analyses of new medical interventions require preference-weighted measures of quality of life, utilities, to estimate quality-adjusted life years (QALYs) [1]. However, estimating utilities can be resource-intensive. Using transformations of already existing quality-of-life data that are often descriptive and non-preference-based has the potential to efficiently provide investigators with utility information, based on large samples of responses. Such methods have been proposed by multiple investigators [2–8]. However, direct comparisons of methods remain sparse in the literature [9–15] and have not been conducted for chronic kidney disease patients with type 2 diabetes mellitus. In addition, the ability of such methods to estimate expected increases in disutility in patients with incremental disease severity has not been tested or compared across methods. Potential biases that may be inherent to using pre-existing data also require exploration and identification.
To address this information gap, we used existing Short Form 36 and 12 Item Health Surveys (SF-36/SF-12) responses from veterans with diabetes mellitus (DM) and chronic kidney disease (CKD) and applied six published transformations to discriminate increasing utility losses (disutilities) as CKD severity increased. Psychometric profile measures such as SF-36 capture information broken down by health dimensions. However, these scores are not easily used for program evaluation or cost-effectiveness analyses. Mathematical models that transform existing psychometric scores into preference-weighted utility scales offer investigators a cost-efficient alternative, relative to administering full-scale preference-based surveys to the population of interest, such as US veterans. Many such transforming methods have been proposed; in our analysis of US veterans with diabetes, we compared six such approaches.
The methods were chosen based on their intention to transform SF-36/SF-12 into well-recognized preference-based, multi-attribute utility scales such as HUI2, SF-6D or VR-6D, and EQ-5D [2, 4, 7, 8, 16–18]. We chose the example of type-2 diabetes and CKD as being of public health and policy importance. The prevalence of type 2 diabetes is projected to increase from 14% in 2010 to 21% in 2050 in the US adult population [19]. Among diabetics, the prevalence of CKD is estimated to be >40% and is a source of increased healthcare utilization and healthcare costs [20].
Investigators who are considering using pre-existing SF-36/SF-12 responses to develop utility estimates using pre-developed transformation methods need to ask themselves: Does each of the available transformation methods perform in a manner that suggests consistency (within-method comparison)? Among the methods that do perform consistently, how will choice of method affect utility estimates (cross-method comparison)? With this in mind, we posed the following analytic questions in conducting this study: (1) Within-method comparison: Did the transformation method under consideration yield a stepwise increase in disutility (when adjusted for covariates), as chronic kidney disease stage increased? and (2) Cross-method comparison: At each stage of chronic kidney disease, were the estimates of disutility generated by the six methods under consideration similar or did they differ?
Methods
Overview
A cross-sectional sample of a well-described longitudinal cohort of veterans with diabetes was used. SF-36/SF-12 responses were available for a subset of cohort members. International Classification of Diseases, 9th edition (ICD-9) codes determined whether respondents were on dialysis or had end-stage renal disease (ESRD). If subjects did not have ESRD/dialysis, serum creatinines were used to stage CKD. Six previously published SF-36/SF-12-to-utility transformations were used to estimate utilities for each subject [2, 4, 7, 8, 16–18]. Generalized linear regression models estimated the disutility associated with each CKD stage, after adjustment for demographics, socio-economics, and comorbid disease conditions.
Population and data sources
US veterans with DM were selected who were both members of a large research cohort of veterans with diabetes and respondents to the Large Veterans Health Survey (LVHS) in 1999 [21]. The research cohort used was the Diabetes Epidemiology Cohort (DEpiC), a well-established longitudinal cohort of veterans with diabetes used in health services research for the past 9 years [22]. DEpiC includes data files from the Veterans Health Administration (VHA inpatient, outpatient, and medication files) and the Center for Medicare and Medicaid Services (CMS) [23] (CMS eligibility, MEDPAR, and Part B files) for all VHA users with diabetes [22]. The diagnosis of diabetes was determined using a validated approach [23], based on having two or more diabetes-specific ICD-9 codes (250.xx, 357.2, 362.0, 366.41) from inpatient or outpatient physician visits.
We selected those US veterans who responded to the LVHS and were members of the DEpiC cohort in 1999, 2000, or 2001. Subjects were categorized as having recent-onset DM (duration of ≤3 years) or prevalent DM (duration >3 years), based on an LVHS survey question regarding duration of diabetes. This categorization was done, as utilities and disutilities experienced by patients may be influenced by duration of illness [24–27].
Included within the 1999 LVHS was the Veterans RAND 36-item health survey (VR-36), a version of the SF-36 that has been previously validated in US veteran populations [21]. Individual responses to the 36 questions, as well as eight subscales—general health (GH), physical function (PF), role physical (RP), role emotional (RE), vitality (VT), bodily pain (BP), social function (SF), mental health (MH)—and two summary scales—physical component scale (PCS) and mental component scale (MCS)—were utilized for the analysis. In addition, we also calculated the PCS-12 and MCS-12 summary scales derived from the subset of SF-12 questions. The subscales derived from VR-36 scores were converted to SF-36 scores by using conversion formulas previously validated for the RP and RE subscales, from 5 point ordinal scales to dichotomized yes/no choices for each of the items [18, 28, 29].
Inclusion and exclusion criteria
To be considered for inclusion in the study, subjects had to have responded to the LVHS in 1999 and have utilized the VHA or Medicare system at least once in fiscal year (FY) 2000. Further, they had to be a DEpiC cohort member in FY1999, FY 2000, or FY 2001. Exclusion criteria included failure to respond to LVHS items pertaining to diabetes, absence of demographic data, and absence of creatinine testing (see CKD staging below). Subjects with certain severe illnesses likely to impact quality of life and/or result in anticipated life expectancies less than ~ one year were also excluded from the study. These illnesses were defined as end-stage hepatic disease, active cancer (other than cancers of the skin and prostate), lower extremity amputations, advanced retinopathy, dementia, and other cognitive impairments. These subjects were considered likely to have large disutilities related to these dominant conditions, making discrimination of additional disutility due to CKD problematic. The presence or absence of these conditions was determined by ICD-9 and CPT codes (Supplemental Table 1). However, in a secondary analysis, these subjects with dominant conditions were added back to the study cohort.
Chronic kidney disease staging
First, it was determined whether respondents were on dialysis or had end-stage renal disease (ESRD), using ICD-9 diagnostic and procedure codes (Supplemental Table 1) [30]. If subjects did not have codes for ESRD and/or dialysis, the presence or absence of CKD and the stage of CKD (if present) were assessed using serum creatinine tests. Estimated glomerular filtration rate (eGFR) from serum creatinine was calculated using the modification of diet in renal disease (MDRD) formula [31]. CKD was defined as no CKD or CKD without alteration of eGFR: stage 0/1 (eGFR ≥ 90 mL/min); stage 2 (60 mL/min ≤ eGFR < 90 mL/min); early stage 3 (45 mL/min ≤ eGFR < 60 mL/min); late stage 3 (30 mL/min ≤ eGFR < 45 mL/min); stage 4 (15 mL/min ≤ eGFR < 30 mL/min); or stage 5 (eGFR < 15 mL/min), using an established approach [32, 33].
Two serum creatinine tests in FY 1999 or FY 2000 were required to establish CKD stage. The first test in FY 1999 was flagged as an index test. Subjects had to have a subsequent serum creatinine test at ≥90 days but <365 days after the index test, in order to be staged. If the index eGFR was ≥90 mL/min, then the subject was determined to have no CKD (stage 0) or CKD without altered eGFR (stage 1). If the index eGFR <90 mL/min, CKD was staged based on the subsequent serum creatinine/eGFR using the categories given above.
Other independent variables
Subjects’ claims data from DEpiC provided information about comorbid disease conditions. These conditions were classified into diabetes-related and non-diabetes-related categories using ICD-9-CM codes (Supplemental Table 2). The diabetes-related categories included macrovascular diseases (coronary artery disease (CAD), congestive heart failure (CHF), arrhythmia, cerebrovascular disease, peripheral vascular disease, gangrene), microvascular diseases (diabetic retinopathy, lower extremity ulcers, lower extremity amputations), and metabolic diseases (hyperglycemia, hypoglycemia, diabetic ketoacidosis, hyperosmolarity, and diabetic coma). The non-diabetes-related diseases were categorized as gastrointestinal, musculoskeletal, pulmonary, neurological, mental and substance-abuse disorders, and other disorders.
Control variables included in multivariate analysis were demographic variables (age, race, marital status, education, employment), VHA eligibility status (priority code), lifestyle behaviors (smoking, alcohol consumption, exercise frequency), and comorbid conditions (diabetes-related and unrelated, as described above).
Utility models
Six previously published transformations (A = SF-12 to EQ-5D, B = SF-36 to HUI2, C = SF-12 to SF-6D, D = SF-36 to SF-6D, E = SF-36 to SF-6D (Bayesian method), F = SF-12 to VR-6D) were used to estimate utilities. The six methods were chosen based on their intention to transform SF-36/SF-12 into well-recognized, preference-based, multi-attribute utility scales: HUI2, SF-6D (VR-6D), and EQ-5D [4, 6–8, 16–18]. Four methods (C, D, E, F) revalue SF-36/SF-12 items to a descriptive system and then apply weights in order to transform the revalued items into a utility score, whereas the two other methods (A, B) rely on regression-based transformation of SF-36 subscales or summary scales to a utility score. Two of six methods (A, B) were developed using US general populations, three methods were developed using a UK general population (C, D, E), while the final method (F) was developed using US veterans. Table 1 summarizes the models used. In each of the six models, the dependent variable is a utility score that ranges between 0 and 1. This utility score is not specific to CKD but reflects the overall health utility experienced by the respondent.
Descriptive analyses and multivariate analyses for the estimation of CKD-related disutility
Means, confidence intervals, medians, and interquartile ranges were used to summarize descriptive data, including utility scores generated by each of the six methods described above.
Generalized linear models were used to estimate the incremental disutility associated with each CKD stage. We took a step-wise approach to building nested models. These were built starting with an unadjusted model; then, we built models that adjusted for demographics and socio-economic variables. Finally, we developed a model that adjusted for demographics, socio-economic variables, and comorbid disease conditions. Generalized linear models were also used to test the null hypothesis that utilities estimated by different methods were equal.
In order to test the impact of mental health disease burden on the results, we calculated utilities among a subgroup of subjects with no mental health or substance-abuse disorders. In order to assess the impact of removing subjects with dominant conditions, a secondary analysis was performed including all subjects. SAS version 9.2 (SAS Institute, Cary, NC) was used for all analyses.
Results
Demographics and epidemiology of diabetes mellitus and chronic kidney disease
A total of 141,747 veterans with diabetes responded to the LVHS question about diabetes duration. Of these, 3,081 were excluded because of missing demographic information (age, sex, race) required for the conversion of serum creatinine into eGFR. A further 38,143 were excluded for having severe and/or life-threatening illnesses (dominant medical conditions), as defined in Methods. Of note, of the remaining 100,253 patients, a greater proportion of those both with mental illness and without CKD were found to have undergone creatinine testing compared to those with no mental illness and without CKD (29 vs. 22%). 22,341 of these 100,253 patients did not have a creatinine test done and were also excluded. Of the remaining, 10,219 had an index eGFR of <90 mL/min; however, we were unable to confirm the CKD stage with a qualifying serum creatinine test. After excluding these individuals, the final study population consisted of 67,963 respondents.
Of these patients, 22,273 patients had recent-onset DM and 45,690 patients had prevalent DM. Compared to those with recent-onset DM, subjects with prevalent DM patients were older, more likely to have diabetes-related diseases, and had lower scores on the SF-36 domain of physical functioning (Tables 2, 3, 4). As CKD severity increased in both recent-onset and prevalent DM, increasing proportions of subjects had macrovascular conditions such as CAD, CHF, and arrhythmia. Patients with CKD stage 0/1 had higher rates of mental illness and substance abuse relative to those with CKD stage 2 through 5. For example, among those with recent-onset DM, 26.4% of subjects with CKD stage 0/1 had a mental illness, while 12.5–19.9% of those with CKD stage 2–5 had a mental illness (Table 2).
SF-36 and SF-12 scores
Among those with recent-onset DM, as CKD severity increased, the SF-36 physical function (PF) and role physical (RP) scores declined (Table 4). This pattern was also seen in subjects with prevalent DM. SF-36 mental health (MH) scores among recent-onset DM ranged from a mean (±sd) of 59.4 ± 25.6 in those with CKD stage 0/1, increasing to 65.0 ± 23.0 among those with CKD early stage 3, then decreasing to a mean of 58.0 ± 24.0 for those with ESRD/dialysis (Table 4).
Utility and disutility estimates
The range of estimated utilities for individual respondents varied by transformation method used (Fig. 1). These ranges were SF-36 to HUI2, 0.249–1.023; SF-36 to EQ-5D, 0.377–1.000; SF-12 to SF-6D, 0.345–1.000; SF-36 to SF-6D (non-Bayesian approach), 0.301–1.000; SF-36 to SF-6D (Bayesian approach), 0.203–1.000; and SF-12 to VR-6D, 0.297–0.997. Mean utility values followed an “S” shaped curve, with improvements in utility at CKD stage 2 compared to stage 0/1 and an improvement in utility from stage 5 to ESRD/dialysis, although these latter estimates were associated with wide confidence intervals due to small numbers. Distributions were not overly skewed, and, generally, the mean utility for each CKD stage was centered, relative to extreme values (Supplementary Table 3). Furthermore, the utilities estimated at each CKD stage (stage 0/1, stage 2, early/late stage 3, stage 4/5 (no dialysis), ESRD/dialysis) were found to differ across transformation methods (p < 0.0001).
Figure 2 summarizes results of multivariate analyses estimating disutilities associated with CKD itself. Estimated disutilities by all six methods fell into different but overlapping ranges. For the recent-onset group, CKD-related disutility ranged from −0.004 to −0.001 for early stage 3; −0.017 to −0.010 for late stage 3; −0.023 to −0.012 for stage 4; −0.078 to −0.033 for stage 5; and −0.012 to −0.001 for ESRD/dialysis (full models available upon request to author). CKD stage 2 was associated with a positive disutility, relative to stage 0/1. At each CKD stage, the absolute value of the associated disutility was less for the prevalent diabetes group than for the recent-onset diabetes group. For both recent-onset and prevalent diabetes groups, differences in utility values were significant (p < 0.01) at each CKD stage and for each contrasted pair of methods.
Secondary analyses
In the first of these analyses, we calculated utilities among a subgroup of subjects with no mental health or substance-abuse disorders, as it was standard practice in VHA mental health clinics in 1999/2000 to send serum creatinine as part of a screening panel of tests and because this practice may have influenced results. The subgroup consisted of 53,060 patients divided into 17,004 subjects with recent-onset DM and 36,056 subjects with prevalent DM. In this analysis, for those with prevalent diabetes, utility decreased from stage 0/1 to stage 5 CKD and then increased for those subjects with ESRD/dialysis (Fig. 3). This was not the case for those with recent-onset diabetes, although the median utility increase from CKD stage 0/1 to stage 2 was 0.0083, less than the median increase in utility from stage 0/1 to stage 2 seen in the primary analysis (0.0166).
We conducted another secondary analysis without excluding patients with dominant illnesses (n = 88,503). In this analysis, 27,675 had recent-onset DM. Patients with dominant illnesses were slightly older than those without (37 vs. 35% over 70 years). Mean PCS and MCS scale scores were similar, with differences in MCS and PCS mean scores between those with and without dominant conditions of −0.62 and −0.54, respectively. Estimated CKD-related disutilities in the analysis including patients with dominant conditions were also similar to disutilities estimated in the primary analysis. As in the original analysis, each transformation resulted in CKD stage 2 being associated with a positive disutility, relative to stage 0/1 (CKD stage 2 disutility range across methods, 0.0037–0.0053, for those with recent-onset diabetes). As in the primary analysis, the magnitude of mean disutilities at each CKD stage was less for the prevalent diabetes group than for the recent-onset diabetes group (data not shown).
Discussion
We found that all six methods of transforming the psychometric profile measure SF-36 or SF-12 into a preference-based utility scale were able to discriminate among utilities associated with progressive stages of chronic kidney disease (CKD), in veterans with recent-onset or prevalent diabetes. The disutilities associated with each CKD stage, estimated from multivariate regression models using stage 0/1 as baseline, were all statistically significant and, for CKD stages 3 onwards, less than zero.
However, in the unadjusted utility analysis, CKD stage 2 had a higher mean utility than stage 0/1 CKD, in each of the six transformation methods. In the adjusted model, the disutility associated with stage 2 was also positive. Discussion with Veterans Health Administration clinicians suggests that this result may be a consequence of the routine practice of sending serum creatinine as part of the screening panel of laboratory studies for veterans receiving care in mental health clinics within the VHA system, during the time period under consideration. This explanation is supported by the finding that the subgroup with CKD stage 0/1 had not only a higher proportion of patients having mental illness but also lower SF-36 MCS scores, compared to all other stages except for ESRD/dialysis. In a sub-analysis in which patients with ICD-9 codes indicative of mental illness were removed, utilities decreased from CKD stages 0/1 through CKD stage 5 in both adjusted and unadjusted analyses, for those with prevalent diabetes. This finding indicates that care must be taken when using pre-existing quality-of-life data in the development of utilities, with a careful search for potential biases introduced by subject selection.
Utilities were also found to increase in patients with an established diagnosis of end-stage renal disease ± dialysis relative to those with stage 5 CKD not yet in receipt of the ESRD diagnosis or dialysis. This was the result of improved scores on most SF-36 subscales for those in the former group, relative to those in the latter. The improved utility for patients on dialysis may represent improved quality of life relative to that experienced by patients with advanced CKD who have not transitioned to renal replacement therapy, although estimates were associated with wide confidence intervals, a result of relatively small numbers of subjects in these groups.
Other investigators have found the SF-36 to SF-6D transformations to be associated with a floor effect in a variety of populations [3, 34, 35], perhaps due to its standard gamble valuation method [36] and a descriptive system that may not cover the most severe states [37]. The tendency to overestimate utilities at the lower end of the utility scale may be most problematic in the SF-12-based SF-6D, which may not be as suitable to assess utility in severely ill patients [4, 6]. The higher predicted utilities in the SF-36 to EQ-5D transformation method may be due to ceiling effects associated with EQ-5D’s valuation method [38]. However, unlike other investigators, we did not find that the estimated utility distributions experienced severe floor or ceiling effects, nor were distributions overly skewed. The range of utility values for the SF-36 to HUI2 transformation did exceed one using the published method, suggesting that some modification of the transformation algorithm at the extreme is needed for this population.
The advantages of our work include a large dataset of greater than 67,000 subjects, representative of the US veteran population with diabetes. To the best of our knowledge, we are the first group to report the discriminant validity of the SF-6D Bayesian non-parametric model as well as the VR-6D model, compared to other transformation methods.
There have only been a few studies publishing utilities for health states associated with CKD [39]. One such study investigated utilities by CKD stage in 205 renal clinic patients, and our results fall in the general range of HUI3 utilities estimated [40]. Another study of 62 dialysis patients reported a mean SF-6D utility of 0.62 [41], higher than SF-6D results for the current study’s subjects. The difference may result from the younger, non-diabetic dialysis population used, with fewer non-renal comorbid disease conditions relative to the veterans with diabetes in the current analysis. Both of these studies had small sample sizes and were not specific to diabetes, gaps in the literature that our study addresses.
Limitations to this study included missing data in VR-36/SF-36 responses. Some questions were not completed by survey respondents, preventing estimation of values for necessary independent variables used in the transformation models. For example, because the Bayesian model required most survey item questions answered to define explicit SF-6D health states, it had the greatest removal of respondents due to missing responses to certain items. In contrast, since the VR-6D method has a built-in algorithm for imputing missing data, it used most of the available patient surveys and had very limited removal of respondents. Because of missing data, including absence of creatinine testing, we assessed utilities in only a subset of total LVHS respondents with diabetes and thus may have introduced selection bias in either direction.
In addition, we had only a single survey response per subject and therefore can only report group (between patient) differences but not longitudinal (within patient) differences among these transformation methods. Although SF-6D has been shown to detect longitudinal differences [35], whether or not this will be true in veterans with diabetes and CKD cannot be tested using our study design. We used data from 1999/2000 and therefore classification of comorbid conditions based on ICD-9 coding may not reflect contemporary practice. Our study used surveys from CKD patients to estimate CKD utilities. Societal preferences are often desired in cost-utility analysis for resource allocation purposes. Because patient populations and general populations may value the same health state differently [42, 43], future work needs to take this into consideration. Finally, our work was not intended to identify a “best” transformation method or to rank methods. It was instead descriptive and evaluative in nature.
Other transformation methods have been published [44–54]. We chose not to include them in our analysis because either they may have inconsistent/paradoxical coefficient estimates [47], have been found to differentiate poorly between different health status states [12, 14], used UK weights for EQ-5D calculation [44–46, 50, 53], suffered a floor effect [10, 11], did not result in improvements in precision [6, 48], have been difficult to implement [51], or the target utility measure is not widely used in the US [49]. Some of our selected methods have already been shown to demonstrate construct and empirical validity [11, 13, 55, 56].
We have shown that all six selected methods transforming SF-36 into a preference-based utility scale discriminated disutilities associated with increasingly severe CKD stage well in a large US veteran population with diabetes, but that values differed by method and that increases in utility (decreases in disutility) between stage 1 and stage 2 were observed. Directly elicited utilities are preferable as transformed measures may be less accurate [12] and indirect methods such as SF-6D systematically yielded lower utilities than direct methods (SG or TTO) [34, 57, 58]. However, as direct measures are not always available, investigators often turn to indirect measures. Our results suggest that careful evaluation of such methods is necessary before incorporating indirectly derived disutilities for chronic kidney disease and diabetes, and possibly other disease conditions, into health economic models.
References
Gold, M. R., Siegel, J. E., Russell, L. B., & Weinstein, M. C. (1996). Cost-effectiveness in health and medicine. Oxford: Oxford University Press.
Brazier, J., Roberts, J., & Deverill, M. (2002). The estimation of a preference-based measure of health from the SF-36. Journal of Health Economics, 21(2), 271–292.
Brazier, J., Roberts, J., Tsuchiya, A., & Busschbach, J. (2004). A comparison of the EQ-5D and SF-6D across seven patient groups. Health Economics, 13(9), 873–884.
Brazier, J. E., & Roberts, J. (2004). The estimation of a preference-based measure of health from the SF-12. Medical Care, 42(9), 851–859.
Brazier, J. E., Yang, Y., Tsuchiya, A., & Rowen, D. L. (2010). A review of studies mapping (or cross walking) non-preference based measures of health to generic preference-based measures. European Journal of Health Economics, 11(2), 215–225.
Mortimer, D., & Segal, L. (2008). Comparing the incomparable? A systematic review of competing techniques for converting descriptive measures of health status into QALY-weights. Medical Decision Making, 28(1), 66–89.
Nichol, M. B., Sengupta, N., & Globe, D. R. (2001). Evaluating quality-adjusted life years: Estimation of the Health Utility Index (HUI2) from the SF-36. Medical Decision Making, 21(2), 105–112.
Sullivan, P. W., & Ghushchyan, V. (2006). Mapping the EQ-5D index from the SF-12: US general population preferences in a nationally representative sample. Medical Decision Making, 26(4), 401–409.
Donald-Sherbourne, C., Unutzer, J., Schoenbaum, M., Duan, N., Lenert, L. A., Sturm, R., et al. (2001). Can utility-weighted health-related quality-of-life estimates capture health effects of quality improvement for depression? Medical Care, 39(11), 1246–1259.
Glasziou, P., Alexander, J., Beller, E., & Clarke, P. (2007). Which health-related quality of life score? A comparison of alternative utility measures in patients with type 2 diabetes in the ADVANCE trial. Health Quality of Life Outcomes, 5, 21.
Hollingworth, W., Deyo, R. A., Sullivan, S. D., Emerson, S. S., Gray, D. T., & Jarvik, J. G. (2002). The practicality and validity of directly elicited and SF-36 derived health state preferences in patients with low back pain. Health Economics, 11(1), 71–85.
Kaplan, R. M., Groessl, E. J., Sengupta, N., Sieber, W. J., & Ganiats, T. G. (2005). Comparison of measured utility scores and imputed scores from the SF-36 in patients with rheumatoid arthritis. Medical Care, 43(1), 79–87.
Lobo, F. S., Gross, C. R., & Matthees, B. J. (2004). Estimation and comparison of derived preference scores from the SF-36 in lung transplant patients. Quality of Life Research, 13(2), 377–388.
McDonough, C. M., Grove, M. R., Tosteson, T. D., Lurie, J. D., Hilibrand, A. S., & Tosteson, A. N. (2005). Comparison of EQ-5D, HUI, and SF-36-derived societal health state values among spine patient outcomes research trial (SPORT) participants. Quality of Life Research, 14(5), 1321–1332.
Pickard, A. S., Wang, Z., Walton, S. M., & Lee, T. A. (2005). Are decisions using cost-utility analyses robust to choice of SF-36/SF-12 preference-based algorithm? Health Quality of Life Outcomes, 3, 11.
SF-6D Bayesian programme. (2011). Retrieved May 25, 2011, from http://www.sheffield.ac.uk/scharr/sections/heds/mvh/sf-6d/bayesian.html.
Kharroubi, S. A., Brazier, J. E., Roberts, J., & O’Hagan, A. (2007). Modelling SF-6D health state preference data using a nonparametric Bayesian method. Journal of Health Economics, 26(3), 597–612.
Selim, A. J., Rogers, W., Qian, S. X., Brazier, J., & Kazis, L. E. (2011). A preference-based measure of health: The VR-6D derived from the veterans RAND 12-item health survey. Quality of Life Research, 20(8),1337–1347.
Boyle, J. P., Thompson, T. J., Gregg, E. W., Barker, L. E., & Williamson, D. F. (2010). Projection of the year 2050 burden of diabetes in the US adult population: Dynamic modeling of incidence, mortality, and prediabetes prevalence. Population Health Metrics, 8, 29.
US Renal Data System. (2010). USRDS 2010 annual data report: Atlas of chronic kidney disease and end-stage renal disease in the United States. National Institutes of Health, National Institute of Diabetes and Digestive and Kidney Diseases, Bethesda, MD, 2010.
Kazis, L. E., Ren, X. S., Lee, A., Skinner, K., Rogers, W., Clarke, J., et al. (1999). Health status in VA patients: Results from the Veterans Health Study. American Journal of Medical Quality, 14(1), 28–38.
Miller, D. R., & Pogach, L. (2008). Longitudinal approaches to evaluate health care quality and outcomes: The Veterans Health Administration diabetes epidemiology cohorts. Journal of Diabetes Science Technology, 2(1), 24–32.
Miller, D. R., Safford, M. M., & Pogach, L. M. (2004). Who has diabetes? Best estimates of diabetes prevalence in the Department of Veterans Affairs based on computerized patient data. Diabetes Care, 27(Suppl 2), b10–b21.
Damschroder, L. J., Zikmund-Fisher, B. J., & Ubel, P. A. (2008). Considering adaptation in preference elicitations. Health Psychology, 27(3), 394–399.
Kostopoulou, O. (2006). The transient nature of utilities and health preferences. Medical Decision Making, 26(4), 304–306.
Menzel, P., Dolan, P., Richardson, J., & Olsen, J. A. (2002). The role of adaptation to disability and disease in health state valuation: A preliminary normative analysis. Social Science and Medicine, 55(12), 2149–2158.
Dolan, P. (1996). Modelling valuations for health states: The effect of duration. Health Policy, 38(3), 189–203.
Kazis, L. E., Miller, D. R., Clark, J. A., Skinner, K. M., Lee, A., Ren, X. S., et al. (2004). Improving the response choices on the veterans SF-36 health survey role functioning scales: Results from the Veterans Health Study. The Journal of Ambulatory Care Management, 27(3), 263–280.
Kazis, L. E., Miller, D. R., Skinner, K. M., Lee, A., Ren, X. S., Clark, J. A., et al. (2004). Patient-reported measures of health: The Veterans Health Study. The Journal of Ambulatory Care Management, 27(1), 70–83.
Tiwari, A., Tseng, C. L., Kern, E. F., Maney, M., Miller, D. R., & Pogach, L. (2007). Facility variation in utilization of angiotensin-converting enzyme inhibitors and angiotensin II receptor blockers in patients with diabetes mellitus and chronic kidney disease. The American Journal of Managed Care, 13(2), 73–79.
Miller, W. G. (2009). Estimating glomerular filtration rate. Clinical Chemistry and Laboratory Medicine, 47(9), 1017–1019.
Levey, A. S., Coresh, J., Balk, E., Kausz, A. T., Levin, A., Steffes, M. W., et al. (2003). National Kidney Foundation practice guidelines for chronic kidney disease: Evaluation, classification, and stratification. Annals of Internal Medicine, 139(2), 137–147.
Tseng, C. L., Kern, E. F., Miller, D. R., Tiwari, A., Maney, M., Rajan, M., et al. (2008). Survival benefit of nephrologic care in patients with diabetes mellitus and chronic kidney disease. Archives of Internal Medicine, 168(1), 55–62.
Khanna, D., Ahmed, M., Yontz, D., Ginsburg, S. S., Park, G. S., Leonard, A., et al. (2008). The disutility of chronic gout. Quality of Life Research, 17(5), 815–822.
McDonough, C. M., & Tosteson, A. N. (2007). Measuring preferences for cost-utility analysis: How choice of method may influence decision-making. Pharmacoeconomics, 25(2), 93–106.
Green, C., Brazier, J., & Deverill, M. (2000). Valuing health-related quality of life. A review of health state valuation techniques. Pharmacoeconomics, 17(2), 151–165.
Tsuchiya, A., Brazier, J., & Roberts, J. (2006). Comparison of valuation methods used to generate the EQ-5D and the SF-6D value sets. Journal of Health Economics, 25(2), 334–346.
Kontodimopoulos, N., Pappa, E., Papadopoulos, A. A., Tountas, Y., & Niakas, D. (2009). Comparing SF-6D and EQ-5D utilities across groups differing in health status. Quality of Life Research, 18(1), 87–97.
Dale, P. L., Hutton, J., & Elgazzar, H. (2008). Utility of health states in chronic kidney disease: A structured review of the literature. Current Medical Research and Opinion, 24(1), 193–206.
Gorodetskaya, I., Zenios, S., McCulloch, C. E., Bostrom, A., Hsu, C. Y., Bindman, A. B., et al. (2005). Health-related quality of life and estimates of utility in chronic kidney disease. Kidney International, 68(6), 2801–2808.
Davison, S. N., Jhangri, G. S., & Feeny, D. H. (2009). Comparing the Health Utilities Index Mark 3 (HUI3) with the Short Form-36 preference-based SF-6D in chronic kidney disease. Value Health, 12(2), 340–345.
Chapman, B. P., Franks, P., Duberstein, P. R., & Jerant, A. (2009). Differences between individual and societal health state valuations: Any link with personality? Medical Care, 47(8), 902–907.
Mann, R., Brazier, J., & Tsuchiya, A. (2009). A comparison of patient and general population weightings of EQ-5D dimensions. Health Economics, 18(3), 363–372.
Ara, R., & Brazier, J. (2008). Deriving an algorithm to convert the eight mean SF-36 dimension scores into a mean EQ-5D preference-based score from published studies (where patient level data are not available). Value Health, 11(7), 1131–1143.
Franks, P., Lubetkin, E. I., Gold, M. R., & Tancredi, D. J. (2003). Mapping the SF-12 to preference-based instruments: Convergent validity in a low-income, minority population. Medical Care, 41(11), 1277–1283.
Franks, P., Lubetkin, E. I., Gold, M. R., Tancredi, D. J., & Jia, H. (2004). Mapping the SF-12 to the EuroQol EQ-5D index in a national US sample. Medical Decision Making, 24(3), 247–254.
Fryback, D. G., Lawrence, W. F., Martin, P. A., Klein, R., & Klein, B. E. (1997). Predicting Quality of Well-being scores from the SF-36: Results from the Beaver Dam Health Outcomes Study. Medical Decision Making, 17(1), 1–9.
Gray, A. M., Rivero-Arias, O., & Clarke, P. M. (2006). Estimating the association between SF-12 responses and EQ-5D utility values by response mapping. Medical Decision Making, 26(1), 18–29.
Hawthorne, G., Densley, K., Pallant, J. F., Mortimer, D., & Segal, L. (2008). Deriving utility scores from the SF-36 health instrument using Rasch analysis. Quality of Life Research, 17(9), 1183–1193.
Lawrence, W. F., & Fleishman, J. A. (2004). Predicting EuroQoL EQ-5D preference scores from the SF-12 health survey in a nationally representative sample. Medical Decision Making, 24(2), 160–169.
Le, Q. A., & Doctor, J. N. (2011). Probabilistic mapping of descriptive health status responses onto health state utilities using Bayesian networks: An empirical analysis converting SF-12 into EQ-5D utility index in a national US sample. Medical Care, 49(5), 451–460.
Lundberg, L., Johannesson, M., Isacson, D. G., & Borgquist, L. (1999). The relationship between health-state utilities and the SF-12 in a general population. Medical Decision Making, 19(2), 128–140.
Rowen, D., Brazier, J., & Roberts, J. (2009). Mapping SF-36 onto the EQ-5D index: How reliable is the relationship? Health Quality of Life Outcomes, 7, 27.
Sengupta, N., Nichol, M. B., Wu, J., & Globe, D. (2004). Mapping the SF-12 to the HUI3 and VAS in a managed care population. Medical Care, 42(9), 927–937.
Ferreira, L. N., Ferreira, P. L., Pereira, L. N., Brazier, J., & Rowen, D. (2010). A Portuguese value set for the SF-6D. Value Health, 13(5), 624–630.
Petrou, S., & Hockley, C. (2005). An investigation into the empirical validity of the EQ-5D and SF-6D based on hypothetical preferences in a general population. Health Economics, 14(11), 1169–1189.
Arnold, D., Girling, A., Stevens, A., & Lilford, R. (2009). Comparison of direct and indirect methods of estimating health state utilities for resource allocation: Review and empirical analysis. British Medical Journal, 339, b2688.
Schmidlin, M., Fritsch, K., Matthews, F., Thurnheer, R., Senn, O., & Bloch, K. E. (2010). Utility indices in patients with the obstructive sleep apnea syndrome. Respiration, 79(3), 200–208.
Acknowledgments
This work was funded by VHA Health Services Research and Development Grant “Medications and Diabetes Morbidity in the VA Diabetes Epidemiology Cohort” [Principal investigators: Donald Miller and Leonard Pogach]. Preliminary results from this paper were presented at the 2011 VHA Health Services Research and Development annual meeting (Baltimore, MD) and at the 2011 Society for Medical Decision Making annual meeting (Chicago, IL). The authors thank Heather Franklin for her assistance with manuscript preparation and Sri Ram Pentakota for his feedback. Financial support for this study was provided by the Veterans’ Health Administration/Health Services Research and Development (VHA/HSR&D). The funding agreement ensured the authors’ independence in designing the study, interpreting the data, writing, and publishing the report.
Conflict of interest
All authors report no financial conflicts of interest.
Author information
Authors and Affiliations
Corresponding author
Additional information
SF-36® and SF-12® are registered trademarks of the Medical Outcomes Trust.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Rajan, M., Lai, KC., Tseng, CL. et al. Estimating utilities for chronic kidney disease, using SF-36 and SF-12-based measures: challenges in a population of veterans with diabetes. Qual Life Res 22, 53–64 (2013). https://doi.org/10.1007/s11136-012-0139-1
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11136-012-0139-1