INTRODUCTION

Pay-for-performance (P4P) incentives continue to be the key strategy by which stakeholders align financial rewards with quality healthcare.1,2 Not only is the use of P4P incentives common, but recent health legislation mandates the continued use of these incentives. Increasingly, public and private health plans are experimenting with tying payment rewards to quality healthcare through “value-based purchasing” efforts, “alternative quality contracts,” and accountable care organizations.24 Understanding how P4P programs work is important for shaping payment policy as health reform continues to unfold.

One ongoing concern about the use of P4P programs is that they could widen resource gaps among provider organizations and thus exacerbate disparities in care.5 POs—medical groups and independent practice associations (IPAs)—located in lower socioeconomic status (SES) areas may be less able to obtain bonuses than those in higher SES areas for at least three reasons. POs in lower SES areas may treat patients who are less educated or less wealthy, and therefore less able to seek needed health information or follow recommended care. For example, it may be more difficult to obtain high rates of screening for diabetic retinopathy or cervical cancer for these patients.6,7 Second, lower SES areas are likely to have fewer resources than more affluent ones—fewer specialist physicians, pharmacies, laboratories, imaging facilities and transportation options.813 Third, POs in lower SES areas may have fewer resources (financial and human) because they have a worse “payer mix” (i.e., a higher proportion of Medicaid and uninsured patients), receive lower payment rates from health plans, and/or have difficulty recruiting highly qualified physicians and staff.6,7,1417

Recent research has demonstrated that hospitals and health plans located in less affluent areas score lower on the types of measures used in pay-for-performance programs,6,9,15,18 and that patients are less likely to receive care recommended by guidelines (e.g., screening mammograms) when they are cared for by individual physicians who care for higher percentages of patients of low SES.6,15,18 However, it is not known whether POs located in lower SES areas perform worse in large scale P4P programs.

We address this question using data from the California Integrated Healthcare Association (IHA), the largest non-governmental, multi-payer P4P program in the U.S.1416 The IHA program represented a prevailing approach to P4P program design at the time, and tying bonuses to “all-or-nothing” achievement (i.e., payment only if a pre-determined quality threshold is crossed) is still the approach used by Medicare and most commercial health plans, even as some are beginning to combine P4P with global budgets.5,19,20 Since 2003, this program (which includes the seven largest health plans in California) has distributed between $38 million and $65 million in P4P bonuses annually to approximately 200 POs across the state.21

We used readily available SES information from the United States Census and geocoding techniques to characterize the SES of a PO, and to quantify the relationship between area-based SES and P4P performance. We hypothesized that POs with more primary care physicians in practice sites located within lower SES areas are less likely to score in the top tiers of performance within IHA’s P4P program than those located in higher SES areas.

METHODS

Setting and Participants

Our study included 160 of the 219 POs that participated in IHA’s P4P program in 2008 and 2009.21 IHA provided the name and address for the central administrative offices for 219 POs. We called POs directly and searched their websites to gather the additional information on the 11,718 practice sites we needed to calculate the area-based PO SES measure. We excluded 43 POs for which we were unable to identify practice sites or the number of primary care physicians at the sites, seven for which IHA did not receive sufficient information for calculating P4P scores, and two that did not provide their percent annual revenues from Medicaid. We also excluded five academic medical centers because they have access to funding not generated by clinical activities (e.g., research, graduate medical education) and therefore may be less affected when some of their practice sites are in low SES areas.22,23 Finally, we excluded two that underwent major organizational changes between 2008 and 2009 (e.g., went out of business).

The unit-of-analysis was the PO because that was the level at which P4P performance was assessed and bonus payments were made.

Measures

Area-based PO SES Measure

Our main predictor was an area-based PO SES measure based on Krieger’s area-based socioeconomic measure, a well-established method for characterizing the SES of a census tract.2428 The area-based PO SES measure serves as a proxy for the characteristics of individual patients likely to be seen by a PO located in an area and also for the resources within that area (e.g., number of imaging facilities). To determine PO SES (Fig. 1), we first identified all 11,718 practice sites within the 160 POs, and then identified the number of PCPs at each site. Second, we geo-coded practice sites to corresponding census tracts29 and linked tracts to Krieger’s 10 census-defined variables. Third, we calculated a z-score for each of these 10 variables, standardizing values against those for California as a whole (i.e., if a census tract’s value for a variable was equivalent to the state’s average, then the census tract’s z-score for that variable was 0). We then added each tract’s z-scores together to generate a site-specific summary SES score; we weighted each census variable equally per previously described methodologies.2428 We weighted the site-specific SES scores according to the proportion of the POs’ total number of primary care physicians at each site because practice sites varied in terms of size and because the P4P program is focused on primary care performance targets. We considered physicians to be primary care physicians if they practiced in any of the following specialties: general internal medicine, geriatrics, general practice, family practice or general pediatrics. These weighted site-specific scores were then summed to arrive at the area-based PO SES measure, oriented such that larger values corresponded to higher SES.

Figure 1.
figure 1

Method for determining the area-based physician organization socioeconomic status (PO SES) measure.

IHA P4P Performance Score

Our main outcome variable was the composite performance score used in IHA’s P4P Program; this is the score used by health plans to calculate the P4P bonus payments they make to POs. The IHA P4P composite performance score was comprised of measures related to clinical quality (e.g., percent of eligible patients who were screened for breast, cervical and colorectal cancer), patient experience (e.g., percent of patients who indicated that their providers communicated well with patients), and use of information technology and registries (e.g., percentage of physicians using clinical decision support, whether registries were being used to coordinate care for patients with diabetes). IHA’s methodology for calculating the composite performance score is available on the IHA website, as is the total amount of bonuses paid by health plans annually.21,30

Covariates

We included three covariates: PO size (i.e., total number of PCPs [expressed in log base 10]); because larger organizations may have more resources or may be better able to take advantage of economies of scale (e.g., install IT, support nurse care managers); PO type (i.e., medical group versus IPA) since there is some data to suggest that larger medical groups on average perform better than IPAs31; and percentage of annual revenues from Medicaid. We included the Medicaid variable in our analyses because POs in both high and low SES areas vary greatly in their willingness to accept Medicaid patients (hence the low correlation between our Medicaid and SES variables). The Medicaid variable may thus provide some additional information, beyond that contained in our SES measure, about the characteristics of some of the patients seen by the PO and about the resources available to it (because Medicaid pays physicians at very low rates).32

Statistical Analysis

We used a non-parametric trend test to initially explore the bivariate relationship between the area-based PO SES measure and IHA’s P4P performance score.33 To mimic the fact that bonus payments were triggered by a certain threshold of performance, we examined the likelihood of POs scoring in the top two quintiles of performance using bivariate and multivariate regression with modeling to estimate relative risks (RR) as described by Zou.34 Due to a highly skewed distribution, the dichotomous outcome for level of Diabetes Registry Use (range 0–5) was constructed as a score of 5 versus less than 5.

We conducted three sensitivity analyses. First, we examined the likely impact of our inclusion criteria by comparing the characteristics of the 160 POs included to those of the excluded POs using Wilcoxon rank-sum tests.35 Second, we examined the impact of area by varying the size of the area upon which the PO SES measure was based: practice site’s census tract only, tract plus all tracts within 1.5 -10 miles, or tract plus all immediately contiguous tracts (generally broader than the 1.5-10-mile radius).3639 These radii were based on prior research that shows that most patients seek primary care close to their homes—within a 1.5-mile radius in urban and suburban areas or a 10-mile radius in rural areas.36,37 Third, we conducted sensitivity analyses around levels that are typically used to determine whether a bonus payment would be triggered (i.e., top 25, 33, or 50% in addition to top 40%).

We used STATA Version 11 (StataCorp, College Station, TX) for all analyses. The Institutional Review Boards of The University of Chicago, Weill Cornell Medical College, and Children’s Hospital of Boston approved this study.

Role of the Funding Source

This project was supported by a grant from the Robert Wood Johnson Foundation. The funding source had no role in the design, data collection, analysis, interpretation, decision to submit manuscript for publication, or manuscript preparation. Dr. Chien is supported by a Career Development Award (K08HS017146) from the Agency for Healthcare Research and Quality.

RESULTS

The 160 POs included 11,718 practice sites with 21,831 PCPs. (Table 1.) Within these POs, the total number of PCPs ranged from 6 to 1,248 (mean 136, SD 156) and the total number of practice sites ranged from 1 to 727 (mean 73, SD 104). Fifty-seven percent of the POs were IPAs; 43% were medical groups. Half of the POs received no revenue from Medicaid; among the POs that did, the mean reported percentage of total revenue from Medicaid-insured patients was 10% (SD 16%, range 0.2% to 80%). The area-based PO SES measure ranged from −11 to +11 (mean 0, SD 5; range for all census tracts within California was −24 to +26). POs’ composite performance scores ranged from 23 to 86 (mean 69, SD 15; possible range 0–100).

Table 1 Physician Organization Characteristics, Socioeconomic Status, and Composite Pay-for-Performance (P4P) Performance Score

The bivariate relationship between the area-based PO SES measure and the composite performance score was significant (trend test p < 0.001); POs in higher SES areas had better performance scores. (Fig. 2.) POs located in the highest SES quintile had a median performance score nearly 20 points higher than those located in the lowest quintile. However, four POs within the lowest SES quintile scored in the highest quintile of P4P performance, and four POs within the highest SES quintile scored quite low. In multivariate analysis, the PO SES measure continued to be significantly associated with the IHA P4P performance score. (Table 2.) A single standard deviation increase in the area-based PO SES measure (about 5 points) was associated with a 44% greater likelihood (RR 1.44, 95%CI 1.22-1.71, p < 0.001) of the PO being ranked in the top 40% of performance. As expected, the likelihood of a PO ranking in the top 40% was also higher for larger POs than for smaller (RR 2.55 for a PO with ten times more physicians; 95%CI 1.67-3.90, p < 0.001) and for medical groups compared to IPAs (RR 2.93, 95%CI 2.00-4.28, p < 0.001). POs with higher percentages of revenue from Medicaid were less likely to rank in the top two performance quintiles (RR 0.68, 95% CI 0.50-0.93, p = 0.017).

Figure 2.
figure 2

Bivariate relationship between the physician organization socioeconomic status measure and the integrated healthcare association’s composite P4P performance score.

Table 2 Bivariate and Multivariate Relationships between Physician Organization Socioeconomic Status and Performance in Integrated Healthcare Association’s Pay-for-Performance Program

We obtained similar results when we conducted multivariate analyses using the scores for individual domains of the composite performance score as the outcome variables. (Table 2.)

In sensitivity analyses, we found these associations to be robust whether we based the area-based PO SES measure on the practice sites’ census tract, the practice’s census tract plus all tracts within a 1.5- or 10-mile radius, or the site’s census tract plus immediately contiguous census tracts (data not shown). We also found that associations were robust to varying the cutoff threshold for the IHA P4P performance score (top 25, 33, or 50%). We also found that excluded POs reported a significantly higher percentage of their revenues from Medicaid (median was 43% for excluded POs versus 0.3% for the included POs; p < 0.001) and had lower composite performance scores (mean was 49 for excluded POs versus 69 for included POs; p < 0.001).

DISCUSSION

We found a significant association between the SES of the census tracts in which POs are located and their performance in IHA’s P4P program. This association was robust across three different methods for defining the areas in which a PO is located, and at a variety of cut-points for awarding bonuses in P4P programs.

To our knowledge this is the first study to show that an area-based SES measure is associated with P4P performance for large medical groups and independent practice associations; its impact on hospital P4P has been demonstrated recently.9

This finding may be of particular interest to rank-and-file physicians because they will be increasingly subject to performance incentives as a part of how they are reimbursed for their work. Tying rewards or sanctions to healthcare quality through pay-for-performance or public reporting programs remains a core strategy in healthcare reform efforts; these incentives are featured prominently in the Patient Protection and Affordable Care Act (ACA) and will be relied upon to counterbalance incentives aimed primarily at controlling costs.1 While early experience suggests that performance incentives may improve care quality, there are concerns that these programs may also inadvertently penalize providers serving more disadvantaged populations and thus potentially widen disparities in the quality of care.5,10,4045

This finding is also of interest because it suggests that the healthcare “neighborhood” may matter even for the large organizations included in the IHA P4P program—large medical groups and IPAs thought to constitute an important foundation upon which the accountable care organizations featured in health care reform can be built.4648

We found a significant association between the SES of the areas in which a PO is located and the PO’s performance on quality metrics in both bivariate and multivariate analyses. From an immediate policy point-of-view, the bivariate result—which shows a somewhat larger “effect” of SES—is most relevant because this is the manner in which payments are dispersed. If policymakers are concerned that their P4P program may be increasing resource gaps between POs in higher and lower SES areas, they first need to know whether POs in lower SES areas have lower performance scores, regardless of their other characteristics. The bivariate analysis provides this information. The multivariate analysis illustrates the strength of this relationship after accounting for other important explanatory factors.

Our study has six main limitations. First, the POs in the IHA program are quite large; it is possible that the association between performance and area-based SES is different for small physician practices. Second, IHA’s P4P program operates in only California, which differs from most of the U.S. in that it has a large number of large medical groups and IPAs, more capitation and more delegation of utilization management from health plans to POs; findings in this setting may or may not be generalizable to other parts of the country. Third, our analysis does not include detailed information about individual patients or about specific resources in the areas being served by POs. A multi-level analysis—one that includes patient-level, PO-level, and area-level variables—would be needed in order to better determine the relative contribution of each factor. Fourth, we had to exclude 59 of the 219 eligible POs. Excluded POs served more Medicaid patients and performed less well in the P4P program. If we had been able to include these POs, it is likely that the finding of an association between the area-based PO SES measure and performance would have been even stronger. Fifth, although IHA’s program represents a prevailing approach to P4P program design, alternative tactics (e.g., those attempting to reward incremental improvements rather than all-or-nothing achievement) may yield different results.49,50 Sixth, P4P may have a different effect if achievement levels are set lower versus higher—although in our study results did not change at a variety of achievement levels.

Our data do not address the mechanisms by which POs in lower SES areas score lower on IHA’s performance measure. It is possible that these POs deliver poorer quality care, and/or that it is more difficult for POs in low SES areas to score well, even when they deliver high quality care, for the reasons discussed in the introduction to this article. Regardless of the mechanism, P4P programs that fail to account for the SES of the areas in which providers are located risk increasing resource gaps between providers in high versus low SES areas, and thus increasing disparities in health care delivery.

Public and private payers may want to consider alternative designs for P4P programs to make them less likely to increase disparities, and medical groups and IPAs may want to encourage these alternative designs. One approach would be to adjust for the SES of the areas a PO serves through risk adjustment or by placing POs into strata based on their SES score and paying P4P bonuses based on comparisons within the same stratum.5 Further work is needed to better understand how alternative incentive strategies may affect providers or which structural features are important for POs aiming to improve quality while being located in lower SES settings.17,50

Another option would be to design P4P programs to de-emphasize absolute thresholds (i.e., only paying providers for reaching a certain level of performance) and tournaments (where providers earn rewards for scoring higher than their competitors), and emphasize rates of relative improvement.49,51,52 In fact, in 2009, IHA suggested a methodology that gives performance attainment and relative improvement more equal weighting; since then, three of the seven participating health plans have adopted this recommendation. This strategy would not eliminate the possibility that P4P programs would increase disparities, but could mitigate it.

In summary, this paper supports the hypothesis that the SES of the locations in which a medical group or IPA’s physicians care for patients is likely to be associated with how well the PO performs in P4P programs. The strengths of this paper include the diversity of POs with respect to their area-based SES, and the use of well-established methodologies for defining area SES and for measuring P4P performance. As policymakers and researchers devote increasing attention to the reasons why POs in lower SES areas have lower scores on P4P quality measures, and to ways of redesigning P4P programs, it may be possible to reduce, or at least not increase, disparities in health care delivery between richer and poorer areas. Physicians will want to learn how some POs located in low SES areas are nevertheless able to provide high quality care, as four POs were able to do in our study. Future studies should evaluate the relative contributions of patient-, practice-, and area-level factors when assessing healthcare performance.