Introduction

Federally qualified health centers (FQHCs) have incorporated elements of the patient-centered medical home (PCMH) model since their inception. FQHCs provide care to low-income and high-need populations in underserved areas (HRSA 2016). FQHCs have historically provided high-quality, team-based, coordinated primary care to patients regardless of their ability to pay. They must provide fees on a sliding scale according to a patient’s income, and they receive enhanced reimbursements and additional grant funding to support their operations (Quyen et al. 2011; Shi et al. 2015; Starfield et al. 2005).

FQHCs are increasingly serving both disabled and elderly Medicare beneficiaries. The number of Medicare beneficiaries served in FQHCs nearly doubled from 1 to 2 million beneficiaries between 2005 and 2014, according to the National Association of Community Health Centers (NACHC 2016). While Medicare does not limit provider selection for fee-for-service Medicare beneficiaries may prioritize care at FQHCs because they are often convenient, provide interpreters or transportation services, and offer culturally-sensitive care (HRSA 2017b).

Given the growth of FQHCs as a usual source of care for Medicare beneficiaries living in underserved areas (Chang et al. 2016), the Centers for Medicare & Medicaid Services’ (CMS) recent FQHC Advanced Primary Care Practice (APCP) Demonstration evaluated whether helping FQHCs to achieve PCMH recognition improved outcomes for Medicare beneficiaries (Kahn et al. 2015a; Kahn et al. 2015b). Advanced primary care practices build upon the joint principles of a strong primary care delivery system including improved access, coordinated and comprehensive care, and continuous quality improvement. Several organizations, including the National Committee for Quality Assurance (NCQA), have encouraged practices to codify their application of PCMH principles (American Academy of Family Physicians et al. 2007; Mahmud et al. 2018). With CMS’s APCP Demonstration, FQHCs participating in the Demonstration received financial support and technical assistance to achieve NCQA Level 3 (the highest NCQA level of recognition) within a 3-year period. Recognition depended on scoring according to six standards, each of which comprised multiple elements. Recognized sites achieved Level 1, 2, or 3 recognition based on their total number of points scored across elements and on the number of points scored on must-pass elements. Points were assigned for the following six standards: (1) Enhance Access and Continuity, (2) Identify and Manage Patient Population, (3) Plan and Manage Care, (4) Provide Self-Care Support and Community Resources, (5) Track and Coordinate Care, and (6) Measure and Improve Performance (NCQA 2016).

An important dimension of care quality as reflected in the PCMH standards is care coordination, where information is shared among providers, providers communicate with patients, and follow-up care is appropriate (NQF 2010; PCPCC 2011). Continuity of care (COC), which refers to an ongoing relationship between a patient and his or her provider or practice and is one of the key elements of advanced primary care models, has been associated with improved outcomes and reduced costs (Amjad et al. 2016; DuGoff et al. 2016; Hussey et al. 2014; Maarsingh et al. 2016). COC among primary care providers might be particularly important for FQHCs, since the vast majority of the services FQHCs provide are primary-care focused. One of the mechanisms through which we hypothesize that the demonstration would improve beneficiary health and lower spending is through improved COC.

The FQHC Demonstration provided an opportunity to assess whether primary care COC (PCCOC) for Medicare beneficiaries visiting FQHCs changed with the Demonstration. Several studies have evaluated the influence of primary care delivery on outcomes for Medicare beneficiaries in FQHCs compared with other primary care settings, but PCCOC has often not been included as an outcome in these studies (Bailey and Goodman-Bacon 2015; Jackson et al. 2013; Mukamel et al. 2016; Shi et al. 2017; Wright et al. 2015).

This analysis includes three main objectives. First, we characterize baseline levels of PCCOC, measured both at the provider and the practice level, among Medicare beneficiaries seeking care within a nationwide sample of FQHCs. Second, we examine whether differences emerged over time in PCCOC for Medicare beneficiaries attributed to demonstration versus comparison FQHCs. Finally, we explore whether PCCOC was influenced by the CMS Demonstration for pre-specified subgroups of Medicare beneficiaries.

Methods

Detailed descriptions of the intervention and evaluation methods are available in prior reports and articles (Kahn et al. 2017; Kahn et al. 2015b; Timbie et al. 2017).

Intervention

The CMS FQHC APCP Demonstration assisted participating FQHCs in becoming NCQA-recognized Level-3 PCMHs using NCQA’s 2011 specifications (NCQA 2016). Level 3 recognition requires documentation of enhanced access and better data systems to manage and track populations. Support provided to demonstration sites included care management fees ($18 per quarter per attributed Medicare beneficiary), transformation assistance in the form of webinars and one-on-one coaching, and periodic feedback reports on beneficiary outcomes and spending, as well as clinic-level performance summaries.

Study sample

The demonstration sample included 503 FQHC practice sites that CMS selected to ensure diversity with respect to region, urbanicity, electronic health record adoption, baseline scores on the NCQA PCMH instrument, and receipt of medical home-related payments from payers. The evaluation team selected 827 comparison sites that were selected based on comparability of patient, site, and geographic characteristics. There was no overlap in sites from the same organization in the treatment and control groups, although within each group, an organization could have multiple sites.

Beneficiaries eligible for inclusion in the evaluation sample had to be continuously enrolled in Medicare Parts A and B during the baseline year (November 2010 to October 2011). Beneficiaries were ineligible if they were enrolled in Medicare Advantage or had end-stage renal disease at any point during that period. Since the outcome measure for PCCOC can lead to spurious effects for patients with few visits (Eriksson and Mattsson 1983; Liss et al. 2011; Perry et al. 2016), we also required that the sample for this study have at least three primary care visits to an ambulatory provider in the baseline year and in each of the 3 demonstration years. The 123,653 beneficiaries in the final sample were attributed to a primary care practice responsible for the plurality of their primary care visits in the baseline year (the most common provider). If there was a tie between two providers, the beneficiary was assigned to the practice associated with the most recent claim. More details on the procedures noted here can be found in the appendix to Timbie et al. (2017).

Data

We used Medicare Parts A and B fee-for-service claims from November 2010 to October 2014 for beneficiaries attributed to demonstration and comparison FQHCs.

Outcome: measuring PCCOC

We measured PCCOC using the claims-based usual provider of care (UPC) index, which represents the proportion of primary care visits within a year to the provider responsible for the plurality of services over the same year. We chose to use the UPC index instead of the Bice–Boxerman index as our measure of continuity because the units of measurement provide a more straightforward interpretation of the concentration of visits with the most common provider. While both are measured on a 0 to 1 scale, the Bice–Boxerman is interpreted as the degree of care dispersion across all of a patient’s providers (Bice and Boxerman 1977; Pollack et al. 2016), while the UPC can be interpreted as the proportion of visits to the most common provider. Other studies have previously demonstrated that the UPC index is correlated with the Bice–Boxerman COC index (Pollack et al. 2016; Romaire et al. 2014; Smedby et al. 1984; Smedby et al. 1986).

Our initial examination of the UPC index indicated that approximately 30% of beneficiary-year measurements had a provider-level UPC value of 1, meaning that almost one-third of beneficiaries visited the same provider every year. In contrast, approximately 60% of the beneficiary-year measurements had a practice-level UPC value of 1, meaning that about three in five beneficiaries visited the same practice for an entire year, but not necessarily the same provider within the practice. To monitor changes in the proportion of beneficiaries who consistently see the same primary care provider in a given year, we also analyzed a binary measure of perfect COC (equal to a value of 1 if the beneficiary had a UPC value of 1 in the given year, and zero otherwise) over time, in addition to the continuous measure that is calculated on a 0 to 1 scale.

Analysis of provider- vs practice-level continuity

Practice-level continuity assesses the proportion of each patient’s primary care visits to the practice site with the plurality of services over the year, while provider-level continuity assesses the proportion of a patient’s primary care visits to a particular provider within the practice. Patients who consistently obtain care from the same practice may have better outcomes than patients who seek care from multiple, unaffiliated practices (Perry 2016).

For the practice-level COC analyses, a primary care practice was defined as an FQHC site, rural health clinic (RHC), or a non-FQHC/non-RHC primary care practice. We used provider transaction access numbers (PTANs) to identify unique FQHCs and RHCs, and tax identification numbers (TINs) for other primary care practices. For the provider-level analyses, a primary care provider was defined as a provider with one of the following taxonomy codes associated with their National Provider Identifier (NPI): internal medicine, general practice, family medicine, OB/GYN, nurse practitioner, physician assistant, or geriatrics.

Analysis of changes over time among demonstration and comparison FQHCs

We used a difference-in-differences study design to compare changes in the UPC index over time for beneficiaries attributed to the demonstration versus comparison sites. We analyzed changes in UPC indices across each of the 3 demonstration years (November 2011 to October 2014) compared with the baseline year (November 2010 to October 2011) between the demonstration and control groups. This study design allowed us to control for unobserved factors that may differ between the two groups and that may influence COC, so long as these factors do not vary over time.

To strengthen the internal validity of the analysis, we applied propensity score weights to each beneficiary’s data that reflected his or her propensity to be attributed to a demonstration site. These weights are a function of baseline beneficiary, site, and geographic characteristics, including 14 baseline claims-based quality and utilization measures as well as the baseline PCCOC. For all analyses, we used propensity-score weighted generalized estimating equation (GEE) models to assess the relationship between the demonstration and the UPC index, while controlling for beneficiary- and site-level characteristics that may influence COC independent of the demonstration. Each model used the appropriate family and link function to account for the varied distributions of each outcome. These characteristics and their description are summarized in Table 1. The standard errors account for the clustering of beneficiaries within sites and over time.

Table 1 Baseline sample characteristics, demonstration versus comparison clinics

Sensitivity analyses examining beneficiary and FQHC subgroups

To explore potential heterogeneity in the effect of the demonstration on PCCOC, we examined effects within beneficiary or FQHC subgroups where we hypothesized that demonstration effects on PCCOC might be strongest. The subgroups are as follows: 1) age 85+ versus 65–84 years; 2) black versus white race; 3) disabled versus non-disabled; 4) eligible for both Medicare and Medicaid, (i.e., “dual eligibility”) versus non-dual; 5) rural versus urban site location; and, 6) medium-sized FQHCs with 5–14 sites versus smaller FQHCs with less than 5 sites. We ran separate models for each subgroup using a three way interaction term between the subgroup, the intervention indicator and the post-period in our difference-in-differences model.

Full models and description of control variables can be found in Appendix Table 5. Appendix Table 6 presents the results of all sensitivity analyses. All analyses were conducted in SAS, Version 9.3. The RAND Human Subjects Protection Committee approved this study.

Results

Descriptive characteristics

Just over half the sample was dually eligible for Medicare and Medicaid, and a similar percentage of beneficiaries were disabled. Over 60% of beneficiaries in both the demonstration and comparison groups lived in urban areas, and the majority of beneficiaries were attributed to sites that were part of multi-site FQHCs. Beneficiaries in both groups had just over seven provider visits per year, on average. Table 1 summarizes other characteristics of the beneficiary sample before applying propensity score weights.

Baseline primary care COC

PCCOC was high among both providers and practices in the baseline (Fig. 1) year before the demonstration began. Figure 1 displays the unadjusted trends in the UPC index at the provider level and practice level over the analysis period. At baseline, the respective demonstration and comparison provider level PCCOC UPC index values were 0.77 and 0.78, and the practice-level PCCOC UPC index values were 0.88 and 0.89. Thirty-eight percent of beneficiaries had perfect provider-level continuity in the demonstration sites, and 37% in the comparison sites at baseline. Sixty-two percent of beneficiaries had perfect practice-level continuity at baseline for both demonstration and comparison groups.

Fig. 1
figure 1

Unadjusted trends in PCCOC over time, provider and practice level, demonstration group vs comparison. Notes: upper lines, continuous measure, lower lines, binary measure. PCCOC = primary care continuity of care is measured with the usual provider of care index (UPC). The continuous UPC index represents the proportion of visits to the provider that is responsible for the plurality of primary care services provided to each beneficiary. The binary UPC measure represents the proportion of beneficiaries with perfect provider-level primary care continuity (achieving a UPC index equal to 1). The ‘demonstration’ clinics are those participating in the Center for Medicare and Medicaid Innovation’s Federally Qualified Health Center Advanced Primary Care Practice Demonstration. The difference between the baseline and year 3 level for both the continuous and binary measures and the practice and provider levels are all significant at the p = 0.05 level.

Unadjusted changes in primary care COC with time

The trends in unadjusted PCCOC in both the demonstration and comparison groups declined slightly across all years from baseline to year 3 for both the provider and practice levels (Fig. 1), and the magnitude of the declines were similar between demonstration and comparison sites. The provider UPC index values decreased slightly from 0.77 to 0.75 in the demonstration sites and from 0.77 to 0.74 in the comparison sites. The practice-level UPC index values declined by a similar degree, as did the binary measure (Fig. 1).

Difference-in-differences regression results

The effect of the demonstration on the continuous PCCOC was small in magnitude, even though the coefficients are statistically significant for years 2 and 3 (Table 2). For example, the effect of the demonstration on provider-level PCCOC in years 2 and 3 is −0.8 percentage points (p = 0.001) for the continuous UPC outcome measures. The decreases are larger in magnitude for the practice-level continuous UPC index measure. For example, the effect of the demonstration on the practice-level continuous UPC outcome measure is −1.3 percentage points (p < 0.001). The binary measure shows a similar pattern (Table 3), with both the provider and practice levels decreasing over time and the practice dropping more than the provider level, and no practical effect of the demonstration.

Table 2 Regression results of the influence of the demonstration on beneficiary PCCOC, demonstration versus comparison clinics, continuous measure
Table 3 Regression results of the influence of the demonstration on beneficiary PCCOC Values, demonstration versus comparison clinics, binary measure

Some of the other coefficients in the regression models indicate associations with PCCOC. Advancing age (compared with < 65 years) is significantly associated with higher PCCOC for both the provider and practice levels. Being female or having a higher hierarchical condition categories (HCC) score (greater severity of illness) is associated with lower continuity at both the provider and practice levels. Participating in a CMS Shared Savings model or program is positively associated with provider-level, but not with practice-level, PCCOC.

PCCOC for subgroups

We hypothesized that the demonstration may affect particularly vulnerable groups differently. Across the analyses shown in Table 4, black beneficiaries in the demonstration sites showed improved COC across 2 of the 3 demonstration years relative to white beneficiaries at the provider level. FQHCs with 5–14 service delivery sites showed significant declines in provider-level UPC scores (for years 1 and 2) and practice-level UPC scores (for years 1, 2, and 3) compared to FQHCs with less than 5 service sites.

Table 4 Effects of the demonstration on continuous UPC Index for subgroups analyzed using the demonstration by year interaction

Discussion

Our study finds that in the year prior to the Demonstration’s onset, beneficiaries attributed to both the demonstration and comparison FQHCs already had high levels of PCCOC as measured by the UPC index. The continuous UPC index measure was over 0.70 for the provider level and over 0.80 at the practice level for both the comparison and demonstration practices. These UPC values are higher than those reported in published studies spanning multiple practice settings: Previous reports note provider-level UPC scores of 0.59 for practices caring for type 2 diabetic patients (Weir et al. 2016), of 0.69 for adult primary care patients (Ionescu-Ittu et al. 2007), and of 0.66 for geriatric primary care patients (Rodriguez et al. 2008). The higher baseline UPC index scores observed in our study may reflect characteristics of FQHCs and their efforts to promote team-based and continuous care (Quyen et al. 2011; Shi et al. 2013; Shi et al. 2003; Starfield et al. 2005).

An important finding of this paper is that practice-level PCCOC is higher at all time points than provider-level PCCOC. Strategies adopted by FQHCs to improve access and quality are likely to preserve or enhance practice-level PCCOC. If the regular provider is not available, FQHCs may substitute another team-based provider, resulting in a lower provider-level UPC index but not impacting the UPC index at the practice level. Increased use of electronic health records also allows multiple providers within the same practice to see the same patient more seamlessly. Additionally, FQHCs seek to accommodate urgent care needs, and offering same-day appointments could further reduce provider-level continuity.

For both groups, the UPC scores declined slightly across the 3-year CMS FQHC Demonstration, similar to previous studies that also found mixed results from initiatives to improve care coordination for Medicare beneficiaries (Peikes et al. 2009; Sinaiko et al. 2017). Given that FQHCs were already showing high levels of COC, this type of small decline may be inevitable. Many FQHCs adopted strategies that had potential to decrease both provider-level and practice-level COC, while improving access.

We find nearly identical values for demonstration and comparison sites at each point in time for both provider- and practice-level UPC indexes. This finding may reflect both demonstration and comparison sites implementing similar strategies to enhance PCCOC. By the end of the demonstration, approximately 37% of comparison sites had achieved some level of PCMH recognition compared with 84% of demonstration sites (Timbie et al. 2017). In our study, we observed both demonstration and comparison sites expanding hours, increasing staff, building teams to support primary care, and conducting patient- and community-level outreach. External funding from several other sources provided incentives for FQHCs to implement these strategies (Kahn et al. 2017; Timbie et al. 2017).

In subgroup analyses, higher levels of patient comorbidity are associated with lower continuity at both the practice and the provider levels. Finally, we found that continuity improved for black Medicare beneficiaries during the Demonstration. It is not clear why improvements were not observed for other subgroups. It could be that all FQHCs seek to improve care for underserved groups, so the differences between demonstration and comparison groups were minimal.

Limitations

In addition to the limitations of the COC measures noted in the methods section, it is not clear how to translate marginal changes in the COC score with meaningful impacts on beneficiaries’ experience of care coordination and resulting outcomes. Patient-reported care coordination may differ from claims-based PCCOC. However, claims-based measures do provide a comprehensive look at patterns of care. In the future, pairing quantitative findings with qualitative interviews of beneficiaries and providers about whether they perceive increases in continuity and whether there are impacts on health outcomes, or spending would be useful. Other limitations of the evaluation of the model itself have been described elsewhere (Kahn et al. 2017; Timbie et al. 2017), including a slower than anticipated achievement of Level 3 recognition in the demonstration sites, and practice improvements occurring in the control group over the same time period. Medicare beneficiaries represent a small, but growing, portion of FQHCs’ overall patient panel, and the demonstration payments for increased care coordination activities may not have been large enough to incentivize practice changes that significantly improved COC.

Conclusion

Consistent with FQHCs’ mission to improve access to care, our analyses show that Medicare FQHC users are achieving high levels of continuity of care at both demonstration and comparison site FQHCs. Overall, our demonstration and comparison site analyses demonstrated declines over time in provider-and practice-level COC, but the magnitudes of the changes were small.