Introduction

Myocardial perfusion imaging (MPI) remains a cornerstone for diagnosis and risk stratification in patients with coronary artery disease.1,2 The volume of MPI grew for the past 3 decades.3-5 Substantial growth in the use of MPI has generated concerns for overuse of this technology.6,7 To address this concern, the American College of Cardiology (ACC) and other specialty societies developed appropriate use criteria (AUC) for MPI in 2005, which was later revised in 2009.8,9

Since the development of the AUC for MPI, several studies have described the prevalence of inappropriate MPI. Most of these studies were conducted in single centers yielding variable rates of inappropriate MPI testing.10-19 The process of rating indications as “inappropriate” is supposed to account for low clinical utility in those scenarios; however, few studies have reported on the relationship between inappropriate MPI and cardiovascular outcomes.15,20-22 A recent analysis of appropriateness across imaging modalities showed a decrease in trans-thoracic echocardiography and computerized tomography angiography, but not with stress echocardiography or SPECT; however, this study did not evaluate all studies in temporal sequence and did not address the relationship between inappropriate studies and cardiovascular outcomes.23

We conducted this systematic review and meta-analysis to demonstrate the relationship between inappropriate MPI and the frequency of abnormal test results and myocardial ischemia. Secondarily, we intended to test the hypothesis that inappropriate MPI testing is less common when ordered by cardiology providers.

Methods

Data Sources

We performed a computerized literature search of the MEDLINE database without language restriction from January 2005 until December 2014 using the search strategy shown in Figure 1. To ensure that no potentially important studies were missed, the reference lists from the retrieved articles were also checked.

Figure 1
figure 1

Study selection flow diagram. Summary of how the systematic search was conducted and eligible studies were identified

Selection Criteria

We selected studies that (1) reported the prevalence of appropriate and inappropriate MPI testing according to either the 2005 or 2009 AUC for MPI, (2) described clinical outcomes among both groups, or (3) reported the variation among clinicians in ordering MPI tests. If the individual study used both criteria for evaluation of the MPI tests ordered, we utilized the data based on the 2009 AUC. We excluded studies that reported only a single arm (either appropriate or inappropriate MPI testing). If multiple reports were made from the same data, we preferentially used reports that included report of cardiovascular outcomes.

Data Extraction

Two authors (IYE and AM) independently extracted data on sample characteristics, sample size, intervention strategies, outcome measures, and other study characteristics from the included studies using a standardized form. Any discrepancies were resolved by consensus of the authors. For all clinical outcomes, we tabulated the number of events that occurred in each cohort within the study.

Outcomes and Definitions

The co-primary outcomes were the rate of abnormal test results and the presence of cardiac ischemia compared between inappropriate and appropriate or uncertain MPI. We used the definition of cardiac ischemia as reported in the individual studies. Other outcomes included all-cause mortality, non-fatal myocardial infarction (MI), the combined outcome of mortality or MI, cardiac catheterization, and revascularization.

Statistical Analysis

Since none of the studies had low event rates (arbitrarily defined as an expected event rate in any cell below 5.0), summary random effects odds ratio (OR) were constructed using the DerSimonian-Laird method.24 This meta-analysis was conducted based on the meta-analysis of observational studies in epidemiology (MOOSE) guidelines.25 An inverse variance weighted meta-regression for the rate of inappropriate testing vs the midpoint of the enrolment period in the individual studies was conducted. All P-values were 2-tailed, with statistical significance declared at P ≤ .025. All overall estimates are accompanied by 95% confidence intervals (CI). All meta-analyses were performed using Comprehensive Meta-Analysis version 3.0 (Biostat; Englewood, NJ), and the meta-regression was conducted using SAS 9.3 (SAS Institute, Cary NC).

Results

A total of 22 studies with 23,443 patients were included in our analysis.10-20,26-36 The results of our search strategy are illustrated in Figure 1. The 2005 AUC were used by 6 studies,16,18,19,31,34,36 and the remainder used the 2009 AUC.10-15,17,20,26-30,32,33,35 Single-photon emission computed tomography (SPECT) was the modality utilized in most of the studies,13-20,26-28,30,31,33-36 one study reported using positron emission tomography (PET),12 while the remainder of the studies did not specify.10,11,29,32 Among the included studies, 5 were conducted outside the United States.11,17,26,30,33 Patients were recruited from the outpatient setting in 3 studies,20,27,29 from an inpatient setting in 1 study,10 and from both settings in 13 studies.11-16,19,26,28,30,32,34,35 Overall, the summary rate for inappropriate testing was 14.8% (95% CI 11.6%-18.7%). The baseline characteristics and the percentage of inappropriate testing among the included studies are reported in Table 1. In Table 2, we summarize the quality of the studies included and report the most common reason for inappropriate MPI in the individual studies.

Table 1 Baseline characteristics of the included studies
Table 2 Assessment of study components

Primary Outcomes

The relationship between inappropriate MPI and abnormal test results was reported by 8 studies reported the incidence of abnormal test results,10,12,17,18,20,26,32,33 while 6 studies described on myocardial ischemia.10,12,20,26,28,32 Individual study definitions for ischemia are supplied in supplemental Table 1. Inappropriate MPI studies were less likely to be abnormal 15.6% vs 42.0% (OR 0.41, 95% CI 0.35-0.49, P < .0001) compared to appropriate tests (Figure 2). Additionally, inappropriate MPI testing was less likely to demonstrate cardiac ischemia 6.1% vs 18.5% (OR 0.40, 95% CI 0.24-0.67, P < .0001) when compared to appropriate testing (Figure 3).

Figure 2
figure 2

Summary plot for abnormal test results. The relative size of the data markers indicates the weight of the sample size from each study. CI, confidence interval

Figure 3
figure 3

Summary plot for detection of cardiac ischemia. The relative size of the data markers indicates the weight of the sample size from each study. CI, confidence interval

Additional Analyses

No significant difference was observed in the pattern of ordering inappropriate tests between cardiology and non-cardiology providers (OR 0.74, 95% CI 0.51-1.06, P = .10) (Figure 4). No significant correlation was observed between the rate of inappropriate testing and the midpoint of the enrollment year (0.52%, 95% CI −1.16% to −2.20% per year, P = .54) (Figure 5). Meta-analysis could not be performed for the risk of all-cause mortality, MI, the combined outcome of mortality or MI, cardiac catheterization, or revascularization due to the limited number of studies reporting these outcomes. In Table 3, we summarize the rate of all-cause mortality, MI, the combined outcome of mortality or MI, cardiac catheterization, and revascularization in the individual studies. The study by Doukky et al was both large and had a notably different point estimate for inappropriate MPI.20 To evaluate the impact of this study on our investigation, we excluded it in a sensitivity analysis. Doing this, the summary estimate of inappropriate testing was 14.0% (95% CI 12.1%-16.2%) and estimates for the co-primary outcomes were not changed; (OR 0.41, 95% CI 0.33-0.50, P < .0001) for abnormal test results, and (OR 0.35, 95% CI 0.18-0.62, P = .002) for ischemia.

Figure 4
figure 4

Summary plot for the rate of inappropriate testing according to provider. The relative size of the data markers indicates the weight of the sample size from each study. CI, confidence interval; MPI, myocardial perfusion imaging

Figure 5
figure 5

Rate of inappropriate studies over time. On the X-axis, studies were arranged chronologically based on the midpoint of patient enrollment for their study. MPI, myocardial perfusion imaging

Table 3 Cardiac events

Discussion

AUC are based on the premise that MPI testing for inappropriate indications will not benefit patients. In this meta-analysis of 22 studies including 23,443 patients, inappropriate MPI are consistently less likely to be abnormal or demonstrate ischemia as compared to appropriate studies. Using currently available data, we were unable to analyze relationship between inappropriate MPI and more substantial cardiovascular outcomes, such as catheterization, MI, and death. While the data for these outcomes were not sufficient to meta-analyze, individual studies demonstrated a substantial effect.20,21 Doukky et al showed a dramatic difference in cardiac events despite having a notably higher rate of inappropriate MPI as compared to other studies.20 Our analysis suggests that systems such as the AUC are valid strategies for focusing limited health care resources on those patients with the most to benefit from testing. It is important to mention that the 2013 AUC for stable ischemic heart disease now uses the term “rarely appropriate as opposed to inappropriate”; 18 out of 80 indications are rated as “rarely appropriate.”37

Since the first AUC for MPI were published 10 years ago, a reasonable assumption is that growing awareness of recommendations about inappropriate testing might alter patterns of care. Although Fonseca et al analyzed the temporal trends in AUC for SPECT,23 our analysis included 7 additional studies.10,11,13,14,26-28 Using meta-regression, we analyzed our data for any change in the rate of inappropriate testing over time. If any change has occurred, the magnitude of change (0.52%, 95% CI −1.16% to −2.20% in our analysis) is smaller than the wide range of reported inappropriate rates (5.2%-45.5%). Larger scale real-world data from programs such as FOCUS and ImageGuide™ may make it possible to ascertain such temporal trends in the future.38,39 We observed a wide range of rates of inappropriate MPI. In one study with the highest inappropriate rate, the patient population was based on multiple community practices, in contrast to the vast majority of the studies included in this analysis which were conducted in single-tertiary care centers. Differences in patient populations and behavior of ordering physicians may have contributed to this difference. In our sensitivity analysis excluding this study, our results did not change.20

Compared with non-cardiology providers, cardiology providers did not order significantly fewer inappropriate MPI in our analysis. The publications addressing this question have shown mixed results with some finding a difference,16,34 and other with no difference.12,40 Doukky et al were able to demonstrate that if a difference does exist, the likely explanation relates to the patient population being tested and not to an enhanced knowledge base by cardiology providers.40 The data were insufficient to make other comparisons about practice setting, such as geographic differences, US vs non-US investigations, academic vs community institutions, and inpatient vs outpatient setting. This last comparison may become more challenging as the next iteration of AUC shift away from modality-based rating systems to multimodality ratings based on patient presentations.

Few studies on interventions to reduce inappropriate testing have been published. Some of the heterogeneous approaches attempted include education, peer review and feedback, and point-of-care decision support systems.19,27,41 Among these studies, education alone appeared insufficient to reduce inappropriate MPI while the addition of other system changes resulted in a decrease. These findings are in accord with recent investigations and systematic reviews, which demonstrate that decision support systems can reduce unnecessary imaging.42-44 Another strategy that has been considered is prior authorization, although a recent analysis on this strategy suggests no effect on the rate of inappropriate MPI among patients insured by private carriers as compared to Medicare which does not implement prior authorization measures.45

Meta-analysis of AUC is subject to a number of limitations. We relied on the AUC classification systems used by the original authors. Multiple studies have demonstrated that inter-rater and intra-rater variation is not negligible; however, we could not ascertain in which direction this might bias our results.19,46 One study showed the AUC inter-rater variation for non-cardiologists raters was modest compared to cardiologists (κ = 0.51).47 As noted previously, we could not comment on the risk of cardiac events due to the limited number of studies reporting these outcomes. Because we lacked access to patient-level data, we could not analyze appropriateness of MPI based on patient characteristics such as sex, obesity, or diabetes since this has been described to increase the risk of bias.48 Publication bias is a potential limitation to meta-analyses. Finally, we included studies that used 2 different AUC (2005 & 2009); however, only one study utilizing the 2005 AUC was included in the analysis of the co-primary outcomes.18

New Knowledge Gained

Inappropriate MPI are less likely to yield abnormal results or determine myocardial ischemia. There has not been a significant reduction in the rates of inappropriate MPI testing with time. No differences were observed in the pattern of ordering inappropriate MPI between cardiology and non-cardiology providers.

Conclusions

We observed a wide range of inappropriate MPI rates, and estimate the overall rate among published studies to be 14.8%. Inappropriate MPI tests are consistently less likely to be abnormal or demonstrate ischemia than appropriate MPI, thus limiting their clinical utility. Currently available literature is insufficient to meta-analyze the relationship between death, MI, and inappropriate MPI. The rate of inappropriate MPI does not appear to have changed significantly over the time.