Introduction

Coronary artery disease (CAD) is still the leading cause of death in industrialized countries, and the prevalence is expected to increase worldwide.1,2 In patients with known or suspected CAD, stress myocardial perfusion imaging (MPI) with single-photon emission computed tomography (SPECT) accounts for the vast majority of tests currently performed for ischemia detection. The diagnostic and prognostic role of this imaging modality is well established, and a negative stress MPI is able to identify subjects at low risk of future cardiovascular events.3,4 However, conventional (C) SPECT (C-SPECT) systems utilize large sodium iodide crystals, photomultiplier tubes, and parallel-hole collimation and are therefore inherently insensitive, necessitating prolonged imaging times, and relatively large radioisotope doses. Recent innovations in camera technology, fast electronics, and reconstruction algorithms have addressed these issues.5 The novel gamma cameras with semiconductor cadmium-zinc-telluride (CZT) detectors directly convert radiation into electric signals, allowing an improvement in terms of image accuracy and acquisition time.6,7,8 Specifically, new multi-pinhole SPECT cameras with CZT solid-state detectors (CZT-SPECT) technology provide for faster image acquisition and lower radiation doses in comparison with traditional sodium-iodine Anger cameras. This allows for MPI protocols preserving diagnostic image quality and diagnostic accuracy.9,10 At the best of our knowledge, the diagnostic performance of C-SPECT and CZT-SPECT have not been compared. Therefore, the aim of this meta-analysis was to compare the diagnostic performance of C-SPECT and CZT-SPECT systems in detecting CAD as defined by invasive coronary angiography.

Materials and Methods

This meta-analysis followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement (see the supplementary material for PRISMA checklist).11

Data Sources and Study Selection

We searched the PubMed and Web of Science databases for English literature from January 2000 to February 2018 on the diagnostic accuracy of MPI for the detection of CAD. Studies search was restricted to data obtained in humans and adults and was conducted using the following key words: myocardial perfusion imaging (OR MPI), single-photon emission tomography (OR SPECT), cadmium-zinc-telluride (OR CZT) SPECT, Anger camera and diagnostic performance in combination with coronary artery disease (OR CAD). The full search strategy for PubMed and Web of Science is shown in the supplementary material.

The bibliographies of selected articles and relevant reviews were screened for potentially suitable references. Two reviewers (V.C. and R.G.) screened for appropriateness title and abstract of potentially relevant studies and disagreement was resolved by consensus. The full-published reports of the abstracts selected by the reviewers were retrieved, and the same reviewers independently performed a second-step selection based on the inclusion criteria; disagreements were resolved by consensus. We included a study if: (1) it assessed C-SPECT or CZT-SPECT as a diagnostic test to evaluate patients for the presence of CAD; (2) CAD was defined as at least 50% diameter stenosis on invasive coronary angiography; and (3) it reported cases in absolute numbers of true positive, false positive, true negative, and false negative results, or if these data were derivable from the presented results. A study was eligible regardless of whether patients were referred for suspected or known CAD. Studies were excluded if they were conducted with: (1) phantom-only models and (2) study population without coronary angiographic correlation. In case of multiple studies reported from the same research group, potential cohort duplication was avoided by including the largest study only.

Data Extraction

Each study was initially identified considering author, journal and year of publication. Population data were collected on age, prevalence of female sex, traditional cardiovascular risk factors (diabetes, dyslipidemia, smoking, hypertension, family history of CAD), angina-like symptoms, and history of CAD (including previous myocardial infarction and coronary revascularization). Further extracted variables consisted of patient characteristics, technical information and absolute numbers of true negative, true positive, false negative, and false positive test results. To improve the comparability of study results in the analysis of overall diagnostic performance, we selected a cut-off value of ≥ 50% whenever possible. However, if data were not reported for a cut-off value of ≥ 50%, we selected the cut-off value that was available (e.g. ≥ 70%). For C-SPECT, a cut-off of ≥ 50%% was used in 21 studies and a cut-off of ≥ 70% in 4 studies. For CZT-SPECT, a cut-off of ≥ 50%% was used in 10 studies and a cut-off of ≥ 70% in 5 studies.

Quality Assessment

The methodological assessments of the quality of eligible studies were graded by two reviewers independently, according to the Quality Assessment of Diagnostic Accuracy Studies 2 (QUADAS-2) tool (Agency for Healthcare Research and Quality, Cochrane Collaboration and the U.K. National Institute for Health and Care Excellence),12 which is recommended for use in systematic reviews of diagnostic accuracy based on sources of bias and variation. The following four aspects are required to use the QUADAS-2 tool: (1) summarize the evaluation question; (2) develop the tool and produce evaluation with guidance; (3) construct a flow diagram for the original study; and (4) judge bias and applicability. The QUADAS-2 tool can provide obvious grades of bias and applicability of primary diagnostic accuracy studies. It comprises four significant domains including: (1) patient selection; (2) index test; (3) reference standard; and (4) the flow and timing. Each domain contains several signal questions used to help judge the risk of bias (low, high, or unclear).12 The two reviewers completed the screening process independently. Disagreement in the process of answering questions was discussed until consensus was reached. A final decision of “yes (favorable scenario, “+”)”, “no (unfavorable scenario, “−”)” or “unclear (mixed scenario, “+/−”)” was made by the reviewers after systematic discussion. If the answers to all the signal problems were “yes”, a low risk of bias was attributed to the study; if the answers to all the signal problems had one or more “no” or “unclear” values, an unclear risk of bias was used; if the answers to all the signal problems contained at least one “no” but no “yes” answers, a high risk of bias was attributed.

Statistical Analysis

For each eligible study, data were extracted to estimate sensitivity, specificity, and diagnostic odds ratio (OR) with 95% confidence interval (CI). The bivariate random-effects model was used to calculate the pooled summary estimates for sensitivities and specificities and to construct their forest plots for C-SPECT and CZT-SPECT.13 The bivariate model estimates pairs of logit-transformed sensitivity and specificity from studies, incorporating the correlation that might exist between sensitivity and specificity. To measure the pooled accuracy, the parameters estimated by the bivariate model were used to construct a smooth summary receiver operator characteristic (ROC) curve, with the area under the curve and the summary operating point with the 95% confidence region calculated13; also, a prediction region that has a 95% probability of including true sensitivity and specificity of a future study was generated. The areas under the two summary ROC curves were compared using a formula provided by Hanley and McNeil.14,15 Multilevel mixed-effects logistic regression was used to compare the summary paired sensitivity/specificity data, adding test type (C-SPECT or CZT-SPECT) as covariate. Likelihood ratio tests were used to obtain the statistical differences between the sensitivities and specificities of the two tests type by fitting alternative models, adding or removing the covariate term from the model.16 Briefly, we compared the model without covariate with the including test-type as covariate allowing separate variances for each test. If a significant likelihood ratio test was found, to investigate if the difference in accuracy was due to sensitivity or specificity further analyses were done assuming the same sensitivity or specificity for the two test and dropping the relevant covariate terms from the model.17 Between-study heterogeneity was evaluated with Cochran’s Q and I2 statistics. When statistical heterogeneity was substantial, meta-regression analysis was performed to identify potential confounders.18 To evaluate the presence of diagnostic threshold effect, the correlation between sensitivity and specificity was examined by the Spearman rank correlation test. Publication bias was examined using the effective sample size funnel plot and associated regression test of asymmetry described by Deeks et al.19 All analyses were performed using Stata, version 15.1 (StataCorp, College Station, TX). Two-sided P values ≤ .05 were considered statistically significant.

Results

Search Results

The complete literature search is presented in Figure 1. The initial search identified 2973 potentially eligible citations. After removing 883 duplicate records, 2090 records were screened by the reviewers. After the titles and abstracts evaluation, 1877 citations were discharged because they were judged to be non-relevant or non-pertinent. Thus, 213 full-text articles were blinded assessed by each investigator for eligibility. After revision, 173 articles were excluded leaving 40 articles (25 C-SPECT and 15 CZT-SPECT studies) including 7334 patients (4997 in C-SPECT and 2337 in CZT-SPECT studies).

Figure 1
figure 1

Literature search and selection process of studies included in analysis

Characteristics of C-SPECT Studies

The characteristics of C-SPECT studies as well as demographic and clinical patient data are detailed in Table 1.20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43 Data were obtained using exercise stress test in only one study,23 exercise or pharmacologic stress test in 10 studies22,24,26,28,32,35,36,41,42 and pharmacologic stress test in 14 studies.20,21,25,27,28,30,31,32,34,37,38,40,43 The imaging tracers were Tc-99m labeled agents in 16 studies,20,21,22,24,25,27,28,30,32,33,34,35,38,40,42,43 thallium-201 in 3 studies,23,29,37 dual isotope thallium-201/Tc-99m sestamibi in 3 studies26,31,41 and Tc-99m labeled agents or thallium-201 in 3 studies.21,36,39 One study considered as cut-off of abnormality a summed stress score > 3,42 2 studies a summed stress score ≥ 4,32,36 one study a summed stress score > 4.41 One study considered as cut-off of myocardial ischemia summed difference score ≥ 235 while one study considered as cut-off of myocardial ischemia transient ischemic dilation ≥ 1.18.43 In 7 studies, myocardial perfusion was defined normal when a perfusion defect involved < 3 myocardial segments.20,21,23,24,26,33,39 In 2 studies, a normal response was defined as a normal uptake (> 70%) at rest and after stressor injection.27,31 Finally, in 10 studies the cut-off of abnormality was defined on visual analysis based on several different parameters.22,25,28,30,34,37,38,40 Study sample size ranged from 31 to 1853 subjects. The mean age ranged from 56 to 72 years and the proportion of women from 8% to 100%. The diagnostic performance in detecting CAD for each study is shown in Table 2.

Table 1 Demographic data and clinical characteristics of patients studied with C-SPECT camera
Table 2 Diagnostic performance in detecting coronary artery disease with C-SPECT camera

Characteristics of CZT-SPECT Studies

The characteristics of CZT-SPECT studies as well as demographic and clinical patient data are detailed in Table 3.44,45,46,47,48,49,50,51,52,53,54,55,56,57,58 Data were obtained using exercise or pharmacologic stress test in 7 studies45,46,48,50,51,52,57 and pharmacologic stress test in 8 studies.44,47,49,53,54,55,56,58 The imaging tracer was a Tc-99m labeled agent in 10 studies,44,46,49,51,52,53,55,56 thallium-201 in 2 studies,57,58 dual isotope thallium-201/Tc-99m sestamibi in 2 studies50,54 and Tc-99m sestamibi or thallium-201 in one study.45 Two studies considered as cut-off of abnormality a summed stress score > 3,53,57 2 studies a summed stress score ≥ 4,47,49 2 studies considered as cut-off of myocardial ischemia summed difference score ≥ 2.50,54 In 2 studies, myocardial perfusion was defined normal when a perfusion defect involved < 3 myocardial segments.44,56 In 7 studies, the cut-off of abnormality was defined on 17-segment visual interpretation using a 5-point score.45,46,48,50,52,55,58 Study sample size ranged from 44 to 695 subjects. The mean age ranged from 59 to 71 years and the proportion of women from 18% to 57%. The diagnostic performance in detecting CAD for each study is shown in Table 4.

Table 3 Demographic data and clinical characteristics of patients studied with CZT-SPECT camera
Table 4 Diagnostic performance in detecting coronary artery disease with CZT-SPECT camera

Quality Assessment

The methodological quality assessment of risk of bias within eligible studies for C-SPECT is shown in Figure 2, according to the QUADAS-2 tool. Overall, the number of high, unclear, and low risk of bias was 1, 23, and 1, respectively, for the four domains (patient selection, index test, reference standard, and flow and timing). The number of unclear and low concerns regarding applicability was 22 and 3, respectively, for the three domains (patient selection, index test, and reference standard). The methodological quality assessment of risk of bias within eligible studies for CZT-SPECT is shown in Figure 3 according to the QUADAS-2 tool. Overall, the number of unclear and high risk of bias was 13 and 2, respectively, for the four domains (patient selection, index test, reference standard, and flow and timing). The number of unclear concerns regarding applicability was 15 for the three domains (patient selection, index test, and reference standard).

Figure 2
figure 2

Methodological quality of the included C-SPECT studies assessed with QUADAS-2 tools for risk of bias and applicability concerns

Figure 3
figure 3

Methodological quality of the included CZT-SPECT studies assessed with QUADAS-2 tools for risk of bias and applicability concerns

Diagnostic Accuracy of C-SPECT and CZT SPECT Cameras

The pooled sensitivity and specificity were 85% (95% CI 79-89) and 66% (95% CI 56-74) for C-SPECT (Figure 4) and 89% (95% CI 86-91) and 69% (95% CI 61-75) for CZT-SPECT (Figure 5) imaging studies. The summary ROC curves for C-SPECT and CZT-SPECT are depicted in Figures 6 and 7, respectively. The area under the curve was slightly higher for CZT-SPECT (0.89, 95% CI 0.86-0.92), with a rather restricted confidence and prediction regions, as compared to C-SPECT (0.83, 95% CI 0.80-0.86) (P = .03); accordingly, the summary diagnostic OR was 17 (95% CI 13-22) for CZT-SPECT and 11 (95% CI 7-15) for C-SPECT (P = .04). The accuracy of the two tests slightly differs between C-SPECT and CZT-SPECT (chi-square 11.28, 5 df, P = .04). However, we were unable to demonstrate if the subtle difference in global accuracy was due to sensitivity (chi-square 2.13, 1 df, P = .14) or specificity (chi-square 0.21, 1 df, P = .65), also when separate variances for each test were allowed in the models.

Figure 4
figure 4

Forest plot of single studies for sensitivity and specificity for C-SPECT. Horizontal lines represent 95% confidence interval of the point estimates. Each solid circle represents sensitivity and specificity of individual studies, and the size of the circle indicates the study size. The diamond means the pooled sensitivity and specificity of all 25 studies. CI, confidence interval

Figure 5
figure 5

Forest plot of single studies for sensitivity and specificity for CZT SPECT. Horizontal lines represent 95% confidence interval of the point estimates. Each solid circle represents sensitivity and specificity of individual studies, and the size of the circle indicates the study size. The diamond means the pooled sensitivity and specificity of all 15 studies. CI, confidence interval

Figure 6
figure 6

Summary receiver-operating characteristic curve for C-SPECT. Each circle represents individual study estimates. The diamond is the summary point representing the average sensitivity and specificity estimates. The ellipses around this summary point are the 95% confidence region (dashed line) and the 95% prediction region (dotted line). AUC, area under the curve; SROC, summary receiver-operating characteristic

Figure 7
figure 7

Summary receiver-operating characteristic curve for CZT SPECT. Each circle represents individual study estimates. The diamond is the summary point representing the average sensitivity and specificity estimates. The ellipses around this summary point are the 95% confidence region (dashed line) and the 95% prediction region (dotted line). AUC, area under the curve; SROC, summary receiver-operating characteristic

An additional analysis was performed excluding a C-SPECT study43 with poor performance. The pooled sensitivity and specificity were 86 % (95% CI 81-89) and 65% (95% CI 56-74) for the remaining C-SPECT studies and the area under the curve (0.85, 95% CI 0.82-0.88) was still slightly lower (P = .03) as compared to CZT-SPECT.

Heterogeneity Analysis

Our analysis revealed that the heterogeneity in sensitivity was higher among C-SPECT (I2 95.1%, P < .001) than CZT-SPECT (I2 68.3%, P < .001) studies. Similarly, the heterogeneity in specificity was higher among C-SPECT (I2 94.9%, P < .001) than CZT-SPECT (I2 84.9%, P < .001) studies. The Spearman rank correlation test showed a weak threshold effect in C-SPECT studies (rho − 0.44, P = .02), but not in CZT-SPECT studies (rho − 0.32, P = .23).

Potential Bias and Meta-regression Analysis

The Deek’s funnel plot shows a trend of asymmetry for C-SPECT (bias 6.49, standard error 3.30, P = .06) (Figure 8). Conversely, the Deek’s funnel plot shows no evidence of asymmetry for CZT-SPECT (bias 2.29, standard error 4.77, P = .64) (Figure 9). At meta-regression analysis, no significant association between both sensitivity and specificity and demographic (age and gender) and clinical (diabetes, dyslipidemia, smoking, hypertension, angina-like symptom, and family history of CAD) variables considered was found for C-SPECT (Figure 10) and CZT-SPECT (Figure 11) studies. The prevalence of obstructive CAD was not different (P = .83) between C-SPECT (69%) and CZT-SPECT (53%) studies. Finally, the distribution of stenosis threshold and imaging tracers used were also not statistically different between C-SPECT and CZT-SPECT studies.

Figure 8
figure 8

The Deeks’ funnel plot asymmetry test for publication bias in the literature evaluation for C-SPECT. Each study is shown as a circle, and the regression line is shown by dashed line

Figure 9
figure 9

The Deeks’ funnel plot asymmetry test for publication bias in the literature evaluation for CZT SPECT. Each study is shown as a circle, and the regression line is shown by dashed line

Figure 10
figure 10

Forest plot of multiple univariable meta-regression and subgroup analyses for sensitivity and specificity of C-SPECT. Each solid circle represents sensitivity and specificity of studies considering covariates as the dependent variable and horizontal lines represent 95% confidence interval. CI, confidence interval

Figure 11
figure 11

Forest plot of multiple univariable meta-regression and subgroup analyses for sensitivity and specificity of CZT SPECT. Each solid circle represents sensitivity and specificity of studies considering covariates as the dependent variable and horizontal lines represent 95% confidence interval. CI, confidence interval

Discussion

To our knowledge, this is the first meta-analysis comparing the diagnostic performance of C-SPECT and CZT-SPECT in detecting CAD in a large number of subjects undergoing stress MPI. We found a good diagnostic performance for the two gamma camera systems, with a slightly higher accuracy for CZT-SPECT.

MPI with SPECT is the most widely used nuclear cardiac imaging technique for the non-invasive assessment of cardiac disease including prognosis and choice of the most appropriate treatment strategies for patients with known or suspected CAD.59,60 Nevertheless, drawbacks such as time-consuming acquisition, different protocols, and radiation exposure still affect SPECT MPI. Refinements regarding iterative reconstruction algorithms,61 early imaging protocols,62 tracer development24 and hardware equipment have been strengthened. Novel gamma cameras with semiconductor CZT detector technology have been introduced.6 MPI acquired with CZT-SPECT has, compared to C-SPECT, an improved image quality but comparable diagnostic confidence.63 Two studies have directly compared the performance of C-SPECT and CZT-SPECT in the same patient population.5,64 Gimelli et al64 demonstrated that CZT-SPECT was superior to C-SPECT for detecting global and regional ischemia and identified a higher number of vessels with obstructive CAD. In a multicenter study, Neill et al5 reported similar results, with a superior sensitivity, specificity and accuracy (92%, 83% and 90% for CZT-SPECT vs 84%, 50% and 76% for C-SPECT).

To increase the level of validity of these results by combining data from multiple studies, we performed a meta-analysis of diagnostic test accuracy comparing indirectly CZT and C-SPECT studies. We identified 25 articles for C-SPECT and 15 articles for CZT-SPECT. The results showed that the accuracy of CZT-SPECT was slightly higher (P = .04) than that of C-SPECT; furthermore, the confidence and the prediction regions of summary ROC curve were narrower for CZT-SPECT than C-SPECT. The confidence region is based on the CI around the summery point and indicates that we would expect the ‘real value’ to be within that region 95% of the time. The prediction region around the summary point indicates the region where we would expect results from a new study in the future to lie.17 Also, the diagnostic OR of CZT-SPECT was greater than that of C-SPECT, confirming that CZT-SPECT might be more accurate in assessing CAD. In fact, reflecting the combination of sensitivity and specificity, the diagnostic OR can be regarded as a single measurement of diagnostic accuracy, with higher values indicate better discriminatory test performance.65 Thus, our results suggest a slightly higher global accuracy of CZT-SPECT, also if the differences between pooled sensitivity and specificity of the two gamma cameras were not statistically significant, probably due to the high between study heterogeneity for both C-SPECT and CZT-SPECT studies. The additional analysis performed excluding a C-SPECT study43 with poor performance showed that the area under the curve was still slightly lower as compared to that of CZT-SPECT. This result is not surprising, as Ueki et al43 analyzed only 31 patients and the weight of this study did not affect considerably the result of the meta-analysis.

The slightly higher accuracy of CZT-SPECT may be explained by the better resolution and the better contrast and image quality of the solid-state camera as compared to the C-SPECT system. The shorter acquisition times and the novel camera design, allowing semi-upright acquisition with the patient’s arms resting on the gantry, enhance patient comfort and potentially reduce the likelihood of motion artefacts. The short imaging time and potential to image a variety of radiotracers within 1 day allows flexibility in the choice of protocol, facilitating increased patient throughput. The intrinsic sensitivity of the system facilitates radioisotope dose reduction with a slight compromise on the length of acquisition time.9,64 However, the performance of C-SPECT may be improved with software upgrading. Newer types of reconstruction software, such as wide beam reconstruction and ordered subset expectation maximization, have been proposed for MPI with C-SPECT systems.66 These technologies reduce image noise by modeling it from spectral analysis of the projections, thereby improving image interpretation without affecting image resolution. These methods have already demonstrated comparable performance and image quality of conventional filtered back projection method allowing though for half-time or half-dose MPI.

From our study, it emerged a trend of asymmetry in funnel plot among C-SPECT studies, suggesting the presence of bias. On the other hand, no significant asymmetry emerged in funnel plot among CZT-SPECT studies. Publication bias is a possible cause of funnel plot asymmetry. However, others possible sources are poor methodological quality, leading to spuriously inflated effects in smaller studies; true heterogeneity, in which the effects differ according to study size; and sampling variation, that can lead to an association between the effect and its standard error.67

Moreover, there was a greater heterogeneity for both sensitivity and specificity among C-SPECT than CZT-SPECT studies, attributed to differences in the spectrum of patients and to difference across studies in the choice of the cut-off of abnormality. As the statistical heterogeneity was substantial, we performed meta-regression to identify potential sources of bias.

At meta-regression analysis, we failed to find a significant search of heterogeneity associated with demographic and clinical characteristics for C-SPECT and CZT-SPECT studies. Hence, other confounding variables are the sources of the heterogeneity observed; therefore, we also performed the diagnostic threshold analysis to investigate the sources of heterogeneity.

Limitations

A limitation of our meta-analysis is that C-SPECT studies were over-represented and more than 50% of the included CZT-SPECT were published during the last ten years (from 2009). Another limitation is that differences in the distribution of study and patient characteristics potentially affecting the diagnostic performance of the imaging modalities. As regard the distribution of study quality, most studies gave a clear description of participants, index, and reference tests. However, the high number of items that were scored “unclear” was probably due to different study design. For instance, the risk of bias associated with patient selection domain was often attributed to studies including both subjects with known and suspected CAD. In addition, differences in demographic and clinical features among patients included in the primary studies might produce measures of diagnostic accuracy that vary considerably. As concern the flow and timing domain, the unclear risk of bias was primarily caused by the lack of an explicit description of the length of the time interval between the index test and the reference standard. Ideally, the results of the index test and the reference standard should be collected at the same time. Despite the presence of unclear items, the overall quality of the studies was reasonable to perform an adequate analysis. Finally, we performed an indirect comparison of the diagnostic value in detecting CAD of C-SPECT and CZT-SPECT due to the lack of head-to-head studies addressing the diagnostic performance by the two gamma cameras in the same patient population.

New Knowledge Gained

The current study underscores the clinical use of the CZT-SPECT camera in detecting CAD as a valid replacement of conventional systems due to the important technological development of state-solid system.

Conclusion

The results of this meta-analysis indicate that C-SPECT and CZT-SPECT have good diagnostic performance in detecting angiographic proven CAD, with a slightly higher diagnostic accuracy of CZT-SPECT. This result supports the use of the novel gamma cameras in clinical routine practices also considering the improvements in acquisition time and radiation exposure reduction.