Introduction

Biologic agents, such as tumor necrosis factor alpha (TNF-α) inhibitors, are approved for the treatment of several rheumatic diseases including rheumatoid arthritis (RA), ankylosing spondylitis (AS), psoriasis arthritis (PsA), and juvenile idiopathic arthritis (JIA). Although biologic agents provide profound clinical benefits, an increased risk of tuberculosis infection was reported to be associated with TNF-α inhibitors [15].

Tuberculosis is a granulomatous disease caused by infection with Mycobacterium tuberculosis. Most of the individuals latently infected with M. tuberculosis will never progress to active disease. In those latently infected cases, the host’s immune system controls the bacilli in a state of non-replicating persistence [6]. The immunological studies found that the cytokine TNF-α plays a key role in granuloma formation and maintenance [7], and TNF-α inhibitors result in the disintegration of the granuloma and dissemination of M. tuberculosis [2, 8]. For the patients on anti-TNF-α therapy, the risk of latent tuberculosis infection (LTBI) progression to active disease would accordingly increase. Current clinical guidelines mandate the screening for LTBI prior to anti-TNF-α therapy [913]. However, no agreement was reached on the best methodology for LTBI screening.

The tuberculin skin test (TST) is the long-established method to identify LTBI due to its simplicity and efficiency; although it has several inherent drawbacks, such as cross-reaction with BCG vaccination or other environmental mycobacteria infection, phenomena of boosting, operator bias and variability in result interpretation [14, 15]. Recently, interferon-γ release assays (IGRAs) have emerged as an alternative for TST [16, 17]. IGRAs measure the immune response to TB-specific antigens either from peripheral blood lymphocytes (T-SPOT.TB; Oxford Immunotec Limited, Abingdon, UK) or whole blood (QuantiFERON-TB Gold [QFT-G] and QuantiFERON-TB Gold In-Tube [QFT-GIT]; Cellestis Limited, Carnegie). Moreover, many rheumatic patient candidates for anti-TNF-α therapy also accept other immunosuppressive therapy (IST), such as steroid and disease-modifying anti-rheumatic drugs (DMARDs). In that both TST and IGRAs measure the magnitude of an adaptive immune response, the testing results may be affected by immunosuppressive drugs or by the autoimmune disease itself. With so much confounding factors associated with both IGRAs and TST, more sufficient data are needed to draw a reliable conclusion on which test is better. To date, limited data from published studies investigated the performance of IGRA for detecting LTBI in rheumatic patients. Most of them had small sample sizes and appeared with controversial conclusions. Besides, many studies included rheumatic patients who had already received biological therapy. While in clinical practice, screening of LTBI in rheumatic patients before initiation of biologic agents is more useful.

With these uncertainties, we conducted a systemic review and meta-analysis to evaluate the performance of IGRAs and TST in diagnosing LTBI in patients with rheumatic diseases before initiation of biological therapy.

Materials and methods

Search strategy and study selection

The Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) [18] compliant literature search strategy was performed. The electronic databases MEDLINE (1966 to February 2014) and EMBASE (1974 to February 2014) were searched by two reviewers (Q.R. and S.Z.). All titles and abstracts generated from the search strategy were independently reviewed. After the initial screening process, the full text of eligible articles was reviewed against the predefined inclusion criteria by two reviewers (Q.R. and S.Z.). In addition, we scrutinized the reference lists of each eligible paper for any omitted studies.

From all citations of published relevant articles and bibliographies of relevant reviews and guidelines for inclusion, studies were eligible for inclusion if they were original researches published in English and assessed the performance of commercial IGRAs (including QFT-G, QFT-GIT and T-SPOT.TB), and TST in rheumatic patients, and studies were excluded when evaluating an in-house or older-generation IGRAs, lack of data specific to patients with rheumatic disease, lack of sufficient data on desired outcomes, or including patients who had already received biological therapy, such as anti-TNF-α therapy.

Data extraction

Two reviewers (Q.R. and S.Z.) conducted the data extraction using a pre-constructed data table. The data collection included published year, country, number of patients, type of rheumatic disease, demographic characteristics (such as gender and BCG vaccination), IGRA methods (assay used, test vision, cut-off point used), TST methods (dose of purified protein derivative (PPD) used, cut-off point used), positivity of these tests, agreement between IGRA and TST, indeterminate results of IGRAs, and outcomes assessing the impact of IST on IGRA and TST results.

Quality assessment

The Quality Assessment of Diagnostic Accuracy Studies (QUADAS) quality assessment tool is a validated tool to evaluate the diagnostic accuracy [19]. However, since there is currently no “gold standard” for the diagnosis of LTBI, specific modifications were made [20]. The following aspects were evaluated including: (1) whether patients were ambulatory or inpatients diagnosed with rheumatic diseases; (2) whether the selection criteria was clearly described; (3) whether the IGRAs was performed before TST; (4) Whether commercial IGRAs were used according to the manuscript; (5) whether the same clinical data were available as in practice; (6) whether indeterminate results were reported; and (7) whether exclusions after enrollment were explained. We scored each of the QUADAS items as Yes (2 points), Unclear (1 point), or No (0 point).

Statistical analysis

Due to no gold standard in assessing the performance of IGRAs and TST, meta-analysis methods in two recent meta-analyses that evaluated IGRAs in HIV-infected individuals and in inflammatory bowel disease patients were used as references [20, 21]. The agreement between two IGRAs and the TST, the proportion of indeterminate IGRAs results, the association between risk factors of LTBI and test positivity, and the impact of IST on IGRAs and TST results were meta-analyzed. The odds ratio (OR) was used to measure the association between test positivity and risk factors of TB. The impact of IST on IGRAs and TST was also measured by pooled OR. Most of the analysis was conducted by groups according to the IGRA used. Continuity corrections were performed when zero events exist in one or both arms of an outcome. The level of statistical significance was established as P < 0.05. Ninety-five percent CIs were presented for each outcome. Heterogeneity was assessed using the I 2 statistics [22]. If I 2 > 25 %, the pooled estimate was obtained using a random effects model. Otherwise, fixed effects model was used. The pooled estimate was not calculated if only three or less studies were available. All analyses were conducted with R (v.3.0) software.

Results

Search results and study description

Totally, 476 articles were identified from the databases and 7 articles were identified from the reference lists of eligible articles. Forty-one studies were reviewed by full text and 11 studies met our inclusion criteria and the data were extracted (Online Resource 1). Of the 11 included articles, a total of 1940 cases from 10 countries were assessed (Table 1). Among them, the majority of the patients suffered from RA, AS, PsA, and other spondyloarthropathies. All these studies used both TST and QuantiFERON test (four QFT-G and seven QFT-GIT) prior to anti-TNF-α therapy. Four of these studies evaluated both QFT and T-SPOT.TB. Most articles satisfied the modified QUADAS items and the scores ranged from 8 to 14, with a mean score of 12.4 (Table 1). The major quality problems of these studies were the unclearly described selection criteria and the uncertain sequence of TST and IGRAs testing.

Table 1 Characteristics of included studies

Agreement between IGRAs and TST

Both QFT and T-SPOT.TB in these studies were performed according to the manufacturer’s instructions. Eight studies performed TST by Mantoux methods using two tuberculin units [2326, 28, 30, 31, 33], two studies used five tuberculin units [27, 32], and one used about three tuberculin units [29]. The cut-offs for positivity were selected differently according to their national guidelines (Table 1). The percentage of agreement between TST and QFT was available in nine studies (Fig. 1). And four of these studies also provide sufficient data to calculate the agreement between and TST and T-SPOT.TB. Overall, the pooled agreement was 72 % (95 % confidence interval (CI) 65–78 %) between TST and QFT, and 75 % (95 % CI 67–83 %) between TST and T-SPOT.TB. Random effects model was used for these pooled estimate because of the heterogeneity between studies (I 2 = 85.8 and 75.8 %, respectively). Three studies [25, 26, 31] also reported the concordance between two IGRA tests, which were 98.2, 81, and 88.9 %, respectively. Of the nine studies with sufficient data of discordant results [2426, 28, 30, 32, 33], seven reported higher proportion of TST+/IGRA− results (ranging from 6.1 to 32.2 %) compared to TST−/IGRA+ results (1.4 to 32.4 %) in individuals with both tests available.

Fig. 1
figure 1

Agreement between IGRAs and TST

Proportion of indeterminate results

Ten studies [2327, 29, 31, 32, 34] reported the proportion of indeterminate results of QFT (Online Resource 2). The pooled percentage of indeterminate results was 3 % (95 % CI 2, 4 %). Three studies which measured the indeterminate results of T-SPOT.TB reported that the proportion of indeterminate results ranged from 0 to 6 % [26, 31, 32].

BCG vaccination and TB risk factors

The pooled data showed that BCG vaccination was associated with TST positivity (pooled OR 1.64, 95 % CI 1.06, 2.53) (Fig. 2). In the analysis between TB risk factors and test results, we found that the IGRA positivity was associated with the presence of one or more risk factors for TB (pooled OR 4.49, 95 % CI 2.73, 7.39), including previous close contact with TB patients, birth or extended living in TB-prevalent area, and abnormal chest radiograph (Fig. 3). However, the TB risk factors analyzed in different studies were not consistent. No pooled OR was available between TST positivity and one or more risk factors for TB. Among all risk factors, abnormal chest radiograph was the most frequently investigated; however, no significant association was found between abnormal chest radiograph and IGRA test positivity (Supplementary Figure S2).

Fig. 2
figure 2

BCG vaccination status and test results of IGRAs and TST

Fig. 3
figure 3

Tuberculosis risk and test results of IGRAs and TST

Association between IST and test performance

Most patients were treated with steroid and/or DMARDs at the time of screening for LTBI prior to anti-TNF-α therapy. There were no studies assessing the impact of IST on agreement between IGRA and TST. Two studies [24, 32] reported that odds of concordance of QFT and TST were higher in individuals who were on steroid compared to those who were not (OR 3.57, 95 % CI 1.03, 12.5 and OR 1.15, 95 % CI 0.71, 1.85). In addition, individuals with indeterminate results were often excluded from analysis, resulting in the failure to assess the impact of IST on indeterminate results. Only two studies reported that administration of steroid was significantly associated with occurrence of indeterminate results of QFT [24, 28].

We assessed the impact of steroid and DMARDs on IGRA positivity and TST positivity separately. Nine studies [2328, 3133] provided the sufficient data to assess the impact of steroid on the IGRA positivity rate and pooled estimate was calculated. The negative QFT results were not significantly associated with the use of steroid (pooled OR 0.90, 95 % CI 0.63, 1.28) or DMARDs (pooled OR 0.96, 95 % CI 0.69, 1.33) (Fig. 4). Four studies [25, 26, 31, 32] provided sufficient data to access the impact of IST on the T-SPOT.TB results. Similarly, pooled estimates revealed that neither steroid (pooled OR 0.69, 95 % CI 0.38, 1.27) nor DMARDs (pooled OR 1.53, 95 % CI 0.98, 2.39) significantly affects positive T-SPOT.TB results (Fig. 5). In contrast, the pooled estimates from nine studies [24, 2629, 3133] indicated that TST positivity was significantly impacted by the use of steroid (pooled OR 0.45, 95 % CI 0.30, 0.69), but less impacted by the use of DMARDs (pooled OR 0.78, 95 % CI 0.50, 1.21) (Fig. 5). However, considerable heterogeneity between studies were seen in assessing the impact of steroid on TST (I 2 = 54.7 %).

Fig. 4
figure 4

Impact of immunosuppressive therapy on IGRA results. IST immunosuppressive therapy, DMARDs disease-modifying anti-rheumatic drugs

Fig. 5
figure 5

Impact of immunosuppressive therapy on TST results. IST immunosuppressive therapy, DMARDs disease-modifying anti-rheumatic drugs

Discussion

Lack of a gold standard in diagnosing LTBI, assessment on diagnostic performance of IGRAs and TST becomes knotty and contentious. Especially for the patients with autoimmune disease before biological therapy, prevention from developing active TB necessitates reliable diagnostic approaches to detect LTBI. We reviewed 11 original studies and meta-analyzed the performance of IGRAs versus TST among rheumatic patients prior to anti-TNF-α therapy and evaluated the impact of immunosuppressive therapy on both IGRAs and TST. We found that the concordances between IGRAs and TST were 72 and 75 % for QFT and T-SPOT.TB, respectively. The proportion of indeterminate results of QFT was 5 %. Positivity rate of TST, but not IGRA, was significantly higher in the patients with BCG vaccination history. Compared with IGRAs, TST positivity was more significantly influenced by the use of steroid.

The development of active disease from LTBI in patients treated with TNF-α inhibitors might be an important outcome to evaluate the test performance. However, only four of our included studies [25, 27, 30, 33] reported the follow-up data and nine active cases were reported from one study [30]. Data that focus on the predictive values of IGRAs in developing active disease among rheumatic patients were very limited. In addition, almost all patients with suspected LTBI received preventive therapy according to the current guidelines. Therefore, the capacity of IGRAs of predicting subsequent tuberculosis could not be well assessed.

The concordances between IGRAs and TST were 72 and 75 % for QFT and T-SPOT.TB, respectively, which indicates differences do exist between these two tests. TST is traditionally used to identify the LTBI. As expected, TST results were significantly associated with BCG vaccination in rheumatic patients just as in immune-complete subjects. For IGRAs, confounding factors related to BCG vaccination were avoided. False-positive TST will result in unnecessary anti-tuberculosis prophylaxis bringing about drug adverse events. In addition, when compared with TST, positive IGRA results were more closely related with having one or more TB risk factors.

The existence of indeterminate results is a considerable problem for IGRAs in the clinical practice. Indeterminate results was defined as: (1) the negative control tests positive regardless of the response to TB-specific antigens or (2) the positive control tests negative as does the response to TB-specific antigens. Actually, most indeterminate results in immunocompromised patients resulted from the negative response against mitogen which was used as positive control [35]. For rheumatic patients, the proportion of indeterminate result of QFT was lower compared with patients with inflammatory bowel disease and HIV infection [20, 21]. However, a study found that over 75 % initial indeterminate results of IGRA performed under routine conditions gave clear positive or negative results upon retesting [36].

An important concern for patients with rheumatic disease is that the immunosuppression due to disease itself and IST will impact on the performance of both tests. It has been shown that TST is more likely to produce false negative result in rheumatic patients than the general population due to the weakened cellular immune response. An early study explored the size of the PPD induration in patients with RA and found that the median size was significantly less than that in healthy control (4.5 vs. 11.5 mm, P < 0.01) [37]. In this study, the pooled estimates revealed that only TST was significantly suppressed by the introduction of steroid, which was related to the higher false negative rate of TST. The impact of steroid on IGRAs was less and did not reach statistically significance. In addition, negative effects of steroid varied considerably with respect to different kinds and doses of steroid used. One study revealed that oral prednisolone, not long-acting corticosteroids, severely suppressed the QFT-GIT and TST performance [28]. However, most studies did not perform such sub-analysis and therefore, further relevant study is recommended. Moreover, it is remained unclear if and when this immunosuppression reverts after withdrawal of steroid therapy. All analyses related to the effect of DMARDs on IGRAs and TST did not reach statistical significance.

Recommendations in screening LTBI in current guidelines vary on the subject of replacing TST with IGRAs or utilizing both tests [38]. The US Centers for Disease Control (CDC) do not explicitly address the choice of test for screening [39]. The European CDC recommends using IGRA and TST in combination for detecting LTBI [40]. The TBNET supported the use of IGRA or, as an alternative in individuals without a history of BCG vaccination, TST in screening adult candidates for TNF inhibitor [41]. In general, IGRA is a more preferred choice in most published national guidelines [38]. However, one major disadvantage of IGRAs is the relatively high cost, and the cost-effectiveness studies remain unclear among patients receiving anti-TNF-α therapy. Based on the available data in the present study, in rheumatic patients with previous BCG vaccination or currently on steroid therapy, IGRAs would be the better choice to identify LTBI by decreasing the risk of false-positivity and false-negativity. The negative results of tests, especially negative TST result should be interpreted with caution in patients treated with any steroid. More information is needed if the test results are negative, including chest radiograph, history of TB exposure, and other risk factors for TB.

There are some limitations in our study. First, without sufficient data, we were not able to assess the capacity of IGRAs to predict the active tuberculosis development. Second, there was considerable heterogeneity between studies, especially in assessing the pooled performance of TST. The heterogeneity could be related to different cut-offs and BCG vaccination status. Third, for study populations on varied regimens of DMARDs and steroid, we could not assess the effect of specific kinds of DMARDs or steroid.