Introduction

Rheumatoid arthritis (RA) is a systemic autoimmune disease characterized by inflammation of the synovial tissue, with an estimated prevalence of 1–2 % (DiPiro et al. 2005). RA results in pain, joint destruction and diminished function, which are associated with impaired health-related quality of life and work disability (Strand and Singh 2008; Smith et al. 2011; Odegård et al. 2005; Strand et al. 2012b). Usually, this chronic and destructive condition requires lifelong medication and, therefore, finding an effective treatment is important to reduce the significant burden not only to patients with RA but also society at large (Miranda et al. 2012).

Disease-modifying anti-rheumatic drugs (DMARDs) are defined as medications that slow or stop the progression of RA by rapid and sustained suppression of inflammation (O’dell 2004). Conventional DMARDs (cDMARDs) are generally used as a first-line treatment strategy, of which methotrexate is the most commonly used. For patients with inadequate response to cDMARDs, biologic DMARDs (bDMARDs) are available (Smolen et al. 2010; Singh et al. 2012). Based on the mode of action, bDMARDs are classified into tumor necrosis factor-α (TNF-α) inhibitors and non-TNF biologics. TNF-α inhibitors include adalimumab, etanercept, infliximab, certolizumab pegol, and golimumab. Non-TNF biologics include abatacept (T cell activation inhibitor), rituximab (B-cell depleting agent), and tocilizumab (interleukin-6 blocking agent) (Genovese et al. 2005; Cohen et al. 2006; Emery et al. 2008).

Most patients with inadequate response to cDMARDs use a TNF-α inhibitor as bDMARD treatment. For patients who fail to respond to the initial TNF-α inhibitor treatment, alternative TNF-α inhibitors (cycling in TNF-α inhibitors), or non-TNF biologics (switching to non-TNF biologics) are recommended (Smolen et al. 2010; Singh et al. 2012). However, there has been an ongoing debate as to which treatment option is more efficacious—cycling of TNF-α inhibitors or switching to non-TNF biologics—due to lack of clinical evidence. Meanwhile, the safety aspects of these agents have been assessed that they do not seem to be differences in several studies (Horton et al. 2010; Ruderman 2012; Smolen et al. 2013).

While randomized controlled trials (RCTs) have been performed to compare these agents with placebo, there have been no head-to-head trials directly comparing cycling of TNF-α inhibitors with switching to non-TNF biologics. In the absence of head-to-head RCTs, Bayesian network meta-analysis (NMA) allows for indirect comparisons of all relevant treatment options. NMA (also called multiple treatment comparison, or MTC) is a generalization of standard meta-analysis for pairwise trials to a simultaneous analysis of multiple pair-wise comparisons (Jansen et al. 2008). The Bayesian approach has an additional advantage because it allows the probabilities of the treatment options to be best to be computed. Therefore, Bayesian analyses may assist clinicians and reimbursement decision makers (Lu and Ades 2004; Jansen et al. 2008). The objective of this study was to compare the effectiveness of cycling in TNF-α inhibitors with switching to non-TNF biologics in patients who do not respond to treatment with a TNF-α inhibitor by conducting a systematic review of previous studies and performing Bayesian NMA.

Methods

Search strategy

We performed a systematic review of the literature to identify studies that have investigated the efficacy of bDMARDs approved to treat patients with RA who failed to respond to previous treatments with TNF-α inhibitors. The systematic review was conducted independently by two reviewers, using the MEDLINE and Cochrane library (CENTRAL: Cochrane central registration for controlled trials) databases, to identify studies published before and up to April 04, 2013. Searches included appropriate keywords and Medical Subject Headings (MeSH) for disease and bDMARD names, and were limited to human RCTs published in English.

Inclusion and exclusion criteria

According to European League Against Rheumatism (EULAR) and the American College of Rheumatology (ACR) guidelines, TNF-α inhibitors (adalimumab, certolizumab, etanercept, infliximab and golimumab) and non-TNF biologics (abatacept, rituximab, and tocilizumab) may be used in patients with inadequate response to a TNF-α inhibitor (Smolen et al. 2010; Singh et al. 2012). These biologic agents were included in the search. The inclusion criteria were adult patients with RA who showed inadequate response to a TNF-α inhibitor (TNF-IR patients), and who used a bDMARD with cDMARD combination therapy in double-blind RCTs with outcomes assessments at 6 months. Six months was chosen as the assessment period based on the ACR guidelines (Singh et al. 2012). Apart from RA, ankylosing spondylitis, psoriatic arthritis and connective tissue diseases and biologic mono-therapy were excluded from the search.

Data extraction and outcome measures

Two researchers extracted predefined data from studies included in the literature review, and quality assessment was performed using Cochrane’s Risk of Bias (Fig. 1). Disagreement was resolved through a third researcher’s assessment, to reach a consensus. Extracted data were reviewed and checked before analysis.

Fig. 1
figure 1

Quality assessment of included studies

The main outcome measures chosen were the ACR20/50/70 and the health assessment questionnaire (HAQ) improvement. ACR20/50/70 are defined as a 20, 50, or 70 % improvement in tender and swollen joints, respectively, and the same level of improvement in three of the following five variables: patient and physician global assessments of overall disease activity, patient evaluation of pain, a score of physical disability, and blood acute-phase reactants (Orme et al. 2012). The HAQ disability index is one of the most widely used self-assessment instruments for measuring functional disability in patients with a variety of rheumatic diseases, and is expressed on a scale of 0–3 units (0: no functional disability, 3: severe functional disability) (Krishnan et al. 2004).

The total number of patients, number of respondents achieving the ACR response, and mean HAQ improvement from baseline and standard deviations (SDs) were extracted. For golimumab, this analysis used HAQ change from baseline from CADTH therapeutic review (2010) because the study selected for golimumab, GO-AFTER trial, did not provide the HAQ change value. Missing SD values were imputed by the average of SDs reported for all the other RCTs. Demographic data such as age and gender, mean disease duration, proportion of female patients, and baseline HAQ score were recorded.

Statistical analysis

A Bayesian NMA was performed to simultaneously conduct indirect comparisons between treatments that were not directly compared and to allow all pair-wise comparisons. This approach provides the flexibility of including a wide range of data and borrowing strength across the whole evidence network, and therefore makes the best use of available data (Schmitz et al. 2012; Jang et al. 2013). Evidence structures for the analysis are shown in a network diagram (Fig. 2). All analyses were performed with WinBUGS 1.4.3 statistical software. The first 10,000 simulations were discarded to allow for model convergence and the subsequent 50,000 simulations were used to estimate posterior probabilities.

Fig. 2
figure 2

Network diagram of evidence. There was no included study on adalimumab, cetolizumab, etanercept and infliximab

In our analysis, treatment effect parameters were modeled by using non-informative (vague) prior distributions that were normally distributed with a mean of 0 and a variance of 10,000. For the dichotomous ACR20/50/70 endpoints, NMA calculated odds ratios (ORs) and treatment effect as a percentage of patients who experienced an event. For the continuous HAQ score data, an NMA model was fitted to differences in HAQ improvement among various treatments. Posterior medians with 95 % credible intervals (CrIs) were reported as the best estimate for the central value, since means may be overly influenced by outliers. Additionally, posterior probabilities were calculated using posterior probability distributions for hypothesis testing. A probability of 90 % suggests that a treatment is more effective than the comparator. The probability of OR greater than one for ACR response was also computed. Bayesian methodology allows conclusions to be drawn regarding the probability that the ORs are greater than one in cases in which the associated 95 % CrI includes one (Moran et al. 2010). Similarly, the probability of a difference in HAQ score change of less than zero was computed and interpreted.

Results

The initial search yielded a total of 853 studies. Of these, 28 remained after 261 duplicates and 564 studies that did not meet the inclusion criteria were excluded based on a review of titles and abstracts (Fig. 3). After the review of full articles, another 22 studies were removed; 10 studies did not provide the main outcome measures of interest, six were duplicate studies, three did not evaluate the intervention of interest, and the remainder did not have the targeting study population, study objective, or outcome assessment time. As a result, a total of 6 studies were selected for analysis; one study of golimumab (Smolen et al. 2009), one study of rituximab (Cohen et al. 2006), two studies of abatacept (Genovese et al. 2005; Westhovens et al. 2006), and two studies of tocilizumab (Emery et al. 2008; Strand et al. 2012a). Two of the abatacept studies analyzed data from the ATTAIN trial. Genovese et al. (2005) included data for the ACR response but did not provide data for the HAQ change from baseline. Therefore, we included Westhovens et al. (2006) to provide the latter. For the same reason, we selected two tocilizumab studies that were based on the RADIATE trial. Golimumab was the only TNF-α inhibitor and the rest were non-TNF biologics.

Fig. 3
figure 3

Flow diagram of studies with inclusion/exclusion

All of the included studies were conducted in samples of patients who failed initial TNF-α inhibitor treatment, and baseline characteristics were similar across all studies (Table 1). The mean age of patients was 53.6 years. All of the studies had high percentages of females (74–84 %) and the mean disease duration was over 9 years. The baseline HAQ score ranged from 1.6 to 1.9. Based on these characteristics, we conducted analyses using the fixed-effect model.

Table 1 Baseline characteristics of included studies

ACR20/50/70 results are shown in Tables 2 and 3. The proportion of patients who achieved ACR20 was highest for tocilizumab (62.4 %; 95 % CrI 49.9–74.0 %), followed by rituximab (47.0 %; 95 % CrI 37.7–56.6 %), abatacept (43.7 %; 95 % CrI 32.9–55.4 %) and golimumab (32.1 %; 95 % CrI 22.3–44.0 %), and lowest for placebo (15.5 %; 95 % CrI 12.8–18.5 %). Similarly, the ACR50 effectiveness measure was highest for tocilizumab and lowest for placebo. Rituximab had the highest proportion of patients who achieved ACR70.

Table 2 Network meta-analysis results of ACR20/50/70: comparison of Placebo versus bDMARD
Table 3 Network meta-analysis results of ACR20/50/70: comparison of TNF-α inhibitor versus non-TNF biologics

Table 3, derived from the results in Table 2, summarizes ORs for non-TNF biologics in comparison to golimumab, a TNF-α inhibitor. For ACR20, abatacept had OR of 1.639 [95 % CrI 0.786–3.408; P(OR > 1) = 90.7 %], rituximab 1.871 [95 % CrI 0.937–3.725; P(OR > 1) = 96.2 %], and tocilizumab 3.52 (95 % CrI 1.567–7.946; P(OR > 1) = 99.9 %). The posterior probabilities (OR > 1) of all non-TNF biologics were over 90 %, suggesting that these agents were more effective. For ACR50, ORs were 1.623 [95 % CrI 0.454–6.247; P(OR > 1) = 72.2 %], 1.702 [95 % CrI 0.558–5.087; P(OR > 1) = 83.0 %], and 2.552 [95 % CrI 0.752–9.1; P(OR > 1) = 93.3 %] for abatacept, rituximab and tocilizumab, respectively. Based on the probability of OR > 1, tocilizumab has clear benefits in comparison to golimumab, a TNF-α inhibitor. For ACR70, ORs were 2.048 [95 % CrI 0.361–16.47; P(OR > 1) = 78.4 %], 3.876 [95 % CrI 0.685–35.37; P(OR > 1) = 93.5 %], and 3.107 [95 % CrI 0.532–25.49; P(OR > 1) = 89.2 %] for abatacept, rituximab and tocilizumab, respectively. In this case, rituximab was shown to be more effective than the TNF-α inhibitor, based on the probability of OR > 1.

HAQ score change results are shown in Tables 4 and 5. The magnitude of change was lowest for placebo at −0.078 (95 % CrI −0.115 to −0.042) and increased from golimumab (−0.218; 95 % CrI −0.325 to −0.112), rituximab (−0.378; 95 % CrI −0.467 to −0.29), tocilizumab (−0.418; 95 % CrI −0.525 to −0.311), and abatacept (−0.478; 95 % CrI −0.575 to −0.381). The probability of being best among five treatments was highest for abatacept at 74.4 %. Comparisons of each bDMARD with placebo showed that the magnitude of the change was highest for abatacept [−0.400; 95 % CrI −0.499 to −0.299; P(difference < 0) = 100 %] followed by tocilizumab [−0.340; 95 % CrI −0.453 to −0.227; P(difference < 0) = 100 %] and rituximab [−0.300; 95 % CrI −0.397 to −0.203; P(difference < 0) = 100 %], and lowest for golimumab [−0.140; 95 % CrI −0.255 to −0.026; P(difference < 0) = 99.2 %].

Table 4 Network meta-analysis results of HAQ change from baseline: comparison of Placebo versus bDMARDs
Table 5 Network meta-analysis results of HAQ change from baseline: comparison of TNF-α inhibitor versus non-TNF biologic

Table 5, derived from Table 4, presents the results of comparing each non-TNF biologic drug to golimumab, a TNF-α inhibitor. The magnitude of the difference in HAQ change was highest for abatacept [−0.260; 95 % CrI −0.411 to −0.107; P(difference < 0) = 100 %], followed by tocilizumab [−0.200; 95 % CrI −0.36 to −0.039; P(difference < 0) = 99.3 %]. It was lowest for rituximab [−0.160; 95 % CrI −0.31 to −0.01; P(difference < 0) = 98.2 %]. Based on posterior probabilities, non-TNF biologics improved HAQ scores compared with the TNF-α inhibitor.

Discussion

We found that the ACR responses of switching to non-TNF biologics were higher than those with cycling of alternative TNF-α inhibitors, even though the superior agent identified based on the probability of being best was different for each category. According to calculations of posterior probabilities of OR > 1, all three non-TNF biologics had higher proportions of patients who achieved ACR20 response than the TNF-α inhibitor. Among all non-TNF biologics, tocilizumab showed superior benefits in ACR50 and rituximab had the best effectiveness in ACR70, compared with the TNF-α inhibitor. Similarly, switching to non-TNF biologics resulted in greater changes in HAQ outcomes, compared with alternative TNF-α inhibitor cycling. In addition, in evaluations of posterior probabilities of differences less than zero, all of the non-TNF biologics were also shown to be more effective than the TNF-α inhibitor.

Our findings are consistent with those of several previous studies. The CADTH therapeutic review comparing three bDMARDs (abatacept, rituximab, and golimumab) concluded that based on RCTs, there are benefits associated with switching to non-TNF biologics rather than cycling of TNF-α inhibitors in TNF-IR patients (CADTH therapeutic review 2010). Several observational studies also are in good accord with our results (Chatzidionysiou and Van Vollenhoven 2013; Soliman et al. 2012; Gomez-Reino et al. 2012). Chatzidionysiou and Van Vollenhoven (2013) concluded that TNF-IR patients showed better overall results when treated with rituximab than with another TNF-α inhibitor. Soliman et al. (2012) also suggested that switching to rituximab may be more beneficial than cycling to an alternative TNF-α inhibitor in TNF-IR patients.

Although a number of indirect comparisons of bDMARDs have been performed, most previous studies dealt with inadequate response to cDMARDs, and only a few studies have been performed in TNF-IR patients (Guyot et al. 2012; Jansen et al. 2008; Lee et al. 2008; CADTH therapeutic review 2010; Devine et al. 2011; Launois et al. 2011; Salliot et al. 2011; Orme et al. 2012; Schmitz et al. 2012). Salliot et al. (2011) compared the efficacy of bDMARDs in TNF-IR patients using meta-analysis and indirect comparisons. This approach is useful for comparing two treatment strategies that have a common comparator, but is not appropriate for simultaneously comparing multiple treatment strategies. The CADTH therapeutic review (2010) could not include all recommended treatments because of the timing of the analysis, nor the HAQ change from baseline as an efficacy outcome measure, which recently began to be reported in the majority of clinical studies (Schmitz et al. 2012). The present study is the first NMA to analyze ACR response and HAQ score change for currently available agents in TNF-IR patients with RA.

This study has some limitations. First, there was limited clinical evidence for the efficacy of biologic agents in TNF-IR patients although a comprehensive search strategy was conducted to identify all relevant evidence. There was only one RCT available for each variable. Furthermore, there was no RCT regarding cycling in other TNF-α inhibitors such as etanercept, infliximab, and adalimumab in TNF-IR patients. Analysis of cycling in TNF-α inhibitors was restricted to only golimumab, for which such data are available. Therefore, the results of this analysis should be interpreted with caution and further studies on the other TNF-α inhibitors are needed. Second, the durations of included trials were too short to assess long-term benefits and harms in RA patients. The delayed and rare effects of biologic agents would not have been detected in the short-duration trials. Since RA is a chronic disease requiring prolonged treatment, further studies to evaluate long-term efficacy and safety are needed.

According to the clinical evidence to date, our findings suggest that non-TNF biologic agents such as rituximab, abatacept, and tocilizumab are more effective than TNF-α inhibitors for the treatment of RA patients after failure of initial treatment with a TNF-α inhibitor. However, more clinical evidence is needed in the future to recommend the best treatment options.