Introduction

Ankylosing spondylitis (AS) is a chronic systemic arthritis with a strong association with HLA-B27, characterized by the presence of enthesitis and arthritis of the axial joints [1]. AS affects males more often than females, with a male-to-female ratio of approximately 3 to 1. It is a disease of young adults with the peak incidence between 20 and 30 years old [2,3,4].

Non-steroidal anti-inflammatory drugs (NSAIDs) are the first line pharmacological therapy for AS. The efficacy of NSAIDs to improve AS symptoms and to slow down radiographic progression has been demonstrated in several clinical studies [5, 6]. Nonetheless, response to NSAIDs is not universal with the failure rate of approximately 20–30% [6]. In addition, side effects from NSAIDs, such as gastrointestinal ulcer, acute kidney injury, and cardiovascular disease, are common and lead to discontinuation of the medications in significant amount of patients [7,8,9]. Thus, non-NSAIDs treatments are often required. Unfortunately, studies have shown that traditional non-synthetic disease-modifying anti-rheumatic drugs (DMARDs), such as methotrexate and sulfasalazine, are not effective for AS [10, 11].

With the better understanding of the molecular pathogenesis of AS and other systemic autoimmune disorders, over the past few decades, a novel class of medication called biologic agent has been developed. In early 2000s, tumor necrosis factor (TNF) inhibitors were initially approved for treatment of rheumatoid arthritis (RA) and were subsequently approved for treatment of AS. The current American College of Rheumatology (ACR) guideline for management of AS [12] recommends treatment with TNF inhibitors for patients who fail or could not tolerate NSAIDs. The guideline does not prefer a specific TNF inhibitor over the others as there is no head-to-head randomized trial comparing efficacy between two TNF inhibitors. In addition, a 2015 indirect comparison meta-analysis from the Cochrane collaboration did not find a significant difference for the efficacy (defined by Ankylosing Spondylitis Assessment Study [ASAS] group response criteria) between older TNF inhibitors (infliximab, adalimumab, golimumab, and etanercept) [13].

Since the publication of that meta-analysis, studies of one more TNF inhibitor (certolizumab) and few non-TNF inhibitor biologic agents (secukinumab, apremilast, and tofacitinib) have been published. Those trials have demonstrated the superior efficacy of the medications compared to placebo. Whether these newer agents are more effective compare with older TNF inhibitors is not known due to the lack of head-to-head controlled trial. The current study aims to compare the efficacy of certolizumab and non-TNF inhibitor biologic agents to older TNF inhibitors in patients who are biologic agent-naïve using indirect comparison technique.

Material and methods

Search strategy

An experienced medical librarian (PJE) in consultation with the two investigators (P.U. and M.K.) searched for published studies indexed in Ovid Medline, Ovid CENTRAL, and Ovid EMBASE database from inception to January 2017 using the search terms described in the supplementary data 1. These terms included the controlled vocabulary of each database and text words (names of individual biologic agents and terms for ankylosing spondylitis). No language limitation was applied. The search retrievals were imported into EndNote X7, and duplicates removed. Search in clinicaltrials.gov was also performed to look for any additional unpublished studies. The bibliographies of selected review articles and the previous meta-analysis by the Cochrane collaboration were also manually searched.

Inclusion criteria

The following criteria were used to determine the eligibility of each study. (1) Eligible studies had to be randomized controlled trials (RCTs). (2) They had to compare the efficacy of biologic agents to placebo in patients with active AS who have failed or could not tolerate NSAIDs therapy. (3) Duration of studies was between 12 weeks to 30 weeks. (4) Ankylosing Spondylitis Assessment Study group response criteria 20 (ASAS20) was the primary or one of the major secondary outcomes. ASAS20 response is defined as at least 20% improvement in at least three of our four evaluated domains (patient global, pain, function, and inflammation) without worsening of more than 20% of the remaining domain [14]. The same two investigators independently determined the study eligibility. Different determinations were resolved by discussion.

Data extraction

A standardized data collection form was used to extract the following information from each study: first author, title of the article, year of publication, countries where the study was conducted, study design, inclusion and exclusion criteria, duration of treatment and follow-up, number of participants in treatment and placebo arm, baseline characteristics of participants, study interventions, concomitant treatments, and number of participants who achieved ASAS20 response in each arm.

To ensure the accuracy of the data extraction, this process was also independently performed by the two investigators. Any discrepancy was resolved by referring back to the original studies.

Statistical analysis

Data analysis was performed using Review Manager 5.3 software from the Cochrane Collaboration (London, UK). If at least two RCTs were available for a given biologic agent, the pooled odds ratio (OR) of achieving ASAS20 response and 95% confidence interval (CI) across studies were calculated using a random effect, Mantel–Haenszel analysis [15]. Effect estimates from intention-to-treat analysis were used in this meta-analysis. Random effect model, rather than fixed effect model, was used due to the difference in baseline characteristics of participants in each study. Cochran’s Q test was used to assess statistical heterogeneity of the ASAS20 response rate for each biologic agent. This statistic was complemented with the I 2 statistic, which quantifies the percentage of total variation across studies that is due to true heterogeneity rather than chance. A value of I 2 of 0 to 25% represents insignificant heterogeneity; >25% but ≤50%, low heterogeneity; >50% but ≤75%, moderate heterogeneity; and >75%, high heterogeneity [16].

Indirect comparison technique as described by Bucher et al. [17] and Song et al. [18] was then utilized to compare the relative efficacy of these biologic agents. This indirect comparison is made through a common comparator (placebo group). The efficacy of two biologic agents was considered significantly different if the 95% CI did not contain OR of one (which would correspond to the p value of less than 0.05).

Evaluation for bias

Risk of bias for individual study was evaluated in six domains including random sequence generation, allocation concealment, blinding of participants and personnel, blinding of outcome assessment, incomplete outcome data, and selective reporting. Visualization of funnel plot and Egger’s regression test were used for the evaluation of publication bias. Comprehensive Meta Analysis version 2.2 software (NJ, USA) was used to perform the Egger’s regression test.

Results

Systematic review of the literature

The search strategy yielded 698 potentially relevant articles (400 articles from EMBASE and 298 articles from Medline). After exclusion of 278 duplicate articles, 420 articles underwent title and abstract review. Three hundred and ninety articles were excluded at this stage as they were clearly not RCTs of biologic agents in AS, leaving 30 articles for full-length article review. Eleven of them were excluded at this stage as they were open-label extension phase of the original RCTs. One study (which is the only available study on apremilast) was excluded as it included both biologic agent experience and naïve patients and did not report ASAS20 response among the subgroup of patients who were biologic agent naïve [19]. Thus, 18 RCTs met the eligibility criteria and were included in our data analyses [20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37]. Additional search in clinicaltrials.gov and bibliographies of selected articles did not yield any other additional studies. The literature review process is summarized in Fig. 1. The methodology and baseline characteristics of participants of the included studies are illustrated in Table 1. It should be noted that the inter-rater agreement for eligibility of studies was high with the kappa statistics of 0.62.

Fig. 1
figure 1

Outline of literature review and study identification process

Table 1 Characteristics of included RCTs

Efficacy of biologic agents in active AS

We included 14 trials of older TNF inhibitors (2321 patients) [20,21,22,23,24,25,26,27,28,29,30,31,32,33], two trials of secukinumab (405 patients) [34, 35], one trial of certolizumab (142 patients) [36], and one trial of tofacitinib (103 patients) [37]. Baseline characteristics of participants were similar across these trials with similar female-to-male ratio, average age, and baseline disease activity as reflected by similar Bath Ankylosing Spondylitis Disease Activity Index (BASDAI). All studies used modified New York criteria to classify participants with AS. The definitions of active AS were consistent across studies (i.e., BASDAI ≥4 and spinal pain VAS ≥ 3 or 4). All studies allowed concomitant use of stable dose of NSAIDs, DMARDs, and steroid at the dose of not more than 10 mg daily of prednisone or equivalent. Nonetheless, the duration of disease varied considerably across the studies, ranging from 1.5 to 18.7 years.

First, the results of 14 trials of older TNF inhibitors were pooled together. The pooled OR of achieving ASAS20 response among older TNF inhibitor-treated patients compared with placebo-treated patients was 4.31 (95% CI, 3.57–5.20). The statistical heterogeneity was low with I 2 of 0%. Forest plot of older TNF inhibitors is shown as Fig. 2. Funnel plot was used to evaluate publication bias. The plot was symmetric and did not provide a suggestive evidence of publication bias (supplementary Fig. 1). Egger’s regression test was also not statistically significant (p = 0.56) which did not suggest the presence of publication bias.

Fig. 2
figure 2

Forest plot of older TNF inhibitors

Second, the results of two trials of secukinumab were pooled together. The pooled OR of achieving ASAS20 response among secukinumab-treated patients compared with placebo-treated patients was 3.45 (95% CI, 2.23–5.35). The statistical heterogeneity was low with I 2 of 0%. Forest plot of secukinumab is shown as Fig. 3. Evaluation for publication bias for secukinumab was not performed as there were only two eligible studies.

Fig. 3
figure 3

Forest plot of secukinumab

The four treatments were then compared to each other using placebo as the common comparator. The OR from the certolizumab study and the OR from the tofacitinib study were used for this analysis to indirectly compare with the aforementioned pooled ORs of older TNF inhibitors and secukinumab. There was no significant difference in any comparisons with the p values ranging from 0.12 to 0.74. The ORs with the corresponding 95% CIs and p values for every comparison are shown in Table 2.

Table 2 Indirect comparison between four treatments

Risk of bias

Risk of bias for individual study is shown in supplementary Fig. 2. The risk was low except for unclear risk of selection bias as most studies did not report the process of randomization in detail.

Discussion

Over the past three decades, biological agents were discovered and approved for clinical use. This meta-analysis aimed to answer a common clinical question in everyday practice. What would be the most effective biological agent for AS after the patients fail or could not tolerate NSAIDs? As there is no available direct head-to-head comparison between those agents, indirect comparison technique was utilized. Older TNF inhibitors, secukinumab, certolizumab, and tofacitinib were compared and their likelihood of achieving ASAS20 response was not significantly different from each other. Thus, from an efficacy standpoint, any one of them could be used as the first line therapy following NSAIDs failure. Of course, safety profile and cost-effectiveness need to be considered as well. For instance, congestive heart failure or history of multiple sclerosis is contra-indications for the use of TNF inhibitors [38]. As such, patients with these conditions should receive either secukinumab or tofacitinib.

Nevertheless, we acknowledge that there are several limitations in this study. Therefore, the results should be interpreted with caution.

The first limitation is inherent to indirect comparison technique as this analysis assumes that the common comparator (in this case, placebo) is transitive, which means that the placebo arms are adequately similar across the included RCTs [39]. This assumption is not always true if characteristics at study entrance of participants, additional treatments, compliance, and follow-up protocol are not similar across studies which would result in uneven distribution of certain confounders or effect modifiers across sets of comparisons. This uneven distribution can still occur even though this study included only RCTs since participants are randomized to treatment/placebo arms within a single trial but are not randomized to different trials.

The second limitation is related to the number of included studies as there is only one study available for certolizumab and tofacitinib. Therefore, the comparisons relied on limited number of participants and it is possible the analyses were underpowered to detect statistical significance. For instance, the upper bound of the CI of the OR of indirect comparison between older TNF inhibitors and certolizumab was 3.94 which means that the odds of achieving ASAS20 response may be as high as four times higher by older TNF inhibitors than certolizumab. Nonetheless, with the wide CI, statistical significance could not be established.

In conclusion, the current meta-analysis demonstrated that the odds of achieving an ASAS20 response in patients with AS who did not have an adequate response to, or could not tolerate, NSAIDs were not significantly different between older TNF inhibitors, secukinumab, certolizumab, and tofacitinib. However, the interpretation of the results was limited by the small number of included RCTs. Head-to-head RCTs are still required to establish the comparative efficacy.