Introduction

In recent years, anti-tumour necrosis factor (TNF)α monoclonal antibody (mAb) has become the main treatment for Crohn’s disease (CD) [1, 2]. Concomitant use of immunosuppressants with anti-TNFα mAb is controversial. In patients with biologic-naïve CD, a combination of the chimeric anti-TNFα mAb infliximab (IFX) with azathioprine (AZA) has shown clinical benefit [3]. Conversely, the continuation of thiopurines > 6 months has shown no clear benefit over scheduled IFX monotherapy [4].

Adalimumab (ADA) is a human anti-TNFα mAb, so antigenicity (which is a way of inducing anti-drug antibodies) seems to be weaker than that of IFX. Long-term use of thiopurines is a risk factor for opportunistic infections and development of malignancies [5]. Thus, the benefits and risks of combination therapy are controversial [6, 7].

A prospective randomised controlled trial comparing the efficacy of ADA monotherapy and combination therapy with AZA for CD patients naïve to biologic agents and thiopurines (DIAMOND study) did not show a benefit of concomitant use of AZA in the prevalence of clinical remission at week 26 as the primary endpoint [8]. However, concomitant use of AZA showed marginal benefits in generation of anti-adalimumab antibodies (AAAs) [9] and endoscopic improvement [10]. Considering these findings (especially for ADA), a suitable time for combination with thiopurines could be important.

The aim of the present multicentre, prospective, randomised, open-label study was to evaluate if continuing use of standard thiopurines in patients who had been responding to a combination of ADA and thiopurine for ≥ 6 months adds to the long-term efficacy of scheduled maintenance compared with ADA monotherapy.

Materials and methods

Ethical approval of the study protocol

The study protocol was approved by the Ethics Review Board at each institution and registered publicly (UMIN000009596). All patients provided verbal and written informed consent for blood testing and collection of clinical data before enrolment.

Patients

The DIAMOND2 was a multicentre, randomised, prospective, open-label study. Patients with CD who were in corticosteroid-free clinical remission (CFCR; Crohn’s Disease Activity Index (CDAI) < 150 [11]) with concomitant use of thiopurines with scheduled maintenance of ADA for ≥ 6 months were enrolled. The diagnosis of CD was based on criteria determined by the Japanese Ministry of Health, Labour and Welfare. The age range of patients was 15–65 years. Patients met all other inclusion criteria and none of the exclusion criteria.

Exclusion criteria were patients: (1) with a contraindication to anti-TNF-α (severe infection, active mycobacterial infection, past/present history of demyelinating disease, or clinically evident congestive heart failure); (2) with a contraindication for thiopurines (peripheral white blood cell count < 3000/ml); (3) who were pregnant or breastfeeding; (4) with a malignant neoplasm; (5) with an interval of < 6 months after their latest surgical procedure; (6) with short-bowel syndrome; (7) with an ileostomy or colostomy; (8) regarded as being unsuitable for the DIAMOND2 by the attending physician.

Study protocol

Each patient had CFCR for ≥ 6 months with scheduled maintenance using ADA (40 mg, s.c., e.o.w.) combined with thiopurines.

Patients assigned to the ‘continue’ (Con) group were treated further with ADA combined with oral thiopurines [AZA or 6-mercaptopurine (6-MP)] for 52 weeks. Patients assigned to the ‘discontinue’ (Dis) group received scheduled maintenance therapy using ADA for 52 weeks (Supplemental Figure 1).

The DIAMOND2 was an open-label study, so patients in the Dis group were not treated with a placebo either. Patients in the Con group were treated with 25–50 mg/day of AZA or 30 mg/day of 6-MP. The dose of thiopurines was maintained during investigation periods. The maintenance dose of thiopurines was chosen based on reports in Japanese patients with inflammatory bowel disease [12]. Follow-up data were collected several weeks after a patient completed the study at week 52 or immediately after withdrawal from the DIAMOND2. Randomisation was done centrally at the Clinical Research Centre within Keio University (Tokyo, Japan) using computer-generated randomisation system. All data were collected at the Clinical Research Centre within Keio University. During the investigation period, mesalazine (p.o.) or sulfasalazine (p.o) was maintained at a stable dose.

Efficacy and safety

CDAI scores were determined at weeks 0, 4, 12, 26 and 52. ‘Clinical recurrence’ was defined as an increase in the CDAI ≥ 70 from baseline and CDAI ≥ 150. Ileocolonoscopy was done at baseline and at week 52. The mucosal lesion at each ileocolonoscopy was assessed according to the Simple Endoscopic Score for Crohn’s Disease (SES-CD) [13]. Endoscopic images at sites of mucosal involvement were recorded as stills, and the SES-CD at each colonoscopy calculated by the attending physician. ‘Clinical remission’ was defined as a CDAI score < 150 [11]. Mucosal healing was defined as SES-CD ≤ 2 [10, 14]. Blood samples were collected from patients in both groups at weeks 0, 4, 12, 26, and 52. Blood samples were also collected if a patient withdrew from the study. The 6-thioguanine (6-TGN) concentration in red blood cells (RBCs) was measured at baseline (Con and Dis) and week 52 (Con). Samples of whole blood were collected in heparinised tubes and centrifuged. After removing plasma, RBCs were hydrolysed with acid and extracted with phenylmercuric acetate/ethyl acetate. 6-TGN levels were measured by high-performance liquid chromatography [15]. Trough levels of ADA and AAAs in serum were measured at baseline and week 52 in both groups. Enzymelinked immunosorbent assay (ELISA) and radio immunoassay were used to measure trough serum adalimumab concentrations and to detect the presence of anti-adalimumab antibodies by Sanquin (Amsterdam, the Netherlands). Based on manufacturer’s recommendations, a serum AAA level of ≥ 12 μg/ml was regarded as ‘positive’ for the antibody [16, 17].

Primary and secondary endpoints

The DIAMOND2 was designed to be a superiority trial. The primary efficacy endpoint was CFCR prevalence at week 52. Secondary efficacy endpoints were: the percentage of patients who showed a negative C-reactive protein (CRP) level; prevalence of mucosal healing (SES-CD ≤ 2) at week 52; serological remission (CRP ≤ 0.10 mg/dl) at week 52. Safety endpoints were the prevalence of: any adverse event during the study period; loss of response to ADA.

Statistical analyses

For the primary endpoint (CFCR prevalence at week 52), it was estimated that 90 patients in each group would be needed to attain a power of 80% to detect a difference in CFCR prevalence of 20% between Con and Dis groups (alpha = two-sided 5%). This calculation was based on the assumption that the CFCR prevalence would be 80% in the Con group and 60% in the Dis group at week 52.

Comparison between groups was done by Welch’s t test for continuous values, and Fisher’s exact test for categorical values. The difference in risk, relative risk, and proportion of each group with confidence intervals (CIs) was calculated for the primary endpoint. Significance was at the 5% level. The statistical power based on the number of people enrolled in the DIAMOND2 was 73.9% with the assumption of CFCR of 95% and 60% (alpha = two-sided 5%). Analyses were undertaken using SPSS v20 (IBM, Armonk, NY, USA).

Results

Study population

During the predetermined period of recruitment from 1 January 2013 to 31 December 2016, 53 patients were enrolled in the DIAMOND2. One patient relapsed before randomisation and was excluded. Fifty-two patients were randomised to the Con group (n = 23) or Dis group (n = 29). One patient in the Con group was excluded because the patient withdrew consent. One patient in the Dis group was excluded because the study was not carried out. Finally, 22 patients in the Con group and 28 patients in the Dis group were assessed (Supplemental Figure 2).

Twenty-one patients in the Con group and 27 patients in the Dis group completed the study in 52 weeks. One patient in the Con group and two patients in the Dis group withdrew from the study owing to an increase in disease activity. No patients in either group withdrew from the study owing to adverse effects. Table 1 summarises and compares the demographic data and baseline clinical characteristics of the study population between Con group and Dis groups.

Table 1 Demographic and clinical characteristics of the patients

Clinical outcomes

As a primary endpoint, CFCR prevalence at week 52 was not significantly different between the two groups. It was 95.5% (21/22, 95% CI 76.5–99.1%) in the Con group and 92.9% (26/28, 72.0–98.9%) in the Dis group (P = 1.000) (Fig. 1a). Kaplan–Meier analyses indicated that the time to CFCR was not significantly different between the two groups (log rank: P = 0.704) (Fig. 1b). Risk difference and relative risk between the groups was 2.6% (95% CI − 10.3–15.5%) and 1.03 (0.90–1.18), respectively.

Fig. 1
figure 1

Corticosteroid-free clinical remission (CFCR) at week 52. a CFCR at week 52 in the Con group and Dis group (Welch’s t test). b Cumulative prevalence of CFCR shown by Kaplan–Meier analyses. Statistical analyses were by log-rank tests. Con continue group, Dis discontinue group

Endoscopic outcomes

At baseline, the total SES-CD score (mean ± SD) was 5.46 ± 5.66 in the Con group and 4.39 ± 4.04 in the Dis group (P = 0.45). The prevalence of mucosal healing (SES-CD ≤ 2) at baseline was 63.6% (14/22) in the Con group and 57.7% (15/26) in the Dis group (P = 0.771) (Table 1). In patients who received ileocolonoscopy at week 0 and week 52, the total SES-CD score at week 52 was 5.81 ± 7.06 in the Con group and 6.54 ± 7.34 in the Dis group, and they were not significantly different (P = 0.942) (Fig. 2a). The proportion of patients with mucosal healing at week 52 was 30% (6/20) in the Con group and 32% (8/25) in the Dis group (P = 1.000) (Fig. 2b). Change in the SES-CD score at week 52 from baseline (mean ± SE) was 0.57 ± 0.97 in the Con group and 2.52 ± 0.96 in the Dis group (P = 0.162) (Fig. 2c). Worsening of endoscopic activity was observed in two patients in the Con group and two patients in the Dis group. Endoscopic improvement was observed in one patient in the Con group (SES-CD decreased from 11 at baseline to 0 at week 52) (data not shown).

Fig. 2
figure 2

Endoscopic evaluation. a The SES-CD score (mean ± SD) at week 52 in Con and Dis groups. b Proportion of the patients with endoscopic remission (SES-CD ≤ 2) at week 52. c The mean change in the SES-CD score at week 52 from baseline (∆SES-CD) (mean ± SD). Statistical analyses were by Welch’s t test. Con continue group, D discontinue group, SD standard deviation

Biological outcomes

At week 52, the proportion of patients with a negative CRP level (< 0.10 mg/dl) was 76.2% (16/21) in the Con group and 84.6% (22/26) in the Dis group (P = 0.711) (Fig. 3a). The change in the CRP level (mg/dl) at week 52 from baseline (mean ± SD) was 0.600 ± 1.574 in the Con group and 0.098 ± 0.535 in the Dis group (P = 0.164) (Fig. 3b).

Fig. 3
figure 3

Analyses of serum CRP levels (mg/dl) at week 52. a Proportion of patients with serological remission (CRP ≤ 0.10 mg/dl) at week 52. b Mean change in the CRP level at week 52 from baseline (∆CRP) (mean ± SD). Statistical analyses were by Welch’s t test. Con continue group, D discontinue group, SD standard deviation

‘Triple remission’ at week 52

The percentage of patients with triple remission (i.e., CFCR, endoscopic remission and a negative CRP level) at week 52 was 22.7% (5/22) in the Con and 28.6% (8/28) in the Dis group (P = 0.750) (Supplemental Figure 3).

Trough levels of ADA and AAA in serum

The trough level (mean ± SD) of ADA in serum (μg/ml) at week 52 was 7.08 ± 2.96 in the Con group and 6.48 ± 3.16 in the Dis group (P = 0.515) (Fig. 4a). The proportion of patients with AAA positivity at week 52 was 10.0% (2/20) in the Con group and 20.0% (5/25) in the Dis group (P = 0.437) (Fig. 4b). The three patients who suffered clinical relapse had AAA negativity when they relapsed. Among 44 patients who had AAA negativity at baseline, switching to AAA positivity during the observation period in three patients (one in the Con group and two in the Dis group) was noted. The trough level (mean ± SD) of ADA in serum (μg/ml) at week 52 was 4.02 ± 2.81 in patients with AAA- positivity and 7.38 ± 3.29 in patients with AAA negativity (P = 0.014) (Fig. 5a). Especially in patients with a high titre of AAAs, the trough level (mean ± SD) of ADA in serum was very low (Fig. 5b).

Fig. 4
figure 4

Trough level of ADA in serum and AAA positivity. a Trough level of ADA in serum (μg/ml) at week 52 (mean ± SD). b AAA positivity at week 52. Statistical analyses were by Welch’s t test. Con continue group, D discontinue group, AAA anti-adalimumab antibody, SD standard deviation

Fig. 5
figure 5

Correlation of the trough level of ADA in serum with AAA. a Trough level of ADA in serum (μg/ml) at week 52 in AAA-negative and AAA-positive patients (mean ± SD). b Scatter plot shows the trough concentration of ADA and AAA titre in each patient. Statistical analyses were by Welch’s t test. AAA anti-adalimumab antibody, ADA adalimumab, SD standard deviation

Endoscopic remission at week 0 and clinical outcomes at week 52

As described above, continuation of thiopurines > 6 months offers no clear benefit over the scheduled ADA monotherapy. We performed a sub-analysis to find out whether endoscopic activity at week 0 influenced the clinical outcomes (CFCR, endoscopic remission, serological remission, triple remission) at week 52 (Table 2). Endoscopic remission at week 0 was involved in endoscopic remission (P < 0.001) and triple remission (P < 0.001) at week 52 (Table 2).

Table 2 Correlation between endoscopic remission at week 0 and outcomes at week 52

Adverse effects

No patients in the Con group and two patients in the Dis group reported adverse effects (common cold), and serious adverse events were not observed in either group (Supplemental Table 1).

Discussion

The DIAMOND2 did not reveal a clear benefit of continuation of thiopurines > 6 months over scheduled ADA monotherapy. There was no difference in the primary endpoint (CFCR prevalence at week 52) between patients who received further concomitant use of thiopurines and patients for whom thiopurines were withdrawn. At week 52, continuation of thiopurines did not show benefit with regard to the serum level of CRP, endoscopic activity, triple remission, or trough level of ADA in serum. Only a change in the SES-CD at week 52 from baseline tended to be higher in the Dis group. The trough level of ADA decreased in patients who were AAA positive.

The benefit of concomitant use of thiopurines in scheduled maintenance therapy using anti-TNFα mAb is controversial. In the Study of biologic and immunomodulator naive patients in Crohn’s Disease (SONIC) study, the combination of IFX with AZA showed clinical benefit with regard to the prevalence of clinical remission at week 26 and appearance of anti-IFX antibodies [3]. Conversely, combination of ADA with thiopurines in biologic-naïve CD patients (DIAMOND study) did not show benefit in clinical remission at week 26 as the primary endpoint [8]. However, sub-analysis of the DIAMOND study suggested that concomitant use of AZA showed some clinical benefit in terms of inhibition of AAA induction and endoscopic improvement [9, 10]. Those findings suggested marginal benefits of concomitant use of thiopurines with anti-TNFα mAbs, even in humanised mAb ADA, as induction therapy in biologic-naïve patients. With regard to generation of anti-drug antibodies, aggressive disease could inhibit the induction of anti-drug antibodies, whereas the risk of induction of anti-drug antibodies could be lower in patients with clinical remission. Hence, concomitant use of thiopurines with ADA in the acute phase of disease may be beneficial.

The benefits and risks of long-term concomitant use of immunosuppressants with anti-TNF mAbs in patients in clinical remission are also controversial. The effects of withdrawal of immunosuppressant therapy in people with quiescent CD are not known [18].

Long-term, strict immunosuppression could be a risk factor for infection and development of malignancy in patients with inflammatory bowel disease. Long-term concomitant use of immunosuppressants with anti-TNF mAbs increases the risk of infectious disease. In the Crohn’s Therapy, Resource, Evaluation, and Assessment Tool (TREAT) Registry, treatment with thiopurines was associated with more frequent serious infections (adjusted odds ratio (OR), 1.23; 95% CI 0.96–1.57) [19, 20]. With regard to lymphoma risk, the Cancer and Increased Risk Associated with Inflammatory Bowel Disease in France (CESAME) study [combination therapy: standardised incidence ratio, 10.2 (95% CI 1.24–36.9); thiopurine monotherapy: 6.86 (3.84–11.31)] and the Kaiser Permanente IBD Registry [combination therapy: 6.6 (4.4–8.8); thiopurine monotherapy: 1.4 (1.2–1.7)] showed a higher risk of lymphoma on combination therapy than for thiopurine monotherapy [19, 21, 22].

In analyses of 1594 patients with CD who participated in clinical trials using ADA, ADA monotherapy did not show a significant increase in the risk of non-melanoma skin cancer and other cancers, whereas concomitant use of thiopurines with ADA showed a significant increase in the risk of non-melanoma skin cancer and other cancers [5]. In this regard, Van Assche et al. suggested withdrawal of thiopurines in patients with CD in CFCR for ≥ 6 months with scheduled maintenance using IFX combined with thiopurines. They found that the clinical course (a change in IFX dosing, treatment interval, or cessation of IFX therapy) and endoscopic findings were not significantly different between the thiopurines continue and the discontinue groups, but the CRP level was significantly higher and the trough level of IFX in serum was lower in the discontinue group [4].

In the DIAMOND2, the proportion of AAA-positive patients at week 52 tended to be higher in the Dis group, but not significantly so. In fact, AAA negativity changed to AAA positivity during the observation period in only three patients. Conversely, the trough concentration of ADA in serum of AAA-negative patients was significantly higher than that in AAA-positive patients. In patients with a high titre of AAAs, the trough concentration of ADA in serum was extremely low. This observation may suggest that, with regard to the trough concentration of ADA in serum, the AAA titre is more important rather AAA positivity. Although the impact of the titre of anti-drug antibodies upon treatment efficacy is controversial, high levels of AAA were associated with increase in disease activity and decreased ADA trough concentration in previous reports [9, 23, 24].

Taken together, we believe that all patients who are in clinical remission do not necessarily need long-term use of thiopurines with ADA maintenance. The risk of induction of anti-drug antibodies is relatively higher in patients with active disease, so it may be reasonable for concomitant use of thiopurines with anti-TNF mAbs as induction therapy especially in complicated patients with severe disease who have the wild type of NUDT15 genotype (codon 139, Arg/Arg) [25, 26], whereas in patients who can maintain remission for > 6 months, thiopurines could be stopped to reduce the risk of infection and malignancy.

Our study had three main limitations. First, the actual number of registered cases in the DIAMOND2 was lower than that in the schedule, so our study was under-powered statistically. Enrolment of participants required more time than anticipated. Therefore, the initial enrolment period fixed to 12 months was twice extended to ultimately 30 months in an effort to include the required sample size. Further prolongation of inclusion period was financially not feasible especially due to the costs of the study, necessitating closure of the study before reaching the calculated number of patients. In addition, not all the patients who enrolled this study came from DIAMOND1 study. Therefore, there could be selection bias for enrolment. Second, we assumed the superiority of continuation of thiopurines. The DIAMOND2 was not powered to demonstrate subtle differences in clinical benefit between the two groups. However, the lower limit of the CI for CFCR of both groups was ~ 75%, and the CIs of the risk difference and risk ratio were within a narrow range. Therefore, the DIAMOND2 suggested that CFCR prevalence was sufficiently high and not significantly different in both groups (of course, it is not confirmatory). However, we believe that the clinical importance of the DIAMOND2 should be taken into consideration fully even though the response and number of enrolled cases in both groups were unexpected. The CFCR prevalence of both groups in the DIAMOND2 was consistent with that of the long-term efficacy and safety of adalimumab maintenance therapy in Crohn’s disease in a follow-on randomised controlled trial (CLASSIC II) undertaken to evaluate long-term efficacy and safety of ADA maintenance therapy in CD. In the CLASSIC II, patients who were in clinical remission at week 4 were followed up until week 56. At week 56, 79% of patients who were maintained with ADA (40 mg, e.o.w.) and 83% of patients who were maintained with ADA (40 mg) weekly were in clinical remission, in contrast to 44% of the placebo group. Concomitant use of thiopurines was 21% in the ADA (40 mg, e.o.w.) group [27]. In the DIAMOND2, the statistical power was ~ 75% even if the CFCR of 95% in each group and non-inferior margin of 12.5% were assumed (alpha = 10% one-sided). Third, the observation period of the DIAMOND2 was 52 weeks. We may have missed the disadvantage of withdrawal of thiopurines in this observation period if it had a gradual influence. Long-term, extensive observation of patients enrolled in the DIAMOND2 should provide the risk of withdrawal of thiopurines in ADA maintenance in CD.

Conclusions

In this prospective, controlled study, continuation of combination of ADA with thiopurines > 6 months was not superior to withdrawal of thiopurines. This observation suggests that further consideration of the thiopurine combination in ADA maintenance therapy for CD is necessary from the viewpoints of benefit and risk.