Introduction

Anti-TNF therapy has been first developed 20 years ago and is now approved for the treatment of rheumatoid arthritis (RA), inflammatory bowel diseases (IBD), psoriasis, ankylosis spondylitis, juvenile idiopathic arthritis (JIA), and hidradenitis suppurativa [1]. In addition, TNF blockade is also used in Behçet disease, non-infectious ocular inflammation, adult-onset Still disease, and pyoderma gangrenosum [2]. Currently, there are five biological agents targeting the TNF pathway approved for treatment: infliximab, a chimeric antihuman monoclonal antibody; etanercept, a TNFR2 dimeric fusion protein associated with an IgG1 Fc; adalimumab and golimumab, two fully human monoclonal antibodies; and certolizumab, a PEGylated Fab fragment. Close monitoring of patient’s anti-TNF trough levels has proved useful in many ways, as well as cost-effective [3, 4]. Low infliximab trough levels are associated with future loss of response to treatment [5,6,7], while high trough levels correlate with future response to treatment during intensification therapy [7, 8]. Besides, relying not only on the clinical decision but also on drug monitoring and clinical data has been proved more effective in patients with Crohn’s disease (CD) who lose response to anti-TNF treatment, and is associated with better endoscopic outcomes and mucosal healing [9, 10]. Several algorithms based on therapeutic drug monitoring (TDM) have been proposed and assessed, suggesting efficiency in treatment de-escalation [11] and follow-up [12, 13]; however, those studies were mostly retrospective. A prospective study revealed the importance of therapeutic drug monitoring (TDM) with drug-tolerant assays to evaluate the patient clinical remission and guide the therapeutic decisions. The prospective TAXIT study showed that the rate of clinical relapse at 1 year in the TDM arm (treat-to-target) was significantly lower [14]. In addition, the approach enabled clinicians to reduce drug dosage among patients whom blood levels of infliximab were > 7 µg/mL, without hindering clinical response.

As of now, a moderate number of ELISA tests are available on the market, with comparability studies showing mixed results [15,16,17,18,19]. For instance, Lee et al. compared three different ELISAs and concluded that the Theradiag Lisa-Tracker was the most accurate one [20] but presented some false-positive samples in Vande Casteele et al.’s work [19]. All in all, currently available tests seem to be interchangeable, but it is recommended that one test should be used for the entire follow-up of one patient [21]. The need for harmonization grows as there currently is no clear recommendation on which technique to use.

Another point to consider is that the ELISA techniques require an incompressible amount of time (turnaround time of 4–8 h), whereas early dosages of anti-TNF therapies yield valuable information regarding endpoint clinical response [22,23,24]. In this context, Lu et al. developed a point-of-care technique which allowed clinicians to get the drug level within 1 h, and could then tailor drug dosage directly [25]. While the technique seems promising as it allows rapid and accurate drug monitoring, it still requires serum instead of total blood, which hinders the technique from being used by clinicians directly on site. The use of dry blood spot (DBS) via a capillary puncture could be a way of bypassing this issue [26] and might be used in the future either with ELISAs or with the point-of-care technique described above [27]. Two point-of-care techniques now coexist on the market (BÜHLMANN Quantum Blue® Infliximab and Ridaquick IFX from R-Biopharm), and both of them consist of a sandwich immunoassay. Recombinant tumor necrosis factor alpha (TNF-α) is conjugated to gold colloids, and infliximab present in the sample will bind to the conjugate and is revealed with a highly specific monoclonal antibody, and while several studies compared the robustness of those techniques versus the ELISA gold standard [28, 30] with both infliximab and adalimumab [29], none of them (to our knowledge) tested them against one another.

Here, we compared the BÜHLMANN Quantum Blue® Infliximab and the R-Biopharm Ridaquick® against one another and against most of the ELISAs on the market (Theradiag, Sanquin, R-Biopharm, and Grifols). Primary objective was to assess whether these techniques were non-inferior to ELISA technique and which of them was the most robust.

Materials and Methods

Patients

Residual trough level of infliximab and anti-infliximab antibodies (ATI) was measured in the serum of 90 CD patients followed in day hospital of gastroenterology. Eighty-five were undergoing a treatment with Remicade®, and five were treated with adalimumab. We also included to the starting population ten randomly selected patients with high IgM rheumatoid factor blood levels as controls.

Measurement of IFX Trough Levels and Antibodies to Infliximab (ATI)

We have used several commercially available ELISAs from different suppliers to monitor infliximab trough levels (Theradiag, R-Biopharm, Sanquin, and Grifols) and for the anti-infliximab antibodies measurement (drug-sensitive assays; R-Biopharm, Grifols, and Theradiag). Two point-of-care techniques were also tested and compared to ELISAs, supplied by R-Biopharm (RIDA®QUICK IFX Monitoring) and BÜHLMANN (Quantum Blue®) (Table 1).

Table 1 Description of the different assays used for the measure of IFX and ATI trough levels

Statistical Analysis

Quantitative data were summarized by median and interquartile range [IQR]. A D’Agostino and Pearson omnibus normality test was used to assess the normality of continuous variables. Quantitative differences between the results of serum IFX were analyzed using a Wilcoxon test, given the non-normal distribution of the assays. Spearman rank test was used for correlation analysis. Comparison of all pairs of IFX concentration means, distributions by quartile, was analyzed with one-way analysis of variance and Tukey’s multiple comparison test. For the quartile comparison, a one-side Cochrane–Armitage trend test has been used.

Regarding ADA, only a qualitative comparison was used, given the difference of units. The comparison of the number of fold increases in the threshold of positivity was also realized. A p value less than 0.05 was considered statistically significant. GraphPad Prism 5.0 (GraphPad Software, San Diego, CA) was used for all statistical analysis.

Results

Comparison of Different Assays for the Measurement of IFX Trough Levels

We measured the infliximab trough level in the serum of 85 patients presenting IBD and treated with infliximab, using four different ELISA kits from multiple suppliers (Theradiag, R-Biopharm, Sanquin, and Grifols) and two Quick methods (R-Biopharm and BÜHLMANN).

We determined the median of the different assays (Table 2). The median values were very close regardless of the used technique and varied from 1.2 to 2.2 µg/mL. We then divided into four quartiles the infliximab levels that we measured with the available kits (0–3, 3–5, 5–7, and > 7 µg/mL). The results show a homogeneous distribution of the values in all of the three first quartiles, and the values are more scattered in the last one because of the difference in the high measurement threshold depending on the BÜHLMANN assay which has a broader linear range for trough determination up to 20 µg/mL (Supplementary Fig. 1).

Table 2 Median and inter-quartiles range (µg/mL) comparison between the different assays
Fig. 1
figure 1

Nonparametric Spearman correlation between the different suppliers’ kits. a Patients trough level distribution within the different assays, the red line representing the median, b correlation between Theradiag and R-Biopharm (ELISA), c correlation between Theradiag and Sanquin (ELISA), d correlation between Theradiag and Grifols (ELISA), e correlation between Sanquin and R-Biopharm (ELISA), f correlation between Grifols and R-Biopharm (ELISA), g correlation between Grifols and Sanquin (ELISA), h correlation between Quantum Blue® (BÜHLMANN quick test) and Ridaquick (R-Biopharm quick test)

We used a nonparametric correlation with a Spearman r coefficient to compare the different results, which revealed an agreement ranging from 0.93 to 0.97 between the different ELISAs [the highest r value was between Theradiag and R-Biopharm ELISA in addition to Grifols and R-Biopharm (0.97); the lowest was between Sanquin and Theradiag (0.93)] and from 0.95 to 0.99 between the ELISAs and the Rapid monitoring kits [the best correlation was between R-Biopharm ELISA and Ridaquick (0.99) and the lowest between both of POCs and Sanquin ELISA (0.95)]. The rapid monitoring assays had a strong correlation with each other, with an r coefficient of 0.98 (Table 3, Fig. 1).

Table 3 Spearman r coefficient comparison between infliximab monitoring tests

In order to complete this latest nonparametric analysis, we decided to study the infliximab trough levels using Bland–Altman plots and to analyze the techniques two by two. Three different gaps of infliximab trough level were defined according to the recommended therapeutic levels in Crohn’s disease (infratherapeutic: IFX < 3 µg/mL—therapeutic zone: IFX 3–7 µg/mL—supratherapeutic: IFX > 7 µg/mL). We then obtained the average of the two measurements and the difference between both of them (Supplementary Fig. 2). All techniques showed a good agreement up to 3 µg/mL of infliximab trough level, but were less correlated in the therapeutic and supratherapeutic groups. POCs overestimated the infliximab levels superior to 3 µg/mL compared to ELISA techniques.

Fig. 2
figure 2

Comparison of assays to measure antibodies to infliximab (ATI). a Normalized quantitative values comparison of ATI independently of IFX trough levels (conclusive and inconclusive ATI) between the different ELISAs expressed with the fold of the positivity limit defined for each assay. b Normalized quantitative values comparison of ATI (conclusive) between the different ELISAs in patients with IFX < 0.3 µg/mL expressed with the fold of the positivity limit defined for each assay

Comparison of Different Assays for the Measurement of ATI

Anti-infliximab antibodies levels were quantified independently of the IFX trough levels in the 85 sera of the same patients treated by infliximab using three drug-sensitive ELISAs (R-Biopharm, Theradiag, and Grifols). Due to differences in measurement units between three suppliers, we compared the positive and negative rates between them also using a nonparametric correlation; the Spearman r coefficient varied from 0.74 to 0.90, and the highest r value was between Grifols and Theradiag (0.90), while the lowest was between R-Biopharm and Theradiag (0.74) (Table 4).

Table 4 Spearman r coefficient comparison between the anti-infliximab antibodies monitoring test (comparison of positive and negative anti-infliximab antibodies values)

We also tried to normalize the quantitative values between the different ELISA tests which have different units, and we arbitrarily chose to express them by using multiples of the positivity limits defined by each supplier (Fig. 2). We then compared the data using a Spearman r coefficient that ranged from 0.81 to 0.93 (the highest r value was between Theradiag and Grifols 0.93 and the lowest between Theradiag and R-Biopharm 0.81) (Table 5).

Table 5 Spearman r coefficient comparison between the anti-infliximab antibodies monitoring test after normalizing the quantitative values (multiples of supplier positivity limits)

We know that the presence of IFX in the sera interferes with the detection of antibodies to infliximab due to the formation of complexes between infliximab and antibodies to infliximab (ATI inconclusive). We finally compared the presence and the quantities of ATI in samples where IFX trough levels are very low or undetectable (< 0.5 µg/mL). The correlation between Theradiag, Grifols and R-Biopharm ranged from 0.56 to 1 when comparing positive/negative rates (Theradiag/Grifols r = 1, Theradiag–Grifols/R-Biopharm r = 0.56) and from 0.6 to 0.89 when comparing the normalized quantitative values.

Discussion

We report a comparison between two rapid monitoring techniques and four commonly used ELISA techniques for the measurement of infliximab and anti-infliximab antibodies in serum samples of CD patients. The results demonstrate the comparability between all ELISA kits with a great correlation ranging between 0.93 and 0.97. The best coefficient was found between the Grifols and R-Biopharm kits, both of them using a monoclonal antibody to quantify trough concentration of infliximab in patients’ sera and both having a very similar calibration curve, with a detection range from 0.2 to 14.4 µg/mL for Grifols and 0.5–12 µg/mL for R-Biopharm, when using the main dilution recommended by the supplier. This value was also found between Theradiag and R-Biopharm ELISA kits. The lowest r value was between Theradiag and Sanquin (0.93). Nevertheless, all ELISAs are perfectly correlated. However, it seems better to perform the follow-up of a patient with the same assay. The use of the Theradiag assay could be useful in the case of anti-TNF switch between infliximab and adalimumab as it allows measuring the cumulative trough levels between these two drugs.

The comparison between the ELISA methods and the two point-of-care (rapid test) also shows a great correlation (0.95–0.99), with the best Spearman r value being between R-Biopharm and both of the POC assays (0.98–0.99) and the lowest between both of POCs and Sanquin ELISA (0.95). The two POC assays had a good correlation (0.98), demonstrating that we can indifferently use either ELISA techniques or POC assays only for punctual infliximab residual level measurement. On the other hand, when testing inter- and intra-assay coefficients of variability, the ELISA techniques showed CV ranging around 7–8%, while the POC assays showed CVs were around 15–20%. From this point of view, the ELISAs are more precise for application of follow-up of patients, the trade-off being the time delay in result availability as compared to the rapid tests. Although the quantification limit can be modified by the dilution factor, the obtained results when using the supplier’s first recommended dilution vary between the different techniques. POC assays offer a standard range from 0.4 to 20 µg/ml for BÜHLMANN and from 0.5 to 10 µg/ml for R-Biopharm, which is comparable to the ELISA; the therapeutic zone (3–7 µg/ml) is included in the given limits (3–7 µg/ml). The current delay to obtain a result with the standard ELISA techniques is around 90–120 min, whereas the rapid methods are then much shorter (around 30 min including serum generation from venous blood). However, in every assay even the two POCs used to monitor infliximab, measurements are taken on serum and not whole blood samples, which add centrifugation to the process, thus hindering time to results. These are not in fact real point-of-care assays as defined for diabetes testing but rather quick assays. These two tests present two main advantages over ELISA techniques: Firstly, shorter time to obtain results, which would theoretically allow clinicians to rapidly optimize patients’ treatments and improve clinical care, and secondly the possibility to analyze single samples on demand while the ELISA methods lengthen the time-to-result by working in series. Nevertheless, when using those techniques in everyday practice, things get more complicated. ELISA techniques may remain more reliable to use when de-escalating the treatment and when treating to target because of its limited CVR measurement in time, which may allow us to better achieve an almost precise level of infliximab in the patient serum. Nevertheless, changes in the dosing regimen using an ELISA can only be performed at the next infusion of the patient, typically eight weeks later. On the other hand, POC techniques may be advantageous in case of non-responding patients, since they can allow the detection of low infliximab concentrations within the same hospital visit of the patient. Nevertheless, the immediate treatment adaptation would only be possible when POC measuring ATI will be available and combined with infliximab POC. Moreover, the importance of having an exact concentration in non-responders may be less necessary. Nevertheless, there are currently no POC ATI assays available to discriminate between immune- and non-immune-mediated pharmacokinetic treatment failure.

Despite a great correlation when using Spearman test, the Bland–Altman plots showed controversial results. In fact, when the patients have a low infliximab trough level, all the techniques were comparable, despite a small percentage of patients overestimated by POC techniques. Things get more complicated when the infliximab level is higher than 3 µg/mL. Some of the ELISA techniques (such as Sanquin) and especially POC overvalue the trough level compared to the other techniques, which can predict difficulties to optimize patients. A part of them will have a high level of infliximab (> 7 µg/mL) when measurements are taken with POC techniques and will be in the therapeutic gap when dosed with ELISA techniques.

Finally, the POCs are quantitative monitoring methods; therefore, their COFRAC accreditation will also be quantitative so their EQC will probably be based on ELISA values with ELISA CVRs. Since the POC coefficient of variation of reproducibility is around 15–20 versus 7–8% for ELISAs, their accreditation will be difficult.

Regarding the measurement of anti-infliximab antibodies with those drug-sensitive techniques, the comparison between Theradiag, R-Biopharm, and Grifols showed a very good correlation when comparing the positive and negative sera (Spearman r between 0.73 and 0.90). The same comparison using the sera with an undetectable trough infliximab level (< 0.5 µg/mL) showed an excellent correlation between Grifols and Theradiag (r = 1) and a poorer correlation between them and R-Biopharm (r = 0.56). A study in Drug test anal (02/2017) showed that we can transform a drug-sensitive technique into a drug-tolerant technique by pretreating the sample in order to dissociate the drug–antibody complexes to measure the anti-infliximab antibodies levels in non-responding patients with a detectable infliximab level in serum.

We then normalized the quantitative values of anti-infliximab antibodies and arbitrarily chose to express them by using the multiples of the positivity limits defined by each supplier (10 ng/mL for Theradiag, 20 ng/mL for R-Biopharm, and 5AU/mL for Grifols) (Fig. 2). The graphic shows dispersing values with Grifols compared to R-Biopharm and Theradiag which presented a less scattered data. This can be explained by the lower positivity limit of Grifols.

These results highlight the fact that every assay can be used to identify a positive level of antibodies in a patient serum. However, since the kits do not have standardized quantification units and therefore generate very different results in absolute value, they should not be interchanged when monitoring a patient. Besides, the main discrepancy was observed in the Grifols ELISA kit in which patients with a high level of rheumatoid factor showed false-positive responses as if they had anti-infliximab antibodies; this result was absent when using the other kits.

Several ongoing works are trying to standardize the monitoring of ATI by using highly specific antibodies (MA-IFX10F9 for example) that can be utilized as potential universal calibrators and thus improve the measures’ sensitivity and specificity.

In conclusion, despite the good correlation between the different assays, it is more accurate to use the same test when monitoring a patient’s infliximab and anti-infliximab antibodies trough level given the fact that they do not all have the same detection limit and differ within the measuring techniques. A standardization of all those methods could be the key for a complete interchangeable use during TDM. Standards to harmonize these different assays are already described in the literature for both IFX and ATI and can eventually be used to harmonize TDM [31, 32]. Indeed, some groups are working on the monitoring of infliximab through level with LC–MS techniques. The actual results remain controversial. Some are well correlated with ELISA [33], but others show an important variation between LC–MS assays techniques and ELISA commercial kits [34]. The main difficulties when using LC–MS are the determination of a suitable internal standard (IS) when monitoring infliximab and the specificity of the detected signature peptide. Moreover, the LC–MS assay is more difficult to apply for a simple follow-up of the patients.