Introduction

Systemic lupus erythematosus (SLE) is a chronic autoimmune inflammatory disease that affects multiple organs [1]. Renal involvement, i.e., lupus nephritis (LN), in SLE occurs in up to 60 % of cases and is a major determinant of the outcome of the disease [2]. Currently, glucocorticoids in combination with cyclophosphamide are the main regimen of induction therapy for lupus nephritis [3]. This regimen is quite effective and may improve the long-term prognosis of patients. However, adverse effects such as bone marrow suppression, infection, and gonadal toxicity limit its use in clinical practice. New immunosuppressant, such as mycophenolate mofetil (MMF) and calcineurin inhibitors (CNIs), has been used to treat LN in recent years. CNIs are potent immunosuppressants that inhibit the nucleus factor of activated T cell (NFAT) family of transcription factors, leading to the reduced function of effector T-cells [4] Meanwhile, CNIs inhibit the transcription of the early activation genes of interleukin (IL)-2 and suppress T cell-induced activation of tumor necrosis factor-α, IL-1β, and IL-6 [5]. Although several studies indicated that CNIs, such as cyclosporine (CyA) and tacrolimus (TAC), are effective at treating LN [6, 7], no systematic review has been performed to clarify their efficacy in the induction and maintenance treatment of LN in comparison with ivCYC and MMF. Therefore, this study sought to evaluate the efficacy and safety of these CNIs in treating LN.

Search strategy

A literature search was performed in PubMed, the Cochrane Library, and the Embase databases. PubMed (1966–February 2015), Embase (1974–February 2015), the Central Register of Controlled Trials (1999–January 2015), and the Cochrane Renal Group (1999–January 2015) were searched for the identification of relevant randomized controlled trials (RCTs). The following search terms were used: lupus nephritis, lupus glomerulonephritis, systemic lupus erythematosus, tacrolimus, TAC, FK506, cyclosporine A, CsA, CyA, and calcineurin inhibitors. Relevant text words relating to eligible interventions were also searched. We also hand-searched the bibliographies of articles for additional references. The results were limited to human studies with no restrictions on language.

Inclusion criteria and risk of bias

Articles were selected and subsequently screened based on the patient/problem intervention comparison outcome (PICO) principle. The studies included were RCTs and quasi-RCTs, whether published or unpublished, that evaluated any of the following treatment options: alone or in combination for more than 6 months in induction and 9 months in the maintenance period with corticosteroids, cyclophosphamide, MMF, tacrolimus, azathioprine, or cyclosporine. Only trials enrolling patients with biopsy-proven lupus nephritis and clearly defined remission criteria, remission outcome data and safety data were included. All potential articles were retrieved for the full text and reviewed independently by at least two investigators to determine whether the inclusion criteria were met.

Because of the variable quality of the articles included, no assessment of validity was made for qualifying studies. Moreover, a risk of bias table recommended by the Cochrane risk of bias tool [8] was used to assess the risk of bias of the included RCTs.

Data extraction and management

Two authors (Xiaoyan Zhang and Ling Ji) performed data extraction independently using standard data extraction forms, and Wei Qin was consulted when there was a discrepancy. For studies from which detailed data could not be extracted, the authors were contacted by e-mail. Basic information such as first author, year of publication, study design, inclusion criteria, study sample size, basic characteristics of the study subject, intervention regimen, drug dosage, follow-up time, outcome data, and adverse effects was recorded for each study included.

Outcome measures

The primary outcomes for the induction period were the complete remission (CR) rate, the partial remission (PR) rate, and the response rate, as defined by the sum of the complete and partial remission rates. The serum creatinine (sCr) level and 24-h urine protein level were used as efficacy indexes, whereas the rates of infection, leukocytopenia, hypertension, hyperglycemia, and menstruation disorders were used as safety indexes. Complete and partial remission criteria and rates were established within each article and are described in Table 1. All studies used proteinuria reduction as a criterion for remission and required a reduction in proteinuria to less than 0.5 g/day for complete remission. Some studies used even lower thresholds of less than 0.3 g/day, and several used serum creatinine levels, serum albumin levels, urine red blood cell numbers, and extra-renal lupus activity as additional criteria for remission. Partial remission criteria varied between articles, but all required a greater than 50 % reduction in proteinuria or less than 3.5 g/day (Table 2). When data were missing or incomplete, the investigators of the trials were contacted for clarification. The method provided by the Cochrane Handbook [8] was used to convert the mean and the 95 % confidence interval (CI) range to the mean and standard deviation.

Table 1 Definitions of clinical outcome in each RCT
Table 2 Reasons for exclusion of full-text articles

Statistical analysis

Review Manager 5.2 software was used to analyze the data. Risk ratios (RR) and 95 % CIs were used to express the results of dichotomous outcomes. The mean difference (MD) was used for results with continuous scales, and the standardized mean difference (SMD) was used when different scales were used. Heterogeneity was analyzed using a Cochran Q test (n − 1 df), with P < 0.05 denoting statistical significance and I 2 measuring the proportion of variation in efficacy estimates due to heterogeneity beyond chance [9]. Random-effects analysis (I 2 > 50 %) and fixed-effects analysis (I 2 < 50 %) were used in meta-analysis according to the protocol. A Z test was used to analyze the overall effect, with P < 0.05 denoting statistical significance. Publication bias was estimated using funnel plots.

Result

Study selection

We identified 2765 articles in the first search. Of these, 2747 articles, including duplicate references, reviews, case reports, basic studies, non-controlled trials, systematic reviews, and meta-analyses, were excluded after examination of the title and abstract. Full texts of the remaining 18 articles were retrieved for further selection. An additional eight articles were excluded, including four non-RCT studies [1013], one study of children [14], one historical controlled trial [15], one study without outcome assessment [16], and one early research study [17]. Eventually, 10 studies including eight induction period studies [1825] and two maintenance period studies [26, 27] were included in this systematic review and meta-analysis. Among them was El-Sehemy’s [18] study, which compared CyA and ivCYC or AZA, and Li’s [25] study, which compared TAC and ivCYC or MMF. Therefore, separate comparisons were performed during the meta-analysis. Miyasaka’s [20] study was not included in the quantitative analysis because it compared TAC with a placebo. Although El-Sehemy’s [18] and Griffiths’ [22] studies both compared CyA and AZA, a quantitative analysis was not performed because there were few identical indices between the two studies. Zavada’s [16] study was not included in the quantitative analysis of the maintenance period because it only contained subjects from the induction period rather than complete remission subjects into the maintenance period. The article search strategy used in our review is described in Fig. 1.

Fig. 1
figure 1

Flow diagram of articles considered for inclusion

Trial characteristics and qualities

Table 3 shows the characteristics of the studies that were included in the meta-analysis. With the exception of two studies, all patients had biopsy-proven proliferative lupus nephritis [18, 22]. Comparators for induction therapy included cyclophosphamide, MMF, tacrolimus, and cyclosporine. The efficacy and safety of the CNIs and ivCYC were compared in five studies [18, 19, 21, 23, 25], totaling 188 and 194 patients, respectively, and the efficacy and safety of the CNIs and MMF were compared in two studies [24, 25] with a total of 56 patients. Only the CNIs and AZA were compared in maintenance therapy, which included two studies [26, 27] and 139 total patients.

Table 3 Characteristics of the studies included in this systematic review of CNI therapy in lupus nephritis

A risk of bias assessment of the RCTs included in our analysis was performed using a risk of bias table recommended by the Cochrane risk of bias tool (Table 4). A lack of allocation concealment resulted in a high risk of bias in Zavada’s study, and selective reporting in El-Sehemy’s and Griffiths’ studies also increased the risk of bias.

Table 4 Risk of bias summary

Comparison of the CNIs and ivCYC in induction therapy

The comparison of the efficacy of the CNIs and ivCYC in induction therapy of LN patients included four RCTs. No significant difference was observed in the CR rate (RR 1.33, 95 % CI 0.93–1.90, P = 0.12), PR rate (RR 0.91, 95 % CI 0.60–1.36, P = 0.64), or response rate (RR 1.12, 95 % CI 0.94–1.33, P = 0.20). Moreover, there were no significant differences in the proteinuria or serum creatinine levels between CNI and ivCYC treated patients (Fig. 2). Five RCTs were used to compare the adverse effects of the CNIs and ivCYC during the induction therapy of LN patients. There were lower rates of infection (RR 0.65, 95 % CI 0.43–0.98, P = 0.04), leukocytopenia (RR 0.32, 95 % CI 0.11–0.93, P = 0.04), and menstrual disorder (RR 0.37, 95 % CI 0.17–0.80, P = 0.01) in patients treated with the CNIs than in patients treated with ivCYC (Fig. 3). However, no differences were found in the incidence of other adverse effects, such as liver dysfunction, hyperglycemia, and the transient increase of serum creatinine (Table 5).

Fig. 2
figure 2

Comparison of the therapeutic efficacy of the CNIs and ivCYC

Fig. 3
figure 3

Comparison of the adverse effects of the CNIs and ivCYC

Table 5 Summary of outcome meta-analysis comparing CNIs with other immunosuppressive agents in LN

Comparison of the CNIs and MMF in induction therapy

Two RCTs were used to compare the efficacies of the CNIs and MMF in the induction therapy of LN patients. No significant difference was observed in the CR rate (RR 0.57, 95 % CI 0.12–2.75, P = 0.48), PR rate (RR 1.33, 95 % CI 0.58–3.08, P = 0.50), or response rate (RR 0.94, 95 % CI 0.68–1.30, P = 0.71), and no difference was found in the incidence of infection or leukocytopenia (Fig. 4; Table 5).

Fig. 4
figure 4

Comparison of the therapeutic and adverse effects of the CNIs and MMF

Comparison of the CNIs and AZA in maintenance therapy

Comparison of the efficacies of the CNIs and AZA in the maintenance therapy of LN patients included two RCTs. No significant difference was observed in the relapse rates (RR 0.44, 95 % CI 0.10–1.89, P = 0.27), but a slight decrease in the infection incidence (RR 0.56, 95 % CI 0.28–1.10, P = 0.09) and a lower apparent leukocytopenia incidence (RR 0.26, 95 % CI 0.12–0.55, P = 0.0005) were observed in the CNI groups (Fig. 5; Table 5), indicating a lower incidence of adverse effects and a better tolerance for the CNIs in maintenance therapy.

Fig. 5
figure 5

Comparison of the therapeutic and adverse effects of the CNIs and AZA

Publication bias

The funnel plots of the five RCTs comparing the CNIs and ivCYC in induction therapy are shown in Fig. 6. The symmetric distribution suggests that there was no publication bias in these studies. No funnel plot analysis could be performed in the CNIs versus MMF or the CNIs versus AZA comparisons because only two RCTs were included.

Fig. 6
figure 6

Funnel plot for the partial remission rate

Discussion

SLE is a classic autoimmune disease with a range of multi-system disorders; the kidney is the most vulnerable target of SLE. Lupus nephritis is one of the most severe complications of SLE. Many guidelines recommend intravenous CYC [2830] as the first choice of induction treatment. Although the CYC regimen achieved a relatively high remission rate, 15 % of LN patients were unresponsive, and up to 50 % of patients developed end-stage renal disease (ESRD) during treatment [31]. Moreover, CYC could cause severe adverse effects such as liver toxicity, infection, malignancy, and infertility [3]. Thus, new immunosuppressants, such as MMF and the CNIs CyA and TAC, have been used recently to improve the clinical management of LN patients. The most effective immunosuppressive therapy, however, is controversial. A recently published systematic review [32] reported that there was insufficient evidence to determine which immunosuppressive agent provides the best induction treatment. To clarify the effect of the CNIs CyA and TAC on the induction and maintenance treatment of LN with the most reliable and credible results, we performed a systematic review and meta-analysis that included all of the high-quality RCTs ever published.

The effect of the CNIs on the induction and maintenance treatment of LN was analyzed in this study. Eight RCTs were included in the systematic review and meta-analysis of the CNIs in induction therapy, and of them, five compared the CNIs with ivCYC (two TAC and three CyA). A total of 202 LN III-V patients were included, with a 6 to 12 month follow-up period. Although some studies suggested that the effect of TAC or CyA was superior to ivCYC, our systematic analysis and meta-analysis indicated that the CNIs TAC and CyA are comparable to ivCYC in terms of their CR, PR and response rates. The advantage of the CNIs is a significantly lower incidence of the adverse effects (infection, leukocytopenia, and menstrual disorder) that limited the use of ivCYC in many patients. A quantitative analysis about hypertension was not performed because only few articles can be merged. However, Griffiths’s and Moroni’s studies showed that the incidence of hypertension in CyA group was higher than azathioprine group. This question should be addressed further after more data could be achieved in RCTs. Two RCTs that compared the efficacy of the CNIs and MMF in LN induction treatment were also included in this study. A total of 56 patients with LN III-V were enrolled in studies with 6 to 24 month follow-up periods. TAC was used in both trials. Moreover, a systematic review and meta-analysis reported no difference in therapeutic effectiveness (CR, PR rate) or adverse effects between TAC and MMF. Similar to induction therapy, the effect of the CNIs on LN maintenance therapy was comparable to that of AZA, but the leukocytopenia rate was much lower. However, only two RCTs were included in the meta-analysis, and this result needs further confirmation.

In the recently published KDIGO (Kidney Disease Improving Global Outcomes) guidelines for lupus nephritis treatment, ivCYC, and MMF were recommended for proliferative and membranous lupus nephritis. CNIs, by contrast, were suggested only as an alternative membranous lupus nephritis treatment. In the ACR guidelines, CNIs were also suggested as a treatment option for refractory LN [28]. Webster reported that tacrolimus could be applied in patients with lupus nephritis during pregnancy [33]. Based on the results of the current study, we recommend the CNIs (CyA and TAC) as an alternative induction LN III-V treatment when patients either refuse or are intolerant to ivCYC/MMF treatment.

Given that no “head-to-head” comparison of TAC and CyA in LN treatment has been performed, we could not determine which one was more effective. However, in this study, we found that, considering the CR, PR and response rates, neither TAC nor CyA is superior to ivCYC in LN induction treatment, but the renal toxicity of TAC is much lower than that of CyA, suggesting that TAC is safer, especially in patients with impaired renal function.

Previously, several systematic reviews [3437] comparing CNIs and ivCYC in LN induction therapy have been published. For maintenance therapy, Henderson LK et al. reported that MMF was more effective than azathioprine at preventing relapse [38]. A recent study [37] also compared CNIs and CYC (oral or iv) in four RCTs, one case control study and one cohort study. The results suggested that CNIs are superior to CYC in the response rate and adverse effect incidence. However, the inclusion of non-RCTs violated the principle of meta-analysis and compromised the reliability of the results.

The current study is the only systematic review to analyze the effect and safety of the CNIs CyA and TAC in the induction and maintenance therapy of LN in high-quality RCTs. No case–control or cohort studies were included to prevent bias and to guarantee the reliability of the results. Moreover, this study is the only systematic review comparing these CNIs and MMF in LN treatment; therefore, it provides very useful information.

Our study had several limitations. First, most of the RCTs included were small sample size, single-center studies, whereas large sample size, multi-center studies are needed to avoid inclusion bias. Second, only six RCTs [19, 2123, 26, 27] indicated the exact randomization method, whereas three RCTs [23, 26, 27] concealed the random assignment protocol, potentially compromising the reliability of these studies. Third, heterogeneities in pathological subtypes and drug dosages, and a combined regimen in different RCTs may have also obscured the meta-analysis results. Therefore, long-term, large-sample, multi-center RCTs are needed to confirm the efficacy of the examined CNIs in LN treatment.

Conclusion

In conclusion, our systematic review and meta-analysis of recent RCTs indicated that the CNIs CyA and TAC were equally as effective as ivCYC/MMF and AZA in the induction and maintenance treatment of LN patients. Moreover, the CNIs were safer than ivCYC, with lower rates of leukocytopenia, infection, and menstruation disorder. Therefore, the CNIs could be recommended to LN patients as an alternative method of induction and maintenance treatment, especially for those who refuse or are intolerant to ivCYC or MMF.