Introduction

Allogeneic hematopoietic cell transplantation (allo-HCT) is the most curative treatment options for hematological malignancies [1,2,3]. Conditioning regimens are among the important structural elements of HCT. This procedure has two primary goals: to provide sufficient immunoablation for the host to allow adequate engraftment of the donor cells, and to reduce the tumor burden of the underlying disease [4]. Historically, to achieve these goals, myeloablative conditioning (MAC) regimens such as cyclophosphamide combined with total body irradiation (TBI) [5] or with busulfan (Bu) [6] have been developed over the past 40 years. However, in the early years, toxicity from MAC was one of the major causes of mortality after HCT, so many patients were considered ineligible for transplantation because of advanced age or unacceptable risks associated with this treatment [7]. Over the past two decades, as immunologic reactions of donor cells against host leukemic cells after HCT were recognized, less toxic and more tolerable conditioning regimens called reduced-intensity conditioning (RIC) or nonmyeloablative conditioning have been developed. Though these regimens must have sufficient immunosuppression for engraftment, they can attenuate their cytotoxicity without compromise of their antileukemic action because of the presence of graft-versus-leukemia effects [7].

A purine analog, fludarabine (Flu), was introduced in the development of the conditioning regimens for allo-HCT in the 1990s [8]. Flu is generally well tolerated and has a sufficient immunosuppressive effect, along with a synergistic effect with alkylating agents [9]. This drug serves as the backbone of most RIC regimens with a reduced dose of alkylating agents and/or a reduced dose of TBI [7]. In allo-HCT for acute myeloid leukemia (AML), Flu with reduced doses of Bu (Flu + Bu) [10] and Flu with reduced doses of melphalan (Mel) (Flu + Mel) [11] are widely used RIC regimens [12,13,14]. There have been several studies for evaluation of Flu-based RIC regimens for AML, but most of them were retrospective analyses or prospective single-arm trials [15,16,17,18,19,20]. The results of these studies were fairly comparable. In some of these studies, the oral form of Bu (poBu) was used. However, interpatient variation of intestinal absorption is a problem with oral Bu [21]. In the late 1990s, an intravenous formulation of Bu (ivBu) was developed. IvBu is expected to stabilize the pharmacokinetics of Bu in each patient and perhaps improve clinical outcomes [22,23,24,25]. In Japan, ivBu was introduced in 2006 [26].

In this nationwide retrospective study, the clinical outcomes of allo-HCT for AML, especially focusing on Flu + ivBu-based (FBiv) RIC regimens, compared with Flu + poBu-based (FBpo) and Flu + Mel-based (FM) ones, were evaluated.

Patients and methods

Study design and data collection

This study was a retrospective multicenter study. Data were provided by the Transplant Registry Unified Management Program, which is managed by the Japan Society for Hematopoietic Cell Transplantation [27]. The population selection criteria included adult patients aged 16 years or older with AML, who received allo-HCT after RIC regimens between 2001 and 2010. We defined a RIC regimen as including the following dosage level according to the report from the Center for International Blood and Marrow Transplant Research [28]: TBI ≤ 5 Gy (nonfractionated) or ≤8 Gy (fractionated), poBu < 9 mg/kg or ivBu < 7.2 mg/kg and Mel < 140 mg/m2. Variables related to patients, diseases and transplants were extracted from the database. Transplant outcomes including engraftment, graft-versus-host disease (GVHD), complications, relapse or disease progression and survival were also collected. Patients lacking the information about key variables, i.e., sex, outcomes, endpoints, a stem cell source and a conditioning regimen, were excluded. The protocol was approved by the institutional review board of St. Luke’s International Hospital. Informed consent was obtained from recipients and donors in accordance with the principles of the Declaration of Helsinki.

Study endpoints and definitions

The primary endpoint of this study was leukemia-free survival (LFS). Secondary endpoints included engraftment, incidences of acute and chronic GVHD, non-relapse mortality (NRM), cumulative incidence of relapse (CIR) and overall survival (OS). All the times to the endpoint were calculated from the date of HCT (day 0). LFS was defined as time to progression of the underlying disease or death from any cause, whichever came first. OS was defined as time to death irrespective of the cause. Relapse was defined as hematological recurrence of AML, with NRM considered a competing event. NRM was defined as time to death while in remission, with relapse considered a competing event. Surviving patients who were free from events were censored at the date of last follow-up. If patients transplanted in active disease failed to achieve complete remission after HCT, the date of relapse was defined as day 0. Times to neutrophil and platelet recovery were defined as the first of 3 consecutive days with an absolute neutrophil count ≥500/µL and a platelet count ≥50,000/µL without transfusion, respectively. Acute GVHD and chronic GVHD were diagnosed and graded by standard criteria [29, 30]. For engraftment and GVHD, relapse and NRM were considered competing events. Cytogenetic abnormalities were classified according to the cytogenetic risk status classification system of the National Comprehensive Cancer Network [31].

Statistical analysis

Probabilities of LFS and OS were calculated by the Kaplan–Meier method. Engraftment, GVHD, NRM, and CIR were estimated by the cumulative incidence method. The log-rank test was used to compare LFS and OS curves and Gray’s test was used for the comparison of cumulative incidence curves. Multivariate analyses were performed using Cox proportional hazards model for LFS and OS, and the Fine–Gray model was used for engraftment, GVHD, relapse, and NRM. All p values were two-sided and p values < 0.05 were considered significant. All statistical analyses were performed with EZR (Saitama Medical Center, Jichi Medical University, Saitama, Japan), which is a graphical user interface for R (The R Foundation for Statistical Computing, Vienna, Austria, version 3.3.2). More precisely, EZR is a modified version of R commander (version 2.3–2) that was designed to add statistical functions frequently used in biostatistics [32].

Results

Patient characteristics

A total of 1743 patients with AML received first allo-HCT with RIC regimens from January 1, 2001 through December 31, 2010. The regimens were FBiv for 347 (20%) patients, FBpo for 444 (25%), and FM for 430 (25%). These 1221 patients were included in this analysis.

Patient, disease, and transplantation characteristics are summarized in Table 1. The median age of the 1221 patients included in this study was 58 (range: 16–82) years, and 38% (n = 470) were female. Patients whose disease status at transplant was high risk were more frequent in the FM group (61%) than in the FBiv (41%) and FBpo (47%) groups. Nearly half of the patients (48%) in the FBiv group received unrelated bone marrow transplantation, and 60% of the patients in the FM group underwent unrelated cord blood transplantation. FM might be expected to reduce the risk of graft failure after cord blood transplantation [33].

Table 1 Characteristics of adult patients (age ≥ 16 years) with AML who received first allogeneic hematopoietic cell transplantation between 2001 and 2010 with one of the three conditioning regimens; Flu + ivBu based, Flu + poBu based, or Flu + Mel based.

In addition to Flu and Bu or Mel, 66%, 53% and 76% of the patients (n = 806) received low-dose (≤8 Gy) fractionated TBI in the FBiv, FBpo, and FM groups, respectively. Low-dose TBI was expected to reduce the risk of graft failure [34]. The majority of patients in the three groups were given Flu (125–180 mg/m2), and ivBu (6.4 mg/kg), poBu (8 mg/kg), or Mel (80 mg/m2). In the 806 patients who received irradiation, 72% of them (n = 581) received 4 Gy of fractionated TBI. The median doses of TBI were similar in the three groups. The median follow-up of survivors was 600 days in the FBiv group, 1947 days in the FBpo group, and 897 days in the FM group.

LFS and OS

Both LFS and OS were significantly better in patients with FBiv conditioning than with FM (p = 0.018; p < 0.001, respectively) (Table 2). However, when the patients were stratified by disease status at transplant, LFS and OS among the three conditioning regimens were not significantly different in patients receiving HCT in both standard-risk (p = 0.64; p = 0.74, respectively) and high-risk (p = 0.32; p = 0.089, respectively) cases (data not shown). Multivariate analysis showed that LFS was significantly better in patients with FM (hazard ratio (HR) = 0.77, 95% confidence interval (CI): 0.63–0.95, p = 0.015) than with FBiv conditioning, but there was no significant difference in OS of the FBpo group (HR = 1.21, 95% CI: 0.97–1.51, p = 0.095) and the FM group (HR = 0.90, 95% CI: 0.72–1.13, p = 0.36) compared with the FBiv group (Table 3 and Supplementary Table 1). Adjusted LFS and OS at 3 years were 30.5 (95% CI: 24.3–38.3)% and 35.7 (28.7–44.5)% in patients with FBiv, 28.3 (23.3–34.4)% and 32.8 (27.5–39.2)% with FBpo, and 41.1 (35.4–47.6)% and 47.1(41.4–53.6)% with FM, respectively (Fig. 1). There were significant differences in both adjusted LFS (p = 0.003) and OS (p = 0.018) among the FBiv, FBpo, and FM groups. On the other hand, multivariate analysis only for patients who received allo-HCT between 2006 and 2010 showed that there were no significant differences in both LFS and OS of the FBpo group and the FM group compared with the FBiv group (Supplementary Table 2).

Table 2 Univariate analysis of transplant outcomes.
Table 3 Multivariate analysis of transplant outcomes.
Fig. 1: Adjusted probabilities of leukemia-free and overall survival for all patients according to conditioning regimen.
figure 1

a Leukemia-free survival. b Overall survival.

Relapse and NRM

CIRs at 3 years were 42.8 (37.2–48.3)% in the FBiv group, 39.3 (34.7–43.9)% in the FBpo group and 39.4 (34.6–44.2)% in the FM group (Table 2). There was no significant difference in CIR among the three groups. NRMs at 3 years were 18.8 (14.1–24.0)% in the FBiv group, 27.6 (23.5–31.9)% in the FBpo group and 30.2 (25.8–34.6)% in the FM group (Table 2). NRM was significantly lower in patients receiving FBiv regimens compared with FBpo (p = 0.003) and FM (p < 0.001) regimens. Stratified by disease status at transplant, CIRs in the FBiv group (63.5% at 3 years) were significantly higher than in the FM group (50.2% at 3 years) in high-risk (p = 0.008) cases (Fig. 2). NRMs in the FBiv group (17.1 and 21.1% at 3 years) were significantly lower than in the FM group (24.9 and 32.1% at 3 years) both in standard-risk (p = 0.026) and high-risk (p = 0.033) cases (Fig. 3). Multivariate analysis showed that the risk of relapse was significantly lower in patients receiving FM conditioning (HR = 0.66, 95% CI: 0.50–0.85, p = 0.002) than FBiv, and that NRM was significantly higher in the FBpo (HR = 1.84, 95% CI: 1.27–2.64, p = 0.001) and FM (HR = 1.60, 95% CI: 1.10–2.33, p = 0.013) groups than in the FBiv group (Table 3). Only for patients who received allo-HCT between 2006 and 2010, multivariate analysis showed that the risk of relapse was still significantly lower in patients receiving FM conditioning (HR = 0.73, 95% CI: 0.53–0.99, p = 0.042) than FBiv, but that there was no significant difference in NRM of the FBpo group and the FM group compared with the FBiv group (Supplementary Table 2).

Fig. 2: Cumulative incidence of relapse according to conditioning regimen, stratified by disease status at transplant.
figure 2

a Cumulative incidence of relapse according to conditioning regimen in standard-risk cases at transplant. b Cumulative incidence of relapse according to conditioning regimen in high-risk cases at transplant.

Fig. 3: Cumulative incidence of non-relapse mortality according to conditioning regimen, stratified by disease status at transplant.
figure 3

a Cumulative incidence of non-relapse mortality according to conditioning regimen in standard-risk cases at transplant. b Cumulative incidence of non-relapse mortality according to conditioning regimen in high-risk cases at transplant.

The major causes of NRM were graft failure/hematological disorder, infection, GVHD and organ failure in this study. The differences in cumulative incidences of NRM by graft failure, GVHD, and organ failure were not significant among the three conditioning regimens. However, on multivariate analysis, NRM by infection was a higher trend in the patients receiving FM than in those receiving FBiv (Table 4).

Table 4 Cumulative incidence and multivariate analysis of causes of non-relapse mortality.

Engraftment

Neutrophil engraftment was significantly higher in the FBiv group than in the FBpo and FM groups when the patients received related and unrelated bone marrow and unrelated cord blood. Platelet engraftment was not significantly different among the three groups irrespective of stem cell source (Table 2). On multivariate analysis, there was no significant difference in neutrophil and platelet recovery among the three groups (Table 3).

GVHD

The cumulative incidence of grade II to IV acute GVHD at day 100 was 33.3 (28.2–38.5) % in the FBiv group, 37.3 (32.5–42.2)% in the FBpo group, and 44.9 (39.7–50.0)% in the FM group (Table 2). It was significantly lower in patients undergoing FBiv regimens, compared with FM (p = 0.003). The cumulative incidence of grade III to IV acute GVHD was significantly lower in the FBiv group than in the FM groups. After adjustment for variables with a different distribution in the FBiv and FM groups, the incidences of grade II to IV acute GVHD and chronic GVHD were significantly higher in patients with FM than in those with FBiv (Table 3).

Discussion

In the present study, the clinical outcomes of adult patients with AML who underwent allo-HCT with RIC regimens were evaluated, focusing on FBiv conditioning compared with FBpo and FM.

In allo-HCT for AML, Flu + Bu, and Flu + Mel are widely used RIC regimens. Flu + Bu (Flu 180 mg/m2 and poBu 8 mg/kg) was first reported by Slavin et al. [10]. Since then, several investigators have further explored this regimen in myeloid malignancies [15,16,17,18]. Investigators at MD Anderson Cancer Center first reported results with a RIC regimen consisting of Flu 125 mg/m2 and Mel 100–140 mg/m2 in patients [11]. This regimen has subsequently been investigated at other centers, yielding similar results [19].

The studies of Flu + Bu conducted in the 1990s contained poBu, which was associated with individual variations in intestinal absorption and plasma Bu levels [21]. In the late 1990s, ivBu was developed and was expected to stabilize the plasma concentration of Bu in each patient. Decreased variability in plasma Bu levels with ivBu was reported to reduce sinusoidal obstruction syndrome and 100-day mortality [24, 35, 36].

In the present study, multivariate analysis showed there was no significant difference in OS among patients with AML receiving allo-HCT with the FBiv, FBpo, and FM regimens, but the FM group had a significant advantage in LFS compared with the FBiv group. The present results also indicated that the risk of relapse for the patients receiving FM decreased significantly compared with FBiv, while NRM was significantly higher in patients with FBpo or FM than that in patients with FBiv. These results could be affected by the progress in supportive care during the study period. Multivariate analysis only for patients who received allo-HCT between 2006 and 2010 showed no significant difference in survival outcomes among transplant recipients with the three regimens. We also analyzed the impact of conditioning regimens on transplant outcomes by stem cell source, but there was no significant difference in OS among the patients with the FBiv, FBpo, and FM regimens in each stem cell source. But about LFS, the FBpo group had a significant disadvantage among the patients who received related bone marrow transplantation, and the FM group had a marginally significant advantage among those who received unrelated cord blood transplantation, compared with the FBiv group (Supplementary Table 3).

There are already several retrospective studies comparing Flu + Bu with Flu + Mel. Three of these studies are summarized in Supplementary Table 4 (refs. [37,38,39]). The present study, as well as these three studies, showed that CIR was significantly lower for patients receiving FM than for those receiving FBiv. In the present study, the total Mel dose was limited to less than 140 mg/m2 according to the definition of RIC, and most patients were given 80 mg/m2 [40]. But in the three studies, the Flu + Mel regimens consisted of 100–140 or 130–150 mg/m2 of Mel. This result indicated that, despite the use of a reduced dose of Mel, Flu + Mel might be more intense and have inherently higher antileukemia potential than Flu + Bu.

In the present study, NRM was significantly lower in the FBiv group than in the FBpo and FM groups. This finding was in agreement with Shimoni et al. and Eapen et al., and the study from the European Society for Blood and Marrow Transplantation also suggested a lower NRM for patients with Flu + Bu than for those with Flu + Mel. This increase in NRM might be attributable to the severe toxicities of Mel. In the present study, NRM by infection was a higher trend in the patients receiving FM than in those receiving FBiv. Severe infection during the early phase of HCT following mucosal barrier damage caused by Mel as a part of a conditioning regimen may have contributed to the increase of NRM [37]. The higher incidence of acute GVHD with FM observed in the present comparison was also likely the result of the increased tissue injury caused by Mel and release of inflammatory cytokines involved in GVHD pathogenesis [41, 42].

Baron et al. suggested that patients receiving Flu + Bu were more likely to have a mixed chimera early after HCT and a higher incidence of graft failure than those receiving Flu + Mel [39, 43]. However, in the present study, multivariate analysis showed there were no significant differences in neutrophil and platelet recoveries between the FBiv and FM groups. We examined the impact of TBI on transplant outcomes. Multivariate analysis by conditioning regimen showed that TBI had a positive impact on LFS and OS only in the FBpo group (Supplementary Table 5). Another subgroup analysis stratified by stem cell source indicated that low-dose TBI significantly reduced the risk of relapse only in the HCT recipients of related peripheral blood or unrelated bone marrow (Supplementary Table 3). These results suggest that TBI had a certain role in engraftment and antileukemia effect, but the influence of low-dose TBI on the FBiv and FM regimens was limited in this comparative study.

It is important to recognize that this retrospective study had some limitations. First, there was no information related to why individual patients were designated to receive specific conditioning regimens. Second, there were diverse characteristics among the patient groups such as age, disease status at transplant, and stem cell sources. Therefore, multivariate analyses were performed to adjust for the effects of these differences on the results. Another limitation is that there was not enough information about late complications for analysis in the dataset. Quality of life in long-term survivors should be one of the critical factors in the comparison of RIC regimens.

In summary, a nationwide retrospective study was performed using a cohort of adult patients with AML undergoing allo-HCT to compare the clinical outcomes of FBiv with those of FBpo and FM. LFS was significantly higher and CIR was significantly lower in the FM group than in the FBiv group. Thus, Flu + Mel appears to have a more intense antileukemia potential than Flu + ivBu. On the other hand, NRM was significantly lower in the FBiv group than in the FBpo and FM groups. This finding indicates that Flu + ivBu is a less toxic RIC regimen than Flu + Mel. Introduction of ivBu instead of poBu also contributed to decreased toxicity. Finally, the three groups had almost the same OS. Though prospective randomized studies are needed to confirm these results, the results from the present study offer convincing evidence that both Flu + ivBu and Flu + Mel are effective and useful RIC regimens for patients with AML who receive allo-HCT.