Introduction

High-dose chemotherapy followed by autologous stem cell transplantation (ASCT) remains a standard of care for Multiple Myeloma (MM) [1,2,3,4,5]. The conditioning regimen used for ASCT has not changed since the Intergroupe Francophone du Myélome demonstrated superior survival for melphalan 200 mg/m2 (MEL200) versus 8-Gy TBI with melphalan 140 mg/m2 (MEL140) in 2002 [6]. Since then, a number of agents [i.e., alkylators, proteasome inhibitors (PI), immunomodulatory drugs (IMiDs)] and other strategies have been tested in combination with MEL200 [7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24]. With the exception of a sole phase III study showing improved progression-free survival (PFS) for conditioning with busulfan and MEL140 compared to MEL200, no other regimen has shown superiority to the standard MEL200 conditioning regimen in a randomized controlled trial (RCT) [24]. It is now well established that achievement of deeper responses post-transplant, as measured by sensitive measurable residual disease detection technologies (MRD), correlates highly with extended survival [25,26,27,28]. Therefore, augmenting transplant conditioning strategies with the purpose of increasing the depth of response after transplant, may result in further improvement in duration of remission and potentially improved survival.

Bendamustine is a synthetic agent that combines the alkylating properties of a mustard-group with the antimetabolic activity of a purine analog [29]. Similar to other alkylators, bendamustine is a DNA cross-linking agent that causes DNA breaks. However, DNA single- and double-strand breaks caused by bendamustine are more extensive and significantly more durable than those induced by other alkylators such as melphalan [30, 31]. Its antimetabolite activity includes induction of apoptosis and inhibition of mitosis [31, 32]. With this distinctive dual mechanism of action, bendamustine has proven to induce responses in MM resistant to other alkylators and is therefore a promising agent to test synergy in conditioning regimens [31,32,33,34,35,36,37,38,39]. Our group previously reported a phase I study of high-dose bendamustine up to 225 mg/m2 in combination with MEL200 in transplant-naive MM and showed that this regimen was well tolerated and a maximum tolerated dose of bendamustine was not reached [23]. We now report the results of the phase II study of chemotherapy conditioning comprised of bendamustine and melphalan in patients undergoing first ASCT.

Patients and Methods

Study design

This is a phase II, single-arm study using bendamustine 225 mg/m2 with MEL200 as determined in the prior phase I trial [23]. The primary objective was to determine the efficacy of bendamustine in combination with melphalan conditioning in patients with MM that would warrant further investigation. The IRB of Weill Cornell Medicine, in accordance with federal regulations and the ethical principles provided in the Helsinki Declaration, approved the protocol. All patients provided informed consent. Patients were enrolled between 05/2011 and 12/2015, and the trial is registered at ClinicalTrials.gov with identifier NCT00916058.

Study eligibility patient population

Patients eligible for entry were 18–75 years with confirmed MM undergoing first ASCT. Patients could be screened for inclusion either i) post-induction for newly diagnosed MM (NDMM), or ii) after relapsed or progressive disease for relapsed or refractory MM (RRMM). Eligibility for ASCT was assessed by adequate collection of ≥2 × 106 of CD34+/kg stem cells, Karnofsky performance status ≥70% and adequate organ function defined as ejection fraction >40%, lung diffusion capacity >45%, creatinine clearance (CrCl) >25 ml/min, liver enzymes ≤5× and total bilirubin ≤1.5×. Major exclusion criteria included previous ASCT, pregnancy, or uncontrolled medical conditions. The Revised International Staging System (R-ISS) at the time of the initial diagnosis was used for prognostication [40]. High-risk cytogenetics were defined as any of the following: del(17p), t(4;14), or t(14;16). Any percentage positive signal on FISH testing was considered a positive result. Prior lines of therapy, defined by the International Myeloma Working Group (IMWG), were obtained for each patient [41].

Conditioning Regimen and Supportive care

The highest dose assessed from the phase I study of bendamustine (125 mg/m2 on day-2 and 100 mg/m2 on day-1) in combination with melphalan (100 mg/m2 daily on day-2 and day-1) were chosen for the study. Melphalan was dose-reduced to 70 mg/m2 per day for CrCl <70 ml/min. At the time of melphalan infusion, each patient was offered oral cryotherapy for 6 h [42]. Stem cell products were infused 24–48 h after the last dose of melphalan. All patients received G-CSF support starting on day + 1 until absolute neutrophil count (ANC) was ≥1000 neutrophils/mm3 for 48 h. Patients received levofloxacin, acyclovir or valacyclovir, and fluconazole prophylaxis starting day + 1, red blood cell and platelet transfusions to support a hemoglobin >7 g/dL and platelet count >10,000/mm3, and other supportive care as an in-patient until ANC engraftment.

Response determination

Engraftment was defined as per CIBMTR criteria as ANC >500/mm3 and platelet count of 20,000/mm3 without platelet support for at least 3 days [43]. Serum and urine protein electrophoresis with immunofixation as well as free light chain determinations were done before transplantation, day + 30, day + 60, day + 100, day + 180 and thereafter every 6 months to assess disease response as per IMWG criteria [44]. Bone marrow biopsy was performed before ASCT and at day + 100. Karyotyping and FISH with CD138-cell selection were performed on all samples. PFS was defined as time elapsed between the stem cell infusion (day + 0) and disease progression or death. OS was defined as time elapsed between day + 0 and death from any cause. The response assessment at day + 100 was independently reviewed by two investigators.

Safety Assessment

Patients were followed prospectively for adverse events (AEs) from day-2 through day + 100. Transplant-related mortality (TRM) was defined as death within the first 100 days due to any cause other than disease progression. All AEs were first documented by the trial physicians, and thereafter collated and graded by two independent reviewers according to the National Cancer Institute’s Common Terminology Criteria for Adverse Events (CTCAEv4.0). The Cornell Data and Safety Monitoring Board oversaw the monitoring plan.

Maintenance Therapy

At the time of the study design, maintenance therapy was not a standard of care and it was therefore not part of the initial protocol. As the practice evolved, the use and type of maintenance or continuous therapy post-ASCT was left to the discretion of the treating physician starting after day + 100.

Statistical Analyses

The primary end point was to evaluate the proportion of patients with complete response (CR) or better at day + 100. Sample size was determined according to Simon’s two-stage minimax design assuming 10% level of significance and 80% power [45]. Previous reports with induction followed by ASCT have shown a CR rate of ~20 to 60% depending on the induction regimen [5, 46]. Based on the objectives of this early phase II study, and the expected heterogeneity of the study population, a CR proportion of ≤40% was defined as unacceptable and >60% as worthy of further exploration. The null hypothesis that the CR proportion is ≤40% was tested against the alternative hypothesis that the CR proportion is ≥60%. In the first stage, 16 patients entered the study and 7 or more patients needed to respond to enter the second stage. At Stage 2, the treatment would be declared effective and worthy of further testing if 15 or more patients respond among 28 patients. This design yielded a ≥0.80 probability of a positive result if the true percentage of complete responders was ≥60%. It yielded a ≥0.90 probability of a negative result if the true percentage of complete responders was ≤40%. Assuming that approximately 10% of patients were declared unavailable/ineligible, we calculated that a total 32 patients were needed for enrollment.

Secondary end points included OS, PFS, and rate of toxicities. PFS and OS were assessed by Kaplan–Meier survival analysis. Patients without OS/PFS events were censored at the last follow-up for OS and at the last disease evaluation for PFS. Post-hoc subgroup comparative analyses for overall response rate (ORR) and depth of response between NDMM and RRMM cohorts was performed by Χ2 test and Fisher-exact test. Comparisons of OS/PFS between groups of interest were evaluated by the log-rank test. A Cox proportional-hazards model was used to evaluate the impact of tumor/patient characteristics on the outcomes of interest. Ninety-five percent confidence intervals (95% CI) for all OS/PFS proportions were calculated to assess the precision of the obtained estimates. All analyses were performed in Stata version 15.0 (StataCorp, College Station, TX).

Results

Study population

At stage 1, 16 patients entered the study. As there were eight patients with a complete response/stringent complete response (CR/sCR), enrollment then increased to a total of 28 patients at stage 2. Sixteen out of the first 28 evaluable patients achieved a CR/sCR, meeting the bar for further exploration as prespecified in the statistical design. The study enrollment was then expanded to a total of 35 patients to improve the precision of the 95%CI. The baseline characteristics of the patients are summarized in Table 1.

Table 1 Patient and Disease Characteristics

ASCT was performed as part of the upfront therapy in 18 patients with NDMM (51%) and in 17 patients with RRMM (49%). Induction regimens for the NDMM included VRd (61%), BiRd [clarithromycin-lenalidomide-dexamethasone] (6%), Ixazomib/Rd (6%), CyBord (3%), V-DCEP (3%), Rd (3%). Of the RRMM group, 3 patients had relapsed disease, 4 had primary refractory disease, 7 had refractory disease to the last line of therapy, and 3 patients had received more than two lines of induction based on poor responses and were transplanted more than 12 months after initial diagnoses (for the purpose of this subgroup analysis, these patients were included in the RRMM group). Median lines of therapy prior to ASCT for the RRMM group was 2 (range: 1–4). Therapy exposure history is described in Table 1. All patients had received either an IMiD or a PI at some point during treatment, most commonly lenalidomide (16/17) and bortezomib (16/17). Four patients had received a newer-generation proteasome inhibitor or IMiD. Seven patients were refractory to lenalidomide, seven were refractory to bortezomib, three of them double-refractory. One patient was double-refractory to pomalidomide and carfilzomib. There was no statistical difference seen between the depth of response pre-transplant in the NDMM and RRMM groups (P = 0.63).

Stem Cell Infusion and Engraftment

Our algorithm for mobilization, based on preharvest HPC values and preemptive use of plerixafor, was followed [47]. Plerixafor was added based on CD34+ count on Day + 4 of G-CSF. Table 2 summarizes information for mobilization, melphalan dosing, CD34+ collection and engraftment.

Table 2 Transplant-related variables

Toxicity

Incidence of grade 3 AEs that occurred in at least 2 cases and all grade 4 AEs are listed in Table 3. There was no transplant-related mortality (TRM). The most common AEs (reported for more than 40% of the patients) were nausea (94%), fatigue (94%), hypocalcemia (94%), anorexia (91%), diarrhea (91%), hypoalbuminemia (91%), dry mouth (63%), dyspnea (60%), mucositis (57%), abdominal pain (51%), peripheral edema (49%), bloating (49%), constipation (49%) and febrile neutropenia (46%). The most frequently observed grade 3 AEs were febrile neutropenia (46%) and hypokalemia (20%). There was only one grade 4 AE, hyperbilirubinemia deemed to be secondary to a drug reaction to piperacillin/tazobactam. The patient had neutropenic fever on day + 5 and was started on piperacillin/tazobactam. The next day, the patient developed new bilirubinemia. Cultures came back positive on day + 6 (later identified as S.hominis). Piperacillin/tazobactam was discontinued after day + 7 and alternative antibiotic therapy was provided. Bilirubin peaked at day + 13 and thereafter resolved. Liver biopsy was performed and consistent with a drug reaction.

Table 3 Incidence of Treatment Related Adverse Events with Toxicity Grade 3 or more occurring in at least 2 subjects or any Grade 4

Disease Response

Day + 100 evaluation with bone marrow examination was performed at a median of 99 days (range 83–135). 51% of the 35 patients enrolled achieved a CR/sCR (95%CI 36%-67%). This included 8 patients (44%) transplanted with NDMM and 10 patients (59%) with RRMM (p = 0.31). The ORR for the cohort was 97%; 100% for the NDMM and 94% for the RRMM setting (p = 0.49). Figure 1 shows all response changes with respect to initial disease status prior to ASCT. Maximum CR/sCR response rate for the cohort was 54% as one additional patient achieved a CR during maintenance therapy (7 months after ASCT). Only 1 patient progressed before day + 100 and one additional patient progressed before Day + 180 (both with RRMM). Individual responses and outcomes for patients with dose-reduction of melphalan based on impaired renal function are described in the Supplementary Information.

Fig. 1
figure 1

Disease response assessment at 100-days post-transplant by disease status prior to transplant: patients transplanted in the newly diagnosed setting (NDMM) (a) and patients transplanted in the Relapsed/Refractory setting (RRMM) (b)

Maintenance

Maintenance was given in 12 out of 18 patients transplanted in the NDMM setting (67%) and in 10 out of 16 eligible patients (63%) transplanted in the RRMM setting. Maintenance was continued until progression or unacceptable toxicity. Seventeen patients started lenalidomide 21/28 days (13 at 10 mg and 4 at 5 mg). Five patients received PI-based regimens (1 carfilzomib and 4 bortezomib). Six patients with NDMM and six with RRMM did not receive maintenance. Reasons included patient refusal [4], recurrent infections [1], physician recommendation [7].

Clinical Outcomes

After a median follow-up of 65 months (range: 9–86 months), 21 patients had progressed (60%): 10 (56%) transplanted with NDMM and 11 (65%) with RRMM. Seven patients (20%) had died from MM progression (3 NDMM and 4 RRMM). Median OS was not reached. The 3-year OS was 88% (95% CI: 72–95%); 94% for NDMM and 81% for RRMM. The median PFS was 47 months (95% CI: 34 months- NE); 48 months for NDMM and 45 months for RRMM. The 3-year PFS was 78% (95% CI:51–91%) for NDMM and 57% (95%CI:30–77%) for RRMM. (Fig. 2). For RRMM patients, maintenance had a significant impact on the median PFS [49 months vs 9 months (p = 0.02)]. For NDMM, the use of maintenance had a clear trend but no statistically significant impact on PFS [median PFS 61 months vs 40 months (p = 0.38)]. Depth of pre-transplant response, categorized as CR/sCR, VGPR, PR and SD or PD, was significantly associated with PFS (p = 0.03). Multivariable analysis was not possible due to the small size of the patient cohorts.

Fig. 2
figure 2

Kaplan–Meier estimates of OS by timing of ASCT (a), PFS by timing of ASCT (b), PFS by maintenance for patients transplanted in the NDMM setting (c), and PFS by maintenance for patients transplanted in the RRMM setting (d)

Discussion

In this single-institution phase II interventional study, we confirmed that high-dose bendamustine in combination with standard high-dose melphalan was safe and effective in first transplant in both NDMM and RRMM. The observed complete response rate of the 35 patients enrolled was 51% (95% CI: 36–67%). High response rates resulted in excellent long-term outcomes, both in the NDMM setting with a median PFS of 48 months, and more remarkably in the RRMM setting, with a similar median PFS of 45 months.

A number of different agents have been tested in combination with melphalan conditioning. With acknowledgment that cross trial comparisons do not carry the same weight of evidence as a controlled study, the results of bendamustine/MEL200 conditioning can be put into context with previously performed trials. Table 4 summarizes the main conditioning studies for ASCT in MM with characteristics and outcomes reported. Initial combination of melphalan with oral/IV busulfan, cyclophosphamide alone or with idarubicin, showed a trend towards increased TRM or unacceptable toxicity [8, 11, 12, 15]. Combination with cyclophosphamide-topotecan-melphalan and mitoxantrone-melphalan showed response rates comparable to historical data [14, 17]. The most promising interventions in the RRMM setting have been with carmustine- bortezomib-thalidomide with median PFS of 28 months [18], high-dose lenalidomide with median PFS of 10 months [20], and with infusional gemcitabine with targeted-busulfan and melphalan with median PFS of 15 months [19]. Costa el al. published the results of carfilzomib with melphalan in the relapsed-setting with a 1-year PFS of 67% with carfilzomib maintenance [22]. For NDMM, a RCT of bortezomib-melphalan failed to show improved CR rates compared to MEL200, but mature data for PFS are pending [10]. A recently published RCT compared infusional busulfan-melphalan vs melphalan alone in NDMM and showed clear superiority of the combination arm with a median PFS of 64.7 vs 34.4 months [24]. Maintenance was used in both arms. The combination regimen had increased rates of non-hematological toxicity (febrile neutropenia, ALT elevation and mucositis) but there was no increase in TRM or SOS/VOD.

Table 4 A. Summary of selected interventional studies with intensified conditioning strategies. Study characteristics and outcomes reported. B. Characteristics of selected RCT trials listed as a historical reference for NDMM and RRMM

Our results should be carefully interpreted in the context ofhistorical results of melphalan conditioning. Specifically, for NDMM, median PFS of 48 months (61 months with maintenance) seems similar to the PFS of the transplant group of the IFM2009 study with VRd induction-consolidation (50 months from randomization prior to induction) [5], and to the PETHEMA/GEM Study with VTD induction (56 months from start of induction and with tandem-ASCT allowed) [48]. The limited number of patients, variable induction regimens and maintenance regimens used, and differences in study design preclude formal comparison. For example, it is notable that the control arm of the infusional busulfan-melphalan vs melphalan had a particularly short PFS of 34.4 months [24]. On the other hand, in RRMM with ASCT performed in the delayed setting, median PFS of 45 months seems promising when comparing to other intensified conditioning studies, as well as when compared to the reported PFS for newer therapies (i.e daratumumab, carfilzomib, pomalidomide), also listed in Table 4 [49,50,51,52,53,54,55,56,57,58,59,60,61,62,63]. Bendamustine/melphalan conditioning followed by maintenance may become a particularly cost-effective strategy for the management of MM at relapse for patients who did not receive ASCT upfront.

The efficacy of this regimen supports prior observations that bendamustine has anti-myeloma activity in patients with disease refractory to conventional alkylator chemotherapy. Single agent activity was initially shown in patients with progressive disease after ASCT [33]. Bendamustine-prednisone was then found to have higher potency than melphalan-prednisone as an induction regimen in NDMM where CR rates were doubled (32% vs 13%) and longer PFS were reached [64]. Different combination therapies of bendamustine have also shown clinical activity in heavily treated patients [35,36,37,38,39]. The efficacy and distinct toxicity profile of bendamustine provided the rationale for the combination with high-dose melphalan in multiple myeloma patients undergoing ASCT and explains the promising outcomes observed.

Limitations of this study include the relatively small cohort size, heterogeneity in patient characteristics, treatment history, variability in maintenance approaches and low number of advanced-stage patients. Furthermore, this early-phase study started in 2011 and was conducted during a time of rapid change in myeloma treatment. Nevertheless, we believe the excellent tolerance, encouraging response rates and duration of response justify a prospective randomized trial in more selected populations with a defined timing of transplantation and post-transplant approaches.

Conclusion

In summary, high-dose bendamustine with melphalan showed favorable safety and encouraging efficacy as an ASCT conditioning regimen in MM. CR/sCR was achieved in 51% of the patients at day + 100. This resulted in excellent long-term outcomes both in the NDMM setting, with a median PFS of 48 months, and most impressively in the RRMM setting, with a median PFS of 45 months. Based on our findings, further investigations of the role of this conditioning strategy are warranted. A RCT in the RRMM setting with MRD negativity as a surrogate outcome is currently planned.