Introduction

Adult acute myeloid leukemia (AML) patients with intermediate or high-risk features often undergo allogeneic hematopoietic cell transplantation (alloHCT) during first complete remission (CR1). Due to competing risks of relapse and non-relapse mortality (NRM), most patients with favorable genomic risk stratification do not benefit from alloHCT in CR1 [1, 2]. What remains less clear is which other patients in morphologic CR derive benefit from the procedure. The 2017 European LeukemiaNet AML guidelines categorize morphologic CR (<5% marrow blasts) according to whether it is accompanied by blood count recovery (CR rather than CR with incomplete count recovery, or CRi) or presence of measurable residual disease (MRD) [3].

CRi has been associated with an increased risk of relapse in AML patients receiving chemotherapy in several retrospective analyses [4, 5], although one study observed pre-HCT blood counts did not affect post-HCT outcomes [6]. The presence of MRD, commonly assessed by multiparameter flow cytometry (MFC) and molecular methods, is generally accepted as leading to increased relapse risk and decreased likelihood of survival regardless of receipt of HCT [4, 7,8,9,10]. This information has led some physicians to recommend against HCT given MRD or responses less than CR. Nonetheless, alternatives to HCT are also unsatisfactory in such patients. The graft-vs.-leukemia effect of HCT seems beneficial in patients in CR, with or without MRD [11, 12].

In this analysis, we used data collected through the Center for International Blood and Marrow Transplantation Research (CIBMTR) registry to examine the relative roles of pre-HCT blood counts and presence or absence of MRD in determining post-HCT outcomes.

Methods

Data source

Study data were obtained from the CIBMTR registry, a voluntary network of over 450 blood and marrow transplant centers in the USA and around the world. Participating centers contributed transplant-related information to the central data management and statistical centers at the Medical College of Wisconsin in Milwaukee and “Be the Match” in Minneapolis, Minnesota. As mandated for observational research conducted by CIBMTR, this study adhered to federal regulations for the protection of human research subjects. Protected health information was collected and maintained in CIBMTR’s capacity as a Public Health Authority under the HIPAA Privacy Rule.

Patients

Eligible cases were identified from 7346 adults with AML who underwent first alloHCT from any donor source between January 1, 2007 and December 31, 2015. We excluded patients who at time of transplant were classified as primary induction failure (n = 736), CR2 (n = 1438), CR3 or beyond (n = 124), relapse (n = 758), or missing disease status (n = 10), leaving a population of 4280 patients in CR1. We also excluded patients with syngeneic twin donors (n = 16), without appropriate data on comprehensive research forms (CRF; n = 435), without consent for data analysis (n = 45), or from embargoed centers (n = 78). We further excluded patients with AML transformed from MDS (n = 749) and those with missing or contradictory data (n = 465), leaving a population of 2492.

We used standard CRFs for baseline characteristics including blood counts prior to alloHCT. Blood counts and MRD status were defined at the pre-alloHCT patient evaluation. We defined CR as absolute neutrophil count ≥ 1000/μl and platelet count ≥ 100,000/μl along with no peripheral blasts and <5% blasts on morphologic assessment of the bone marrow; patients also needed normal maturation of all cellular components in the bone marrow. CRi was defined as <5% blast percentage in the marrow as in CR, but peripheral blood neutrophils, platelets, or both remained below the above stated levels. MRD was assessed based on answers to qualitative CRF questions that ask if the patient is in molecular, cytogenetic, and/or MFC remission (see Supplementary Material for full operational definition).

Statistical analysis

The primary outcome was overall survival (OS), defined as time from HCT to death due to any cause, and secondary outcomes were NRM, defined as time to death without evidence of relapse, relapse, defined as the reappearance of at least 5% blasts on morphological/cytogenetic/flow/molecular evaluation in bone marrow, blood, or an extramedullary site as per the reporting center, and disease-free survival (DFS), defined as time to relapse or death due to any cause. The Kaplan–Meier method was used to estimate survival and cumulative incidence function was used to estimate relapse and NRM. Multivariable analysis (MVA) was performed using the Cox proportional hazards model to adjust for patient-, disease-, and transplant-related factors. The covariates considered in the Cox models included age at transplant, Karnofsky performance score, hematopoietic cell transplantion comorbidity index (HCT-CI), MRD at time of transplant, white blood count at diagnosis, cytogenetic risk group [13], time to achieve first CR, de novo vs. therapy-related AML, number of cycles of induction and consolidation prior to transplant, conditioning intensity (using standard CIBMTR operational definitions), type of donor, and year of transplant. All clinically relevant patient-, disease-, and transplant-related variables were considered in the Cox model, and those that were significant to corresponding outcomes were kept in the final model. Adjusted probabilities of DFS and OS and adjusted cumulative incidence curves of NRM and relapse were generated from final regression models stratified on CR vs. CRi and weighted averages of covariate values using pooled sample proportion as weight function. Interactions between main effect (CR vs. CRi) and all covariates were tested at a significance level of p = 0.01.

Results

Characteristics of study population

The study population included 2492 patients (CR, n = 1799; CRi, n = 693). Patients with CRi were more likely than those with CR to have a Karnofsky score <90 (39 vs. 33%) and an HCT-CI score of 3 or higher (47 vs. 40%), but other demographic variables were similarly distributed (Table 1 and Supplementary Table 1). The time to achieve remission, type of pretransplant therapy, and number of cycles of pre-HCT chemotherapy were similar. Most patients received 7 + 3 chemotherapy for induction (87.3%). However, positive MRD at the time of HCT was more common in the CRi group (18 vs. 12%, p < 0.001), as was older age (p = 0.02). CR patients were more likely to undergo myeloablative conditioning (62 vs. 53% in the CRi group, p < 0.001).

Table 1 Baseline characteristics, n (%) unless otherwise specified.

Effect of incomplete count recovery on outcomes

Multivariate analysis found that CRi was associated with a statistically significant increased risk of death with HR 1.27 (95% CI: 1.13–1.44) even after accounting for other associated covariates such as older age, non-favorable cytogenetics, lower Karnofsky score, higher HCT-CI, MRD, and higher WBC count at diagnosis (Table 2 and Fig. 1). Conditioning intensity was not independently associated with survival, suggesting increasing conditioning intensity may not be sufficient to abrogate the deleterious effects of CRi. Donor type did not differentially affect survival in the CR or CRi groups.

Table 2 Multivariable analysis results.
Fig. 1: Adjusted survival curves and hazard ratios for AML patients in CR vs. CRi prior to alloHCT are shown.
figure 1

These include overall survival (a), disease-free survival (b), cumulative risk of relapse (c), and non-relapse mortality (d).

A longer time to achieve CR1 was associated with shorter survival. In the 622 patients who did not achieve CR1 within 8 weeks, the HR for death was 1.32 (95% CI: 1.11–1.57) compared to those who achieved CR1 in ≤4 weeks, suggesting that slower recovery after chemotherapy may identify patients with less responsive disease as has been shown previously [14]. Presence of MRD at time of HCT was also independently associated with a higher HR for death of 1.52 (95% CI: 1.31–1.77) compared to absence of MRD (independent of CR vs. CRi status). The adjusted OS probabilities at 5 years post HCT, after accounting for factors from the MVA model, are 50% (95% CI: 47–52) for patients with CR and 43% (95% CI: 39–47) for patients with CRi. The deleterious effect of CRi was also seen with DFS, relapse, and NRM (Table 2, Fig. 1, and Supplementary Tables 2a, b, 3, and 4). Though data regarding peri-transplant infections are limited, it is notable that 12% of CRi patients had an infection requiring continuation of antimicrobial treatment after transplant day 0, compared to 6.7% of CR patients (p < 0.001). Although patients who received reduced-intensity or nonmyeloablative conditioning had less NRM in the MVA (HR of 0.73 (95% CI: 0.56–0.94)), they also had a statistically significantly higher risk of relapse than those who underwent MAC with TBI (HR: 1.69, 95% CI (1.39–2.05)).

Effect of MRD on outcomes, and interaction with CRi

MRD status was available in 2267 (91%) patients who were classified as: CR/MRD− (n = 1450), CR/MRD+ (n = 214), CRi/MRD− (n = 477), and CRi/MRD+ (n = 126). As expected, presence of MRD was associated with shorter OS, shorter DFS, higher NRM, and increased relapse compared to absence of MRD (Table 3 and Fig. 2). Notably, the effect of MRD was similar in those in CR and those in CRi and the unfavorable effect of CRi was the same regardless of MRD status. Pairwise interactions between the main effects (CR vs. CRi) and MRD status were not significant at a level of p < 0.01, demonstrating independently significant negative effects of CRi and presence of MRD. Older age was associated with a greater likelihood of positive MRD (p value < 0.001 using Pearson chi-square test).

Table 3 Pairwise comparisons of remission and MRD status.
Fig. 2: Adjusted survival curves and survival probabilities for AML patients in CR vs. CRi and MRD+ vs. MRD− prior to alloHCT are shown.
figure 2

These include overall survival, disease-free survival, cumulative risk of relapse, and non-relapse mortality.

Discussion

Analysis of this large CIBMTR cohort with 2492 patients demonstrates that survival outcomes differ significantly among AML patients in morphologic CR at the time of alloHCT. In our analysis, patients with CRi or MRD prior to HCT had worse outcomes than those with CR or without MRD, respectively; the negative effects on survival of incomplete count recovery and presence of MRD were independent. Most clinical trial reporting combines the endpoints of CR and CRi, but our analysis suggests that morphologic CR with fewer than 5% marrow blasts is an inadequate assessment of disease status and that both count recovery and MRD status also need to be considered. DFS was significantly lower in the CRi patients, who also showed a trend toward increased risk of relapse. Data from pediatric AML patients indicate that MRD detected by flow cytometry is more important than morphologic assessments in determining outcomes [15], and we may reach a point when sensitive methodology such as flow cytometry or molecular analyses take precedence over morphologic evaluation in adult AML patients as well [16].

In our cohort, CRi was not only associated with lower DFS; patients with CRi also had an increased rate of NRM, meaning that the negative effects of responses less than CR were not solely a sign of persistent disease that was more likely to relapse. The reasons for poorer survival outcomes in CRi patients are not fully elucidated by our data set. Rate of infection may be higher in patients with incomplete count recovery prior to HCT, since many of those patients would have had a prolonged duration of neutropenia prior to HCT; in fact, CRi patients were significantly more likely to have an infection requiring antimicrobial therapy past transplant day 0 than CR patients. However, infection may not fully explain the increased NRM rate in CRi patients. One study of 459 patients who underwent nonmyeloablative HCT indicated that depth of neutrophil nadir in the first 21 days after transplant was associated with higher NRM because of higher rates of GVHD [17]. In our study, due to incomplete information, we could not adequately evaluate the effect of any post-HCT interventions such as maintenance chemotherapy, which are increasingly being employed in high-risk patients.

Our retrospective study used a definition of incomplete count recovery prior to alloHCT as that was directly available in our data set. Importantly, we found that CRi on the pre-HCT assessment had significant survival implications and thus may be generalizable to the broad population of AML HCT recipients. The completeness of available MRD data was variable, though missing data were generally <10%. Additionally, CIBMTR CRFs collect only qualitative data and rely on the transplant centers’ testing methodology without centralized confirmation, so it is possible that MRD status would be interpreted and reported in different ways at different centers. Definitions and standards in MRD terminology remain a moving target in the AML field, though most experts would agree that flow cytometric and molecular assessments (at least of NPM1 and the core-binding factor fusion proteins) are well validated [18, 19]. We used a binary operational definition of MRD with any detectable disease identified as “positive” (further described in Supplementary Information); ideally, we would have had more granular information about methodology and cutoffs for positivity at the level of reporting centers [19]. High rates of false positive and false negative results with MRD testing should be taken into account when making decisions about referral to alloHCT [20]. Even given these limitations, the assessment of CR and MRD status utilized in our study (those performed immediately pretransplant) may be useful in decision-making as well as guidance about treatment options and prognosis at the time of alloHCT.

Overall, AML patients with CRi or those with presence of MRD at the time of the pretransplant evaluation have inferior outcomes after alloHCT compared to those in CR or those without MRD. Prognostic counseling should be offered to patients so that they are aware of the increased risks of both NRM and relapse following HCT though with limited other treatment options, alloHCT often remains the best available choice. In the future, inclusion of CRi and MRD status could strengthen prognostic models that evaluate the effect of disease-specific characteristics including cytogenetic risk and CR status on post-HCT outcomes, such as the disease risk index or the HCT-composite risk [21, 22].

Strategies to eliminate MRD prior to alloHCT seem appealing, but are unproven since it is unknown whether additional therapy to eliminate MRD can lead to improvement in post-HCT outcomes. Additionally, no drugs in AML have shown the promise of blinatumomab, which is approved for MRD-level disease in acute lymphoblastic leukemia. Furthermore, efficacy of MRD eradication pre-alloHCT is unknown; that is, presence of MRD following chemotherapy may denote more resistant AML, an unfavorable bone marrow microenvironment, or other unknown factor, any of which may predispose to worse posttransplant outcomes even if MRD is temporarily eradicated.

Our data suggest that MAC with TBI is associated with a decreased risk of relapse compared to NMA conditioning for patients with MRD prior to alloHCT, which is consistent with findings from previous retrospective analyses [23,24,25]. However, conditioning intensity did not seem to affect outcomes for patients with incomplete count recovery prior to alloHCT. Additionally, CR and CRi prior to HCT appear to define prognostic groups with significantly different outcomes, and these patients should be analyzed separately in future analyses. Further prospective studies should focus on limiting NRM and reducing relapse to optimize post-alloHCT outcomes for AML patients with CRi or MRD.