Introduction

Although treatment outcomes have improved in patients with acute myeloid leukemia (AML), relapse and relapse mortality rates remain high [1]. Chemotherapy-resistant leukemic stem cells (LSCs) residing in the bone marrow niche facilitate disease recurrence [2,3,4]. Granulocyte colony-stimulating factor (G-CSF) is widely used to support the recovery from bone marrow suppression after chemotherapy for hematological malignancies [5] and mobilization of hematopoietic stem cells [6,7,8]. The concurrent use of G-CSF with anticancer chemotherapy against AML can induce leukemic blasts [9], including quiescent LSCs residing in the bone marrow niche [10], into the synthesis phase and increase susceptibility to cell cycle-dependent drugs. This is called the priming effect of G-CSF and contributes to enhancing the antileukemic effect in vivo, improving the outcomes of AML treatment. Several investigators have conducted randomized controlled trials (RCTs) regarding this [11,12,13,14,15,16]. In contrast, a combination of G-CSF and antileukemic chemotherapy can induce severe bone marrow suppression because G-CSF switches the status of normal hematopoietic stem cells from dormancy to self-renewal and may sensitize them to anticancer agents [17]. The American Society for Clinical Oncology guidelines do not describe the use of G-CSF in combination with chemotherapy aimed at a priming effect. The National Comprehensive Cancer Network guidelines recommend (category 2B) an intensive combination chemotherapy regimen using fludarabine, cytarabine, and G-CSF (FLAG) with/without anthracycline for patients with AML aged <60 years for standard or adverse risk [18]. In Japan, there has been no systematic review on the use of a G-CSF priming regimen to improve the remission induction rate and outcomes of patients with AML, and the importance of G-CSF priming remains to be determined. Therefore, we performed a systematic literature review to evaluate the effects and safety of priming malignant blasts with G-CSF for AML, which will assist in updating clinical guidelines for G-CSF use.

Methods

Search strategy

A systematic review of the literature was performed according to the “Medical Information Network Distribution Service (MINDS) Handbook for Clinical Practice Guideline Development 2014” [19] and “MINDS Clinical Practice Guideline Development Guide 2017” [20] using PubMed, Ichushi-Web, and the Cochrane Library. The search terms used in the combination of Medical Subject Headings and keywords were as follows: “leukemia, myeloid, acute/drug therapy,” “granulocyte colony-stimulating factor,” “prevent*, prevention, and control,” “prophylaxis*,” and “first, initial, induction” in all fields. Two reviewers (Y.N. and T.M.) of the systematic review team independently performed the initial screening based on the titles and abstracts of all articles for ineligible reports, followed by full-text screening (i.e., second screening) according to the inclusion and exclusion criteria. The reasons for exclusion were recorded, and duplicates were removed. Disagreements were resolved by consensus among the coauthors. These articles were examined for quality reporting data related to the selection criteria outlined in the following section.

Selection criteria

The inclusion criteria of this study were as follows: studies (1) designed as an RCT, a non-RCT, and a cohort or case–control trial; (2) aiming at adult patients diagnosed with AML; and (3) that included patients in the treatment group who received standard intensive induction therapy or salvage high-dose intensity treatment. Studies assessing treatments with low-dose cytarabine or similar low-intensity chemotherapy were excluded. We also excluded guidelines, reviews, letters, abstracts without articles, laboratory studies, systematic reviews, meta-analyses, and case reports.

Data extraction and quality assessment

After the second screening, the reviewer (Y.N.) of the systematic review team evaluated the articles and extracted data using standardized data abstraction forms. The evidence indicated by individual studies related to critical outcomes included within the clinical questions posed by the guideline creation team was divided into groups based on study design and quality. The following eight outcome indicators were evaluated: (1) infection-related mortality; (2) overall survival (OS); (3) disease progression/recurrence; (4) improvement in remission induction rate (priming effect); (5) duration of neutropenia or thrombocytopenia; (6) incidence of secondary cancer; (7) adverse events, such as musculoskeletal pain; and (8) quality of life (QOL). The authors determined the outcomes of the Population, Intervention, Comparator, and Outcome (PICO) framework on both the benefits and harms of concomitant G-CSF use with chemotherapy. The leader (S.Y.) of the guideline creation team resolved the conflicts and questions. The level of evidence was evaluated not for individual references but for each outcome for studies grouped by study design. The certainty of evidence was assessed based on the risk of bias, inconsistency, imprecision, indirectness, and publication bias. The literature quality and body of evidence were evaluated using the Grading of Recommendations, Assessment, Development, and Evaluation (GRADE) approach and then classified into four levels: “strong,” “medium,” “weak,” and “very weak.”

Statistical methodology

Review Manager (The Cochrane Collaboration, London, UK) version 5.41 was used for statistical analyses. After a qualitative analysis using Excel, studies were considered eligible and included in the meta-analysis if they were designed to compare the use of G-CSF combined regimens for AML with a control group. The risk ratio (RR) for each endpoint was calculated, and the effect size was described as a 95% confidence interval (CI) for each study. They were calculated using fixed- or random-effects models, depending on the level of heterogeneity. A forest plot was used to graphically present the results of the calculated RR for individual studies and overall meta-analyses. The degree of heterogeneity was assessed using the I2 and Chi-squared-based Q tests. A p value <0.05 in the Z test was considered significant. A funnel plot was used to graphically investigate the potential publication bias.

Results

Literature search

The initial literature search included 274 results: PubMed, 196; the Cochrane Library, 1; and Ichushi-Web, 77 (date of search: March 23, 2020). An additional four articles were manually selected and added. Among the 278 articles obtained, 216 were excluded after screening for the following criteria: human participants only, publication date ranging from January 1, 1990, to December 31, 2019, publications in English or Japanese, and selection criteria outlined in the section above, yielding 11 articles (Fig. 1). In most cases, the primary reason for exclusion was participants’ eligibility.

Fig. 1
figure 1

Modified PRISMA flow diagram of the literature search and study selection process. Number of studies included in the meta-analysis of disease progression/recurrence (*1), remission induction rate (*2), and adverse events, such as musculoskeletal pain (*3), were shown. PRISMA, Preferred Reporting Items for Systematic Reviews and Meta-analysis

Studies selected for the meta-analysis

Eleven studies [11,12,13,14,15,16, 21,22,23,24,25], six RCTs and five case–control studies, were included in the descriptive qualitative analysis, of which six RCTs [11,12,13,14,15,16] were examined in the meta-analysis. The six RCTs were conducted between 1994 and 2016. Meta-analyses of the study findings on the duration of neutropenia or OS were not feasible because of differences in the treatment benefit and harm assessment measurements. Three RCTs [11, 13, 16] were ultimately selected for the meta-analysis of disease progression/recurrence, 6 [11,12,13,14,15,16] for the G-CSF priming effect on the remission induction rate, and 2 [12, 15] for adverse events, such as musculoskeletal pain. Four case–control studies also performed meta-analyses on disease progression/recurrence [22,23,24,25].

Relationship between each outcome using the PICO framework and the G-CSF priming in AML

Relationship between infection-related mortality and the G-CSF priming

One evaluable RCT included 640 patients with newly diagnosed AML aged ≤60 years who received standard-dose remission induction chemotherapy with (n = 321) or without (n = 319) G-CSF priming [11]. This was not a placebo-controlled RCT; thus, there was a risk of bias. No significant differences were observed in infection-related mortality between patients who received chemotherapy with and without G-CSF priming [RR, 1.83 (95% CI 0.79–4.52); p = 0.175]. Although the report is high quality, involving over 300 patients in both the control and intervention groups, the evaluation depends on a single paper. According to the GRADE approach, the quality/certainty of evidence for this outcome was “medium.”

Relationship between OS and the G-CSF priming

Data from five RCTs [11,12,13,14, 16] and five case–control studies [21,22,23,24,25] were included in the qualitative analysis. Overall, 4,626 patients were included in the five RCTs: two RCTs which included 2347 patients of all ages [13, 14], two RCTs on 1557 patients aged ≤60 years [11, 16], and one RCT on 722 older patients aged ≥61 years [12]. Four RCTs studied newly diagnosed AML [11, 12, 14, 16], and one RCT focused on relapsed/refractory AML [13]. All five RCTs concluded that there was no significant difference in OS between groups with and without G-CSF priming. However, among the subgroups of patients with AML at standard risk [11] or those who received high-dose Ara-C [16], OS was significantly better in the G-CSF priming arm than in the control arm. A meta-analysis of this outcome was not performed because of differences in effect measures.

In total, 687 patients were included in the five case–control studies: three studies on 510 newly diagnosed patients of all ages [21,22,23,24,25], one study on 71 relapsed/refractory AML [24], and one study on 106 patients with secondary AML [25]. Regarding intensity of intervention, two studies were designed to compare the asymmetry between purine analogs with high-dose Ara-C and standard-dose chemotherapy. Thus, there was a non-negligible bias among the evaluated studies.

In two studies on relapsed/refractory AML, patients who received G-CSF priming chemotherapy had significantly better OS than those in the control group [24, 25]. However, the effect on OS was inconsistent; there were no differences in OS between the intervention and control groups in other three studies [21,22,23]. Overall, the G-CSF priming did not affect patient survival. Although this can be beneficial in some selected groups, it is unclear because of inconsistencies between reports. The quality/certainty of evidence was “medium.”

Relationship between disease progression/recurrence and the G-CSF priming

Data from the following three RCT studies were included (n = 1913): two RCTs on patients aged ≤60 years (n = 1557) [11, 16] and one RCT on patients of all ages (n = 356) [13]. Two RCTs were conducted on newly diagnosed AML [11, 16], and one RCT on relapsed/refractory AML [13]. Combination treatment was standard remission induction therapy in one RCT (n = 640) [11], and two RCTs included randomization allocating high- or standard-dose cytarabine (n = 1273) [13, 16]. None of these RCTs was placebo-controlled; thus, there was a risk of bias. The relapse rate was significantly lower in the G-CSF priming group than in the control group (p = 0.04) in one RCT [11]; however, there was no difference in the other two studies [13, 16]. The result of meta-analysis revealed the tendency for lower relapse rate in the G-CSF priming groups without inter-study heterogeneity [RR, 0.91 (95% CI 0.82–1.01), p = 0.08; I2 = 4%, p = 0.35] (Fig. 2a). No apparent asymmetry was observed in the funnel plot (Fig. 2b).

Fig. 2
figure 2

Association between disease progression/recurrence and the G-CSF priming. (a) Forest plot shows treatment effects versus the study size estimated from the standard error of log (RR). Open circles indicate individual studies in this meta-analysis. The broken line is a pseudo 95% confidence interval of effect measures in the study. (b) Funnel plot showing the symmetrical distribution of studies indicating the absence of a publication bias. CI confidence interval, G-CSF granulocyte colony-stimulating factor, IV inverse variance method, RR risk ratio, SE standard error

In the four case–control studies (n = 581), patient characteristics varied: two were on older patients in their 80 s (n = 313) [21, 23], three studies on newly diagnosed AML (n = 510) [21,22,23] (one study was solely on the favorable chromosomal risk group (n = 114) [21]), and one study on relapsed/refractory AML (n = 71) [24]. As an intervention, one study included an asymmetric comparison of treatment arms with purine analogs and high-dose Ara-C vs. standard induction therapy [23]. Compared with the control group, patients in the G-CSF priming group had a lower cumulative incidence of relapse in one study [22] and longer relapse-free survival in two studies [23, 24]. Owing to the differences in the effect measures in each study, a meta-analysis could not be performed. Overall, G-CSF priming did not affect the progression. In some specific patient populations, G-CSF priming may be beneficial, but the results were inconsistent. Moreover, the background of the patient characteristics varied; thus, careful interpretation of this evaluation is required. The quality/certainty of evidence on this outcome was “strong.”

The G-CSF priming effect on remission induction rate

This effect was defined as complete remission induction rate. Data from six RCTs [11,12,13,14,15,16] on 4684 patients included various subjects: two studies included 2347 patients of all ages [13, 14], one study included elderly patients aged ≥61 years (n = 722) [12], two studies included those aged ≤60 years (n = 1557) [11, 16], and one study included those aged ≤66 years (n = 58) [15]. Four studies included 4270 patients with newly diagnosed AML [11, 12, 14, 16] and two included 414 patients with relapsed/refractory AML [13, 15]. For combination therapy with G-CSF priming, three studies used standard remission induction chemotherapy (n = 1420) [11, 12, 15] and three studies were randomized with high-dose cytarabine, standard chemotherapy, or other treatments (n = 3264) [13, 14, 16]. Only one study used a placebo control arm (n = 58) [15]; thus, there was some risk of bias. Only one out of six RCTs revealed the benefit of the G-CSF priming in remission induction rate [12]. The result of the meta-analysis had some heterogeneities, and there was no difference between the groups with and without G-CSF priming [RR, 1.03 (95% CI 0.96–1.10), p = 0.42; I2 = 55%, p = 0.05] (Fig. 3a). No apparent asymmetry was observed in the funnel plot; thus, publication bias was not evident (Fig. 3b).

Fig. 3
figure 3

G-CSF priming effect on remission induction rate. (a) Forest and (b) funnel plots based on the RCTs and (c) forest and (d) funnel plots based on the case–control studies. Although there were significantly few studies and insufficient variations in standard errors to assess whether funnel plots were symmetric, there was no asymmetry visible in any of the funnel plots (b and d). CI confidence interval, G-CSF granulocyte colony-stimulating factor, IV inverse variance method, RCTs randomized controlled trials, RR risk ratio, SE standard errors

The patient characteristics in the selected four case–control studies (n = 573) were heterogeneous: three studies on 502 patients with newly diagnosed AML of all ages [22, 23, 25] and one study on 71 patients with relapsed/refractory AML aged ≤60 years [24]. One study consisted of an asymmetric comparison between treatment arms using a purine analog with high-dose cytarabine and standard chemotherapy (n = 199) [23]. Thus, there was a risk of bias. The results of the meta-analysis revealed that the remission induction rate was significantly better in the groups using combination therapy with G-CSF priming [RR, 1.27 (95% CI 1.12–1.43); p = 0.0002], and the result was consistent (I2 = 3%, p = 0.38) (Fig. 3c). Although the possibility of publication bias was not excluded owing to the small number of studies, no apparent asymmetry in the funnel plot was detected (Fig. 3d). The effect of the combined use of G-CSF priming on treatment effectiveness was insignificant in the evaluation using RCT studies, but it was significant in that using non-RCT studies. Thus, there was a divergence between the results of these meta-analyses. Overall, remission induction rates were significantly higher in the G-CSF priming group than in the control groups in one out of the six RCTs [12] and three of the four non-RCTs [23,24,25]. Although the G-priming group can be beneficial in some specific groups, including older patients with newly diagnosed disease [12] or patients harboring adverse prognostic factors [22,23,24,25], the benefit of G-priming has been inconsistent between reports and remains unclear. The quality/certainty of evidence on this outcome was “strong.”

Relationship between the duration of neutropenia and the G-CSF priming

Data from four RCTs [11, 13, 15, 16] included various participants (n = 1971): two included patients aged ≤60 years (n = 1557) [11, 16], one included patients aged ≤66 years (n = 58) [15], and one included patients across all age groups (n = 356) [13]. Two studies included newly diagnosed AML (n = 1557) [11, 16], and two included relapsed/refractory AML (n = 414) [13, 15]. The intensity of the intervention varied: two studies performed standard chemotherapy (n = 698) [11, 15] and two studies were designed to compare the asymmetry between purine analogs with high-dose Ara-C and standard-dose chemotherapy (n = 1273) [13, 16]. The duration of neutropenia was significantly shorter in the G-CSF priming group in two studies (n = 414) [13, 15], not significantly different between the G-CSF priming and control groups in one study (n = 640) [11], and significantly longer in the G-CSF priming group in one study (only cycle 2) (n = 917) [16]. Because of the divergence in the effect measures in each study, a meta-analysis could not be performed.

Two case–control studies (n = 303) [22, 25] were asymmetric in terms of patient characteristics and intervention: one study assessed patients with newly diagnosed AML and was designed to compare treatment arms with purine analog and high-dose Ara-C vs. standard chemotherapy (n = 106) [25]. In both studies, the G-CSF priming group had a shorter duration of neutropenia than the control group. Regarding the harmful effects of the G-CSF priming strategy on neutropenia, the results of the evaluated studies were inconsistent. Moreover, in most studies, G-CSF administration was designed to continue from the start of combination chemotherapy until neutrophil recovery. Thus, the relationship between G-CSF priming and the duration of neutropenia remains unclear. The quality/certainty of evidence on this outcome was “medium.”

Relationship between the incidence of secondary cancer and the G-CSF priming

Although the outcome was set before starting the evaluation, we could not find any studies aimed at this outcome; thus, we concluded that this relationship was unevaluable.

Relationship between adverse events, such as musculoskeletal pain, and the G-CSF priming

The following two RCTs were included in the meta-analysis (n = 780): an RCT on 722 elderly patients with newly diagnosed AML [12] and 58 patients with relapsed/refractory AML aged ≤66 years [15]. Only the latter was a placebo-controlled study; therefore, there was a risk of bias. The result of the meta-analysis was not heterogeneous, which revealed that there was no significant difference in adverse events, including musculoskeletal pain, between patients who received chemotherapy with the G-CSF priming and those who did not [RR, 1.39 (95% CI 0.26–7.31), p = 0.70; I2 = 0%, p = 0.54) (Fig. 4a). Although the small number of studies was a limitation, the funnel plot indicated no publication bias (Fig. 4b). The quality/certainty of evidence was “middle.”

Fig. 4
figure 4

Adverse events, such as musculoskeletal pain, associated with G-CSF priming. Forest (a) and funnel (b) plots of the RR of adverse events, such as musculoskeletal pain, comparing the G-CSF priming and control study arms for each study. Although there were remarkably few studies and insufficient variations in SE to assess whether funnel plots were symmetric, there was no asymmetry visible in the funnel plots. CI confidence interval, G-CSF granulocyte colony-stimulating factor, IV inverse variance method, RCTs randomized controlled trials, RR risk ratio, SE standard errors

Relationship between the QOL and the G-CSF priming

Although the outcome was initially set, there was no study evaluating this outcome. Thus, we concluded that this relationship could not be determined.

Discussion

In this review, we did not find a significant improvement in the OS and remission induction rates associated with the G-CSF priming strategy through RCT evaluation. However, the results of the meta-analysis of the three RCTs aimed at newly diagnosed AML on disease progression/recurrence revealed a tendency toward a lower relapse rate in the G-CSF priming group (Fig. 2a). Through the subgroup analysis of RCTs, patients in the standard-risk group [11] or those receiving high-dose cytarabine [16] showed a significant benefit in OS from G-CSF priming, although the reasons underlying this survival improvement remain unclear. Furthermore, the evaluation of two case–control studies for relapsed/refractory AML [24] or newly diagnosed secondary AML [25] revealed that G-CSF priming was associated with better OS. Moreover, meta-analysis of four case–control studies showed a significantly better remission induction rate in the G-CSF priming group than in the control group (Fig. 3c). Consequently, although some patients had survival benefits from G-CSF priming, the exact patient groups that benefited from this strategy are unclear. In terms of remission induction, the G-CSF priming group had better response rates than the control group in one RCT which only enrolled patients aged ≤61 with newly diagnosed AML [12], in a case–control study for secondary newly diagnosed AML [25], and in the specific subgroup harboring adverse chromosomal abnormalities in another case–control study [22]. Following standard remission induction regimens, patients in these subgroups tended to have lower response rates than those in the favorable/intermediate risk groups or younger populations; this may explain why the effects of G-CSF priming were more evident in unfavorable risk subgroups. Based on the above, there is room for future prospective evaluation.

Recent progress in the biology of AML has revealed that the mechanism of disease relapse can be caused by an increase in leukemic blasts originating from LSCs [3, 4]. Quiescent LSCs residing in the bone marrow niche have low chemosensitivity and survive through the lines of consolidation [2]. G-CSF priming sensitizes most leukemic blasts [26]. Moreover, it induces cell cycle quiescent LSCs into the cell cycling state, potentiates their sensitivity to cell cycle-dependent traditional antileukemic agents, and significantly enhances the induction of apoptosis and elimination of LSCs [10].

Although beyond the scope of the current evaluation, G-CSF priming with the most intensive treatment, the conditioning regimen before hematopoietic stem cell transplantation, is associated with better outcomes [27,28,29,30]. A nationwide prospective study is ongoing [31]. In contrast, several groups have reported the effectiveness of G-CSF priming combined with low-dose cytarabine or other low-intensity chemotherapeutic agents, including low-dose cytarabine and aclarubicin (CAG) [32] or CAG with decitabine [33]. Moreover, a recent study involving unfit or relapsed/refractory AML reported promising results of the combination with venetoclax, hypomethylating agents, and half-dose CAG [34], while interactions between these agents and the G-CSF receptor signal transduction pathway remain uncovered. Evaluation of G-CSF priming with these factors was not performed in this study. The G-CSF priming strategy in these settings, combined with a high-intensity preparative conditioning regimen or less toxic chemotherapy, awaits evaluation.

The current study revealed that G-CSF priming with remission induction treatment was not associated with an increase in the duration of neutropenia, incidence of infection-related mortality, disease progression/recurrence, and adverse events. Notably, most studies sequentially administered G-CSF as prophylaxis for neutropenia after the concomitant use of G-CSF with chemotherapy; thus, the true effect on prolonged period of neutropenia cannot be strictly evaluated.

It is important to consider Japan’s recent public approval of this strategy. The Japan Adult Leukemia Study Group conducted a phase II study to evaluate the efficacy of FLAG with a mitoxantrone regimen and revealed that it was an effective and safe salvage therapy to achieve complete remission in 73% of patients with relapsed/refractory AML [35]. Based on these evaluations, on February 4, 2022, a pre-evaluation of the public knowledge-based application (Kouchi-shinsei) of the combined use of G-CSF for chemotherapy in relapsed/refractory AML was completed. It was approved that lenograstim and filgrastim in combination with anticancer chemotherapy using fludarabine and cytarabine could be covered by Japanese insurance. Thus, the G-CSF priming regimen is now more prevalent in Japan than in the past.

In the current study, we evaluated the outcomes of G-CSF priming, including the response rate, OS, and relapse rate through 6 RCTs. Four of the included RCTs studied newly diagnosed AML (n = 4270) [11, 12, 14, 16], while two focused on relapsed/refractory cases (n = 414) [13, 15]. Considering that G-CSF priming for AML is currently approved only for relapsed/refractory cases with FLAG-based intensive regimens, further studies are required to determine whether G-CSF priming is also beneficial for newly diagnosed AML in Japan.

This systematic review and meta-analysis has some limitations. Patient characteristics, disease status, and treatment intensity varied. Moreover, we included articles from a wide range of published years; thus, the efficacy of supportive care may have influenced the response rate and survival. In addition, although we assessed the publication bias through funnel plot analyses and did not detect any bias, the number of studies included was small.

Conclusions

The current evaluation cannot confirm a clear benefit of G-CSF priming with concurrent chemotherapy in all patients with AML. However, the benefits of this strategy in improving the response rate and OS have been suggested for some specific subgroups. Further studies are required to identify the patient cohort for which G-CSF priming is recommended.