Introduction

Allogeneic hematopoietic stem cell transplantation (allo-HSCT) has become an important and accepted therapeutic treatment for malignant disorders. Hematopoietic stem cells can be obtained from bone marrow through multiple aspirations or alternatively from peripheral blood after stimulation with granulocyte colony-stimulating factor (G-CSF). G-CSF has been indicated to promote the mobilization and differentiation of hematopoietic stem cells and is now a widely used scheme in HSCT [1]. However, the mobilization efficiency of G-CSF varies individually; that is, poor mobilization occurs in 25% of patients with hematological malignancies and in 10–20% of healthy donors [2].

G-CSFR is a highly specific receptor that binds G-CSF and induces a series of signaling cascades, leading to different biological events [3, 4]. Genomic sequencing of the CSF3R locus has identified many intronic polymorphisms and four synonymous coding mutations [5], therefore discrepancies in the CSF3R gene, whether caused by SNPs or mutations, that have been proven to be associated with an abnormal response to G-CSF treatment [6,7,8]. However, most current studies focus on the effect of CSF3R polymorphisms on the occurrence or progression of disease in patients [9]. Donor-derived hematopoietic stem cells are the products responding to G-CSF treatment. Bahar et al used PCR-RFLP technology to study whether differences in CSF3R SNP loci of healthy donors and hematological tumor patients would affect the enrichment of peripheral blood CD34+ cells, and the results showed that different SNP loci have different mobilization efficiencies for different individuals. Among them, healthy donors carrying the rs3918018 A/A genotype had significantly less efficient CD34+ collections in the first two attempts than donors carrying the G/G genotype [10], therefore, CSF3R polymorphism may directly affect the prognosis of transplantation via regulation of CD34 mobilization. However, no reports clarified the specified effect of CSF3R polymorphisms on clinical outcomes of patients post allo-HSCT. Thus, the aim of the study was to screen CSF3R SNPs and therefore explore whether donor CSF3R SNPs can affect the clinical outcomes of patients post allo-HSCT.

Materials and methods

Patients

We performed a retrospective observational study to investigate the impact of donor CSF3R polymorphisms on the clinical outcomes of acute leukemia patients who underwent HLA-matched sibling HSCT or unmanipulated haploidentical allo-HSCT at the Peking University Institute of Hematology from April 2014 to April 2015. A total of 273 patients between the ages of three to sixty-two years old were enrolled in the present study, and all cases were diagnosed based on the World Health Organization diagnosis criteria. The study was conducted in accordance with the Declaration of Helsinki and was approved by the Institutional Review Board of Peking University.

All patients provided informed consent for the treatment under a protocol reviewed and approved by the Peking University Institute of Hematology. The patient and transplantation characteristics are summarized in Table 1.

Table 1 Subject characteristics and outcomes (N = 273).

Sequencing of CSF3R single-nucleotide polymorphisms (SNPs)

Sequencing was performed using the Ion Torrent platform. CSF3R gene exon region and exon-intro junction region primers were designed by Ampliseq design. The Ion Ampliseq Library Kit, 2.0(Thermo Fisher) was used for library construction. The Ion PGM Hi-Q OT2 Kit and Hi-Q Sequencing 200 Kit reagents were used to perform sequencing. Then, sequencing BAM files and variant caller files were acquired. We then performed data screening and judged the data quality based on mutation frequency, sequencing quality value, forward and reverse sequencing depth and chain bias. Then, SNP sites were screened by population frequency, mutation type and mutation function.

Conditioning regimen and stem cell collection

All patients received a myeloablative conditioning regimen (MAC) without in vitro T-cell depletion. The conditioning regimens used in our center were based on BU and total body irradiation (TBI). For patients who received haploidentical HSCT, the conditioning regimen was modified to include the following: BUCY+ATG (thymoglobulin) consisting of cytarabine (4 g/m2 per day in haploidentical HSCT) intravenously on days −10 to −9; busulfan (3.2 mg/kg per day, intravenously on days −8 to −6); cyclophosphamide (CY, 1.8 g/m2 per day) intravenously on days −5 to −4; Me-CCNU (250 mg/m2), orally once on day −3; and thymoglobulin (ATG, Sang Stat, Lyon, France; 2.5 mg/kg per day) intravenously for 4 consecutive days from days −5 to −2. Patients undergoing HLA-matched sibling donor HSCT received hydroxycarbamide (80 mg/kg) orally on day −10 and a lower dose of cytarabine (2 g/m2 per day) on day −9, but otherwise, these patients followed an identical regimen to haploidentical patients, without ATG. The TBI-based conditioning regimen was composed of the following: TBI (770 cGy for a single dose) on days −6 and cyclophosphamide (CY, 1.8 g/m2 per day) intravenously on days −5 to −4; and Me-CCNU (250 mg/m2), orally once on day −3. ATG was used for patients who received haplo-HSCT, and the dosage and time were consistent with the above description. All subjects received fresh granulocyte colony-stimulating factor (G-CSF)-mobilized bone marrow (BM) and peripheral blood cells. Patients also received prophylactic drugs to prevent infection by fungi and viruses.

GVHD prophylaxis

The prophylaxis for acute GVHD (aGVHD) was composed of cyclosporine, mycophenolate mofetil (MMF), and short-term methotrexate (MTX). The discontinuation time of MMF was different according to the transplantation type: (1) MMF was withdrawn when (White blood cells, WBCs) engrafted in patients who received matched related transplantation (MRD) within 30 days post-HSCT; and (2) MMF was withdrawn on day 45 post-HSCT in patients who received mismatched related/haploidentical transplantation.

Definitions and endpoints

Neutrophil engraftment was defined as an absolute neutrophil count of 0.5 × 109/L or more for 3 consecutive days. Platelet engraftment was defined as a platelet count of 20 × 109/L or more for 7 consecutive days without transfusion. Acute and chronic GVHD were diagnosed and graded according to established criteria. Risk stratification was differentiated into two groups: AML and ALL in first or second remission (CR1 or CR2), chronic myeloid leukemia (CML) in the first or second chronic phase (CP1 or CP2), and MDS with refractory anemia with or without ringed side oblasts were defined as the standard-risk group, and other malignant diseases were defined as the high-risk group. Overall survival (OS) times were measured from the date of HSCT until death from any cause. Leukemia-free survival (LFS) was defined as the time from transplantation to relapse or death from any cause. Surviving patients were censored on the date of their last follow-up. Transplantation-related mortality (TRM) was defined as death after allo-HSCT without disease progression or relapse. Relapse was defined by the appearance of morphological evidence of the disease obtained from testing samples from the peripheral blood, bone marrow, or extramedullary sites or by recurrence and the sustained presence of pre-transplantation chromosomal abnormalities.

Statistical analyses

The Mann-Whitney U test was used to compare continuous variables, and the chi-square test or Fisher exact statistic was used to compare categorical variables. Cumulative incidence curves were used in a competing risk setting, with TRM related as a competing event, to calculate relapse probabilities. The probabilities of LFS and OS were estimated with the Kaplan-Meier method. All variables in Tables 1 and 2 were included in the univariate cox analysis. Only variables with a P < 0.10, including PLT, rs3917980 and rs2275472 in CSF3R SNPs and disease risk, were included in a Cox proportional hazards model with time-dependent variables. P values were based on 2-sided hypothesis tests. The alpha level was set at .05. All data analyses were performed with the R Foundation for Statistical Computing 3.5.3 using the packages cmprsk, survival, surviminer.

Table 2 Different donors’ CSF3R SNPs and the percentages of polymorphic sites analyzed in this study.

Results

Clinical data

The demographic and clinical data of 273 patients post allo-HSCT are listed in Table 1. The average patient age at the time of the transplant was 29.71 years old. The M:F sex ratio was 171 (62.6%):102 (37.4%). Acute myeloid leukemia, affecting 156 (57.1%) patients, acute lymphoid leukemia, affecting 63 (23.1%) patients, and myelodysplastic syndrome, affecting 33 (11.9%) patients, were the most frequently found diseases. Among the patients, 254 had a standard risk status, and 19 had a high-risk status. At the time of allo-HSCT, 256 patients were in complete remission (CR), 2 were in partial remission, and 10 had disease activity. All patients received peripheral blood as well as bone marrow cells as their source of hematopoietic stem cells. Only 60 (22.0%) samples were HLA-matched from their relatives. A total of 146 (53.5%) patients developed acute GVHD, and 54 (19.8%) patients were found to have a chronic form. As of April 2020, fifty (18.3%) had relapsed, forty-three (15.8%) finally developed transplantation-related mortality, and 188 (67.8%) remained alive. The median follow-up time was 1799 days. The overall survival rate at five years was 67.8%, whereas the leukemia-free survival rate was 71.1%.

Frequency of polymorphisms

We analyzed 273 healthy donors for polymorphisms of the CSF3R gene, and the results showed that there are mainly six SNPs in CSF3R, which can be listed as rs2275471, rs78861150, rs3917981, rs3917980, rs150501885, and rs2275472.Among healthy donors, almost forty-percent had at least one CSF3R SNP polymorphism, and two SNP polymorphisms were found concomitantly in 32.96% of these donors, while only approximately 10% of healthy donors carried no SNP polymorphism. It is noteworthy that receipt from one CSF3R SNP locus donor accounts for the majority of AML and ALL patients. In order to roughly express the population distribution of donor CSF3R gene polymorphisms among transplant patients, we also associate the donor CSF3R gene polymorphisms with the disease classification of the patients. Further more details about the heterozygous genotype of the above SNPs and their distribution among healthy donors are listed in Table 2 and Fig. 1, respectively.

Fig. 1: The frequency of healthy donors’CSF3R SNPs distributed among the recipients with different diagnoses (n = 273).
figure 1

The inner cycle was composed of disease categories in this study, and the outer rings indicated the proportion of different cumulative CSF3R SNPs in the inner circle. 0,1,2,3 represent the sum of the number of CSF3R SNP loci from respective healthy donors among recipients with different diagnoses.

No association between donor CSF3R SNPs and engraftment of platelets and neutrophils

CSF3R SNPs from healthy donors or autologous transplantation from patients with hematologic malignancies have now been shown to be related to peripheral blood stem cell enrichment [9]. Therefore, we wanted to determine whether the distribution of CSF3R SNPs in this study correlated with CD34 enrichment. The results showed that no SNPs were associated with CD34 enrichment.

CSF3R is a special receptor for G-CSF signaling and thus plays an important role in neutrophil development, survival and function [11]. We therefore explored the associations of CSF3R SNPs with neutrophils and PLT engraftment. The results showed that no associations were observed between the genotype frequencies of these six CSF3R SNPs and the engraftment of platelets or neutrophils (data not shown).

Association of donor CSF3R SNPs with relapse and TRM

Relapse occurred in 50 patients, including 38 patients who died of relapse and 12 patients who remained alive after treatment. None of the rs2275471, rs78861150, rs3917981, rs150501885, or rs2275472 polymorphisms were associated with the rate of relapse except for the rs3917980G/G or A/G genotype was tentatively related (p = 0.082) (Fig. 2A). However, multivariate cox analysis found that rs3917980 showed no significant association (HR = 0.59; 95% CI: 0.246–1.40, p = 0.59) (Table 3 and Fig. 4A). Further subgroup analyses based on HLA loci showed that relapse rates had no association with rs3917980 in recipients receiving haplo-HSCT (p = 0.27), and none of those six SNPs were associated with relapse in the HLA-matched all-HSCT group (data not shown). Moreover, in this study, AML, MDS and CML were classified as myeloid leukemia, and ALL was classified as lymphocytic leukemia to elucidate whether there is a correlation between CSF3R SNPs and relapse in the above subgroup. The results showed that the rs3917980G/G or A/G genotype correlated with a lower incidence of relapse in the myeloid leukemia subgroup (p = 0.047) (Fig. 2B, n = 201).

Fig. 2: The associations of CSF3R SNPs with TRM and relapse.
figure 2

A The association between rs3917980G/G or A/G genotype with relapse rate. B The association between rs3917980G/G or A/G genotype with relapse rate in myeloid subgroup (n = 201). C The association between rs2275472T/C genotype with TRM. D, E The ass The association between rs2275472T/C genotype with TRM in HLA-mismatched (n = 213) and HLA-matched group (n = 60). F Cumulative analysis between platelet implantation and TRM.

Table 3 Univariate Cox analysis for clinical outcomes of allogeneic stem cell transplantation.

Regarding the association between CSF3R SNPs and TRM, only the rs2275472T/C genotype was associated with a lower incidence of TRM (Fig. 2C, p = 0.0031); and further multivariate cox analysis showed rs2275472T/C genotype was an independent risk factor for TRM (Fig. 4B). In subgroup analyses, this effect was confirmed in HLA-mismatched and HLA-matched recipients, as cumulative analysis showed that there was a significant association between the rs2275472T/C genotype and TRM (p = 0.043 and p = 0.032) (Fig. 2D, E), while no associations were found in any of the rs2275471, rs78861150, rs3917981, rs3917980, and rs150501885 CSF3R SNPs (p > 0.05).

In view of the close correlation between TRM and SNPs, we aimed to understand the reasons for the increasing incidence of TRM caused by the rs2275472T/C genotype. By analyzing correlations among post-transplantation infection, GVHD, platelet implantation, neutrophil implantation and SNPs, we found no evidence of an effect of the CSF3R SNP genotype on the incidence of aGVHD, and this result was further proven in all HLA-unmatched and HLA-matched patients. The genotypes of these six CSF3R SNPs did not influence the incidence of cGVHD and CMV (data not shown). However, the engraftment of PLTs was an independent protective factor against TRM (HR = 0.053, 95% CI: 0.024–0.11, p < 0.001) (Fig. 4B). Cumulative analysis showed that good platelet implantation was associated with lower transplantation-related mortality (p < 0.000) (Fig. 2F).

Association between donor CSF3R SNPs and patient survival

In univariate cox analysis, we found that rs3917980 and rs2275472 polymorphisms in donors were associated with the LFS and overall survival of recipients (Table 3), while none of the other CSF3R SNPs in donors were associated with LFS or patient survival. Although CSF3R SNPs had statistically significant trends with LFS and OS, the biological results of LFS and OS associated with these the rs3917980 and rs2275472 genotypes were completely opposite. Patients receiving stem cells from donors with the rs3917980G/G or A/G genotype tended towards increased LFS (82.8% versus 67.5%, p = 0.016) and OS rates (p = 0.044) (Fig. 3A, B), while those with the rs2275472T/C genotype trended towards lower LFS rates (p = 0.066) (Fig. 3C, D). Further subgroup analyses based on HLA loci showed no relations between CSF3R SNPs and LFS and OS in HLA-matched groups (p > 0.05); however, patients receiving stem cells from HLA-mismatched donors with the rs3917980G/G or A/G genotype showed better LFS and a trend towards a higher OS rate (p = 0.015 and p = 0.098, respectively) (Fig. 3E, F). Notably, no association was found between the rs2275472T/C genotype and OS in total HLA-unmatched recipients (p = 0.42), and patients receiving stem cells from HLA-unmatched donors with the rs2275472T/C genotype had no associations with incidence of LFS rates and OS (p > 0.05) (data not shown).

Fig. 3: The association between overall survival and LFS probabilities and rs3917980 and rs2275472 polymorphisms.
figure 3

A, B The association between rs3917980G/G or A/G genotype with LFS and OS rates. C, D The association between rs2275472T/C genotype with LFS and OS rates respectively. E, F The association between rs3917980G/G or A/G genotype with LFS and OS rates in HLA-mismatched group (n = 213).

The multifactor Cox analysis showed that no significance was found for any CSF3R SNPs for predicting LFS or OS (p > 0.05), while the engraftment of PLTs was a protective factor for LFS and OS and disease risk was a risk factor for LFS and OS (p < 0.001, Fig. 4C, D).

Fig. 4: Multifactor Cox regression of different clinical outcomes.
figure 4

A Multifactor Cox regression of relapse, B Multifactor Cox regression of TRM, C Multifactor Cox regression of LFS, D Multifactor Cox regression of OS.

The rs3917980 genotype was associated with better LFS and OS in patients post allo-HSCT

As rs3917980 and rs2275472 SNPs showed a completely opposite effect on clinical prognosis after transplantation in univariate cox analysis and approximately 33% of the healthy donors carried two SNPs, we speculated whether the biological effects caused by the above SNPs would be weaker than those caused by a single SNP. Then, we divided all patients into the following four groups according to whether their corresponding donors contained two SNPs: rs3917980G/G or A/G genotype single-positive group, rs2275472 T/C genotype single-positive group, double-positive group and double-negative group. Because the number of positive patients was only 4 out of 273, we created a group with double-negative SNPs. Cumulative analysis of LFS and OS showed that recipients receiving grafts from healthy donors with the rs3917980 G/G or A/G genotype had higher LFS rates than those who received grafts from healthy donors with the rs22754272 T/C genotype (p = 0.037) (Fig. 5A), while the LFS and OS rates of the wild-type genotype in the group with double-negative SNPs were below the values of the above groups, further proving that the rs3917980 G/G or A/G genotype may be a strong protective factor for a better LFS; whoever no significant differences were found among the groups regarding to OS (p = 0.14) (Fig. 5B).

Fig. 5: Analysis of LFS and OS among CSF3R SNPs.
figure 5

A Cumulative analysis of LFS among the three groups. B Cumulative analysis of OS among the three groups.

Discussion

In this paper, we would like to emphasize the following: (i) the CSF3R polymorphism rs2275472 in donors with the T/C genotype has a negative impact on the risk for TRM and LFS of patients post allo-HSCT; however, the rs3917980G/G or A/G genotype is considered a protective factor for LFS, and (ii) low platelet recovery identifies patients at greater risk for a poor prognosis.

The first observation that the two different SNPs of CSF3R in healthy donors have opposite effects on the clinical outcomes of recipients after allo-HSCT, which is perhaps not well documented in published studies. Previous studies have shown that CSF3R polymorphisms caused by mutation or alternative splicing lead to increased or decreased signaling cascades in response to G-CSF, leading to abnormal clinical outcomes [9]. CSF3R is widely distributed in hematopoietic cells, therefore, we hypothesize that as the donors’ hematopoietic stem cells gradually chimerize in recipients, different SNPs are likely to mediate signal cascades, eventually leading to diverse clinical outcomes post allo-HSCT. Our further subgroup analysis results suggest that patients receiving grafts from healthy donors with the rs3917980 genotype have higher LFS rates than those receiving grafts from donors with the rs227542 genotype, suggesting that CSF3R site screening in healthy donors may be used as an indicator to predict the outcome of patients. However, more research is needed. No one has discussed the potential effects of these two SNPs on clinical outcomes so far. Regarding why the CSF3R SNP site in the intron region can affect the prognosis of transplantation, we speculated that this effect may be caused by the intron region affecting mRNA expression by changing the transcription efficiency, splicing or maturation of mRNA [12]. Our next step may focus on verifying whether SNP site can affect the transcription and protein expression of the CSF3R gene by interfering with the intron or exon zone.

The second result of our study was that multivariate analysis showed that poor platelet recovery was a negative predictor of transplant-related mortality, leukemia-free survival and overall survival. This has been elucidated in various reports [13,14,15]. The reason for the above results may be that poor platelet implantation is related to the occurrence and development of acute and chronic GVHD, which leads to high TRM, low OS rates and better LFS [16]. A previous study elucidated that CSF3R mutations in AML patients were associated with significantly lower platelet counts [17]. In this study, we found no association between platelet implantation and SNPs of CSF3R, and further studies are still needed to explore the clinical characteristics of patients receiving grafts from healthy donors with different CSF3R SNPs due to the relatively small number of participants included in this study.

G-CSF is an important regulator cytokine for the development and function of neutrophils, and most studies have previously proven that mutations of CSF3R lead to increased or decreased numbers of neutrophils because of disorder of the CSF3R signaling cascade [18]. However, regarding the correlation of CSF3R SNPs with granulocytes, no reports have been found. In this study, we found no association between CSF3R SNPs and neutrophil recovery. Further large prospective studies are needed.

In summary, we have reported that two CSF3R polymorphisms in healthy donors predicted completely opposite results in recipients, indicating that a comprehensive assessment of donor CSF3R may be valuable in patients receiving allo-HSCT. Obviously, these data must be validated in larger patient cohorts, and the pathophysiological functionality of these polymorphisms should be further elucidated in mechanistic in vivo and in vitro models.