Introduction

Myelodysplastic syndromes (MDS) represent a heterogeneous group of malignant hematopoietic disorders that predominantly affect the elderly [13]. In patients suffering from MDS, the myeloid cells in the bone marrow are characterized by uni- or multilineage dysplasia, which is often accompanied by hyperplasia and enhanced apoptosis. This ineffective hematopoiesis may result in one or more peripheral cytopenias. Furthermore, approximately 30% of patients develop acute myeloid leukemia (AML). Due to considerable differences in the number and severity of cytopenias and in the propensity of developing AML, the median overall survival of distinct patient groups may vary from 5 months to ≥9 years [4]. The diversity in the clinical presentation of MDS emphasizes the need for tailored treatment of patients. Current therapy options comprise supportive therapy (transfusions, antibiotics), growth factor therapy (erythropoietin, granulocytic-colony stimulating factor), chemotherapy (cytarabine with/without anthracycline), immunotherapy (lenalidomide, thalidomide, anti-thymocyte globulin, cyclosporin A), epigenetic therapy (azacitidine, decitabine) and stem cell transplantation (SCT) [2, 3]. To date, SCT is the sole treatment that may result in cure; however, due to high toxicity of the treatment and the advanced age of many MDS patients it can only be applied in a subset of cases. Assessing the appropriate treatment for each patient requires accurate diagnosis, classification and prognostication of the diverse syndromes. This is based on bone marrow morphology, blast count, peripheral blood cytopenias and the karyotype of the malignant clone. Various systems that take these parameters into account have been developed and can be used to stratify patients with different clinical behavior into distinct categories. Systems comprise the International Prognostic Scoring System (IPSS), the World Health Organisation (WHO) classification system, and the WHO classification-based prognostic scoring system (WPSS) [13]. Recurrent aberrant karyotypes, which are incorporated in these systems, cannot entirely account for the genetic defects that are at the basis of the pathogenesis of MDS, as they are detected only in approximately 50% of patients [4]. A more accurate prediction of clinical outcome may require the incorporation of small genetic defects that have been revealed by the application of high-resolution SNP-arrays, Sanger sequencing, and whole exome sequencing [57]. Affected genes encode signal transduction proteins (BRAF, CBL, GNAS, JAK2, KRAS, NRAS and PTPN11), transcription factors and cofactors (ETV6, NPM1, RUNX1 and TP53), cell cycle regulators (CDKN2A and PTEN), components of the RNA splicing machinery (PRPF40B, SF1, SF3A1, SF3B1, SRSF2, U2AF35, U2AF65 and ZRSR2), and epigenetic regulators (ASXL1, DNMT3A, EZH2, IDH1 and IDH2, and TET2). Epigenetic regulators influence gene expression by methylation of DNA and modification of the DNA-associated histone proteins (see Box 1) [8]. The discovery of mutations in these epigenetic regulators might be in line with observations describing both deregulated DNA methylation patterns in MDS, which correlated with overall survival, and clinical response to hypomethylating agents, which was associated with a decrease in DNA methylation [912]. It will be interesting to investigate whether clinical responses also correlate with the presence of mutations in epigenetic factors. We will summarize the current knowledge about the mutated epigenetic factors in this review.

Box 1  

The ASXL1 protein

The epigenetic regulator ASXL1 (Fig. 1a) displays a dual function in that it mediates both gene repression and gene activation, depending on the cellular context [1315]. Gene regulation by ASXL1 in mammalian cells is exerted by interaction of ASXL1 with nuclear receptors, like RARα and PPARγ, upon ligand induction (Fig. 1a). Subsequent gene silencing involves complex formation with the epigenetic regulators HP1 and LSD1 [13, 14], whereas gene activation involves both the C-terminal PHD domain of ASXL1 and the interaction of ASXL1 with the histone methyltransferase SRC1 [15] (Fig. 1a). The human ASXL1 gene is located on chromosome 20q11.21. Although chromosome 20q may be affected by deletions (del(20q)) in MDS, ASXL1 does not reside in the commonly deleted region [4, 16]. However, truncating ASXL1 mutations, which are predominantly heterozygous and monoallelic, occur in 16% of patients with MDS and are closely associated with RAEB-1 (refractory anemia with excess blasts-1) and RAEB-2 (23 and 30% of patients, respectively); besides, ASXL1 mutations occurs in 22% of patients with secondary AML post-MDS [1720]. In line with this distribution is the observation that the presence of ASXL1 mutations showed a correlation with progression of MDS to AML [20]. In addition, the presence of ASXL1 mutations correlated with poor overall survival in various patient cohorts, and multivariate analyses indicated that ASXL1 mutations represent an independent poor risk factor for overall survival [5, 20]. These results might be of importance for future prognostication, especially since most ASXL1 mutations were detected in lower-risk (low and intermediate-1 risk) patients [5]. It is tempting to speculate that patients harboring ASXL1 mutations might benefit from epigenetic therapy since disruption of this epigenetic factor might confer an aberrant epigenetic profile. However, to date, both the impact of ASXL1 mutations on the epigenetic profile and the contribution of these mutations to the pathogenesis of MDS remain unknown. Notably, the adverse effects of ASXL1 mutations on prognosis are opposite to the correlation of isolated del(20q) with good prognosis [3, 4], suggesting that del(20q) and mutations of ASXL1 represent different pathogenic events in MDS. This is consistent with the observation that ASXL1 mutations (n = 63) co-occurred with 28% of isolated del(20q) deletions in a cohort of 439 MDS patients [5]. Of importance is that all ASXL1 mutations described in this cohort were heterozygous, including the ones that co-occurred with del(20q). This observation confirms the previous observations [16] that the ASXL1 gene does not reside in the commonly deleted region on chromosome 20q. As ASXL1 mutations and del(20q) seem to represent different events, this might indicate that ASXL1 mutations result in dominant-negative functions or gain of functions rather than loss of function; however, haploinsufficiency cannot be excluded at present. Mutations of ASXL1 also occurred in the context of diverse isolated cytogenetic abnormalities (30% of chromosome 7/7q deletions, 17% of trisomy 8 and 9% of del(5q)), and co-occurrence was seen with 20% of TET2 mutations, 47% of IDH mutations and 50% of EZH2 mutations, suggesting that these epigenetic factors might display complementary functions in epigenetic regulation and in the pathogenesis of MDS. To examine possible effects of the ASXL1 mutations, we will highlight some features of the discovered mutations. Mutations of ASXL1 affect the longest isoform (Fig. 1a) and target the last two exons, encoding amino acids 361–1541 [1720]. One of the most frequently occurring, but disputed, mutations observed in these cohorts was the p.G646WfsX12 frameshift mutation. At present, the status of this mutation is uncertain, since it was claimed to be a PCR-artifact [21]. However, large-scale analysis of 439 MDS samples by Bejar et al. [5] revealed just one p.G646WfsX12 mutation, while the total percentage of ASXL1 mutations in this study was similar to the percentage reported in previous studies [1720] (14 vs. 16%); furthermore, Bejar and colleagues affirmed the somatic origin of this mutation. This indicates that the status of this variant remains controversial. Probably, repeated PCR and sequence analysis of paired somatic and germ line samples should be executed to determine the true nature of this variant. In addition to the germ line analysis of the p.G646WfsX12 mutation, the somatic origin of 2 out of 5 missense variants (R402Q and A1486T) was confirmed by Bejar and colleagues. Missense variants had also been detected in the initial cohorts [18, 20], but had not been included in the calculations as their somatic origin had not been verified; some of them were identified as single nucleotide polymorphisms (SNPs) in subsequent studies [5]. Nevertheless, even when all possible missense mutations are included, then still more than 90% of all ASXL1 mutations are truncating mutations, predicting an ASXL1 protein that lacks its PHD domain in all cases and the RARα-binding domain in most cases. Since it is not clear whether mutant ASXL1 is transcribed and translated into protein, it would be interesting to assess the presence of mutant ASXL1 protein in MDS cells. If the mutated ASXL1 gene is expressed, the function of the mutant proteins should be investigated. Future functional analysis of the truncated proteins could be extended to the examination of the confirmed missense mutation R402Q [5] as well as two in-frame deletion–insertion variants that compromise amino acids 639–660 [20] (Fig. 1a); examination of these mutant proteins would be of particular interest since these variants alter a small region within the SRC1-binding domain without affecting the overall reading frame. The frequent disruption of ASXL1 domains that are implicated in gene activation (the PHD domain, the RARα-binding domain and the SRC1-binding domain) indicates that only the repressive domains (the HP1- and LSD1-binding domains) remain intact. Although the RARα-binding domain plays a role in RARα-dependent repression, this domain is not required for the RARα-independent repressive capacity of ASXL1 displayed in in vitro assays [13]. Therefore, if expressed, the repressive domains might interfere with epigenetic regulation. In summary, ASXL1 mutations could represent different pathogenic events from del(20q) and may result in dominant-negative function or gain of function. The repressive domains of ASXL1 might be important for the mutant function.

Fig. 1
figure 1

Epigenetic regulators that are mutated in MDS. Schematic pictures of proteins, including functional and interacting domains, are shown. a The ASXL1 protein; HP1 HP1-binding domain, LSD1 LSD1-binding domain, SRC1 SRC1-binding domain, RARα RARα-binding domain, PHD PHD domain, asterisk R402Q, black bar in-frame deletions. b The DNMT3A and DNMT3A3 proteins; PWWP PWWP domain, PHD PHD-zinc finger domain, MT methyltransferase domain, asterisk p.Q237X, p.L422fs, p.E599X. c The EZH2 protein; EED EED-binding domain, SUZ12 SUZ12-binding domain; CXXC cysteine-rich domain, SET SET domain. d The IDH1 and IDH2 domains; large (red) and small (pink) domains of the active region are indicated; asterisk IDH1-R132, IDH2-R140 and R172. e The TET2 protein; conserved domains are indicated with hatched bars

The DNMT3A protein

The DNMT3A protein (Fig. 1b) is a DNA methyltransferase that catalyzes de novo methylation of cytosines in genomic DNA (Box 1) [22, 23]. DNA methylation plays an important role in human development, self-renewal of stem cells, DNA repair and cellular differentiation: not only has it been linked to chromosomal integrity by both repression of transposable elements and stabilization of chromosomes, but it is also involved in gene repression and maintenance of active chromatin states, thereby controlling genomic imprinting, X chromosome inactivation and proper gene expression [2226]. These processes are severely deregulated in tumor cells, which exhibit widespread chromosomal loss of DNA methylation and concurrent chromosomal aberrations as well as altered gene expression: tumor suppressors are suppressed whereas oncogenes are activated as a result of gene hypermethylation and gene hypomethylation, respectively [26, 27]. These abnormalities are also implicated in the pathogenesis of MDS and may be elicited, at least in part, by the DNMT3A mutations that have been detected in MDS [6, 28, 29]. Mutations of the DNMT3A gene, located on chromosome 2p23.3, were detected in 6% of MDS patients and exhibited equal distribution amongst the different MDS subtypes [6, 28, 29]. The patient cohorts that were screened for DNMT3A mutations were too small to perform multivariate analysis for clinical outcome. Nevertheless, the presence of DNMT3A mutations correlated with poor clinical outcome in univariate analysis: patients harboring DNMT3A mutations showed increased progression toward AML and inferior overall survival. It will be interesting to assess the effect of treatment with hypomethylating agents (azacitidine and decitabine) on clinical outcome in patients carrying DNMT3A mutations, since acquired mutations of this DNA methyltransferase are expected to affect DNA methylation patterns. At present, it remains elusive whether DNMT3A mutations result in loss of function, dominant-negative function or gain of function. Although DNMT3A mutations in MDS seemed to have no effect on global methylation levels [29], similar DNMT3A mutations detected in AML correlated with either hypermethylation or hypomethylation of distinct genes, and concurrent gene repression and gene activation [30]. This duality might be explained by the observation that methylation by wild-type DNMT3A is required for repression of genes as well as for maintenance of active chromatin states [25]. The mechanism by which DNMT3A mutations result in hypermethylation and hypomethylation needs to be further investigated. Mutations of DNMT3A are predominantly heterozygous and constitute mainly monoallelic events. The observation that levels of mutant transcripts were similar to levels of wild-type transcript [6] suggests that patients carrying DNMT3A mutations may express both wild-type and mutant DNMT3A protein and implies that mutations may result in dominant-negative or gain of function rather than loss of function. Accordingly, overexpression of a DNMT3A gene that harbored a missense mutation promoted proliferation in myeloid 32D cells [31], which would not be observed if mutations confer mere loss of function. Mutations comprise missense mutations (86%) and truncating mutations (14%) [6, 28, 29]. The truncating mutations (n = 3) result in a predicted protein lacking either all known functional domains (p.Q237X) or the PHD-zinc finger and methyltransferase domains (p.L422fs and p.E599X) (Fig. 1b) [6, 29]. Notably, the remaining N-terminal amino acids have been implicated in DNA binding [32]; moreover, the truncated protein p.Q237X resembles isoform DNMT3A3 (166 amino acids; Fig. 1b), which is catalytically inactive and may display a regulatory function by targeting full-length DNMT3A to the chromatin [33, 34]. This isoform is not ubiquitously expressed; instead, it is present in developing gonads, testicular cells and resting fibroblasts. Therefore, the mutant (truncated) DNMT3A protein resembling DNMT3A3 might be regulatory active in cells where DNMT3A3 should not operate. If the missense mutations result in a catalytically inactive protein, they might possess the same function as the truncated proteins. The missense mutations target the methyltransferase domain (Fig. 1b), which binds to the methyl donor and catalyzes the methylation of DNA [23]. The most frequently targeted amino acid (R882, 50% of DNMT3A mutations in MDS) corresponds to DNMT3B-R823. The DNMT3B protein, a paralog of DNMT3A, catalyzes de novo DNA methylation too, and its R823 is mutated in ICF (immunodeficiency, centromeric instability, facial anomalies) syndrome [35]. The R823G mutant is catalytically active, but cannot dissociate from the DNA to search for a new target, which results in decreased activity observed in in vitro assays [35]. Consistent with this observation is the reduced activity of the R882-mutated DNMT3A proteins in in vitro assays [31]. Next to R823G, additional DNMT3B mutations in ICF syndrome have been described, which result in significant hypomethylation of heterochromatic genes in peripheral blood cells [35, 36]. Of note, these mutations comprise mostly missense mutations in the MT domain and some truncating mutations in the N-terminal region [36], thereby showing a strong resemblance to the DNMT3A mutations observed in MDS. In summary, we hypothesize that mutated DNMT3A proteins retain their DNA-binding potential but lose their potential to methylate DNA properly as they are captured onto the DNA. As azacitidine and decitabine have been described to function by capturing DNMT proteins to the DNA, the question arises whether these hypomethylating agents will be effective in cells harbouring DNMT3A mutations.

The EZH2 protein

The EZH2 protein (Fig. 1c) is a histone methyltransferase that catalyzes progressive mono- through trimethylation of H3K27 [37], for which it depends both on its C-terminal SET domain and on its interaction with the EED and SUZ12 proteins (Box 1) [38]. The different methylation levels of H3K27 play an important role in many cellular processes, including hematopoiesis, and H3K27me3 is involved in gene repression [39]. Overexpression of EZH2 has been described in various malignancies, and therefore EZH2 has been labeled as a potential oncogene [38]. In line with this, an activating point mutation in the SET domain of EZH2 (Tyr641) was identified in diverse lymphomas [37]. In contrast, no activating EZH2 mutations were found in MDS. Instead, mutations of EZH2 that result in loss of function were detected in 6% of patients with MDS [5, 4042]. The EZH2 gene is located at chromosome 7q36.1, a region that is frequently affected in MDS by chromosome 7/7q deletions, and occasionally by microdeletions [4, 40, 42]. Microdeletions of chromosome 7q36.1 co-occurred with truncating EZH2 mutations that deleted or disrupted the important SET domain on the remaining allele, indicating that no functional EZH2 protein could be expressed in these MDS cells [40, 42]. In addition to the hemizygous mutations that result in loss of EZH2 at both alleles, biallelic homo- and heterozygous mutations were observed [40, 41]. Mutations comprised truncating and missense mutations that were scattered throughout the coding sequence. All truncating mutations resulted in the disruption or deletion of the SET domain, and most missense mutations targeted the SET domain as well. Analysis of some of these mutants showed loss of H3K27me3 in affected cells [41, 42], thereby confirming the loss of function of mutant EZH2 proteins. Mutations of EZH2 occurred in the context of normal and non-complex karyotypes (mainly del(5q), del(20q) and trisomy 8) [5, 40, 41] and co-occurred with 8% of TET2 mutations, 13% of IDH mutations and 22% of ASXL1 mutations [5], indicating that these proteins function in different epigenetic processes and that their mutations represent complementary events in the pathogenesis of MDS. Until now, the co-occurrence of EZH2 and DNMT3A mutations has not been investigated in large patient cohorts. Most of the mutations that were found in EZH2 occurred in the absence of a chromosome 7/7q deletion [5, 4042]. The presence of EZH2 mutations correlated with adverse prognosis, comparable to the well-known poor prognosis of patients with a chromosome 7/7q deletion [35, 40, 41]. In multivariate analysis, the correlation with poor prognosis was shown to be independent from other known risk factors [5]. This observation is of particular relevance as most EZH2 mutations were detected in lower-risk patients [5, 40]. Since patients with chromosome 7/7q deletions particularly benefit from treatment with azacitidine [11], it might be hypothesized that patients harboring EZH2 mutations might benefit from this treatment as well. The use of EZH2 inhibitors has been suggested for the treatment of malignancies that show increased EZH2 activity [38]. The loss-of-function mutations of EZH2 that occur in MDS argues against the use of these inhibitors in myeloid malignancies.

The IDH1 and IDH2 proteins

The IDH1 protein (Fig. 1d) is located in the cytosol and peroxisomes of eukaryotic cells, whereas its paralog IDH2 (Fig. 1d) is situated in mitochondria [43, 44]. Both proteins convert isocitrate to α-ketoglutarate (α-KG; also known as 2-oxoglutarate (2-OG)) (Fig. 2a), while reducing nicotinamide adenine dinucleotide phosphate (NADP+) to NADPH. These reactions have important functions in the metabolic pathways and in the protection of mammalian cells from oxidative stress-induced tumorigenesis. Mutations in the IDH genes were first detected in brain tumours: the IDH1 gene, located at chromosome 2q33.3, is frequently mutated in grade II–III gliomas and secondary glioblastomas (50–100%), whereas the IDH2 gene, located at chromosome 15q26.1, is only occasionally mutated in these malignancies (0–7%) [45]. In MDS, the IDH genes are mutated in approximately 10% of patients [5, 19, 4649]. The mutations comprise missense mutations, which target either one single codon in the IDH1 gene (c.394-396CGT, encoding amino acid p.R132) or one of two codons in the IDH2 gene (c.418-420CGG, encoding amino acid p.R140, or c.514-516AGG, encoding amino acid p.R172). Notably, the R140 mutations in the IDH2 gene represent the most frequently occurring mutations in MDS (61%), but have not been identified in gliomas and glioblastomas. Although the IDH2-R140, IDH2-R172 and IDH1-R132 residues reside in the active site of the proteins (Fig. 1d) [43], cells carrying IDH mutations did not show impaired levels of α-KG and NADPH [50, 51]. Apparently, these products were generated by the unaffected wild-type protein that could be expressed because mutations of the IDH genes constitute predominantly monoallelic events, leaving one allele intact [5, 19, 4649]. The mutant IDH proteins showed a gain of function as they could convert the α-KG that is generated by wild-type IDH proteins into 2-hydroxyglutarate (2-HG) (Fig. 2a) [52]. This oncometabolite competitively inhibits α-KG-depending enzymes [52, 53], among which are the Jumonji family of histone demethylases (JHDM) [54] and the TET proteins, as will be described in the section on TET2. Since the oncometabolite 2-HG that is formed by mutant IDH proteins inhibits the function of the TET2 protein, mutations of IDH and TET2 are expected to be mutually exclusive. This hypothesis was confirmed as IDH mutations hardly co-occurred with mutations in TET2, whereas they co-occurred with approximately 7% of EZH2 mutations and 11% of ASXL1 mutations [5]. Furthermore, IDH mutations were found in the context of normal karyotype, isolated del(5q), isolated del(20q) and isolated trisomy 8 [5, 55, 56]. At present, the effect of IDH mutations on clinical outcome is not that clear, as different studies in various cohorts of patients showed different results [5, 47, 49]. Prospective studies involving large patient cohorts should be performed to clarify this issue.

Fig. 2
figure 2

Formation and function of the oncometabolite 2-HG. a Wild-type IDH proteins convert isocitrate to α-ketoglutarate (α-KG), which is converted to the oncometabolite 2-hydroxyglutarate (2-HG) by mutant IDH proteins. b 2-Hydroxyglutarate inhibits the conversion of 5-methylcytosine (5mC) into 5-hydroxymethylcytosine (5hmC) by competing with the cofactor α-KG

The TET2 protein

The TET family of proteins was shown to be able to demethylate DNA by converting 5-methylcytosine (5mC) into 5-hydroxymethylcytosine (5hmC) (Box 1) [57]. 5-Hydroxymethylcytosine can be further oxidized into formyl- and carboxylcytosine (5fC and 5cC), which can subsequently be recognized by thymine-DNA glycosylase base-excision repair enzymes and replaced by unmethylated cytosines [58, 59]. The TET proteins bind to the transcriptional start site of actively transcribed genes, indicating that they may act to keep the chromatin of active genes in an open configuration by the demethylation of CpG nucleotides. The conversion of 5mC into 5hmC by TET proteins depends on the cofactors Fe2+ and α-KG (Fig. 2b). This dependence reveals an interesting link between the TET proteins and the IDH proteins, as IDH1 and IDH2 convert isocitrate to α-KG (Fig. 2a), thereby supplying the cofactor for the TET proteins. As described in the previous section, mutations of IDH1 and IDH2 result in a gain of function; instead of catalyzing the conversion of isocitrate into α-KG, the mutant proteins convert α-KG, generated by the wild-type enzyme, into 2-HG (Fig. 2a). The generated 2-HG was shown to inhibit the function of the TET proteins by competitive binding to the α-KG binding site (Fig. 2b) [52, 53]. The TET2 protein (Fig. 1e) was also targeted by mutations in MDS and many different types of myeloid malignancies [5, 60, 61]. In MDS, mutations were found in 20–25% of cases. They appear to be more prevalent in lower-risk patients, but the presence of TET2 mutations does not appear to be an independent prognostic factor. In approximately half of the cases, TET2 mutations were bi-allelic, either by uniparental disomy, deletion or independent mutations in both copies of the gene. The occurrence of nonsense mutations at the very N-terminal part of the coding region in some patients, and deletions that span the entire gene in others, strongly suggested that the mutations result in loss of function. This was confirmed in mouse models, in which inactivation of TET2 in myeloid cells induced hematopoietic malignancies [6264]. In line with the idea that the function of the TET2 protein is inhibited either by the oncometabolite 2-HG or by loss-of-function mutations, is the observation that IDH mutations and TET2 mutations are mutually exclusive in most patients. As loss of function of the TET2 protein results in defective 5hmC formation and inhibition of demethylation of DNA [65], the effect of hypomethylating agents in patients with TET2 mutations might be particularly efficient. Two recent studies appear to indicate that this may be the case [66, 67]. This needs to be further confirmed in larger prospective clinical trials.

Conclusion

Mutations of epigenetic regulators (ASXL1, DNMT3A, EZH2, IDH1 and IDH2, and TET2) occur frequently in myelodysplastic syndromes. The presence of ASXL1 and EZH2 mutations represents an independent factor for poor clinical outcome, and the impact of DNMT3A and IDH mutations on overall survival requires further investigation. The concurrence of several mutant proteins in a MDS clone might indicate that these proteins perform different functions in epigenesis and pathogenesis, which seems to be the case for ASXL1 and EZH2, whereas mutations that are mutually exclusive might indicate that the corresponding proteins depend on each other for proper function, as is the case for TET2 and the IDH proteins. Future functional assays and prospective clinical studies might reveal the impact of mutations on the epigenetic landscape, on the pathogenesis of MDS and on response of patients to (epigenetic) therapy.