Dynamic posttranslational modifications (PTMs) of histone proteins, especially those that occur on their N-terminal unstructured tails, are key to the regulation of chromatin dynamics [1]. These modifications, which include acetylation, methylation, phosphorylation and ubiquitination, serve as hallmarks of the chromatin state and gene expression status [2]. In general, acetylation is associated with gene activation, whereas methylation can be associated with either gene activation or gene silencing, depending on the residues being methylated. For instance, methylation of histone H3 lysine 4 (H3K4), H3K36 and H3K79 is typically linked to active gene expression, whereas di- and trimethylation on H3K9, H3K27, and H4K20 are usually associated with gene silencing [35].

Almost all of the enzymes that deposit and remove methyl groups from lysine residues were originally discovered as histone-modifying enzymes; therefore, they were named histone lysine methyltransferases (HKMTs) and histone lysine demethylases (HKDMs), respectively [5, 6]. As more examples of these enzymes acting on non-histone proteins were uncovered, they were renamed as lysine methyltransferases (KMTs) and lysine demethylases (KDMs) [7], respectively, to reflect their general roles in modifying proteins beyond histones. Like histone lysine methylation, non-histone lysine methylation facilitates interactions with specific readers to execute downstream signaling, and the crosstalk between lysine methylation and other PTMs also impacts the biological functions of the modified proteins. Therefore, the interactions of lysine methylation and other PTMs on non-histone proteins can be seen as extending the “histone code” to a more general “PTM code” that is likely to regulate all eukaryotic proteins. This review will describe the current knowledge of the KMTs and KDMs, including SETD7, G9a, and SMYD2, that have more known non-histone substrates than other enzymes, focusing on their role in the methylation of non-histone proteins. In the second part of this article, we will use p53 and nuclear receptors, especially ERα, as examples to discuss the roles and mechanisms of lysine methylation in regulating non-histone proteins.

The enzymes that act on non-histone substrates

Most known KMTs contain a catalytic SET domain, an evolutionarily conserved domain named after its founding members: Su(var)3-9, a Drosophila position-effect variegation suppressor gene, Enhancer of zeste, and Trithorax [8]. Based on primary amino acid sequence, domain architecture and histone substrate specificity, KMTs are classified into eight distinct subfamilies [9]. A handful of KMTs have been shown to methylate non-histone proteins, among which SETD7, G9a, SMYD2, SET8, and EZH2 have multiple non-histone protein substrates (Table 1). We will briefly describe these enzymes below.

Table 1 Lysine methyltransferases and their histone and non-histone substrates in humans

Setd7

SETD7 (also known as KMT7 or SET7/9) is the only KMT7 family member due to its unique enzymatic activity and protein domain architecture. It was initially identified as a histone H3K4 mono-methyltransferase associated with transcriptional activation [10]. However, later studies revealed that SETD7 knockdown in cells did not alter histone H3K4 methylation, indicating that SETD7 does not play a major role in maintaining histone methylation [11]. Instead, SETD7 preferentially catalyzes the methylation of numerous non-histone proteins [12, 13] (Table 1). Structural and functional analysis revealed the minimal consensus sequence for SETD7 substrate recognition: K/R-S/T-K (K is the target lysine residue) [14]. The identification of this consensus sequence led to the discovery of over 90 putative non-histone substrates of SETD7 in mammalian cells [15]. To date, more than 30 non-histone proteins involved in diverse cellular processes have been confirmed as SETD7 substrates.

SETD7-mediated methylation of transcription factors and epigenetic regulators can lead to either gene activation or repression via modulating protein stability. For example, methylation of the tumor suppressor protein p53 and estrogen receptor α (ERα) by SETD7 stabilizes these proteins and is required for their activation [16, 17]. In contrast, SETD7-mediated methylation of transcription factors such as the RelA/p65 subunit of NF-κB and E2F1, or the DNA methyltransferase 1 (DNMT1) reduces their protein stability and impairs function [1821]. The functional outcome of lysine methylation is also site-specific: SETD7-mediated methylation of different lysine residues within the same protein is linked to distinct biological functions. For example, SETD7 methylates the FoxO transcription factor 3 (FoxO3) at two residues, K270 and K271. Methylation at K270 represses its DNA-binding activity and transactivation [22], whereas K271 methylation promotes its transcriptional activity, but leads to decreased protein stability [23].

SETD7 can also modulate its substrates’ functions without affecting protein stability. For example, SETD7 methylates androgen receptor (AR) and enhances AR-mediated transactivation. This methylation facilitates the inter-domain communication and the recruitment of AR to chromatin, while the overall AR protein level is not affected [24, 25]. Likewise, SETD7-mediated methylation of farnesoid X receptor (FXR) stimulates its transcriptional activation activity without altering protein stability [26].

Methylation of proteins by SETD7 can also influence protein–protein interaction. SETD7 methylates the tumor suppressor retinoblastoma protein (Rb) and regulates Rb-dependent cellular functions through modulating protein–protein interactions. SETD7-mediated methylation of Rb at K873 promotes its interaction with the heterochromatin protein HP1, whereas methylation of K810 impedes Rb binding to cyclin-dependent kinases [27, 28]. Similarly, SETD7 methylates TAF10, a component of the general transcription factor TFIID complex. This modification enhances the interactions between TAF10 and RNA polymerase II during the formation of the transcription pre-initiation complex [29]. In addition, another component of the TFIID complex TAF7 is also a putative target of SETD7 [14], suggesting that non-histone protein methylation may be a general means of transcriptional control that extends beyond the TAF10–RNAPII interaction. Finally, SETD7-mediated protein methylation may also regulate protein–RNA interaction. SETD7 methylates the HIV transactivator Tat protein and this methylation enhances the interaction between Tat and the HIV trans-activating response RNA element [30, 31].

In addition to the direct modification of transcription factors, SETD7 also modifies epigenetic regulators and transcription cofactors such as histone-modifying enzymes. For instance, SETD7 methylates SUV39H1 at two residues (K105 and K123), and this methylation inhibits the H3K9 methyltransferase activity of SUV39H1 leading to heterochromatin relaxation and genome instability [32]. SETD7 methylates poly-ADP-ribosyltransferase 1 (PARP1) and triggers its recruitment to DNA damage sites [33]. It has also been reported that SETD7 methylates the p300/CBP-associated factor (PCAF), but the biological effect of this modification is unknown [34].

So far, we have presented several examples of how SETD7 can directly influence chromatin and transcription, but SETD7 also methylates proteins in the cytoplasm that are important for various cellular responses and pathways. SETD7 methylates Yes-associated protein (YAP), a Hippo pathway transducer to promote its cytoplasmic retention [35], as well as the interferon-induced transmembrane protein 3 (IFITM3) to reduce its antiviral activity [36]. Other non-histone substrates of SETD7 include AKA6, CENPC1, MeCP2, MINT, PPARBP, ZDH8, Cullin 1, IRF1, and TTK, all of which were identified by a peptide array methylation-based analysis [15]; however, the biological functions of these methylation events remain to be determined.

G9a

G9a (also known as euchromatin histone methyltransferase 2 [EHMT2] or KMT1C) and its homolog, G9a-like protein, a.k.a. EHMT1 and KMT1D (GLP), are the main enzymes contributing to H3K9 mono- and dimethylation (H3K9me1/2) at euchromatin for transcriptional repression [3740]. G9a and GLP, which can form both homodimers and G9a/GLP heterodimers, are each required for the in vivo function of the other: G9a cannot compensate for the loss of GLP function and vice versa [41, 42]. In addition to its main target at H3K9, G9a was shown to also methylate H3K27, H3K56, and residues on certain histone H1 variants, including H1.4K26 and H1.2K187 [4346]. G9a-mediated H3K56 methylation serves as a chromatin docking site for PCNA prior to its function in DNA replication [46], and H1.4K26 methylation provides a recognition surface for the chromatin-binding proteins HP1 and L3MBTL1, promoting H1 deposition and retention on chromatin [43, 47].

G9a is ubiquitously expressed in all cell types during development and plays an important role in various biological processes, including mouse embryo development, germ cell development, immune response, and brain function [48]. At the molecular level, G9a is essential for transcriptional repression, gene imprinting, provirus silencing, and DNA methylation [4953]. G9a exerts most of these functions through the methylation of histone H3K9; however, this activity does not seem to be required in the regulation of DNA methylation in embryonic stem cells [42, 49, 51].

Besides methylating histones, G9a can also methylate a number of non-histone proteins (Table 1). The first identified non-histone substrate of G9a was G9a itself [54]. Automethylated G9a recruits HP1 to enhance the negative regulation of gene transcription. The other known non-histone substrates of G9a are mainly transcription factors. Through methylating these transcription factors, G9a suppresses target gene expression in addition to depositing the repressive H3K9me2 mark. G9a was shown to methylate C/EBPβ [55], p53 [56], and the myogenic regulatory factor MyoD [57] to inhibit their transactivation activity. Additionally, G9a also methylates the chromatin-remodeling factor Reptin [58], myocyte enhancer factor 2 (MEF2) [59], and metastatic tumor antigen 1 (MTAl) [60]. The G9a-mediated methylation on MEF2 and MTAl can be removed by LSD1/KMT1A, which is required for the precise control of their transcriptional activity in response to different signals [59, 60].

Given the wide array of G9a substrates, researchers sought to define a G9a methylation consensus sequence using a peptide array. The G9a substrate consensus sequence is composed of an R–K sequence flanked with a hydrophilic amino acid at position -2, a small amino acid at position -1, a hydrophilic amino acid at position +1, and a hydrophobic amino acid at position +2 [48, 61]. The identification of this motif led to the discovery of additional potential non-histone substrates of G9a, such as WIZ, CDYL1, CSB, ACINUS, HDAC1, DNMT1, KLF12 [48], and SIRT1 [62] (Table 1). However, it remains to be determined whether these proteins are bona fide G9a methylation targets in vivo.

G9a was previously found to be a coactivator of nuclear receptors, such as AR, ERα, and the glucocorticoid receptor (GR) as well as the osteoblast-specific transcription factor Runx2 [6366]. The coactivator role of G9a has been shown to be largely methylation independent [67]. However, unpublished data from our group indicate that G9a-mediated methylation of ERα protein contributes to the positive regulation of ERα target gene expression. Overall, G9a functions as either a transcriptional corepressor or coactivator in a context-dependent manner. Future studies will be needed to determine how G9a makes the switch between histone and non-histone substrates during transcriptional control.

SMYD2

SMYD2 (also known as KMT3C) was initially identified as a member of the SMYD family of KMTs that contains a SET domain and a MYND motif. SMYD2 was shown to methylate both H3K36 and H3K4 and act as a transcriptional coactivator [68, 69]. Like SETD7 and G9a, SMYD2 can methylate diverse non-histone proteins in addition to histones (Table 1). The currently known SMYD2 non-histone substrates include p53, Rb, ERα, PARP1 and the chaperone protein HSP90 [7074]. As opposed to the SMYD2-mediated methylation of histone proteins that leads to gene activation, SMYD2-mediated methylation of non-histone proteins is normally associated with an inhibitory effect. For example, SMYD2 represses the transactivation activity of p53 through the methylation of p53K370, which negatively influences the SETD7-mediated p53-activating methylation of p53K372 [70]. SMYD2 inhibits the positive role of Rb in transcription by methylating RbK810 [75] and K860 [71]. In addition, SMYD2 inhibits ERα-mediated transcriptional activation. Recently, our group reported that SMYD2 monomethylates ERα at K266 in the hinge region to prevent ERα from being recruited to chromatin, thus suppressing transcriptional activation of its target genes [72].

Other KMTs that target non-histone substrates

In addition to the three KMTs discussed above, several other KMTs have also been reported to methylate some non-histone substrates (Table 1). The histone H4K20 mono-methyltransferase SETD8 (also known as PR-SET7 or KMT5A) [7678] methylates p53 [79], PCNA [80] and the p53 associated tumor suppressor protein Numb [81]. Enhancer of Zeste Homolog 2, a.k.a. KMT6 (EZH2), the catalytic subunit of the Polycomb-repressive complex 2 (PRC2) that deposits the repressive H3K27me3 mark [82], can also methylate GATA4 [83] and the orphan nuclear receptor RORα [84]. Other examples include: methylation of the HIV Tat protein by the H3K9 methyltransferase ESET [85, 86], HSP70 methylation by the H3K4 methyltransferase SETD1A [87, 88], VEGFR1 and MAP3K2 methylation by the H3K4 methyltransferase SMYD3 [8991], and RelA methylation by both the H3K36 methyltransferase NSD1 [92, 93] and SETD6 [9496].

In addition to the SET domain-containing KMTs, there are other types of methyltransferases that can modulate non-histone proteins (Table 1). These enzymes include the calmodulin lysine N-methyltransferase (CaM KMT) and the methyltransferase-like (METTL) proteins. The calcium-binding messenger CaM is methylated at K115 by a class I, non-SET domain-containing protein methyltransferase, CaM KMT [97]. CaM KMTs and their catalytic activities on CaM are highly conserved across species from human to insect to plant. In addition to the class I methyltransferase signature domain, CaM KMTs possess unique flanking regions at their C- and N-termini that may dock the substrates’ methylation sites into the enzymes’ active sites for trimethylation. The METTL family contains METTL21D, METTL22, and METTL21A. METTL21D trimethylates K315 of VCP/p97, a type II AAA + ATPase. This methylation negatively affects various cellular functions of VCP/p97 including ubiquitin-dependent protein degradation [98100]. METTL22 trimethylates KIN/kin17 at K135, which regulates the association of KIN with chromatin and possibly its functions in DNA repair, DNA replication and mRNA processing [99, 101]. METTL21A appears to act on the HSP70 family of chaperone proteins including HSPA1, HSPA8, and HSPA5 [99, 102]. Trimethylation of HSPA8 leads to a decrease in the interaction of this chaperone with α-synuclein, which likely has implications in the etiology of Parkinson’s disease [102]. Together, these studies implicate protein methylation in regulating a broad range of cellular functions through direct impacts on non-histone substrates.

KDMs that act on non-histone proteins

There are only four KDMs known to demethylate non-histone proteins: LSD1 (a.k.a. KDM1A or AOF2), JMJD2A (a.k.a. KDM4A), FBXL1 (F-box and leucine-rich repeat protein 11, a.k.a. KDM2A), and PHF2 (Table 2). LSD1 antagonizes SETD7-mediated methylation on both histone H3K4 and various non-histone proteins including MYPT1, DNMT1, E2F1, p53, STAT3, and Tat [20, 21, 103107]. LSD1 is also involved in the removal of methylation marks on ERα and AR that are mediated by KMTs other than SETD7 [72, 108111]. Similar to the opposing functions of LSD1 and SETD7, the Jumonji domain-containing H3K9 and H3K36 demethylase JMJD2A [112, 113] appears to pair with G9a in modulating the methylation dynamics of various substrates including WIZ, CDYL1, CSB, and G9a itself [114] (Table 2). Another histone H3K36 KDM, FBXL11, opposes NSD1-mediated methylation on the RelA subunit of NF-κB [93, 115]; and the H3K9 demethylase PHF2 can act on its effector partner ARID5B [116, 117]. With time, it is likely that more non-histone KMT and KDM targets will emerge.

Table 2 Lysine demethylases and their histone and non-histone substrates in humans

Non-histone proteins regulated by lysine methylation

As we discussed above, transcription factors such as p53, NFκB and several nuclear receptors are modified by lysine methylation. Here, we will describe the roles and mechanisms of these methylation events on non-histone proteins using p53 and ERα as examples. Lysine methylation on non-histone proteins is generally involved in the regulation of protein–protein interaction, protein stability, and in some cases, protein localization. Similar to methylation on histones, lysine methylation on non-histone proteins often occurs in regions that are also enriched in many other PTMs. Orchestration of each modification and crosstalk among them lead to distinct functional outcomes under different biological conditions. Thus, similar to their role in creating the “Histone Code” for epigenetic regulation of chromatin, lysine methylation and other modifications on non-histone proteins constitute a more general “PTM code” that is likely to regulate most, if not all, eukaryotic proteins in diverse cellular processes.

p53

p53 is a well-known tumor suppressor that binds to specific DNA sequences and transcriptionally activates target genes that regulate several key cellular processes. These include cell cycle control, apoptosis and DNA repair in response to genotoxic stress [118]. p53 is probably the most studied non-histone protein that is regulated by lysine methylation. Interestingly, all known lysine methylation events on p53 occur in the C-terminal unstructured regulatory region of the protein, which mimics the N-terminal unstructured tail of histone H3 (Fig. 1).

Fig. 1
figure 1

Schematic representation of protein structure and posttranslational modifications on p53 and ERα. PTMs are represented by colored dots that are defined in the figure. Protein domains are abbreviated as follows: p53: TAD Transactivation domain, PRD proline-rich domain, DBD DNA-binding domain, TD tetramerization domain, CRD C-terminal regulatory domain; ERα, AF-1 activation function domain 1, DBD DNA-binding domain, LBD ligand-binding domain, AF-2 activation function domain 2; H3, HFD histone fold domain. The first and last amino acids of selected domains are indicated. The protein primary sequence and PTMs of the p53 C-terminal CRD domain, ERα DBD and hinge regions, as well as the histone H3 N-terminal tail, included for comparison, are shown in the zoomed-in boxes. The KMTs catalyzing these lysine methylation events are listed. Residues that are demethylated by LSD1 are also shown

Both the positions of the methylated lysine residues and the methylation states (e.g., mono or dimethylation) influence p53 functions (Fig. 1). For example, monomethylation of p53K370 (p53K370me1) by SMYD2 is linked to transcriptional inhibition [70]; whereas its dimethylation, by a yet unknown enzyme, is linked to p53 activation [105]. LSD1 demethylates p53K370me2 to p53K370me1 and thus can switch p53 from an active state to an inactive form [105]. Other KMTs that act on p53 are SETD7 and G9a/GLP. SETD7 monomethylates p53K372 whereas G9a/GLP dimethylates p53K373 [16, 56]. Like their contrasting roles in controlling transcription by methylating histone H3K4 to activate transcription or H3K9 to repress transcription, SETD7- and G9a/GLP-mediated methylation of p53 are associated with target gene activation and repression, respectively [16, 56]. Likewise, the H4K20 methyltransferase SETD8 monomethylates p53 at K382, which suppresses p53′s functions in checkpoint activation and apoptosis [79]. In addition, p53K386 can also be mono- and dimethylated, by enzymes that are yet to be identified [119].

As the methylated lysine residues of p53 are in close proximity, it is not surprising that modification on one residue can influence modifications on other residues. For example, the active mark of p53K372me1 deposited by SETD7 negatively impacts the repressive marks of SMYD2-dependent p53K370me1 [70] and the G9a-mediated p53K373me2 [56] (Fig. 2a, b). In addition to methylation, the lysine residues within the C-terminus of p53 are also modified by acetylation, ubiquitylation and sumoylation [120, 121] (Fig. 1). Crosstalk among these PTMs provides a molecular mechanism that mediates rapid switching of p53 function upon various stimuli such as DNA damage response. The role of p53 in DNA damage response is mediated by modification of p53K382, where methylation and acetylation compete with each other to inhibit or promote p53′s role in DNA damage response. SETD8-mediated p53K382me1 inhibits DNA damage-induced acetylation at the same residue (p53K382ac) thus impairing p53-mediated DNA damage responses [79] (Fig. 2c). In contrast, the dual methylation of p53K370me2 and p53K382me2 promotes p53 DNA damage responses by inhibiting MDM2-mediated p53 protein ubiquitination and degradation [122].

Fig. 2
figure 2

Crosstalk and functional outcomes of p53 lysine methylation. a SET7-catalyzed monomethylation of p53K372 (K372me1) (activating mark) competes with G9a-catalyzed dimethylation of K373 (K373me2) (repressive mark) to promote the transcriptional activity of p53. b SET7-catalyzed K372me1 (activating mark) competes with SMYD2-catalyzed K370me1 (repressive mark) to promote the transcriptional activity of p53. LSD1 demethylates p53K370me1. c SET8-catalyzed K382me1 (repressive mark) competes with p300-catalyzed K382 acetylation (K382ac) (activating mark) to repress the transcription activity of p53. d L3MBTL1 recognizes p53K382me1 to repress p53 target gene expression. Upon DNA damage, 53BP1 recognizes the dual marks of p53K370me2 and p53K382me2 to act as a transcriptional coactivator of p53. e Dual methylation of p53 K370me2 and K382me2 recruits the PHF20 dimer to inhibit MDM2-mediated p53 ubiquitylation and degradation, thus promoting p53 protein stability and transcription activity

Like the methylation marks on histones, methylation on p53 can be recognized by reader proteins. The chromatin compaction factor L3MBTL1 recognizes p53K382me1 through its triple malignant brain tumor (MBT) repeats [123]. The interaction between L3MBTL1 and p53 brings L3MBTL1 to promoters to repress the transcription of p53 target genes such as p21 and PUMA [123] (Fig. 2d). Upon DNA damage, monomethylated p53K370 and K382 are dimethylated by currently unknown enzymes. These dimethylation marks are recognized by the tandem Tudor domain of p53 Binding Protein 1 (53BP1) that acts as a transcriptional coactivator of p53 to promote p53 target gene expression during the DNA damage response [105, 119, 124] (Fig. 2d). In addition to 53BP1, p53K370me2 and p53K382me2 are also recognized by the second Tudor domain of PHF20. The two Tudor domains of PHF20 dimers each bind p53K370me2 or p53K382me2 simultaneously, and thus stabilizing and activating p53 during DNA damage response [122] (Fig. 2e).

Although a number of PTMs, including lysine methylation, have been identified within the p53 C-terminal regulatory region, these PTMs may only fine tune the functions of p53 and some modifications are likely functionally redundant. This could explain why the loss of p53K372 methylation in SETD7 knockout mice has no effect on p53-dependent cellular functions such as cell cycle arrest and apoptosis in response to radiation, genotoxic agents, and oncogenic agents [125, 126]. Future research should carefully combine in vitro and in vivo approaches to investigate not only the methylation events, but also their interplay with other PTMs and regulators of p53.

ERα and other nuclear receptors

ERα is a member of the nuclear hormone receptor family that controls cellular responses to estrogens [127]. ERα shares a conserved protein structure with other NRs, including an N-terminal regulatory domain, known as activation function domain 1; a central DNA-binding domain (DBD); a C-terminal ligand-binding domain, known as activation function 2; and two flexible domains: a hinge region that connects the DBD with the ligand-binding domain, and a regulatory domain at the C-terminus (Fig. 1). The binding of hormone ligands is an essential step in the activation of ERα and other NRs that leads to a sequence of events including conformational changes, protein dimerization, translocation from the cytoplasm to the nucleus, and binding to specific DNA sequences known as estrogen response elements [127]. The ligand-induced allosteric changes of ERα alter its interactions with different coregulators that are essential for ER-mediated transcription. Transcriptional coregulators, namely coactivators and corepressors, regulate the activity of DNA-bound transcription factors [128, 129]. One of the main functions of ERα coregulators is to modulate histone modifications, thus controlling the accessibility of the underlying DNA to the transcriptional machinery [128, 129].

Like histones, the ERα proteins are subjected to a number of PTMs including phosphorylation, methylation, acetylation, ubiquitination, and sumoylation. Except for phosphorylation, these modifications are enriched in the DBD domain (aa 181-263) and hinge regions (aa 264-302) (Fig. 1), and are believed to play important roles in regulating ERα protein stability, estrogen sensitivity, subcellular localization and DNA-binding affinity [130132]. All five lysine residues (K266, K268, K299, K302, and K303) at the border between the DBD and the hinge region are reported to be acetylated by p300/CBP [133, 134]. Depending on the target sites, lysine acetylation can lead to opposing outcomes: acetylation of K299, K302 and K303 inhibits ERα target gene expression, whereas acetylation of K266 and K268 promotes ERα target gene expression by enhancing its DNA binding.

To date, there are only two reported lysine methylation events of ERα that regulate ERα, each with a distinct outcome. In the first, SET7 monomethylates ERα at K302, which stabilizes ERα and promotes its transactivation activity [17]. This modification is negatively regulated by the K303R mutation [17] (Fig. 3a), which has been associated with breast cancer [135137]. The second methylation event was recently reported by our group. SMYD2 monomethylates ERα at K266, which prevents acetylation of the same residue, thus keeping ERα in a repressed or inactive status [72]. Upon estrogen activation, the repressive K266me1 mark is removed by LSD1, which then allows p300 to acetylate this residue to activate the ERα transcriptional response (Fig. 3b). In our own research, we have identified a third ERα methylation event. Through in vitro screen, we found that G9a can methylate ERα (unpublished results). The G9a-mediated ERα methylation is recognized by the Tudor domain of PHF20, which recruits the MOF histone acetyltransferase complex to deposit histone acetylation, thus coordinating the in trans crosstalk between ERα and chromatin modifications (Fig. 3c).

Fig. 3
figure 3

Crosstalk and functional outcomes of ERα lysine methylation. a SET7-catalyzed ERαK302me1 (activating mark) promotes ERα protein stability and transcriptional activity. The breast cancer-associated K303R mutation inhibits ERαK302me1. b SMYD2-catalyzed K266me1 (repressive mark) competes with the p300-catalyzed K266ac (activating mark) to repress the transcription activity of ERα. LSD1 demethylates K266me1. c G9a-mediated ERα methylation of ERα recruits PHF20 to promote the transcriptional activity of ERα. d EZH2-mediated RORα K38 methylation is recognized DCAF1, which facilitates the CUL4–DDB1-dependent ubiquitination and protein degradation, resulting in a loss of transcriptional activation of RORα

In addition to ERα, a few other nuclear receptors, such as AR, retinoic acid receptor (RAR), and RAR-related orphan receptor (ROR), are also regulated by lysine methylation. AR is methylated on two residues, K630 and K632, by SETD7 [24, 25]. As is the case for methylation of ERα by SETD7, these methyl marks on AR enhance its transcriptional activity. RARα is trimethylated at K347, which facilitates the interaction between RARα and its modulators including p300/CBP, receptor-interacting protein 140, and retinoid X receptor (RXR) [138]. Monomethylation of RARα at K109 also modulates its ligand-dependent activation and interaction with coregulators, probably through coordinating the synergy between the receptor DBD and the ligand-binding domain (LBD) [139]. Nevertheless, the enzymes responsible for K347 and K109 methylation have yet to be unveiled. RORα is another nuclear receptor that undergoes lysine methylation. The H3K27-specific methyltransferase EZH2 monomethylates RORα at K38 [84]. This modification is recognized by the DDB1 and CUL4-associated factor 1 (DCAF1) adaptor, resulting in ubiquitination-dependent degradation and a loss of transcriptional activation of RORα [84] (Fig. 3d).

Other non-histone proteins modulated by lysine methylation

In addition to those already discussed, several other proteins are modulated by lysine methylation. These include the NFκB subunit, RelA/p65, Rb, DNMT and HSP proteins (Table 1). The NFκB subunit, RelA/p65, is a component of the canonical NFκB signaling pathway that is important for immune responses. Six lysine residues in RelA have been reported to be methylated by three KMTs: K37, K314, and K315 are monomethylated by SETD7 [18, 19]; K218 and K221 are mono- and dimethylated by NSD1 [93]; and K310 is monomethylated by SETD6 [95, 96]. In response to either TNFα or IL-1β stimulation, RelA is methylated at K37 by SETD7, which promotes NFκB promoter binding and target gene expression [18]. In contrast, SETD7-mediated methylation of K314 and K315 triggers proteasomal degradation of NFκB [19]. Methylation of K314 and K315 is impaired by the acetylation of K310, leading to prolonged protein stability and enhanced RelA transcriptional activity [140]. Monomethylation of RelA at K310 by SETD6 recruits G9a/GLP, thus attenuating NFκB-induced transcriptional responses by depositing the histone H3K9 methylation mark [95]. In contrast, methylation of RelA at K218 and K221 by NSD1 is associated with NFκB activation [93]. Interestingly, NFκB signaling drives the expression of FBXL1/KDM2B to remove these methylation marks, thus forming a negative regulatory feedback loop [93].

Retinoblastoma protein was the first tumor suppressor found to function in inhibiting abnormal cell cycle progression and excessive cell growth. SETD7 monomethylates Rb at K873, which recruits the heterochromatin protein HP1, to facilitate Rb-dependent transcriptional repression, cell cycle arrest and cellular differentiation [27]. Similarly, the K860 monomethylation mark deposited on Rb by SMYD2 interacts with the transcriptional repressor L3MBTL1 to repress the expression of cell cycle genes in response to anti-proliferative signals [71]. Both SETD7 and SMYD2 monomethylate Rb on another lysine residue, K810 [28, 75]. Methylation of K810 by SETD7 impedes the binding of cyclin-dependent kinase (CDK) thus preventing Rb phosphorylation and maintaining Rb in the hypophosphorylated, growth-suppressing state [28]. In contrast, SMYD2-mediated methylation on Rb K810 enhances S807/S811 phosphorylation to accelerate E2F transcriptional activity and to promote cell cycle progression [75]. It is unclear how a single modification can lead to such divergent outcomes. One possibility is that SETD7 and SMYD2 may also modify other proteins involved in these processes, leading to activation or repression of the Rb-associated kinases. Another possibility is that SETD7 and SMYD2 may recruit other proteins that sterically block or open up one or more phosphorylation sites, which may be independent of their enzymatic activity.

DNA methyltransferase 1 is the only DNA methyltransferase responsible for the maintenance of CpG methylation. The discovery that DNMT1 itself is methylated bridges two important epigenetic regulatory mechanisms: histone methylation and DNA methylation. DNMT1 is monomethylated by SETD7 at K142 and is believed to increase DNMT1 protein turnover [20]. This methyl mark can be removed by LSD1, which protects DNMT1 from proteasomal degradation [141143]. However, DNMT1K142me1 can also be recognized by the Tudor domain of PHF20L1, which blocks the ubiquitination of DNMT1 and inhibits its degradation [144], thus making it less clear under what conditions K142me1 promotes or inhibits DNMT1 turnover. Regardless, DNMT1 can also be dimethylated by G9a at K70 in vitro, but any biological consequence of this modification has yet to be determined [48]. The de novo DNA methyltransferase DNMT3A is also methylated by G9a. G9a-mediated dimethylation of DNMT3A on K47 is recognized by the chromodomain of the methyl-H3K9-binding protein MPP8, forming a DNMT3A-MPP8-G9a silencing complex on chromatin [145].

HSPs function as molecular chaperones in various cellular processes. So far, HSP70, HSP90, and HSP90AB1 have all been shown to be substrates of KMTs. HSP70 is dimethylated at K561 by SETD1A [87], which promotes cell growth by enhancing HSP70 nuclear translocation and interaction with Aurora kinase B [88]. HSP70 isoforms such as HSPA1, HSPA8, and HSPA5 can be methylated by METTL21A to modulate protein–protein interactions [99, 102]. SMYD2 monomethylates HSP90 on K209 and K615 [73] and HSP90AB1 on K531 and K574 [146]. HSP90 K615 methylation promotes the formation of an HSP90-involved protein complex important for myocyte function [147, 148]. SMYD2-mediated HSP90AB1 methylation facilitates chaperone complex formation [146].

Perspectives

As we have discussed above, the substrates of KMTs and KDMs include not only histones, but also tumor suppressors, nuclear receptors, transcription factors, epigenetic regulators and chaperone proteins. Compared with the plethora of information about histone methylation, our understanding of non-histone protein lysine methylation is very limited. We do know that lysine methylation regulates protein function through two main mechanisms: interplay with other PTMs and effects on protein–protein interactions.

Similar to its role in the epigenetic regulation of histones, lysine methylation is associated with both positive and negative functions of the modified proteins in a site- and state-specific manner. Several groups have attempted to analyze and catalog protein lysine methylation at the proteome-wide level using different approaches: array-based enzymatic assays or mass spectrometry coupled with affinity or chemical enrichment (see review in [149]). With the increasing number of non-histone lysine methylated proteins being identified, it is likely that lysine methylation will be found to be a ubiquitous PTM that modulates proteins involved in all cellular processes. However, challenges still remain. For those interactions that have been identified in vitro, it is important to validate their occurrence in vivo. Further, it is imperative to determine the biological significance of these modifications. Future studies are needed for better understanding of the writers, erasers and readers of lysine methylation the signaling pathways that control protein lysine methylation and their dysregulation in diseases.