Introduction

In eukaryotes, gene expression is a tightly controlled process that involves many processes throughout the cell, which include transcription of mRNA, its processing in the nucleus, and translation of the exported mRNA in the cytoplasm. In past decades, researchers have made extensive efforts to unveil how the complicated steps of gene expression are coordinated and which steps are associated with which cellular processes [1]. In this regard, one of the most significant coordinators is RNApII, which primarily transcribes the protein-coding regions of genes. It also facilitates mRNA processing during transcription and further connects transcription with a variety of proteins that are associated with modifying histones or implicated in repairing DNA damage [2,3,4,5].

Rpb1 is the largest subunit of RNApII. It contains an unusual repetitive C-terminal domain (CTD) that consists of tandem heptapeptide repeats with the consensus sequence Y1-S2-P3-T4-S5-P6-S7, varying in number from 52 in humans to 26 in yeast [3]. Significantly, each CTD residue can be chemically modified by several CTD modifiers, allowing the repeats to serve as binding platforms to recruit and assemble proteins for the different stages of transcription [2, 5,6,7].

In Saccharomyces cerevisiae, four different cyclin-dependent kinases targeting the RNApII CTD for phosphorylation have been identified, namely, Kin28, Srb10, Bur1, and Ctk1 [3, 7]. Both Kin28 and Srb10 catalyze the phosphorylation of the fifth-position serine (Ser5) of the RNApII CTD during transcription initiation, but they have opposite effects; Kin28 phosphorylates the CTD after the formation of the RNApII initiation complex on the promoter and then promotes transcription, while Srb10 phosphorylates the CTD prior to initiation complex formation to inhibit transcription [8]. Srb10 also targets the subunits of the transcription factor IID complex in the transcriptional machinery [9] or Gal4 for transcription activation [10, 11]. The other two kinases, Bur1 and Ctk1, phosphorylate CTD Ser2 and belong to distinct complexes, Bur1/Bur2 and CTDK-I, respectively [12, 13]. Although both Bur1 and Ctk1 target the CTD Ser2, they regulate transcription elongation in different ways [14, 15]. Bur1 preferentially phosphorylates Ser2 at a low level near the 5′ end of the gene during early elongation, thereby enhancing the rest of the Ser2 phosphorylation, most of which is performed by Ctk1. Ser2 increases towards the 3′ ends of genes, then decreases beyond the poly(A) sites. This phosphorylation plays an essential role in recruiting 3′-end processing factors [3, 14].

Recent structural studies showed that the formation of the elongation complex is promoted by Bur1/Bur2 [16] or its mammalian homolog, the positive transcription elongation factor b (P-TEFb/Cdk9) [17]. In this context, Bur1/P-TEFb phosphorylates the RNApII linker to the CTD, which is bound by the elongation factor SPT6. Additionally, besides targeting the CTD of the RNApII, yeast Bur1 phosphorylates the Spt5, a subunit of the elongation factor Spt4/5 complex (also termed DRB sensitivity-inducing factor, DSIF), both in vivo and in vitro to stimulate recruitment of the PAF complex elongation factor [18].

In general, CTDK-I is considered the major kinase complex targeting CTD Ser2 during transcription elongation by RNApII in budding yeast [14]. It consists of three subunits: the cyclin-dependent kinase Ctk1 (homologous to human CDK12), the cyclin Ctk2 (homologous to human cyclin K) and the yeast-specific subunit Ctk3, a co-cyclin factor (Fig. 1a) [13, 19, 20]. The subunits of Ctk1, Ctk2, and Ctk3 in budding yeast are homologous to those of Lsk1, Lsc1, and Lsg1 in fission yeast [21, 22]. In addition, sequence alignments throughout eukaryotes, together with a previous report, show that Ctk1 and Ctk2 are highly conserved from yeast to human [23] (Fig. 1b, c). The Ctk3 subunit in budding yeast has a low level of homology with the Lsg1 subunit in fission yeast and lacks homologs with other higher eukaryotes (Fig. 1d). A previous report identifies Lsg1 as a functional homolog of Ctk3 [22].

Fig. 1
figure 1

The cyclin-dependent kinase complex is evolutionarily conserved. a The cyclin-dependent kinase complexes that are responsible for the phosphorylation of the RNApII CTD Ser2 are shown in several eukaryotes. Colors indicate homology among species; light red, gray, and light brown colors indicate the kinase, the cyclin, and the co-cyclin factors in each species, respectively. bd Multiple sequence alignments of the CTDK-I homologs. Sequence alignments were generated using Clustal Omega from the protein sequences of kinase, cyclin, and co-cyclin factor from S. cerevisiae, S. pombe, C. elegans, Drosophila and H. sapiens. The numbers in the sequences indicate the positions of amino acid residues. Amino acids are colored according to the degree of conservation and physicochemical properties; blue, red, magenta, green, pink, orange, yellow, cyan and white indicate hydrophobic, positively charged, negatively charged, polar, cysteine, glycine, proline, aromatic and unconserved residues, respectively [77]. The symbols at the top are as follows: (*) indicates identity between residues, (:) conservation and (.) less conservation. b and c show only the most conserved sequences of Ctk1/Ctk2 and their homologs, whereas d shows the whole sequence of Ctk3 and its homologs. Note that the level of homology between Ctk3 in S. cerevisiae and Lsg1 in S. pombe is not very high, but they are functionally conserved

The genes encoding yeast CTDK-I subunits are all not essential for viability, but each disruption causes a cold-sensitive phenotype and abolishment of the RNApII CTD Ser2 phosphorylation [13, 20, 24]. Notably, the CTD kinase activity of Ctk1 is dependent on its association with the stable heterodimer of Ctk2 and Ctk3, showing that CTD Ser2 phosphorylation requires an intact CTDK-I complex in cells [24]. Similarly, the three subunits of the CTDK-I complex in fission yeast interact with each other to perform their activity [25]. Consistent with those associations, in higher eukaryotes, the CTD Ser2 kinases always have a stable association with their cyclin partners [19, 26, 27] (Fig. 1a).

By phosphorylating the RNApII CTD at Ser2, the CTDK-I complex plays a pivotal role in coupling transcription with the 3′-end processing of nascent mRNA transcripts by recruiting 3′-end processing factors or in regulating different steps in transcription by recruiting different kinds of histone-modifying enzymes [14, 28,29,30,31,32]. However, recent studies on the CTDK-I complex have revealed a few additional pathways that take advantage of its localizations in both the nucleus and cytoplasm [33] (see Fig. 2). Here, we highlight these multiple aspects of the CTDK-I complex in addition to its conventional role in RNApII transcription to emphasize its widespread roles across the cell. Recent findings concerning the role of the mammalian CTDK-I in ovarian and prostate cancers will be briefly discussed.

Fig. 2
figure 2

CTDK-I functions throughout the cell. The CTDK-I kinase contributes to various cellular processes throughout the cells via its widespread localization. (1) Yeast CTDK-I associates with RNApI in the nucleolus, where the kinase is necessary for efficient 35S rDNA transcription. (2) In the nucleoplasm, CTDK-I functions as a primary kinase for Ser2 phosphorylation of the RNApII CTD to facilitate the transcription elongation of protein-coding genes, including genes for DNA damage repair response. In human ovarian and prostate cancers, inactivation of CTDK-I is associated with a particular genome instability pattern consisting of a high number of TDs. (4) In the cytoplasm, yeast CTDK-I is important for the formation of the intact 80S ribosomal complex during translation initiation and plays a vital role in enhancing accurate translational decoding fidelity during translational elongation by phosphorylating the ribosomal protein Rps2. Human CTDK-I phosphorylates 4E-BP1 to regulate mTORC1-dependent translation and mitotic genome stability

The function of CTDK-I in genome stability

The repair process of damaged DNA often occurs in DNA regions adjacent to the places where transcription proceeds. Indeed, RNApII contributes extensively to DNA repair and stress responses by playing a significant role in identifying and signaling the occurrence of DNA damage to other cellular machinery [4, 5]. Importantly, the forms of RNApII that are correctly modified at the CTD are regarded as crucial during the DNA damage repair response; the evidence shows that deletion of yeast CTK1 leads to high sensitivity to certain DNA-damaging agents, such as doxorubicin (DOX), hydroxyurea (HU), methyl methanesulfonate (MMS) and 4-nitroquinoline oxide, or to UV irradiation [34,35,36,37]. In CycK-depleted human HeLa cells, sensitivity to a variety of DNA-damaging agents such as mitomycin C, etoposide, or camptothecin is reported [27, 38].

Among many different types of DNA damage, DNA double-strand breaks (DSBs) are regarded as the type of lesion most dangerous to cells. The deleterious DSB is preferentially repaired by homologous recombination (HR), which requires an intact homolog or a sister chromatid to serve as a template for the DNA repair [39]. By employing a gene deletion study of diploid cells, a subset of conserved genes, including CTK1, were identified as needed for resistance to the DNA-damaging agent DOX [37]. The ctk1Δ/ctk1Δ diploid cells cause a severely impaired G1/S transition and an enhanced gene conversion frequency when exposed to DOX, suggesting a role of Ctk1 in HR-mediated DNA damage repair. In parallel, spontaneous mitotic recombination is believed to be strictly dependent on Ctk1 because, in ctk1Δ/ctk1Δ diploid cells, the rate of loss of heterozygosity is dramatically reduced at several auxotrophic loci, such as CAN1, LEU1, TRP5 and URA3 loci [34]. The intimate connection between the CTDK-I complex and HR-dependent DNA repair is further supported by a global genetic analysis; several single-deletion mutations in the Rad52 epistasis group, which are essential for DSB repair by mitotic HR, are synthetically lethal when combined with ctk1Δ, ctk2Δ or ctk3Δ [34, 40, 41]. In addition, each deletion of any CTDK-I subunit is synthetically lethal when combined with the deletion of many genes involved in DNA integrity [34, 40]. Thus, these reports support an indispensable role of the CTDK-I complex in maintaining genome integrity by facilitating HR in response to DSB.

CDK12, the metazoan orthologue of yeast Ctk1 in C. elegans [26], Drosophila [19] and human [27] (see Fig. 1), is critical in maintaining the genomic stability by regulating the expression of the functional DNA damage response genes, especially in the HR-induced DNA lesion repair [27, 42]. Genome-wide expression studies show that CDK12 globally suppresses intronic polyadenylation events in mouse embryonic stem cells, enabling the production of full-length HR gene products [43]. In CDK12-deficient cells, only a small set of the assayed human genes are altered (2.67%), of which the downregulated group includes genes encoding critical regulators of genomic stability, including BRCA1, ATR, FANCI, and FANCD2. Consequently, the knockdown of CDK12 or cyclin K sensitizes cells to a variety of DNA damage agents and increased the spontaneous DNA damage [27]. Sharp et al. further show that CDK12 loss enriches the intronic polyadenylation sites events at the expense of distal sites, which leads to a cumulative defect by downregulating multiple gene products and thus causes the enhanced sensitivity of HR gene expression to CDK12 loss [43].

It is particularly notable that inactivation of the CDK12 gene in ovarian and prostate cancers has been associated with a particular genome instability pattern consisting of a high number of tandem duplications (TDs) up to 10 Mb in size [44, 45]. TDs were quasi-randomly distributed along the genomes and affected more than 10% of the genomic content. The genomic instability caused by the inactivation of CDK12 gene, which was referred to as ‘the CDK12 TD-plus phenotype’, was observed in approximately 4% of serous ovarian carcinomas and 1–2% of prostate adenocarcinomas. Significantly, analysis of the TDs in the CDK12-mutated context gave no preference for TD localization and no presence of homologous fragments around the breakpoints, excluding any possibility of a recombination mechanism, such as misalignment of the sister chromatids or unbalanced translocations. To address the connection of CDK12 with HR, they examined the number of large-scale state transitions (LSTs), which are defined as chromosomal breaks between adjacent regions of at least 10 Mb and are considered a genomic HR signature in basal-like breast carcinomas [46]. They found that CDK12-inactivated tumors resemble the group of non-BRCA1/2 tumors and mostly display fewer LSTs than the minimal level of BRCA1/2-inactivated ovarian cancers, suggesting that genomic instability associated with CDK12 inactivation does not display substantial genomic HR deficiency [44]. Similarly, another recent study using integrative genomic analysis of metastatic castration-resistant prostate cancers found that CDK12 loss was associated with genomic instability and focal TDs, leading to highly recurrent gains of genes involved in the cell cycle and DNA replication [45]. Importantly, they did not observe any DNA mutational signatures typically associated with defects in HR, nor was the expression of BRCA1 or BRCA2 affected by CDK12 mutation. So far, it is unclear whether the CDK12 TD-plus phenotype results from an indirect effect of CDK12 mutation disrupting RNApII transcriptional regulation or is instead a direct result of aberrant cellular processes, e.g., a defect in DNA re-replication during S-phase.

The function of CTDK-I in the synthesis of rRNA and implications in maintaining rDNA integrity

In budding yeast, the rDNA region encoding the ribosomal RNAs (rRNAs) is organized as tandem arrays of a 9.1-kb-long unit that repeats 100–200 times on chromosome XII [47]. This rDNA unit consists of a 5S rRNA gene that is transcribed by RNA polymerase III (RNApIII) and a long precursor 35S rRNA gene that is transcribed by RNA polymerase I (RNApI). These two genes are separated by two nontranscribed spacers, namely, NTS1 and NTS2 [48]. Intriguingly, CTDK-I physically interacts with RNApI in vivo; its kinase subunit Ctk1 can physically interact with RNApI in vitro as well [49]. The copurification of Ctk1 with the RNApI–Rrn3 complex, which is the competent form of transcription initiation after assembling onto the 35S rDNA promoter, further supports the necessary association of CTDK-I with rRNA-transcribing RNApI [50, 51]. Loss of either the Ctk1 kinase or the Ctk2 cyclin subunit leads to reduced recruitment of RNApI to the 35S promoter region [49]. Indeed, the synthesis of 35S pre-rRNA, as well as the production of each 25S, 18S or 5.8S rRNA, is severely impaired by ctk1Δ or ctk3Δ and by a kinase-dead mutation of Ctk1 [49, 52]. Phosphorylated RNApI is a prerequisite to forming a stable RNApI–Rrn3 complex for efficient transcription initiation [53]. Although the kinase that is responsible for RNApI phosphorylation is not known, the previous findings, together with the fact that Ctk1 colocalizes in the nucleoplasm and nucleolus [49, 54], strongly suggest the importance of Ctk1 kinase for the synthesis of functional rRNAs.

The rDNA region is very unstable and prone to inducing HR primarily due to its tandemly repeated structure. Especially in the event of a DSB, the instability of the rDNA array may cause unequal sister chromatid exchange and lead to contraction or expansion of the rDNA repeats [48, 55, 56]. It is notable that the loss of Ctk1 results in contraction of the rDNA tandem array, a phenotype that is reversed by the reintroduction of CTK1 [52]. However, little is known about the molecular mechanism by which Ctk1 contributes to rDNA integrity. At the rDNA loci, Fob1, a protein that binds to the replication fork barrier within the NTS1 region of the rDNA repeat and induces HR in response to DSB, is required for rDNA expansion [52, 57, 58]. However, it appears unlikely that Ctk1 shares the pathway with Fob1 to regulate rDNA integrity because the rDNA contraction caused by ctk1Δ is not concomitant with the formation of Fob1-dependent extrachromosomal rDNA circles (ERC). Additionally, the loss of Ctk1 impairs RNApI transcription without affecting Sir2-dependent silencing at rDNA loci [52]. Thus, considering that Sir2 is critical in inhibiting ERC formation [59], it appears that Ctk1 affects rDNA integrity in a distinct manner from Fob1 and Sir2.

The function of CTDK-I in translation

Translation is one of the essential cellular processes during gene expression, where ribosomes produce polypeptides with the assistance of multiple factors in the cytoplasm. Several lines of evidence suggest that, in addition to the primary location and role of CTDK-I in the nucleus [20], the kinase translocates to the cytoplasm and even affects translation by interacting with ribosomal subunits, monosomes and polysomes [33, 60, 61].

In eukaryotes, translation initiation is an intricate process that involves the formation of the translation-competent 80S ribosome by the assembly of the 40S and 60S ribosomal subunits on the start codons of mRNAs with the help of ribosome-bound eukaryotic initiation factors (eIFs) [62]. Briefly, before joining with 60S ribosomal subunits, the 43S preinitiation complex (PIC) is formed from the 40S ribosomal subunits associated with the eIF2-GTP-Met-tRNAMet ternary complex (eIF2 TC) and a subset of initiation factors, such as eIF1, eIF1A, and eIF3. Subsequently, the 43S PIC attaches to the 5′-end of mRNA, associating with the eIF4F complex composed of eIF4E, eIF4G, and eIF4A to generate the 48S PIC. The 48S complex scans mRNA from 5′ to 3′ to recognize the AUG start codon, which triggers the hydrolysis of eIF2-bound GTP and phosphate release. Finally, a 60S ribosomal subunit joins the 48S initiation complex to form an elongation-competent 80S initiation complex (see reviews [62, 63]).

Notably, the phosphorylation of the α subunit of eIF2 (eIF2α) at Ser51 by protein kinase Gcn2 reduces eIF2 TC formation and blocks protein synthesis under various stressful circumstances, such as amino acid deprivation or treatment with chlorpromazine or peroxide [64,65,66]. Recently, Sträßer et al. reported that the yeast Ctk1 kinase is necessary for full translation initiation activity [60]. They found that the deletion of CTK1 increases the phosphorylated form of eIF2α, leading to impaired translation initiation and defective 80S complex formation. Nevertheless, it remains obscure how Ctk1 affects translation initiation activity. The approach of stable isotope labeling with amino acids in cell culture (SILAC) showed that the depletion of endogenous Ctk1 decreased the phosphorylation levels of several proteins that are involved in ribosome biogenesis and/or translation in vivo, but none of the proteins were found to be associated with Ctk1 in vitro [60].

Several studies further imply a role of Ctk1 in the elongation step of translation to maintain high decoding fidelity during protein synthesis. Loss of Ctk1 causes hypersensitivity against drugs such as paromomycin, hygromycin B and geneticin, which prevent aminoacyl-tRNA dissociation from the ribosomal A site [67,68,69]. Moreover, Röther and Sträßer previously demonstrated that Ctk1 is necessary for a high rate of decoding fidelity in translation elongation; depletion of Ctk1 impairs the explicit incorporation of amino acids and increases miscoding frequency [61]. In this maintenance step of high decoding fidelity, Ctk1 phosphorylates the Ser238 residue of Rps2, a protein component of the small 40S subunit, and tightly controls translational accuracy to enhance translational decoding fidelity [61, 70,71,72].

In higher eukaryotes, the kinase subunit of the mammalian P-TEFb, CDK9, shuttles between the nucleus and cytoplasm, similarly to yeast Ctk1 [73], and likewise associates with translating polysomes [61]. Moreover, a recent study in the human osteosarcoma U2OS cells reported that CDK12 is involved in the translation of a specific subset of mRNAs by collaborating with the mechanistic target of rapamycin (mTORC1) kinase [74]. CDK12 directly phosphorylates the mRNA 5′ cap-binding repressor, 4E-BP1, at Ser65 and Thr70, to promote the translation of mTORC1-dependent mRNAs. This phosphorylation is facilitated by the phosphorylation of 4E-BP1 by mTORC1 at Thr37 and Thr46 and followed by the exchange of 4E-BP1 with eIF4G at the 5′ cap of many mTORC1 target mRNAs. Additionally, by genome-wide ribosome profiling (Ribo-seq), they showed that CDK12 mediates a specialized translation network that coordinates the expression of crucial factors in the centrosome, centromere, and kinetochore proteins to direct mitotic progression. Parallel with this, cells deprived of CDK12 display profound mitotic chromosomal instability, including chromosome misalignment, bridging and segregation defects, raising the possibility that the CDK12-involved translation network correlates with the genomic stability, a defect hallmark of CDK12 mutant cancers [44, 45, 75, 76]. Unlike the global effect of mRNA translation initiation and elongation by yeast CTDK-I, CDK12 controls only a specific subset of mRNAs, including mTORC1 target mRNAs. As they mentioned, this may occur due to the absence of a 4E-BP1 homolog in yeast and reflects the greater diversity and regulation of mRNA cap complex assemblies in metazoans [74].

Conclusions and perspectives

The highly conserved CTDK-I complex plays a central role in RNApII transcription by phosphorylating its CTD at Ser2 and thus recruiting 3′-end processing factors or various histone-modifying enzymes to the chromatin. In this review, we focus on the previously underappreciated roles of CTDK-I, such as its involvement in the regulation of the genomic stability and the rDNA integrity, rRNA synthesis by RNApI, and translational initiation and elongation and mTORC1-dependent translation. Our review shows that CTDK-I is multifunctional and significantly influences many steps in gene expression across the cell (see Figs. 2, 3 for details). The widespread distribution of Ctk1 and its multiple functions within the cell reflect that the CTDK-I complex coordinates diverse cellular processes required for gene expression, from transcription to translation, and these functions are important to maintain genome stability and prevent deleterious mutations. Therefore, continued effort to identify novel substrates for CTDK-I kinase is important for comprehending the underlying mechanisms by which it contributes to the coordination of many cellular processes. Moreover, it appears necessary to define the pathways by which CTDK-I shuttles between the nucleus and cytoplasm; the pre-40S subunit that includes Rps2 would be one plausible partner because it is synthesized in the nucleoplasm but functions in the cytoplasm, where it interacts with and is phosphorylated by yeast CTDK-I. It will also be promising to test whether the multifunctional nature of CTDK-I contributes to the regulation of lifespan, as cellular aging is generally correlated with attenuated gene expression, including both transcription and translation, and with increased genome instability. Thus, a comprehensive understanding of the role of the evolutionarily conserved CTDK-I in many cellular processes would provide insight into how multiple steps in gene expression are coordinated and utilized under physiological conditions.

Fig. 3
figure 3

Multiple cellular roles of CTDK-I beyond mRNA transcription. A schematic diagram shows multiple roles of the CTDK-I complex. The primary roles of the CTDK-I complex are during transcription elongation and mRNA 3′-end processing. However, recent studies have found that the CTDK-I complex also has significant direct or indirect roles in other cellular processes in various locations across the cell, including the regulation of genome stability in both the nucleoplasm and the nucleus, rRNA synthesis in the nucleolus, and translation initiation and elongation in the cytoplasm. The CTDK-I complex is shown in the center, and its multiple cellular roles are indicated in the gray box. Detailed descriptions of the function of each pathway are listed below the box