Keywords

Introduction

Protein glycosylation is the most prevalent and complex posttranslational modification (Varki 1993). Glycan structures associated with proteins and lipids have been shown to regulate numerous biological and physiological events, such as protein folding and quality control, protein function and signaling, protein half-life and endocytosis, cell-cell interaction, autoimmunity, hormone bioactivity, gene expression, tumor metastasis, and many others (Moremen et al. 2012; Varki 1993). Most glycans on membrane and secreted proteins can be classified into two major types based on their linkage to polypeptide structures: N-linked glycans and O-linked glycans. Glycans attached to Asn side chains of polypeptides form N-linked glycans, while O-glycosylation is initiated by the addition of O-linked N-acetylglucosamine (GalNAc) to serine or threonine, catalyzed by a family of polypeptide N-acetylgalactosaminyltransferases (ppGalNAcT) enzymes. In addition to O-linked GalNAc, commonly referred to as mucin-type O-glycosylation, additional, distinct glycan types linked to Ser or Thr with defined functions have been identified, including O-mannose,O-Fuc, and O-GlcNAc (Moremen et al. 2012). Glycan structures on glycoproteins are complex with a great diversity for branching and anomeric linkages. Although the biosynthesis of the polypeptides of glycoproteins is controlled by the transcription and translation of their corresponding genes, the posttranslational glycosylation of these glycoproteins is not template driven. Instead, hundreds of proteins are involved in the biosynthesis of the various kinds of glycan structures. Among these, about 250 open reading frames in the mouse and human genomes are identified as glycosyltransferases, the enzymes that catalyze the addition of glycans to polypeptides using nucleotide or lipid-linked sugars as activated donor substrate. Therefore, the expression levels of glycans on glycoconjugates are determined primarily by the activities of corresponding glycosyltransferases. The transcripts and expression of glycosyltransferases are regulated during embryonic development, cellular differentiation, and proliferation. Aberrant expression of these enzymes results in aberrant glycans, which have been associated with a wide range of human diseases, including tumor, diabetes, cardiovascular, immunological, infectious, and congenital disorders (Marth and Grewal 2008).

The expression of glycosyltransferases is controlled by many biological effectors, including the Golgi complex itself, which can regulate the function of glycosyltransferases, and various transcriptional regulators that control glycosyltransferase gene expression, such as the binding of transcription factors to promoter regions, as well as epigenetic regulation. Transcription factors are a group of proteins that recognize and bind to a segment of DNA in the promoter and/or enhancer region, recruiting cofactors and RNA polymerase II and regulating the expression of specific genes (Lee and Young 2013; Spitz and Furlong 2012). There are two different types of transcription factors based on their regulatory responsibilities, those that control initiation and those that control elongation. Most transcription factors are believed to control transcription initiation by recruiting co-activators, such as the Mediator complex, P300, and general transcription factors (Juven-Gershon and Kadonaga 2010; Lee and Young 2013; Taatjes 2010). Gene regulation by transcription factors plays a pivotal role in many cell events, and several human diseases and syndromes have been linked to the mutations in transcription factors, including diabetes, cardiovascular diseases, autoimmunity, and certain cancers (Lee and Young 2013).

Regulation of Glycans by Transcription Factors

Many studies have investigated the transcriptional regulation of glycosyltransferase gene expression by transcription factors using various strategies, such as rapid amplification of 5′ cDNA ends (5′ RACE), luciferase reporter assays, series deletion of promoter regions, site-directed mutagenesis, electrophoretic mobility shift assays (EMSA), and chromatin immunoprecipitation (ChIP) assays. Results show that the expression of glycosyltransferases associated with the initiation, extension, branching, and modification of both N- and O-linked glycans are regulated by transcription factors in a tissue- or cell-specific manner. The types of transcription factors involved in this regulation vary from the common families of transcription factors, such as Ets and AP-2, to some differentiation inducers and cytokines that directly bind to cis-regulatory elements in flanking sequences of glycosyltransferase genes, such as ATRA and IL13. The binding of transcription factors to promoter regions influences either positively or negatively the transcription of specific genes. Table 1 summarizes the involvement of transcription factors in the regulation of the expression of particular glycosyltransferases.

Table 1 Regulation of glycans by transcription factors

Perspective

A study that cataloged changes of glycosyltransferase transcript expression, as well as changes in glycan structures, during mouse embryonic stem cell differentiation also compared these two sets of data to determine the degree to which the changes were associated (Nairn et al. 2012). The conclusion was that in by far the majority of cases, there was a positive association between changes in a particular glycosyltransferase transcript and the changes in glycan expression whose levels these enzymes should impact. This result suggests that the most common mechanism for regulation of glycan expression is due to changes in glycosyltransferase transcript expression. There were, however, a significant number of cases where there were changes in a transcript but no corresponding change in the glycan products synthesized by the enzyme encoded by the transcript, or vice versa, there were no changes in transcripts but there were observed changes in corresponding glycan products attributed to the enzyme encoded by these transcripts. These intriguing examples demonstrate not only that transcript expression is only one of many regulatory mechanisms for glycan expression but also that our understanding of these mechanisms is very incomplete. A study of some of these examples may, therefore, allow us to identify additional regulatory mechanisms for glycan expression. It is also clear that the expression of at least one glycosyltransferase is regulated epigenetically (Kizuka et al. 2011). Presently, the precise mechanisms that regulate glycosyltransferase transcripts have only been studied in detail for about two dozen enzymes (Table I). As more and more functions for specific glycan structures are discovered and the mechanisms by which their expression become relevant, this list of enzymes and the relevant transcription factors in play will undoubtedly grow. Some glycosyltransferase transcripts have extensive 3′-untranslated regions; therefore, it is quite likely that there is control of glycosyltransferase expression by posttranscriptional mechanisms , as well.

Further studies are needed to elucidate precisely how expression of glycosyltransferases is controlled by transcription factors in specific tissues and cell types. Since many diseases and syndromes are associated with mutations in both transcription factors and regulatory regions of promoters (Lee and Young 2013), analyses of genetic variations in transcription factors and regulatory regions of glycosyltransferase genes will likely define a research direction for the study of the transcriptional regulation of glycans expression. Indeed, mutations in the cis-elements of promoter regions of glycosyltransferases, leading to the generation of additional binding sites for certain transcription factors, have been reported recently (Raducu et al. 2012).