Truncation of the TPR domain of OGT alters substrate and glycosite selection

Ramirez, Daniel H.; Yang, Bo; D’Souza, Alexandria K.; Shen, Dacheng; Woo, Christina M.

doi:10.1007/s00216-021-03731-8

Truncation of the TPR domain of OGT alters substrate and glycosite selection

Paper in Forefront
Published: 02 November 2021

Volume 413, pages 7385–7399, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Analytical and Bioanalytical Chemistry Aims and scope Submit manuscript

Truncation of the TPR domain of OGT alters substrate and glycosite selection

Download PDF

Daniel H. Ramirez¹^nAff2,
Bo Yang¹,
Alexandria K. D’Souza¹,
Dacheng Shen¹ &
…
Christina M. Woo¹

2402 Accesses
14 Citations
5 Altmetric
Explore all metrics

Abstract

O-GlcNAc transferase (OGT) is an essential enzyme that installs O-linked N-acetylglucosamine (O-GlcNAc) to thousands of protein substrates. OGT and its isoforms select from these substrates through the tetratricopeptide repeat (TPR) domain, yet the impact of truncations to the TPR domain on substrate and glycosite selection is unresolved. Here, we report the effects of iterative truncations to the TPR domain of OGT on substrate and glycosite selection with the model protein GFP-JunB and the surrounding O-GlcNAc proteome in U2OS cells. Iterative truncation of the TPR domain of OGT maintains glycosyltransferase activity but alters subcellular localization of OGT in cells. The glycoproteome and glycosites modified by four OGT TPR isoforms were examined on the whole proteome and a single target protein, GFP-JunB. We found the greatest changes in O-GlcNAc on proteins associated with mRNA splicing processes and that the first four TPRs of the canonical nucleocytoplasmic OGT had the broadest substrate scope. Subsequent glycosite analysis revealed that alteration to the last four TPRs corresponded to the greatest shift in the resulting O-GlcNAc consensus sequence. This dataset provides a foundation to analyze how perturbations to the TPR domain and expression of OGT isoforms affect the glycosylation of substrates, which will be critical for future efforts in protein engineering of OGT, the biology of OGT isoforms, and diseases associated with the TPR domain of OGT.

Graphical abstract

The active site of O-GlcNAc transferase imposes constraints on substrate sequence

Article 03 August 2015

Polypeptide N-acetylgalactosamine transferase 3: a post-translational writer on human health

Article 02 September 2022

Cryo-EM structure of human O-GlcNAcylation enzyme pair OGT-OGA complex

Article Open access 31 October 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

O-Linked N-acetyl glucosamine (O-GlcNAc) is a monosaccharide post-translational modification installed on serine or threonine residues of thousands of nucleocytoplasmic proteins across multiple species [1]. O-GlcNAc is a dynamic and reversible modification that is regulated by a pair of essential enzymes, the writer O-GlcNAc transferase (OGT) and the eraser O-GlcNAcase (OGA) [2, 3]. Regulatory maintenance of O-GlcNAc levels in the proteome is critical for normal cellular physiology as loss of either OGT or OGA disrupts cellular growth and limits organismal survival [4, 5], and disruptions to O-GlcNAc homeostasis have been linked to genetic diseases (e.g., X-linked intellectual disability) [6] and various chronic diseases, including neurodegeneration [7], diabetes [8], and cancer [9].

Due to the essential requirement of OGT for substrate O-GlcNAcylation [10], deciphering the basis of OGT’s substrate selection mechanisms has attracted much interest [2, 11, 12]. The canonical nucleocytoplasmic form of OGT (ncOGT) is composed of two domains: an N-terminal tetratricopeptide repeat (TPR) domain and a C-terminal catalytic domain (Fig. 1A). The recognition and glycosylation of protein substrates are primarily mediated by the TPR domain, which consists of 13.5 (rounded to 13 hereafter) repetitive 34-amino-acid antiparallel alpha helices in ncOGT [13,14,15]. Two additional isoforms of OGT, the mitochondrial isoform (mOGT) and the short isoform (sOGT), have truncated TPR domains composed of 9.5 and 2.5 TPR units, respectively (rounded to 9 and 2 hereafter, Fig. 1A) [16,17,18]. Truncation of the TPR domain affects the OGT interactome [19,20,21,22,23,24] which may provide a molecular basis for observed cellular signaling and proliferation effects [10, 25, 26] or for human diseases associated with mutations in the TPR domain, such as X-linked intellectual disability (XLID) [6, 27,28,29,30]. The relationship of these OGT isoforms to substrate and glycosite selection has been explored in vitro, where variation of the TPR domain alters substrate selectivity and truncation of the TPR domain as in sOGT limits the resulting glycosyltransferase activity [11, 18, 23]. However, expression of sOGT in cells showed a robust increase in global O-GlcNAcylation [31, 32], indicative of differences between in vitro and cellular assays. Given the expression of OGT isoforms changes as a function of age [33] and their differential relationship to human disease [34], investigation of the substrate and glycosite selection of the OGT isoforms in cells is vital.

In our efforts to better understand the regulatory role of O-GlcNAc on substrates in cells, we recently reported the development of a pair of complementary methods to write and erase O-GlcNAc on a target protein using nanobody fusions to OGT and split-OGA, respectively [35, 36]. The nanobody is a small single-domain protein binder that recruits the enzyme to the desired target protein in cells for selective manipulation of O-GlcNAc levels. To enhance selectivity for a desired substrate, these enzymes were engineered to reduce their innate substrate selection mechanisms, which were then replaced with a nanobody. Previously, we achieved enhanced protein selection through the nanobody with OGT by truncation of the TPR domain to four repeats [35]. However, whether further engineering of the TPR domain would afford even higher selectivity through the nanobody or enable enhanced selectivity for protein regions and glycosites was unknown.

Here, we report a systematic investigation of the TPR domain and its relationship to the protein substrate and glycosite selection of OGT in cells. We generated truncations of each of the TPRs of OGT to characterize subcellular localization and activity on a global scale, followed by a detailed survey of the substrates and glycosites produced by overexpression of RFP-tagged isoforms corresponding to ncOGT, mOGT, sOGT, and an OGT with 4 TPR repeats in U2OS cells. We found that most of these OGT isoforms were active in cells, which results in altered substrate and glycosite profiles and find that the first four TPRs afford the broadest substrate selection, while the last four TPRs mediate the glycosite selection. These data will enhance future engineering of the O-GlcNAc proteome and provide insight to the potential protein substrates mediated by different OGT isoforms.

Methods

Cell culture, transfection, immunofluorescence, and cell lysate collection

Methods outlined below were performed as previously reported, with modifications [37]. Unless otherwise noted, the experiments were performed with U2OS cells (a gift from the Choudhary Lab, Broad Institute). Cells were cultured in high glucose with pyruvate Dulbecco’s modified Eagle medium supplemented with 10% FBS and 1% penicillin–streptomycin at 37 °C in a humidified atmosphere with 5% CO₂. Samples for western blot or immunofluorescence were prepared from cells seeded in a well of a sterile 6-well plate at a density ~ 0.4 × 10⁶ cells/well and transfected at ~ 80% confluency the next day. For mass spectrometry–based glycoproteomics experiments, cells were seeded at the density ~ 7 × 10⁶ cells/plate in sterile 150-mm tissue culture dishes (Corning, 25,383–103) and transfected at ~ 80% confluency the next day. Transient expression of the indicated proteins was performed by transfection using TransiT-PRO (Mirus Bio, MIR 5740) with the desired plasmids following the manufacturer’s protocol. Cells were incubated for 48 h after transfection for all experiments. For fluorescent imaging, cells were visualized with an ECHO microscope using the TX Red bandpass filter.

To generate lysates, the transfected cells were collected and lysed by probe sonication in lysis buffer [150 µL for western blot samples or 800 µL for MS samples of 2% SDS + 1 × PBS + 50 µM Thiamet-G + 1 × protease inhibitors (cOmplete™, EDTAfree Protease Inhibitor Cocktail, Sigma Aldrich; 11,873,580,001)]. A BCA assay was performed to determine protein concentration, and the concentration was adjusted to 2.5 µg/µL with lysis buffer for western blot samples and 5 µg/µL for MS samples. One hundred micrograms of protein sample was prepared with 5 × BME buffer, and the sample was heated for 5 min at 98 °C and centrifuged prior to gel loading.

Western blotting

The protein sample (15 µL) was loaded on 6–12% Tris–glycine SDS-PAGE gels and run on a Mini-PROTEAN® BioRad gel system. Gels were transferred with the Invitrogen iBlot. Membranes were stained with Ponceau stain to verify transfer and equal protein loading and blocked with 3% BSA + 1 × TBST for 1 h at 24 °C. Primary antibodies and the following dilutions were incubated with the membranes for 12 h: anti-HA (1:1000; Cell Signaling; 3724S), anti-O-GlcNAc RL2 (1:1000; Abcam; ab2739), and anti-GAPDH (1:1000; Cell Signaling; D4C6R). The membranes were washed 3 × 5 min each wash with 1 × TBST and incubated with the following secondary antibodies and dilutions: anti-rabbit HRP (1:10,000; Rockland; 611–1302) and anti-mouse IR 800 (1:10,000; LI-COR; 925–32,210). The membranes were washed 3 × 5 min each wash with 1 × TBST, and results were obtained by chemiluminescence or IR imaging using Azure c600. The membranes were quantified using LI-COR Image Studio Lite, and graphs were made with GraphPad Prism 9.

Molecular cloning

The RFP-OGT TPR truncations were generated by restriction enzyme digestion of the previously developed plasmid HA-RFP-OGT(13) [35] using the BamHI and NotI enzymes to extract the OGT(13) sequence. OGT TPR truncation regions of interest were amplified from the HA-RFP-OGT(13) with forward primers containing a BamHI restriction site and reverse primers containing a NotI restriction site. Standard molecular cloning techniques were used to generate the final plasmids.

Chemical enrichment of glycoproteins and sample preparation for IsoTag

Chemical enrichment of glycoproteins for MS analysis was processed as previously described with minor modifications [35, 38]. After the collection of transfected cells and the generation of reduced and alkylated cell lysates, a BCA protein assay (Pierce) was performed on cell lysates and protein concentration was adjusted to 5 μg/μL with lysis buffer. The protein lysate (2 mg, 400 μL) was placed on ice and mixed with water (490 μL), GalT1 labeling buffer (800 μL, final concentrations: 50 mM NaCl, 20 mM HEPES, 2% NP-40, pH 7.9), 100 mM MnCl₂ (110 μL), 500 μM UDP-GalNAz (100 μL), and 2 mg/mL GalT1 enzyme (100 μL). The sample reaction was then rotated for 16 h at 4 °C, and the GalT reaction was quenched by methanol–chloroform protein precipitation [methanol/chloroform = 4:1 (v/v)]. The sample was allowed to air-dry for 5 min at 24 °C before resuspension in 2% SDS + 1 × PBS (400 μL). The sample was treated with a pre-mixed solution of the click chemistry reagents [100 μL; final concentration of 200 μM IsoTaG silane probe (3:1 heavy:light mixture), 500 μM CuSO₄, 100 μM THPTA, 2.5 mM sodium ascorbate], and the reaction was incubated for 4 h at 37 °C. The click reaction was quenched by methanol–chloroform protein precipitation, and the protein pellet was allowed to air-dry for 5 min at 24 °C. The dried pellet was resuspended in 2% SDS + 1 × PBS (300 μL) by probe tip sonication and then diluted in 1 × PBS (3 mL) to a final concentration of 0.2% SDS. A 50% slurry of streptavidin–agarose resin [800 μL, washed with 1 × PBS (2 × 1 mL)] was added to the protein solution, and the resulting mixture was incubated for 12 h at 24 °C with rotation. The beads were washed using spin columns with 8 M urea (5 × 1 mL), 0.1% SDS + 1 × PBS (5 × 1 mL), 1 × PBS (5 × 1 mL), and water (5 × 1 mL). Spin columns were capped and the beads were resuspended in 470 μL 1 × PBS + 1 mM CaCl₂, urea (8 M, 30 μL), and trypsin (2 μg), and protein digestion was performed for 16 h at 37 °C with rotation. Supernatant was collected and the beads were washed three times with 1 × PBS (2 × 200 μL) and MilliQ water (200 μL). The trypsin fraction for protein identification was formed by combining the washes and supernatant digest. To cleave glycopeptides from beads, the IsoTaG silane probe was cleaved with 5% formic acid/water (2 × 200 μL) for 1 h at 24 °C with rotation and the glycopeptide eluent was collected. The beads were washed with 50% acetonitrile–water + 1% formic acid (2 × 200 μL), and the washes were combined with the glycopeptide eluent to form the cleavage fraction for site-level identification. The trypsin and cleavage fractions were dried in a vacuum centrifuge and desalted using C18 tips following the manufacturer’s instructions. The trypsin fractions were resuspended in 50 mM TEAB (20 μL), and the samples were labeled with the corresponding amine-based TMT 16-plex (5 μL) and reacted for 1 h at 24 °C. The TMT reactions were quenched with 2 µL of a 5% hydroxylamine solution and combined before concentration and fractionation into six samples using a High pH Reversed-Phase Peptide Fractionation Kit (Thermo Fisher Scientific). All samples were stored at − 20 °C until analysis.

Mass spectrometry parameters used for glycoproteomics and data analysis

The following instruments and parameters were used as previously described [36]. A Thermo Scientific EASY-nLC 1000 system was connected to an Orbitrap Fusion Tribrid with a nano-electrospray ion source. The mobile phases A and B were water with 0.1% (vol/vol) formic acid and acetonitrile with 0.1% (vol/vol) formic acid, respectively. For the trypsin fractions, peptides were separated using a linear gradient from 4 to 32% B for 50 min, then increased to 50% B for 10 min and finally increased to 98% B for 10 min followed by re-equilibration. For the cleavage fractions, peptides were separated with a linear gradient from 5 to 30% B for 95 min, then increased to 50% B for 15 min and finally increased to 98% B for 10 min followed by re-equilibration. The instrument parameters were set as previously described with minor modifications [35, 38]. Briefly, the MS1 spectra were recorded from m/z 400–2000 Da and if the oxonium product ions (138.0545, 204.0867, 345.1400, 347.1530 m/z) were observed in the HCD spectra, then ETD (250 ms) with supplemental activation (35%) was performed in a subsequent scan on the same precursor ion selected for HCD. Other relevant parameters of EThcD include an isolation window (3 m/z), use calibrated charge-dependent ETD parameters (True), Orbitrap resolution (50 k), first mass (100 m/z), and inject ions for all available parallelizable time (True). The raw data was processed using Proteome Discoverer 2.4 (Thermo Fisher Scientific). For the trypsin fraction, the data were searched against the UniProt/SwissProt human (Homo sapiens) protein database (20,355 proteins, downloaded on Feb. 21, 2019) and contaminant proteins using the Sequest HT algorithm. The database was adjusted by removing the OGT and JunB entries and replacing with RFP-OGT and GFP-JunB. Searches were performed as previously described [36]. For the cleavage fraction, both HCD and EThcD spectra were searched against the proteome identified in the trypsin fraction using Byonic algorithms. The searches were performed with the following guidelines: trypsin as enzyme, 3 missed cleavages allowed; 10 ppm mass error tolerance on precursor ions; and 0.02 Da mass error tolerance on fragment ions. Intact glycopeptide searches allowed for tagged HexNAc modifications (HexNAc + 203.0794 on cysteine, serine, threonine). Methionine oxidation and cysteine carbaminomethylation were set as variable modifications. Glycopeptide spectral assignments passing a FDR of 1% at the peptide spectrum match level based on a target decoy database were kept. Singly modified glycopeptides assigned from EThcD spectra passing a 1% FDR and possessing a delta modification score of greater than or equal to ten from > 2 PSMs were considered unambiguous glycosites. The label-free quantification of unambiguous glycosite was performed as previously described [39].

Gene ontology analysis

Differentially abundant proteins (DAPs) across treatments were calculated in R using one-way ANOVA and an adjusted p < 0.01 after Benjamini–Hochberg FDR correction. The Database for Annotation, Visualization and Integrated Discovery (DAVID) [40] was used for gene ontology (GO) enrichment analysis of DAPs with the whole Homo sapiens genome as the background and with a significant enrichment threshold of a Benjamini correction of p < 0.05. The DAPs were also subjected to protein–protein interaction (PPI) analysis using the STRING database [41].

OGT consensus sequence generation

Consensus sequences were generated by extraction of + /– 15 amino acid residues surrounding an unambiguous glycosite at S/T/C. Glycopeptides with only a single unambiguous glycosite were used for consensus sequence generation. Consensus sequences that were less than 31 amino acids were excluded. The final list of glycosites was entered into the Berkeley Weblogo program to generate the consensus sequences. We extracted amino acids around the glycosite in the − 5 and + 15 positions to afford the consensus sequence around these glycosites for clarity in figures.

Results

OGT TPR truncations exhibit altered localization, but most remain active in cells

We began our studies by generating sequential TPR truncations from ncOGT that are fused to the C-terminus of mRFP [RFP(X), X = number of TPRs], which we had previously characterized for high expression and activity (Fig. 1B) [35]. In addition, we included the catalytically inactive RFP(13,K852A) as a control. The resulting 14 RFP-fused OGT constructs with laddered TPR domains were transfected to U2OS cells, and the localization of the proteins was determined by fluorescence microscopy (Fig. 1B). We observed the localization of OGT transition from a primarily nuclear to a nucleocytoplasmic localization as the TPR domain was truncated, with a major inflection point occurring in constructs with five or fewer TPRs. The transfected cells were then collected to determine the levels of OGT expression and changes to the global O-GlcNAc proteome using the O-GlcNAc RL2 antibody (Fig. 1C). Excitingly, most of the TPR truncations exhibited elevated O-GlcNAc levels, with the exception of the catalytically inactive RFP(13,K852A) and RFP(0), which has the TPR domain completely removed. Interestingly, RFP(2) and RFP(1) displayed a reduced and altered O-GlcNAc band pattern relative to the other TPR truncations. The altered subcellular localization and global O-GlcNAc levels produced by the TPR truncations, particularly exhibited by RFP(2) and RFP(1), correspond with altered substrate selection previously shown by in vitro studies with single proteins and cell extracts [11, 18].

Overexpression of OGT isoforms enhances glycosylation of proteins associated with mRNA processing

To gain insight into the OGT substrates that are differentially modified by the TPR truncations, we next sought to gain deeper insight to the O-GlcNAc proteins and their modification sites by quantitative proteomics and glycoproteomics [35, 36]. We focused our efforts on the most studied OGT isoforms consisting of 13, 9, 4, and 2 TPRs. In addition, we co-expressed the RFP(X) construct with GFP-JunB to serve as an internal benchmark to determine the extent of modification on GFP-JunB by the different TPR truncations in comparison to endogenous OGT. U2OS cells co-transfected with GFP-JunB and the indicated OGT construct were collected, and the O-GlcNAcylated proteins were chemoenzymatically labeled for biotin-based enrichment. Following enrichment, the O-GlcNAcylated proteins were digested for protein identification and quantification by mass spectrometry (MS). The biotinylated glycopeptides were recovered by cleavage of the probe and analyzed separately. Over 1200 proteins were identified after O-GlcNAc enrichment (SI Table 1). To reduce complexity in the initial analysis and to identify the most significantly enriched proteins as a function of overexpression of at least one of the OGT isoforms, we performed a comparison of the samples transfected with an OGT isoform to the control by a one-way ANOVA, which revealed 32 significantly enriched proteins [fold enrichment ≥ 1.5, adjusted p value ≤ 0.01 (Fig. 2A, SI Table 1)]. As expected, JunB and OGT are the highest O-GlcNAcylated proteins in this dataset due to their overexpression. All 32 significantly enriched proteins are found in RFP(13) samples, and 24 are exclusively enriched with RFP(13). A subset of the other proteins are relatively insensitive to regulation through the TPR domain (i.e., SPART, ZC3HAV1, CEP170), while others are specifically modified by RFP(13) and RFP(9) (i.e., ANXA7, EIF2SI, MLF2, PHLDB1). A network analysis of the 32 proteins identified a highly enriched network cluster of 10 proteins involved in mRNA processes such as splicing, processing, and mRNA decay (Fig. 2B). The remaining proteins formed minor relationships related to lipid droplets, microtubules, or cellular signaling, or were not clustered to a network, and are not represented in the network analysis diagram for clarity.

RFP(13) has the broadest substrate scope of all of the OGT isoforms

Having identified the most significantly enriched proteins and pathways shared between the OGT constructs, we then sought to analyze an expanded set of affected proteins and pathways enriched by OGT overexpression using a reduced cut-off criterion. We analyzed differences in the significantly enriched O-GlcNAcylated proteins between the co-transfected OGT constructs and the GFP-JunB only control [log₂(RFP(X)/control) ≥ 1.5, p ≤ 0.05, Fig. 3A, SI Table 2]. Of the over 1200 proteins, 212 were significantly enriched in the RFP(13) sample, 124 with RFP(9), 114 with RFP(4), and 123 with RFP(2) in comparison to the control (Fig. 3B). Inherent glycosylation efficiency of the control protein GFP-JunB by each of the OGT isoforms was also variable, with the highest recognition by RFP(13) and the lowest glycosylation by RFP(4). In addition to GFP-JunB, a comparison of the significantly enriched proteins between the OGT constructs revealed a total of 59 highly enriched proteins across the four co-transfected samples, indicating that these proteins do not require the extended TPR domain for glycosylation. Conversely, the RFP(13) sample enhanced glycosylation of the most unique proteins at 71 proteins, which represents substrates that appear to require the full-length TPR domain (Fig. 3C). By contrast, selective O-GlcNAcylation of 3 proteins with RFP(9), 7 proteins with RFP(4), and 10 proteins with RFP(2) was observed (Fig. 3C).

To obtain a broad overview of the biological processes selectively enriched by expression of each of the OGT constructs, we performed a gene ontology (GO) analysis on all the enriched proteins from each condition. After a stringent filter for clusters of adjusted p value ≤ 0.05, two clusters involving the biological processes of cell–cell adhesion and regulation of translation initiation were shared between all the OGT constructs (Fig. 4, SI Table 3). As before, proteins that are O-GlcNAcylated by RFP(13) yielded the highest number of biological process clusters at 14, of which 9 of these processes involved disruptions to components of the nuclear pore complex. The mitotic nuclear envelope disassembly cluster was shared between proteins that are O-GlcNAcylated by RFP(13) and RFP(9), which represents the mitochondrial isoform of OGT.

Interestingly, GO analysis of O-GlcNAcylated proteins found in the RFP(2) condition was the only set that enriched for the negative regulation of the mRNA splicing process. RFP(2) represents the sOGT isoform, which has been described as a negative regulator of ncOGT by disrupting the Ataxin-10–ncOGT interaction [42]. Of the 5 enriched proteins associated with the negative regulation of the mRNA splicing process in the RFP(2) condition, three (splicing factor U2AF, polypyrimidine tract-binding protein 1, and transformer-2 protein homolog beta) are shared between RFP(13) and RFP(2). The remaining two proteins, RNA-binding protein 10 and apoptotic chromatin condensation inducer in the nucleus (Acinus), are unique to the RFP(2) construct.

In sum, this quantitative proteomic analysis of the O-GlcNAcylated proteome produced by four OGT isoforms confirmed that the full-length OGT possesses the broadest swath of substrates across the greatest number of biological processes and revealed specific substrates and biological pathways that are shared and unique to each of the examined OGT isoforms. Furthermore, additional truncation of the TPR domain for higher protein selectivity through nanobody fusions may not be advantageous given the similar global O-GlcNAc levels and gain of specific substrates with RFP(2).

Overexpression of TPR truncations shows an altered consensus sequence at the glycosite

As the enrichment step of our proteomics workflow allows us to separate trypsin-digested glycopeptides from unmodified peptides, we next performed glycoproteomics on the recovered glycosites to determine the effects of overexpression of truncated OGT constructs on the site level. Mass spectra were searched against the over 1200 enriched proteins using Byonic, and the results were filtered based on ambiguous versus unambiguous glycosites (SI Table 4). We defined an unambiguous glycosite as a modified glycopeptide having a ≥ 10 delta modification score by EThcD fragmentation. All other glycopeptides were categorized as ambiguous. In total, we identified over 3900 peptide spectral matches (PSMs) that were assigned to 891 unique unambiguous glycopeptides from 210 proteins across 6 biological replicates for each condition and a total of 30 samples. To determine if any new unambiguous O-GlcNAc proteins had been identified, we searched for these 210 proteins on the recently published O-GlcNAc database [1] and found 6 new proteins (Table 1). Of these 6 proteins, three were modified by RFP(13), two by RFP(9), three by RFP(4), and four by RFP(2). Two proteins, PLCB3 and MAP2K1, were unique to RFP(2), and WBP2 was unique to RFP(4).

Table 1 New glycoproteins identified in the OGT conditions. The > 200 glycoproteins with unambiguous glycosites were compared to the O-GlcNAc database, and 6 new glycoproteins were identified. The condition and glycosites in which the glycoprotein was identified are listed with the gene and protein name

Full size table

To obtain a broad overview of how the glycosites on proteins changed with the different OGT constructs, we next generated consensus sequences of the unique glycosites found in each sample (Fig. 5). These consensus sequences were generated from unique unambiguous glycosites identified in each condition. We identified 113, 246, 179, 232, and 205 unique glycosites in the GFP-JunB control, RFP(13), RFP(9), RFP(4), and RFP(2) respectively. These unique glycosites were then used to obtain an O-GlcNAc consensus sequence for each of the OGT conditions that could be used to compare to the control, which contains glycosites introduced by the endogenous OGT (Fig. 5A). As the TPR domain positions substrates for glycosylation through interactions toward the C-terminus of the peptide sequence, we extracted amino acids around the glycosite in the − 5 and + 15 positions to afford the consensus sequence around these glycosites (SI Table 4). As expected, the consensus sequence local to the glycosite (position 0) is enriched for a PPV sequence at the − 3 to − 1 positions on the peptide, which assists in extending the peptide in the catalytic site of the OGT [43]. In general, glycosylated protein regions have a high abundance of polar uncharged and hydrophobic residues in the 1 to 15 position, with some enrichment for K that may form contacts with acidic residues in the TPR domain, as previously observed in vitro [44]. Unique sequences observed in the RFP(13) largely mimic the consensus sequence observed in the control conditions. However, overexpression of the additional OGT constructs, RFP(9), RFP(4), and RFP(2), increases the prevalence of additional polar uncharged and charged residues like Q, N, K, E, and D.

We then generated a Venn diagram to visualize glycosites that are common between each OGT isoform and the GFP-JunB only control or unique to the overexpression of the indicated OGT construct. We found that the majority of glycosites are shared between two or more conditions, but that 66, 18, 42, and 55 glycosites from 48, 17, 36, and 35 proteins were identified only in the RFP(13), RFP(9), RFP(4), and RFP(2) conditions, respectively (Fig. 5B, SI Table 4). Most of these unique glycosites can be traced to a single protein though in some cases multiple unique glycosites are found on one protein (i.e., JunB, OGT, AHNAK, HCF-1, VIM, ZC3H14, PLEC). To identify glycosite features important for each OGT isoform, we generated an O-GlcNAc consensus sequence with these unique glycosites for each of the OGT conditions (Fig. 5C). Overexpression of the OGT constructs generally increases the prevalence of additional polar uncharged and charged residues like Q, N, K, E, and D. The consensus sequence from the RFP(2) condition was particularly enriched in negatively charged E residues. To determine if the changes in the O-GlcNAc consensus sequence were due to an altered glycosite selection or substrate preference, we generated a Venn diagram of the proteins identified from the unique glycosites in Fig. 5B. We found that the majority of these glycosites that were only observed in one condition also derived from proteins that were unique to that condition (Fig. 5D, SI Table 4). This analysis indicates that a major contributor to the altered O-GlcNAc consensus sequence in Fig. 5C derives from an altered substrate preference of the OGT isoforms. Collectively, the majority of glycosites detected during OGT overexpression are shared across two or more conditions and the unique glycosites identified can be traced to an altered substrate preference by the OGT isoforms.

Label-free quantification of GFP-JunB indicates increased glycosite abundance with RFP(13) expression

In addition to the glycosite level data obtained for the global O-GlcNAc proteome, co-transfection of GFP-JunB allowed us to compare all the OGT conditions and their effect on GFP-JunB glycosites. An analysis of the unambiguous peptide spectral matches (PSMs) obtained from GFP-JunB after a search of the MS data revealed that RFP(13) had the highest number of PSMs detected at 161 and the detection of PSMs decreased as the TPR was truncated with 96, 97, 69, and 20 for the RFP(9), RFP(4), RFP(2), and control conditions respectively (Table 2), indicating a reduced activity on GFP-JunB with the TPR truncations. To determine the relative abundance of the GFP-JunB glycosites across conditions, we employed label-free quantification as previously described [39], averaged the abundance across the three biological replicates for each condition, and generated a heatmap for the relative abundance of each glycosite (Fig. 6A). In line with the PSM data, all OGT conditions exhibited elevated levels of O-GlcNAc on GFP-JunB. Glycosites at positions 84, 85, 95, and 100 are modified by the endogenous OGT in the control condition, and the abundance and intensity of these glycosites increase with OGT overexpression indicating these are stable glycosites naturally found on GFP-JunB. Glycosites at positions 28, 84, 85, and 312 appear to increase in intensity with longer TPRs while glycosites 95 and 100 are relatively insensitive to the TPR length. Additional glycosites at positions 14, 28, 38, 76, 129, 251, and 312, and surrounding regions are more readily detected in OGT overexpression conditions. In general, all OGT isoforms can modify GFP-JunB, although a reduction to two TPRs appears to affect the efficiency and access to glycosites and at least four TPRs are necessary to modify similar glycosites as longer TPR isoforms. Thus, in terms of O-GlcNAc engineering, use of OGT isoforms with at least four TPRs appears to be optimal for generating a similar glycosite profile as ncOGT, but use of the sOGT isoform may access unique glycosites that cannot be targeted by the other constructs.

Table 2 Total unambiguous peptide spectral matches (PSMs) obtained from GFP-JunB in each indicated condition

Full size table

Finally, we cross-referenced the identified glycosites on GFP-JunB with previously identified phosphorylation sites on JunB (Fig. 6B) [45]. Interestingly, we observed novel glycosites at positions 311 and 312 in the bZIP domain of JunB indicating a potential relationship between O-GlcNAc and DNA binding of JunB. Glycosites that also overlapped with phosphorylation at positions 68, 70, 102, 117, 237, 251, and 312 may also play a role in PTM crosstalk. Phosphorylation at positions 79 and 102 has been described as altering transcriptional activity [46], and phosphorylation in regions 251–259 has an effect on JunB degradation [47]. We have previously observed that JunB expression levels appear to increase as it is O-GlcNAcylated by a nanobody-OGT [35]. Further study of these intersections with JunB or other enriched glycoproteins obtained in our dataset may reveal additional differences between the glycosites installed by the OGT TPR truncations that may have implications on downstream regulation of the protein substrate.

Discussion

In this study, we report the first direct comparison of overexpressed OGT TPR truncations and their effect on the global O-GlcNAc proteome and a specific target protein GFP-JunB by quantitative proteomics and glycoproteomics. Consistent with prior reports, we find that OGT isoforms with a truncated TPR domain have intrinsic activity in cells [31, 32, 35], and now map this activity to substrates and their glycosites. We find that RFP(13), representing the full-length ncOGT, possesses the greatest substrate scope amongst the OGT isoforms examined, while the TPR truncations have more limited and sometimes unique access to substrates and glycosites. The differentiation is particularly accentuated by RFP(2), which represents sOGT, where the enriched glycoproteins and unique glycosites corresponded to a unique mRNA biological pathway and differentiated consensus sequence. These data could serve to increase glycosylation on desired substrates and influence glycosite selection while minimizing perturbations to the cell for O-GlcNAc engineering, in addition to the biological implications arising from differential substrate and site selection of the OGT TPR isoforms.

The relationship between the TPR isoforms of OGT and biological function is a growing area of investigation as interactions and substrate selection through the TPR domain may be of particular relevance to normal O-GlcNAc regulation [6, 10, 31, 48]. Comparison of glycoproteins in the presence of RFP(13) to a recently determined TPR interactome of ncOGT via a proximity-labeling approach [24] shows enrichment for similar biological processes, indicating that the TPR interactome and the enriched O-GlcNAcylated proteome correspond. Future studies may extend this comparison to the OGT isoforms mOGT and sOGT, in addition to characterizing whether these isoforms are important for cell viability and organismal function, although the isoforms of OGT are not required for cell proliferation in tissue culture [10]. Our data indicate that the isoforms of OGT possess both a complementary and unique substrate scope, accompanied by altered efficiency at modifying glycosites, which is potentially indicative of separable biological roles in cells if they are required for normal cellular function. These observations were made in the presence of endogenous OGT, which could mask more subtle differences between the OGT isoforms, and future studies in cell lines lacking endogenous OGT function may reveal additional differences. Further investigation of these OGT isoforms may yield essential substrates, binding partners, and glycosites required for cell proliferation and organismal development.

Analysis of the glycoproteome enriched by each of the OGT isoforms revealed differences in the biological processes that were conserved or unique to each isoform. One of the most enriched pathways across the OGT isoforms was the mRNA splicing pathway, which points to a possible reciprocal regulatory relationship between ncOGT and sOGT on mRNA splicing. Observation of the mRNA splicing pathway is particularly interesting given the highly regulated expression of OGT and OGA that maintain steady levels of O-GlcNAc by a detained intron-splicing mechanism [49, 50], although the exact mechanisms monitoring the levels of O-GlcNAc to trigger splicing have not been identified. If the OGT isoforms are differentially involved in the detained intron-splicing mechanism, the dataset reported here may assist in the evaluation of glycoproteins underlying this biological function. Indeed, one of the mRNA splicing proteins, Acinus, is regulated in Arabidopsis by O-GlcNAc or O-fucose and these distinct sugar modifications regulate which introns are spliced [51]. Separately, of the 14 enriched biological processes of RFP(13), 9 involved components of the nuclear pore complex (NPC). The biological role of O-GlcNAc on the NPC includes roles in the permeability barrier [52], transport rate [53], and stability of the nucleoporins to degradation [54,55,56]. Future study of these targets with nanobody-OGT [35] and nanobody-split-OGA [36] fusions for increasing or decreasing O-GlcNAc on specific target proteins may assist in the dissection of the role of O-GlcNAc in these proteins and their functions.

Of particular interest to our protein engineering efforts is that truncations of the TPR are still active in cells and modify similar glycosites as RFP(13) with a reduced substrate scope. The reduced substrate scope after truncation of the TPR domain to four repeats should indeed enhance selectivity for a desired target protein through the nanobody. Based on our data herein, further truncation of the TPR domain may not be advantageous given the similar global O-GlcNAc levels and gain of specific substrates and glycosites with RFP(2). However, the altered prevalence of polar and charged residues in the consensus sequence of truncated OGTs hints at the ability to limit access to certain glycosites and opens a pathway to creating glycosite-specific OGTs in the future. Undoubtedly, the full-length OGT is the construct that consistently generates the most detectable number of glycosites on overexpressed and endogenous protein substrates. Extension of efforts to determine how substrates engage the TPR domain of OGT would define the features of the extended TPR that promote this selection [11, 12]. Nevertheless, our dataset can shed light on the TPRs that are required to engage certain substrates in a global manner and the particular glycosites affected by TPR truncations.

In summary, we report here the first dataset directly comparing the effects of overexpression of OGT TPR truncations on the O-GlcNAc proteome. We find that truncation of the OGT isoforms alters the substrate and glycosite selection in cells and identify that the first four TPRs afford the broadest substrate selection, while the last four TPRs mediate the glycosite selection. These profiles of the substrate and glycosite selectivity of the OGT TPR truncations will assist with understanding the biological roles of the OGT isoforms. Future studies of the OGT isoforms will further enhance our understanding of the regulatory relationship between O-GlcNAc and human disease, and eventually mechanisms to introduce O-GlcNAc to desired target proteins and glycosites to influence specific biological pathways.

Data availability

The MS data were deposited at the ProteomeXchange Consortium [57] via the PRIDE partner repository, and are available with the identifier PXD028141.

References

Wulff-Fuentes E, Berendt RR, Massman L, Danner L, Malard F, Vora J, Kahsay R, Olivier-Van Stichelen S. The human O-GlcNAcome database and meta-analysis. Sci Data. 2021;8:25. https://doi.org/10.1038/s41597-021-00810-4.
Article CAS PubMed PubMed Central Google Scholar
Joiner CM, Li H, Jiang J, Walker S. Structural characterization of the O-GlcNAc cycling enzymes: insights into substrate recognition and catalytic mechanisms. Curr Opin Struct Biol. 2019;56:97–106. https://doi.org/10.1016/j.sbi.2018.12.003.
Article CAS PubMed PubMed Central Google Scholar
Stephen HM, Adams TM, Wells L. Regulating the regulators: mechanisms of substrate selection of the O-GlcNAc cycling enzymes OGT and OGA. Glycobiology. 2021;1–5. https://doi.org/10.1093/glycob/cwab005.
Shafi R, Iyer SP, Ellies LG, O’Donnell N, Marek KW, Chui D, Hart GW, Marth JD. The O-GlcNAc transferase gene resides on the X chromosome and is essential for embryonic stem cell viability and mouse ontogeny. Proc Natl Acad Sci U S A. 2000;97:5735–9. https://doi.org/10.1073/pnas.100471497.
Article CAS PubMed PubMed Central Google Scholar
Yang YR, Song M, Lee H, Jeon Y, Choi EJ, Jang HJ, Moon HY, Byun HY, Kim EK, Kim DH, Lee MN, Koh A, Ghim J, Choi JH, Lee-Kwon W, Kim KT, Ryu SH, Suh PG. O-GlcNAcase is essential for embryonic development and maintenance of genomic stability. Aging Cell. 2012;11:439–48. https://doi.org/10.1111/j.1474-9726.2012.00801.x.
Article CAS Google Scholar
Pravata VM, Omelková M, Stavridis MP, Desbiens CM, Stephen HM, Lefeber DJ, Gecz J, Gundogdu M, Õunap K, Joss S, Schwartz CE, Wells L, van Aalten DMF. An intellectual disability syndrome with single-nucleotide variants in O-GlcNAc transferase. Eur J Hum Genet. 2020;28:706–14. https://doi.org/10.1038/s41431-020-0589-9.
Article CAS PubMed PubMed Central Google Scholar
Yuzwa SA, Vocadlo DJ. O-GlcNAc and neurodegeneration: biochemical mechanisms and potential roles in Alzheimer’s disease and beyond. Chem Soc Rev. 2014;43:6839–58. https://doi.org/10.1039/c4cs00038b.
Article CAS PubMed Google Scholar
Ma J, Hart GW. Protein O-GlcNAcylation in diabetes and diabetic complications. Expert Rev Proteomics. 2013;10:365–80. https://doi.org/10.1586/14789450.2013.820536.
Article CAS PubMed PubMed Central Google Scholar
Slawson C, Hart GW. O-GlcNAc signalling: implications for cancer cell biology. Nat Rev Cancer. 2011;11:678–84. https://doi.org/10.1038/nrc3114.
Article CAS PubMed PubMed Central Google Scholar
Levine ZG, Pottera SC, Joinerb CM, Feia GQ, Nabetc B, Sonnette M, Zacharaf NE, Grayc NS, Paulog JA, Walkera S (2021) Mammalian cell proliferation requires noncatalytic functions of O-GlcNAc transferase. Proc Natl Acad Sci U S A 118:2020.10.22.351288 . https://doi.org/10.1073/PNAS.2016778118
Joiner CM, Hammel FA, Janetzko J, Walker S. Protein substrates engage the lumen of O-GlcNAc transferase’s tetratricopeptide repeat domain in different ways. Biochemistry. 2021. https://doi.org/10.1021/acs.biochem.0c00981.
Article PubMed Google Scholar
Kositzke A, Fan D, Wang A, Li H, Worth M, Jiang J. Elucidating the protein substrate recognition of O-GlcNAc transferase (OGT) toward O-GlcNAcase (OGA) using a GlcNAc electrophilic probe. Int J Biol Macromol. 2021;169:51–9. https://doi.org/10.1016/j.ijbiomac.2020.12.078.
Article CAS PubMed Google Scholar
Kreppel LK, Hart GW. Regulation of a cytosolic and nuclear O-GlcNAc transferase. Role of the tetratricopeptide repeats. J Biol Chem. 1999;274:32015–22. https://doi.org/10.1074/jbc.274.45.32015.
Article CAS PubMed Google Scholar
Lubas WA, Hanover JA. Functional expression of O-linked GlcNAc transferase. J Biol Chem. 2000;275:10983–8.
Article CAS Google Scholar
Iyer SPN, Hart GW. Roles of the tetratricopeptide repeat domain in O-GlcNAc transferase targeting and protein substrate specificity. J Biol Chem. 2003;278:24608–16. https://doi.org/10.1074/jbc.M300036200.
Article CAS PubMed Google Scholar
Hanover JA, Yu S, Lubas WB, Shin SH, Ragano-Caracciola M, Kochran J, Love DC. Mitochondrial and nucleocytoplasmic isoforms of O-linked GlcNAc transferase encoded by a single mammalian gene. Arch Biochem Biophys. 2003;409:287–97. https://doi.org/10.1016/S0003-9861(02)00578-7.
Article CAS PubMed Google Scholar
Love DC. Mitochondrial and nucleocytoplasmic targeting of O-linked GlcNAc transferase. J Cell Sci. 2002;116:647–54. https://doi.org/10.1242/jcs.00246.
Article CAS Google Scholar
Lazarus BD, Love DC, Hanover JA (2006) Recombinant O-GlcNAc transferase isoforms : identification of O-GlcNAcase , yes tyrosine kinase, and tau as isoform-specific substrates. 16:415–421 . https://doi.org/10.1093/glycob/cwj078
Zhang Q, Liu X, Gao W, Li P, Hou J, Li J, Wong J. Differential regulation of the ten-eleven translocation (TET) family of dioxygenases by O-linked β-N-acetylglucosamine transferase (OGT). J Biol Chem. 2014;289:5986–96. https://doi.org/10.1074/jbc.M113.524140.
Article CAS PubMed PubMed Central Google Scholar
Martinez M, Renuse S, Kreimer S, O’Meally R, Natov P, Madugundu AK, Nirujogi RS, Tahir R, Cole R, Pandey A, Zachara NE. Quantitative proteomics reveals that the OGT interactome is remodeled in response to oxidative stress. Mol Cell Proteomics. 2021;20: 100069. https://doi.org/10.1016/j.mcpro.2021.100069.
Article CAS PubMed PubMed Central Google Scholar
Gao J, Yang Y, Qiu R, Zhang K, Teng X, Liu R, Wang Y. Proteomic analysis of the OGT interactome: novel links to epithelial-mesenchymal transition and metastasis of cervical cancer. Carcinogenesis. 2018;39:1222–34. https://doi.org/10.1093/carcin/bgy097.
Article CAS PubMed PubMed Central Google Scholar
Deng RP, He X, Guo SJ, Liu WF, Tao Y, Tao SC. Global identification of O-GlcNAc transferase (OGT) interactors by a human proteome microarray and the construction of an OGT interactome. Proteomics. 2014;14:1020–30. https://doi.org/10.1002/pmic.201300144.
Article CAS PubMed Google Scholar
Ortiz-Meoz RF, Merbl Y, Kirschner MW, Walker S. Microarray discovery of new OGT substrates: the medulloblastoma oncogene OTX2 is O-GlcNAcylated. J Am Chem Soc. 2014;136:4845–8. https://doi.org/10.1021/ja500451w.
Article CAS PubMed PubMed Central Google Scholar
Stephen HM, Praissman JL, Wells L. Generation of an interactome for the tetratricopeptide repeat domain of O-GlcNAc transferase indicates a role for the enzyme in intellectual disability. J Proteome Res. 2021;20:1229–42. https://doi.org/10.1021/acs.jproteome.0c00604.
Article CAS PubMed Google Scholar
Tseng TS, Swain SM, Olszewski NE. Ectopic expression of the tetratricopeptide repeat domain of SPINDLY causes defects in gibberellin response. Plant Physiol. 2001;126:1250–8. https://doi.org/10.1104/pp.126.3.1250.
Article CAS PubMed PubMed Central Google Scholar
Izhaki A, Swain SM, Tseng TS, Borochov A, Olszewski NE, Weiss D. The role of SPY and its TPR domain in the regulation of gibberellin action throughout the life cycle of Petunia hybrida plants. Plant J. 2001;28:181–90. https://doi.org/10.1046/j.1365-313X.2001.01144.x.
Article CAS PubMed Google Scholar
Vaidyanathan K, Niranjan T, Selvan N, Teo CF, May M, Patel S, Weatherly B, Skinner C, Opitz J, Carey J, Viskochil D, Gecz J, Shaw M, Peng Y, Alexov E, Wang T, Schwartz C, Wells L. Identification and characterization of a missense mutation in the O-linked β-N-acetylglucosamine (O-GlcNAc) transferase gene that segregates with X-linked intellectual disability. J Biol Chem. 2017;292:8948–63. https://doi.org/10.1074/jbc.M116.771030.
Article CAS PubMed PubMed Central Google Scholar
Selvan N, George S, Serajee FJ, Shaw M, Hobson L, Kalscheuer V, Prasad N, Levy SE, Taylor J, Aftimos S, Schwartz CE, Huq AM, Gecz J, Wells L. O-GlcNAc transferase missense mutations linked to X-linked intellectual disability deregulate genes involved in cell fate determination and signaling. J Biol Chem. 2018;293:10810–24. https://doi.org/10.1074/jbc.RA118.002583.
Article CAS PubMed PubMed Central Google Scholar
Willems AP, Gundogdu M, Kempers MJE, Giltay JC, Pfundt R, Elferink M, Loza BF, Fuijkschot J, Ferenbach AT, Van Gassen KLI, Van Aalten DMF, Lefeber DJ. Mutations in N-acetylglucosamine (O-GlcNAc) transferase in patients with X-linked intellectual disability. J Biol Chem. 2017;292. https://doi.org/10.1074/jbc.M117.790097.
Gundogdu M, Llabrés S, Gorelik A, Ferenbach AT, Zachariae U, van Aalten DMF. The O-GlcNAc transferase intellectual disability mutation L254F distorts the TPR helix. Cell Chem Biol. 2018;1–6. https://doi.org/10.1016/j.chembiol.2018.03.004.
Liu L, Li L, Ma C, Shi Y, Liu C, Xiao Z, Zhang Y, Tian F, Gao Y, Zhang J, Ying W, Wang PG, Zhang L. O-GlcNAcylation of Thr12/Ser56 in short-form O-GlcNAc transferase (sOGT) regulates its substrate selectivity. J Biol Chem. 2019;294:16620–33. https://doi.org/10.1074/jbc.RA119.009085.
Article CAS PubMed PubMed Central Google Scholar
Feng Z, Hui Y, Ling L, Xiaoyan L, Yuqiu W, Peng W, Lianwen Z. FBXW10 is negatively regulated in transcription and expression level by protein O-GlcNAcylation. Biochem Biophys Res Commun. 2013;438:427–32. https://doi.org/10.1016/j.bbrc.2013.07.091.
Article CAS PubMed Google Scholar
Liu Y, Li X, Yu Y, Shi J, Liang Z, Run X, Li Y, Dai C, ling, Grundke-Iqbal I, Iqbal K, Liu F, Gong CX, . Developmental regulation of protein O-GlcNAcylation, O-GlcNAc transferase, and O-GlcNAcase in mammalian brain. PLoS ONE. 2012;7:1–13. https://doi.org/10.1371/journal.pone.0043724.
Article CAS Google Scholar
Konzman D, Abramowitz LK, Steenackers A, Mukherjee MM, Na H-J, Hanover JA. O-GlcNAc: Regulator of signaling and epigenetics linked to X-linked intellectual disability. Front Genet. 2020;11. https://doi.org/10.3389/fgene.2020.605263.
Ramirez DH, Aonbangkhen C, Wu HY, Naftaly JA, Tang S, O’Meara TR, Woo CM. Engineering a proximity-directed O-GlcNAc transferase for selective protein O-GlcNAcylation in cells. ACS Chem Biol. 2020;15:1059–66. https://doi.org/10.1021/acschembio.0c00074.
Article CAS PubMed PubMed Central Google Scholar
Ge Y, Ramirez DH, Yang B, D’Souza AK, Aonbangkhen C, Wong S, Woo CM. Target protein deglycosylation in living cells by a nanobody-fused split O-GlcNAcase. Nat Chem Biol. 2021. https://doi.org/10.1038/s41589-021-00757-y.
Ramirez DH, Ge Y, Woo CM. O-GlcNAc Engineering on a target protein in cells with nanobody-OGT and nanobody-splitOGA. Curr Protoc. 2021;1:1–28. https://doi.org/10.1002/cpz1.117.
Article CAS Google Scholar
Darabedian N, Yang B, Ding R, Cutolo G, Zaro BW, Woo CM, Pratt MR. O-Acetylated chemical reporters of glycosylation can display metabolism-dependent background labeling of proteins but are generally reliable tools for the identification of glycoproteins. Front Chem. 2020;8:1–14. https://doi.org/10.3389/fchem.2020.00318.
Article CAS Google Scholar
Woo CM, Lund PJ, Huang AC, Davis MM, Bertozzi CR, Pitteri SJ. Mapping and quantification of over 2000 O-linked glycopeptides in activated human T cells with isotope-targeted glycoproteomics (Isotag). Mol Cell Proteomics. 2018;17:764–75. https://doi.org/10.1074/mcp.RA117.000261.
Article CAS PubMed PubMed Central Google Scholar
Huang DW, Sherman BT, Lempicki RA. Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 2009;37:1–13. https://doi.org/10.1093/nar/gkn923.
Article CAS Google Scholar
Szklarczyk D, Gable AL, Lyon D, Junge A, Wyder S, Huerta-Cepas J, Simonovic M, Doncheva NT, Morris JH, Bork P, Jensen LJ, Von Mering C. STRING v11: Protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 2019;47:D607–13. https://doi.org/10.1093/nar/gky1131.
Article CAS Google Scholar
März P, Stetefeld J, Bendfeldt K, Nitsch C, Reinstein J, Shoeman RL, Dimitriades-Schmutz B, Schwager M, Leiser D, Özcan S, Otten U, Özbek S. Ataxin-10 interacts with O-linked β-N-acetylglucosamine transferase in the brain. J Biol Chem. 2006;281:20263–70. https://doi.org/10.1074/jbc.M601563200.
Article CAS PubMed Google Scholar
Pathak S, Alonso J, Schimpl M, Rafie K, Blair DE, Borodkin VS, Schüttelkopf AW, Albarbarawi O, Van Aalten DMF. The active site of O-GlcNAc transferase imposes constraints on substrate sequence. Nat Struct Mol Biol. 2015;22:744–9. https://doi.org/10.1038/nsmb.3063.
Article CAS PubMed PubMed Central Google Scholar
Joiner CM, Levine ZG, Aonbangkhen C, Woo CM, Walker S. Aspartate residues far from the active site drive O-GlcNAc transferase substrate selection. J Am Chem Soc. 2019;141:12974–8. https://doi.org/10.1021/jacs.9b06061.
Article CAS PubMed PubMed Central Google Scholar
Hornbeck PV, Kornhauser JM, Latham V, Murray B, Nandhikonda V, Nord A, Skrzypek E, Wheeler T, Zhang B, Gnad F. 15 years of PhosphoSitePlus ® : integrating post-translationally modified sites, disease variants and isoforms. Nucleic Acids Res. 2019;47:D433–41. https://doi.org/10.1093/nar/gky1159.
Article CAS PubMed Google Scholar
Narayanan K, Srinivas R, Peterson MC, Ramachandran A, Hao J, Thimmapaya B, Scherer PE, George A. Transcriptional regulation, of dentin matrix protein 1 by JunB and p300 during osteoblast differentiation. J Biol Chem. 2004;279:44294–302. https://doi.org/10.1074/jbc.M403511200.
Article CAS PubMed Google Scholar
Pérez-Benavente B, García JL, Rodríguez MS, Pineda-Lucena A, Piechaczyk M, Font De Mora J, Farràs R. GSK3-SCF FBXW7 targets JunB for degradation in G2 to preserve chromatid cohesion before anaphase. Oncogene. 2013;32:2189–99. https://doi.org/10.1038/onc.2012.235.
Article CAS PubMed Google Scholar
Sacoman JL, Dagda RY, Burnham-Marusich AR, Dagda RK, Berninsone PM. Mitochondrial O-GlcNAc transferase (mOGT) regulates mitochondrial structure, function, and survival in HeLa cells. J Biol Chem. 2017;292:4499–518. https://doi.org/10.1074/jbc.M116.726752.
Article CAS PubMed PubMed Central Google Scholar
Tan ZW, Fei G, Paulo JA, Bellaousov S, Martin SES, Duveau DY, Thomas CJ, Gygi SP, Boutz PL, Walker S. O-GlcNAc regulates gene expression by controlling detained intron splicing. Nucleic Acids Res. 2020;48:5656–69. https://doi.org/10.1093/nar/gkaa263.
Article CAS PubMed PubMed Central Google Scholar
Park SK, Zhou X, Pendleton KE, Hunter OV, Kohler JJ, O’Donnell KA, Conrad NK. A conserved splicing silencer dynamically regulates O-GlcNAc transferase intron retention and O-GlcNAc homeostasis. Cell Rep. 2017;20:1088–99. https://doi.org/10.1016/j.celrep.2017.07.017.
Article CAS PubMed PubMed Central Google Scholar
Bi Y, Deng Z, Ni W, Shrestha R, Savage D, Hartwig T, Patil S, Hong SH, Zhang Z, Oses-Prieto JA, Li KH, Quail PH, Burlingame AL, Xu S, Wang Z. Arabidopsis ACINUS is O-glycosylated and regulates transcription and alternative splicing of regulators of reproductive transitions. Nat Commun. 2021;12:945. https://doi.org/10.1038/s41467-021-20929-7.
Article CAS PubMed PubMed Central Google Scholar
Labokha AA, Gradmann S, Frey S, Hülsmann BB, Urlaub H, Baldus M, Görlich D. Systematic analysis of barrier-forming FG hydrogels from Xenopus nuclear pore complexes. EMBO J. 2013;32:204–18. https://doi.org/10.1038/emboj.2012.302.
Article CAS PubMed Google Scholar
Yoo TY, Mitchison TJ. O -GlcNAc modification of nuclear pore complexes accelerates bidirectional transport. J Cell Biol. 2021;220. https://doi.org/10.1083/jcb.202010141.
Zhu Y, Liu TW, Madden Z, Yuzwa SA, Murray K, Cecioni S, Zachara N, Vocadlo DJ. Post-translational O-GlcNAcylation is essential for nuclear pore integrity and maintenance of the pore selectivity filter. J Mol Cell Biol. 2016;8:2–16. https://doi.org/10.1093/jmcb/mjv033.
Article CAS PubMed Google Scholar
Ruba A, Yang W. O-GlcNAc-ylation in the nuclear pore complex. Cell Mol Bioeng. 2016;9:227–33. https://doi.org/10.1007/s12195-016-0440-0.
Article CAS PubMed Google Scholar
Li B, Kohler JJ (2014) Glycosylation of the nuclear pore. Traffic
Vizcaíno JA, Deutsch EW, Wang R, Csordas A, Reisinger F, Ríos D, Dianes JA, Sun Z, Farrah T, Bandeira N, Binz PA, Xenarios I, Eisenacher M, Mayer G, Gatto L, Campos A, Chalkley RJ, Kraus HJ, Albar JP, Martinez-Bartolomé S, Apweiler R, Omenn GS, Martens L, Jones AR, Hermjakob H. ProteomeXchange provides globally coordinated proteomics data submission and dissemination. Nat Biotechnol. 2014;32:223–6. https://doi.org/10.1038/nbt.2839.
Article CAS PubMed PubMed Central Google Scholar

Download references

Funding

Support from the National Institutes of Health (U01CA242098-01, C.M. Woo), the Sloan Foundation (C.M. Woo), the Camille and Henry Dreyfus Foundation (C.M. Woo), and Harvard University is gratefully acknowledged.

Author information

Daniel H. Ramirez
Present address: Department of Plant Biology, Carnegie Institution for Science, Stanford, CA, USA

Authors and Affiliations

Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA, USA
Daniel H. Ramirez, Bo Yang, Alexandria K. D’Souza, Dacheng Shen & Christina M. Woo

Authors

Daniel H. Ramirez
View author publications
You can also search for this author in PubMed Google Scholar
Bo Yang
View author publications
You can also search for this author in PubMed Google Scholar
Alexandria K. D’Souza
View author publications
You can also search for this author in PubMed Google Scholar
Dacheng Shen
View author publications
You can also search for this author in PubMed Google Scholar
Christina M. Woo
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.H. Ramirez and C.M. Woo conceived the project and designed the experiments. D.H. Ramirez performed the experiments. B. Yang and A.K. D’Souza helped with quantitative MS analyses. D. Shen contributed IsoTag reagents. D.H. Ramirez and C.M. Woo analyzed the data and wrote the paper with input from all authors.

Corresponding author

Correspondence to Christina M. Woo.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

ABC Highlights: authored by Rising Stars and Top Experts.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (XLSX 411 KB)

Supplementary file2 (XLSX 573 KB)

Supplementary file3 (XLSX 44 KB)

Supplementary file4 (XLSX 4972 KB)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ramirez, D.H., Yang, B., D’Souza, A.K. et al. Truncation of the TPR domain of OGT alters substrate and glycosite selection. Anal Bioanal Chem 413, 7385–7399 (2021). https://doi.org/10.1007/s00216-021-03731-8

Download citation

Received: 04 September 2021
Revised: 25 September 2021
Accepted: 11 October 2021
Published: 02 November 2021
Issue Date: December 2021
DOI: https://doi.org/10.1007/s00216-021-03731-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Truncation of the TPR domain of OGT alters substrate and glycosite selection

Abstract

Graphical abstract

Similar content being viewed by others

Introduction

Methods

Cell culture, transfection, immunofluorescence, and cell lysate collection

Western blotting

Molecular cloning

Chemical enrichment of glycoproteins and sample preparation for IsoTag

Mass spectrometry parameters used for glycoproteomics and data analysis

Gene ontology analysis

OGT consensus sequence generation

Results

OGT TPR truncations exhibit altered localization, but most remain active in cells

Overexpression of OGT isoforms enhances glycosylation of proteins associated with mRNA processing

RFP(13) has the broadest substrate scope of all of the OGT isoforms

Overexpression of TPR truncations shows an altered consensus sequence at the glycosite

Label-free quantification of GFP-JunB indicates increased glycosite abundance with RFP(13) expression

Discussion

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation