Abstract
Main conclusion
74 phytocyanin genes were identified in the Populus trichocarpa genome. Phylogenetic analysis grouped the PC proteins into four subfamilies (UCs, PLCs, SCs, and ENODLs). Closely related PC proteins share similar motifs, implying similar functions. Expression profiles of PtPC genes were analyzed in response to drought and salt-stress.
Phytocyanins (PCs) are blue copper proteins associated with electron carrier activity that have a large influence on plant growth and resistance. The majority of PCs are chimeric arabinogalactan proteins (AGPs). In this work, we identified 74 PC genes in Populus trichocarpa and analyzed them comprehensively. Based on the ligands composition of copper-binding sites, glycosylation state, the domain structure and spectral characteristics of PC genes, PCs were divided into four subfamilies [uclacyanins (UCs), plantacyanins (PLCs), stellacyanins (SCs) and early nodulin-like proteins (ENODLs)], and phylogenetic relationship analysis classified them into seven groups. All PtPCs are randomly distributed on 17 of the 19 poplar chromosomes, and they appear to have undergone expansion via segmental duplication. Eight PtPCs do not contain introns, and each group has a similar conserved motif structure. Promoter analysis revealed cis-elements related to growth, development and stress responses, and established orthology relationships of PCs between Arabidopsis and poplar by synteny analysis. Expression profile analysis and qRT-PCR analysis showed that PtPCs were expressed widely in various tissues. Quantitative real-time RT-PCR analysis of PC genes expression in response to salt and drought stress revealed their stress-responses profiles. This work provides a theoretical basis for a further study of stress resistance mechanisms and the function of PC genes in poplar growth and development.
We’re sorry, something doesn't seem to be working properly.
Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.
Avoid common mistakes on your manuscript.
Introduction
Blue copper proteins containing a single type I mononuclear copper site are known as phytocyanins (PCs) in plants which they are associated with electron carrier activity (Giri et al. 2004). The phytocyanin domain has a core β-sandwich comprising seven β-strands, and a disulfide bridge closing the metal centre is a characteristic feature (Hart et al. 1996). Based on the ligand composition of the copper-binding site, the glycosylation state, the domain structure, and the spectral characteristics, PCs can be grouped into four subfamilies: uclacyanins (UCs), plantacyanins (PLCs), stellacyanins (SCs) and early nodulin-like proteins (ENODLs) (Cao et al. 2015).
Residues ligating the copper ion in SCs consist of two His, one Cys, and one Gln, whereas PLCs and UCs also have two His and one Cys, but the Gln is replaced by a Met (Nersissian et al. 1998). Although PLCs have the same four conserved residues as UCs, they lack putative glycosylation sites on the backbone (Nersissian et al. 1998). Interestingly, early nodulins (ENODLs) might be involved in Cu-independent processes since they lack key copper-binding residues (Nersissian et al. 1998). PC genes containing AG (arabinogalactan) glycomodules and signal peptides (SPs) are believed to be members of the arabinogalactan proteins (AGP) superfamily (Mashiguchi et al. 2009).
PCs exert an important part in growth and development of plants, in addition to the influence of their spectroscopic and redox properties. OsUCL29 and ZmUC22 are significantly expressed under various stresses. BrUCL16 and ZmUC19 are specifically expressed in the stem and silique, respectively, which show that the first of the four PC subfamilies (UCs) appear to function in the evolution of polyploid plants (Ma et al. 2011; Li et al. 2013; Cao et al. 2015). Six SCs are induced by oxidative stress and Al toxicity in Arabidopsis (Ezaki et al. 2001, 2005). PeSCL1 and PeSCL3 are strongly expressed in the stem and roots of Phalaenopsis equestris (Xu et al. 2017). The PLC subfamily participates in the growth processes of several specific plants, including pollination through S-Rnase binding in tobacco (Cruz-Garcia et al. 2005). AtPLCs were also regarded as miR408, the member of the microRNAs targets, which were related to plant growth as transcription factors (Sunkar and Zhu 2004). Dong et al. (2005) found that AtPLCs were strongly expressed in the pistil to prevent pollination and destroy the endothecium structure, thereby influencing the growth of the anther in promoter-β-glucuronidase transgenic plants, as was also shown by an immunohistochemical analysis of wild-type pistil tissues. The last subfamily (ENODLs) is related to many aspects of plant development and important function on transport nutrients, solutes, amino acids or hormones, and improves the fitness to pathogens during host colonization at the plant–microbe interface activities (Denancé et al. 2014). Additionally, ENODLs have high expression levels in the inflorescence of some plants, such as AtENODL3/4 (Mashiguchi et al. 2009), BrENODL22/27 (Li et al. 2013) and PeENODL5/7 (Xu et al. 2017). ENODLs are also relevant for the defence responses of plants. Mashiguchi et al. (2009) found that AtENODL2/18 were induced by osmotic and salt stress. Using hybridisation analysis, Yoshizaki et al. (2000) demonstrated that ENODLs were specifically expressed in tissues including apical buds in Pharbitis nil and root nodules of legumes, and the expression of these genes was distinctly down-regulated during floral induction. Furthermore, these genes might be involved in the organ differentiation of plants (Yoshizaki et al. 2000). AtFLA3, another type of chimeric AGP, is involved in microspore development and the formation of the pollen intine via the deposition of cellulose. Overexpression of this gene restricted the progress of pollination to lower the rate of seed production (Li et al. 2010). AtENODL14 localized at the synergid cell surface strongly and specifically interacts with the extracellular domain of the receptor-like kinase FERONIA, which could minutely control the reception of pollen tube (Escobar-Restrepo et al. 2007). Wild-type pollen tubes can not prevent growth and cause rupture after entering the ovules of quintuple ENODL mutants which loss the function, implying the core function ENODLs being in male–female communication and pollen tube reception (Hou et al. 2016). Furthermore, the overexpression of AtENODL15 by the endogenous promoter results in disturbed pollen tube guidance and reduced fertility (Hou et al. 2016).
To date, the features and functions of the PC gene family have been identified and investigated in several plant species, including Arabidopsis, rice, Chinese cabbage, maize, and orchid. However, no comprehensive analyses of the PC gene family in poplar have been conducted. In the present study, we identified 74 probable PtPC genes in Populus and performed comprehensive phylogenetic, structural, promoter, gene expansion and microsynteny analyses. Additionally, we selected 18 PtPCs to investigate their behaviour under drought and salt treatments. The results provide valuable information about their biological functions and stress responses. Furthermore, analysis of tissue-specific expression of the PtPC genes during development showed differences in their spatiotemporal expression patterns, and many were expressed at high levels in roots and xylem. Genome-wide analysis of the PC genes in Populus trichocarpa will facilitate a better understanding of the role of this gene family during poplar growth and development.
Materials and methods
Identification of PC family genes in poplar
The sequences of previously identified PC genes in Arabidopsis were downloaded from the NCBI database (http://www.ncbi.nlm.nih.gov/). We performed BLASTP searches (E value < 1e−6) with the Arabidopsis PC proteins as queries to identify PtPCs in the Phytozome database (http://www.phytozome.net, PF02298). Each protein sequence identified by BLASTP was checked for the existence of a plastocyanin-like domain (PCLD) to confirm membership of the PC gene family. The signal peptide (SP), glycosylphosphatidylinositol (GPI) anchor signal, and N-glycosylation sites of PtPCs were predicted by the SignalP 4.1 server (Petersen et al. 2011; Jeßberger et al. 2015), Big-PI Plant Predictor (Eisenhaber et al. 2003) and the NetNGlyc 1.0 server (http://www.cbs.dtu.dk/services/NetNGlyc/), respectively. Potential arabinogalactan glycomodules (AGs) were predicted based on previously reported criteria (Mashiguchi et al. 2004). The subcellular localization of all PCs was predicted using the CELLOv2.5 server (http://cello.life.nctu.edu.tw/).
Phylogenetic analysis
Protein sequences and alignments were analyzed using the DNAMAN program, and a phylogenetic tree was constructed with default parameters using the neighbour-joining (NJ) method in MEGA6.0 with 1000 bootstrap replicates (Hu et al. 2010; Tamura et al. 2013).
Analysis of exons/introns, conserved motifs, and chromosomal location
We analyzed the exon/intron structure of PC genes by comparing the coding DNA sequence (CDS) and the corresponding genomic DNA sequence using the online GSDS server (http://gsds.cbi.pku.edu.cn/). Conserved motifs were predicted using the online MEME program (http://meme.nbcr.net/meme/cgi-bin/meme.cgi). An image of the chromosomal location was constructed using MapInspect software based on the initial positional information provided in the Phytozome database.
Analysis of microsynteny and gene duplication
A syntenic block is defined as an area in which exons/introns in orthologs are located within 15 genes upstream or downstream in both genomes (Wang et al. 2015). Syntenic blocks within the PtPC gene family among chromosomes were acquired from the PGDD database (http://chibba.agtec.uga.edu/duplication). Microsynteny analysis was performed using MicroSyn software, and the online OrthoMCL program (http://orthomcl.org/orthomcl/) was used to analyze duplicated genes.
Evaluation of K a/K s values
Ka (number of synonymous substitutions per synonymous site)/Ks (number of non-synonymous substitutions per non-synonymous site) ratios were evaluated using the DnaSP software. Sliding window analysis of Ka per nonsynonymous locus Ka/Ks ratio was also performed with a window size of 150 bp and a step size of 9 bp. Divergence time (T) was estimated by T = Ks/(2 × 9.1 × 10−9) × 10−6 million years ago (Mya).
Promoter and microarray analysis
The 2000 bp upstream sequences of the PtPCs promoter regions were downloaded from the Phytozome database and used to identify the putative cis-elements in PlantCARE (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/) (Goodstein et al. 2011). To better understand the expression levels of PtPCs, the GSE13990 array data from poplar expression profiling was downloaded from the Gene Expression Omnibus (GEO) database at the NCBI (https://www.ncbi.nlm.nih.gov/). The corresponding probes for PC genes were identified using the online ProbeMatch tool available at the NetAffx Analysis Center (http://www.affymetrix.com/analysis/index.affx). The final gene expression data was identified using the corresponding PC-Probe and the GSE13990 data.
Plant materials and stress treatments
The 8-week-old seedlings of P. trichocarpa (Torr. & Gray) used in all experiments were cultivated in the Tissue Culture Lab. Plants were treated with 20% PEG (polyethyleneglycol) and 200 mM NaCl for drought and salt stress, respectively. Untreated plants were used as controls. Leaves were collected for RNA extraction at four time points (4, 8, 12 and 24 h) after treatment.
RNA isolation and qRT-PCR analysis after different stress treatments
TRIzol reagent was used to extract RNA from young poplar leaves under different stress treatments and in different organs, and first-strand cDNAs were synthesized. Primer Premier 5.0 and the NCBI primer BLAST tool were used to design and check primers for amplifying PtPC genes (Table S1). The poplar housekeeping gene encoding ubiquitin (UBQ, gene ID no. Potri.001G418500) was used as an internal control for normalizing experimental expression profile data (Hui et al. 2014). qRT-PCR was performed in a 20-μl volume, including 10 μl of 2 × SYBR® Premix Ex Taq™ (TaKaRa, Otsu, Japan), 0.4 μl of 50 × ROX Reference Dye, 2 μl diluted cDNA template, 0.8 μl of each specific primer, and 6 μl ddH2O. The qPCR reaction conditions were as follows: 95 °C for 30 s, followed by 40 thermal cycles of denaturation at 95 °C for 5 s, and annealing at 55–60 °C for 34 s. The relative expression levels were calculated using the ΔΔCT method. It is noteworthy that relative gene expression [2−ΔΔCT, CK (0 h)] for each gene in the control plants was normalized to 1 as described previously for stress treatments (Schmittgen and Livak 2008). GraphPad software was used for statistical analysis, and three biological and technical replicates were performed for each sample.
Results
Identification of PC family genes in poplar
We identified 77 potential PC protein sequences in poplar, of which three were paired with identical sequences, resulting in 74 putative PtPCs (Table 1). Based on multiple sequence alignments (Fig. 1), PtPCs were classified into three subfamilies: uclacyanin-like proteins (PtUCs, 7), stellacyanin-like proteins (PtSCs, 19) and plantacyanins (PtPLCs, 3) based on the predicted copper-binding ligands (His, Cys, His and Met/Gln). The remaining 45 PtPCs belonged to the ENODL family based on the modified copper-binding residues. Moreover, the prediction of subcellular localization indicated that 93.2% of PtPCs were associated with the plasma membrane or were extracellular (Table 1), and only five PCs (PtENODL9, PtENODL27, PtUC6, PtSC8 and PtSC18) were localized in the nucleus.
Phylogenetic and structural analysis of PtPCs
To better understand their structure and function, we predicted N-terminal signal peptides (SPs), glycosylphosphatidylinositol (GPI) anchor signals (GASs), AG glycomodules and N-glycosylation (Fig. S1; Table 1). The results revealed that 60 PtPCs had a predicted SP involved in targeting to the endoplasmic reticulum. Additionally, 49 PtPCs had GASs related to the localization at the plasma membrane. About 58.1% of poplar genes contained hypothetical AG glycomodules in the PAST-rich region (Pro, Ala, Ser, Thr). There were 60 PtPCs with putative N-glycosylation sites in the PCLD and PAST-rich region.
According to the predicted domain structures, PtPCs were divided into six types (Fig. 2): Type I includes a N-terminal SP, a PCLD, an arabinogalactan-like region (ALR), and a C-terminal GAS; by contrast, GAS is absent from type II; type III resembles type I but lacks ALR; type IV only has SP and PCLD; type V lacks SP compared with type III; type VI only contains PCLD.
To investigate the evolutionary relationships of PC genes in Arabidopsis, Zea mays and poplar, we constructed a phylogenetic tree of AtPC, ZmPC and PtPC protein sequences. The 38 AtPCs, 60 ZmPCs and 74 PtPCs were divided into seven clades, consistent with previous Arabidopsis, rice and Phalaenopsis equestris studies (Fig. 3). The results indicated that clade VII had the most number of PC gene members (61), while clade VI contains the fewest PCs (12). All members of clade VI and clades VII entirely belong to the same subfamily, put it another way, the clade VI and the clades VII wholly contain SCs and ENODLs, respectively. Moreover, except for ZmUC18/26 and PtUC7, all PCs in the clade IV belonged to the ENODL subfamily.
Gene structure and conserved motifs
It was well known that gene structural diversity resulted from the evolution of multi-gene families. The exon/intron structures of PtPCs are shown in Fig. S2. The gene structures of PtPCs are not complex and most include two introns. Like PtSC8, PtENODL1, 3, 34, 35, and 38, have only one intron, and eight PtPCs contain no introns. In addition to exon/intron pattern, conserved motifs could also be important for the diversified functions of PCs. We identified 10 different conserved motifs (Fig. 4), suggesting that members of the same subfamily shared a similar motif structure. The details of each conserved motif are shown in Table S2. Motif 1, motif 2, motif 6 and motif 10 are related to intermolecular electron transfer reactions, while others have no functional annotation. Most PtPCs include motif 1, 3, and 6, in the order 1-6-3. Motif 1 is the most common motif, present in all PtPC genes. Motif 7, 9 and 10 are only present in PtENODLs. PtUC1 differs from other PtUCs by lacking motif 6, and all PtSCs have motif 4 apart from PtSC7.
Chromosomal location, gene duplication and conserved microsynteny
Based on the chromosomal location map, 74 PtPCs were randomly distributed on the 17 poplar chromosomes (Fig. 5). Chromosome 1 has the largest number of PC genes (14), while chromosomes 8 and 19 only have one PtPC gene, and chromosomes 4, 11, 13, 15, 16 and 17 only contain ENODL subfamily genes.
Many gene families in plants appear to be generated by expansion through segmental or tandem duplication. To better comprehend the evolution of poplar PC genes, we investigated genome duplication events in this family. Duplicated genes were confirmed using the Vista Synteny browser (http://pipeline.lbl.gov/cgi-bin/gateway2), which indicated that 11 pairs of genes originated from segmental duplication (Table S3). Neighbouring genes were also analyzed to determine whether tandem duplication had occurred (Ye et al. 2009), but there was no evidence of this phenomenon in PtPCs.
To examine the evolutionary selection process, we calculated Ka/Ks ratios of 10 pairs of PtPC paralogs (Table 2). Almost all Ka/Ks ratios were < 0.5, but the PtSC8/PtSC17 pair was > 1. Moreover, the duplication events of the 10 gene pairs were estimated to have occurred between 10.20 and 55.80 million years ago (Mashiguchi et al. 2004), based on sliding-window analysis of Ka/Ks ratios for all PC paralog pairs (Fig. S3).
To probe the relationships between homologous genes, we investigated the microsynteny between poplar and Arabidopsis sequences (Fig. S4). Pairwise comparison of flanking genes in chromosomal regions containing PC genes revealed three or more pairs displaying conserved microsynteny. Analysis of intraspecies microsynteny identified 23 collinear gene pairs in poplar (Fig. S5). Microsynteny analysis was also performed to evaluate the relationship between paralogous and orthologous PC genes, and 15 orthologous gene pairs with two-for-one microsynteny were identified between Populus and Arabidopsis sequences, but only six pairs were identified with one-for-one microsynteny, including AtENODL20-PtENODL5, and At1g22480-PtUC3/AtENODL8-PtENODL2.
Identification of cis-regulatory elements in the promoters of PtPCs
To investigate gene function and regulation, we analyzed cis-regulatory elements in the promoters of PtPCs (Table S4). In previous studies, we found that cis-regulatory elements play a pivotal role in controlling physical and reproductive growth, phytohormone responses, and abiotic and biotic stress responses. As shown in Fig. 6, endosperm expression elements (Skn-1_motif and GCN4_motifs) were found in 17 PtPCs and 6 PtPCs, respectively (Washida et al. 1999). The Skn-1_motif was the most abundant cis-element in the promoter sequences of 17 PtPCs, and the Skn-1_motif was the most populous in PtSC6. The CAT-box, RY-element and CCGTCC-box are present in two PtPCs, but MBSII, HD-Zip1 and HD-Zip2 are only present in one PtPC gene (Sessa et al. 1993; Bobb et al. 1997). We also identified the circadian control element and the O2 site, which were involved in zein metabolism regulation, in the promoters of PtPCs, and ten cis-elements related to phytohormone responses were detected (Anderson et al. 1994). The CGTCA and TGACG motifs that are related to MeJA responsiveness were found in seven PtPCs (Nejad et al. 2012). The TCA element, ABRE, ERE and TGA element were found in the promoters of 13, 8, 6 and 3 PtPCs, respectively (Goldsbrough et al. 1993; Shen and Ho 1995). Other elements, including the AuxRR core, TATC-box, GARE motif and P-box were also observed in 1, 2, 3 and 4 PtPCs, respectively (Washida et al. 1999). In addition, we also identified some cis-elements related to abiotic and biotic stresses, including HSE and TC-rich repeats in 14 PtPCs, but the WUN motif was only present in PtSC6.
Expression of PtPC genes
To reveal the different evolutionary fates of duplicated genes, we analyzed the expression patterns of PtPCs in six different tissues of poplar: young leaves, roots, xylem, female catkin, male catkin, and mature leaves (Fig. S6; Table S5). The heatmap showed that 11 PtPCs (PtPLC1, PtENODL43, PtENODL7, PtSC6, PtUC2, PtENODL45, P9tSC, PtENODL41, PtSC10, PtENODL25, PtUC1) were highly expressed in six organs. Notably, PtPLC1 and PtUC4 have relatively high expression levels only in roots and xylem, respectively.
Examination of PC gene expression by qRT-PCR
To confirm the expression of PtPCs in response to stresses, we used qRT-PCR to analyze relative expressions of 18 PtPCs under drought and salt treatments (Figs. 7, 8). The results of drought treatments showed that three PtPCs (PtSC9, PtENODL24 and PtENODL41) were highly expressed at all four-time points measured, and some PtPCs were up-regulated at particular time points. For instance, PtPLC1 was only up-regulated at 4 h after treatment. PtUC7, PtSC13 and PtSC17 were distinctly up-regulated at 24 h. By contrast, expression of PtENODL9 remained low at all time points, while PtENODL45 was observably down-regulated at 24 h after treatment.
Subsequently, we analyzed the expression patterns of PtPCs under salt treatments. The expression levels of PtEDNOL9 remained low at all time points, but PtPLC1 was down-regulated at 4 h yet up-regulated at other time points. Expression of PtENODL24 and PtENODL41 obviously showed a high expression under both, drought and salt treatment, at all time points, and expression of three PtPCs (PtUC7, PtSC13 and PtSC17) was significantly increased at 24 h after salt treatment. Additionally, PtUC2 was distinctly up-regulated at all time points except at 12 h. PtUC7, PtSC13 and PtSC17 had the similar expression after both drought and salt treatments at all time points.
To predict possible functions of PtPC genes in organ development, we also performed qRT-PCR analyses to examine the relative expression of the 18 PtPCs in five organs, including young leaves (YL), roots (RT), xylem (XY), mature leaves (ML), and phloem (PH) (Fig. 9; Table S6). As shown in Fig. 9, PtPCs showed the highest mRNA accumulation in all the tissues, including three in the roots (PtUC1, PtENODL45, PtENODL41) and young leaves (PtENODL24, PtENODL9, PtSC6), four in the mature leaves (PtENODL24, PtSC17, PtENODL7, PtUC7) and the phloem (PtENODL9, PtENODL25, PtENODL33, PtENODL43), five in the xylem (PtUC2, PtUC4, PtSC10, PtSC13, PtPLC1). However, most of the remaining genes had intensively different expression patterns. For example, PtENODL45 had low expression levels in mature leaves. It is worth noting that PtSC17 and PtENODL24 were both up-regulated in the mature leaves while they were expressed at a relatively low level in young leaves and xylem, respectively.
Discussion
Genome-wide analysis of PC genes in plants has been performed previously in various species including Arabidopsis thaliana (Mashiguchi et al. 2009), Oryza sativa (Ma et al. 2011), Brassica rapa (Li et al. 2013), Zea mays (Cao et al. 2015) and Phalaenopsis equestris (Xu et al. 2017). All of these studies indicated a role in plant growth and development (Fedorova et al. 2002; Ozturk et al. 2002; Diab et al. 2004; Ma and Jie 2010; Wu et al. 2011), but PC genes have not been investigated extensively in poplar. In the present study, we identified 74 PtPCs and grouped them into seven clades based on phylogenetic tree analysis. The phylogenetic tree showed that most genes of each subfamily in poplar grouped tightly with these genes of subfamily in Arabidopsis and Zea mays, which is consistent with the fact that maize, Arabidopsis and poplar diverged from a common ancestor before the divergence of monocot and dicot lineages. AGPs belong to a subfamily of hydroxyproline-rich glycoproteins (HRGPs), and are involved in the growth and development of the plant, such as stem strength, somatic embryogenesis in cotton, cell culture and extracellular signals transduction (Tan et al. 2004; Seifert and Roberts 2007; MacMillan et al. 2010; Poon et al. 2013; Ma et al. 2017). Several bioinformatics studies were used to identify the AGP gene family in plants. For instance, the PAST amino acid bias was calculated for AGPs in Arabidopsis as well rice (Schultz et al. 2002; Ma and Zhao 2010), the well-designed BIO OHIO program was employed for HRGPs in Arabidopsis and poplar (Showalter et al. 2010, 2016), a Python script named Finding-AGP was utilized in 47 plant specie (Ma et al. 2017), MAAB bioinformatics pipeline was conducted to classify HRGPs (Johnson et al. 2017), and BLAST searches were performed for AtENODLs, OsPCs and PePCs (Mashiguchi et al. 2009; Ma et al. 2011; Xu et al. 2017). Through these methods, the classical AGPs, lysine-rich AGPs, AG peptides, fasciclin-like AGPs, plastocyanin AGPs and other chimeric AGPs have been identified in Arabidopsis, rice and poplar. Chimeric AGPs were determined if the protein sequences contained at least one arabinogalactosylated domain and a domain with an unrelated motif (Schultz et al. 2002). In our study, 43 PtPCs contain AG glycomodules and SPs, including 27 PtENODLs, five PtUCs, nine PtSCs and two PtPLCs that might belong to the chimeric AGP group. Showalter et al. (2016) have identified 39 PtPAGs in poplar by BIO OHIO 2.0 bioinformatics program, and all PtPAGs belonged to the AGPs we identified in the present study except two PtPAGs (PtPAG25 and PtPAG27). Moreover, eight additional PtPCs (PtENODL2/7/8/12/19/21/25/31) might be the member of a AGP superfamily due to the presence of putative arabinogalactan glycomodules found in our study. The previous report has identified 18 ENOD-like, seven UC-like and four SC-like AGPs in Arabidopsis (Li et al. 2013). There were about 1.5 times more ENOD-like, 0.71 times more UC-like and 2.25 more SC-like AGPs in poplar than in Arabidopsis, which may result from the evolutionary diversification of the two plants.
Multiple sequence alignment confirmed that Cys residues involved in the formation of disulfide linkages are highly conserved in all 74 PtPCs, suggesting they are essential for maintaining PCLD structure, function and stability. PCs are involved in electron transport in the cytomembrane. This process is related to photophosphorylation and can affect ATP generation, and therefore influences physiological processes in plants.
Several reports on subcellular localization of PCs have provided information on their specific functions in plants. For instance, AtSC3 is located in the plasma membrane and related to aluminum as well oxidation stresses (Ezaki et al. 2001, 2005), which is possibly the response to other abiotic stresses. Khan et al. (2007) found that AtENODL9-1, one of the double mutants of AtENODL9, which is located in the sieve element plasma membrane, plays a more significant role in reproductive processes than irregular physiological processes, as revealed by analysis of the phenotype of homozygous T-DNA insertion mutants. All known PCs have secretion signals, which are essential to localize in the extracellular space. According to reports, more than half of ENODLs were predicted to be GPI-anchored proteins, implying theirs functions at the plasma membrane or extracellular matrix. Many proteins exist in the cell in a dynamic form, and they can perform diverse cell functions in different subcellulars. In addition, the plants often grow under a variety of environmental conditions, which may make some proteins, such as transcription factors, appear in different organelles during signal transduction. Our current study revealed that most PtPCs were localized in the plasma membrane or extracellular, and the phenomenon demonstrated that the different subcellular localization of phytocyanins may be due to diverse function in poplar. PtPCs may exert a significant part in the growth and the development of poplar, and they were divided into four subfamilies.
The exon/intron structure and motif arrangement are highly conserved in all subfamilies. The 74 identified PtPCs include different numbers of introns, indicating diverse functions during their evolution. The similar exon/intron and motif composition of members within subfamilies imply conservation of function, and the phylogenetic analysis can, therefore, be beneficial. The specific motifs are ordered 1-6-3 in all sequences, indicating strong conservation of domain structure. Motifs 7, 9 and 10 are only present in members of the ENODL subfamily, indicating a role in the functional divergence of PtENODLs. Notably, all PLCs lack motif 7 in poplar, demonstrating divergence from other subfamilies during evolution and its probable specific-role in plant development and growth, which could be important to their functions during electron transportation. The identified subfamily has different motifs which may play a crucial role in the different functions that their genes perform. These specific motifs present in the PtPCs are most probably the structural basis for their diverse functions.
The endosperm offers diverse nutrients to seedling growth after germination before it turns into photosynthetic and self-sufficiency (Zhang et al. 2016). Most PtPCs had Skn-1_motif cis-regulatory elements were relevant to endosperm expression. Analysis of promoter regions showed that the majority of PtPCs contained TC-rich repeats cis-regulatory elements, which suggested that PtPC genes also played a significant role in stress responses. In Boea crassifolia, BcBCP1, an ENOD-like gene, increases tolerance to osmotic stress in transgenic tobacco under the control of the CaMV 35S promoter. Ezaki et al. (2005) found that the Arabidopsis blue-copper binding gene restrained aluminum absorption so as to keep plants and Saccharomyces cerevisiae (yeast) out of aluminum toxicity. In the present study, PtENODL24 containing the MYB cis-element was strongly induced under drought treatment, which was supported by the promoter cis-element analysis.
Gene duplication is a vital source of new genes in the course of evolution (Lynch and Conery 2000; Gu et al. 2003; Khan et al. 2007). Gene duplication can also help organisms adapt to different environments during development and growth (Bowers et al. 2003). Gene duplication mechanisms such as unequal exchange, reverse transcription, or entire gene duplication produce a gene or base sequence that resembles the original gene (Zhang 2003). However, previous studies indicated that half of OsPCs in rice were the result of segmental duplication, and the others derived from tandem duplication, which indicated that these two types of duplication event played an equivalent role in the expansion of OsPC genes (Ma et al. 2011). The results of the present study indicated that the conserved region of 10 PtPC paralogs resulted from segmental duplication events, suggesting that segmental duplication exerted a crucial part in the expansion of the PtPC gene family.
Ka/Ks ratio can be used to measure the historical choice of coding sequences (Vandepoele et al. 2003; Wu et al. 2016). In this study, Ka/Ks sliding window analysis indicated that most PtPCs were under disadvantageous selection, imply a strong selection constraint and purifying selection in the PtPC genes. This suggests that functional divergence of duplicated genes might have been promoted by positive selection during evolution, which may facilitate the adaptation of plants in different environments. By evaluating the duplication time of paralogs, we inferred that all large-scale duplication events involving PtPCs occurred within the last 10.20–55.80 million years. We observed microsynteny in PtPCs and identified 23 collinear PC gene pairs, indicating a low degree of divergence during evolution. Simultaneously, we observed microsynteny between Populus and Arabidopsis PC genes and identified 15 pairs orthologs, hence orthologs derived from common ancestral genes are present in different species.
Since expression analysis can provide valuable information for further exploration of the relative expression levels and better understand the PC gene function in P. trichocarpa, we examined the expression of PtPCs in poplar in different tissues using microarray data. The results demonstrated that most genes were highly expressed in roots and xylem, indicating roles in vegetative growth. Through comparisons of microarray analysis and qRT-PCR analysis, we found that the majority of PtPCs was similarly expressed in the same organs. However, there were some different results. For example, PtENODL41 was expressed at low levels in roots according to microarray analysis but relatively high in the same tissue according to qRT-PCR, which might result from differences in the experiments, such as conditions, poplar ages, sample collection times and so on. In addition, abiotic stresses such as high salinity and high temperature influence plant growth and development. Many stress-response genes are activated to help plants to deal with stress in adverse circumstances. Hence, it is important to identify the major regulatory pathways of stress responses in poplar. The results of qRT-PCR experiments indicated an important function for PtPCs under drought or salt stresses. Previous reports showed that OsUC23/26/27 and BrUC6/16 were highly expressed under drought or salt stresses. Differences of gene expression among species indicated that PC genes in different plants may generate diverse responses to abiotic stress.
Conclusions
In the current study, we identified 74 PtPCs and comprehensively and systematically analyzed their PC domains, gene structure, gene replication, chromosomal distribution, and conserved microsynteny. The evolutionary relationships between PCs in Arabidopsis and poplar were also investigated, and the results revealed different expression patterns, indicating differentiated functions. To further understand the function and role of each PC family gene, multiple methods including molecular genetic analysis should be employed. The qRT-PCR analysis indicated a central role of PtPCs in many aspects of plant growth and development and plant stress response. Our findings provide a theoretical basis for further research on the function of PtPCs.
Author contribution statement
SSL and WFH designed and conceived the experiment, carried out the principal bioinformatics analysis, drafted the manuscript. Performed the experiments: SSL, WFH. Edited the data, figures and tables: YW, BL. Contributed reagents/materials/analysis tools: YX. All authors read and approved the final manuscript.
Abbreviations
- AG:
-
Arabinogalactan
- AGPs:
-
Arabinogalactan proteins
- ENODLs:
-
Early nodulin-like proteins
- PCs:
-
Phytocyanins
- PCLD:
-
Plastocyanin-like domain
- PLCs:
-
Plantacyanins
- SCs:
-
Stellacyanins
- SP:
-
Signal peptide
- UCs:
-
Uclacyanins
- K s :
-
Number of synonymous substitutions per synonymous site
- K a :
-
Number of non-synonymous substitutions per non-synonymous site
References
Anderson SL, Teakle GR, Martino-Catt SJ, Kay SA (1994) Circadian clock- and phytochrome regulated transcription is conferred by a 78 bp cis-acting domain of the Arabidopsis CAB2 promoter. Plant J 6(4):457–470
Bobb AJ, Chern MS, Bustos MM (1997) Conserved RY-repeats mediate transactivation of seed-specific promoters by the developmental regulator PvALF. Nucleic Acids Res 25(3):641–647
Bowers JE, Chapman BA, Rong J, Paterson AH (2003) Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events. Nature 422(6930):433–438
Cao J, Li X, Lv Y, Ding L (2015) Comparative analysis of the phytocyanin gene family in 10 plant species: a focus on Zea mays. Front Plant Sci 6:515. https://doi.org/10.3389/fpls.2015.00515
Cruz-Garcia F, Nathan HC, Kim D, Mcclure B (2005) Stylar glycoproteins bind to S-RNase in vitro. Plant J 42(3):295–304
Denancé N, Szurek B, Noël LD (2014) Emerging functions of nodulin-like proteins in non-nodulating plant species. Plant Cell Physiol 55(3):469–474
Diab AA, Teulat-Merah B, This D, Ozturk NZ, Benscher D, Sorrells ME (2004) Identification of drought-inducible genes and differentially expressed sequence tags in barley. Theor Appl Genet 109(7):1417–1425
Dong J, Kim ST, Lord EM (2005) Plantacyanin plays a role in reproduction in Arabidopsis. Plant Physiol 138(2):778–789
Eisenhaber B, Wildpaner M, Schultz CJ, Borner GH, Dupree P, Eisenhaber F (2003) Glycosylphosphatidylinositol lipid anchoring of plant proteins. Sensitive prediction from sequence- and genome-wide studies for Arabidopsis and rice. Plant Physiol 133(4):1691–1701. https://doi.org/10.1104/pp.103.023580
Escobar-Restrepo JM, Huck N, Kessler S, Gagliardini V, Gheyselinck J, Yang WC, Grossniklaus U (2007) The FERONIA receptor-like kinase mediates male–female interactions during pollen tube reception. Science 317(5938):656–660
Ezaki B, Katsuhara M, Kawamura M, Matsumoto H (2001) Different mechanisms of four aluminum (Al)-resistant transgenes for Al toxicity in Arabidopsis. Plant Physiol 127(3):918–927
Ezaki B, Sasaki K, Matsumoto H, Nakashima S (2005) Functions of two genes in aluminium (Al) stress resistance: repression of oxidative damage by the AtBCB gene and promotion of efflux of Al ions by the NtGDI1gene. J Exp Bot 56(420):2661–2671
Fedorova M, Mortel JVD, Matsumoto PA, Cho J, Town CD, VandenBosch KA, Gantt JS, Vance CP (2002) Genome-wide identification of nodule-specific transcripts in the model legume Medicago truncatula. Plant Physiol 130(2):519–537
Giri AV, Anishetty S, Gautam P (2004) Functionally specified protein signatures distinctive for each of the different blue copper proteins. BMC Bioinform 5(1):1–8
Goldsbrough AP, Albrecht H, Stratford R (1993) Salicylic acid-inducible binding of a tobacco nuclear protein to a 10 bp sequence which is highly conserved amongst stress-inducible genes. Plant J 3(4):563–571
Goodstein DM, Shu S, Russell H, Rochak N, Hayes RD, Joni F, Therese M, William D, Uffe H, Nicholas P (2011) Phytozome: a comparative platform for green plant genomics. Nucleic Acids Res 40:1178–1186
Gu Z, Steinmetz LM, Gu X, Scharfe C, Davis RW, Li WH (2003) Role of duplicate genes in genetic robustness against null mutations. Nature 421(6918):63–66
Hart PJ, Nersissian AM, Herrmann RG, Nalbandyan RM, Valentine JS, Eisenberg D (1996) A missing link in cupredoxins: crystal structure of cucumber stellacyanin at 1.6 A resolution. Protein Sci 5(11):2175–2183
Hou Y, Guo X, Cyprys P, Zhang Y, Bleckmann A, Cai L, Huang Q, Luo Y, Gu H, Dresselhaus T, Dong J, Qu L (2016) Maternal ENODLs are required for pollen tube reception in Arabidopsis. Curr Biol 26(17):2343–2350
Hu R, Qi G, Kong Y, Kong D, Qian G, Zhou G (2010) Comprehensive analysis of NAC domain transcription factor gene family in Populus trichocarpa. BMC Plant Biol 10(1):145. https://doi.org/10.1186/1471-2229-10-145
Hui M, Lin F, Zhu C, Xue C, Hualin Z, Yan X (2014) Genome-wide identification and expression analysis of the IQD gene family in Populus trichocarpa. Plant Sci 229:96–110
Jeßberger N, Krey VM, Rademacher C, Böhm ME, Mohr AK, Ehlingschulz M, Scherer S, Märtlbauer E (2015) From genome to toxicity: a combinatory approach highlights the complexity of enterotoxin production in Bacillus cereus. Front Microbiol 6(6):560
Johnson KL, Cassin AM, Lonsdale A, Bacic A, Doblin MS, Schultz CJ (2017) A motif and amino acid bias bioinformatics pipeline to identify hydroxyproline-rich glycoproteins. Plant Physiol 174(2):886–903
Khan JA, Wang Q, Sjölund RD, Schulz A, Thompson GA (2007) An early nodulin-like protein accumulates in the sieve element plasma membrane of Arabidopsis. Plant Physiol 143(4):1576–1589
Li J, Yu M, Geng LL, Zhao J (2010) The fasciclin-like arabinogalactan protein gene, FLA3, is involved in microspore development of Arabidopsis. Plant J 64(3):482–497
Li J, Gao G, Zhang T, Wu X (2013) The putative phytocyanin genes in Chinese cabbage (Brassica rapa L.): genome-wide identification, classification and expression analysis. Mol Genet Genom 288(1–2):1–20
Lynch M, Conery JS (2000) The evolutionary fate and consequences of duplicate genes. Science 290(5494):1151–1155
Ma H, Jie Z (2010) Genome-wide identification, classification, and expression analysis of the arabinogalactan protein gene family in rice (Oryza sativa L.). J Exp Bot 61(10):2647–2668
Ma H, Zhao J (2010) Genome-wide identification, classification, and expression analysis of the arabinogalactan protein gene family in rice (Oryza sativa L.). J Exp Bot 61(10):2647–2668
Ma H, Zhao H, Liu Z, Zhao J (2011) The phytocyanin gene family in rice (Oryza sativa L.): genome-wide identification, classification and transcriptional analysis. PLoS One 6(10):e25184
Ma Y, Yan C, Li H, Wu W, Liu Y (2017) Bioinformatics prediction and evolution analysis of arabinogalactan proteins in the plant kingdom. Front Plant Sci 8:1–17
MacMillan CP, Mansfield SD, Stachurski ZH, Evans R, Southerton SG (2010) Fasciclin-like arabinogalactan proteins: specialization for stem biomechanics and cell wall architecture in Arabidopsis and Eucalyptus. Plant J 62(4):689–703
Mashiguchi K, Yamaguchi I, Suzuki Y (2004) Isolation and identification of glycosylphosphatidylinositol-anchored arabinogalactan proteins and novel β-glucosyl Yariv-reactive proteins from seeds of rice (Oryza sativa). Plant Cell Physiol 45(12):1817–1829
Mashiguchi K, Asami T, Suzuki Y (2009) Genome-wide identification, structure and expression studie, and mutant collection of 22 early nodulin-like protein genes in Arabidopsis. Biosci Biotechnol Biochem 73(11):2452–2459
Nejad ES, Askari H, Soltani S (2012) Regulatory TGACG-motif may elicit the secondary metabolite production through inhibition of active cyclin-dependent kinase/cyclin complex. Plant Omics 5(6):553–558
Nersissian AM, Immoos C, Hill MG, Hart PJ, Williams G, Herrmann RG, Valentine JS (1998) Uclacyanins, stellacyanins, and plantacyanins are distinct subfamilies of phytocyanins: plant-specific mononuclear blue copper proteins. Protein Sci 7(9):1915–1929
Ozturk ZN, Talamé V, Deyholos M, Michalowski CB, Galbraith DW, Gozukirmizi N, Tuberosa R, Bohnert HJ (2002) Monitoring large-scale changes in transcript abundance in drought- and salt-stressed barley. Plant Mol Biol 48(5–6):551–573
Petersen TN, Brunak S, Von HG, Nielsen H (2011) SignalP 4.0: discriminating signal peptides from transmembrane regions. Nat Methods 8(10):785–786
Poon S, Heath R, Clarke A (2013) A chimeric arabinogalactan protein promotes somatic embryogenesis in cotton cell culture. Plant Physiol 160(2):684–695
Schmittgen TD, Livak KJ (2008) Analyzing real-time PCR data by the comparative C(T) method. Nat Protoc 3(6):1101–1108
Schultz CJ, Rumsewicz MP, Johnson KL, Jones BJ, Gaspar YM, Bacic A (2002) Using genomic resources to guide research directions. The arabinogalactan protein gene family as a test case. Plant Physiol 129(4):1448–1463
Seifert GJ, Roberts K (2007) The biology of arabinogalactan proteins. Annu Rev Plant Biol 58(1):137–161
Sessa G, Morelli G, Ruberti I (1993) The Athb-1 and -2 HD-Zip domains homodimerize forming complexes of different DNA binding specificities. EMBO J 12(9):3507–3517
Shen Q, Ho TH (1995) Functional dissection of an abscisic acid (ABA)-inducible gene reveals two independent ABA-responsive complexes each containing a G-box and a novel cis-acting element. Plant Cell 7(3):295–307
Showalter AM, Keppler B, Lichtenberg J, Gu D, Welch LR (2010) A bioinformatics approach to the identification, classification, and analysis of hydroxyproline-rich glycoproteins. Plant Physiol 153(2):485–513
Showalter AM, Keppler B, Liu X, Lichtenberg J, Welch LR (2016) Bioinformatic identification and analysis of hydroxyproline-rich glycoproteins in Populus trichocarpa. BMC Plant Biol 16(1):229
Sunkar R, Zhu JK (2004) Novel and stress-regulated microRNAs and other small RNAs from Arabidopsis. Plant Cell 16(8):2001–2019
Tamura K, Stecher G, Peterson D, Filipski A, Kumar S (2013) MEGA6: molecular evolutionary genetics analysis version 6.0. Mol Biol Evol 30(12):2725–2729
Tan L, Qiu F, Lamport DTA, Kieliszewski MJ (2004) Structure of a hydroxyproline (Hyp)-arabinogalactan polysaccharide from repetitive Ala-Hyp expressed in transgenic Nicotiana tabacum. J Biol Chem 279(13):1315656–1315665
Vandepoele K, Simillion C, Van de Peer Y (2003) Evidence that rice and other cereals are ancient aneuploids. Plant Cell 15(9):2192–2202
Wang Y, Feng L, Zhu Y, Li Y, Yan H, Xiang Y (2015) Comparative genomic analysis of the WRKY III gene family in populus, grape, arabidopsis and rice. Biol Direct 10:48. https://doi.org/10.1186/s13062-015-0076-3
Washida H, Wu CY, Suzuki A, Yamanouchi U, Akihama T, Harada K, Takaiwa F (1999) Identification of cis-regulatory elements required for endosperm expression of the rice storage protein glutelin gene GluB-1. Plant Mol Biol 40(1):1–12
Wu H, Shen Y, Hu Y, Tan S, Lin Z (2011) A phytocyanin-related early nodulin-like gene, BcBCP1, cloned from Boea crassifolia enhances osmotic tolerance in transgenic tobacco. J Plant Physiol 168(9):935–943
Wu M, Li Y, Chen D, Liu H, Zhu D, Xiang Y (2016) Genome-wide identification and expression analysis of the IQD gene family in moso bamboo (Phyllostachys edulis). Sci Rep UK 6:24520
Xu L, Wang XJ, Wang T, Li LB (2017) Genome-wide identification, classification, and expression analysis of the phytocyanin gene family in Phalaenopsis equestris. Biol Plant 61(3):1–8
Ye H, Du H, Tang N, Li X, Xiong L (2009) Identification and expression profiling analysis of TIFY family genes involved in stress and phytohormone responses in rice. Plant Mol Biol 71(3):291–305
Yoshizaki M, Furumoto T, Hata S, Shinozaki M, Izui K (2000) Characterization of a novel gene encoding a phytocyanin-related protein in morning glory (Pharbitis nil). Biochem Biophys Res Commun 268(2):466–470
Zhang J (2003) Evolution by gene duplication: an update. Trends Ecol Evol 18(6):292–298
Zhang R, Tucker MR, Burton RA, Shirley NJ, Little A, Morris J, Milne L, Houston K, Hedley PE, Waugh R, Fincher GB (2016) The dynamics of transcript abundance during cellularisation of developing barley endosperm. Plant Physiol 170(3):1549–1565
Acknowledgements
We thank the members of the Laboratory of Modern Biotechnology for their assistance in this study.
Funding
National Natural Science Foundation of China (31370561) and National Science and Technology Support Plan Corpus (2015BAD07B070104).
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Fig. S1
Protein backbones of PCs in P. trichocarpa. The coloured sequences at the N and C terminus indicate predicted signal peptides (green) and GPI anchor addition sequences (purple) if present in the sequences. Putative AG sites (blue) are also indicated. Red font indicates N-glycosylation site. Extensins SP3, SP4 and SP5 repeats are indicated (light blue) if present. Note that the putative AG glycomodules are only checked in the proteins containing N-secretion signals (TIFF 32125 kb)
Fig. S2
Exon–intron structures and conserved domains of the predicted PtPC proteins. Exons, introns and untranslated regions (UTRs) are represented by yellow rectangles, grey lines and blue rectangles, respectively (TIFF 437 kb)
Fig. S3
Sliding window plots of Ka/Ks ratios of representative duplicated PtPC gene paralogs. The x-axis denotes the nucleotide position, and the y-axis denotes the Ka/Ks ratio (TIFF 68 kb)
Fig. S4
Extensive microsynteny of PC regions between Arabidopsis and poplar. Arabidopsis and Populus chromosomes are shown in different colours. Numbers along each chromosome box indicate sequence length in megabases. Whole chromosomes of the two species harbouring PC regions are encircled. Black lines represent syntenic relationships between PC regions (TIFF 3195 kb)
Fig. S5
Synteny analysis of PC genes in Populus. Black lines represent syntenic relationships (TIFF 2627 kb)
Fig. S6
Hierarchical clustering of poplar PC gene expression. The heatmap shows hierarchical clustering of PtPC genes across various tissues/organs. Affymetrix microarray data under accession number GSE13990 encompassing results from six organ/tissue types including young leaves (YL), roots (R), xylem (XY), female catkin (FC), male catkin (MC) and mature leaves (ML) were re-analysed (TIFF 621 kb)
Rights and permissions
About this article
Cite this article
Luo, S., Hu, W., Wang, Y. et al. Genome-wide identification, classification, and expression of phytocyanins in Populus trichocarpa. Planta 247, 1133–1148 (2018). https://doi.org/10.1007/s00425-018-2849-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00425-018-2849-2