Introduction

Photoreceptor molecules in plants are responsible for light perception and the initiation of signal responses, which are crucial for survival and acquiring optimal status in a dynamically changing light environment. There are five phytochromes (phys) in Arabidopsis, namely phyA, phyB, phyC, phyD and phyE, which are responsible for perceiving and response to red (R) and far-red (FR) light spectrum (Clack et al. 1994; Legris et al. 2019). Phytochromes are converted between two interconvertible forms, when R absorbing, the inactive form Pr (R absorbing) located in the cytosol converted into active Pfr form the translocates into the nucleus (Klose et al. 2015). Photoactivated phyB regulates photomorphogenesis by integrating multiple signaling pathways by interacting with a set of basic helix-loop-helix (bHLH) transcription factors (TFs) known as PIFs (or PILs) (PHYTOCHROME INTERACTING FACTOR-LIKE) (Leivar and Quail. 2011). This kind of proteins belong to the group VII of bHLH superfamily with 15 members in A. thaliana (Heim et al. 2003). Thereinto, eight members are designated as PIF (PIF1-PIF8) in Arabidopsis, which all harbors a conserved APB (for active phyB binding) motif (also called PIL-motif) in N-terminal region (Khanna et al. 2004; Lee and Choi 2017). Moreover, PIF1 and PIF3 also contain additional APA (for active phyA binding) motif with a less conserved sequence, which is necessary for phyA binding (Al-Sady et al. 2006; Shen et al. 2008). Mutational analyses show that the four amino acids in the APB domain (ELxxxxGQ) provide the Pfr specificity for the physical interactions of phyB-PIF, and that the sequences on both sides of the amino acids may also affect the binding of phyB to a certain extent (Khanna et al. 2004). The C-terminus of PIF proteins bears the bHLH domain, which is responsible for the formation of homodimers or heterodimers with other bHLH proteins and DNA-binding with the protein (Bu et al. 2011; Hornitschek et al. 2009). In general, all PIFs can bind the G-box (5'-CACGTG-3') or PBE-box (5'-CACATG-3') of target gene promoter region to regulate target gene expression at the transcriptional level (Zhang et al. 2013).

PIFs are involved in maintaining the skotomorphogenic status of the etiolated seedlings and the repression of photomorphogenic development as negative regulators in darkness (Leivar and Monte 2014). However, they are promoted the photomorphogenic development through rapid phosphorylation, ubiquitination, and proteasome-mediated their degradation after the onset of light (Xu et al. 2015). The regulation of most light responses displays significant functional redundancy within the PIF family. For instance, AtPIF1, AtPIF3, AtPIF4, AtPIF5, and AtPIF7 inhibit phy-imposed seedling photomorphogenesis during deetiolation, under daytime growing conditions, and response to shading through varying degrees of overlapping redundancy (Leivar and Monte 2014). However, they also have distinct functions under certain environment. For example, AtPIF1 shows functional specificity in inhibiting light-dependent seed germination (Oh et al. 2004). Unlike other AtPIFs, AtPIF2 (also called PIL1) and AtPIF6 slightly positively regulates seedling deetiolation and seed dormancy in a certain light, respectively (Luo et al. 2014; Penfield et al. 2010). AtPIF1 and/or AtPIF3 regulate chlorophyll biosynthesis photosynthesis, and reactive oxygen species (ROS) responses by modulating the expression of a number of genes during deetiolation (Chen et al. 2013; Liu et al. 2013). In addition, AtPIF3 functions in hypocotyl growth by modulating the freezing tolerance (Xu and Deng. 2020) and stomatal opening (Wang et al. 2010a). Moreover, AtPIF4 and AtPIF5 directly interact with cryptochromes and bind to target gene promoters in response to blue light (Ma et al. 2016; Pedmale et al. 2016). AtPIF4 and AtPIF7 are essential for the early response to thermomorphogenesis and at least partially accompanied by the production of the growth promoting hormone auxin (Fiorucci et al. 2020; Koini et al. 2009). AtPIF4 also accelerates entering the reproductive stage of plants in a temperature-mediated manner by directly activating the expression of flowing locus T (FT) (Kumar et al. 2012). Interestingly, AtPIF8 inhibits phyA-induced FR light responses, rather than phyB-mediated (Oh et al. 2020).

Very little PIFs have been annotated in plants except for Arabidopsis, but at least the evidence of multiple PIFs have been found at genome-wide levels in other seed plants, including rice (Nakamura et al. 2007), tomato (Lee & Choi. 2017), Malus domestica (Zheng et al. 2020), and so on. Moreover, ZmPIF3.1 and ZmPIF3.2 have also been characterized that interact with the Pfr form of ZmphyB1 and possess presumptive APA motifs (Kumar et al. 2016). Likewise, rice PIF protein OsPIF14 interacts with the Pfr form of OsphyB (Cordeiro et al. 2016). Moreover, non-vascular plants Marchantia MpPIF and Physcomitrium PpPILs contain APA motifs, but not APB motifs. Interestingly, MpPIF preferentially interacts with the MpPhy-Pfr through this conserved motif, indicating that PIFs interaction with phyB depending on APB motifs is a derived characterization of spermatophyte (Lee and Choi. 2017). However, the degradation mechanism of the liverwort PIFs is similar to that of Arabidopsis, suggesting that it seems to originate and evolve in the common ancestor of land plants. Therefore, the further tackling of PIFs evolutionary history can benefit to better understanding their function. What’s more, the pattern of gene expansion is an important aspect of evolutionary history. Because gene duplication provides new opportunities and basic sources for evolutionary success, especially whole genome duplications (WGDs) (Van de Peer et al. 2009).

Brachypodium distachyon is a model plant for its small physical stature, self-pollinate, short life cycle, and a small diploid genome, to study relevant questions unique to the cereals and grasses (International Brachypodium. 2010). PIFs function as a hub that integrates hormonal and environmental signals to orchestrate regulation of the transcriptional network (Lau and Deng. 2010; Leivar and Quail. 2011). However, there is limited information on the origin and evolutionary history of them in entire plant species and their functional evolution in B. distachyon. Here, we investigated PIF genes using bioinformatics methods and analyzed their gene organization, conserved motifs, molecular evolution, gene duplication patterns and phylogenetic relationships throughout plant lineages. To better understand the function of B. distachyon PIF genes, expression patterns of these genes were surveyed under various tissues and stresses. Moreover, the subcellular localization analysis of BdPIFs was also performed and observed. Finally, heterogeneous overexpression of BdPIFs was performed in Arabidopsis displayed the phenotype of hypocotyl elongation in the light treatment condition and precocious flowering, especially the transgenic plants of BdPIF4/5C. In general, these results provided a new viewpoint on the origin and evolutionary history, and established a foundation for further functional research of plant PIFs.

Methods and materials

Identification of PIF family members in land plant species

Detailed information about the Arabidopsis PIF gene was obtained from The Arabidopsis Information Resources database (TAIR: http://www.arabidopsis.org/) (Lamesch et al. 2012). Putative PIF genes of other plant species were identified by BLASTP searches across the set of orthologous protein sequences of A. thaliana in the publicly available PLAZA database (http://bioinformatics.psb.ugent.be/plaza/) (Van Bel et al. 2018) and/or phytozome database (https://phytozome-next.jgi.doe.gov/) (Goodstein et al. 2012). When the protein harbored HLH domain (IPR011598) and APB motif used in this search, it was considered as a candidate protein and was scanned by InterPro software for final confirmation. In addition, all identified proteins were examined for redundancy, without considering any alternative splice variants.

Sequence alignment, protein motif, phylogenetic tree and molecular evolution analyses

Multiple sequence alignments were performed using the Clustal Omega (http://www.ebi.ac.uk/Tools/msa/clustalo/). The conserved residues and PIL motifs in the alignments were executed using InterProScan against protein database (http://www.ebi.ac.uk/interpro/). Visualization of alignment logos of the protein conserved domain were generated using WebLogo (http://weblogo.threeplusone.com/). The structure schematic of the PIFs was achieved based on the InterProScan results. The theoretical isoelectric point (pI) and molecular weight (Mw) of PIFs was predicted by the online Compute pI/Mw tool (http://web.expasy.org/compute_pi/). DNASP v5.10 was used to counting Ka and Ks values and their ratios of candidate PIFs aligned cDNA sequences (Librado & Rozas. 2009). The genetic distances of between each group were calculated with the Jones-Taylor-Thornton (JTT) model in the Molecular Evolution Genetic Analysis 6.0 (MEGA 6.0) (Tamura et al. 2013). MEGA6 software was also used to calculate the overall mean distances of all PIFs. The phylogenetic tree was reconstructed by IQTREE (Nguyen et al. 2015) using trimmed multiple alignments with the JTT + I + G model selected by ProtTest. A total of 1,000 bootstrap replicates were performed for support estimation.

Plant materials and growth conditions

B. distachyon and A. thaliana plants were grown in a soil mix (clay is mixed with soil at 3:1 ratio) and placed in a greenhouse with 22 ℃ and 70% room humidity under a 14-h-light (> 3000 lx) /10-h-dark photoperiod. Nicothiana benthamiana plants were grown in growth room conditions (16 h light at 26 °C and 8 h dark at 22 °C). All A. thaliana lines used in this study were in the Columbia-0 (Clo-0) background. Arabidopsis and B. distachyon seeds were surface-sterilized with 75% ethanol for 1 min, 10% bleaching powder for 10 min, then washed with sterilized water three times at least and sown on sterile Murashige and Skoog (MS) medium containing 1.5% agar. Seeds were placed for 1 week at 4 °C in darkness and then transferred to a phytotron set at standard conditions or in continuous light (100 μmol m−2 s−1) and constant temperature for specific experiments.

Expression analysis

The real-time quantitative polymerase chain reaction (RT-qPCR) was used to quantify the mRNA level of the BdPIFs genes, Actin as an internal control. For different tissue analysis, root, stem, leaf, caryopsis and seedling samples was harvested from 2-week-old B. distachyon Bd21 plants grown in 1/2 MS solid medium. For phytohormone analysis, 2-week-old seedlings were grown in 1/2 MS liquid medium containing 100 µM ABA for 1 h, 3 h, 6 h, 12 h and 24 h, respectively. For abiotic stress analysis, light treatment, UV-B and heat conditions were treated by placing 2-week-old seedlings in 1/2 MS liquid medium at shade, UV-B and 40 °C for 1 h, 3 h, 6 h, 12 h and 24 h, respectively. For biotic stress, the leaves of seedlings were exposed in spray inoculation with a suspension of Magnaporthe grisea (Guy11, avirulent ace1 genotype; PH14, virulent ace1 genotype) or Fusarium graminearum (F0968) conidia and sampled at 4 h and 12 h after treatment. Untreated plants were served as control samples. Seedlings were collected at different time points after treatment and immediately frozen with liquid nitrogen.

Total RNA was isolated using Trizol reagent and reverse-transcribed into cDNA using PrimeScript RT Master Mix Perfect Real Time (TaKaRa) according to the manufacturers instructions. The integrity of total RNA was evaluated by electrophoresis, and optical density readings were quantified using Nanodrop1000. The RT-qPCR assay was implemented in a 10 µl reaction with 5–50 ng of first-stand cDNA products (4 µl), 5 pmol of each primer (0.4 µl), 5 μl SYBR green master mix (Takara Co., Ltd, China) and 0.2 µl ROX. To maximize the visibility of the bands, the number of PCR cycles for each gene was adjusted to 45. Three replicates were performed independently for each treatment, using three biological replicates. The relative expression of BdPIF genes was counted using 2−ΔΔCt method. The up-regulated genes were defined as a fold-change greater than 2 with the p value of < 0.05 and a fold change of 0.5 or less was used to define down-regulated genes when the p value of < 0.05 with sample at 0 h as control. The primer-sets were listed in Table S1.

Plasmid construction, Arabidopsis transformation and subcellular localization assays

Full-length cDNAs encoding BdPIF4/5A was amplified from Bd21 plants using 12-p1300-F (KpnI) and 12-p1300-R (XbaI) primers (Table S2), and cloned into pCAMBIA1300 vector at KpnI/XbaI sites, yielding plasmid 35S:BdPIF4/5A which was used for Agrobacterium-mediated transformation of the Arabidopsis (Clough and Bent 1998). The transgenic plants were screened on 1/2 MS medium containing 25 µg/L hygromycin, and their mRNA levels were confirmed by RT-PCR. The acquisition for transgenic plants of other BdPIFs was executed in accordance with the above method.

For subcellular localization analysis, the coding sequence of BdPIF4/5A was also amplified with12-GFP-F (KpnI) and 12-GFP-R (XbaI) primers (Table S3), and cloned into modified pCAMBIA1300-GFP at KpnI/XbaI sites, yielding plasmid 35S:BdPIF4/5A-GFP, which was used for transient expression in N. benthamiana leaves. The agroinfiltrated leaves of N. benthamiana with 35S:BdPIF4/5A-GFP were cultured in a standard greenhouse and gathered at 48 h after Agrobacterium GV3101 infiltration. BdPIF4/5A-GFP fluorescence signals were excited by 488-nm ion argon laser and detected by Zeiss LSM 510 Meta confocal laser scanning microscope (Oberkochen, Germany) (Anderson et al. 2006). The subcellular localization analysis of other BdPIFs was also performed according to the above method.

Physiological and biochemical measurements

Measurements of hypocotyls length were recorded with IMAGE J software (http://www.rsb.info.nih.gov/ij). Flowering time was recorded by counting the number of rosette leaves and the number of days from the sowing day to the appearance of the first flower under long-day conditions (light/16 h: dark/8 h). The Arabidopsis seedlings treated by shading were used as samples for physiological and biochemical analysis. Simply, seedlings were grown in 1/2 MS solid medium under standard greenhouse for 8 days, then transferred to darkness condition (Other conditions being constant) for 2 days and collected for next analysis. Superoxide dismutase (SOD), peroxidase (POD) and catalase (CAT) activities were detected by spectrophotometry following previously described methods (Mittova et al. 2000; Wang et al. 2010b). A minimum of 20 independent seedlings were picked for hypocotyl measurement with Clo-0 as control. Each experiment was repeated with multiple biological times with similar results. For expression analysis of SODs, PODs and CATs, the seedlings of heterologous expression plants -were collected after 2 days under light treatments. The RT-qPCR analysis was performed according to the methods mentioned above.

Results

Identification and characterization of PIF genes in 40 land plant species based on the conserved PIL-motif

To explore the diversity and evolution of PIF proteins, identification of PIF gene family was performed from 40 publicly available genome of land plants that located at different evolutional positions (Fig. 1). The result showed that the numbers of PIFs varied from species to species across the entire land plant kingdom. However, canonical PIF were absent in several streptophyte algal lineages including Micromonas commoda, Ostreococcus lucimarinus, Volvox carteri and Chlamydomonas reinhardtii dwelling in seawater environments (Table S4). Finally, 246 non-redundant PIF sequences harboring the bHLH domain and PIL-motif were retrieved in total (Table S4). The diploid Brassica oleracea and Glycine max genome had the largest number of PIF genes, whereas the Marchantia polymorph contained one (Fig. 1). Moreover, Brassica rapa (13) and Gossypium raimondii (9) contained higher PIF numbers (Fig. 1, Table S4). It was worth noting that a novel member (OsPIF8) considered as PIFs in rice, which was slightly difference with previous research (Nakamura et al. 2007). Hence, the PIF members are all increased in corresponding monocotyledons, such as Setaria italica and Zea mays (Fig. 1, Table S4). In non-seed plants, only eight PIF genes were identified, including three in Selaginella moellendorffii, four in Physcomitrium patens and one in M. polymorph. In flowering plants, the expansion of PIF genes might be involved with the recursive polyploidy or ancient polyploidization events (Fig. 1). For example, the duplicated G. max genomes that retain evidence of two rounds of WGDs (Jiao et al. 2011) had 14 PIF genes. Similarly, Brassica genomes undergone a Brassiceae-lineage-specific whole-genome triplication (WGT) (Liu et al. 2014) which might be contribute to the increase of gene number reach to 14 (Fig. 1).

Fig. 1
figure 1

modified from Phytozome (https://phytozome-next.jgi.doe.gov/). Red and blue stars indicate whole-genome duplication and triplication, respectively

Phylogenetic relationships between 40 plant species investigated in this study. The total number of PIF proteins identified in each plant genome is indicated on the right. The phylogenetic tree is

Phylogenetic and molecular evolution analyses of PIFs

To investigate the evolution history of plant PIF proteins, the phylogenetic reconstruction of all identified 246 protein sequences composed of the conserved domains (APB and bHLH) from 40 land plants were performed based on sequence alignments of amino acids using ML methods. The phylogenetic tree clarified the number of main PIF lineages and the relationships between these. In this tree, PIF sequences resolved three main clades: Clade A (including PIF1, PIF4 and PIF5), Clade B (PIF7 and PIF8), and Clade C (PIF2, PIF3 and PIF6) (Fig. 2a). Because they include bryophyte sequences, all of sequences can probably be traced back to the origin of the land plants. Specially, Clade C can further split into two classes: class I (PIF3) and class II (PIF2 and PIF6), other clades have only one (Fig. 2a). Class I homologues comprised gymnosperms and angiosperms, forming a relatively well-supported clade (Fig. 2a, Table S4). In addition, Class I homologue was absent in Carica papaya (Table S4). As with the Class I PIF, Class V PIF contained gene members in gymnosperms and angiosperms, suggesting that they independently descended from a common ancestor of spermophyte. In addition, the sequence of Class V PIF was also absent in Hordeum vulgare. Class II PIF had gene members only in rosids including Brassicales, Malvales, Myrtales and Malpighiales, while Class IV PIF existed in almost all eudicots (Table S4). Ancient PIF consisted of only moss and fern PIF and had eight members. We named them based on the ancient roles of PIF3 clade than other members (although they were not on the same evolutionary branch with the Class I PIF), such as MpPIF3. The protein size of ancient PIF ranged from 211 (VvPIF8) to 853 (AcPIF3B) amino acids. Furthermore, Class III PIF exists only in angiosperms, indicating that they originated in the common ancestor of flowering plants.

Fig. 2
figure 2

Phylogeny and expansion of PIF genes in land plants. (a) Phylogram of the 246-taxon analyses obtained through maximum-likelihood analyses were conducted using JTT + I + G. Support values are shown for selected nodes, bootstrap replicates BP. A dot indicates support values of BP > 95. Scale bar indicates number of changes per site. (b) Evolutionary origin of the PIFs. Filled squares indicate the presence of genomic data, open squares indicate lack of data, and red asterisks indicate a whole-genome duplication event (according to Jiao et al. 2011)

Although the main three lineages of A, B, and C PIF have an ancient origin, their respective content has been shaped by many lineage-specific duplication events. Early duplications occurred in clades B and C of Gymnosperms (seed plants) to found the PIF2/3/6, and PIF7/8 lineages (Fig. 2b). Indeed, our phylogenetic tree showed that the present-day classes took place after the divergence of Lycophytes. In particular, the PIF sequences from ANA grade species indicated that these subfamilies were established before the initial radiation of the angiosperms. For instance, the PIF7 and PIF8 subfamilies each include sequences from ANA grade and other angiosperms (Fig. 2b). Interestingly, the duplication pattern of the PIF family is very consistent with the WGD that occurs during the evolution of seed plants (Jiao et al. 2011). Further analysis displayed that the parallel duplications occurred in several different PIF subfamilies prior to the radiation of the extant angiosperms, allowing what is called the preangiosperm duplication to play an important role in establishing the diversity of angiosperm PIF genes (Fig. 2b).

The genetic distance between Class I and II PIFs was 0.534, which was not the lowest than other counterparts, despite being located in adjacent evolutionary branches (Table 1), which was consistent with their evolutionary grouping (Fig. 2A). While the highest genetic distance difference occurred between Class II and Class V PIFs, suggesting that the sequence similarity between them were the least. In addition, the combination of Class V PIFs and other each group PIFs had relatively higher genetic distance than remaining counterparts (Table 1), indicating that Class V PIFs with other counterparts had higher sequence divergence. Interestingly, the average overall mean distance of PIFs was 0.452 (standard error 0.069). Moreover, the nonsynonymous-to-synonymous rates ratio (ώ = dN/dS) of each group PIFs have been comprehensively calculated under different codon substitution-based evolutionary models to deeper analyze their evolutionary basis of functional diversification (Table 2). Molecular evolution analysis displayed that Class III PIFs had the lowest Ka and ώ values (ώ = 0.311), reflecting strong purify selection during the evolution process (Table 2). While the mean ώ values of Class I, IV and V PIFs were greater than 1, suggesting that the PIF genes are positive selection during the evolution process. In addition, Class I and V PIFs had roughly similar Ka and ώ values when monocots were removed, which indicated that positive selection were mainly contributed to their evolution in Eudicots (Table 2). Moreover, we also surveyed the ώ values using the bHLH domain and APB motif sequences under different codon substitution-based evolutionary models. The results showed that all group PIFs had equally low Ka and ώ values, which were strong purify selection during the evolution process (Table S5). Molecular evolutionary analyses exhibited that the divergence of PIFs within each group were contributed to the sequences other than domain or motif, which indicated that their functional domains of PIFs were highly conserved.

Table 1 Genetic distance between different groups of PIF genes calculated based on the amino acid sequences with the Jones–Taylor–Thornton (JTT) model
Table 2 Molecular evolutionary analysis of the PIF genes using their whole cDNA sequences

Common conserved domain compositions and genomic analysis within plant PIF groups

To better understand the characteristics of different plant PIFs from an overview, the detailed sequence features were further surveyed. Not surprisingly, putative plant PIF proteins possessed common motifs in the same group, suggesting similar functions within each group. All of groups contained the bHLH domain (IPR011598) and the APB motif (Fig. 3a and b). It was noted that class I PIF (AtPIF3) and class IV PIF (AtPIF1) usually harbored APA motif (Fig. 3c). Moreover, a few PIF proteins also contained other domains, suggesting that PIF proteins might also have other functions. For instance, AtPIF7, BoPIF7, CruPIF7 and EgPIF1 extra contained a WD40/YVTN repeat-like-containing domain (IPR015943) (Fig. 3a). AcPIF1A and MaPIF8 also had a Zinc finger C2H2-type (IPR013087). An isopenicillin N synthase-like (IPR027443) and Remorin, C-terminal domains (IPR005516) also additional existed in NnPIF3B and FvPIF7, respectively. These data provided clues for us about other potential function of PIF proteins.

Fig. 3
figure 3

Gene structure and sequence features of conserved PIF genes. (a) Gene structure and protein motif. The structure of an A. thaliana gene (indicated on the left) is shown as an example for each class (in parenthesis on the protein motifs are shown as colored boxes, whereas introns of different phase are shown as colored vertical lines. Protein motif architectures of the full-length proteins were drawn based on a search of interPro program. IPR011598 indicates basic helix-loop-helix (bHLH) domain, IPR015943 means WD40/YVTN repeat-like-containing domain. Exons are drawn to scale. (b) Sequence feature of APB domain. Sequence features shown in the form of web logos representing the conserved APB motif of each class. (c) Sequence feature of APA domain. The red star indicates residues of functional or structural importance based on phylogenetic conservations. Logos were generated using the Weblogo3 application (http://weblogo.threeplusone.com/)

To elucidate the potential gene structural relationship among PIF orthologues and paralogues, their exon/intron organizations were also investigated using GSDS software. Most gene members within each group revealed similar exon/intron organizations in terms of intron number, intron phase and exon length. In particular, more similarities were observed in conserved regions such as bHLH domain (Fig. 3a). The numbers of intron varied from 0 to 10 among different members with wide variation (Table S6). Class II PIFs have introns ranged from 3 (BoPIF6A, BoPIF6B, and BoPIF6C) to 6 (SpPIF6) and 5 occupied the main points (58%), while the numbers of intron in Class V PIFs ranged from 0 (CpPIF8) to 9 (MaPIF8) and dominated with 5 (61%; Table S6). It was worth noting that most Class I, III and IV PIFs possessed 5 or 6 introns (Table S6).

Afterwards, the Mw and pI of different PIF proteins were also calculated using the online Compute pI/Mw tool. The Mw of PIF proteins ranged from 23.433 (VvPIF8) to 91.995 (MaPIF8) kDa and the pI varied from 4.82 (OsPIF4/5D) to 12.05 (SbPIF4/5C) (Table S6). It was interesting that the pI of all PIFs within each group ranged from acidic to alkaline. The average amino acid composition of PIF proteins were ranges from 0.71 (tryptophan) to 11.04 (serine) (Table S7). Notably, the average abundance of the most important amino acids such as glutamic acid, leucine and glycine (ELxxxxGQ) which is necessary and sufficient for phyB-specific binding were 5.38, 6.51 and 7.02, respectively. Moreover, the average abundance of their hydrophobic amino acids was relatively higher than ones of other amino acids such as alanine (8.28), leucine (6.51), proline (7.86) and valine (5.14) (Table S7).

Previous studies have reported several consensus sequences associated with the structural or functional roles, including the bHLH domain in C-terminal region, the APB motif and APA motif in N-terminal region. Most of all, the ELxxxxGQ in APB motif is required for phyB-specific binding (Khanna et al. 2004). To further identify the characteristic of consensus sequences among phylogenetic groups, the sequence logos motif analyses were then executed using WebLogo 3 online tool for each group to illustrate the sequences features of their key functional domain (Fig. 3b, c and Fig. S1). For example, the result displayed the detailed sequence information for APB or APA motif of each group (Fig. 3b and c). While the ancient PIFs may have difference (Inoue et al. 2016). The stars indicated that these residues were essential for the interaction PIFs with phytochrome. In all groups, the APB motif were highly conserved in spite of occasional variations such as EFPWEKDQ (AcPIF8) or EILWQNGQ (NnPIF4), indicating that most PIF had the function of binding to phyB (Fig. 3b). However, compared with APB motif, the APA motif was not highly conserved and existed only in Class I and IV PIF proteins (Fig. 3c). Furthermore, plant PIFs contained a highly conserved bHLH domain in C-terminal extension which comprised the HLH domain (dimerization) and basic domain (DNA binding capacity). The PIF proteins showed a group specific conserved basic domain except for Class II PIFs, H-N-X-S-E-R-R-R-R (Class I), H-N-L-S-E-R-R-R-R (Class III), H-N-L-S-E-R-R/K-R-R (Class IV) and H-N-Q/E-S-E-R-K/R-R-R (Class V), respectively (Fig. S1). The presence of group-specific basic domain in PIFs indicated that DNA binding capacity of PIFs might be have group specificity. Thus, it can be concluded that the evolution of different PIFs were orthologous based on group specificity.

Multiple duplication events were identified in each group

The main force in the evolution of different species is from gene duplication, which leads to the creation gene families from genes. To further comprehend the duplication and polyploidy or ancient polyploidization events of the PIF genes, the duplicated genes in land plant genome from each orthology group were also analyzed (Table S8). The number of duplicated genes was on behalf of the sizes of gene families which were usually called paralogous genes. As is well-known, plant species with duplicated genomes are more likely to cause gene duplications including PIF genes than those with unduplicated genomes. Thereby, B. oleracea, B. rapa and G. max had more duplicated genes. To further investigate the evolution and duplication events of each group of PIFs, their phylogenetic trees were reconstructed, respectively. The result showed that class I PIFs could be further divided into three subgroups according to the phylogenetic relationship of terrestrial plants, namely subgroup I1, I2, I3, I4 and I5 (Fig. S2). Of these five subgroups, subgroup I1, I2 and I3 contained genes in eudicots, whereas subgroup I4 contained genes from gymnosperms, and subgroup I5 only in monocots. Within the subgroup I3 clade, there were several independent duplications, suggesting that duplication events may have occurred in the ancestor of corresponding lineages (Fig. S2). Likewise, Class II and IV PIFs contained genes only in eudicots, whereas Class III and V PIFs contained genes from green plants and seed plants, respectively (Fig. S3-S6). Class III PIFs fell into seven subgroups, namely subgroup III 1–7 (Fig. S4). Of these subgroups, subgroup III 1–4 and III 5–7 contained genes in eudicots and monocots, respectively. While Class V PIFs were further divided into six subgroups, of which subgroup V1, V2, V4 and V5 contained genes from eudicots, while V3 and V6 only in monocots and gymnosperms, respectively (Fig. S6). Similarly, in the other clades, there were several independent duplications in Brassicales, Fabales and grasses (Figs. S4 and S6), which might be involved with two successive duplication events in Brassicales and at least three duplication events in grasses.

To further elucidate the causes of these duplication events occurred, we surveyed the genomic regions containing PIF genes that may be synteny. Most duplicate gene pairs emerged in syntenic genomic regions, implying that these multiple gene copies were resulted in whole genome or segmental duplications (Fig. 4). In particular, the duplication events of Class II PIFs, PIF2 and PIF6, have happed in the common ancestor of core eudicots which corresponded to the γ WGD (Fig. 4;(Tang et al. 2008)). The two duplications of Class II PIFs in Brassicaceae were associated with the α/β WGDs within the Brassicaceae lineage (Fig. S3; (Tang et al. 2008)). Based on extensive studies of WGD events in many species, these data about their paleopolyploidy histories were collected and deeper analyzed (Fig. 1). We then evaluated the impact of duplication events to the size of Class II PIFs. The result suggested that the WGDs might be beneficial to the orthology expansion of some species. For instance, compared with Medicago truncatula, G. max undergo one more genome polyploidization event, as a result, the soybean genome (14 PIFs) encoded more than two times numbers of family members (6 PIFs). To further confirm the roles of genome duplication in the expansion of gene family, the co-relationship analysis between rounds of genome duplication and encoded Class II PIFs were performed (Fig. S7). The correlation coefficient was counted as 0.939 (P < 0.01) for the Class II PIFs. The results indicated that the WGD prominently promoted to the gene expansion for Class II PIFs.

Fig. 4
figure 4

Examples of the detailed locations of representative pairs of genes duplicated in recent polyploidy events in the syntenic regions. At, Arabidopsis thaliana; Br, Brassica rapa; Pt, Populus trichocarpa; Sp, Schrenkiella parvula; chr, chromosome

Expression profiles of the PIF genes in B. distachyon

Transcript profiling would offer direct clues of functional divergence among all members of a gene family (Whittle and Krochko 2009). In the study, RT-qPCR was performed to investigate the expression patterns of BdPIF genes in various tissues. The results showed that BdPIF4/5A, BdPIF3B and BdPIF8 had expression in all tissues tested. BdPIF4/5C was highly expressed in root and the lowest in leaf, while BdPIF4/5B and BdPIF3A all had no expression in root (Fig. 5), suggesting that they may play distinct roles in the different tissues. To explore their roles in responses to hormone and abiotic stresses, the expression patterns of BdPIFs under ABA, light treatment, heat and UA-B conditions were also executed (Fig. 6). BdPIF4/5A, BdPIF4/5B and BdPIF3A showed increases after shading treatments and peaked at 3 h, then remained at relatively high levels. BdPIF3B exhibited increases and peaked at 12 h after treatment. BdPIF4/5C and BdPIF8 were down-regulated at 1 h after treatment and kept at relatively low levels (Fig. 6a). Moreover, compared other BdPIF genes, BdPIF4/5C displayed the highest expression after heat treatment and peaked at 6 h, then remained at relatively high levels (Fig. 6b). However, all BdPIF genes except BdPIF3B were suppressed after UV-B treatment in spite of the times and extents of response had a difference (Fig. 6c). Furthermore, BdPIF3A and BdPIF3B were not induced by ABA within 1 h and then strongly suppressed at 3 h after treatment, while other BdPIF genes were strongly inhibited within 1 h (Fig. 6d). We also investigated their expression profiles after phytopathogen applications. BdPIF4/5C, BdPIF3A and BdPIF8 were all significantly down-regulated after Guy11, PH14 and F0968 infection, while BdPIF3B only suppressed by Guy11 at 4 h and then recovered to near normal levels (Fig. 6e). In addition, BdPIF4/5A and BdPIF4/5B all revealed a slightly suppressed expression after inoculated pathogen (Fig. 6e).

Fig. 5
figure 5

Expression patterns of BdPIF genes in different tissues. Leaf, stem, seedling and root samples were collected 12 d after germination. Caryopsis samples were collected at their anthesis. The expression levels were normalized to the expression level of seedling. Stars indicate significant difference compared to seedling (P < 0.05). Bars represent SE (n = 3)

Fig. 6
figure 6

Expression profiles of BdPIF genes under various stress treatments (a) Light treatment, (b) Heat, (c) UV-B, (d) ABA and (e) Pathogen infection. The expression levels were normalized by the actin gene. Stars indicate significant difference compared to seedling (P < 0.05). Bars represent SE (n = 3)

BdPIF genes are nucleus localized TFs

To study the subcellular localization of BdPIFs, the six BdPIF open reading frames (ORFs) were cloned in pCAMBIA1300-GFP modified vector, respectively, followed by recombination and in-frame fusion to with GFP at C-terminal and expressed transiently in leaves of N. benthamiana plants and fluorescent signals were observed at 48 hpi (hours post-incubation), by confocal microscopy. Microscopic observations of the N. benthamiana leaves displayed that all BdPIFs-GFP fusion were distributed in the nucleus, being uniformly distributed in this compartmental localization, while GFP alone was localized in ubiquitously throughout the cell without specific compartment (Fig. 7). These results suggested that all BdPIF genes were nucleus-localized TFs, like the localization of Arabidopsis AtPIFs and maize ZmPIFs (Al-Sady et al. 2006; Wu et al. 2019).

Fig. 7
figure 7

Subcellular localization of BdPIFs-GFP. BdPIFs-GFP fusion proteins and GFP alone were each expressed transiently in leaves of N. benthamiana plants after 48 h post infiltration (hpi) and observed under a confocal laser scanning microscope in dark field for green fluorescence, white field for cell morphology and in combination, respectively. Visible, bright-field. Merged, overlay of the GFP and bright-field images. Scale bar = 100 µm

Heterologous expression of BdPIF genes in Arabidopsis plants

To better characterize the physiological functions of BdPIF genes, the genetic studies in B. distachyon with reference to the light responses of plants were performed. The Arabidopsis plants (ecotype Col-0) were successfully transformed with 35S::BdPIF4/5A, 35S::BdPIF4/5B, 35S::BdPIF4/5C, 35S::BdPIF3A, 35S::BdPIF3B and 35S::BdPIF8. The RT-PCR results displayed that all the genes were overexpressed in corresponding transgenic lines (Fig. S8). The transgenic lines obtained were designated BdPIF4/5A-OX (over-expressor), BdPIF4/5B-OX and so on. To accurately assess the results, an Arabidopsis transgenic line 35S::AtPIF4 which showed the long-hypocotyl phenotype (Sun et al. 2012) was employed as positive control. As expected, the various BdPIF-OX transgenic plants showed similar hypocotyls elongation phenotype, especially BdPIF4/5C-OX, although the extents of hypocotyls elongation ranged obviously from one BdPIF-OX to another (Fig. 8a and b). In addition, we also investigated numbers of rosette leaves (RL) and flowering time (Days) of BdPIF4/5C-OX. It was found that three BdPIF4/5C-OX lines showed little numbers of rosette leaves and early flowering phenotype, as similar with phenotype of AtPIF4-OX plants (Fig. S9).

Fig. 8
figure 8

Phenotypic analysis of the transgenic Arabidopsis lines expressing BdPIF genes under normal and light treatment conditions. Photographs (a) hypocotyl lengths (b) of Col-0, AtPIF4-OX and BdPIFs-OX seedlings grown in the 22 ℃ continuous irradiation. Plants were grown at 22 ℃ for 8 days before transfer to dark for 2 days. Control plants were maintained at light. (ce). Analysis of superoxide dismutase (SOD), peroxidase (POD) and catalase (CAT) activities. All experiments were repeated three times. Error bars indicate the standard deviation from triplicate experiments. Asterisks represent a significant difference from the light samples with Student,s tests (P ≤ 0.05)

Moreover, the activities of SOD, POD and CAT of various BdPIF-OX transgenic plants were also performed. The activities of SOD of all BdPIF-OX transgenic plants were lower than that in Col-0 under light treatment condition, which was opposite in AtPIF4-OX, while they had no obvious difference under normal condition except BdPIF4/5B-OX and BdPIF8-OX (Fig. 8c). The activities of POD and CAT of Col-0, AtPIF4-OX and BdPIFs-OX samples all had no apparent change (Fig. 8d and e). Furthermore, we next surveyed the expression of SOD, POD and CAT genes in various BdPIF-OX transgenic plants. A total of nine genes were screened for the present research including three SOD genes (At5g18100, At2g28190 and At3g10920), three POD genes (At5g66390, At5g58400 and At2g18140) and three CAT genes (At4g35090, At1g20620 and At1g20630), which have been verified previously to be related to SOD, POD or CAT activity. The transcripts of almost all SODs, PODs and CATs displayed an increased significantly levels in BdPIF-OX transgenic plants under normal condition (Fig. S10). However, in various BdPIF-OX transgenic plants, the expression levels of all SODs, PODs and CATs showed different responses under light treatment condition, even significantly increased or decreased (Fig. S10). Moreover, SOD activities of different plants and under different conditions were the most significant difference. Therefore, we further analyzed the relationship between SOD gene expression results and actual activities. To evaluate their relationship between SOD gene expression results and actual activities, their correlation analysis was performed (Fig. S11). The correlation coefficient of At5g18100 and At2g28190 achieved a significant difference (Fig. S11). They are positively and negatively correlated under normal and light treatment conditions, respectively. These results indicated that these SODs, PODs and CATs genes were induced by BdPIFs to influence on the SOD, POD and CAT activity under normal condition, especially SOD activity. Taken together, these results suggested that the identified BdPIF genes were counterparts of Arabidopsis PIFs and exhibited a similar biochemical function to the AtPIFs.

Discussion

Evolutionary history and functional evolution of the PIF genes in land plant lineages

Our data suggested that the PIF gene family exist only in land plant lineages and is not even present in M. viride dwelling in subaerial/terrestrial (Fig. 1, Table S4). The PIF gene has expanded and subjected to at least two ancient duplications during the history of land plants. Following the gene duplication, the first expansion occurred after the divergence of Gymnosperms, leading to the emergence of the Clade C (Class I and class II) and other Clades, as consistent with their sequence divergence (Fig. 2a), resulting that Class I and II PIF members may experience a certain functional divergence. For instance, unlike most PIFs suppression of the light response, PIF2 and PIF6 shows a mild activation response under certain conditions (Luo et al. 2014; Penfield et al. 2010). In turn, the second expansion appeared after the divergence of angiosperms, resulting in two main Clades including Clade A (class III and IV) and Clade B (Class V) (Fig. 2) revealing apparent functional redundancy. For example, plants exhibited constitutive photomorphogenic phenotypes only in quadruple mutant (pifq, i.e. PIF1, PIF3, PIF4, and PIF5) in the dark (Leivar et al. 2008b; Shin et al. 2009). In addition, most PIFs inhibited light response and rapidly degraded upon light exposure, while PIF7 was not detected light-induced phosphorylation or degradation (Leivar et al. 2008a). The protein accumulation pattern and function of PIF8 were different from other PIFs under diverse light conditions (Oh et al. 2020). Further studies displayed that all plant PIFs shared highly similar exon/intron organizations with conserved APB motif and bHLH domains within each group (Fig. 3), indicating that these genes of ancient origins still maintained a fairly conservative role. However, expression profiles showed that all BdPIF genes were distinct expressed in different tissues (Fig. 5), indicating that it might be resulted in the evolutionary variation. Taken together, PIF gene members exhibit overlapping and distinct roles in different light responses, similar to the patterns of HD-ZIP III genes (Prigge et al. 2005).

Whole genome duplication contributed to gene expansion of Class II PIFs

Our phylogeny suggested that there were two and/or three paralogs in the orthology groups of Class II PIFs. Further examination displayed that the genomic regions where the PIF genes were within syntenic genomic regions, implying that these multiple copies may be caused by large-scale duplication events such as WGDs or segmental duplications (Fig. 4). In particular, the duplication events of Class II PIFs took place in the common ancestor of core eudicots, corresponding to the γ WGD (Tang et al. 2008). Compared with A. thaliana, Brassica plants have undergone an extra recent whole-genome triplication (WGT) event (Jiao et al. 2012) resulted in the approximately twice as Class II PIF genes in B. rapa and B. oleracea than those of other species (Fig. 1 and Table S4), as in accordance with the discovery in other polyploids, including Arachis hypogaea (Wang et al. 2021) and G. max (Arya et al. 2018). Moreover, the correlation coefficient between the rounds of genome duplication and gene family size was 0.939 (Fig. S7), indicating a positive correlation of them. It have reported that the signaling genes and regulatory genes are more likely to be retained, mainly depending on their environmental adaptation after duplication events, rather than WGDs (Lei et al. 2012; Maere et al. 2005; Van de Peer et al. 2009). Although other forms of duplication also contribute to gene expansion, such as tandem duplication (Jiang et al. 2021), these independent and alike examples elucidated the function of WGDs in the gene expansion of Class II PIF genes, which existed in similar situations in other gene family such as MKK and/or rhomboid (Jiang & Chu. 2018; Li et al. 2015). The fact suggested that the WGDs and polyploidization events were helpful of the gene expansion of Class II PIF genes during their evolution process.

PIFs function as a cellular signaling hub in integrating environmental and hormonal pathways to modulate plant growth and development

Apart from being negative regulators of photomorphogensis, PIFs are also other crucial regulator that integrates various environmental and hormonal signaling for plant growth and development including temperature (Kumar et al. 2012; Xu and Deng. 2020), anthocyanin synthesis and accumulation (Liu et al. 2021; Shin et al. 2007), sugar metabolism (Liu et al. 2011), signaling pathways of plant hormones (GA, BR, NO, JA, Ethylene and auxin) (Leivar and Monte. 2014), and so on. In our study, expression analysis revealed that BdPIF genes could be induced by a variety of abiotic stresses and hormone (Fig. 6). Specially, six BdPIF genes were induced by shading and ABA treatments to produce variant response patterns, while BdPIF4/5C, compared with other BdPIF genes, was significantly up-regulated under heat stress (Fig. 6), as supported by previous studies (Choi and Oh. 2016; Koini et al. 2009). All BdPIF genes were inhibited after phytopathogen infection regarding of a difference in terms of time and extent (Fig. 6). The data suggested that the function of BdPIFs may be relatively conserved in response to external and hormonal signaling.

A growing body of evidence indicates that the identified BdPIF genes are orthologs of AtPIFs, as estimated from a biological perspective (Fig. 8). When the BdPIF TFs are expressed ectopically in Arabidopsis, the phenotype results of the transgenic plants were very similar to the morphology of AtPIF4 over-expressor, especially under light treatment condition, as in accordance with the elongated phenotype of phyB mutant (Leivar et al. 2012). Additionally, the nuclear-localization and conserved APB motif of BdPIFs imply that it may play possible roles in plant growth and development (Figs. 3 and 7). Additionally, DELLA directly interacts with PIF and mediates its degradation in Arabidopsis (Li et al. 2016), while BdPIF3/4 also interacts with BdDELLA proteins (Niu et al. 2019), suggesting that genes from these two different species may have similar functions in both light and gibberellin signaling pathways. Notably, the BdPIF4/5C over-expressor displayed the most pronounced phenotype of hypocotyl elongated, precocious flowering and rosette leaves reduction, as is consistent with those of AtPIF4 over-expressor (Fig. S8) (Kumar et al. 2012). In addition, the expression of BdPIF4/5C and other PIF genes were significantly different under heat stress (Fig. 6). Therefore, our data suggest that BdPIF4/5C may play a vital role in plant adaptation to environmental and hormonal signaling, especially high-temperature, similar to AtPIF4.

Conclusion

A total of 246 PIF genes were identified by bioinformatics methods from 40 land plants based on the conserved bHLH domain plus the existence of APB motif, additionally algae lineages dwelling in subaerial/terrestrial were also analyzed. Phylogenetic analyses showed that the PIF gene family fell into five classes and designated as Class I to Class V. It was noteworthy that the expansion of Class II PIFs benefited by the WGDs followed by diversification during the evolution process. Moreover, molecular evolutionary and protein domain analyses revealed that the functional domains of PIFs were highly evolutionary conserved, especially the APB motif. Furthermore, all BdPIF genes were nucleus-localized TFs like the localization of Arabidopsis AtPIFs. Interestingly, it was found that BdPIF4/5C had the highest expression level after heat treatment and his over-expressor had the most conspicuous hypocotyls elongation and precocious flowering phenotype, which was in accordance with the function and phenotype of AtPIF4. In short, these results provide new viewpoints into the origin and evolutionary history as well as function of plant PIFs.