A Comparison of Phenylpropanoid Pathway Gene Families in Common Bean. Focus on P450 and C4H Genes

Reinprecht, Yarmilla; Perry, Gregory E.; Peter Pauls, K.

doi:10.1007/978-3-319-63526-2_11

Yarmilla Reinprecht⁵,
Gregory E. Perry⁵ &
K. Peter Pauls⁵

Part of the book series: Compendium of Plant Genomes ((CPG))

1182 Accesses
7 Citations

Abstract

The focus of this chapter is on gene families encoding enzymes of phenylpropanoid pathway in common bean. The introductory section contains a short overview of the phenylpropanoid pathway. Section 11.2 introduces major gene families encoding enzymes of this pathway in common bean, soybean, and Arabidopsis in the current annotations of their complete genome sequences (Phaseolus vulgaris v1.0, Glycine max Wm82.a2.v1, and Arabidopsis thaliana TAIR10) deposited in Phytozome 10.2. For each of the 21 enzyme classes, their functional annotations were based on the commonly used Pfam and KOG databases, while the number of genes in each family was based on Phytozome and KEGG databases. Section 11.3 describes cytochrome P450s involved in the phenylpropanoid pathway with particular emphasis on ten families included in the general (central) phenylpropanoid pathway, C4H (family CYP73A), in the lignin/lignan branch, C3H (family CYP98A) and F5H (family CYP84A), in the flavonoid/anthocyanin/proanthocyanidin branch, F3′H (family CYP75B), F3′5′H (family CYP75A), and FNS (family CYP93B), and in the isoflavonoid branch IFS (family CYP93C), I2′H (family CYP81E), F6H (family CYP71D), and D6aH (family CYP93A). The availability of the complete genome sequences enabled a thorough inventory of putative P450 genes encoding enzymes of this metabolic pathway. The P450 gene sequences from common bean were compared to homologs from Arabidopsis and soybean and confirmed with the information published for both soybean and common bean genomes. Cinnamate 4-hydroxylase (C4H) is the first P450 enzyme in the phenylpropanoid pathway and is described in detail in Sect. 11.4. It belongs to the relatively small CYP73A gene family. Genome locations and gene structures including cis-regulatory regions in 5′UTRs (5′ regulatory sequences) are detailed for this family in common bean. In addition, the expression patterns of these genes in different tissues (Phytozome 10.2) and syntenic relationships (Plant Genome Duplication Database) between common bean and soybean were examined. Finally, genes encoding the C4H enzyme in landrace G19833 (Andean gene pool, Phytozome 10.2) and in cultivar OAC Rex (Mesoamerican gene pool) were compared and searched for polymorphisms. These sequence differences can be used to develop C4H gene-based marker(s) to explore the roles of these genes in various processes such as lignin or anthocyanin biosynthesis.

Access provided by CONRICYT-eBooks. Download chapter PDF

Soybean chalcone isomerase: evolution of the fold, and the differential expression and localization of the gene family

Article 11 November 2014

Genome-wide identification, comparative analysis and functional roles in flavonoid biosynthesis of cytochrome P450 superfamily in pear (Pyrus spp.)

Article Open access 03 October 2023

Characterization of genes coding phenylalanine ammonia lyase and chalcone synthase in four Pogostemon cablin cultivars

Article 01 June 2015

Keywords

In this chapter:

Common names in plants: Arabidopsis (Arabidopsis thaliana), soybean (Glycine max), common bean (Phaseolus vulgaris )
Chromosome-based locus (gene model) identifier (Phytozome)

11.1 Introduction

As sessile organisms, plants produce numerous secondary metabolites to overcome biotic and abiotic stressors, to attract pollinators and nitrogen-fixing microorganisms, and to communicate with other plants (Koes et al. 2005; Noel et al. 2005; Moura et al. 2010; Agati et al. 2012, 2013; Baxter and Stewart 2013). Many of these compounds are synthesized by the phenylpropanoid pathway, which is likely one of the most studied pathways in plants. It is relatively well understood and was extensively reviewed (Goujon et al. 2003; Raes et al. 2003; Wang 2011; Falcone Ferreyra et al. 2012; Petrussa et al. 2013). Individual branches of the pathway have been thoroughly characterized. Most of the enzymes that catalyze individual steps of the pathway have been identified, and the genes coding for them have been isolated in a number of plant species, including Arabidopsis and soybean (Graham et al. 2008; Fraser and Chapple 2011).

The core (general or central) pathway consists of three steps, including (1) the conversion of the aromatic amino acid phenylalanine into trans-cinnamic acid, which is catalyzed by phenylalanine ammonia-lyase (PAL); (2) the conversion of trans-cinnamic acid into p-coumaric acid, catalyzed by cinnamate 4-hydroxylase (C4H); and (3) the transformation of p-coumaric acid into p-coumaroyl-CoA, catalyzed by 4-coumarate:CoA ligase (4CL). The compound p-coumaroyl-CoA serves as a starting point for several branches of the phenylpropanoid pathway leading to biosynthesis of lignin, lignans, coumarins, stilbenes, flavonoids, anthocyanin, condensed tannins (proanthocyanidins), and isoflavonoids (Vogt 2010; Cheynier et al. 2013). These products have important functions not only for plant survival, growth, and development but they could also be powerful supplements to the human diet. For example, lignans, stilbenes, and isoflavonoids have been associated with the reduced onset/development of certain chronic disease in humans, including some forms of cancer and heart diseases (Cassidy et al. 2000; Chen et al. 2006; Adlercreutz 2007; Xiao 2008; Brunetti et al. 2013) (Fig. 11.1).

Lignin biosynthesis is a two-step process. First, monolignol is synthesized through a series of hydroxylations, O-methylations, and conversions of side-chain carboxyl into p-coumaryl, coniferyl, and sinapyl alcohols (Humphreys and Chapple 2002; Boerjan et al. 2003; Vanholme et al. 2010; Weng and Chaple 2010; Labeeuw et al. 2015). A second step involves monolignol polymerization by peroxidases (PER), laccases (LAC), and dirigent proteins (DP). In a reversible reaction, hydroxycinnamoyl-CoA:shikimate/quinate hydroxycinnamoyltransferase (HCT) converts p-coumaroyl-CoA and caffeoyl-CoA into their corresponding shikimate/quinate esters, which are then transformed by coumarate 3-hydroxylase (C3H) into their corresponding caffeoyl esters (Schoch et al. 2001). Caffeoyl-CoA O-methyltransferase (CCoAOMT) catalyzes methylation of caffeoyl-CoA to generate feruloyl-CoA. Cinnamoyl-CoA reductase (CCR) converts hydroxycinnamoyl-CoA esters into their corresponding aldehydes, and cinnamyl-alcohol dehydrogenase (CAD) catalyzes the conversion of cinnamyl aldehydes into their corresponding alcohols. Ferulic acid 5-hydroxylase (F5H) converts ferulic acid into 5-hydroxyferulic acid. F5H is also known as coniferaldehyde 5-hydroxylase (CAld5H), since the enzyme preferably transforms coniferaldehyde and/or coniferyl alcohol into synapaldehyde and/or sinapyl alcohol, respectively (Humphreys et al. 1999; Osakabe et al. 1999). Caffeic acid O-methyltransferase (COMT) converts 5-hydroxyconiferaldehyde and/or 5-hydroxyconiferyl alcohol into sinapaldehyde and/or sinapyl alcohol, respectively (Osakabe et al. 1999; Parvathi et al. 2001; Zubieta et al. 2002). COMT was previously thought to be a bifunctional enzyme, methylating caffeic and 5-hydroxyferulic acids.

Chalcone synthase (CHS) is the first enzyme in the flavonoid/anthocyanin branch of the phenylpropanoid pathway. It catalyzes the biosynthesis of chalcone from one molecule of p-coumaroyl-CoA with three molecules of malonyl-CoA. This basic flavonoid structure is then transformed by a set of various isomerases, reductases, hydroxylases, Fe²⁺/2-oxoglutarate-dependent dioxygenases, and transferases into different flavonoids, including flavanones, flavones, flavonols, anthocyanins, and condensed tannins (Winkel-Shirley 2001; Ralston et al. 2005; Ferrer et al. 2008; Saito et al. 2013). CHS and chalcone isomerase (CHI) catalyze the two-step condensation, producing a colorless flavanone (naringenin), which is then oxidized by flavanone 3-hydroxylase (F3H) into the colorless dihydroflavonol (dihydrokaempferol). Subsequent hydroxylation of this compound (at the 3′ or 5′ position of the B-ring), catalyzed by flavonoid 3′-hydroxylase (F3′H) and flavonoid 3′,5′-hydroxylase (F3′5′H), produces dihydroquercetin and dihydromyricetin. These two enzymes (F3′H and F3′5′H) can also hydroxylate flavanone (naringenin) to produce eriodictyol and pentahydroxy-flavanone, which are then hydroxylated by F3H into dihydroquercetin and dihydromyricetin, respectively. The next step in the pathway is the conversion of the three dihydroflavonols (dihydroquercetin, dihydrokaempferol, and dihydromyricetin). These compounds can be transformed into flavonols (kaempferol, quercetin, and myricetin) by flavonol synthases (FLS). Dihydroflavonol 4-reductase (DFR) converts dihydroflavonols into leucoanthocyanidins (colorless flavan-3,4-diols: leucocyanidin, leucopelargonidin, and leucodelphinidin), which are then oxidized by anthocyanin synthase [ANS, also known as leucoanthocyanidin dioxygenase (LDOX)] into colored but unstable anthocyanidins [cyanidin (red-magenta), pelargonidin (orange), and delphinidin (purple-mauve)]. Stable anthocyanins (colored) are produced by glycosylation of these compounds by the UDP-glucose:flavonoid 3-O-glucosyl transferases (UFGT). Some anthocyanins (cyanidin-3-glucoside and delphinidin-3-glucoside) may be further methylated by methyltransferases (MTs) to produce peonidin-3-glucoside and petunidin- or malvidin-3-glucoside, respectively.

Condensed tannins are synthesized through two branches of the anthocyanin pathway. The reduction of leucocyanidin to catechin (2,3-trans flavan-3-ols) is catalyzed by leucoanthocyanidin reductase (LAR), and the conversion of cyanidin into epicatechin (2,3-cis flavan-3-ols) is driven by anthocyanidin reductase (ANR). The subsequent steps catalyzed by polyphenol oxidases and condensing enzymes possibly take place in vacuoles.

Legume-specific isoflavonoids are produced through two branches of the isoflavonoid pathway having major reactions in common. The branch leading to the isoflavone genistein uses the same naringenin intermediate, which is synthesized in the flavonoid/anthocyanin branch of the phenylpropanoid pathway by a two-step condensation catalyzed by CHS and CHI (common to majority of plants) (Lozovaya et al. 2007). On the other hand, isoflavone daidzein is synthesized through the co-action of CHS and legume-specific chalcone reductase (CHR), yielding isoliquiritigenin (trihydroxychalcone), which is then transformed into liquiritigenin (dihydroxyflavanone), a core intermediate of this branch of the isoflavonoid pathway (Austin and Noel 2003). Isoflavone synthase [IFS, also known as 2-hydroxyisoflavanone synthase (2-HIS)] converts flavanone (naringenin or isoliquiritigenin) into 2-hydroxyisoflavanones (through an aryl migration of the aromatic B-ring from C-2 to C-3 position and hydroxylation in position C-2) (Steele et al. 1999; Jung et al. 2000), which are then dehydrated (formation of a double bond between C-2 and C-3) to the corresponding isoflavones (genistein and daidzein) by 2-hydroxyisoflavanone dehydratase (HID) (Akashi et al. 2005; Shimamura et al. 2007). They are further modified by isoflavonoid-specific enzymes to produce major phytoalexins, including medicarpin, biochanin A, glyceollin, pisatin, and maackiain (Latunde-Dada et al. 2001; Lozovaya et al. 2007; Artigot et al. 2013).

Biosynthesis of lignin, flavonoids/anthocyanins/proanthocyanidins, and isoflavonoids is under complex regulation. The expression of the lignin biosynthetic genes is coordinately regulated by a number of transcription factors. The majority of these genes contain a common AC cis-element, which is required for their expression in cells undergoing lignification. NST1/2/3 (NAC secondary wall thickening promoting factor 1/2/3) and Myb26/Myb83 transcription factors act as master switches to regulate biosynthesis of major secondary wall components, including cellulose, xylan, and lignin in Arabidopsis (Zhong and Ye 2009; Zhao and Dixon 2011; Hao and Mohnen 2014; Yoon et al. 2015). In Arabidopsis flavonoid pathway, genes for early biosynthetic enzymes (CHS, CHI, F3H, and F3′H) are regulated by the three functionally redundant R2R3-MYB transcription factors (MYB11, MYB12, and MYB111), while the activation of late biosynthetic genes is controlled by the R2R3-MYB/bHLH/WD40 (MBW) complex (Grotewold 2005; Hartman et al. 2005; Ramsey and Glover 2005; Gonzalez et al. 2008; Gou et al. 2011; Petrussa et al. 2013; Li et al. 2014; Xu et al. 2014, 2015). Genes of legume-specific isoflavonoid branch of phenylpropanoid pathway are regulated by a different set of transcription factors. For example, GmMYB176, a R1 MYB transcription factor, regulates CHS8 expression and isoflavonoid synthesis in soybean (Yi et al. 2010a, b; Dhaubhadel 2011). The constitutive over-expression of LjMYB14 was associated with the activation of dozen of genes coding for enzymes in the core phenylpropanoid pathway and isoflavonoid branch in Lotus japonicus (Shelton et al. 2012). At the same time, the expression of other transcription factors was altered resulting in coordinated down-regulation of the competing biosynthetic pathways.

Genes encoding the major enzymes of the phenylpropanoid pathway have been identified in a number of plant species (Tsai et al. 2006; Tohge et al. 2007; Xu et al. 2009). In most species, enzymes involved in the phenylpropanoid pathway are encoded by gene families of various sizes. For example, plants’ CADs can reduce various aldehydes, including those expressed in response to pathogens (Barakat et al. 2010; Miedes et al. 2014). The nine putative CAD genes that were identified in Arabidopsis are split into three classes based on protein phylogenetic analysis (Raes et al. 2003). Using Southern hybridization of genomic DNA, Ryder et al. (1987) identified six to eight CHS genes in common bean, some of them tightly clustered, which represented different loci, not allelic variation. The soybean CHS gene family consists of nine members (CHS1 to CHS9), some of which are clustered (Akada and Dube 1995; Yi et al. 2010a). They share a high degree of sequence similarity and play different roles in plant development and interactions with environment. Matsumura et al. (2005) mapped eight CHS genes on five linkage groups (A1, A2, B1, DIa, and K) in soybean. Duplicated CHS1 gene was associated with the suppressed seed coat pigmentation in yellow soybean (Senda et al. 2002).

Gene families arise from interspecific hybridization, polyploidization, and local duplication. Genome duplication results in biased gene content (Freeling 2009) and non-random divergence in gene expression (Casneuf et al. 2006; Wang et al. 2012, 2013). After a duplication event, the new gene copy (or the original copy) can retain the same function (subfunctionalization), undergo neo-functionalization, or become non-functional (loss of function) (Lynch and Conery 2000; Hanada et al. 2011; Barker et al. 2012). Gene clusters formed by gene duplication have been frequently found in multigene families, including plant specialized metabolism (Nutzmann and Osbourn 2014, 2015). For example, clusters encoding enzymes of all steps in lignin biosynthesis have been identified in the Eucalyptus grandis EST libraries (Harakava 2005). The authors also predicted co-localization of several phenylpropanoid pathway enzymes including PAL, C4H, 4CL, C3H, and F5H on the endoplasmic reticulum (ER) membrane. This may suggest the existence of metabolons involving P450 multienzyme complexes and channeling of pathway intermediates without their release into the general metabolic pool (Hrazdina and Wagner 1985; Winkel-Shirley 1999; Ralston and Yu 2006; Bassard et al. 2012).

The availability of complete genome sequences enabled genome-wide analyses of the phenylpropanoid pathway genes in several species (Naoumkina et al. 2010). Shi et al. (2010) identified 95 genes (ten gene families) associated with phenylpropanoid pathway in Populus trichocarpa and identified functional redundancy at the transcript level for six lignin biosynthetic genes [PAL, C4H, 4CL, HCT, CCoAOMT, CAld5H (F5H)]. Using an in silico approach, Costa et al. (2003) analyzed the organization and function of phenylpropanoid pathway gene network in Arabidopsis, while Lucheta et al. (2007) focused on genes encoding key enzymes in the flavonoid pathway in Citrus sinensis. Hamberger et al. (2007) conducted genome-wide analysis of phenylpropanoid pathway gene families in poplar and compared them to homologs in Arabidopsis and rice. The focus of these studies was on the genes of the core pathway and the lignin branch. To explore the evolution of phenylpropanoid pathway diversity, Tohge et al. (2013) compared 65 gene families involved in the pathway among 23 species, including Arabidopsis and soybean. Another evolutionary study was focusing on the isoflavonoid pathway (Chu et al. 2014). The research examined nine major isoflavonoid genes in seven plant species, including Arabidopsis, soybean, and common bean. Genes coding for PAL, C4H, 4CL, CHS, and CHI were identified in all analyzed species, while for CHR, IFS, IOMT (isoflavonoid O-methyltransferase), and IFR (isoflavonoid reductase) were confirmed to be legume-specific. Divergent evolutionary patterns were observed among different gene copies of centrally located branch-point enzymes (4CL, CHS, and CHI) regardless of the level of polymorphism or the evolutionary rate.

However, information about this important pathway in common bean is still fragmentary. In our previous study (Reinprecht et al. 2013), 35 phenylpropanoid pathway genes were cloned and mapped in silico in common bean genome (annotation Phaseolus vulgaris v1.0). The work also identified syntenic regions containing phenylpropanoid pathway genes in common bean and soybean (annotation Glycine max v1.1) (Reinprecht et al. 2013). In another study, 22 phenylpropanoid pathway genes have been mapped in the Bat93 × Jalo EEP558 (a core mapping resource for P. vulgaris) and OAC Rex × SVM Taylor recombinant inbred line (RIL) populations (Yadegari 2013). Currently, work on identifying an association between these genes and different seed phenolics in common bean using an association mapping approach is underway. Cytochrome P450 gene family encodes several key enzymes in the phenylpropanoid pathway. Alber and Ehlting (2012) reviewed P450s involved in lignin biosynthesis. The availability of the complete common bean genome sequence allowed Kumar et al. (2015) to identify members of this gene family. The focus of our work was to study gene families encoding enzymes of phenylpropanoid pathway in common bean, using an in silico approach.

11.2 Gene Families Encoding Enzymes of Phenylpropanoid Pathway in Common Bean

Currently, complete genome sequences for 55 plant species, including common bean (Schmutz et al. 2014; current annotation P. vulgaris v1.0), are deposited in Phytozome 10.3 (a comparative genomic database, available at http://phytozome.jgi.doe.gov/pz/portal.html; accessed 16 Nov 2015; Goodstein et al. 2012). This allowed us to study the complete gene families encoding enzymes of phenylpropanoid pathway in common bean, thus extending our previous work (Reinprecht et al. 2013). In particular, we examined their conservation and diversification through comparative analyses with previously sequenced soybean (Schmutz et al. 2010; current annotation G. max Wm82.a2.v1) and Arabidopsis (The Arabidopsis Genome Initiative 2000; Lamesch et al. 2012; current annotation Arabidopsis thaliana TAIR10) genomes. The basic information for the sequenced Arabidopsis, soybean, and common bean genomes is presented in Table 11.1.

Table 11.1 Basic information for the sequenced genomes of A. thaliana, G. max, and P. vulgaris

Full size table

Genome annotations for common bean (Schmutz et al. 2014), soybean (Schmutz et al. 2010), and Arabidopsis (The Arabidopsis Genome Initiative 2000) were obtained from Phytozome 10.2 (Goodstein et al. 2012). For each gene, identifiers and descriptions for all Pfam (Protein families), KEGG (Kyoto Encyclopedia of Genes and Genomes), GO (Gene Ontology), PANTHER (Protein ANalysis THrough Evolutionary Relationships), and KOG (EuKaryotic Orthologous Groups) classifications assigned to this gene can be found.

Table 11.2 contains the list and the number of putative genes in each of the major gene families encoding enzymes of the phenylpropanoid pathway in common bean, soybean, and Arabidopsis in the current annotations of their complete genome sequences (P. vulgaris v1.0, G. max Wm82.a2.v1, and A. thaliana TAIR10) deposited in Phytozome. For each of the 21 enzyme classes, their functional annotations were based on the Pfam and KOG databases (commonly used), while the number of genes in each family was based on Phytozome and KEGG databases. For example, with the KOG0222 search, four phenylalanine ammonia-lyase (PAL, EC:4.3.1.24) genes were identified in Arabidopsis, eight PAL genes were identified in soybean, and six PAL genes were found in common bean (Table 11.2). Several large gene families are involved in phenylpropanoid pathway, including the cytochrome P450 family.

Table 11.2 Major gene families encoding enzymes of phenylpropanoid pathway in A. thaliana, G. max, and P. vulgaris

Full size table

11.3 The Role of Cytochrome P450 Superfamily in Phenylpropanoid Pathway

11.3.1 Cytochrome P450

Cytochromes P450 (CYPs) are ubiquitous monooxygenase enzymes involved in the oxidation of various substrates using oxygen and NADPH. Plant P450s play vital roles in metabolism and detoxification (Mizutani and Ohta 2010; Hamberger and Back 2013). They catalyze reactions in both primary metabolism and secondary metabolism and are involved in the biosynthesis of various metabolites, including fatty acids, sterols, hormones, phenylpropanoids, terpenoids, and signaling molecules. Chemical diversity across plant species is well correlated with the heterogeneity of the P450s (Mizutani and Sato 2011; Mizutani 2012; Sezutsu et al. 2013). They contain a heme cofactor, which absorbs light at 450 nm, and are named for this trait (Pigment absorbing at 450 nm), as well as their cellular localization. Plant P450s are typically membrane-bound to the cytoplasmic surface of the endoplasmic reticulum (ER) by a short N-terminal segment.

The P450s are one of the largest families of enzymes in plants and, in most of plant species, exist as a superfamily. The number of P450 genes is highly variable among plants (Nelson 2006) and represents 0.57–1.07% of the protein coding genes in various plant species [1.07% in Arabidopsis (246/23,000) (Nelson et al. 2004), 0.71% in soybean (332/46,500) (Guttikonda et al. 2010), and 0.78% in common bean (247/31,638) (Kumar et al. 2015)]. The large number of P450s in higher plants is due to gene duplication and diversification (Werck-Reichhart and Feyereisen 2000).

The P450 gene superfamily is characterized by enormous structural and functional diversity (Nelson et al. 2008; Nelson and Werck-Reinchhart 2011; Nagano 2014). Homology and phylogeny were used to group P450s into families (>40% amino acid sequence identity) and subfamilies (>55% amino acid sequence identity) (Nelson et al. 1996). Plant P450 proteins are numbered as CYP51, CYP71 to CYP99, and CYP701 to CYP772. They belong to ten clans (group of genes originated from a single ancestor), which are named by their lowest numbered member [six single-family clans (CYP51, CYP74, CYP97, CP710, CYP711, and CYP727) and four multiple-family clans (CYP71, CYP72, CYP85, and CYP86)] (Werck-Reichhart and Feyereisen 2000; Nelson et al. 2004; Schuler and Werck-Reinchhart 2003; Schuler et al. 2006). Following recommendations of a nomenclature committee (Nelson et al. 1996), the name of P450s consists of a CYP italicized root symbol, followed by a number of the family, a letter of the subfamily and ending by a number of the gene (e.g., CYP71D9—family 71, subfamily D, gene 9), which is determined by the order of identification regardless of the origin.

Initially, P450s were divided into a large A-type clade, which included members that are involved in secondary metabolism (clan CYP71) and several smaller, non-A-type clades, involved in primary metabolism (such as fatty acids and sterols) (Nelson 2006). The occurrence of large numbers of A-type P450s, compared to the non-A-type, suggests a rapid expansion of A-type P450 gene families in plants (Bak et al. 2011).

11.3.2 Clan CYP71—P450s Involved in the Phenylpropanoid Pathway

Based on the current genome annotations [Pfam:00067 (cytochrome P450) functional annotation at Phytozome 10.2; http://phytozome.jgi.doe.gov/pz/portal.html—accessed 26 June 2015], there are 249 P450 genes in A. thaliana TAIR10, 443 P450 genes in G. max Wm82.a.v1, and 264 P450 genes in P. vulgaris v1.0. However, the number of published P450s in these species is slightly different, 272 genes (including 28 pseudogenes) in Arabidopsis (Bak et al. 2011) and 247 genes (including 15 pseudogenes) in common bean (Kumar et al. 2015). P450s in common bean were classified into ten clans that contain 47 families. The largest CYP71 clan (A-type) consists of 19 families with 144 genes. The majority of the genes (>70%) contain a single intron, but more than 20% of the genes have two introns and only a small number of genes (4%) are intronless. In addition, over 80% of the introns are of the zero phase (intron sequence inserted between two successive codons).

It was reported that over 16 P450s are involved in the synthesis and metabolism of phenylpropanoids (Werck-Reichhart 1995). They are placed at the several key positions in the phenylpropanoid pathway, and their roles in phenylpropanoid metabolism were extensively reviewed. For example, Ehlting et al. (2006) and Alber and Ehlting (2012) focused on P450s involved in the core phenylpropanoid pathway and lignin branch, Ayabe and Akashi (2006) in flavonoid metabolism, while Tanaka (2006) and Tanaka and Brugliera (2013) reviewed the role of P450s in flower color.

Seven gene families that encode P450 enzymes involved in phenylpropanoid pathway, as identified in the current genome annotations in common bean, soybean, and Arabidopsis, are listed in Table 11.3. It should be noted, however, that the number of genes in analyzed genomes may change as more work on annotations is done. For example, the CYP71D family in soybean had 81 genes (including 39 pseudogenes) in G. max v1.0 (Nelson 2009) and 52 genes (including 16 pseudogenes) in G. max Wm82.a2.v1. Eleven gene sequences did not correspond between the two genome annotations.

Table 11.3 Clan CYP71 cytochrome P450 gene families encoding enzymes of the phenylpropanoid pathway in A. thaliana, G. max, and P. vulgaris

Full size table

We used the standard nomenclature of chromosome-based locus (gene model) identifiers in plant genome annotations and assemblies (Phytozome), which consists of four segments:

species [AT or At (A. thaliana), Glyma. (G. max), Phvul. (P. vulgaris)],
chromosome number [1 to 5 (A. thaliana), 01 to 20 (G. max), 001 to 011 (P. vulgaris)],
gene (G or g), and
five-digit code [A. thaliana—At2g37040 for phenylalanine ammonia-lyase 1 (PAL1)] or six-digit code [G. max (Glyma.03g181700, PAL1) and P. vulgaris (Phvul.001g177800, PAL1)], numbered from top to bottom of chromosome.

These gene families encode enzymes that catalyze various reactions in different branches of the phenylpropanoid pathway (Fig. 11.1), including

1.
core phenylpropanoid pathway: cinnamate 4-hydroxylase (C4H, CYP73A),
2.
lignin/lignan branch: coumarate 3-hydroxylase (C3H, CYP98A) and ferulic acid 5-hydroxylase (F5H, CYP84A),
3.
anthocyanin/condensed tannin branch: flavonoid 3′-hydroxylase (F3′H, CYP75B), flavonoid 3′,5′-hydroxylase (F3′,5′H, CYP75A), and flavone synthase (FNS, CYP93B), and
4.
isoflavonoid branch: isoflavone synthase (IFS, CYP93C), isoflavone 2′-hydroxylase (I2′H, CYP81E), flavonoid 6-hydroxylase (F6H, CYP71D), and 3,9-dihydroxypterocarpan 6a-monooxygenase (D6aH, CYP93A).

11.3.3 Gene Structure, Conserved Domains, and Motifs of P450s Involved in the Phenylpropanoid Pathway

Seven P450s families (clan CYP71) that encode enzymes in the phenylpropanoid pathway in common bean, soybean, and Arabidopsis contain 135 members, with one to 36 genes per family (Table 11.3). Most of these genes contain introns. Only one gene is intronless (Phvul.009g244000, CYP81E51). The number of introns ranges from one to four. The majority of the genes contain one (63%) or two introns (32%). The proteins that they encode range in size from 408 amino acids (Phvul.001g139500, CYP93A57) to 543 amino acids (Phvul.002g014800, CYP81E44). The protein sequences were aligned using Clustal Omega at EMBL-EBI (http://www.ebi.ac.uk/Tools/msa/clustalo/), and conserved regions were displayed with a sequence logo generated from the alignment using a Web-based WebLogo 3.4 (Crooks et al. 2004; available at http://weblogo.threeplusone.com/). All of the P450 sequences included the following domains: a heme-binding region (FxxGxRxCxG), a PERF motif (PERF/W), a K-helix region (KETRL) involved in defining the heme pockets and stabilizing the protein structure, and an I-helix region (AGxDT) involved in oxygen binding (Fig. 11.2).

11.3.4 Phylogenetic Analysis of P450s Involved in the Phenylpropanoid Pathway

The alignment and tree construction of 135 protein sequences (Table 11.3) from seven P450 gene families (clan CYP71) involved in the phenylpropanoid pathway were performed in MEGA6 (Tamura et al. 2013). These analyses were based on the full-length genes from the three genomes, with one nearly intact soybean C4H pseudogene included (indicated by P at the end of the CYP name—CYP73A88 P). A member from the soybean CYP81E family (CYP81E220de1b, Glyma.16g149200) is truncated (101 amino acids) and was not included in the tree construction.

The phylogenetic tree (Fig. 11.3) separates P450 protein sequences (clan 71) from the two species into seven families:

CYP71—CYP71D is a legume-specific cluster and contains 36 genes in soybean (and 16 pseudogenes, not included) and 21 genes in common bean (and four pseudogenes, not included). A single flavonoid 6-hydroylase (F6H) in common bean was clustered with three F6H proteins in soybean.
CYP73—CYP73A family contains four genes for cinnamic acid 4-hydroxylase (C4H) in soybean (including one pseudogene), three genes in common bean, and a single gene in Arabidopsis. The C4H cluster splits into class I and class II enzymes.
CYP75 family is split into two subfamilies. CYP75A consists of two genes for flavonoid 3′,5′-hydroxylase (F3′5′H) (and one pseudogene, not included) in soybean and two genes in common bean. There are no genes for F3′5′H in Arabidopsis. Subfamily CYP75B contains five genes for flavonoid 3′-hydroxylase (F3′H) (and one pseudogene, not included) in soybean, two genes in common bean, and a single gene for F3′H in Arabidopsis.
CYP81—CYP81E is a legume-specific cluster and consists of 12 genes coding isoflavone 2′-hydroxylase-like (I2′H) genes (and four pseudogenes, not included) in soybean and 12 genes (and two pseudogenes, not included) in common bean.
CYP84—CYP84A cluster contains three genes encoding ferulic acid 5-hydroxylase (F5H) (and one pseudogene, not included) in soybean, three genes in common bean, and two genes in Arabidopsis.
CYP93—The family is clustered into three subfamilies. CYP93A is a legume-specific subfamily. It consists of eight genes for 3,9-dihydropterocarpan 6a-monooxygenase (D6aH) (and two pseudogenes, not included) in soybean and seven genes (and one pseudogene, not included) in common bean. The CYP93B subfamily contains two genes encoding flavonoid synthase (FNS) in soybean and a single gene in common bean. There are no FNS genes in Arabidopsis. CYP93C is a legume-specific branch. It consists of two genes for isoflavone synthase (IFS) in soybean and three genes in common bean.
CYP98—CYP98A cluster consists of two genes for coumarate 3-hydroxylase (C3H) in soybean and single genes in common bean and Arabidopsis genomes, respectively. There are two additional pollen-specific CYP98As in Arabidopsis (CYP98A8 and CYP98A9; Matsuno et al. 2009—not included in tree construction).

11.3.5 Genome Organization of the Clan CYP71 Gene Families Involved in Phenylpropanoid Pathway in Common Bean

A common bean in silico map that contained genes coding for enzymes of phenylpropanoid pathway, including nine P450s, was developed previously (Reinprecht et al. 2013). The map was created by BLASTing the genomic sequences of the phenylpropanoid pathway genes against the whole common bean genome (P. vulgaris v1.0, Phytozome) using the starting nucleotide positions of the resulting alignments with the chromosome as the map positions for each of the gene sequences.

A similar approach was used to develop a common bean P450-based in silico map, which contains 144 P450, clan CYP71 genes. The mapping was initiated with 134 genes that were identified at Phytozome by searching for KOG0156 functional annotations (cytochrome P450 CYP2 subfamily). Selected gene sequences were BLASTed against the complete common bean genome sequence (Phytozome) to identify their locations. Gene identity was confirmed with the published common bean P450s (Kumar et al. 2015), and ten new sequences (not annotated as KOG0156 in Phytozome) were added to the map. Gene families involved in the phenylpropanoid pathway (shown in larger font, color-coded) were found throughout the common bean genome, except for chromosome Pv05 (Fig. 11.4).

Within the same family, P450s are usually grouped into clusters and the structure of the same P450 family is generally conserved (Nelson et al. 2004; Paquette et al. 2009). In the common bean genome, clustering of genes from the same family was noticed on the chromosomes Pv03 for family CYP93C (all three IFS genes) and Pv09 for family CYP81E (three I2′H genes). Some of the CYP71 genes are tandem arranged with at least four genes from the same subfamily in a row. Many of these clustered genes are found in the same orientation on four chromosomes [Pv01 (four CYP712B, all forward), Pv02 (four CYP71D, all forward), Pv04 (ten CYP82A, all forward; five CYP71AU, all reverse; five CYP736A, all reverse) and Pv06 (eight CYP71D, all reverse; four CYP79D, all forward)] but in a different orientation on three chromosomes [Pv03 (four CYP71D), Pv04 (four CYP81E), and Pv06 (four CYP71D)]. However, members of the large CYP71D subfamily clustered in the same orientation on chromosomes Pv02 (four) and Pv06 (eight) but in a different orientation on the chromosomes Pv03 (four) and Pv06 (four). Therefore, the subfamily distribution may not follow a regular pattern. Due to clustered organization, the 144 CYP71 P450 genes (Kumar et al. 2015) were not evenly distributed in the common bean genome. They ranged from two genes on the chromosome Pv05 to 25 genes on the chromosome Pv04 (Fig. 11.4).

11.4 Cinnamate 4-Hydroxylase (C4H, EC:1.14.13.11, CYP73A)

11.4.1 C4H Catalytic Reaction and Position in the Phenylpropanoid Pathway

Cinnamate 4-hydroxylase (trans-cinnamate 4-monooxygenase, C4H, EC:1.14.13.11, CYP73A) is the first P450 enzyme in the phenylpropanoid pathway. It is an ER membrane-bound P450 and belongs to the family of oxidoreductases that act on paired donors with incorporation of molecular oxygen. The enzyme catalyzes an irreversible (and rate-limiting) region-specific hydroxylation of the aromatic ring of trans-cinnamic acid (only at the 4-position or para position) to produce p-coumaric (hydroxycinnamic) acid (Fig. 11.5), a precursor for many phenylpropanoids including flavonoids, phytoallexins, and monolignols (Hahlbrock and Scheel 1989; Anterola and Lewis 2002; Lu et al. 2006). For activity, C4H requires molecular oxygen and a cytochrome P450 reductase (CPR).

Mizutani et al. (1997) isolated a cDNA and a genomic clone encoding cinnamate 4-hydroxylase from Arabidopsis (CYP73A5) and found its coordinated expression with PAL and 4CL genes. Mutations in this gene affected phenylpropanoid metabolism, growth, and development (Schilmiller et al. 2009). The gene was mapped to the lower arm of chromosome 2 and was highly expressed in all Arabidopsis tissues, especially in roots and lignifying cells (Bell-Lelong et al. 1997). Genes targeted by the same transcription factors tend to show similar expression patterns, which usually suggest relationships among the genes. Down-regulation of genes coding for PAL and C4H was associated with reduced lignin content and altered lignin composition in transgenic tobacco (Sewalt et al. 1997). The position of C4H in the phenylpropanoid pathway protein network is shown in Fig. 11.6a. Highly connected proteins have a stable steady-state distribution of gene expression (Fig. 11.6b).

Separation of three common beans, four soybeans, and single Arabidopsis sequences into two groups (Fig. 11.7) confirmed earlier groupings of C4H into class I and class II proteins (Ehlting et al. 2006). This diversification occurred early in the evolution of vascular plants through gene duplication. Common bean and soybean have both classes of C4Hs, while Arabidopsis (Brassicaceae) contains only one gene encoding class I C4H. The alignment of C4H protein sequences (ClustalW2 at EMBL-EBI, available at http://www.ebi.ac.uk/Tools/msa/clustalw2/) revealed high conservation (60–98% identity) among the proteins (85–98% within five C4H class I proteins and 90% between two class II C4H proteins). However, when both monocots and dicots were compared, class I C4H was highly conserved (over 80% protein level), while class II C4Hs were more divergent (less than 70% protein level). This suggests that class I C4Hs “maintained an essential function that does not allow these genes to be lost or even changed much, and it is appealing to assume that this essential function is developmental lignification” (Alber and Ehlting 2012). Class II C4Hs are only present in some plant species, and the class seems to have more specialized functions.

The sequences of eight C4H proteins from common bean, soybean, and Arabidopsis were aligned using Clustal Omega (http://www.ebi.ac.uk/Tools/msa/clustalo/) and BoxShade (http://www.ch.embnet.org/software/BOX_form.html). The sequences were most divergent in their N-terminal membrane anchors. Conserved motifs found in plant P450s (Fig. 11.8, shown in bold) were present in all eight proteins, including proline-rich (PPGP) region, C helix (WrkmR), oxygen binding and activation I-helix (AAIETT), K-helix (EtlR), PERF motif (PeeFrPeRF), and heme-binding region (FgvGrRsCpG) at C-terminus. The only exception is soybean C4H (CYP73A88P) encoded by a pseudogene (Glyma.10g275600). It has truncated N-terminal region, and the generally highly conserved PERF motif has an arginine (R, Arg) to lysine (K, Lys) substitution (Fig. 11.8, highlighted).

Secondary structures of C4H proteins were predicted by programs GOR (Garnier-Osguthorpe-Robson), IV (Garnier et al. 1996; https://npsa-prabi.ibcp.fr/cgi-bin/npsa_automat.pl?page=npsa_gor4.html), and Phyre2 (Protein Homology/analogY Recognition Engine V 2.0) (Kelley et al. 2015; http://www.sbg.bio.ic.ac.uk/phyre2/html/page.cgi?id=index). Transmembrane helices were predicted by program TMHMM-2.0 (TransMembrane prediction using Hidden Markov Models; Krogh et al. 2001; http://www.cbs.dtu.dk/services/TMHMM-2.0/). All proteins have secondary structures similar to the previously published P450s (Graham and Peterson 1999) including alpha helices (blue), beta sheets (red), and random coils (pink) (Fig. 11.9a). They consist of 36–45% alpha helices, 14–18% extended (or beta) strands, and 40–46% random coils. There is a slight difference between the classes of common bean and soybean C4H proteins. Class I C4H proteins contain higher percentages of alpha helices, while class II C4H proteins were predicted to have higher percentages of extended (or beta) strands and random coils. Membrane anchors were predicted for all proteins except for soybean C4H (CYP73A88P) encoded by the pseudogene Glyma.10g026000 (Fig. 11.9b). All C4H proteins are globular proteins as predicted by Phyre2 (Fig. 11.9c). Common bean and soybean C4Hs have tertiary structures similar to the previously identified CYP73A5 in Arabidopsis (At2g30490) and also contain an alpha-domain and a beta-domain (Rupasinghe et al. 2003).

Gene ontology (GO) annotations for C4H proteins (Table 11.4) were predicted using the protein function prediction (PFP), a sequence similarity-based protein function prediction server at Kihara Bioinformatics Laboratory (http://kiharalab.org/; Hawkins et al. 2009). PFP takes into account weakly similar sequences as well as GO term associations observed in known annotations.

Table 11.4 Protein function prediction (PFP) GO terms predicted for common bean, soybean, and Arabidopsis C4H proteins

Full size table

11.4.2 CYP73A Gene Family—Structure and Genome Location of C4H Genes

C4Hs are encoded by the relatively small CYP73A gene family. It consists of three genes in common bean {Phvul.006g079700—CYP73A118, Phvul.007g026000—CYP73A15, and Phvul.008g247400—CYP73A [this P450 was incorrectly named as CYP73A2 in common bean (Kumar et al. 2015); however, CYP73A2 was identified in mung bean (Mizutani et al. 1993), Vigna radiata (previously Phaseolus aureus; recently moved from the genus Phaseolus to Vigna)], four genes (including one pseudogene, Glyma.10g275600—CYP73A88 P) in soybean, and a single gene in Arabidopsis (At2g30490; CYP73A5; REF3).

The gene is well conserved in plants, including soybean and common bean. It contains the Pfam domain (PF00067), found as a “duplication-resistant” gene (Paterson et al. 2006). The first C4Hs were identified in Jerusalem artichoke (Helianthus tuberosus—CYP73A1, GenBank accession Z17369; Teutsch et al. 1993) and mung bean (V. radiata—CYP73A2, GenBank accession L07634; Mizutani et al. 1993). Soybean C4H (CYP73A11), a class I C4H enzyme, was identified as an elicitor-induced cytochrome P450, using differential display of mRNA (Schopfer and Ebel 1998). In contrast, common bean C4H (CYP73A15) was identified as a class II C4H enzyme, whose expression was associated with differentiation (Nedelkina et al. 1999).

The genes coding for C4H in common bean, soybean, and Arabidopsis differ in their exon/intron structures. The exons are conserved, while introns are more variable. Genes encoding class II proteins in common bean and soybean consist of two exons separated by an intron of moderate size (354 and 463 bp, respectively). Both exons are split, resulting in four exons, in the two genes encoding class I C4Hs in soybean. These genes are characterized by a long intron 3 (1499 and 1272 bp, respectively). The class I C4H gene in Arabidopsis and the two genes in common bean all have three exons (Fig. 11.10).

11.4.3 Tissue-Specific Expression of Genes Encoding C4Hs

Using publicly available microarray data, Ehlting et al. (2008) created a tool for co-expression analysis of P450s in Arabidopsis. RNA sequencing (RNA-seq) atlases were developed for both soybean (Severin et al. 2010) and common bean (O’Rurke et al. 2014). Based on RNA-seq data (Phytozome 10), genes encoding C4H are differentially expressed in six common bean and soybean tissues (Fig. 11.11). In general, the expression of the genes encoding class I C4H enzymes [Phvul.008g247400 (CYP73A), Glyma.02g236500 (CYP73A11), and Glyma.14g205200 (CYP73A90)] compared to the class II enzymes [Phvul.007g026000 (CYP73A15) and Glyma.20g114200 (CYP73A87)] was higher in all tissues (flowers, pods, leaves, stems, roots, and nodules). Both common bean and soybean have two copies of genes encoding class I C4H enzymes. In both species, one of the genes (Phvul.008g247400 and Glyma.02g236500) is highly expressed in all tissues. The second copy of the genes (Glyma.14g205200 and Phvul.006g079700) is expressed at lower level. In soybean, Glyma.14g205200 had approximately half of the expression of Glyma.02g236500 in stems, roots, and nodules but very low expression in leaves, pods, and flowers. However, Phvul.006g079700 had very low expression in all common bean tissues compared to Phvul.008g247400 (Fig. 11.11).

Common bean C4H (CYP73A15) was characterized as a class II C4H enzyme, whose expression was more related to differentiation than the responses to stress (Nedelkina et al. 1999). Antisense and sense expression of cDNA coding for a truncated CYP73A15 gene from French bean led to a reduced and delayed production of lignin in tobacco (Blee et al. 2001). Three C4H genes were identified in the P. trichocarpa genome. Two of them (PtrC4H1 and PtrC4H2) were abundant in differentiating xylem, suggesting their importance in monolignol biosynthesis. Transcripts of PtrC4H3 had little or no expression in all examined tissues (Lu et al. 2006).

11.4.4 Cis-Regulatory Regions in 5′UTRs of C4H Genes

In order to understand the functions of individual members of the C4H multigene families, promoters of the common bean and soybean genes were analyzed and compared to Arabidopsis gene (At2g30490) promoter, which have known functions. Promoter sequences [1 kb of 5′ regulatory sequence upstream of the coding region (1 kb 5′UTR flanking region)] of C4H genes were retrieved from Phytozome (10.2) and aligned in Clustal Omega at EMBL-EBI (http://www.ebi.ac.uk/Tools/msa/clustalo/) to search for possible sequence similarities among these sequences in the two C4H classes. The analysis of the 5′ regulatory regions of C4H genes in Arabidopsis, soybean, and common bean C4H genes revealed a moderate degree of divergence in these regions (39–60% identity). Multiple sequence alignment was sent to ClustalW2_Phylogeny to produce a phylogenetic tree, which was visualized in TreeView. Based on the 5′UTR sequences, eight C4Hs were split into two clusters: a three-gene class I C4Hs (Phvul.008g247400, Glyma.02g236500, and Glyma.14g205200) and a two-gene class II C4Hs (Phvul.007g026000 and Glyma.20g114200) clusters. However, Arabidopsis (class I At2g30490), common bean (class I Phvul.006g079700), and soybean (class II pseudogene Glyma.10g275600) were not clearly included in any class (Fig. 11.12).

The 5′UTR sequence of C4H genes was analyzed for potential cis-acting regulatory elements using PlantCARE database (http://bioinformatics.psb.ugent.be/webtools/plantcare/html; Lescot et al. 2002). In total, 69 potential regulatory elements were identified in 5′UTR sequences of eight C4H genes (Fig. 11.13; Table 11.5). Twenty-six (38%) elements were present in four or more genes (Fig. 11.13, color-coded). In addition to the core TATA box and CAAT box (present in all genes), the list included a large number of light-responsive elements (27), as well as elements associated with tissue-specific expression (5), defense and stress responses (6), or hormonal responsiveness (9). A considerable number (14) of predicted regulatory elements were categorized as unknown function (Table 11.5), and two of these (AC II and unnamed_4) were present in all eight C4H genes.

Table 11.5 Potential cis-acting regulatory elements identified in the 5′ regulatory sequences (5′UTR) of C4H genes in common bean, soybean, and Arabidopsis using PlantCARE database

Full size table

A fraction of identified regulatory elements was specific only to class I or class II C4H genes (Fig. 11.13; Table 11.5). Twenty-six elements (37.7%) were present only in class I C4H genes. Four of these elements were identified in all five class I C4H genes. The CGTCA-motif and the TGACG-motif are cis-acting elements involved in the MeJA responsiveness, while the functions of the unnamed_1 and unnamed_3 are unknown. In addition, MBS (a MYB binding site involved in drought inducibility) and O₂ site (cis-acting regulatory element involved in regulation of zein metabolism) were identified in the 5′UTRs of all four legume class I C4H genes. Eleven elements (15.9%) were unique to the class II C4H genes. Three of these elements were identified in both soybean (Glyma.20g114200) and common bean (Phvul.007g026000) C4H genes. The MNF1 and CG motifs are light-responsive elements, while the function of the TATCCAT/C-motif is unknown. Lu et al. (2006) reported that four divergent C4H isoforms play distinct roles in P. trichocarpa. The divergent upstream sequences among the two group PtreC4H genes suggested that the mechanisms of gene regulation might be different.

The identification of the cis-acting sequences regulating differential expression of C4H genes and transcription factors that interact with these sequences in common bean, soybean, and Arabidopsis could lead to an understanding of the mechanism(s) of differential regulation of these highly similar genes in these plant species.

11.4.5 Syntenic Regions Containing Common Bean C4H Genes

The availability of the complete genome sequences for numerous plant species, including soybean (Schmutz et al. 2010) and common bean (Schmutz et al. 2014), allows the organization of the individual genomes to be studied, as well as enables comparison of the genomes at the nucleotide level. The size of the common bean genome (521 Mb) is approximately half of the size of the soybean genome (978 Mb). As a result of at least two rounds of polyploidization [~59 MYA (million years ago) and ~13 MYA], the soybean genome contains significant gene duplications and redundancy (Schmutz et al. 2010). In general, for any gene in common bean, two corresponding homologous genes could potentially be found in soybean. Moreover, because of the shared synteny between the two genomes, regions homologous to regions in two soybean chromosomes were found for all 11 common bean chromosomes, with a minor marker rearrangement and/or sequence orientation (Galeano et al. 2009; McClean et al. 2010; Reinprecht et al. 2013).

Synteny analysis was performed in Plant Genome Duplication Database (PGDD, available at http://chibba.agtec.uga.edu/duplication; Lee et al. 2013) against complete genome sequences available for 47 flowering plant species. Numerous syntenic regions (26–44) with other plant species were found for common bean, soybean, and Arabidopsis class I C4Hs. The blocks were of various sizes, ranging from 14 to 884 gene anchors. For example, common bean C4H on the chromosome Pv06, CYP73A118 (Phvul.006g079700), was syntenic to 44 regions in 31 different plant species including two regions in soybean, poplar, pear, watermelon, rice, kale, sacred lotus, and chickpea, three regions in Chinese cabbage, and four regions in kiwifruit (data not shown). In contrast, only five syntenic blocks were identified for common bean and soybean class II C4Hs. They were syntenic to each other and to another three legumes (Medicago truncatula, Cicer arietinum, and Cajanus cajan).

Several syntenic blocks containing C4H loci were identified among common bean, soybean, and Arabidopsis genomes (Table 11.6; Fig. 11.14). For example, Phvul.006g079700 (encoding common bean class I C4H) was syntenic to other four class I C4Hs: common bean Phvul.008g247400, soybean Glyma.02g236500 and Glyma.14g205200, and Arabidopsis At2g30490. Similarly, common bean class II C4H Phvul.007g026000 was syntenic to two soybean class II C4Hs: Glyma20g.114200 and Glyma.10g275600. They were contained in large syntenic blocks anchored by 641 and 561 genes, respectively (Table 11.6; Fig. 11.14, ). Synteny was also analyzed with SyMap v4.0 (Synteny Mapping and Analysis Program; available at http://www.symapdb.org; Soderlund et al. 2011) to produce circular alignments of multiple common bean and soybean chromosomes (Fig. 11.14, right).

Table 11.6 Syntenic blocks containing C4H loci in genomes of common bean, soybean, and Arabidopsis

Full size table

11.4.6 Sequence Polymorphisms in C4H Genes in Common Bean

Nucleotide polymorphisms for a number of phenylpropanoid pathway genes in various plant species have been described, including Arabidopsis (Savolainen et al. 2000; Aguade 2001; Wright et al. 2003) and maize (Brenner et al. 2010). In the current work, sequences of three C4H genes in the common bean landrace G19833 (Phytozome) were BLASTed against genome sequence of cultivar OAC Rex. The structure of C4H genes identified in OAC Rex was predicted with the HMM-based Fgenesh gene finder (Solovyev et al. 2006; available at http://linux1.softberry.com/berry.phtml?topic=fgenesh&group=programs&subgroup=gfind; accessed: 7 July 2015).

The C4H proteins in the two genotypes were very similar. The proteins encoded by the Phvul.006g079700 gene in G19833 and OAC Rex were identical. A single amino acid difference was identified at position 42 between OAC Rex (I) and G19833 (V) C4H proteins encoded by the Phvul.008g247400 gene (99.8% identity). OAC Rex and G19833 C4H proteins encoded by the Phvul.007g026000 gene were 99.1% identical. Differences were found in five amino acids at positions 4 (V in OAC Rex, F in G19833), 7 (N in OAC Rex, K in G19833), 18 (L in OAC Rex, S in G19833), 54 (K in OAC Rex, N in G19833), and 420 (I in OAC Rex, V in G19833) (data not shown).

The CH4 genomic sequences were also very similar between two common bean genotypes (97.2% identity for Phvul.006g079700, 98.5% identity for Phvul.008g247400, and 98.9% identity for Phvul.007g026000). However, by aligning the CH4 encoding sequences in the two bean genomes (G19833 and OAC Rex), polymorphism (SNPs, insertions, and deletions) was identified for all three C4H genes (Table 11.7; Fig. 11.15).

Table 11.7 C4H gene polymorphism between common beans cultivar OAC Rex and landrace G19833

Full size table

Although polymorphisms were detected in both the coding (one to 11 SNPs, shown in bold) and non-coding regions, the majority of the sequence differences that were identified occurred in the introns and UTRs. For example, the size difference of the Phvul.006g079700 gene (encoding class I C4H, CYP73A118) intron 2 (143 bp) in OAC Rex (272 bp) and G19833 (415 bp) can be used to develop gene-based marker(s). However, the usefulness of these polymorphisms as C4H gene-specific marker needs to be evaluated in additional germplasm from two common bean gene pools.

11.5 Conclusions

The availability of the whole genome sequences allowed us to identify gene families encoding major enzymes of the phenylpropanoid pathway in common bean, soybean, and Arabidopsis. The work focused on C4H, a cytochrome P450 that occupies an entry position in the pathway. Three genes encoding C4H proteins were identified in common bean genome compared to the four genes in soybean. The next step would be to functionally characterize these genes. The availability of the common bean genome sequence also makes it possible to identify and characterize the members of each gene family that are involved in the specific branches of the phenylpropanoid pathway. Furthermore, the identification of transcription factors that activate phenylpropanoid biosynthetic gene families could provide tools to potentially manipulate the amount of different phenylpropanoids in common bean.

References

Adlercreutz H (2007) Lignans and human health. Crit Rev Clin Lab Sci 44:483–525
Article CAS PubMed Google Scholar
Agati G, Azzarello E, Pollastri S, Tattini M (2012) Flavonoids as antioxidants in plants: location and functional significance. Plant Sci 196:67–76
Article CAS PubMed Google Scholar
Agati G, Brunetti C, Di Ferdinando M, Ferrini F, Pollastri S, Tattini M (2013) Functional roles of flavonoids in photoprotection: new evidence, lessons from the past. Plant Physiol Biochem 72:35–45
Google Scholar
Aguade M (2001) Nucleotide sequence variation at two genes of the phenylpropanoid pathway, the Fah1and F3Hgenes, in Arabidopsis thaliana. Mol Biol Evol 18:1–9
Article CAS PubMed Google Scholar
Akada S, Dube SK (1995) Organization of soybean chalcone synthase gene clusters and characterization of a new member of the family. Plant Mol Biol 29:189–199
Article CAS PubMed Google Scholar
Akashi T, Aoki T, Ayabe S (2005) Molecular and biochemical characterization of 2-hydroxyisoflavanone dehydratase. Involvement of carboxylesterase-like proteins in leguminous isoflavone biosynthesis. Plant Physiol 137:882–891
Article CAS PubMed PubMed Central Google Scholar
Alber A, Ehlting J (2012) Cytochrome P450s in lignin biosynthesis. Adv Bot Res 61:113–143
Article CAS Google Scholar
Anterola AM, Lewis NG (2002) Trends in lignin modification: a comprehensive analysis of the effects of genetic manipulations/mutations on lignification and vascular integrity. Phytochemistry 61:221–294
Article CAS PubMed Google Scholar
Artigot M-P, Dayde J, Berger M (2013) Expression of flavonoid 6-hydroxylase candidate genes in normal and mutant soybean genotypes for glycitein content. Mol Biol Rep 40:4361–4369
Article CAS PubMed Google Scholar
Austin MB, Noel JP (2003) The chalcone synthase superfamily of type III polyketide synthases. Nat Prod Rep 20:79–110
Article CAS PubMed Google Scholar
Ayabe S, Akashi T (2006) Cytochrome P450s in flavonoid metabolism. Phytochem Rev 5:271–282
Article CAS Google Scholar
Bak S, Beisson F, Hamberger B, Hofer R, Paquette S, Werck-Reichhart D (2011) Cytochrome P450. Arabidopsis Book 9:e0144. doi:10.1199/tab.0144
Article PubMed PubMed Central Google Scholar
Barakat A, Bagniewska-Zadworna A, Frost CJ, Carlson JE (2010) Phylogeny and expression profiling of CAD and CAD-like genes in hybrid Populus (P. deltoides × P. nigra): evidence from herbivore damage for subfunctionalization and functional divergence. BMC Plant Biol 10:100. doi:10.1186/1471-2229-10-100
Article PubMed PubMed Central CAS Google Scholar
Barker MS, Baute GJ, Liu S-L (2012) Duplications and turnover in plant genomes. In: Wendel JF, Greilhuber J, Dolezal J, Leitch IJ (eds) Plant genome diversity, vol 1. Springer-Verlag, Wien, pp 155–169
Google Scholar
Bassard J-E, Richert L, Geerinck J, Renault H, Duval F, Ullmann P, Schmitt M, Meyer E, Mutterer J, Boerjan W, De Jaeger G, Mely Y, Goossens A, Werck-Reichhart D (2012) Protein-protein and protein-membrane associations in the lignin pathway. Plant Cell 24:4465–4482
Article CAS PubMed PubMed Central Google Scholar
Baxter HL, Stewart N Jr (2013) Effects of altered lignin biosynthesis on phenylpropanoid metabolism and plant stress. Biofuels 4:635–650
Article CAS Google Scholar
Bell-Lelong DA, Cusumano JC, Meyer K, Chapple C (1997) Cinnamate-4-hydroxylase expression in Arabidopsis. Regulation in response to development and the environment. Plant Physiol 113:729–738
Article CAS PubMed PubMed Central Google Scholar
Blee K, Choi JW, O’Connell AP, Jupe SC, Schuch W, Lewis NG, Bolwell GP (2001) Antisense and sense expression of cDNA coding for CYP73A15, a class II cinnamate 4-hydroxylase, leads to a delayed and reduced production of lignin in tobacco. Phytochemistry 57:1159–1166
Article CAS PubMed Google Scholar
Boerjan W, Ralp J, Baucher M (2003) Lignin biosynthesis. Annu Rev Plant Biol 54:519–546. doi:10.1146/annurev.arplant.54.031902.134938
Article CAS PubMed Google Scholar
Brenner EA, Zein I, Chen Y, Andersen JR, Wenzel G, Ouzunova M, Eder J, Darnhofer B, Frei U, Barriere Y, Lubberstedt T (2010) Polymorphisms in O-methyltransferase genes are associated with stover cell wall digestibility in European maize (Zea mays L.). BMC Plant Biol 10:27. doi:10.1186/1471-2229-10-27
Article PubMed PubMed Central CAS Google Scholar
Brunetti C, Di Ferdinando M, Fini A, Pollastri S, Tattini M (2013) Flavonoids as antioxidants and developmental regulators: relative significance in plants and humans. Int J Mol Sci 14:3540–3555. doi:10.3390/ijms14023540
Article CAS PubMed PubMed Central Google Scholar
Cassidy A, Hanley B, Lamuela-Raventos RM (2000) Isoflavones, lignans and stilbenes—origins, metabolism and potential importance to human health. J Sci Food Agric 80:1044–1062
Article CAS Google Scholar
Casneuf T, De Bodt S, Raes J, Maere S, Van de Peer Y (2006) Nonrandom divergence of gene expression following gene and genome duplications in the flowering plant Arabidopsis thaliana. Genome Biol 7:R13
Article PubMed PubMed Central CAS Google Scholar
Chen J, Wang L, Thompson LU (2006) Flaxseed and its components reduce metastasis after surgical excision of solid human breast tumor in nude mice. Cancer Lett 234:168–175
Article CAS PubMed Google Scholar
Cheynier V, Comte G, Davies KM, Lattanzio V, Martens S (2013) Plant phenolics: recent advances on their biosynthesis, genetics, and ecophysiology. Plant Physiol Biochem 72:1–20
Article CAS PubMed Google Scholar
Chu S, Wang J, Cheng H, Yang Q, Yu D (2014) Evolutionary study of the isoflavonoid pathway based on multiple copies analysis in soybean. BMC Genet 15:76
Article PubMed PubMed Central Google Scholar
Costa MA, Collins RE, Anterola AM, Cochrane FC, Davin LB, Lewis NG (2003) An in silico assessment of gene function and organization of the phenylpropanoid pathway metabolic networks in Arabidopsis thaliana and limitations thereof. Phytochemistry 64:1097–1112
Article CAS PubMed Google Scholar
Crooks GE, Hon G, Chandonia JM, Brenner SE (2004) WebLogo: a sequence logo generator. Genome Res 14:1188–1190
Article CAS PubMed PubMed Central Google Scholar
Dhaubhadel S (2011) Regulation of isoflavonoid biosynthesis in soybean seeds. In: Ng T-B (ed) Soybean—biochemistry, chemistry and physiology. InTech, http://www.intechopen.com
Ehlting J, Hamberger B, Milliom-Rousseau R, Werck-Reichhart D (2006) Cytochrome P450 in phenolic metabolism. Phytochem Rev 5:239–270
Article CAS Google Scholar
Ehlting J, Sauveplane V, Olry A, Ginglinger J-F, Provart NJ, Werck-Reichhart D (2008) An extensive (co-)expression analysis tool for the cytochrome P450 superfamily in Arabidopsis thaliana. BMC Plant Biol 8:47. doi:10.1186/1471-2229-8-47
Falcone Ferreyra ML, Rius SP, Casati P (2012) Flavonoids: biosynthesis, biological functions, and biotechnological applications. Front Plant Sci 3:222. doi:10.3389/fpls.2012.00222
CAS PubMed PubMed Central Google Scholar
Ferrer JL, Austin MB, Stewart C Jr, Noel JP (2008) Structure and function of enzymes involved in the biosynthesis of phenylpropanoids. Plant Physiol Biochem 46:356–370. doi:10.1016/j.plaphy.2007.12.009
Article CAS PubMed Google Scholar
Fraser CM, Chapple C (2011) The phenylpropanoid pathway in Arabidopsis. Arabidopsis Book 9:e0152. doi:10.1199/tab.0152
Article PubMed PubMed Central Google Scholar
Freeling M (2009) Bias in plant gene content following different sorts of duplication: tandem, whole-genome, segmental or by transposition. Annu Rev Plant Biol 60:433–453
Article CAS PubMed Google Scholar
Galeano CH, Fernandez AC, Gomez M, Blair MW (2009) Single strand conformation polymorphism based SNP and Indel markers for genetic mapping and synteny analysis of common bean (Phaseolus vulgaris L.). BMC Genom 10:629. doi:10.1186/1471-2164-10-629
Article CAS Google Scholar
Garnier J, Gibrat JF, Robson B (1996) GOR method for predicting protein secondary structure from amino acid sequence. Methods Enzymol 266:540–553
Article CAS PubMed Google Scholar
Gonzalez A, Zhao M, Leavitt JM, Lloyd AM (2008) Regulation of the anthocyanin biosynthetic pathway by the TTG1/bHLH/Myb transcriptional complex in Arabidopsis seedlings. Plant J 53:814–827
Article CAS PubMed Google Scholar
Goodstein DM, Shu S, Howson R, Neupane R, Hayes RD, Fazo J, Mitros T, Dirks W, Hellsten U, Putnam N, Rokhsar DS (2012) Phytozome: a comparative platform for green plant genomics. Nucleic Acids Res 40:D1178–D1186
Article CAS PubMed Google Scholar
Gou J-Y, Felippes FF, Liu C-J, Weigel D, Wang J-W (2011) Negative regulation of anthocyanin biosynthesis in Arabidopsis by a miR156-targeted SPL transcription factor. Plant Cell 23:1512–1522
Article CAS PubMed PubMed Central Google Scholar
Goujon T, Sibout R, Eudes A, MacKay J, Jouanin L (2003) Genes involved in the biosynthesis of lignin precursors in Arabidopsis thaliana. Plant Physiol Biochem 41:677–687
Article CAS Google Scholar
Graham SE, Peterson JA (1999) How similar are P450s and what can their differences teach us? Arch Biochem Biophys 369:24–29
Article CAS PubMed Google Scholar
Graham T, Graham M, Yu O (2008) Genomics of secondary metabolism in soybean. In: Stacey G (ed) Genetics and genomics of soybean. Springer Science+Business Media, LLC, Berlin, pp 211–241
Chapter Google Scholar
Grotewold E (2005) Plant metabolic diversity: a regulatory perspective. Trends Plant Sci 10:57–62
Article CAS PubMed Google Scholar
Guttikonda SK, Trupti J, Bisht NC, Chen H, An Y-QC, Pandey S, Xu D, Yu O (2010) Whole genome co-expression analysis of soybean cytochrome P450 genes identifies nodulation-specific P450 monooxygenases. BMC Plant Biol 12:243
Article CAS Google Scholar
Hahlbrock K, Scheel D (1989) Physiology and molecular biology of phenylpropanoid metabolism. Annu Rev Plant Physiol Plant Mol Biol 40:347–369
Article CAS Google Scholar
Hamberger B, Bak S (2013) Plant P450s as versatile drivers for evolution of species-specific chemical diversity. Philos Trans R Soc B 368:20120426
Article CAS Google Scholar
Hamberger B, Ellis M, Friedmann M, de Azevedo Souza C, Barbazuk B, Douglas CJ (2007) Genome-wide analyses of phenylpropanoid-related genes in Populus trichocarpa, Arabidopsis thaliana, and Oryza sativa: the Populus lignin toolbox and conservation and diversification of angiosperm gene families. Can J Bot 85:1182–1201
Article CAS Google Scholar
Hanada K, Sawada Y, Kuromori T, Klausnitzer R, Saito K, Toyoda T, Shinozaki K, Li W-H, Hirai MY (2011) Functional compensation of primary and secondary metabolites by duplicate genes in Arabidopsis thaliana. Mol Biol Evol 28:377–382
Article CAS PubMed Google Scholar
Hao Z, Mohnen D (2014) A review of xylan and lignin biosynthesis: foundation for studying Arabidopsis irregular xylem mutants with pleiotropic phenotypes. Crit Rev Biochem Mol Biol 49:212–241
Article CAS PubMed Google Scholar
Harakava R (2005) Genes encoding enzymes of the lignin biosynthesis pathway in Eucalyptus. Genet Mol Biol 28:601–607
Article CAS Google Scholar
Hartmann U, Sagasser M, Mehrtens F, Stracke R, Weisshaar B (2005) Differential combinatorial interactions of cis-acting elements recognized by R2R3-MYB, BZIP, and BHLH factors control light-responsive and tissue-specific activation of phenylpropanoid biosynthesis genes. Plant Mol Biol 57:155–171
Article CAS PubMed Google Scholar
Hawkins T, Chitale M, Luban S, Kihara D (2009) PFP: automated prediction of Gene Ontology functional annotations with confidence scores using protein sequence data. Proteins 74:566–582
Article CAS PubMed Google Scholar
Hrazdina G, Wagner GJ (1985) Metabolic pathways as enzyme complexes: evidence for the synthesis of phenylpropanoids and flavonoids on membrane associated enzyme complexes. Arch Biochem Biophys 237:88–100
Article CAS PubMed Google Scholar
Humphreys JM, Hemm MR, Chapple C (1999) New routes for lignin biosynthesis defined by biochemical characterization of recombinant ferulate 5-hydroxylase, a multifunctional cytochrome P450-dependent monooxygenase. Proc Natl Acad Sci U S A 96:10045–10050
Article CAS PubMed PubMed Central Google Scholar
Humphreys JM, Chapple C (2002) Rewriting the lignin roadmap. Curr Opin Plant Biol 5:224–229
Article CAS PubMed Google Scholar
Jung W, Yu O, Lau SM, O’Keefe DP, Odell J, Fader G, McGonigle B (2000) Identification and expression of isoflavone synthase, the key enzyme for biosynthesis of isoflavones in legumes. Nat Biotechnol 18:208–212
Article CAS PubMed Google Scholar
Kelley LA, Mezulis S, Yates CM, Wass MN, Sternberg MJE (2015) The Phyre2 web portal for protein modeling, prediction and analysis. Nat Protoc 10:45–858. doi:10.1038/nprot.2015.053
Article CAS Google Scholar
Koes R, Verweij W, Quattrocchio F (2005) Flavonoids: a colorful model for the regulation and evolution of biochemical pathways. Trends Plant Sci 10:236–242
Article CAS PubMed Google Scholar
Krogh A, Larsson B, von Heijne G, Sonnhammer EL (2001) Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol 305:567–580. doi:10.1006/jmbi.2000.4315
Article CAS PubMed Google Scholar
Kumar MS, Chakravarthy SS, Babu PR, Rao KV, Reddy VD (2015) Classification of cytochrome P450s in common bean (Phaseolus vulgaris L.). Plant Syst Evol 301:211–216
Article CAS Google Scholar
Labeeuw L, Martone LT, Boucher Y, Case RJ (2015) Ancient origin of the biosynthesis of lignin precursors. Biol Direct 10:23. doi:10.1186/s13062-015-0052-y
Article PubMed PubMed Central CAS Google Scholar
Lamesch P, Berardini TZ, Li D, Swarbreck D, Wilks C, Sasidharan R, Muller R, Dreher K, Alexander DL, Garcia-Hernandez M, Karthikeyan AS, Lee CH, Nelson WD, Ploetz L, Singh S, Wensel A, Huala E (2012) The Arabidopsis Information Resource (TAIR): improved gene annotation and new tools. Nucleic Acids Res 40:D1202–D1210
Article CAS PubMed Google Scholar
Latunde-Dada AO, Cabello-Hurtado F, Czittrich N, Didierjean L, Schopfer C, Hertkorni N, Werck-Reichhart D, Ebel J (2001) Flavonoid 6-hydroxylase from soybean (Glycine max L.), a novel plant P-450 monooxygenase. J Biol Chem 276:1688–1695
Google Scholar
Lee TH, Tang H, Wang X, Paterson AH (2013) PGDD: a database of gene and genome duplication in plants. Nucleic Acids Res. doi:10.1093/nar/gks1104
Google Scholar
Lescot M, Déhais P, Moreau Y, De Moor B, Rouzé P, Rombauts S (2002) PlantCARE: a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences. Nucleic Acids Res 30:325–327
Article CAS PubMed PubMed Central Google Scholar
Li S (2014) Transcriptional control of flavonoid biosynthesis—fine-tuning of the MYB-bHLH-WD40 (MBW) complex. Plant Sig Behav 9:e27522
Article CAS Google Scholar
Lozovaya VV, Lygin AV, Zernova OV, Ulanov AV, Li S, Hartman GL, Widholm JM (2007) Modification of phenolic metabolism in soybean hairy roots through down regulation of chalcone synthase or isoflavone synthase. Planta 225:665–679
Article CAS PubMed Google Scholar
Lu S, Zhou Y, Li L, Chiang VL (2006) Distinct roles of cinnamate 4-hydroxylase genes in Populus. Plant Cell Physiol 47:905–914
Article CAS PubMed Google Scholar
Lucheta AR, Silva-Pinhati ACO, Basílio-Palmieri AC, Berger IJ, Freitas-Astúa J, Cristofani M (2007) An in silico analysis of the key genes involved in flavonoid biosynthesis in Citrus sinensis. Genet Mol Biol 30:819–831
Google Scholar
Lynch M, Conery JS (2000) The evolutionary fate and consequences of duplicate genes. Science 290:151–1155
Article Google Scholar
Matsumura H, Watanabe S, Harada K, Senda M, Akada S, Kawasaki S, Dubouzet EG, Minaka N, Takahashi R (2005) Molecular linkage mapping and phylogeny of the chalcone synthase multigene family in soybean. Theor Appl Genet 110:1203–1209
Article CAS PubMed Google Scholar
Matsuno M, Compagnon V, Schoch GA, Schmitt M, Debayle D, Bassard J-E, Pollet B, Hehn A, Heintz D, Ullmann P, Lapierre C, Bernier F, Ehlting J, Werck-Reichhart D (2009) Evolution of a novel phenolic pathway for pollen development. Science 324:1688–1692
Article CAS Google Scholar
McClean PE, Mamidi S, McConnell M, Chikara S, Lee R (2010) Synteny mapping between common bean and soybean reveals extensive blocks of shared loci. BMC Genom 11:184. doi:10.1186/1471-2164-11-184
Article CAS Google Scholar
Miedes E, Vanholme R, Boerjan W, Molina A (2014) The role of the secondary cell wall in plant resistance to pathogens. Front Plant Sci 5:358. doi:10.3389/fpls.2014.00358
Article PubMed PubMed Central Google Scholar
Mizutani M (2012) Impacts of diversification of cytochrome P450 on plant metabolism. Biol Pharm Bull 35:824–832
Article CAS PubMed Google Scholar
Mizutani M, Ohta D (2010) Diversification of P450 genes during land plant evolution. Annu Rev Plant Biol 61:291–315
Article CAS PubMed Google Scholar
Mizutani M, Ohta D, Sato R (1997) Isolation of a cDNA and a genomic clone encoding cinnamate 4-hydroxylase from Arabidopsis and its expression manner in plants. Plant Physiol 113:755–763
Article CAS PubMed PubMed Central Google Scholar
Mizutani M, Sato F (2011) Unusual P450 reactions in plant secondary metabolism. Arch Biochem Biophys 507:94–203
Article CAS Google Scholar
Mizutani M, Ward E, DiMaio J, Ohta D Ryals J, Sato R (1993) Molecular cloning and sequencing of a cDNA encoding mung bean cytochrome P450 (P450_C4H) possessing cinnamate 4-hydroxylase activity. Biochem Biophys Res Commun 190:875–880
Google Scholar
Moura JC, Bonine CA, de Oliveira Fernandes Viana J, Dornelas MC, Mazzafera P (2010) Abiotic and biotic stresses and changes in the lignin content and composition in plants. J Integr Plant Biol 52:360–376. doi:10.1111/j.1744-7909.2010.00892.x
Article CAS PubMed Google Scholar
Nagano S (2014) Structural and functional diversity of cytochrome P450. In: Yamazaki H (ed) Fifty years of cytochrome P450 research. Springer, Japan, pp 95–106. doi:10.1007/978-4-431-54992-5_5
Naoumkina MA, Zhao Q, Gallego-Giraldo L, Dai X, Zhao PX, Dixon RA (2010) Genome-wide analysis of phenylpropanoid defence pathways. Mol Plant Pathol 11:829–846
CAS PubMed Google Scholar
Nedelkina S, Jupe SC, Blee KA, Schalk M, Werck-Reichhart D, Bolwell GP (1999) Novel characteristics and regulation of a divergent cinnamate 4-hydroxylase (CYP73A15) from French bean: engineering expression in yeast. Plant Mol Biol 39:1079–1090
Article CAS PubMed Google Scholar
Nelson DR (2006) Cytochrome P450 nomenclature, 2004. Method Mol Biol 320:1–10
CAS Google Scholar
Nelson DR (2009) The cytochrome P450 homepage. Hum Genomics 4:59–65
CAS PubMed PubMed Central Google Scholar
Nelson DR, Koymans L, Kamataki T, Stegeman JJ, Feyereisen R, Waxman DJ, Waterman MR, Gotoh O, Coon MJ, Estabrook RW, Gunsalus IC, Nebert DW (1996) P450 superfamily: update on new sequences, gene mapping, accession numbers and nomenclature. Pharmacogenetics 6:1–42
Article CAS PubMed Google Scholar
Nelson DR, Ming R, Alam M, Schuler MA (2008) Comparison of cytochrome P450 genes from six plant genomes. Trop Plant Biol 1:216–235
Article CAS Google Scholar
Nelson DR, Schuler MA, Paquette SM, Werck-Reichhart D, Bak S (2004) Comparative genomics of rice and Arabidopsis. Analysis of 727 cytochrome P450 genes and pseudogenes from a monocot and a dicot. Plant Physiol 135:756–772
Article CAS PubMed PubMed Central Google Scholar
Nelson DR, Werck-Reichhart D (2011) A P450-centric view of plant evolution. Plant J 66:194–211
Article CAS PubMed Google Scholar
Noel JP, Austin MB, Bomati EK (2005) Structure-function relationships in plant phenylpropanoid biosynthesis. Curr Opin Plant Biol 8:249–253
Article CAS PubMed PubMed Central Google Scholar
Nutzmann H-W, Osbourn A (2014) Gene clustering in plant specialized metabolism. Curr Opin Biotechnol 26:91–99
Article CAS PubMed Google Scholar
Nutzmann H-W, Osbourn A (2015) Regulation of metabolic gene clusters in Arabidopsis thaliana. New Phytol 205:503–510. doi:10.1111/nph.13189
Article PubMed CAS Google Scholar
O’Rourke JA, Iniguez LP, Fu F, Bucciarelli B, Miller SS, Jackson SA, McClean PE, Li J, Dai X, Zhao PX, Hernandez G, Vance CP (2014) An RNA-Seq based gene expression atlas of the common bean. BMC Genom 15:866
Article Google Scholar
Osakabe K, Tsao CC, Li L, Popko JL, Umezawa T, Carraway DT, Smeltzer RH, Joshi CP, Chiang VL (1999) Coniferyl aldehyde 5-hydroxylation and methylation direct syringyl lignin biosynthesis in angiosperms. Proc Natl Acad Sci U S A 96:8955–8960
Article CAS PubMed PubMed Central Google Scholar
Paquette SM, Jensen K, Bak S (2009) A web-based resource for the Arabidopsis P450, cytochromes b5, NADPH-cytochrome P450 reductases, and family 1 glycosyltransferases (http://www.P450.kvl.dk). Phytochemistry 70:1940–1947
Parvathi K, Chen F, Guo D, Blount JW, Dixon RA (2001) Substrate preferences of O-methyltransferases in alfalfa suggest new pathways for 3-O-methylation of monolignols. Plant J 25:193–202
Article CAS PubMed Google Scholar
Paterson AH, Chapman BA, Kissinger JC, Bowers JE, Feltus FA, Estill JC (2006) Many genes and domain families have convergent fates following independent whole-genome duplication events in Arabidopsis, Oryza, Saccharomyces and Tetraodon. Trends Genet 22:597–602
Article CAS PubMed Google Scholar
Petrussa E, Braidot E, Zancani M, Peresson C, Bertolini A, Patui S, Vianello A (2013) Plant flavonoids-Biosynthesis, transport and involvement in stress responses. Int J Mol Sci 14:14950–14973. doi:10.3390/ijms140714950
Article PubMed PubMed Central CAS Google Scholar
Raes J, Rohde A, Christensen JH, Van de Peer Y, Boerjan W (2003) Genome-wide characterization of the lignification toolbox in Arabidopsis. Plant Physiol 133:1051–1071
Article CAS PubMed PubMed Central Google Scholar
Ralston L, Subramanian S, Matsuno M, Yu O (2005) Partial reconstruction of flavonoid and isoflavonoid biosynthesis in yeast using soybean type I and type II chalcone isomerases. Plant Physiol 137:1375–1388
Article CAS PubMed PubMed Central Google Scholar
Ralston L, Yu O (2006) Metabolons involving plant cytochrome P450s. Phytochem Rev 5:459–472
Article CAS Google Scholar
Ramsay NA, Glover BJ (2005) MYB-bHLH-WD40 protein complex and the evolution of cellular diversity. Trends Plant Sci 10:63–70
Article CAS PubMed Google Scholar
Reinprecht Y, Yadegari Z, Perry GE, Siddiqua M, Wright LC, McClean PE, Pauls KP (2013) In silico comparison of genomic regions containing genes coding for enzymes and transcription factors for the phenylpropanoid pathway in Phaseolus vulgaris L. and Glycine max L. Merr. Front Plant Sci 4:317. doi:10.3389/fpls.2013.00317
Article PubMed PubMed Central Google Scholar
Rupasinghe S, Baudry J, Mary A, Schuler MA (2003) Common active site architecture and binding strategy of four phenylpropanoid P450s from Arabidopsis thaliana as revealed by molecular modeling. Protein Eng 16:721–731. doi:10.1093/protein/gzg094
Article CAS PubMed Google Scholar
Ryder TB, Hedrick SA, Bell JN, Liang XW, Clouse SD, Lamb CJ (1987) Organization and differential activation of a gene family encoding the plant defense enzyme chalcone synthase in Phaseolus vulgaris. Mol Gen Genet 210:219–233
Article CAS PubMed Google Scholar
Saito K, Yonekura-Sakakibara K, Nakabayashi R, Higashi Y, Yamazaki M, Tohge T, Fernie AR (2013) The flavonoid biosynthetic pathway in Arabidopsis: structural and genetic diversity. Plant Physiol Biochem 72:21–34
Article CAS PubMed Google Scholar
Savolainen O, Langley CH, Lazzaro BP, Freville H (2000) Contrasting patterns of nucleotide polymorphism at the alcohol dehydrogenase locus in the outcrossing Arabidopsis lyrata and the selfing Arabidopsis thaliana. Mol Biol Evol 17:645–655
Article CAS PubMed Google Scholar
Schilmiller AL, Stout J, Weng J-K, Humphreys J, Ruegger MO, Chapple C (2009) Mutations in the cinnamate 4-hydroxylase gene impact metabolism, growth and development in Arabidopsis. Plant J 60:771–782
Article CAS PubMed Google Scholar
Schmutz J, Cannon SB, Schlueter J, Ma J, Mitros T, Nelson W, Hyten DL, Song Q, Thelen JJ, Cheng J, Xu D, Hellsten U, May GD, Yu Y, Sakurai T, Umezawa T, Bhattacharyya MK, Sandhu D, Valliyodan B, Lindquist E, Peto M, Grant D, Shu S, Goodstein D, Barry K, Futrell-Griggs M, Abernathy B, Du J, Tian Z, Zhu L, Gill N, Joshi T, Libault M, Sethuraman A, Zhang XC, Shinozaki K, Nguyen HT, Wing RA, Cregan P, Specht J, Grimwood J, Rokhsar D, Stacey G, Shoemaker RC, Jackson SA (2010) Genome sequence of the palaeopolyploid soybean. Nature 463:178–183
Article CAS PubMed Google Scholar
Schmutz J, McClean PE, Mamidi S, Wu GA, Cannon SB, Grimwood J, Jenkins J, Shu S, Song Q, Chavarro C, Torres-Torres M, Geffroy V, Moghaddam SM, Gao D, Abernathy B, Barry K, Blair M, Brick MA, Chovatia M, Gepts P, Goodstein DM, Gonzales M, Hellsten U, Hyten DL, Jia G, Kelly JD, Kudrna D, Lee R, Richard MM, Miklas PN, Osorno JM, Rodrigues J, Thareau V, Urrea CA, Wang M, Yu Y, Zhang M, Wing RA, Cregan PB, Rokhsar DS, Jackson SA (2014) A reference genome for common bean and genome-wide analysis of dual domestications. Nat Genet 46:707–713
Article CAS PubMed Google Scholar
Schoch G, Goepfert S, Morant M, Hehn A, Meyer D, Ullmann P, Werck-Reichhart D (2001) CYP98A3 from Arabidopsis thaliana is a 3′-hydroxylase of phenolic esters, a missing link in the phenylpropanoid pathway. J Biol Chem 276:36566–36574
Article CAS PubMed Google Scholar
Schopfer CR, Ebel J (1998) Identification of elicitor-induced cytochrome P450s of soybean (Glycine max L.) using differential display of mRNA. Mol Gen Genet 258:315–322
Article CAS PubMed Google Scholar
Schuler MA, Werck-Reichhart D (2003) Functional genomics of P450s. Annu Rev Plant Biol 54:629–667
Article CAS PubMed Google Scholar
Schuler MA, Duan H, Bilgin M, Ali S (2006) Arabidopsis cytochrome P450s through the looking glass: a window on plant biochemistry. Phytochem Rev 5:205–237
Article CAS Google Scholar
Senda M, Jumonji A, Yumoto S, Ishikawa R, Harada T, Niizeki M, Akada S (2002) Analysis of the duplicated CHS1 gene related to the suppression of the seed coat pigmentation in yellow soybeans. Theor Appl Genet 104:1086–1091
Article CAS PubMed Google Scholar
Severin AJ, Woody JL, Bolon Y-T, Joseph B, Diers BW, Farmer AD, Muehlbauer GJ, Nelson RT, Grant D, Specht JE, Graham MA, Cannon SB, May GD, Vance CP, Shoemaker RC (2010) RNA-Seq atlas of Glycine max: a guide to the soybean transcriptome. BMC Plant Biol 10:160
Article PubMed PubMed Central CAS Google Scholar
Sewalt VJH, Ni W, Blount JW, Jung HG, Masoud SA, Howles PA, Lamb C, Dixon RA (1997) Reduced lignin content and altered lignin composition in transgenic tobacco down-regulated in expression of l-phenylalanine ammonia-lyase or cinnamate 4-hydroxylase. Plant Physiol 115:41–50
Article CAS PubMed PubMed Central Google Scholar
Sezutsu H, Le Goff G, Feyereisen R (2013) Origins of P450 diversity. Philos Trans R Soc B 368:20120428
Article CAS Google Scholar
Shelton D, Stranne M, Mikkelsen L, Pakseresht N, Welham T, Hiraka H, Tabata S, Sato S, Paquette S, Wang TL, Martin C, Bailey P (2012) Transcription factors of lotus: regulation of isoflavonoid biosynthesis requires coordinated changes in transcription factor activity. Plant Physiol 159:531–547
Article CAS PubMed PubMed Central Google Scholar
Shi R, Sun Y-H, Li Q, Heber S, Seferoff R, Chiang VL (2010) Towards a system approach for lignin biosynthesis in Populus trichocarpa: transcript abundance and specificity of the monolignol biosynthetic genes. Plant Cell Physiol 51:144–163
Article CAS PubMed Google Scholar
Shimamura M, Akashi T, Sakurai N, Suzuki H, Saito K, Shibata D, Ayabe S, Aoki T (2007) 2-hydroxyisoflavanone dehydratase is a critical determinant of isoflavone productivity in hairy root cultures of Lotus japonicus. Plant Cell Physiol 48:1652–1657
Article CAS PubMed Google Scholar
Soderlund C, Bomhoff M, Nelson WM (2011) SyMAP v3.4: a turnkey synteny system with application to plant genomes. Nucleic Acids Res 39:e68. doi:10.1093/nar/gkr123
Article CAS PubMed PubMed Central Google Scholar
Solovyev V, Kosarev P, Seledsov I, Vorobyev D (2006) Automatic annotation of eukaryotic genes, pseudogenes and promoters. Genome Biol 7:S10. doi:10.1186/gb-2006-7-s1-s10
Article PubMed PubMed Central Google Scholar
Steele CL, Gijzen M, Qutob D, Dixon RA (1999) Molecular characterization of the enzyme catalyzing the aryl migration reaction of isoflavonoid biosynthesis in soybean. Arch Biochem Biophys 367:146–150
Article CAS PubMed Google Scholar
Tamura K, Stecher G, Peterson D, Filipski A, Kumar S (2013) MEGA6: molecular evolutionary genetic analysis version 6.0. Mol Biol Evol 30:2725–2829
Article CAS PubMed PubMed Central Google Scholar
Tanaka Y (2006) Flower colour and cytochromes P450. Phytochem Rev 5:283–291. doi:10.1007/s11101-006-9003-7
Article CAS Google Scholar
Tanaka Y, Brugliera F (2013) Flower colour and cytochromes P450. Philos Trans R Soc Lond B Biol Sci 368:20120432. doi:10.1098/rstb.2012.0432
Article PubMed PubMed Central CAS Google Scholar
Teutsch HG, Hasenfratz M, Lesot A, Stoltz C, Garnier J-M, Jeltsch J-M, Durst F, Werck-Reichhart D (1993) Isolation and sequence of a cDNA encoding the Jerusalem artichoke cinnamate 4-hydroxylase, a major plant cytochrome P450 involved in the general phenylpropanoid pathway. Proc Natl Acad Sci U S A 90:4102–4106
Article CAS PubMed PubMed Central Google Scholar
The Arabidopsis Genome Initiative (2000) Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408:796–815
Article Google Scholar
Tohge T, Watanabe M, Hoefgen R, Fernie AR (2013) The evolution of phenylpropanoid pathway in the green lineage. Crit Rev Biochem Mol Biol 48:123–152
Article CAS PubMed Google Scholar
Tohge T, Yonekura-Sakakibara K, Niida R, Watanabe-Takahashi A, Saito K (2007) Phytochemical genomics in Arabidopsis thaliana: a case study for functional identification of flavonoid biosynthesis genes. Pure Appl Chem 79:811–823
Article CAS Google Scholar
Tsai C-J, Harding SA, Tschaplinski TJ, Lindroth RL, Yuan Y (2006) Genome-wide analysis of the structural genes regulating defense phenylpropanoid metabolism in Populus. New Phytol 172:47–62. doi:10.1111/j.1469-8137.2006.01798.x
Article CAS PubMed Google Scholar
Vanholme R, Demedts B, Morreel K, Ralph J, Boerjan W (2010) Lignin biosynthesis and structure. Plant Physiol 153:895–905
Article CAS PubMed PubMed Central Google Scholar
Vogt T (2010) Phenylpropanoid biosynthesis. Mol Plant 3:2–20
Article CAS PubMed Google Scholar
Wang X (2011) Structure, function, and engineering of enzymes in isoflavonoid biosynthesis. Funct Integr Genomics 11:13–22
Article CAS PubMed Google Scholar
Wang Y, Tan X, Paterson AH (2013) Different patterns of gene structure divergence following gene duplication in Arabidopsis. BMC Genom 14:652
Article CAS Google Scholar
Wang Y, Wang X, Paterson AH (2012) Genome and gene duplications and gene expression divergence: a view from plats. Ann N Y Acad Sci 1256:1–14
Article PubMed Google Scholar
Weng J-K, Chapple C (2010) The origin and evolution of lignin biosynthesis. New Phytol 187:273–285
Article CAS PubMed Google Scholar
Werck-Reichhart D (1995) Cytochromes P450 in phenylpropanoid metabolism. Drug Metabol Drug Interact 12:221–243
Article CAS PubMed Google Scholar
Werck-Reichhart D, Feyereisen R (2000) Cytochrome P450: a success story. Genome Biol 1:reviews3003.1–reviews3003.9
Google Scholar
Winkel-Shirley B (1999) Evidence for enzyme complexes in the phenylpropanoid and flavonoid pathways. Physiol Plant 107:142–149
Article CAS Google Scholar
Winkel-Shirley B (2001) Flavonoid biosynthesis. A colorful model for genetics, biochemistry, cell biology, and biotechnology. Plant Physiol 126:485–493
Article CAS PubMed PubMed Central Google Scholar
Wright SI, Lauga B, Charlesworth D (2003) Subdivision and haplotype structure in natural populations of Arabidopsis lyrata. Mol Ecol 12:1247–1263
Article CAS PubMed Google Scholar
Xiao CW (2008) Health effects of soy protein and isoflavones in humans. J Nutr 138:1244S–1249S
CAS PubMed Google Scholar
Xu Z, Zhang D, Hu J, Zhou X, Ye X, Reichel KL, Stewart NR, Syrenne RD, Yang X, Gao P, Shi W, Doeppke C, Sykes RW, Burris JN, Bozell JJ, Cheng Z-M, Hayes DG, Labbe N, Davis M, Stewart CN, Yuan JS (2009) Comparative genome analysis of lignin biosynthesis gene families across the plant kingdom. BMC Bioinform 10:S3
Article CAS Google Scholar
Xu W, Dubos C, Lepiniec L (2015) Transcriptional control of flavonoid biosynthesis by MYB-bHLH-WDR complexes. Trends Plant Sci 20:176–185
Article CAS PubMed Google Scholar
Xu W, Grain D, Bobet S, Le Gourrierec J, Thevenin J, Kelemen Z, Lepiniec L, Dubos C (2014) Complexity and robustness of the flavonoid transcriptional regulatory network revealed by comprehensive analyses of MYB-bHLH-WDR complexes and their targets in Arabidopsis seed. New Phytol 202:132–144. doi:10.1111/nph.12620
Article CAS PubMed Google Scholar
Yadegari Z (2013) Molecular mapping and characterization of phenylpropanoid pathway genes in common bean (Phaseolus vulgaris L.). Ph.D. thesis, University of Guelph, Guelph, Canada
Google Scholar
Yi J, Derynck MR, Chen L, Dhaubhadel S (2010a) Differential expression of CHS7 and CHS8 genes in soybean. Planta 231:741–753
Article CAS PubMed Google Scholar
Yi J, Derynck MR, Li X, Telmer P, Marsolais F, Dhaubhadel S (2010b) A single repeat MYB transcription factor, GmMYB176, regulates CHS8 gene expression and affects isoflavonoid biosynthesis in soybean. Plant J 62:1019–1034
CAS PubMed Google Scholar
Yoon J, Choi H, An G (2015) Roles of lignin biosynthesis and regulatory genes in plant development. J Integr Plant Biol 57:902–912
Article CAS PubMed PubMed Central Google Scholar
Zhao Q, Dixon RA (2011) Transcriptional networks for lignin biosynthesis: more complex than we thought? Trends Plant Sci 16:227–233
Article CAS PubMed Google Scholar
Zhong R, Ye Z-H (2009) Transcriptional regulation of lignin biosynthesis. Plant Signal Behav 4:1028–1034
Article CAS PubMed PubMed Central Google Scholar
Zubieta C, Parvathi K, Ferrer J-L, Dixon RA, Noel JP (2002) Structural basis for the modulation of lignin monomer methylation by caffeic acid/5-hydroxyferulic acid3/5-O-methyltransferase. Plant Cell 14:1265–1277
Article CAS PubMed PubMed Central Google Scholar

Download references

Author information

Authors and Affiliations

Department of Plant Agriculture, University of Guelph, Guelph, ON, N1G 2W1, Canada
Yarmilla Reinprecht, Gregory E. Perry & K. Peter Pauls

Authors

Yarmilla Reinprecht
View author publications
You can also search for this author in PubMed Google Scholar
Gregory E. Perry
View author publications
You can also search for this author in PubMed Google Scholar
K. Peter Pauls
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yarmilla Reinprecht .

Editor information

Editors and Affiliations

Facultad de Biología, Universidad de León, León, Spain
Marcelino Pérez de la Vega
Misión Biológica de Galicia, Grupo de Biología de Agrosistemas, Consejo Superior de Investigaciones Científicas, Pontevedra, Spain
Marta Santalla
Genomics and Biotechnology, London Research and Development Centre, Agriculture and Agri-Food Canada, London, Ontario, Canada
Frédéric Marsolais

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Reinprecht, Y., Perry, G.E., Peter Pauls, K. (2017). A Comparison of Phenylpropanoid Pathway Gene Families in Common Bean. Focus on P450 and C4H Genes. In: Pérez de la Vega, M., Santalla, M., Marsolais, F. (eds) The Common Bean Genome. Compendium of Plant Genomes. Springer, Cham. https://doi.org/10.1007/978-3-319-63526-2_11

Download citation

DOI: https://doi.org/10.1007/978-3-319-63526-2_11
Published: 21 December 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-63524-8
Online ISBN: 978-3-319-63526-2
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)

Publish with us

Policies and ethics

A Comparison of Phenylpropanoid Pathway Gene Families in Common Bean. Focus on P450 and C4H Genes

Abstract

Similar content being viewed by others

Soybean chalcone isomerase: evolution of the fold, and the differential expression and localization of the gene family

Genome-wide identification, comparative analysis and functional roles in flavonoid biosynthesis of cytochrome P450 superfamily in pear (Pyrus spp.)

Characterization of genes coding phenylalanine ammonia lyase and chalcone synthase in four Pogostemon cablin cultivars

Keywords

11.1 Introduction

11.2 Gene Families Encoding Enzymes of Phenylpropanoid Pathway in Common Bean