Keywords

2.1 Introduction

Only a few decades back in the 1960s or even in the 1970s and 1980s, taxonomy involved almost exclusively the use of morphological traits to differentiate one species from another. Genetic variation of populations was also accounted for based on morphological variation. It was in the mid-1960s that Lewontin and Hubby revealed the application of protein electrophoresis to quantify genetic variation in populations of fruit flies (Lewontin and Hubby 1966). Soon enough, isozyme electrophoresis spread throughout the world. Numberless population genetic studies were produced, unraveling genetic systems of a diverse set of species, including plants. A revolution to that began with the use of markers directly at the level of DNA, in the 1980s. Restriction fragment length polymorphisms (RFLP) enabled not only population genetic structure but also the development of linkage maps (Tanksley et al. 1989). The development of polymerase chain reaction accelerated the discovery of molecular markers of various types in a few laboratory steps. Random amplified polymorphic DNA (RAPD) (Williams et al. 1990) and amplified fragment length polymorphisms (AFLP) (Vos et al. 1995) filled in countless issues of scientific journals in the 1990s and early 2000s. Simple sequence repeats (SSR) or microsatellites (Litt and Luty 1989) became more popular and are still being largely employed for population genetic studies that require the distinction of heterozygotes, such as in studies of genetic diversity, structure, paternity and system of mating. Currently, single-nucleotide polymorphisms (SNP) prevail, as those are the most abundant markers in organisms, accounting for the variation in a single base.

As members of the Poaceae family, bamboos comprise up to 1662 species and 121 genera described so far (Canavan et al. 2017), with broad distribution throughout the world. Extensive molecular marker-based surveys have been carried out to constantly improve the resolution of phylogenetic trees of such species (Bamboo Phylogeny Group 2012; Wysocki et al. 2015). Besides taxonomy and molecular phylogenetics, molecular markers have also been employed for the study of bamboos for diverse purposes such as genetic diversity and structure of populations, genetic fidelity assessment in clonal populations, candidate gene search for growth, and developmental processes, among others. The emerging field of epigenomics has also been explored in a few studies so far. It is the aim of this chapter to describe the fundamental applications of molecular markers to bamboos, briefly accounting for the main methods that have been employed, either PCR or gel-based, or obtained directly from sequencing. Up-to-date studies on genomics and perspectives into other “omics,” such as epigenomics and phylogenomics, are also discussed.

2.2 Molecular Markers for Comprehensive Population Genetic Studies in Bamboos

The possibility of inferring genetic diversity and structure of populations with molecular markers allowed unprecedented discoveries in natural populations of several plants, including bamboos. Uncovering phylogenetic bases of the high species diversity of bamboos, in general, has been the target of various other studies employing molecular markers. An increasing number of studies, though, have also been devoted to deciphering intrapopulation genetic variation. Molecular surveys can explain the clonal structure of natural populations and the implications of the rare flowering events in various bamboos.

Bamboos are, in general, clonal plant species, possessing both sexual and asexual propagation systems. Asexual reproduction is advantageous as it enables the successful establishment in novel environments. On the other hand, it does not provide new genetic variation. This is achieved by sexual reproduction (Kitamura and Kawahara 2011). It is frequent that bamboos flower only once in their life cycle and after several decades (Kitamura and Kawahara 2009). A study of Isagi et al. (2004) investigated the flowering pattern of culms regenerated from seeds of Phyllostachys pubescens and that have naturally propagated their genotypes through leptomorphic rhizomes. AFLP screening allowed the identification of distinct genets distributed in the population. The genets had distinct flowering times, suggesting a genetic architecture involved in this trait. Further analysis with microsatellite markers identified the clonal structure of a population of Sasa cernua and revealed an overall synchronism of flowering culms of the same clone, but also that not all culms flowered at once. Culms that flowered died, leaving others of the same clone still able to flower in future events. This may enable novel opportunities for cross-pollination (Kitamura and Kawahara 2009). In Sasamorpha borealis, after vegetative propagation by rhizomes, almost all adults produce flowers, set seeds in large amounts, and die (Lee and Chung 1999). However, dying after producing flowers is not a general rule, as revealed for S. pubiculmis. Among distinct genets that were monitored, one had both flowering and nonflowering patches for 4 years. One genotype can maintain their rhizomes and nonflowering patches alive after mass flowering (Miyazaki et al. 2009).

The rare flowering events in bamboos have important implications in outcrossing rates. The synchronism of flowering within clones (Kitamura and Kawahara 2009) limits the possibility for outcrossing between distinct genotypes. Using microsatellite polymorphisms, Kitamura and Kawahara (2011) studied the mating system of the dwarf bamboo S. cernua. The authors found an overall low outcrossing rate (tm = 0.148). The genotype pairs that presented the highest outcrossing rates also had between 2% and 17% of seeds with homozygote genotypes for the markers. Therefore, high inbreeding coefficients should be expected, but the authors observed some decline toward the stage of seedlings, suggesting some selection against inbred progenies. In P. pubescens, Lin et al. (2014) also detected low outcrossing rates (tm = 0.089) and excess of homozygotes (FIS = 0.195) after genotyping seeds with cDNA SSR markers from three locations separated by at least 100 km. This may be due to an auto compatibility system, such as shown in the Brazilian species Merostachys riedeliana (Guilherme and Ressel 2001).

The characteristics of bamboos are compatible with lower levels of genetic variation within populations and higher genetic differentiation among populations, as revealed by population genetic studies in some species (Table 2.1). However, various species still do not have population genetic surveys published, and high genetic variation is not that rare in data already available. It is not our goal to exhaust the literature on all the studies available for such a purpose, but to provide results and prospects of the use of molecular markers in bamboos.

Table 2.1 Compilation of molecular marker studies with bamboo populations, with genetic diversity and structure estimates

Although with less polymorphism and as indirect products of gene expression, isozyme markers have been used in bamboo germplasm characterization. The advantage of such markers relies on the possibility of discriminating heterozygotes. Four enzyme systems were screened in accessions of hill bamboo, Sinarundinaria anceps. Out of 12 markers, approximately 75% were polymorphic, and similarities among accessions ranged from 54 to 100% (Tiwari et al. 2019). For isozymes, considerably high genetic variation, measured from the observed heterozygosity (Ho = 0.219), was found in populations of S. borealis from Korea. Relatively moderate genetic differentiation, however, was also identified, as measured by the proportion of genetic variation among populations (GST = 0.310) (Lee and Chung 1999). In 17 populations of Pseudosasa japonica, the mean genetic diversity encountered through 28 isozyme loci (Ho = 0.099) was lower than the expectation under Hardy-Weinberg equilibrium (He = 0.167). A high and significant deficit of heterozygotes was then detected (FIS = 0.457), suggesting considerable inbreeding due to vegetative spread. In this species, most of the genetic variation was found within populations (84.8%) (Huh and Huh 2002).

AFLP, ISSR, and SRAP are dominant markers that, in general, are more suitable for genetic structure analysis rather than genetic diversity, as they do not allow the discrimination of heterozygotes. Sixteen cultivars of P. violascens were analyzed with these three types of markers. AFLP allowed the discrimination of 434 markers, while ISSR (209 bands) and SRAP (222) had a lower number. In general, AFLP showed the highest marker efficiency among other methods, although the polymorphism of AFLP (58.3%) was not higher than ISSR (65.1%) and SRAP (68.5%) (Lin et al. 2011). In the dwarf bamboo Bashania fangiana, AFLP markers showed considerable genetic polymorphism, and most of the genetic diversity was found within populations (GST = 0.057). The study indicated that the two populations investigated were multiclonal and diverse (Ma et al. 2013).

ISSR markers were used to analyze the genetic diversity and structure of seven populations of Melocanna baccifera, a non-clump and evergreen arborescent bamboo in India. Moderate values of Nei’s genetic diversity (H = 0.1639) were detected through the analyses, with 88.4% polymorphic bands. From the hierarchical partition of the genetic diversity, most of it was found within populations (GST = 0.1942) (Nilkanta et al. 2017). Similar differentiation was also detected among populations of Dendrocalamus membranaceus from China. Among all the 155 bands, 153 were polymorphic, and the proportion of genetic differentiation among populations was 25.2% (from GST) and 21.1% based on AMOVA (Yang et al. 2012). Conversely, ISSR proved useful in detecting low genetic diversity (H = 0.0418) and high genetic differentiation (GST = 0.8474) in populations of D. giganteus, one of the largest bamboos in the world. A Mantel’s test suggested that geographic and genetic distances were not significantly correlated, implicating in a high genetic differentiation across populations of the species. Differences in flowering times and limited pollen flow could explain the strong differentiation (Tian et al. 2012). Considerable genetic differentiation (FST = 0.38949) was also found among 13 populations of the Ethiopian lowland bamboo, through ISSR. In this case, considerably high genetic diversity was found at the species level (H = 0.2702) (Oumer et al. 2020).

The multiallelic nature and codominance of microsatellites markers, as well as their transferability to other species, make them important markers for more detailed and precise genetic variation and structure analyses (Table 2.1). Abreu et al. (2011) developed 13 novel microsatellite markers for a Brazilian species of bamboo, Aulonemia aristulata, native to the Atlantic rain forest. In a posterior study, those microsatellites were used to analyze the genetic diversity of two populations. The observed heterozygosity was calculated in saplings and seedlings of both populations, being higher in saplings of the populations (Table 2.1). The fixation indexes varied from 0.43 to 0.84, with higher values in the seedling stage. Inbreeding depression could be the main factor explaining a reduction in density of bamboos in the direction of the sapling stage (Abreu et al. 2014).

Inbreeding depression is a potential consequence of crossings between individuals identical by descent, as it happens among close relatives or by selfing (Frankham et al. 2010). As discussed before, the clonal structure of bamboos is favored by their limited flowering events, making opportunities from outcrossing scarcely available. Inbreeding depression can be manifested by several characteristics, such as low germination, mortality of seedlings due to factors such as abnormal germination, or lack of the ability to synthesize chlorophyll. Our group collected a source of seeds of Dendrocalamus asper that were obtained after a single flowering event of an individual in an urban area (Data unpublished). Significant mortality and abnormal germination were observed from such seeds. Moreover, several seedlings were not able to grow, as they lacked chlorophyll deposition in their leaves (Fig. 2.1a). Those that possessed partial green areas alternated with white surfaces on their leaves were able to grow normally (Fig. 2.1b). An ISSR profile with a few markers indicated low polymorphism on such materials, such as shown with the universal primer ISSR 827 (Fig. 2.1c).

Fig. 2.1
figure 1

Plants of Dendrocalamus asper originated from seeds collected from a single plant. (a) Illustration of a few plants after approximately 4 months after germination. The plant circulated in yellow did not survive (white leaves). (b) Plants after 6 months of germination. Plants that have partial ability to accumulate chlorophyll survived. (c) ISSR profile of some of those plants, showing limited polymorphism among them

With microsatellites, genetic diversity and structure of Kuruna debilis, a threatened bamboo species, were determined in populations located in Sri Lanka (Peng et al. 2013). High genetic diversity was found (Ho = 0.708 for all populations), but considerable deficiency of heterozygotes was also computed (FIS = 0.170). The differentiation among populations (FST = 0.113) revealed that most of the diversity was also found within populations (Attigala et al. 2017). P. edulis, the model bamboo which has been sequenced (Peng et al. 2013), was also the object of a microsatellite study in populations of China. Considerable genetic variation, measured from heterozygosity, was measured from 20 novel microsatellite markers developed by Jiang et al. (2013). Nine microsatellites were developed for the Australian endemic species Bambusa arnhemica, revealing an average genetic variation of Ho = 0.360 in four populations. Once again, the deviation from the expected heterozygosity was considerably high (mean of He = 0.690) (Kaneko et al. 2008). In 19 populations of D. hamiltonii from the Himalayas, low genetic variation (Nei’s H = 0.175 at the species level) and moderate genetic differentiation were found (FST = 0.165) (Meena et al. 2019).

From 16 microsatellite markers developed by Dong et al. (2012) for D. sinicus, 8 were screened in another study with natural populations of the species in its distribution range. The data on nuclear microsatellites enabled the discovery of high genetic differentiation among the populations, dividing them into two main subgroups that are consistent with different culm types. With an analysis of chloroplast DNA markers as well, one genetic group encompassed populations with straight culms, while the other genetic cluster grouped the sinuous-culm lineage. Based on both types of molecular markers, the authors concluded that the populations experienced dispersal and long-term isolation, with some dilution due to contemporary gene flow (Yang et al. 2018).

One of the most important food sources to the giant panda, in China, is the bamboo species Fargesia denudata. Therefore, it is straightforward to study the genetic resources available from natural populations of the species. The development of microsatellite markers for F. denudata is an important tool for exploring the genetic diversity of the species. According to a primer note, 14 markers were developed for further population genetic studies (Lv et al. 2016).

Due to the several related species of bamboos, microsatellites can usually be successfully transferred among species for genetic studies. Several of the microsatellites developed for the Brazilian species Aulonemia aristulata were successfully amplified in other species belonging to the genera Bambusa, Dendrocalamus, Gigantochloa, and Guadua (Abreu et al. 2011). Similarly, the microsatellites developed for D. sinicus were transferable to other species within the same genus (Dong et al. 2012). The microsatellites developed for F. denudate were transferable, in different numbers, to F. scabrida, F. rufa, F. ferax, Arundinaria fargesii, and Yushania lineolata (Lv et al. 2016). Bhandawat et al. (2015) dedicated a full study to analyze the crosstransferability between D. latiflorus and other bamboo species.

Another line of studies with molecular markers encompasses comparisons of distinct species and genera of bamboos (Table 2.2). RFLP markers were employed to characterize genetic relationships among several species of the genera Phyllostachys, Bambusa, Arundinaria, Pseudosasa, Sinobambusa, and Sinocalamus (Friar and Kochert 1991). RAPD markers were able to distinguish several bamboo species and genera (Nayak et al. 2003; Ramanayake et al. 2007; Konzen et al. 2017; Annisa et al. 2019). Moreover, RAPD markers were efficiently converted to RFLP markers by enzyme digestion. Those markers were able to distinguish four genera (Bambusa, Dendrocalamus, Guadua, and Phyllostachys) and species (Konzen et al. 2017). ISSR markers were also used in similar approaches (Mukherjee et al. 2010; Amom et al. 2018). Morphological variables and SRAP markers were also combined in a study with several bamboo taxa (Zhu et al. 2014). EST-derived microsatellites have been used in phylogenetic studies of bamboo species (Barkley et al. 2005; Sharma et al. 2009; Cai et al. 2019). EST-based random primers were also useful in the study of Mukherjee et al. (2010). cpDNA markers were the object of distinct studies as well, providing tools for examining genetic differentiation and haplotype diversity within populations (Abreu et al. 2014; Triplett et al. 2010; Yang et al. 2018) and among species (Triplett et al. 2010).

Table 2.2 Compilation of a few molecular marker studies that were conducted to compare distinct taxa of bamboos

To detect natural hybridization among North American woody bamboos of the genus Arundinaria, AFLP and cpDNA markers were used (Triplett et al. 2010). The distinction of three species (A. gigantea, A. appalachiana, and A. tecta) that were previously recognized based on morphological variation was conducted. Those species were also analyzed for their intraspecies genetic diversity. In general, relatively low levels of genetic diversity were found within each of the species, based on the AFLP data (Triplett et al. 2010). The authors discussed the intergradation of molecular and phenotypic data among the parental species. cpDNA analyses suggested multiple and reciprocal hybridization events (Triplett et al. 2010).

Transposable elements are ubiquitous in eukaryotic genomes, being able to self-replication and insertion to other sites, affecting the stability of genomes. In bamboos, long terminal repeat (LTR) retrotransposons are highly abundant (more than 40% of the genome of P. edulis) (Zhou et al. 2017). Retrotransposon-based markers were used to screen 58 bamboo taxa, including 47 distinct species in the genus Phyllostachys. Those markers behaved as dominant, producing an average of 208.75 bands per primer and average polymorphic information content of 0.327 for the whole set of taxa. After AMOVA, 25% of the genetic variation was detected among the taxa (Li et al. 2020a).

The emerging field of epigenetics deals with heritable variation beyond the level of DNA, such as methylation and histone acetylation and patterns of gene expression. The considerable genomic and transcriptomic data available for bamboo is enabling the discovery of epigenomic variation among taxa. Using methylation-sensitive amplification polymorphism (MSAP), Yuan et al. (2014) encountered distinct patterns of methylation for chronological age of P. edulis. The increase in chronological age was accompanied by an augmented DNA methylation rate. Lu et al. (2012) also detected great differences of DNA methylation rates among developmental stages of P. praecox, based on AFLP using methylation-sensitive enzymes. The rapid increase in sequenced genomes will enable the discovery of methylation patterns across taxa of bamboo, assisting in the clarification of several processes that are accompanied by epigenetic changes.

2.3 SNP Markers and their Application to Conservation and Breeding

Genomic technologies accelerated the discovery of single-nucleotide polymorphisms (SNP), the most abundant type of molecular markers in all organisms. The publication of the genomes of P. edulis (Peng et al. 2013) has opened the field for genomic and transcriptomic studies. Restriction site-associated DNA sequencing (RAD-seq) is a method that enables the detection of thousands of SNP, by a strategy of reduced genomic representation (Baird et al. 2008). SNP markers detected through RAD-seq were used to evaluate phylogenetic relationships in the tribe Arundinarieae. In general, eight lineages were supported by the data, two of them in agreement with previous studies on nuclear markers (Wang et al. 2017a).

After comparing three moso bamboo samples through genome resequencing, 4,700,803 SNP markers were detected (Zhou et al. 2019a). The study also discriminated INDEL markers, accounting for 268,150 distinct positions throughout the genomes. Moreover, copy number variation (CNV = 65,935) and structural variations (SV = 215,297) were also analyzed. The variation detected was present in genes associated with ribosome genesis, caffeine metabolism, nucleotide binding, and ribonucleoside binding, among several other categories, probably involved in adaptation of moso bamboo to their environment (Zhou et al. 2019a).

To date, the number of studies involving SNP markers is yet insipient, but there will be certainly many more publications to come shortly. High-density SNP profiles have provided important information on genetic variation and structure of populations. They also have been largely employed in genome-wide association mapping in plants and other species. Such discoveries enable the design of proper conservation and breeding strategies, as phenotypes can be precisely correlated with SNP and genes that are located throughout genomes. Data on SNP can rapidly be obtained from genotyping by sequencing (GBS) (Elshire et al. 2011), even in species that do not have a reference genome for the alignment of reads (Poland et al. 2012).

2.4 Phylogenomics and its Application to Bamboo Taxonomy

Until a few decades ago, bamboo taxonomy relied exclusively on morphological variation. Moreover, the focus was mainly on vegetative characters. Due to the rare flowering events in bamboos, the morphology-based taxonomy of bamboo has been a challenge (Bhattacharya et al. 2006). The proper distinction between bamboo taxa and comprehension of phylogeographic and phylogenetic relationships are essential to guide conservation and development decisions. The development of molecular marker technologies has provided more consistent phylogenetic trees, improving the taxonomic resolution of bamboos.

Currently, high-throughput sequencing technologies (NGS) allow accessing genome sequences of virtually all species. Through genomic data, researchers can infer species relationships, understand mechanisms of molecular evolution, and control for stochastic events. This intersection between evolution and genomics has been called phylogenomics (Eisen and Fraser 2003; Delsuc et al. 2005; Philippe et al. 2005; Yu et al. 2018). In this topic, we provide a few case studies that used phylogenomics for taxonomical classification and a better understanding of the evolutionary relationships of bamboo species.

The tribe Arundinarieae belongs to the subfamily Bambusoideae (Poaceae), containing more than 1600 species, a highly complicated taxonomy (Bamboo Phylogeny Group 2012; Clark et al. 2015; Canavan et al. 2017). Plastid markers enabled the subdivision of bamboos into 12 major lineages (Zeng et al. 2010; Yang et al. 2013; Zhang et al. 2016). Plastid genome sequencing has also been used to resolve the phylogenetic relationships in Arundinarieae and in obtaining robust relationships among lineages (Ma et al. 2014; Wysocki et al. 2015). Recently, other phylogenomic approaches based on restriction site-associated DNA sequencing (RAD-seq) were used to estimate the phylogenetic relationships among Arundinarieae genera from a nuclear evolutionary trajectory perspective (Wang et al. 2017a).

The phylogenetic relationships of Shibataea, a genus of tribe Arundinarieae and endemic to China, have been reconstructed through a phylogenomic approach based on double digest restriction site-associated DNA sequencing (ddRAD-seq), and whole plastid genomes were generated using genome skimming (Guo et al. 2019). Fargesia, other genus of tribe Arundinarieae, has also been its phylogenetic relationships reconstructed based on complete plastid genome sequences of 26 species from Fargesia (Zhou et al. 2019b).

Phylogenomic approaches based on complete plastid genome sequences have also been used in evolutionary studies of bamboos. Through genome-wide comparative analyses of 76 chloroplast genomes of bamboos, extreme heterogeneity of the evolutionary rate within the bamboos was demonstrated, with the lowest value found in tribe Arundinarieae (Wang et al. 2020). Plastid genome sequences have also been used to estimate phylogenetic relationships among Bambuseae species (do Vieira et al. 2016). Bambuseae is another tribe of subfamily Bambusoideae (Poaceae), containing 66 genera and 812 species (Bamboo Phylogeny Group 2012; Clark et al. 2015).

Whole plastid genomes of Olyreae species were generated using genome-skimming approach and used to strongly support the phylogenetic positions of Froesiochloa and Rehia in the Olyreae (Wang et al. 2018). Olyreae is the smallest tribe of subfamily Bambusoideae (Poaceae), containing 22 genera and 124 species (Bamboo Phylogeny Group 2012; Clark et al. 2015). Therefore, there are few phylogenomic studies involving the Olyreae species.

Finally, all these studies and another show the potential of the phylogenomic approach based on NGS technologies combined with traditional morphological analyses for resolving difficult phylogenetic relationships in the intractable bamboo species, evolutionary, and biogeography studies.

2.5 Protein and Nucleotide Motifs for Evolutionary Studies with Gene Families

So far in this chapter, we have treated molecular markers as tools for comprehending population genetic processes as well as to understand phylogenetic relations among species. By a functional characterization of the allelic variation of genes, sequence motifs affecting phenotyping variation can be identified, those being referred to as functional markers (Andersen and Lübberstedt 2003). Genes that have been functionally characterized may be a reference to annotations of paralogs and orthologs, which enable their categorization in major families. Phylogenetic reconstruction with gene families provides insights into the evolution and adaptation of plants (Neale et al. 2017). Genome-wide searches for gene families use model plants to find homologs in the species of interest, but experiments are required to functionally confirm the inferred genes (Vaattovaara et al. 2019).

Various gene families of transcription factors are implicated in the response of plants to biotic and abiotic stresses. In P. edulis, genome-wide surveys of genes are available for WRKY (responsive to cold and drought) (Li et al. 2017); no apical meristem (NAM); Arabidopsis transcription activation factor (ATAF) and cup-shape cotyledon (CUC), collectively referred to as NAC genes (participate in lignin catabolic process and cellulose biosynthetic process) (Shan et al. 2019), heat shock transcription factors (Hsfs) (participate in shoot and flower development; expression is changed under cold, high temperature, drought and high salinity stresses) (Xie et al. 2019), dehydration responsive element-binding (DREB) (differentially expressed under drought, cold and high salinity) (Wu et al. 2015). Among various other categories of gene families studied in P. edulis are aquaporins (AQP) (Sun et al. 2017), late embryogenesis abundant (LEA) (Huang et al. 2016), amino acid/auxin permease (AAAP) (Liu et al. 2017), plant homeodomain zinc finger (PHD-finger) (Gao et al. 2018), trihelix genes (Gao et al. 2019), SQUAMOSA-promoter binding protein-like (SBP-like) transcription factors (Pan et al. 2017), IQ67-domain (IQD) genes (Wu et al. 2016), and basic leucine zipper domain (bZIP) transcription factors (Pan et al. 2019). In this chapter, we highlight the APETALA2/ethylene-responsive element binding protein (AP2/EREBP) superfamily and aquaporins as examples for the application of protein or nucleotide motifs for evolutionary analyses in bamboos.

The AP2/EREBP superfamily of proteins has a high conserved domain named APETALA2 (AP2), with 55 to 70 amino acids (Jofuku et al. 1994; Okamuro et al. 1997; Konzen et al. 2019). This superfamily is divided into three families that have different numbers of domains and participate in different physiological processes, which are RAV, AP2, and ERF (Sakuma et al. 2002). The RAV family has one AP2 domain and one B3 domain (Kagaya et al. 1999), the AP2 family has two AP2 domains (Okamuro et al. 1997), and the ERF family has only one AP2 domain (Sakuma et al. 2002).

Wu et al. (2016) identified 116 genes from the AP2/EREBP family in P. edulis and divided them into three subfamilies: 28 AP2, 7 RAV, 80 ethylene response factors (ERF), and 1 soloist; the ERF group is divided into two subgroups, ERF and DREB. As an example in this chapter, we selected five amino acid sequences available in the Plant Transcription Factor Database (http://planttfdb.gao-lab.org/) from each group to analyze and identify motifs that can be used as molecular markers. First, we used the Clustal W algorithm for producing an alignment using BioEdit (Hall 1999). Afterward, we constructed a phylogenetic tree by maximum likelihood, using MEGA X (Kumar et al. 2018), and the amino acid substitution model WAG+I + G + F, as determined from Bayesian criterion information, with 500 bootstrap replications. The protein motifs were searched on the MEME-suite website (http://meme-suite.org/). From the phylogeny, two major groups were identified (Fig. 2.2a), one with RAV proteins and the other with AP2, DREB, and ERF, the last two being more similar between them (Data unpublished). Observing the protein motifs, similar motifs are shared within the major groups and according to the subdivision of the superfamily. The protein motifs one and two correspond to the domain AP2, a domain that is present in all sequences with certain differences among the subgroups. The protein motifs 8, 3, and 4 correspond to the domain B3, even though the motif 4 was not identified in the sequence RAV_PH01000581G0530.

Fig. 2.2
figure 2

Protein and nucleotide motifs as potential phylogenetic markers for the study of gene families in bamboos. (a) Protein motifs searched for some members of the AP2/EREBP superfamily. (b) Nucleotide motifs searched for 12 aquaporin genes These figures were elaborated based on analyses performed by the authors of the chapter, using MEME-suite396 website (http://meme-suite.org/) with sequences available from GenBank/NCBI (https://www.ncbi.nlm.nih.gov/) and the Plant Transcription Factor Database (http://planttfdb.gao-lab.org/)

We also performed a similar analysis of AQP proteins from sequences available in the public database. AQP proteins are channels to water transport along with CO2 and nutrients (Maurel et al. 2015). PeTIP4;1–1, isolated from P. edulis, conferred tolerance to transgenic Arabidopsis thaliana in abiotic stresses caused by drought and high salinity (Sun et al. 2017). PeTIP4;1 and PeTIP4;2 were expressed in the parenchyma and epidermal cells and roots, indicating their role in water absorption and transport (Sun et al. 2018). Sun et al. (2018) classified the aquaporins from P. edulis into four groups: (a) plasma membrane intrinsic proteins 22 (PIP), (b) tonoplast intrinsic proteins 20 (TIP), (c) nodulin 26-like intrinsic proteins 17 (NIP), and (d) small basic intrinsic proteins 4 (SIP); the uncharacterized X intrinsic proteins (XIP) were not found.

We used the 12 sequences from the work of Sun et al. (2018), available from GenBank/NCBI (https://www.ncbi.nlm.nih.gov/) to search for the correspondent nucleotide sequences in A. thaliana with BLASTn. Sequence alignment and phylogenetic reconstruction were conducted similarly to the AP2/EREBP sequences. The maximum likelihood tree, this time, was constructed with the nucleotide substitution model K2 + G, as determined with MEGA X, after 500 bootstrap replications. The tree of aquaporins with the nucleotide sequences from bamboo and A. thaliana was divided into two major groups, the first with SIP and PIP genes and the second with TIP and NIP genes from both species (Fig. 2.2b). Motif 5 was identified in all nucleotide sequences. Motifs 6 and 7 were found only in PIP genes, while motifs 8 and 9 were found in TIP genes, in similar positions. Motifs 10 and 17 were found only in PIP. Motifs 1, 2, and 3 were found in all sequences except SIP. Motif 4 was detected in TIP and PIP genes (Data unpublished).

Therefore, protein or nucleotide motifs can be used as markers to identify and compare gene families, to better understand the evolution and adaptation of plants. With the sequences available for P. edulis, other bamboos may be searched for genes already annotated in this species.

2.6 Functional Gene Markers for Improving Abiotic Stress Tolerance in Bamboos

Drought, salinity, and low and high temperatures are the mains abiotic constraints that negatively affect bamboo growth and development and reduce its industrial productivity (Ramakrishnan et al. 2020). Therefore, understanding the stress responses will increase the ability to improve tolerance in bamboo species. However, the mechanisms of molecular responses in bamboo species under stress conditions are still not completely understood.

The responses to drought, salinity, and low and high temperature stresses are under stronger biochemical and genetic control (Zhu 2016; Haak et al. 2017). As stated before, various gene families have been identified in moso bamboo. In general, no in-depth functional analyses were conducted for those genes. However, there are other gene families in that at least one member was functionally characterized. From these studies, we listed 12 genes that could be used as functional molecular markers in comprehending abiotic stress tolerance (Table 2.3). To select these stress-responsive genes, we considered their expression profile in moso bamboo plants under stress treatments. Moreover, we considered only those genes that were functionally analyzed in at least one model species such as Arabidopsis, rice, and yeast.

Table 2.3 Gene markers responsive to abiotic stresses in moso bamboo (P. edulis)

Several transcription factors have been identified and associated with drought, salinity, and cold-induced stresses in moso bamboo (Table 2.3). WRKY proteins are important transcription factors involved in different biological processes such as plant growth and abiotic stress responses (Rushton et al. 2010; Chen et al. 2012). A genome-wide survey identified 121 WRKY genes in P. edulis (Li et al. 2017). The authors showed that PheWRKY72–2 is a drought- and cold-inducible gene. Furthermore, the overexpression of PheWRKY72–2 in Arabidopsis enhanced plant tolerance to drought stress by promoting stomatal closure. In another study, PeWRKY83 gene was highly upregulated in moso bamboo seedlings subjected to salinity and drought stress. After ectopic expression in Arabidopsis, it was demonstrated that PeWRKY83 regulates the expression of genes involved in ABA biosynthesis, signaling, and responses to salt and improves salt tolerance (Wu et al. 2017).

The MYB gene family, first described as an oncogene from MYeloBlastosis virus (capital letters highlighting the attributed name of the gene family), also was identified in moso bamboo (Hou et al. 2018). Through in silico and expression analyses, these authors identified a PheMYB4–1 stress-related gene. After ectopic overexpression, PheMYB4–1 promoted improvement in tolerance to cold, drought, and salt stress in Arabidopsis seedlings. Two NAC transcription factors from moso bamboo were identified and functionally characterized. First, the expression levels of PheNAC3 gene increased in leaves of moso bamboo after NaCl treatment. Overexpression in Arabidopsis improved drought and salt tolerance (Xie et al. 2020). Second, drought and salinity also induced the expression of PeSNAC-1 in moso bamboo. Rice plants overexpressing PeSNAC1 gene were more drought- and salt-tolerant (Hou et al. 2020). Moreover, these authors suggested that PesSNAC-1 works as a positive stress regulator in moso bamboo.

The ABA-stress-ripening (ASR) gene family is a small group of chaperone-like proteins and plant-specific transcription factors involved in plant development, senescence, fruit ripening, and abiotic stresses (González and Iusem 2014). After being identified in moso bamboo, ASR genes were found upregulated under drought, NaCl, and ABA. Specifically, the overexpression of PheASR2 in rice improved drought tolerance (Wu et al. 2020b). Stress response-related functions of Teosinte branched1/Cincinnata/proliferating cell factor (TCP) members have been investigated in different plant species including moso bamboo (Liu et al. 2018). A member of this family was characterized in rice and Arabidopsis, and the results showed that PeTCP10 may have positive regulatory functions in drought tolerance (Liu et al. 2020). A drought-induced (Di19) protein, a zinc-finger transcription factor containing two conserved and atypical Cys2/His2 zinc-finger domains, isolated from moso bamboo, has been functionally characterized in Arabidopsis and rice (Wu et al. 2020a). Through transgenic lines, these authors concluded that PheDi19–8 significantly increased drought tolerance. Moreover, complementation analyses showed that PheDi19–8 works as a positive modulator of drought stress tolerance.

Twenty-six DNA binding with one finger (Dof) genes were identified in moso bamboo (Wang et al. 2016). A member of this transcription factor family has been functionally characterized by ectopic expression in Arabidopsis (Liu et al. 2019). These authors showed that the PheDof12–1 gene is highly induced by cold, drought, and salt in moso bamboo and their overexpression promotes early flowering in Arabidopsis. The Arabidopsis lines overexpressing PheDof12–1 gene were not analyzed under those abiotic stresses (Liu et al. 2019).

VQ motif-containing proteins are involved in responses to biotic and abiotic stresses by interaction with WRKY transcription factors (Lai et al. 2011; Hu et al. 2013). VQ genes were identified in moso bamboo genome, and the PheVQ28 gene has been functionally characterized (Wang et al. 2017b; Cheng et al. 2020). PheVQ28 was highly upregulated in moso bamboo seedlings under salt stress, and Arabidopsis plants overexpressing this gene were significantly more tolerant to salt stress (Cheng et al. 2020). Laccases (LAC) are oxidative enzymes involved in flavonoid and lignin biosynthesis (Pourcel et al. 2005; Berthet et al. 2011). Thus, LAC genes are promising for woody, biofuel, and other biotechnological applications. Moreover, LAC genes have also been involved in plant responses to environmental stress (Wang et al. 2015). Laccase gene members have been identified in moso bamboo. PeLAC10 gene was upregulated under ABA and NaCl, and its overexpression in Arabidopsis plants improved drought tolerance (Li et al. 2020b). These results suggest that laccase genes could be potential candidates for the molecular breeding of bamboo to increase the content of lignin and improve their adaptability to drought stresses.

Finally, although all genes list in Table 2.3 requires further functional characterization, they are a potential candidate for the breeding of stress tolerance in bamboo species and other crops through genetic engineering using approaches such as CRISPR (clustered regularly interspaced short palindromic repeats). Using molecular markers for deciphering genetic variation and its functional implications will certainly assist in improving stress tolerance and increase the productivity of bamboos in the climate change scenario that affects the whole planet.

2.7 Molecular Markers for Genetic Fidelity

Molecular markers are also a powerful tool to attest to the genetic fidelity of clonally produced plantlets of bamboos. The use of micropropagation has been a viable alternative for cloning superior genotypes, as well as for the production of seedlings with high quality and free of pathogens. There are strict precautions in micropropagation so that plants can express all their genetic potential, as on the course of in vitro cultivation regenerated plants may not maintain the genetic stability in relation to their mother plant (the donor of propagules) (Kaeppler et al. 2000; Ray and Ali 2017).

Changes in the genotype after micropropagation are referred to as somaclonal variation. Mutations may occur due to the use of growth regulators, the various subcultures of explants, the regeneration by callus induction or protoplasts, and somatic embryogenesis. Therefore, DNA can undergo irreversible and inheritable changes (Ray and Ali 2017). Genotypic changes can occur through chromosomal breaks and rearrangements, changes in the level of ploidy, activation of transposable elements, and DNA methylation (Kaeppler et al. 2000; Krishna et al. 2016). These are examples of genetic and epigenetic changes that may or may not cause phenotypic variations. In fact, the identification of possible phenotypic variation in early stages occurs by observing variations in leaf color or size (albinism, variegation, or dwarfism); however, in most cases, these variations do not become visible requiring analysis by molecular markers to identify somaclonal variants.

The most used markers for the identification of somaclonal variations in bamboo are AFLP, RAPD, ISSR, and SSR (Table 2.4). The principle of the analysis involves the comparison of DNA fingerprint among all propagules with their mother plant. Evidence of somaclonal variation occurs when the fingerprints differ from the original source (Fig. 2.3a). AFLP markers were used to verify the genetic stability of explants from a 2-year subculture of D. hamiltonii in a culture medium containing thidiazuron (TDZ) (Singh et al. 2013a). The combination of the enzymes EcoRI and MseI with 15 selective primers did not reveal any polymorphism between the fragments, which validated the protocol for the species. In another study, nodal segments of Bambusa nutans were used to obtain a somatic embryogenesis protocol in MS culture medium containing 2,4-D (dichlorophenoxyacetic acid) (Mehta et al. 2011)⁠. Six AFLP primer combinations produced 407 fragments, 98.8% of which were monomorphic. Therefore, low number of plants showed somaclonal variations. With this high percentage of monomorphism, the technique is efficient for verifying genetic fidelity.

Table 2.4 Assessment of genetic fidelity in bamboos through molecular marker studies
Fig. 2.3
figure 3

(a) Schematic illustration of the principle of the assessment of genetic fidelity after micropropagation of bamboo. A hypothetical fingerprint shows genetic fidelity vs evidence of somaclonal variation. Although no genetic variation may be present, novel technologies allow the identification of epigenetic mechanisms that might implicate in distinct phenotypic effects. Photo credit: Enéas Ricardo Konzen. (b) Bambusa vulgaris plant originated from micropropagation of mother plant. Photo credit: Siu Mui Tsai

There are also studies that report the success of RAPD markers for bamboo species as well as validating different types of protocols, such as the use of different growth regulators for the in vitro rooting of D. strictus (Goyal et al. 2015), regeneration of axillary buds of B. arundinacea (Kalaiarasi et al. 2014), and validation of commercial materials produced in vitro from D. asper (Singh et al. 2012). For bamboos, ISSR markers have become the most viable alternative for studies of genetic fidelity. The use of ISSR markers to verify the genetic fidelity of B. balcooa, a species of great importance in India, did not show somaclonal variation, even with variations in the concentrations of growth regulators (Rajput et al. 2020)⁠. Genetic stability using ISSR markers has also been shown in the in vitro cultivation of G. magna and G. angustifolia (Nogueira et al. 2019).

In D. asper, the use of four different markers, including SSR, did not show polymorphism for the use of nodal segments under different stages of in vitro development, with the use of cytokinin in the multiplication phase, indolebutyric acid, and acetic naphthalene for elongation (Singh et al. 2013b). The authors used 25 SSR primers based on the rice genome, obtaining 164 bands (all monomorphic) observing the absence of somaclonal variation, which shows the efficiency of these markers for studies of genetic fidelity.

The somaclonal variation observed in micropropagated bamboo species may be linked to the use of growth regulators, indirect organogenesis, as well as the number of subcultures. For the identification of variants, ISSR-type markers, which are based on the repetitive portion of eukaryotic DNA, are effective due to their low cost and repeatability. With the evolution of genomic studies in bamboo, other ways of identifying somaclonal variation will allow the selection of true-to-type planting material, such as studies of epigenetic variations, ploidy, and movement of transposable elements. Even if no mutations have occurred at the DNA level, epigenetic changes such as methylation of DNA, acetylation and methylation of histones, and altered gene expression are possible outcomes (Fig. 2.3a), which could implicate in distinct phenotypic effects and performances of the propagules. It is important to ascertain the genetic fidelity to obtain plants with similar performances to their sources (Fig. 2.3b).

2.8 Conclusion and Future Prospects

Molecular markers empowered the distinction of the diversity of bamboo species and enabled comprehensive analyses of their reproduction and population structure. The current genomic technologies will produce numberless novel reports on the genetic diversity and divergence within and among species. Phylogenomics offers novel methods for improving the resolution of phylogenetic trees, reaching a deeper understanding of the evolution of bamboo taxa. Moreover, in the era of systems biology, research is moving forward to integrative approaches for conservation and breeding of such important species. Further comprehension of bamboo DNA variation coupled with the modulation of methylation, acetylation, and gene expression in their environment is necessary to design proper management. From the classical to the ultimate technologies for their obtainment, molecular markers certainly have much more to contribute to future endeavors.