Introduction

Sorghum (Sorghum bicolor (L.) Moench, 2n = 20) is the fifth most important cereal crop and occupies the second position among the staple food grains in semiarid tropics. Sorghum is an excellent feedstock for bioenergy production because of three distinct features: high photosynthetic efficiency for converting solar energy into biomass; high water use efficiency; and high tolerance to stressed environments for growing in marginal lands.

Sweet sorghums accumulate sugars (10 to 25%) in their stalk juice at physiological maturity. Sweet sorghums are typically characterized by moderate grain yield, but high biomass production. Because of its high sugar content, sweet sorghum is cultivated mainly for fodder, syrup, molasses, sugar and ethanol production (Rao et al. 2009; Wang et al. 2009). The genetic diversity in the germplasm of any breeding program affects the potential for genetic gain through selection. Knowledge of genetic diversity permits the germplasm to be classified into heterotic groups, which is important in hybrid breeding (Menz et al. 2004). Crosses between unrelated and genetically distant adapted parents show greater hybrid vigor than crosses between closely related parents (Stuber 1994; Hallauer 1999). More studies to assess the genetic diversity of sweet sorghum are necessary for crop improvement. Ritter et al., (2007) suggested that sweet sorghum is of polyphyletic origin. As sweet sorghum traits such as sugar rich juicy tall stem, high biomass, high stem/leaf ratio and low grain were found in several races of subspecies bicolor, it raises questions about the origin, selection and genetic and genomic basis of sweet sorghum (Zheng et al. 2011).

Recently, single-nucleotide polymorphisms (SNPs) have become the marker of choice because they can provide accurate and reproducible data along with the potential for high throughput, low-cost genotyping (Yan et al. 2009). Many studies were reported to determine the genetic relationships among sweet sorghum accessions using different types of markers such as AFLPs, (Ritter et al. 2007), SSR markers (Ali et al. 2008; Murray et al. 2009) and SNPs (Murray et al. 2009; Burks et al. 2015; da Silva et al. 2017; Luo et al. 2020; Hao et al. 2021). The objectives of the present study were to assess the agronomic performance of sweet sorghum breeding material and their derived lines and also to assess the genetic variability among them using the SNP markers. This study could be useful to identify the genetically diverse material with superior agronomic characters to use in the future sweet sorghum breeding programs aiming at heterotic hybrids for sugar yield-related traits.

Results

Evaluation for Morpho-Physiological Traits

The two-way analysis of variance (ANOVA) for pooled data showed significant differences for all measured traits at P < 0.001 Table 1. The maximum and minimum mean, standard error, least significant difference (5%) and coefficient of variation (CV, %) for all observed traits for all 75 genotypes are presented in Table 1.

Table 1 Range and mean performance of sweet sorghum panel of genotypes for plant height and sugar yield-related traits

Plant height ranged from 1.3 to 3.4 m. High biomass landraces, specifically most of the durra race accessions, recorded plant heights over 3 m. These landraces are photosensitive lines (Fig. 1, most of clusters IIb and IIc) that grow tall and produce high biomass, under long day length conditions and are induced to flower only when the day length falls below 11 h 45 min. The average plant height for all the high biomass germplasm accessions was 2.7 m. Plant height for most of the remaining sweet sorghum varieties was in the range of 2 to 3.6 m, with the average of 2.6 m, which was almost same as the mean value of germplasm accessions (Fig. 1) clusters I and IIa and for the maintainer lines, range was 1.3 m to 2.1 m (Fig. 1, clusters I and IIb), with the average of 1.6 m. These results indicate, there was significant variability among studied genotypes for plant height. Significant variability was also observed among these germplasm lines, maintainer lines and improved sweet sorghum varieties for stalk yield, brix and sugar content. All the sweet sorghum varieties and breeding lines (Fig. 1) clusters I and II are corded average stalk yield of 37.54 t ha−1, whereas germplasm accessions (Fig. 1) cluster IIb recorded average yields of 54.4 t ha−1. The lowest average yields were observed for maintainer lines, which was 23 t ha−1. Highest brix value (19.4%) was recorded by the breeding line ICSV 12008 while lowest brix value (8.3%) was recorded for Nr 481. All the high biomass germplasm accessions (Fig. 1, most of clusters IIb and III) recorded lower brix values, averaging about 12.7%. Cluster I largely comprised of improved sweet sorghum lines, most of which exhibited higher brix values (> 13%). The average brix value for this group was 15.6%. Cluster II contained breeding lines, most of which were having brix value above 15%, and germplasm accessions, with brix values ranging from 9.2 to 16.1%.

Fig. 1
figure 1

Neighbor-joining tree using SNP data of 75 genotypes. Genetic distances between the sweet sorghum accessions were calculated using the identity-by-state (IBS) coefficient. Ia, Ib, IIa, IIb and III correspond to correspond to the clusters identified through the neighbor-joining method. IS International sorghum, ICSB ICRISAT sorghum female parent, ICSV ICRISAT sorghum variety

Genetic Diversity

Sweet sorghum samples were processed through the GBS pipelines, and a total of 135,162 SNPs were identified after filtering the total SNPs with minor allele frequencies (MAF) at 20 percent. The NJ cluster analysis grouped the 75 genotypes into five distinct major groups primarily based on their biological status (photoperiod-sensitive high biomass germplasm lines or sweet sorghum varieties and derived breeding lines) and also based on their morpho type (race) and pedigree. Grouping of genotypes according to the race is in conformity with previously published reports in diverse germplasm lines (Billot et al. 2013; Ramu et al. 2013). Cluster I consists of two sub-clusters, clusters Ia and Ib. Cluster Ia comprises of 40 genotypes, mainly belonging to bicolor race except for a ICSV 25266 belong to race guinea-caudatum). A majority of them developed at ICRISAT-Patancheru in its sweet sorghum breeding program are parents of that material and grouped based on their common ancestry. Cluster Ib consists of five genotypes all belonging to bicolor race, except IS 18521 belonging to caudatum race. Interestingly, IS 18521 clustered together with SPSSV30, a bicolor race cultivar, indicating latter’s origin from bicolor race.

In the remaining 46 genotypes, 44 genotypes formed cluster II, which has two subgroups (II a and II b) primarily consisting of high biomass and sweet sorghums developed by India’s National Agricultural Research System (NARS), a few ICRISAT-bred lines (some involving NARS parental lines) and 12 high biomass germplasm accessions. Cluster IIa consists of bicolor race breeding products including superior sweet sorghum and high biomass lines (> 2.5 m height). Grouping in this cluster is primarily based on their pedigree Table 1. Two genotypes (viz., SP-08 16427 and SP 08 16421-2) sharing a common ancestry grouped closely together in this cluster, near their stem borer and shootfly resistant parent ICSV 93046, several other of its derivatives, and its progenitor ICSV 700. Cluster IIb mainly consists of non-bicolor race genotypes (including caudatum, durra, guinea and their intermediate races), the majority of them are high biomass germplasm lines and a low biomass maintainer line (ICSB 27) Table 1. Interestingly, IS27206 falling under guinea race, and IS18542, an accession from durra race cluster together. The maintainer line, ICSB 27, was grouped with these germplasm lines and was found to be closely related to durra line IS27314. ICSB 27 although was found to be inferior for all the quantitative traits compared to other genotypes of the group. Surprisingly, two high biomass germplasm accessions IS 16575 and IS 8813 formed into a separate group, cluster III, though they belong to different races durra-caudatum and guinea, respectively. In the rainy season, these two landraces have the same maturity period with the capacity of producing huge biomass (IS 16575: ~ 100 t ha−1 and IS 8813: ~ 75 t ha−1). The pure line variety ICSV 25266 (selected from germplasm accession IS 23526) grouped closely with improved sweet sorghum derivatives of that same germplasm accession, namely 5655-3 and 5651-3, in cluster I along with NTJ2 and other improved sweet sorghum varieties and breeding lines. These two genotypes were showing superiority over their parent ICSV 25266 for plant height, stalk yield, juice yield, brix and sugar yield (Table 1). Also, SPSSV 30 grouped closely with germplasm line IS 18521, probably because of its similar genetic background.

Discussion

Sweet sorghum is one of the potential and promising biofuel feedstocks that has many characteristics such as high stalk sugar content, wide adaptability and tolerance to biotic and abiotic stress. The knowledge on genetic diversity is very important in hybrid breeding program since it permits the germplasm to be classified into heterotic groups (Menz et al. 2004). The genetic diversity in the germplasm of any breeding program affects the potential for genetic gain through selection.

Phenotypic Traits

The seasonal variance and variance for genotype × season interactions showed significant differences for the candidate sugar traits (plant height, juice yield and brix %) at P < 0.001 corroborating with the earlier observations (Rao et al. 2013a, b; Ratnavathi et al. 2010). The landraces are photosensitive lines and clustered in IIa, IIb and III, that grow tall and produce high biomass, under long day length conditions (Fig. 1). They flower only when the day length falls below 11 h 45 min, and it was evident from the mean values (Table 1) that germplasm accessions had lower brix values, and hence, less sugar content (clusters IIb and III) vis a vis improved sweet sorghum varieties and parental lines that recorded superior stalk yield and sugar yield (cluster I). Therefore, based on phenotypic data, entries from cluster Ia and clusters IIb and III could be used as potential parents for developing new heterotic hybrid combinations for sugar yield.

Most of the sweet sorghum genotypes that belong to bicolor race in this study were grouped together although they showed significant differences in terms of observed quantitative agronomic traits. Deu et al. (2006) earlier reported that the bicolor race has high genetic diversity and many rare alleles. This race is considered to be oldest and most widely distributed geographically due to its several uses (fodder, brooms, and sweet stems) (Pecina-Quintero et al. 2012). The three caudatum race genotypes (SSV 84, IS 22868 and IS 18521) were distributed in different clusters, supporting that this race is broadly dispersed (Billot et al. 2013; Ramu et al. 2013). It is evident that these sweet sorghums exhibit wide variability in multiple phenotypic traits such as stalk yield and plant height which corroborates the findings of earlier reports (Balaravi et al. 1997; Bennetzen et al. 2001; Biswas and Sundaralingam 1997) (Table 1). Genotypic differences were also reported by Rao et al. 2013a for extractable juice, stem sugar content and fermentation efficiency. The major roles of nonadditive gene action for plant height, total soluble solids, juice yield and stalk yield indicate the potential of heterosis breeding for improving these traits in sweet sorghum (Rao et al. 2013a, b). Significant genetic differences that were observed for all quantitative traits studied indicate that there is considerable amount of genetic variability existing among genotypes. Cultivar development is based on the exploitation of genetic variability for traits of interest (Makanda et al. 2009). Progress of breeding in such characters is primarily determined by the magnitude and nature of variation and interrelationship among them (Gandhi et al. 1964).

Breeding for Bioenergy

Sweet sorghum has high amounts of total reducing sugars, which prevent sucrose crystallization and has 90% fermentation efficiency (Kumar et al. 2013).Compared to grain sorghum, sweet sorghum grows rapidly producing high biomass and also has wider adaptation. It is important to identify suitable superior genotypes for ethanol production in terms of brix, biomass, juice yield and fermentation efficiency. The genotypes with high sugar yield in contrast to their high biomass and low sugar yield counterparts may possess alleles for increased amount of sugar yield in the stem juice. So, contrasting pairs of these lines that are otherwise most similar for flowering time and plant height can be used as parents to develop mapping populations, targeting genes responsible for sugar accumulation. The segregating population derived from the crosses between these diverse pairs of lines can be used to develop high density linkage maps and to map genes which influence sugar accumulation in the stems (Ali et al. 2008; Zhang et al. 2018; Kong et al. 2020; Hao et al. 2021).

The most elite sweet sorghum genotypes in cluster I in the present study (namely SSV 74 and lines derived from cross DSV 4 × SSV 84), which exhibited superior performance in terms of brix (> 15%), sugar yield (> 2.5 t ha−1), plant height (> 2.8 m) and biomass yield (> 50 t ha−1), can be predicted to show only limited genetic variation, which can be confirmed using SNP data available for them. Also, by performing genome-wide association studies with high density SNP genotyping, genes underlying natural variation in various traits can be identified (Morris et al. 2013a, 2013b). All these genomic studies would be helpful to better understand the diverse pairs of genotypes under study and aid in the incorporation of these lines in future molecular breeding programs.

In conclusion, this study provides detailed analysis of genetic diversity existing among materials under study in the ICRISAT-Patancheru sweet sorghum breeding program. In the present study, NJ analysis based on genetic background revealed that genotypes ICSV 25311, ICSV 25316, ICSV 25300 and SSV 74 show very limited diversity (cluster I), but were found to be superior in terms of sugar yield. Nevertheless, they are genetically distant from germplasm lines (clusters IIb and III) which have potential high biomass yield. Hence, these diverse groups of genotypes could be of valuable material for sweet sorghum breeding programs aiming for heterotic hybrid combinations for sugar yield (Li et al. 2019; Hao et al. 2021). Thus, the results suggest that the information on sugar yield-related traits and genetic relationships will be helpful in guiding future molecular breeding approaches to improve sugar and biomass yield while maintaining and exploiting a broad range of genetic diversity.

Materials and Methods

Phenotyping

Sweet sorghum genotypes used in the present study were selected from the ICRISAT-Patancheru sweet sorghum breeding program in India. A total of 75 sweet sorghum genotypes including germplasm lines with high biomass yield, maintainer lines and sweet sorghum varieties were selected based on their phenotype with an emphasis on panicle type, grain, days to flowering and plant height (Table 2). These genotypes were further tested to assess their sugar yield-related traits. Field studies were performed during October 2010, post-rainy season and October 2013, post-rainy season in vertisols of ICRISAT, Patancheru, India, using a randomized complete block design (RCBD) with three replications. In both the years, fields were fertilized with di-ammonium phosphate (DAP) at the time of sowing and top dressed with urea after 21 days of planting (N-P-K: 90-40-0). Each entry was planted in two rows of four meters each. Spacing between the rows and in between the plants were 60 cm and 15 cm, respectively. Twenty randomly selected plants from each entry were used for observations on agronomic traits. Days to 50% flowering were recorded during crop growth. Plants were harvested at the time of physiological maturity and data were collected following methodology described in (Munirathnam et al. 2013) for the traits: plant height (m), stalk yield (t ha−1), juice yield (t ha−1), brix (%), and sugar yield (t ha−1). The juice was extracted from the clean stalks of sweet sorghum after stripping the leaves harvested from 1 m−2 area at physiological maturity by passing through the three-roller Wiley mill thrice and expressed in per ha basis. The sugar yield (t ha−1) is calculated as brix (%) X juice yield (t ha−1)/100.

Table 2 Combined analysis of variance for sugar yield-related traits in sweet sorghum panel evaluated on vertisols at ICRISAT-Patancheru during 2010 and 2013

Genotyping

DNA was extracted from 10-day-old seedlings of the 75 entries using the CTAB method described in Mace et al. (2003). Samples were genotyped with genotyping-by-sequencing (GBS) technology at the Institute of Genomic Diversity (IGD, Cornell University, USA, (Elshire et al. 2011). Sequences generated were processed through GBS pipelines using TASSEL version 3.0 to identify SNP markers (Glaubitz et al. 2014). BTx623 was used as reference genome (v1.4, (Paterson et al. 2009) in the pipeline to identify SNP markers on all chromosomes.

Data Analysis

The data on the measured quantitative traits were pooled over both the seasons, and analysis of variance (ANOVA), mean, standard error, least significant difference and coefficient of variation for the recorded traits were calculated using Genstat v.14.0 program (VSN International 2011). SNP data generated were used in TASSEL version 3.0 to generate a distance matrix among the entries. The corresponding similarity matrix was used inDARwinv5 software (Perrier et al. 2003; Perrier and Jacquemoud-Collet 2006) to construct a dendrogram using the neighbor-joining (NJ) method.

Conclusions

The study established the genetic relationship among the elite sweet sorghum lines and germplasm accessions with tropical and temperate adaptation. In order to develop heterotic hybrid combinations for sugar yield it would be appropriate to attempt distant crosses between sugar-rich lines from cluster I (ICSV 25311, ICSV 25316, ICSV 25300 and SSV 74) and high biomass yielding lines from clusters IIb and III (IS 8813, IS 16575, IS 16529, IS 18542). Future studies should assess large pool of lines with much larger data points to harness the potential of genomic selection to accelerate genetic gain for sugar yield and associated traits.