Introduction

Information on genetic diversity and population structure is a prerequisite for efficient breeding programs as it assists plant breeders in the selection of parents for crossing, providing a more rational basis for expanding the gene pool, and for identifying materials that harbor genes of value for plant improvement. Furthermore, knowledge of population structure of genetic resources is necessary for the development of strategies for appropriate conservation of genetic diversity. Therefore, in recent decades a primary activity in genetic resource management has been to characterize the structure of diversity within crop species (Brown 1989). Increasing attention has been given to dissecting and understanding diversity in relation to genes underlying important agronomic traits in a number of crops (Buckler and Thornsberry 2002). This includes several important crops such as maize (Zea mays L.), rice (Oryza sativa L.), wheat (Triticum aestivum L.), barley (Hordeum vulgare L.), and sorghum. Sorghum is an annual, predominantly autogamous cereal (Ollitrault et al. 1997) and was domesticated in northeastern Africa. Cultivated sorghum (Sorghum bicolor ssp. bicolor) has been grouped into five basic botanical races (bicolor, caudatum, durra, guinea, and kafir) and ten intermediate ones (Harlan and de Wet 1972). These races can be recognized based on their grain and spikelet/panicle morphology and can be traced back to their specific environments and the nomadic peoples that first cultivated them (Smith and Frederiksen 2000).

Sorghum is adapted to hot and dry climates as found in West and Central Africa (WCA) where it is difficult to grow most other food grains. African sorghum production is centered in the Savannah zone of WCA, where the grain of this crop is a major component of the diet for millions of people (Purseglove 1985). Apart from grains for food consumption, sorghum stover may also be used as building material or as cooking fuel and is often used to feed cattle in many West African countries (de Vries and Toenniessen 2001). Therefore, sorghum is a pillar of food security in the semi-arid zones of WCA. Worldwide it is the fifth most important cereal crop grown over an area of 43.8 million hectares (FAOSTAT 2007).

Because of its agronomic importance, there is a worldwide growing interest in characterizing the levels of genetic diversity present within sorghum using phenotypic traits, but also molecular markers (Djè et al. 2000; Grenier et al. 2000; Ghebru et al. 2002; Folkertsma et al. 2005; Deu et al. 2008). Among molecular markers most widely used in diversity studies, simple sequence repeats (SSRs) represent an ideal marker system due to their codominant inheritance, locus specificity, and multi-allelic character. Therefore, SSRs have been established as popular genetic markers for characterizing genetic variation in many plant species (Goulão et al. 2001). With the alignment of the sorghum genome sequence based on shotgun sequencing (Paterson et al. 2009), much information is available to characterize genes associated with both qualitative and quantitative variation. Association studies based on linkage disequilibrium (LD) i.e. the nonrandom associations of alleles at different loci (Flint-Garcia et al. 2003) have been performed to determine marker-trait associations in various crop plants such as oat, rice, maize, sea beet, barley (Kraakman et al. 2004), and sorghum (Casa et al. 2008). It is possible to identify genetic regions (if LD extends for the distance of several centiMorgans) or genes (if LD decays within a few thousand base pairs) associated with a particular trait of interest by genome-wide scans for genomic regions or by individual SNPs (single nucleotide polymorphisms) or SNP haplotypes within a candidate gene (Szalma et al. 2005). While performing these marker-trait association studies, ‘population structure’ may give rise to ‘spurious associations’ that is, associations between a phenotype and a marker that is not linked to any causative locus (Lander and Schork 1994). Therefore, knowledge of population structure and kinship in association studies is critical (Yu and Buckler 2006).

In WCA, photoperiod sensitivity of the local genotypes is a key adaptation trait as it assures flowering before the end of the rainy season independent of variable dates of sowing (Vaksman et al. 1996). Due to the diversity in production environments as well as large year-to-year variation in climatic conditions, formal sorghum breeding efforts in WCA that have emphasized on non-sensitive sorghum for “wide adaptation” have had limited success. Modern cultivars of sorghum with improved yield potentials may occupy specific niches in the production systems of this region, but they cannot be adopted across larger areas in WCA. The reasons for this were: firstly, the focus on early maturity in the wake of the drought periods in the 1980’s led to a removal of photoperiodic sensitivity from much of the breeding material in e.g. millet (Niangado 2001) and secondly, lack of adaptation of improved varieties to low-input conditions and climatic diversity, vulnerability to bird damage, and other biotic constraints (Omanya 2005). Hence, there is a need to incorporate variability from photoperiod sensitive sorghum to deal with these problems. Our objectives were to (1) analyze the population structure in a set of sorghum accessions from WCA differing in their photoperiod sensitivity as a pre-processing step for a candidate gene association study of genes involved in photoperiod pathway of flowering time, (2) assess the level of genetic diversity among these accessions, and (3) determine the magnitude of LD between pairs of SSR marker loci in these accessions.

Materials and methods

Plant material

Field experiments were conducted within the framework of a larger project aimed at assessing marker-trait associations for genes involved in photoperiod pathway of flowering time in sorghum. During the rainy season of 2007, a total of 219 cultivated inbred accessions of sorghum were grown at the regional research station of the International Crops Research Institute for the Semi-Arid Tropics (ICRISAT) at Samanko, Mali. The accessions were chosen to represent a broad phenotypic diversity of photoperiodic response to flowering and were grouped into early, intermediate, and late flowering accessions based on previous analyses. It was projected that the collection should represent all sorghum material adapted to the different growing regions in WCA. This included the countries from Western West Africa (drier Savanna) like Burkina Faso, Mali, Niger, and Senegal and countries like Nigeria, Cameroon and Chad Eastern West Africa (wetter Savanna). A small proportion of accessions (n = 10) also originated from non-West African countries such as China, Ethiopia, India, Malawi, Nepal, Sudan, USA, Zambia, and Zimbabwe. The material included basic races like bicolor, caudatum, durra, and guinea. Intermediate races included in the material were durra-caudatum, kaura which is a dominating sub-race of durra-caudatum in Northern Nigeria, and breeding products from guinea × caudatum inter-racial crosses. The guinea accessions were primarily obtained from ICRISAT’s guinea core collection and included a photoperiod insensitive converted guinea accession with a short growth habit (Folkertsma et al. 2005). The inter-racial guinea × caudatum breeding lines were obtained from the Rural Economy Institute (IER), in Mali, and were comprised of sterility maintaining (B) and fertility-restoring (R) parents of the hybrid breeding program.

Genotyping

DNA extraction and polymerase chain reactions

For each accession, a leaf sample was taken for DNA extraction from the field experiment described above. The leaf material was cleaned with distilled water and dried in an oven at 40°C for 3 days. Total genomic DNA was extracted from leaf tissue using a CTAB protocol (Saghai Maroof et al. 1984) with modifications. Polymerase chain reaction amplifications of the SSRs were performed in 10 μl reaction volumes consisting of 2 μl (20–25 ng) genomic DNA, 1.5 μM of MgCl2, PCR buffer with 1 × 1.5 μM MgCl2, 0.2 μM per dNTPs, 250 μM of each primer, and 0.5 U/10 μl of HotTaq DNA polymerase (Fermentas Life Sciences). Amplifications were performed using MJ Research Thermal Cyclers (MJ Research, Watertown, MA). The cycling protocol consisted of 35 cycles with an initial denaturing step of 10 min at 95°C followed by 34 cycles at 95°C for 1 min for denaturation, followed by primer-specific annealing for 1 min and extension at 72°C for 1 min. After 35 cycles, there was a final extension step of 10 min at 72°C. Amplified PCR products were denatured and size-fractioned using capillary electrophoresis on a MegaBACE sequencer (Amersham Biosciences, Sweden). The MegaBACE Fragment Profiler v1.2 (Amersham Biosciences, Sweden) was used to size peak patterns, using internal standard ROX 400 HD for allele calling.

Simple Sequence Repeat (SSR) Markers

A total of 27 SSR primer pairs detecting single-copy loci were used to genotype the sorghum accessions (Table 1). These primer pairs have been used in earlier studies (Brown et al. 1996; Taramino et al. 1997; Bhattramakki et al. 2000; Kong et al. 2000; Schloss et al. 2002; Ramu et al. 2009), or were more recently developed to provide broader genome coverage for sorghum diversity analyses and marker-assisted selection. They were selected primarily based on the genome coverage they provide and robustness of their allele calls across laboratories (http://www.sat.cirad.fr/sat/sorghum_SSR_kitsat.kit/) and secondarily on information content (Casa et al. 2005). Out of 27 SSR markers we used, nine were genomic, 14 were EST-based and for the remaining four, their exact genomic locations were not known. For each primer pair, forward primer was labeled with fluorescent dyes 6-carboxyfluorescein (FAM), terachloro-6-carboxyfluorescein (TET) or hexachloro-6-carboxyfluorescein (HEX) and reverse primers were left unlabeled (Biomers GmbH, Germany).

Table 1 Annealing temperatures, the dye used for each primer pairs, the sequence of primers, and chromosomal location of each SSR locus used in genotyping sorghum accessions

Statistical analyses

The average number of alleles per locus and the number of group-specific alleles were determined for the germplasm groups and for various subsets within this collection. We calculated the allelic richness (R s ) for the races with more than ten accessions using the rarefraction method (Kalinowski 2004) by randomly sampling 10 genotypes 1,000 times and estimating the mean number of alleles per locus. The total gene diversity (H e ) was decomposed into gene diversity estimates between individual lines within each germplasm group according to Nei (1987):

$$ H_{e} = n/n-1\left( {1 - \sum {P_{i}^{2} } } \right) $$

where, P i is the frequency of the ith allele, and n is the number of samples. Confidence intervals for gene diversity estimates were obtained by a bootstrap procedure with resampling across markers.

The polymorphic information content (PIC) for each marker was determined using the following equation:

$$ {\text{PIC}} = 1 - \sum\limits_{i = 1}^{n} {P_{i}^{2} } - 2\left[ {\sum\limits_{i = 1}^{n - 1} {\sum\limits_{j = i + 1}^{n} {P_{i}^{2} P_{j}^{2} } } } \right] $$

where P i is the frequency of the ith allele, and n is the number of alleles (Botstein et al. 1980).

A model-based approach implemented in the software package STRUCTURE (Pritchard et al. 2000) was used to determine the presence of population structure and assign sorghum accessions to subgroups. In our investigations, the set of 219 entries was analyzed by setting the number of subgroups from 1 to 20 with five runs, using the method allowing for the admixture, correlated allele frequencies and no recombination information. For each run of STRUCTURE, the burn-in time as well as the iteration number for the Markov chain Monte Carlo algorithm was set to 100,000. Evanno et al. (2005) criteria were used to decide about the best K value. Accessions with membership probabilities less than 0.80 for both subgroups were assigned to a mixed group (cf. Stich et al. 2005). Fisher’s exact test was performed in order to test disequilibrium between earliness groups of the accessions and the subgroups identified by STRUCTURE.

A permutation test using a Monte Carlo procedure was applied to test for LD between pairs of SSR loci on a genome-wide scale (Lewis and Zaykin 2002; Weir 1996). An adaptive permutation procedure was used to reduce the computational effort. If the P value was smaller than 0.3 after 2,500 permutations, another 14,500 permutations were performed. If the P value was smaller than 0.075, another 933,000 permutations were performed. LD between pairs of polymorphic loci mapped on the same chromosome (intrachromosomal comparisons) and on different chromosomes (interchromosomal comparisons) as well as r 2 (square of correlation of the allele frequencies) was evaluated for the subgroups identified by STRUCTURE. Since the power of the test for LD depends on the number of accessions analyzed per group, we used a resampling strategy to obtain comparable estimates: random samples were drawn from the group with the higher number of lines using a sample size equal to the number of lines of the smaller group. This procedure was repeated 25 times and the results were averaged (cf. Stich et al. 2005). Genetic diversity analyses was performed with software plabsoft (Maurer et al. 2004), which is implemented as an extension of the statistical software R (Ihaka and Gentleman 1996).

Results

Description of genetic diversity

Genotyping of the 219 cultivated sorghum accessions from West Africa with 27 single-copy SSR markers identified a total of 523 alleles. The number of alleles per locus ranged from 3 to 44, with an average of 19 alleles per locus (Table 2). The PIC estimates for the 27 SSR markers ranged from 0.13 to 0.94, with an average of 0.70. The highest number of alleles per locus was detected for SSR Xtxp141 (A t  = 44). It was observed that SSRs with di-nucleotide repeats detected higher numbers of alleles than those with more than two repeated nucleotides. The mean number of alleles per locus for genomic SSRs (A t  = 23) was higher than for EST-based SSRs (A t  = 16), but was only slightly higher than that of the SSRs of unknown genomic location (A t  = 24). Mean polymorphic information content for the genomic SSRs (PIC = 0.74) was higher than for both SSRs of unknown genomic location (PIC = 0.70) and EST-based (PIC = 0.67) SSRs.

Table 2 Type of SSR marker, nucleotide repeat units, number of observations (N), number of alleles detected per locus (A t ), and PIC values for 27 SSRs in sorghum accessions from WCA

It was observed that guinea race accessions had the highest number of alleles per locus (14.8) compared to the other basic races in this study. Basic race caudatum and its intermediate races with guinea and durra showed between 5.23 and 7.18 alleles per locus (Table 3). Accessions belonging to basic race kaura showed 3.55 alleles per locus. Gene diversity for race guinea was higher than other basic races (H e  = 0.67). Race kaura showed lowest gene diversity among all the races (H e  = 0.53). Gene diversity estimates for races caudatum and durra were found to be similar (H e  = 0.56). Guinea-caudatum intermediate race accessions showed slightly higher gene diversity (H e  = 0.67) than durra-caudatum race accessions (H e  = 0.60), but similar to that of basic race guinea accessions. Mean allelic richness was found highest for race guinea (4.33) followed by races durra-caudatum (3.80), guinea × caudatum-bred (3.78), and caudatum (3.44).

Table 3 Genetic diversity estimates according to different sorghum races

STRUCTURE analysis

The STRUCTURE simulation demonstrated that LnP(D) value showed no clear peak for K values between 1 and 20 (data not shown). Thus, it was difficult to determine the true K (number of subgroups) using that measure. The magnitude change of LnP(D) relative to the standard deviation, called ΔK (Evanno et al. 2005), showed its highest peak at K = 2 (Fig. 1; Table 4). We also checked the structure patterns in 10 repeats of each K value from 2 to 9 (data not shown). Figure 2 provides a graphic representation of the estimated membership coefficients to the two clusters for each individual identified by a color, obtained by running STRUCTURE, setting K = 2. A clear and stable population structure was found with this approach. Subgroup one consisted of 64 and subgroup two of 109 accessions; 46 accessions had for both subgroups membership probabilities of less than 0.80, and thus, were assigned to a mixed group. After coding the sorghum accessions from each STRUCTURE subgroup by their race and maturity class (Fig. 2) it was revealed that subgroup one was dominated by the race guinea primarily belonging to early to intermediate maturity classes. The result of fisher’s exact test indicated that there were significantly (alpha = 0.06) more early genotypes in subgroup one than in subgroup two. Unlike subgroup one, subgroup two was rather mixed involving all the major accessions of sorghum belonging to intermediate to late maturity classes. After coding the accessions by their geographical origin (Fig. 3) it was found that in subgroup one, the accessions mainly originated from Burkina Faso and Mali and in subgroup two, the majority of the accessions originated from countries such as Nigeria, Niger and Mali.

Fig. 1
figure 1

Magnitude of ΔK for each K value

Table 4 Values of ΔK for each value of K
Fig. 2
figure 2

Summary plot of estimated population structure of 219 inbred sorghum accessions for K = 2. The probability of membership of every accession to each population is sorted by membership coefficients and identified by color. Each accession is represented by a vertical column divided into K colors. G = Guinea, C = Caudatum, GC = Guinea-caudatum, GC-bred = Guinea × Caudatum bred, CD = Caudatum × Durra, K = Kaura, C-bred = Caudatum bred; B = Bicolor; D = Durra; M = GM = Margaritiferum subgroup of basic race Guinea. Race and maturity class and the STRUCTURE subgroups are shown. The letters at the bottom of each bar denote to the maturity class of each accession where E: early, I: Intermediate, and L: late maturing

Fig. 3
figure 3

Distribution of major races of sorghum accessions among the subgroups identified by STRUCTURE. Country names are given below each bar, BF: Burkina Faso, SEN: Senegal, NIG: Nigeria CAM: Cameroon, WCA: Western Central Africa, TAN: Tanzania, and ZIM: Zimbabwe

Extent of linkage disequilibrium among SSR loci

Linkage disequilibrium was assessed for all combinations of SSR loci. An overview of the percentage of loci pairs in LD and magnitude of r 2 values with regard to the subgroups is given in Table 5. The percentage of locus pairs in significant (P < 0.05) LD was slightly higher for interchromosomal combination than intrachromosomal combinations when the complete set of 219 accessions was considered. The mean r 2 value was slightly higher for intrachromosomal combinations than interchromosomal combinations. The percentage of interchromosomal locus pairs in significant LD for the subgroups ranged from 16.6 to 49% and was higher than intrachromosomal LD values for all subgroups. The r 2 values for the subgroups ranged from 0.017 to 0.026. Subgroup two showed threefold more intrachromosomal and twofold more interchromosomal LD than did subgroup one. But subgroup one, showed based on r 2 values higher intra- as well as interchromosomal LD than subgroup two.

Table 5 Number of sorghum accessions and percentage of SSR locus pairs in significant (P < 0.05) LD within sorghum subgroups identified by STRUCTURE

Discussion

Genetic diversity

The average number of alleles per locus detected in our study (A t  = 19) was higher than found in previous studies by Djè et al. (1999), Ghebru et al. (2002), and Deu et al. (2008) (A t  = 10.4, 11, and 13.9, respectively). Possible reasons for this were, firstly, that in our study the material was compiled in such a way that it represented diverse sorghum materials adapted to the different agro-ecological conditions in West Africa. Furthermore, our study included a higher proportion of guinea race, which showed higher genetic diversity- compared to the study of Folkertsma et al. (2005) using Southern and Eastern African guinea germplasm which has been reported to be less diverse than West African guinea (de Wet et al. 1972; Doggett 1988). Secondly, among the SSRs we used, there were more di-nucleotide microsatellites (above 50%) than the studies of Djè et al. (1999) and Deu et al. (2008). Furthermore, we used a large proportion of Xgap and Xtxp series SSRs that were isolated from genomic libraries (Brown et al. 1996; Bhattramakki et al. 2000; Kong et al. 2000). Therefore, these markers are more likely to include non-coding regions and hence they detected more alleles (A t  = 23) leading to a higher average number of alleles per locus. The average gene diversity found in our study (H e  = 0.70) was slightly higher than reported by Deu et al. (2008) for 484 sorghum varieties from Niger (H e  = 0.60), and slightly lower than reported by Ghebru et al. (2002) for 28 Eritrean sorghum landraces (H e  = 0.78), and Djè et al. (1999) for 120 Moroccan sorghum landraces (H e  = 0.84). One reason for the higher H e values found in our study compared to Deu et al. (2008) may be due to the fact that the accessions of our study covered entire WCA, whereas sorghum collection of Deu et al. (2008) was limited to Niger. Furthermore, Deu et al. (2008) used two more Xcup SSRs, which are assumed to be located within or near genes, and indeed revealed fewer alleles (4.63 alleles per locus). In addition, the collection of Deu et al. (2008) had a higher percentage (24%) of accessions of race durra (which showed lowest gene diversity (H e  = 0.39)), compared to our study (1.4%), and therefore this might have resulted in lower diversity values. Another possible reason for higher gene diversity found in the study of Ghebru et al. (2002) in comparison with our study may be that Eritrean sorghums (mainly caudatum and durra) have been investigated which seem to be exceptionally polymorphic because (1) most of the sorghum is maintained in the field as farmers’ varieties, (2) they show high levels of outcrossing and heterozygosity, (3) and little selection is practiced by the farmers leading to lower level of allelic fixation. Therefore, Eritrean sorghum exhibits more size variation within 28 landraces than observed in hundreds of sorghum landraces of the world collection (Ghebru et al. 2002). Compared to the study of Deu et al. (2008) allelic richness for the race guinea in our study was higher (R s  = 4.33) but lower for the race caudatum. Based on the gene diversity estimates in our study, race guinea was the most diverse and race kaura was the least diverse race than other basic races.

Although we have found a high degree of genetic diversity present among accessions of sorghum in our study that may be similar to the above-mentioned studies, no direct comparison between the studies can be made, because estimates of gene diversity vary depending on the sampling schemes (single plant or bulked DNA), number and type of SSRs used, numbers of repeats of those SSRs, and their location in the genome i.e., coding or non-coding DNA regions (Deu et al. 2008). We detected 3–44 alleles per SSR locus in our analyses. These differences between numbers of alleles per locus detected among different races by SSRs can also be due to the mechanisms that generate variability for these different categories of DNA markers, e.g. point mutation and insertion-deletion for all DNA markers, plus polymerase slippage and/or unequal crossing-over for microsatellite markers (Grenier et al. 2000).

Population structure

Using the computer software STRUCTURE to infer population structure and employing Evanno’s method to estimate the number of subgroups, we found that our sorghum material was divided into two subgroups (Fig. 2). The STRUCTURE approach we used is different from a distance-based phylogenetic algorithm, such as neighbor joining used in previous studies (e.g. Deu et al. 2008), as STRUCTURE uses a model-based Bayesian clustering approach to assign individuals to the subgroups. The clusters identified using distance-based algorithms may be heavily dependent on both the distance measure and graphical representation chosen. Also, it is difficult to incorporate additional information such as the geographic sampling locations of individuals (Pritchard et al. 2000).

Between the two subgroups detected by STRUCTURE in our study, subgroup one mostly included accessions of race guinea (83%) while subgroup two showed a much lower proportion of guinea race accessions (28%) and additionally included other major races of our study such as caudatum and caudatum intermediate accessions (durra caudatum and guinea caudatum) and accessions of race durra and kaura. Similar observations were made in previous studies (Djè et al. 2000; Folkertsma et al. 2005) which reported that, in most cases, guinea-race sorghum accessions can clearly be distinguished from all other races. Similar to subgroup two, lack of clear distinction among races of sorghum was observed by Morden et al. (1989) and Djè et al. (2000) where 9–14% of the total variation detected was between sorghum races, which made it apparent that the races in subgroup two of sorghum were not substantially differentiated genetically. Another important observation was that Fisher’s exact test revealed that there are significantly (alpha = 0.06) more early maturing accessions in subgroup one than subgroup two. This can partly explain the structuring we found among sorghum accessions of our study. Coding sorghum accession by their country of origin (Fig. 3) showed clearly that subgroup one contained mainly guinea accession from western WCA such as Mali and Burkina Faso. The majority of these accessions showed early and intermediate maturity. In the case of subgroup two, countries as well as race-wise patterns were apparent. The accessions from countries like Nigeria and Niger were predominant followed by accessions from Mali. The accessions from Nigeria distinctively included mostly late maturing intermediate races like durra-caudatum and guinea-caudatum and sub-race kaura. The accessions originating from Niger included all caudatum accessions which were all early maturing and the Malian accessions included late maturing guinea accessions. Therefore, we can conclude that for the subgroups in our study race as well as geographic origin seems to be important for the observed structuring pattern of the accessions.

Linkage disequilibrium

In our study the percentage of locus pairs in significant LD (P < 0.05) was higher (Table 5) than reported by Hamblin et al. (2004) using RFLP markers with 8.7% of locus pairs in significant LD for 27 sorghum accessions. The reasons for this difference may be, firstly, that RFLP markers have a lower power to detect LD because they detect lesser alleles than SSR markers, therefore, much power is lost to detect LD (see Sham et al. 2000). Secondly, we surveyed a much larger number of accessions (217) than Hamblin et al. (2004). Furthermore, the accessions of the study of Hamblin et al. (2004) involved some wild sorghum accessions, which are generally higher outbreeding (Doggett 1988) and could have caused a downward bias of LD estimates.

The extent to which linked sites will have a correlated evolutionary history is a function of effective population size and recombination rate, both of which are affected by the mating system (Nordborg 2000). Sorghum is a predominantly self-fertilizing species (estimates of out-crossing range from 2 to 73% depending on panicle type and environment (Rooney and Smith 2000; Barnaud et al. 2008), and is therefore expected to show higher levels of LD than out-crossing species like maize (Nordborg 2000). Consistent with the predicted effects of self-pollination and reduced effective population size, sorghum showed a greater extent of LD than maize. However, the mating system alone cannot be a simple predictor of high levels of LD—barley is highly self-pollinating but has similar diversity and LD as maize (Morrell et al. 2003). Non-equilibrium processes like population structure, history, and selection may explain this phenomenon.

We found in our study that subgroup two exhibited a higher level of intrachromosomal as well as interchromosomal LD than subgroup one based on the percentage of loci pairs in significant LD. The elevated levels of LD in subgroup two based on loci pairs in significant LD can be due to the higher number of accessions investigated in subgroup two (109) compared to subgroup one (64). A similar observation was made by Malysheva-Otto et al. (2006) in cultivated barley germplasm, where the global collection of 953 barley accessions showed higher intrachromosomal as well as interchromosomal LD (100 and 98%, respectively) than a subset of 207 European 2-rowed spring barley accessions (45 and 42%, respectively). However, subgroup two in our study—even after resampling with 46 accessions (data not shown)—showed higher levels of both interchromosomal and intrachromosomal LD than subgroup one resampled with 46 accessions as well (data not shown). Higher interchromosomal LD for both subgroups showing relatedness may be the cause for LD between SSR loci (see Stich et al. 2006). The r 2 values in our study were comparable to the ones reported for maize and sorghum with SSRs/RFLPs (Hamblin et al. 2004; Remington et al. 2001). In our study, higher mean r 2 values of subgroup one than subgroup two indicate that an increase in LD can occur for accessions within subpopulations with similar genetic background (subgroup one) in contrast to the populations with different genetic backgrounds (subgroup two). This higher LD in subgroup one compared to subgroup two may have been the result of West African farmers’ strong preference to select (Rafalski and Morgante 2004) early-to-intermediate maturing sorghum accessions over late maturing ones. Sorghum accessions selected for earliness thus ensure that important growth stages of the crop are completed before the rainy season is over and the risk of crop failure is minimized.