Introduction

Mesoamerica is one of the main centres of the origin of agriculture and domestication of plants of the world (Vavilov 1926; Ranere et al. 2009; Zizumbo-Villarreal and Colunga-GarcíaMarín 2010). One of the most important cultural areas of Mesoamerica is the Mayan region which has been divided into two subareas: (1) The Mayan lowlands, which includes the Yucatan Peninsula (Yucatan, Campeche and Quintana Roo) and the state of Tabasco in Mexico, the region of Peten in Guatemala and Belize; and (2) the Mayan highlands, which includes the state of Chiapas in Mexico, Guatemala (except Peten) and the western part of Honduras and El Salvador (Ruz 1981; Garza et al. 1996; Sharer 1999). Within the Mayan area, evidence has been found about domestication of regionally important species such as jocote (Spondias purpurea L; Miller and Schaal 2005), scarlet runner bean (P. coccineus L.; Spataro et al. 2011) and Lima beans (P. lunatus L.; Andueza-Noh et al. 2013).

The Mayan area possesses a high richness of plant genetic resources because many farmers still practice the traditional Mesoamerican agricultural system called milpa (Colunga-GarcíaMarín and May-Pat 1992; Zizumbo-Villarreal and Colunga-GarcíaMarín 2010). However, in recent decades, the Mayan milpa has undergone a process of intensification that has placed the system in crisis. As a result, some crops of great importance to Mayan farmers, such as the Lima beans landraces are at risk of genetic erosion (Martínez-Castillo et al. 2012).

Lima beans are the second most important cultivated species of Phaseolus in the world. It includes two botanical varieties: P. lunatus var. silvester Baudet and P. lunatus var. lunatus (Baudet 1977). Recent studies indicate the existence of three major gene pools within the species: the Andean (A), Mesoamerican I (MI) and Mesoamerican II (MII), all of which contain wild and domesticated forms (Serrano-Serrano et al. 2010, 2012; Martínez-Castillo et al. 2014). Lima beans cultivated in the Yucatán Peninsula are part of the MII gene pool, which constitutes the domesticated forms of Lima beans with the highest richness in the whole Mexico (Ballesteros 1999). Molecular studies have shown that the cultivated gene pool of Lima beans in the Yucatan Peninsula have high levels of genetic diversity (Martínez-Castillo et al. 2008, 2012). While the genetic diversity of Lima beans of the Mayan lowlands is well documented, information on Lima beans grown in the Mayan highlands is still poor, and only germplasm has been collected (Debouck 2011). Here we hypothesized that Lima bean of the Mayan area have a high genetic structure. Genetic diversity is greater in the Mayan lowlands than in the Mayan highlands, and there is a pattern of clustering based on these two Mayan subareas.

The inter-simple sequence repeats (ISSR) allows detection of polymorphism without previous knowledge of DNA sequences. This is a dominant molecular marker that involves amplification of DNA using a single primer composed of a microsatellite (SSR) sequence (Zietkiewicz et al. 1994). The ISSR markers are rapid, simple and cheap for assessing genetic diversity, structure, and relationships among cultivars (Martins et al. 2003; González et al. 2005; Culley et al. 2007). In this study, ISSR markers were used to analyze the structure and genetic diversity of the domesticated gene pool of Lima beans throughout the entire Mayan area to generate basic information for designing conservation policies to the species and its varieties in this important cultural region.

Materials and methods

Plant material and DNA extraction

Sampling included 46 accessions (23 from the Mayan lowlands and 23 from the Mayan highlands) (Fig. 1) from the collection of the International Center for Tropical Agriculture (CIAT, Cali, Colombia) and from the collection of Dr. Martínez-Castillo held at the Centro de Investigación Cientifica de Yucatan (CICY). Most accessions of Dr. Martínez-Castillo were collected directly from farmers either from milpas or barns; some were purchased on local markets. Data on the origin of the CIAT accessions were only available for few accessions (Table 1). Ten seeds of each accession were germinated in a greenhouse at the CICY. The DNA was extracted from five seedlings of each accession using the protocol of Dellaporta et al. (1983), and the quality was verified by electrophoresis in 1% agarose gel with 1X Tris–borate-EDTA (1X TBE: 100 mM Tris–borate, pH 8.0, 2 mM EDTA) buffer and staining with ethidium bromide.

Fig. 1
figure 1

Geographical distribution of Lima bean landraces collected in the Mayan area

Table 1 Geographic location of 23 cultivated accessions of the Mesoamerican gene pool of P. lunatus in the Mayan area included in the study

ISSR analysis

The inter-simple sequence repeat (ISSR) technique was used following the work by Martínez-Castillo et al. (2008). Each ISSR band was considered as an independent locus and polymorphic bands were scored as absent (0) or present (1) for all samples. Four primers were used: 15 (GACA)3 RG, 16 YR (GACA)3, 30 (GACAC)3 AG and 32 (CACAC)3 RG and 73 ISSR loci were generated. PCR amplification was performed in a total reaction volume of 20 μL containing 10 mM Tris–HCl (pH 9.0), 50 mM KCl, 2 mM MgCl2, 200 μM dNTP, 1 μM of primer, 1 unit of Taq polymerase (Promega, Madison, WI), and 20 ng of template DNA. Amplification was performed in a GeneAmp PCR System 9700 (Applied Biosystems, Foster City, CA, USA), under the following conditions: 4 min at 94 °C for initial denaturing; followed by 35 cycles of 2 min at 94 °C, 1 min at 42 or 54 °C (depending on the primer), and 2 min at 72 °C; and a final extension of 5 min at 72 °C. Formamide (4 μL) containing 0.45% bromophenol blue and 0.25% xylene-cianol was then added to each reaction mixture, then 4 μL of the reaction mixture was loaded onto 6% nondenaturing 30:1 bis-acrylamide gels (320 × 380 × 0.4 mm) containing 3 M urea and 1X TBE buffer (Zietkiewicz et al. 1994). A 123-bp molecular marker standard was included in each gel. Electrophoresis was carried out at 300 V (SQ3 Sequence Hoeffer), and the products of the amplification were visualized with silver staining (Bassam et al. 1991).

Data analysis

Genetic structure

The genetic structure was analyzed through two methods: (1) Wright’s genetic differentiation index (F ST) obtained with the program POPOGENE ver. 1.31 (Yeh and Boyle 1999); (2) a hierarchical analysis of molecular variance (AMOVA) considering three levels: (a) the whole Mayan area (including all accessions), (b) between Mayan subareas (23 accessions from the Mayan lowlands and 23 from the Mayan highlands) and (c) between accessions in each subareas (including 115 individuals for each subarea), using the program Arlequin 2.0 (Excoffier et al. 2006). Long term Gene flow (Nm) was also calculated using the program POPGENE ver. 1.31 (Yeh and Boyle 1999).

Genetic relationships

To analyze the genetic relationships among landraces of the Mayan area, the genetic data were analyzed using Bayesian clustering approaches as implemented in the program STRUCTURE (Pritchard et al. 2000) to infer the number of K groups (i.e., populations or accessions). We used the admixture model with uncorrelated allele frequencies. Ten independent simulations were run for each value of K ranging from K = 1 to K = 5. The analyses were made with replications for a burn-in period of 10,000 and a run length of 100,000. With the results generated, the optimal value of K was determined according to Evanno et al. (2005) using the program Structure Harvester v. 0.6.93 (Earl and VonHoldt 2012). Finally, the graphs of ancestry for the optimum value of K were generated using STRUCTURE. Thereafther a dendrogram with the neighbor joining method (NJ) as implemented with the programs NEIGHBOR and CONSENSE in the Phylip package (Felsenstein 2005) was performed. The NJ was constructed using Nei’s genetic distance (D) (Lynch and Milligan 1994) obtained with the program AFLP-SURV ver. 1.0 (Vekemans 2002) with 1000 bootstrap replicates. The dendrogram was visualized using the program TREEVIEW (Page 1996). Finally, we carried out a principal coordinate analysis (PCoA) using the program GenAlex 6 (Peakall and Smouse 2006). This technique allows us to explore the genetic structure of the sample data without a priori criteria, using each allele as an independent variable.

Genetic diversity

The ISSR is a dominant molecular marker, and unlike for codominant markers, estimating allele frequencies poses some difficulties (Lynch and Milligan 1994). These problems have been solved in numerous studies using Bayesian methods (Zhivotovsky 1999); and better estimators for analyzing dominant markers (Lynch and Milligan 1994; Krauss 2000). In this study, genetic diversity was estimated at two geographic levels: (a) Mayan area and b) Mayan subareas. To avoid common problems associated with the analysis of dominant data (Lynch and Milligan 1994; Culley and Wolfe 2001), we did not use Hardy–Weinberg equilibrium (HWE) in the analyses because domesticated populations of P. lunatus from the Yucatan Peninsula are known to deviate from Hardy–Weinberg equilibrium with co-dominant microsatellite markers (Martínez-Castillo 2005). Assumptions included absence of co-migration of alleles from different loci, alleles shared by two individuals descended from a common ancestor and each locus consisted of only two alleles that follow Mendelian inheritance (Martínez-Castillo et al. 2008). The polymorphic loci percentage (%P) was calculated with the program POPGENE ver. 1.31 (Yeh and Boyle 1999). Although Hardy–Weinberg equilibrium was not considered applicable in this study, on the basis of the results of Kremer et al. (2005), we decided to evaluate average heterozygosity (H BAY) using the Bayesian approach proposed by Zhivotovsky (1999). This estimator was obtained with AFLP-SURV ver.1.0 program (Vekemans 2002). Paired Student t tests were performed to compare the diversity values (H BAY) between the two Mayan subareas (α = 0.05) using the program Statgraphics ver. 15.2.06 (StatPoint 2006).

Results

Genetic structure

Genetic structure in whole Mayan area and in its two subareas is shown in Table 2. The Mayan area presented high genetic differentiation (F ST = 0.66, AMOVA: 65% of total variation was found between accessions). At the subarea level, the Mayan lowlands presented higher genetic structure (F ST = 0.63) and lower gene flow (Nm = 0.25) than the Mayan highlands (F ST = 0.53, Nm = 0.34). Values for genetic differentiation between subareas and between accessions within each subarea were high (30 and 34%, respectively; Table 3).

Table 2 Estimators of structure (F ST) and gene flow (Nm) of Mesoamerican gene pool of P. lunatus cultivated in the Mayan area, using 75 ISSR loci
Table 3 Analysis of molecular variance (AMOVA) applied to Mesoamerican gene pool of P. lunatus cultivated in the Mayan area using 75 ISSR loci

Genetic relationships

The Bayesian approach indicated that the optimum K was 2, indicating the existence of two genetically distinct groups (Fig. 2). The bar graphs in Fig. 3 show the coefficients of membership for each of the 46 accessions analyzed. Individuals of the Mayan lowlands belonged to one group and individuals of the Mayan highlands formed the other group. In Fig. 3, also can be show the existence of some accessions with some degree of ancestry from both groups.

Fig. 2
figure 2

Estimating number of subpopulations using delta K values for K ranging from 1 to 5 using method proposed by Evanno et al. (2005)

Fig. 3
figure 3

Tests of assignment of individuals showing clustering of 46 accessions of Lima bean based on analysis of genotypic data using STRUCTURE. Each accession is represented by a vertical bar

Figure 4 shows the general pattern of grouping based on the two Mayan subareas. One group included 19 of the 23 accessions collected in the Mayan lowlands and had high bootstrap support (88%). The other group included 23 accessions collected in the Mayan highlands and 3 accessions belonging to Mayan lowlands with bootstrap support (79%). Six accessions were not grouped based on their geographical origin: accession G25240 collected in the Mayan highlands grouped with the Mayan lowlands accessions and the Bacalar, Trapichito, JMC 1030, JMC 1033 and JMC 105 accessions collected in the Mayan lowlands grouped with the Mayan highlands accessions.

Fig. 4
figure 4

Neighbor joining (NJ) analysis of 46 populations of Phaseolus lunatus cultivated in the Mayan area. Numbers at the nodes indicate percentage bootstrap support

The principal coordinate analysis (PCoA; Fig. 5) showed a pattern consistent with the NJ results. The presence of two groups were observed, with most individuals collected in the lowlands (gray triangles) grouped closer together than with individuals collected in the highlands (black squares). PCoA showed a greater dispersion of individuals in the highlands than in the lowlands, and some individuals of the Mayan lowlands were clustered with individuals collected in the highlands. These PCoA results were supported by 70 and 18% of the total variation accumulated in axis 1 and 2, respectively (Fig. 5).

Fig. 5
figure 5

Principal coordinate analysis (PCoA) of the populations of Phaseolus lunatus collected in the Mayan area. Black squares: highland populations, gray triangle: lowland populations

Genetic diversity

The results generated by the structure (Figs. 2 and 3), NJ and PCoA analyses supported the existence of the two groups corresponding to the Mayan subareas. Based on this information, we analyzed the genetic diversity for all the Mayan area and within each of the two subareas.

The four primers ISSR generated 75 loci, which were all polymorphic. Primer 30 (GACAC)3 AG, yielded the highest number of loci (21). The Mayan area level had 100% of the polymorphic loci (%P) and high levels of genetic diversity (H BAY = 0.44). At the Mayan subarea level, the percentage of polymorphic loci was slightly higher in the Mayan highlands than in the Mayan lowlands (%P = 100% and 97.33%, respectively). The estimator H BAY was higher in the Mayan lowlands than in the Mayan highlands (0. 44 and 0.36, respectively; Table 4), but the difference was not statistically significant.

Table 4 Estimators of genetic diversity of P. lunatus cultivated in the Mayan area measured at three levels

Discussion

Genetic structure of Lima beans landraces from the Mayan area

Nm estimates indicate that the average number of successful immigrants per generation should be estimated in one to favour the reduction of population structure (Slatkin and Barton 1989). The high genetic structure observed in the Mayan area can be a consequence of the low level of long term gene flow found (Nm = 0.29). It is generally accepted that gene flow is an important component of population structure (Slatkin 1994). Maquet et al. (1996) reported high genetic differentiation (G ST = 0.803) and very low gene flow (Nm = 0.056) between wild populations of Lima bean distributed throughout the Central Valley of Costa Rica. Martínez-Castillo et al. (2007) reported Nm = 0.28 between wild and cultivated populations of Lima beans from the Yucatan Peninsula. Castiñeiras et al. (2007) using AFLP markers reported G ST = 0.23 for cultivated accessions in three regions from Cuban home gardens. Montero-Rojas et al. (2013) using SSR molecular markers reported G ST = 0.036 in 50 landraces collected directly in Haiti, Puerto Rico and Dominican Republic. With the exception of the results of Montero-Rojas et al. (2013), our results are consistent with those previously reported, indicating the existence of high genetic structure within the cultivated gene pool of Lima beans. The mating system and life history traits of a species are usually highly associated with differences in G ST values (Hamrick et al. 1991). Hamrick et al. (1991) classified 473 taxa using eight characters of the species including life history traits, geographic range, mating system, mechanism of seed dispersal and reproductive method and reported G ST = 0.51 for autogamous species. These results indicate that the high levels of genetic structure found in Lima beans is due to its predominantly autogamous reproductive system (Maquet et al. 1997; Webster et al. 1979).

Genetic differentiation was high between the two Mayan subareas and between accessions within each subarea. The lowland accessions presented a higher genetic structure than the highland (F ST = 0.63 vs. 0.53, respectively), which could be explained by the difference between gene flow levels found in the highland (Nm = 0.34) and lowland (Nm = 0.25) subareas (Table 2). Another factor that could explain the genetic structure observed within each subarea is the presence of landraces from the Papa and Sieva cultigroups in the Mayan lowlands (Martínez-Castillo et al. 2004) and landraces from the sole Sieva cultigroup in the Mayan highlands. Fofana et al. (1997) indicated that both cultigroups are involved in a recent process of sympatric speciation that eventually will generate a greater degree of genetic differentiation among them. The level of genetic differentiation found in the Mayan lowlands in the present study was higher than the value reported by Martínez-Castillo et al. (2007) F ST = 0.63 versus G ST = 0.47 respectively. The discrepancy between current and previous results can be explained by the differences such as the molecular marker used; Martínez-Castillo et al. (2007) used SSR markers, and we used ISSR markers. The ISSR markers have a higher level of polymorphisms than do the SSR markers in landraces of Lima bean (Martínez-Castillo et al. 2008). In addition, more accessions and loci were analyzed in the present study.

Genetic relationships among landraces of Lima bean

The analysis of genetic relationships, with some exceptions, showed a general grouping pattern according to the two Mayan subareas. Exceptions to this pattern of clustering were five accessions that belonged to the lowlands and were grouped with accessions from the highlands of Guatemala and Chiapas. Even though there are no studies on the seed flow between lowland and highland Mayan communities, the discrepancies observed in five accessions could have been common in recent decades as a result of the displacement of Mayan peasants from the highlands (Chiapas and Guatemala) to the lowlands, promoting the flow and exchange of seeds among farmers from each subarea. Martínez-Castillo et al. (2004) reported a similar scenario in a study of the genetic diversity of landraces of Lima bean in the Yucatan Peninsula.

The clustering pattern observed using NJ and PCoA was supported by the high level of genetic structure and low levels of gene flow observed in the wide Mayan area. These results are similar to those found by Martínez-Castillo (2005), who used SSR markers and found that the wild and domesticated populations of P. lunatus present in the Yucatan Peninsula are grouped according to the region where they were collected. This pattern was concluded to be the result of low levels of gene flow and an isolation by distance.

Genetic diversity of Lima bean landraces grown in the Mayan area

Considering the hypothesis that the Mayan area is an important center of genetic diversity for Lima bean, we need to evaluate our results in relation to those reported by others, but with caution given that these other studies used different methodologies (e.g., estimates of diversity, molecular markers, sampling methods). Hamrick et al. (1991) in a review of the diversity and genetic structure of 473 taxa reported mean H E = 0.149 and H E = 0.12 for species with a mixed mating system and for short-lived perennials respectively. Using RAPD markers, Nienhuis et al. (1995) reported H E = 0.11 for 35 accessions belonging to Mesoamerican landraces obtained from CIAT germplasm bank. Maquet et al. (1997) using allozymes reported H E = 0.26 for a base collection of P. lunatus from the CIAT germplasm bank. Lioi et al. (1998) reported H E = 0.13 for 60 cultivated accessions that included both Mesoamerican and Andean landraces. Castiñeiras et al. (2007), using AFLP markers, reported H T = 0.13 for 65 cultivated accessions from Cuba. Montero-Rojas et al. (2013) reported H E = 0.337 and H O = 0.403 for 50 landraces collected directly from Haiti, Puerto Rico and the Dominican Republic.

Comparing our results with the studies referred to above, we consider that the Mayan area had high levels of genetic diversity. However, it is important to remember that some studies (Nienhuis et al. 1995; Maquet et al. 1997; Lioi et al. 1998) used accessions from seed banks, and management of germplasm held in seed banks can induce high levels of genetic erosion, starting with the sampling process and continuing with the seed regeneration (Gómez-Campo 2006; Maquet et al. 1997; Parzies et al. 2000). In our study, all accessions analyzed from lowland were collected directly from the farmers’ milpas, which tend to favor the conservation of higher genetic diversity levels and could explain our high values. The importance of using accessions collected in situ for studies of genetic diversity is also reflected in the work of Montero-Rojas et al. (2013) and Andueza-Noh et al. (2013) who reported high levels of genetic diversity in landraces from the Carribean and throughout Mesoamerica, respectively.

With respect to the Mayan subareas, high values of genetic diversity were observed, with higher level in the lowlands (Table 4), but these differences were not significant. However, the values of H BAY in favor of lowlands indicate the importance of this region of Mexico as a center of genetic diversity for the gene pool of cultivated Lima beans, even considering recent reports of genetic erosion of this gene pool. Camacho-Pérez (2009) and Martínez-Castillo et al. (2012) reported that this crop experienced a high degree of genetic erosion from 1979 to 2007 in northeastern Campeche as a result mainly, of the incorporation of small farmers into the market and the introduction of improved varieties of Lima beans into this region. Unfortunately, there are no studies in the Mayan highlands to analyze and understand the factors that generated the lower levels of genetic diversity.

One factor that could explain the differences in genetic diversity among the two subareas is the presence of landraces belonging to two cultigroups, Papa and Sieva in the lowlands compared with landraces only from the Sieva cultigroup in the highlands. Also, gene flow and genetic introgression demonstrated between wild and cultivated Lima bean populations present in the Mayan area could explain these differences (Martínez-Castillo et al. 2007). Dzul-Tejero et al. (2014) analyzed the genetic introgression wild-cultivated in three milpas from the Yucatan Peninsula, finding that the milpa had higher levels of genetic introgression and genetic diversity. In addition, in both studies cited, weedy populations of Lima bean that were present could act as genetic bridges between wild and domesticated gene pools. In the Mayan highlands, however, there are no reports of flow and genetic introgression in Lima bean because wild populations of P. lunatus in the Mayan area grow only at low altitudes (Serrano-Serrano et al. 2010; Andueza-Noh et al. 2013). Certainly, genetic introgression remains an important factor in increasing the genetic diversity of crops (Arnold 1992; Quiros et al. 1992; Altieri and Montecinos 1993; Zizumbo-Villarreal et al. 2005).

Levels of genetic diversity found in the Lima beans grown in the lowlands (Table 4) were higher than those reported by Martínez-Castillo et al. (2008) (%P = 78.9, H BAY = 0.31). Two aspects may explain the differences between their study and ours: (1) we analyzed a more complete geographical representation of germplasm, and (2) we analyzed five individuals per accession independently, while Martínez-Castillo et al. (2008) analyzed five individuals per accession in the form of genetic mixing, which could have resulted in a loss of information. Although the studies differed in the values of genetic diversity, they both point to the Mexican side (Mayan area) of the Yucatan Peninsula as an important center of genetic diversity for this crop.

Implications for conservation

The Mayan area has been identified as an important center of genetic diversity and domestication of Lima beans (Martínez-Castillo et al. 2008; Andueza-Noh et al. 2013); yet, in recent years, the landraces of this region, particularly in the Mayan lowlands, are at high risk of genetic erosion mainly from changes in traditional agriculture and market preferences that have led many farmers to abandon the cultivation of these landraces (Martínez-Castillo et al. 2008, 2012). Analyzing the results of previous studies (Martínez-Castillo et al. 2004, 2008, 2012) with regard to our results, we can see that, despite the recent loss of landraces in the Mayan lowlands, levels of genetic diversity in the Mayan area are still high, highlighting the importance of the Mayan culture in generating and preserving crops with high levels of genetic diversity. On the premise that the reduction in genetic diversity of landraces is a worldwide problem and poses a high risk for the food security of future generations, the high genetic diversity within landraces highlights the need for management strategies that allow us to retain the widest possible set of landraces to ensure the maintenance of this genetic diversity within this important cultural region where the Lima bean is the second most important food legume in the genus Phaseolus.

Although the results generated in this study are of great importance for the conservation of P. lunatus in the Mayan area, more studies are needed to address the problems facing Lima beans in the traditional agricultural system of the Mayan highlands and better implement programs for the ex situ and in situ conservation throughout the Mayan area and avoid the loss of landraces of this species, ensuring future increases in the genetic and biological diversity in modern crops. We thus recommend in situ collections of Lima bean in the Mayan highlands and an ethnobotanical study of the species in the Mayan highlands to assess production dynamics. In this sense, as part of a program of ex situ conservation, the populations collected in this study have been deposited in the gene bank of the Centro de Investigacion Cientifica de Yucatan Mexico (CICY), and a copy was sent to the gene bank of the Sistema Nacional de Recursos Fitogenéticos para la Alimentación y la Agricultura (SINAREFI), to ensure their conservation and sustainable use. Finally, the results revealed that ISSR markers could be used as a reliable method to assess the genetic diversity in Lima bean germplasm and may help to identify or to select groups of genotypes with desirable characters. To generate diversification of landraces of Lima bean in both gene pools analyzed, we recommend that these 46 genotypes be used with efficient hybridization techniques to increase the levels of genetic polymorphism in Lima bean landraces in the Mayan region.