Introduction

The effect of habitat fragmentation on genetic diversity and biodiversity is of increasing concern since more natural habitats have been modified as a result of human activities. Modern agriculture and inappropriate management of natural resources have produced an accelerated and sustained erosion of variability that leads to desertification in many areas of the world, one of the major environmental problems affecting humanity.

Fragmentation, decline or perturbation of environment can lead to genetic changes that can have adverse implications for the conservation of the species. These changes include loss of differentiated populations, alteration of differentiation between populations, loss of variation among members of the same population and changes in the level of inbreeding (Sherwin and Moritz 2000).

Some species of Prosopis genus, mainly those belonging to section Algarobia (locally known as “algarrobos”), have been subjected to unsustainable exploitation to obtain multiple products: wood, charcoal, forage, gum arabic, tannins, mucilage, ethanol and human food (Karlin et al. 1997; Pasiecznik et al. 2001). The accelerated and uncontrolled use of this important natural resource for fuel and timber has led to increasing deforestation and has caused the loss of many elite trees and the destruction of extensive Prosopis forests (Hunziker et al. 1986).

The genus includes approximately 45 species distributed in arid and semiarid regions of the Americas, Africa and West Asia. Argentina is the country with the greatest species diversity, with more than 27 species, 11 of which are endemic (Burkart 1976). The section Algarobia of this genus includes the most important species from an economic and ecological standpoint.

Prosopis chilensis is an important species within this section that exhibits high variability in morphological characters, and it has been suggested that it should be included in reforestation programmes as it is one of the fastest growing species under arid conditions in Argentina (Burkart 1976; Dutton 1989; Karlin et al. 1997). Although Burkart (1976) described three varieties, P. chilensis var. chilensis, P. chilensis var. catamarcana and P. chilensis var. riojana, different evidence lines suggest that these taxa would not be natural groups. According to Roig (1993), the differences in spine, fruit and leaf morphology are not too general, Dalmasso (1993) described six morphotypes in a single population of San Juan (Argentina), Verga (1995) suggested that P. chilensis var. riojana is actually a hybrid between P. chilensis and P. flexuosa and Vázquez-Garcidueñas et al. (2003) based on RAPD markers suggested that P. chilensis var. chilensis is more closely related to P. flexuosa var. flexuosa than to P. chilensis var. riojana.

According to Burkart (1976), the native range of P. chilensis includes southern Peru, Bolivia, Chile and Argentina. However, Galera (1996) states that this species was introduced into Chile at the beginnings of the XIX Century. In Argentina, it is distributed in the political provinces of Cordoba, Catamarca, La Rioja, Salta and San Juan, which correspond to the biogeographical provinces of Chaco, Monte and Prepuna (for a thorough description of biogeographical regions, see Cabrera and Willink 1980).

This species has wide climatic tolerance (Galera 2000) from an absolute maximum temperature of 48°C to an absolute minimum temperature of −7°C. P. chilensis, like most Prosopis species, can grow in many different soils, commonly in sandy soils; it exhibits a great resistance to salinity and tolerates alkaline soils (pH 7.7–8.9) with a high concentration of sodium. As many other legume species, its ability to develop a symbiotic relationship with Rhizobium, which allows N2 fixation, (Aiazzi et al. 1995) is important to resist and improve soils poor in nutrients. For these reasons, afforestation or reforestation with this species is promising to prevent desertification.

According to Verzino et al. (2003), P. chilensis exhibits throughout its distribution in Argentina an important variation for morphological and physiological quantitative traits. This result implies genetic differences associated with adaptation to particular environments, and these authors pointed out the importance of analysing the relation of adaptive genetic variation to particular environmental conditions.

Genetic markers provide a method for assaying levels of genetic variation and population structure in natural populations of forest trees and for measuring population processes of ecological importance and conservation forest management (Rieseberg 1996a; Shea and Furnier 2002; Lee et al. 2002; Renau-Morata et al. 2005; Belletti et al. 2008). Among several methods, random amplified polymorphic DNA (RAPD) has been commonly used because data can be generated relatively quickly and with less labour than other methods for comparative studies in plant collections. Some authors have suggested that the reliability of RAPD and isozyme data should be comparable at the intraspecific level and moderate and high correspondence between results of these markers has been reported (Nybom and Bartish 2000), while other authors state that RAPD provides a better genome random sample than allozymes do (Fristch and Rieseberg 1996; Nybom 2004). Thus, a complementary approach using both isozyme and RAPD markers may be suitable for generating accurate estimates of genetic diversity and population structure. Moreover, these markers have shown significant correlation with climatic factors (Bulinska-Radomska 2000; Lewinsohn et al. 2001; Semagn et al. 2000) and may be useful in predicting sampling strategies for the conservation of genetic resources.

In Prosopis, isozyme markers have allowed the characterization of many species of section Algarobia (Julio 2000; Saidman et al. 2000; Bessega et al. 2005; Ferreyra et al. 2007) while RAPD markers have provided species-specific bands (Juárez-Muñoz et al. 2002; Vázquez-Garcidueñas et al. 2003; Ferreyra et al. 2004; Landeras et al. 2005). Nevertheless, studies using both isozymes and RAPDs are scarce and have included few populations of each species (Bessega et al. 2000a; Ferreyra 2001; Ferreyra et al. 2007), and the studies so far conducted in species of section Algarobia have failed to show genetic differentiation associated with environmental variables (Julio 2000; Saidman et al. 2000; Ferreyra et al. 2007).

In this study, we analysed the variability of RAPD markers in nine Argentine populations of P. chilensis representative of the three biogeographical provinces: Chaco, Prepuna and Monte. RAPD data were compared with previous isozyme studies (Julio 2000), and the joint information from both markers was used to (1) explore the organization of genetic variability at different hierarchical levels (within populations, among populations within regions and among regions) and (2) investigate the association of genetic variation with climatic and geographic variables throughout the main distribution area of this species in Argentina, in order to apply this information to developing forest conservation strategies.

Methods

Populations sampled

The seeds analysed in this study were provided by the Banco Nacional de Germoplasma de Prosopis (NGBP) of the Facultad de Ciencias Agropecuarias, Universidad de Córdoba (Verzino 2005). Plants collected were taxonomically identified by Joseau and Castro Schule (2005) following the morphological criteria described in Burkart (1976). From each sampled tree, a voucher specimen was obtained and kept in the herbarium of the NGBP. According to R. Palacios, all material used in the present study belong to the same variety (P. chilensis v. chilensis). Seeds for isozyme and RAPD analyses were obtained from the same mother plants.

The present study involved nine sample sites representative of the range of P. chilensis in Argentina (Table 1; Fig. 1). Four of them belong to the biogeographical province of Chaco: Villa Dolores (DOL), La Higuerita (LHI), Soto (SOT) and Conlara (CON). Three populations are situated in the Monte biogeographical province: Patquía (PAT), Talampaya (TAL) and Astica (AST). The remaining two samples correspond to Prepuna biogeographical province: Las Talas (LTA) and Belén (BEL) (see Cabrera and Willink 1980).

Table 1 Geographic and climatic information of sampled populations of P. chilensis
Fig. 1
figure 1

Map of Argentina indicating the nine sample sites representative of the distribution range of Prosopis chilensis in Argentina. The corresponding population acronyms are indicated in Table 1

As P. chilensis predominantly outcrosses like most species of section Algarobia (Bessega et al. 2000b), most genetic variability is captured with seeds collected from a relatively small number of mother plants (Saidman and Vilardi 1993). In fact, studies in related species of Prosopis with mating systems similar to that of P. chilensis (Saidman et al. 2000; Ferreyra et al. 2004; Ferreyra et al. 2007) showed that more than 85% of the genetic diversity detected was found within population suggesting that the sampling error by applying this strategy is minimal. According to this, the seeds sampled (450 for isozyme and 141 for RAPD analysis) were obtained from seven to nine trees per population. Within each site, the sampled trees were separated at least 100 m from each other in order to reduce the probability of duplication in the samples.

Isoenzyme technique

Six isoenzymatic systems were previously analysed by Julio (2000) in populations of P. chilensis: (EST), (Enzyme Commission) E.C. 3.1.1.1; glutamate oxalacetate transaminase (GOT), E.C. 2.6.1.1; leucilamino peptidase (AMP), E.C. 3.4.1.1.1; alcohol dehydrogenase (ADH), E.C. 1.1.1.1; superoxide dismutase (SOD), E.C. 1.15.1.1; peroxidase (PRX), E.C. 1.11.1.7 using horizontal electrophoresis. A total of 450 individuals (seeds) were analysed, involving 50 seeds per system per population. The information of buffers, electrophoresis conditions and staining methods is given in Online Resource 1. Homogenates were obtained from cotyledons of 7-day-old seedlings for all systems but ADH, for which whole seeds soaked for 24 h were used. Seedlings were obtained under the same conditions of lighting, humidity and temperature (Saidman 1985) in order to assure reproducibility of isozyme banding patterns. Electrophoretic phenotypes of isozymes are available in Online Resource 2 and 3.

Band patterns in P. chilensis were homologous to those observed in related species of section Algarobia (Saidman 1985, Saidman et al. 1997; Bessega et al. 2000a; Ferreyra 2001).

RAPD technique

Genomic DNA was extracted from cotyledons of 8–13-day-old seedlings using the method of Dellaporta et al. (1983). DNA concentration was estimated by comparing electrophoretic patterns on 0.8% agarose (in 1× TAE buffer) gels with standard DNA marker sets (phage λ double-digested with EcoRI and HindIII).

The PCR involving arbitrary primers (Promega) was carried out in a thermal cycler (GeneE, Cambridge, Techne Ltd) that was programmed for one cycle of 94°C for 6 min, 45 cycles of 94°C for 1 min, 36°C for 1 min and 72°C for 2 min followed by a final extension step of 72°C for 6 min. PCR was carried out in 50 μl final volume using 40–60 ng of genomic DNA, 5 μl dNTP mix (1 mM each dNTP), 10 μl of primer (3 ng per μl), 10 μl Taq DNA polymerase buffer (10 mM Tris–HCl, pH 9 at 25°C, 50 mM KCl, 0.1% Triton X-100), 6 μl MgCl2 (25 mM) and 0.5 units of Taq DNA polymerase (Promega). These conditions were maintained in all experiments in order to avoid reproducibility and scoring problems. The amplification products were resolved on 1.4% agarose gels and stained with EtBr. Fragments were observed and photographed on a UV transilluminator (312 nm). The usual cautions needed to prevent contamination of PCR experiments with previously amplified fragments were observed. Reliability of PCR products was tested by several controls that were routinely used, one without primer, a second one with no Taq DNA polymerase and the third one with no genomic DNA. No amplification occurred in any of these negative controls. Re-amplification was performed routinely to ensure reproducibility of banding patterns. Each individual DNA sample was amplified three times and seeded in three different gels, and the same pattern was obtained. In each gel, we analysed individuals from different populations in order to avoid bias in the comparisons among populations attributable to experimental error.

Twenty primers were tested, and five of them were selected on the basis of reproducibility of bands retrieved and their ability to show polymorphisms. The selected primers were B01 (5′-TCG AAG TCC T-3′), B07 (5′-AGA TCG AGC C-3′), B10 (5′-CAG GCA CTA G-3′), A09 (5′-CTA ATG CCG T-3′) and A06 (5′-GAG TCT CAG G-3′). A total of 141 individuals (seeds) were analysed, involving 14–20 per population, with the exception of La Higuerita where the sample was considerable lower (8 individuals) (Table 4).

Homology of RAPD bands

The use of RAPDs for comparative purposes relies on the assumption that similarity of fragment size is a dependable indicator of homology (Rieseberg 1996b). To test the validity of this assumption, homology of the polymorphic RAPD DNA fragments (A09-3 and A09-5) between different populations of P. chilensis was determined and further investigated by DNA sequence alignment. These bands derived from the primer A09 (5′-CTA ATG CCG T-3′) were chosen because they showed association with environmental variables. We cloned and sequenced the bands taken from random selected individuals of different populations. A09-3 bands were sequenced in individuals from TAL and DOL, whereas A09-5 bands were sequenced in individuals from PAT, TAL, DOL and SOT. The bands were excised from 1, 5% agarose gels with a sterile gel slicer, and the DNA was purified using the QIAEX II Gel Extraction Kit (QIAGEN, USA). The eluted fragments were ligated into pGEM T Easy Vector (Promega, USA) following the supplier’s instructions. The recombinant plasmid DNA was used to transform Escherichia coli (DH5a) competent cells. Plasmid DNA was purified from five independent transformed clones as described by Sambrook et al. (1989). The size of cloned DNA inserts was checked by digestion with EcoRI (New England Biolabs). The complete sequences of the cloned fragments were obtained using an ABI Prism 3100 automated DNA sequencer (Unidad de Genómica, Inst. Biotechnology, INTA Castelar, Argentina).

Assembly and editing of sequences was performed with BioEdit version 5.0.9 (Hall 1999). Alignments were performed using CLUSTALW (Thompson et al. 1997).

Genetic variability

Genetic variability and structure parameters of nine populations of Prosopis chilensis were studied from RAPD markers. Binary matrix of band presence/absence was converted into allelic frequencies by a Bayesian method (Zhivotovsky 1999) with non-uniform prior distribution using the software AFLP-SURV 1.0 (Vekemans et al. 2002). Genetic variability was quantified by the unbiased expected heterozygosity (H e) (Nei 1978), percentage of polymorphic loci (P) and mean number of alleles per locus (A).

Expected heterozygosities were compared among populations by means of the asymptotic Friedman test, taking populations and loci as factors. Variability was also compared among regions. In this case, as the model was unbalanced (different numbers of populations between regions), we applied the K-sample permutation test. Pair-wise comparisons of H e between populations were conducted by Wilcoxon test. The correlation among variability from different markers was evaluated by Spearman′s rank correlation test. These non-parametric methods were chosen because they do not involve any assumptions regarding distribution of variability estimates. They were performed using the packages coin (Hothorn et al. 2007) and exactRankTests (Hothorn and Hornik 2007) of the program R (versión 2.9) (R Development Core Team 2009).

Gametic disequilibrium

Linkage between RAPDs loci was estimated by Otha’s (1982) method using the program RAPDLD from the RAPDs package (Black 1996).

Hierarchical analysis of genetic variability

The distribution of RAPD genetic variability at different hierarchical levels was studied by analysis of molecular variance (AMOVA) (Excoffier et al. 1992). Also, non-hierarchical Φ ST was estimated to evaluate overall genetic differentiation. These analyses were conducted using the software GeneticStudio (Dyer 2005–2007).

Clustering methods and linear discriminant analysis

Genetic differentiation between populations was measured by pair-wise Wright’s Φ ST obtained from the RAPD dataset using the software AFLP-SURV 1.0 (Vekemans et al. 2002).

From the genetic distance matrix, a phenogram was obtained by applying the unweighted pair grouping method based on arithmetical means (UPGMA) using the program R. The robustness of the phenogram branching was evaluated by the analysis of consensus of 100 trees obtained from 100 bootstrapped pseudoreplicates of the original data matrix.

Bootstrapping was performed by AFLP-SURV 1.0 (Vekemans et al. 2002), and the consensus analysis was made with PHYLIP program (Felsestein 1993).

The structures of the genetic distance matrices and trees obtained from RAPD loci were compared with those obtained by Julio (2000) from isozymes using Mantel test. The consistence of isozyme and RAPD-based phenograms was evaluated by comparing the corresponding cophenetic matrices by the same statistical method. To conduct Mantel tests, we used the package ape (Paradis et al. 2005–2007) of program R.

The relationships between populations retrieved in the matrix of pair-wise Φ ST were represented graphically by Kruskal’s non-metric multidimensional scaling (MDS) to determine whether populations are grouped according to their corresponding regions.

Linear discriminant analysis (LDA) was conducted, trying to determine how best to separate known groups of individuals and predict to which predefined class an individual belongs. LDA and MDS were made with the package MASS of program R.

Climatic and geographic information

The climatic and geographic variables were obtained using the New locClim program (ver. 1.10) (Grieser 2006). These variables were the following: average minimum temperature, mean annual temperature, average maximum temperature, evapotranspiration potential, mean number of rainy days, sunshine fraction, sunshine hours, mean annual rainfall per month, aridity, effective rain, dryness, latitude, longitude and altitude. From this information, standardized Euclidean pair-wise climatic distances were estimated between populations using the program R.

Isolation by distance

The hypothesis that the genetic differentiation between populations is due to isolation by distance was tested comparing geographic and genetic distance matrices by Mantel test. Distances between sample sites were estimated using the measuring tool of Google Earth ver. 4.1 (http://earth.google.com).

Correlation of genetic variables with climatic and geographic variables

The association of band frequencies with environmental variables cannot be properly analysed with simple regression of band frequencies on environmental variables because this method does not take into account that the environmental and geographical variables are not independent. Taking into account this issue, for each locus, a stepwise multiple regression analysis (backward and forward) was applied to determine direct effects of environmental variables on allelic frequencies. The procedure applied was a generalized linear model defined by the following equation:

$$ y = \mu + \beta_{1} x_{1} + \beta_{2} x_{2} + \cdots + \beta_{n} x_{n} $$

where y is response variable (band presence or absence), μ is the overall mean, the explanatory variables x i are the climatic or geographic records corresponding to each population and β i are the partial regression coefficients for each explanatory variable.

As in this case we deal with proportion data (band frequencies), we considered three important properties that affect the way the data should be analysed: (1) the data are strictly bounded (between 0 and 1), (2) the variance is non-constant and (3) errors are non-normal. With proportion data, if band frequency is 0, then there will be no band in repeated trials (sampled individuals), all the data will be zeros and hence the variance will be zero. Likewise, if band frequency is 1, then all sampled individuals will show the band, and again the variance will be 0. For proportion data, therefore, the variance increases with the mean up to a maximum (when band frequency is one half) then declines again towards zero as the mean approaches 1. The variance–mean relationship is humped, rather than constant as assumed in the classical tests. The final assumption is that the errors (the differences between the data and the fitted values estimated by the model) are normally distributed. This cannot be so in proportional data because the data are bounded above (1) and below (0). All these issues (boundedness, non-constant variance and non-normal errors) are dealt with by using a generalized linear model with a binomial error structure. To take into account sample sizes with binomial errors, we must give the number of “failures” (band absence) as well as the numbers of “successes” (band presence) in a two-vector response variable. To do this, we bind together two vectors into a single object, “y (the response variable), comprising the numbers of successes and the number of failures. The binomial denominator, “n , is the total sample. The procedure we applied carries out weighted regression, using the individual sample sizes as weights, and the logit link function to ensure linearity (see Crawley 2007). The method applies the Akaike (1974) information criterion (AIC) to select predictor variables to be included in the model. The regression analysis was conducted with the package stats of the program R (versión 2.9) (R Development Core Team 2009).

Results

RAPD analysis

With the five (out of 20) primers selected, we detected 46 reproducible and unambiguously scored bands. RAPD patterns obtained with the primers A01, A06, A09, B01 and B10 are given in Online Resource 4 and 5. The molecular weight of the bands ranged from 300 to 1,600 bp. All of them were polymorphic (sensu Ford 1940 and see Ridley 1993 p. 69, 639) showing variation within at least one population (data not shown). Allelic frequencies of RAPD loci in the studied populations of Prosopis chilensis are given in Online Resource 6. After applying Otha′s (1982) method, only 2 out of 1035 comparisons (0.19%) suggested epistasis disequilibrium between RAPD markers.

Analysis of band homology

The sequences of bands A09-3 and A09-5 were deposited in Gene Bank, with the information on accession numbers and molecular weights reported in Table 2. The comparison of the DNA sequences of band A09-5 in individuals from different populations (Table 3) indicated a high sequence homology ranging from 91.0 to 97.4%. The two sequences analysed of band A09-3 in one individual from DOL and one from TAL exhibit a homology of 99.7%. These results were consistent with the hypothesis of homology of comigrating bands observed in individuals from different populations.

Table 2 Molecular characterization of some RAPD fragments generated by primer A09 (5′-CTA ATG CCG T-3′) and their GenBank Accession numbers
Table 3 Matrix of DNA sequence identity between A09-05 RAPD bands of individuals from different populations

Genetic variability

The average number of alleles per locus showed low variation among populations ranging from 1.4 to 1.7 (Table 4). The percentage of polymorphic loci exhibited a large range (from 37 in LHI to 71% in LTA) (Table 4) indicating considerable differences in variability among populations.

Table 4 Measures of genetic variability estimated from RAPD loci

He estimated from RAPD (Table 4) showed significant differences among populations according to Friedman test (χ2 = 18.88, DF = 8, P = 0.01). Out of 36 pair-wise comparisons between populations, only five yielded significant or highly significant results when compared by Wilcoxon test (Table 5). According to the K-sample permutation test, He estimates between regions differed significantly (T = 2.85, P = 0.012).

Table 5 Pair-wise comparisons of RAPD based on H e values that showed significant differences among populations according to Wilcoxon test

Julio (2000) on the basis on 6 isoenzymatic systems detected a total of 17 loci, 12 of them variable. Isozyme patterns of ADH, PRX, EST, AMP, GOT and SOD are given in Online Resources 2 and 3. The variability estimates from these loci in the same populations studied here were not significantly correlated with those based on RAPD according to Spearman′s rank correlation test (Z = −0.33, P = 0.74).

Hierarchical and non-hierarchical genetic structure analysis

The analysis of the distribution of the genetic variability of RAPD loci by AMOVA (Excoffier et al. 1992) indicated that total differentiation among populations was highly significant (non-hierarchical Φ ST = 0.113, P = 0.001). The variability among populations (Φ ST) was partitioned into the differentiation among populations within regions (Φ SR) and among regions (Φ RT). The highest variance component was observed within populations (Table 6). The results, expressed in terms of gene diversity (Nei 1978), showed that most (88.1%) of the total diversity (1 − Φ ST) occurs within populations, the component among populations within regions (9.3%) (Φ STΦ RT) was intermediate, while the between-region component (Φ RT) was the lowest (2.6%) (Table 6). All three variance components were highly significant.

Table 6 Hierarchical analysis of genetic variability distribution estimated by AMOVA

Genetic differentiation between populations was also analysed by Kruskal’s non-metric multidimensional scaling from the matrix of pair-wise Φ ST values. The results of this analysis (Fig. 2) were consistent with the highly significant among-region differentiation indicated by the AMOVA. Axis 1 separates Prepuna (positive values) from Monte populations (negative values). Axis 2 separates Chaco populations (positive values) from the remaining populations (negative values).

Fig. 2
figure 2

Multidimensional scaling (MDS) obtained from the pair-wise Φ ST matrix among nine populations of P. chilensis. Elliptic drawings group populations belonging to the same biogeographical regions

Phenetic relationships

The structure of the genetic distance matrices obtained from isozyme (Julio 2000) and RAPD data was not correlated with each other according to Mantel test (r = 0.103, Z = 0.367, P = 0.34, based on 1000 permutations). The RAPD-based phenogram (Fig. 3) showed good cophenetic correlation respect to the corresponding distance matrix (r = 0.77, Z = 0.269, P = 0 based on 1000 permutations). Phenogram nodes showed relatively low (lower than 50%) bootstrap support. Populations from Prepuna province (BEL and LTA) are clustered together. PAT is not grouped with the other two populations (AST and TAL) from the Monte province. Genetic distances between populations from the Chaco (LHI, DOL, SOT and CON) are higher than those between populations within the other biogeographical provinces.

Fig. 3
figure 3

UPGMA phenogram based on Nei′s genetic distances from RAPD data showing relationships among nine populations of P. chilensis. Bootstrap support values are indicated over nodes

Linear discriminant analysis

The linear discriminant analysis succeeded in differentiating individuals from different populations on the basis of presence or absence of RAPD band. The consistency between prior and post classifications was 100%. The first three linear discriminant axes (LD1-LD3) explained about 77% of total variation, and the tridimensional dispersion plot according to these axes shows most of the individuals in each population grouped (Fig. 4).

Fig. 4
figure 4

Plot of canonical linear discriminant functions (LD1, LD2 and LD3) based on RAPD loci of nine populations of P. chilensis

Climatic, geographic and genetic distance

The genetic (Φ ST) distance matrix obtained from RAPD was compared with the matrix of geographical distances in order to determine whether they fit to the expectations according to the isolation by distance model. Mantel test (based on 1,000 permutations) indicated that genetic distances were not significantly correlated with geographical distances (r = 0.148, Z = 852.87, P = 0.24).

Band frequencies and geographic and climatic variables

All 12 variable isozyme loci showed significant or highly significant association with at least one geographic/climatic variable according to the stepwise multiple regression analysis (Table 7). We conducted a similar analysis for RAPD data. Out of 46 RAPD loci, 26 showed significant or highly significant association with at least one climatic or geographic variable (Table 8).

Table 7 Stepwise regression coefficients of isozyme allelic frequencies on climatic and geographic variables
Table 8 Stepwise regression coefficients of RAPD band frequencies on climatic and geographic variables

Discussion

Despite the ecological importance of Prosopis chilensis and its potential use for reforestation programmes in arid soils, little is known about the distribution of its genetic variability and population structure. This information is essential for establishing programmes of conservation of genetic resources.

Biochemical and molecular markers have been used to study the diversity within and among related forest species and to assist taxonomists in species determination. Among several efficient methods, allozymes have been the most widely used genetic marker in the study of the differentiation and variation at the population and species levels (Crawford 1997). Polymorphic DNA (RAPDs) has also been used as a genetic marker in forest plants (Lee et al. 2002; Shea and Furnier 2002; Renau-Morata et al. 2005). This marker has found the widest application in systematic studies (Bussell et al. 2005) and particularly in the quantification of genetic variability in closely related species (Crawford 1997; Nybom and Bartish 2000; Nybom 2004). The advantages and limitations of co-dominant versus dominant markers have been largely discussed. Although as a general rule highly polymorphic co-dominant loci offer the highest precision for most purposes, tens to hundreds of dominant polymorphic loci are commonly reported (Hardy 2003), which offers opportunities to make inferences at much finer scales of resolution (Ritland 2005). Furthermore, in comparison to co-dominant markers, dominant markers can be developed relatively easily even for species for which no prior genetic information is available and at a relatively low cost (Belletti et al. 2008).

In reference to genus Prosopis, Bessega et al. (2000a) obtained similar estimates of gene flow between populations of the North American species P. velutina and P. glandulosa from isozymes and RAPDs; however, two isoenzyme systems produced diagnostic loci, whereas no diagnostic bands were obtained from RAPDs useful to distinguish these species. Ferreyra et al. (2007) applied both techniques to analyse variability and differentiation among five Argentinean species of Prosopis and natural hybrids closely related to P. chilensis, obtaining similar conclusions from both methods in reference to population structure and phenetic relationships among species. Moreover, Ferreyra et al. (2004) found diagnostic RAPD band combinations useful to distinguish five species of section Algarobia of Prosopis, which in isoenzymatic analysis (Ferreyra et al. 2007) showed only small differences in allele frequencies.

In this paper, the ranges of the different genetic variability measures (A, P and H e) obtained from RAPD were similar to those observed in other related species of section Algarobia (Ferreyra et al. 2004). However, the variability estimates based on RAPD and isoenzymes were not correlated with each other. These results are explained because RAPD-based variability estimates showed significant differences among populations, whereas isozyme-based ones did not. A possible explanation for this different behaviour is that isozyme systems are more or less randomly chosen without taking into account the variability detected. By contrast, the primers for PCR are usually selected to obtain as many informative bands as possible. According to our results, RAPD-based heterozygosity estimates varied significantly among regions, suggesting that genetic variability is associated with some environmental factors.

The analysis of genetic structure from RAPD loci included populations that are representative of almost the entire range of this species in Argentina. The differentiation among populations was highly significant. Although the AMOVA showed that most variability occurs within populations, the diversity among regions and among populations within regions was highly significant. The high variability within populations may be explained by the open pollinated system with a preponderance of an outcrossing mating system as observed in the species so far studied of section Algarobia (Bessega et al. 2000b). The highly significant variability among populations is consistent with the relatively higher differentiation observed for insect-pollinated and animal-dispersed tree species in comparison with wind-pollinated and wind-dispersed trees (Hamrick et al. 1992; Jordano and Godoy 2000; Hilfiker et al. 2004). It is interesting that the differentiation among populations and regions persists in spite of the highly interactive relationship between humans and algarrobos, which probably inadvertently changed the original genetics of population structure by stimulating the mixing of populations: People returning to the trees with sweetest pods, which were later defecated and germinated, or allowing erect thornless trees to grow near their homes while eliminating spiny ones with poor growth. Also, the introduction of livestock by Europeans contributed to the mixing of populations as animals were moved between distant regions (D’ Antoni and Solbrig 1977). The finding of significant variation between regions should be considered in programmes of genetic resources conservation, and this implies that populations from all biogeographical provinces should be maintained to preserve the full genetic variation across the species range.

The structure of the genetic distance matrices obtained from isozyme and RAPD data was not correlated according to Mantel test. These results can be explained by the fact that RAPD markers provide information of a higher number of polymorphic loci and probably represent a more accurate picture of the real distances between populations.

The RAPD phenogram representative of genetic distances (quantified as pair-wise Φ ST) between populations did not agree with the expected one according to geographical distances. Moreover, the multidimensional scaling analysis indicated that populations tend to be grouped according to their corresponding biogeographical province. These results might be interpreted as evidence that the ecological differentiation among biogeographical provinces is reflected in allele frequency variation.

Earlier studies in this species (Verzino et al. 2003) reported a weak association between some macroenvironmental variables and quantitative physiological traits and pointed out the importance of the knowledge of adaptive genetic variation in relation to particular environmental conditions. In contrast, in this study, a deeper insight was obtained by the regression analysis of allele/band frequencies on environmental variables. Twenty-six out of 46 RAPD loci and all 12 isozyme loci showed dependence on climatic or geographic variables. The isozyme systems analysed in this study were previously applied to species belonging to Algarobia section by our group (Saidman 1985, 1990; Saidman et al. 1997; Bessega et al. 2000a; Ferreyra 2001; Ferreyra et al. 2007) and by other authors (Verga 1995). The isozyme banding patterns were consistent, and homology was proved even in different species. Saidman (1990) analysed hybrid segregation between P. caldenia and P. flexuosa and proved homology between isozyme loci of both species. Respect to RAPDs, in this paper, we analysed the homology of some bands that showed highly significant association with several geographic or climatic variables by DNA sequencing. The high homology observed between individuals from different populations reinforces our results. The variables that affected gene frequencies were associated with rainfall, dryness, sun incidence, temperature, altitude, latitude and longitude. The results obtained in the present study give support to the hypothesis that the morphological and physiological differences observed among populations have a genetic basis. Moreover, the discriminant analysis indicated that the individuals can be assigned unequivocally to their corresponding population on the basis of their RAPD pattern. This result together with the highly significant non-hierarchical Φ ST suggests that the gene flow between populations is relatively low, allowing adaptive local differentiation.

According to the neutral theory (Kimura 1983), most variation of molecular markers is caused by random drift and mutant genes selectively equivalent. RAPD and isozymes are usually considered neutral markers. However, several studies suggest that their variation may have adaptive significance (Nevo et al. 1988; Matrajt et al. 1996; Confalonieri et al. 1998; Semagn et al. 2000; Lewinsohn et al. 2001). Furthermore, it has been argued that the environmental heterogeneity is a major factor in maintaining and structuring the genetic variation in natural populations (Bulinska-Radomska 2000) and that the amount of variation may be regarded as an adaptive strategy for increasing population fitness in an heterogeneous environment (Nevo et al. 1982; Baum et al. 1997; Confalonieri et al. 1998; Lewinsohn et al. 2001; Fahima et al. 2002).

In the present paper, variability estimates (H e) differ among biogeographical provinces, and as it was previously stated, allele frequencies of more than half of RAPD loci and all polymorphic isozyme loci are affected by at least one climatic/geographic variable. These results imply that genetic distances between populations are better explained by biogeographical grounds than by physical distances.

The lack of previous molecular support to the morphological ecotypes described for P. chilensis might be explained by the fact that within-population variation explains nearly 90% of total diversity. However, a wider sampling of genetic markers coupled with environmental information might be a useful tool to identify selective variation. We hypothesize that the correlation of isozyme and RAPD loci with climatic and geographic variables reported in this paper as well as the highly significant differentiation among regions is at least partially associated with gametic disequilibrium between the genetic markers here analysed and loci related to local adaptation. These results provide some orientation for the genetic management of this important species.

Conclusions

RAPD and Isozyme markers proved to be a powerful tool to analyse population structure and to assess the importance of environmental and biogeographical factors in the distribution of genetic variability of Prosopis chilensis in Argentina. We detected significant differences of variability estimates among biogeographical provinces. The analysis of population structure indicated that most of genetic diversity (88%) occurs within populations, the diversity among populations within each province is intermediate (9%) and the lowest component corresponds to diversity among provinces diversity (3%). All three components were highly significant. The high variability within populations is consistent with the mostly outcrossing mating system found in this species, while the high differentiation among populations is explained by the insect-pollinated and animal seed dispersion systems. This among population differentiation persists despite anthropogenic interactions that probably increased the dispersion rate. The phenogram and MDS plots did not agree with geographical distances and were best explained on the grounds of biogeographical information. All polymorphic isozyme loci and more than half (26/46) of RAPD band are significantly dependent on at least one climatic or geographic variable. These results allowed us to hypothesize the occurrence of gametic disequilibrium between the genetic markers here analysed and loci responsible for local adaptation. The results of our study contribute to a better understanding of the distribution of genetic variability and population structure and provide orientation for the genetic management and preservation of this important species. The significant variation observed between regions has implications for conservation purposes because populations from all biogeographical provinces should be maintained to preserve the full genetic variation across the species range.