Introduction

The Argan (Argania spinosa L., Sapotaceae) is a thorny evergreen tree endemic of arid and semi-arid regions in Morocco. In 1998, the “UNESCO’s Man and the Biosphere Program” declared it as a Biosphere Reserve. This tree covers currently an area of 900,000 ha (HCEFLCD 2012) and stretches on the southwest region in addition to three small populations in the North (Oued Grou) in Northeast (Benié snassen) and in the South (Goulmime) (Ehrig 1974; Prendergast and Walker 1992).

Argan trees protect the soil from erosion and help fighting desertification. The tree wood is very hard, it is used for heating and construction materials. The main product is the oil extracted from its seeds providing income for rural population. Argan oil is rich in unsaturated fatty acids (Chimi et al. 1994) and built a worldwide reputation for its cosmetic and nutritional uses. Despite its numerous properties, cultivation of argan is rare and suffers from overexploitation. The assessment of the genetic variation for species located in narrow geographic areas is critically important for designing conservation strategies (Vicente et al. 2011). Therefore, the accurate analysis of genetic structure and the understanding of distribution patterns of genetic diversity is necessary to preserve the evolutionary potential of species to cope with environmental changes which is one of the main conservation goals (Milligan et al. 1994; Reisch et al. 2003).

In recent years, different molecular techniques have been used to study genetic diversity of argan tree including RFLP of chloroplast DNA (El Mousadik and Petit 1996), SSR (Majourhat et al. 2008), ISSR (Yatrib et al. 2015) and AFLP (Pakhrou et al. 2016). However, all these studies have not taken into consideration global distribution of argan tree in Morocco and most studies have involved a limited number of samples. In this study, the set of samples selected provides global representation of the natural distribution area of argan tree. In order to obtain a reliable estimate of genetic diversity assessment, two molecular markers ISSR and IRAP were used. The inter-retrotransposon amplified polymorphism (IRAP) is a technique developed by Kalendar et al. (1999) for the amplification of intervening region between two retrotransposons sequences. This method requires a single polymerase chain reaction (PCR) and uses two LTR primers (Kalendar and Schulman 2006). Because of the high copy number of this molecular marker throughout the genome (Kumar and Bennetzen 1999) and the change caused by their insertion, this method has been applied in several genetic diversity studies of plant species including Quercus brantii (Alikhani et al. 2014), Quercus libani (Khadivi-Khub et al. 2015), Pinus massoniana (Fan et al. 2013) and Pistacia (Sorkheh et al. 2016). To our knowledge, no IRAP based study has been conducted on argan tree. The inter-simple sequence repeat (ISSR) is one of the most informative markers able to detect polymorphism with a limited budget and in a short time (Gupta et al. 1994; Wu et al. 1994; Zietkiewicz et al. 1994). Also, any previous knowledge of DNA sequence is not required for the primer design (Godwin et al. 1997). The present work was carried out to provide new molecular evidence and baseline information about the overall genetic variability of the natural areas of argan tree using two molecular markers ISSR and IRAP. In this context, we aim to investigate the distribution structure and genetic variability within and among the populations of argan tree considering its natural distribution in Moroccan areas. The obtained information will contribute to the foundation of the best conservation and management programs.

Materials and methods

Plant materials

A total of 240 individuals of A. spinosa L. were collected from 24 populations of different regions in Morocco. The sampled individuals were selected to provide global representation of the distribution area of argan. All the information and the geographical indicators of sampling sites are presented in Table 1. Fresh foliage samples were harvested from adult trees and stored at −80 °C for genomic DNA extraction.

Table 1 Locations and sample size of the 24 populations of A. spinosa L

DNA extraction

Isolation of genomic DNA (gDNA) was done according to the ISOLATE II Plant DNA Kit. The extracted total DNA was checked by electrophoresis on 1% agarose gel and the concentration was spectrophotometrically estimated (NanoDrop Technologies Inc., USA).

IRAP analysis

A total of 28 IRAP primer combinations were tested (Alavi-Kia et al. 2008). Initially, one individual from each population was used to perform a preliminary screening. Those producing a very low proportion of polymorphic markers were excluded. Finally, 4 primer combinations that provided reproducible and describable bands were chosen for final evaluation. The amplification reaction was performed according to the Alavi-Kia et al. (2008) protocol. The PCR conditions were optimized and the final PCR reaction mixture (15 μl) included 20 ng of template DNA, 1 × PCR buffer, 2 mM MgCl2, 0.2 mM dNTPs, 0.3 μM of primer and 1 U of Taq polymerase. The cycling program consisted of an initial denaturation at 94 °C for 4 min, followed by 35 cycles at 94 °C for 1 min, annealing temperature Ta (Table 2) for 1 min and 72 °C extension for 2 min with a final elongation step at 72 °C for 7 min. The PCR amplification products were separated by electrophoresis using 2% agarose gel (UltraPure™ Agarose, Invitrogen) in 1 × TAE. Gels were visualized under UV light after being stained with ethidium bromide.

Table 2 Degree of polymorphism and information content for ISSR primers and IRAP primer combinations

ISSR analysis

The ISSR analysis was carried out according to the modified protocol described by Zietkiewicz et al. (1994). A total of 7 ISSR primers selected by Yatrib et al. (2015) were used in this study. The final PCR reaction was performed in 25 μl containing 1X PCR buffer, 1.5 mM of MgCl2, 1 U/μl Taq DNA polymerase, 1.5 mM of dNTPs, 1.5 mM ISSR primer and 75 ng of total genomic DNA. PCR amplification was performed in GenAmp® thermal cycler (Applied Biosystem, CA, USA) following the reaction program: 94 °C for 30 s; 45 cycles of: 94 °C for 30 s; annealing temperature (Table 2) for 45 s; 72 °C for 2 min; and 72 °C for 7 min after the last cycle. The DNA amplification products of ISSR markers were separated by electrophoresis in 2.8% agarose gels.

Data analysis

The clear and visible amplified bands for ISSR and IRAP were considered as potential polymorphic markers and scored as 1 for presence and 0 for absence to generate a binary data matrix for each primer. The efficiency of each primer was evaluated by calculating the total number of polymorphic bands, the polymorphism information content (PIC) following Roldán-Ruiz et al. (2000) and the marker index (MI) according to Powell et al. (1996). The ability to distinguish ISSR and IRAP markers between the individuals was determined by calculating resolving power (Rp), using formula by Prevost and Wilkinson (1999). The different parameters of genetic diversity and the structure analysis were performed using the combined data of ISSR and IRAP. Nei’s genetic diversity index (H), effective number of alleles (Ne) and Shannon diversity index (I) were calculated using GenALEx 6.5 (Peakall and Smouse 2006, 2012). The coefficient of genetic differentiation (Gst), gene flow (Nm = 0.5(1 − Gst)/Gst) and total genetic diversity (Ht) were estimated for total population using POPGENE software version 1.32 (Peakall and Smouse 2006; Yeh et al. 1999). The variation among the four regions, among and within populations were analyzed based on the molecular variation (AMOVA) using GenAlEx version 6.5 (Peakall and Smouse 2006). In order, to test the correlation between geographical and genetic distances among populations, a Mantel test was carried out with 999 permutations. The genetic similarities according to Jaccard’s coefficient of 240 individuals of A. spinosa L. were calculated using the SMIQUAL program of the analysis system NTSYS-pc (Numerical Taxonomy System, version 2.11) (Rohlf 2000). The resulting matrices were used to construct the dendrogram through unweighted pair-group method with arithmetic mean (UPGMA) using SAHN module as recommended by most studies (Alikhani et al. 2014; Lind and Gailing 2013). The principal coordinate analysis (PCoA) was conducted on a covariance matrix using GenALex6.5 to describe the relationship between populations. The population structure was determined for 240 genotypes of argan tree. The molecular data were processed using the STRUCTURE software based on the Bayesian estimate which assigns individuals to a number K of genetically homogeneous groups without a priori group designation. For the analysis with the program STRUCTURE, the length of burn-in and MCMC (Monte Carlo Markov Chain) were set to 10,000 for each K based on the work of Evanno et al. (2005), suggesting that this level is sufficient. Twenty independent replications (runs) were performed for each simulated value of K, varying from 2 to 20. Subsequently, the likely number of populations was calculated based on the highest log value of the likelihood (LnP(K)) using STRUCTURE Harvester (Earl and vonHoldt 2011) based on the method developed by Evanno et al. (2005).

Results

Level of polymorphism and informativeness obtained with ISSR and IRAP

Four IRAP primer combinations and seven ISSR primers were selected to amplify a total of 240 DNA samples of argan tree, the level of polymorphism and informativeness are presented in Table 2. The 4 IRAP primer combinations produced a total of 164 polymorphic amplicons with a range of 4 to 18 bands per primer combinations. The total number of fragment ranged from maximum 37 (LTR6150/LTR6149) to minimum 33 (5′LTR1/Sukkula). Also, the total number of amplified fragments by ISSR was 248 polymorphic bands that could be unambiguously scored. Primer ISSR 6 generated the largest number of fragment (53) while the primers ISSR 3 generated the smallest number of fragment (24). Furthermore, polymorphic information content (PIC) values of the four IRAP primer combinations ranged from maximum 0.31 (3′LTR/Sukkula) to minimum 0.22 (Sukkula/Sukkula) while marker index (MI) varied from 8.88 (Sukkula/Sukkula) to 12.58 (LTR6150/LTR6149) with an average of 10.81. The values of PIC of the 7 ISSR primers ranged between 0.21 (ISSR 7) and 0.30 (ISSR 3 and ISSR 5) with an average of 0.27 and the mean value for MI was 9.16 ranging from 11.64 (ISSR 6) to 7.16 (ISSR 4). Resolving power (Rp) of the IRAP primer combinations ranged from 12 (Sukkula/Sukkula) to 19 (LTR6150/LTR6149) with an average of 15 while Rp of the ISSR primers varied from 10 (ISSR 3 and ISSR 4) to 15 (ISSR 6). The primers ISSR 6 and LTR6150/LTR6149 were the most informative according to the Rp and MI values.

Genetic diversity and population structure analysis

The diversity parameters of all 240 individuals were calculated based on the combined data of ISSR and IRAP. The highest values of Shannon index (I), Nei’s genetic diversity (H), observed number of alleles (Na), effective number of alleles (Ne) and the percentage of polymorphic bands (PPB) was observed in Lakhsass population (I = 1.194, H = 0.125, Na = 0.864, Ne = 1.202, P = 42.48%) (Table 3). The total genetic diversity (Ht) of A. spinosa L. was 0.1879. Furthermore, the AMOVA showed that genetic variation within populations was 49% while 40% was among populations and 11% of molecular variation was partitioned among regions (Table 4). The proportion of total diversity residing among populations (Gst) was 0.5391 which indicates a high level of genetic differentiation among A. spinosa L. populations. The gene flow (Nm) based on the Gst was measured to be 0.4274. The results of this analysis suggest that the gene flow among populations throughout the entire distribution area is restricted with a high genetic differentiation. The Mantel test revealed that there was no significant correlation between genetic and geographic distance (r = 0.0246; p < 0.05).

Table 3 Genetic variation and polymorphic features estimated using IRAP and ISSR markers among 24 populations of the A. spinosa L
Table 4 Analysis of molecular variance (AMOVA) of 24 populations of A. spinosa L

To elucidate the genetic relationships among the studied genotypes of argan tree, the dendrogram based on all of the molecular data (IRAP + ISSR) was constructed using Jaccard coefficient in NTSYS 2.11 (Applied Biostatistics, Setauket, New York, USA) (Rohlf 2000). The genetic similarity coefficient ranged from 0.14 to 0.74 (Fig. 1). The dendrogram grouped the studied populations into two major clusters and the individuals were separated according to their populations. Some populations of each region were placed into the same sub-cluster and showed high similarities with each other. In fact, populations of region 1 (MR, RB, JB, RT, OH, TM, AI, TS, NK, TO) were grouped in cluster I and the populations of region 3 (GU, AZ, LA) were grouped in cluster II while populations of region 2 and 4 were scattered in cluster I and II. These findings suggest that there is no important correlation between genetic relationships and geographical origins.

Fig. 1
figure 1

UPGMA dendrogram produced using Jaccard’s coefficient based on combined data of two molecular markers (IRAP and ISSR) in the studied 240 individuals of A. spinosa L. I: Cluster I grouped all populations of region 1 and region 4 and few populations of region 2; II: Cluster II grouped all populations of region 3 and few populations of region 2. a: populations of regions 1; b: populations of region 2, c: populations of region 4; d: populations of region 3

Besides, the principal coordinate analysis was performed to visualize the relationships among populations in more details and it explained 48.22% of total variation. The first PCo explained 8.01% of the total variability while the second PCo and the third PCo accounted for 15.05 and 21.16% of the variance respectively. The PCoA presents the relationship among populations and the results were comparable to the cluster analysis UPGMA dendrogram which showed the same distribution (Fig. 2).

Fig. 2
figure 2

Principal coordinate analysis (PCoA) of 240 A. spinosa L. individuals from 24 populations, based on IRAP and ISSR analysis by GenAlEx 6.5. Percentage of total variance was 48.22% (axis1 = 8.01; axis2 = 15.05%; axis3 = 21.16%)

In order, to provide further evidence of hierarchical population structure, powerful computational tools meant to deduce population structure were used. The STRUCTURE software v2.1 (Pritchard et al. 2000) performs Bayesian assignments of individuals to a given number of genetically homogenous clusters (K) of populations. The most likely K for the given range of 2–20, was computed using STRUCTURE HARVESTER (Earl and vonHoldt 2011) based on both the log likelihood and maximum ΔK (Evanno et al. 2005). The maximum value of ΔK was detected at K = 7, followed by K = 2. Furthermore, the results of neighbour-joining tree analysis and PCoA indicated that 24 populations of argan tree were divided into two groups which is in agreement with the STRUCTURE results at K = 2. This result indicated that the genetic structure of A. spinosa L. could be grouped into two populations and seven subpopulations. The first cluster mainly consisted of 13 populations, these populations were split into 4 sub-cluster. The second cluster consisted of 11 populations. This group was further separated into 3 sub-clusters (Fig. 3). One of these clusters was containing the population of region 1 and the other genetic cluster comprised all of the populations of region 3 while populations of region 2 and 4 were placed into both clusters.

Fig. 3
figure 3

Estimated populations structure of A. spinosa L. obtained with the STRUCTURE program. Results are based on IRAP and ISSR markers of the 240 samples. The maximum value of Delta K was detected at K = 7 (a), followed by K = 2 (b). In the barplots, each population is represented by a single vertical line divided into K colors, where K is the number of population assumed. Each color represents one cluster. Population’s codes see Table 1

Discussion

This is the first detailed report of genetic diversity of argan tree that took into consideration the global natural distribution of argan tree in Morocco. In this study we combined the performance of two molecular markers IRAP and ISSR to evaluate conclusively the genetic variation of a set of samples collected from 24 populations. This species is the only one representative of the Argania genus and it’s endemic to a restricted area in Morocco.

The IRAP and ISSR techniques were used to study the genetic diversity and structure of different forest tree such as Quercus libani G. Olivier (Khadivi-Khub et al. 2015) and Quercus brantii (Alikhani et al. 2014). Although, the mean of polymorphic bands of IRAP (41) was higher than ISSR (35.43) their percent of polymorphism bands and the diversity index consisted of PIC value were similar. Also, the two-marker techniques target different regions of the genome. In fact, IRAP marker amplifies genomic DNA of the region between inserted retrotransposons (Kalendar et al. 1999) while ISSR system detects the polymorphism of the region between simple sequence repeats (Zietkiewicz et al. 1994). This explains the importance of combining two markers in obtaining reliable estimates of genetic diversity (Souframanien and Gopalakrishna 2004). During the present investigation, we have demonstrated the effectiveness and power of IRAP and ISSR markers to detect polymorphism and estimate the genetic diversity by calculating the MI and Rp parameters. Our results are comparable to those found by Baraket et al. (2009) on Ficus carica and Pecina-Quintero et al. (2013) on Ricinus communis. It is considered to be informative in distinguishing the individuals and could be successfully applied to assess genetic variation among and within A. spinosa L. populations.

Our analysis of IRAP and ISSR molecular markers indicated that argan tree has maintained a moderate genetic diversity (Ht = 0.1879) despite of its fragmented area. This genetic diversity reflects the historic genetic variability which is quite common in long lived plant species (Geert et al. 2007; Honnay et al. 2005).

The genetic analysis of populations based on combined data (ISSR and IRAP marker systems) showed divergence among the studied populations. The analysis of all clusters did not show clear clustering pattern of geographically closer populations. This result is consistent with those found by Yatrib et al. (2015) using ISSR markers on 17 populations suggesting that the geographic distribution is not the principal factor shaping the present population genetic structure. However, the dendrogram revealed that most of individuals were separated according to their populations. This result can be explained by the reproduction system of argan tree and its pollen scattering mechanism by wind and insects (Bani-Aameur et al. 1999; Hamrick and Godt 1996; Scotti-Saintagne et al. 2004).

The distribution of genetic differentiation among and within populations depends on gene flow rate between populations. The gene flow of a species is a function of the size of populations, the movement of seeds and pollen, the degree of isolations and the distribution of the habitats it occupies (Dumolin-Lapègue et al. 1997). The present analysis revealed a high genetic differentiation among the studied populations (Gst = 0.5391), this could be seen as evidence of low gene flow between the populations (Nm = 0.4). Similar results were found by Mouhaddab et al. (2015) using ISSR markers and by Pakhrou et al. (2016) using AFLP markers. The high differentiation among A. spinosa L. populations is consistent with studies of other endemic species like Cupressus gigantean (Gst = 0.36) (Xia et al. 2008) while it was higher than other forest species such as sessile oak (Gst = 0.026) (Kremer et al. 2002). Moreover, the high genetic differentiation and gene exchange between studied populations are expected for several reasons. Firstly, the studied populations are located in different geographical areas under variable bioclimates (sub-humid, semi-arid, arid and saharan) and the distribution range of argan tree is limited to a specific area and naturally fragmented. This geographical distance indicates a barrier that can lead to difficulties in pollen and seeds scattering (Ding et al. 2008) and it’s a probable cause of low gene flow in argan germplasm. Secondary, the strong differentiation observed could come from divergent adaptation of the species due to environmental factors causing a differentiated selection. Furthermore, the Mantel test indicated that there was no significant relationship between genetic and geographic distances. This result was confirmed by the UPGMA clustring, PCoA and STRUCTRE which indicates that the geographical distribution is not the main factor that shaped the current population genetic structure.

The main goal of current conservation plans is focused on maintaining species diversity (Margules and Pressey 2000; Rodrigues et al. 2013). The results revealed in this study enhance our understanding of population structure of argan tree and provide information which would have important management implications for this species.

Despite political and strategic actions implemented by the Moroccan government to conserve world heritage of the argan tree, the size of populations has decreased in recent times (de Waroux and Lambin 2012) due to a combination of loss of trees, lack of regrowth resulting from overgrazing, increasing aridity and human activities. Based on our results we recommend possible management strategies for the argan tree conservation. The first most important management practice is a periodic monitoring of natural distribution range and establishing a protection area where the exploitation of this tree is forbidden and especially in populations with low levels of genetic diversity such as Tmanar. This strategy can reduce the impact of human and animals activities and allow populations to increase in size through natural regeneration. The second component of proposed strategy concerns the establishment of an ex situ conservation program by seed storage. Although the seeds from different population are needed to preserve most of the genetic variation and not to lose specific genes, it is suggested that the Lakhssas population will be the main source because of its higher levels of genetic diversity. In addition, considering the high genetic differentiation, a field gene bank should be built by a large scale plantation in a nature reserve. This strategy must take into account all existing populations and genetic groups in order to increase the chance of gene exchange and ensure a long-term conservation of most existing genetic variation. Finally, the safest way to preserve these genetic resources is probably through a combination of all these in situ and ex situ approaches but also increased the awareness of the local community about argan tree conservation. This last action can have a major influence on the efficiency of conservation strategy. Furthermore, owing the economic value of this tree, it is important to combine the studies on the distribution of genetic diversity with other information such as oil yield per population.