Introduction

Indiscriminate exploitation of natural forest resources has significantly decreased the size of many natural populations, resulting in fragmented habitats and population isolation (DeFries et al. 2005). Studies have debated the impact of habitat fragmentation and population reduction on genetic diversity in natural populations (Aguilar et al. 2008; Jump and Penuelas 2006; Honnay and Jacquemyn 2007). The fragmentation can significantly affect the movement of animals, pollen, and seeds (Tewksbury et al. 2002), which can alter populations’ genetic structure (Bacles et al. 2004; Sebbenn et al. 2011). The reduced size of natural areas and fragmentation may also lead to a loss of the genetic diversity contained within and among populations (Young et al. 1996; Newman and Pilson 1997).

Anthropogenic disturbances can have a significant impact on population genetic diversity and structure (Santos et al. 2015; Omondi et al. 2016). Consequently, studies of genetic structure and diversity in populations of key biological resources are needed to understand how diversity is distributed within and between populations and factors that affect this distribution (Schwartz et al. 2007). The influence of these factors vary with life-history traits and include effective population size, mode of reproduction, and breeding systems (Degen and Roubik 2004), as well as the geographical range of the species (Rouger and Jump 2014). Furthermore, gene flow also has an impact on genetic structure within and among populations (Provan et al. 2008; Araújo et al. 2017), which is influenced not only by the ability of dispersers and pollinators to reach other populations, but also by geographical barriers that may exist between populations (Dias et al. 2016).

The Caatinga biome represents one of the largest areas of the South American seasonally dry tropical forest. It has been severely deforested as a result of wood consumption, livestock grazing, and fire, and more than half of all ‘poor’ Brazilians in the country live within the biome (Silveira-Neto 2014). Furthermore, most areas of Caatinga are ecologically sensitive, with particularly amplified responses to climate variability (Seddon 2016), and are currently experiencing a trajectory of drying (da Silva 2004). Native to the Caatinga, Copernicia prunifera, known as carnauba palm, is economically significant because of the commercially important wax (carnauba wax) that covers its leaves (IBGE 2018), especially younger leaves. However, extensive and unsustainable harvesting practices, agricultural expansion, and an absence of sustainable management programmes represent major threats to the long-term continuation of C. prunifera populations. Continued unsustainable harvesting of non-timber forest products (NTFPs) is expected to have cascading ecological impacts, from individual and population to community and ecosystem function (Ticktin 2004). Over-exploitation of carnauba populations has had a negative impact on associated wild fauna, for example forcing wild triatomines to seek other habitats (Lima and Sarquis 2008).

Copernicia prunifera populations have rapidly declined because of anthropogenic disturbance over the last century primarily due to deforestation and agricultural expansion (D’alva 2004). The use of carnauba wax dates back to the 18th Century for the production of candles. From the second half of the 19th Century, the discovery of new uses for the wax intensified its exportation and allowed the development of economically important extractive, agroindustrial and commercial activities. From the 1960s, the modernization of agriculture led to the deforestation of extensive areas of the Caatinga, significantly reducing the C. prunifera habitat (D’alva 2004), while exploitation of carnauba has increased. An additional, and substantial, contemporary threat relates to a changing climate given that the whole of the species’ distribution is located in semi-arid regions subject to desertification (MMA 2005). Ecological niche modeling (ENM) allows correlating a set of environmental variables with the geographical occurrence of a species. The ENM become a useful method to address ecology issues such as conservation practices, indicating regions with habitat suitability under ongoing climate change (Zacarías-Correa et al. 2020).

Assessments of genetic diversity for key species can provide important contributions when defining conservation strategies and developing management programs (Duarte et al. 2015) and should be taken into consideration in development of public policies aimed at conserving biodiversity (Laikre et al. 2010). Molecular markers based on amplification of DNA provide valuable tools to study genetic structure and diversity between individuals and within and between populations (Nybom 2004). The use of inter-simple sequence repeat (ISSR) markers provides a quick and simple method to effectively analyse the genetic diversity of natural populations across a large number of polymorphic bands. This method is low cost and does not require prior information of the genome, which is particularly important for genera such as Copernicia as there is no previous knowledge of microsatellite regions of the genome (Reddy et al. 2002). While ISSR markers cannot differentiate heterozygous from homozygous individuals since they are dominant markers, they do permit the analysis of multiple loci in a single reaction (Wolfe 2005) and can be an alternative in cases where a high number of null alleles exist in microsatellite markers (Rosa et al. 2017).

Given the importance of C. prunifera to local communities and the potential impacts of its overexploitation on resource sustainability and biodiversity, we sought to determine if recent rapid increases in the exploitation of C. prunifera populations are associated with negative impacts on the genetics of the species. We hypothesised that genetic bottlenecks would accompany high levels of genetic differentiation among populations due to unsustainable management practices over the years in a harvest-intense area. Furthermore, we sought to determine the extent to which landscape boundaries result in current genetic discontinuities within the species and potential interactions of exploitation and habitat suitability predicted by ENM.

Material and methods

Target species

Copernicia prunifera individuals can be found in river valleys and in seasonally flooded areas in the semi-arid region of northeastern Brazil, where they generally form monodominant populations known as carnaubais. The species is highly resistant to the prolonged absence of water and permanent floods (Arruda and Calbo 2004). The wax produced from its leaves is used in cosmetics, pharmaceutical capsules, electronics, food products, polishing waxes, and coatings (Sousa et al. 2015), and the stems are commonly used in house construction (Fig. S1). The production value of its wax and fibers brings in more than $55 million per year, according to the official government data (IBGE 2018). The species presents multiple inflorescences, which are made up of yellowish and hermaphroditic flowers (Silva et al. 2017). Flowering is subannual, with greater intensity between November and February and ripe fruits between January and March (Rocha et al. 2015). The flowers are visited by insects like the irapuá bee (Trigona spinipes) and the maribondo-caboclo wasp (Polistes canadensis), and the species has a mixed mating system that is preferentially allogamous (Silva et al. 2017). Fruits are likely dispersed by the palm tanager (sanhaçu-do-coqueiro; Tangara palmarum) (Silva et al. 2017) and bats (Sousa et al. 2015), demonstrating the relevant interactions between species (animal-plant) that need to be preserved.

Sampling

This study was conducted in eleven natural populations located in Rio Grande do Norte and Ceará States, Brazil, which represents one of the areas in which the species is most intensely harvested in Northeast Brazil (D’alva 2004; IBGE 2018). One-hundred and eighty individuals were sampled (Table 1 and Fig. 1), and sampling ranged from 11 to 24 individuals per population, which is consistent with other studies using ISSR markers (Duarte et al. 2015; Rosa et al. 2017). Pairwise distance between populations ranged from 4.6 km between SER and LGP to 310.4 km between LGP and AR1 (Fig. 1). Small pieces of leaves were cut using a tree trimmer, placed in plastic tubes containing 2 mL CTAB 2X (cationic hexadecyltrimethylammonium bromide), labelled, and stored in a freezer at – 20 °C until DNA extraction.

Table 1 Location of sampled Copernicia prunifera populations, population code, sample size (n), total quantity of powder and wax (tons) produced from vegetal extraction (between 1986 and 2018), and geographical information
Fig. 1
figure 1

Geographic location of the sampled Copernicia prunifera populations in northeast Brazil (a), and altitudinal gradients (b). Populations are identified according to genetic groups established by Structure (see Fig. 3 and Fig. 4). Group distribution is shown in comparison with ecological niche modelling for the species at present day (c) and the future scenario (d). Red corresponds to regions with the highest probability of C. prunifera occurrence, blue corresponds to the least suitable regions, green lines correspond to rivers. Both figures (c and d) show the main genetic boundaries indicating three barriers among populations (dotted lines a–a, b–b, c–c) obtained with Monmonier’s maximum difference algorithm (see Fig. S3). The coordinates of each population are shown in Table 1

Historical anthropogenic disturbances

Although change in population size was not measured directly, a previous ethnoecological and ethnobotanical survey indicates substantial population decrease over recent decades (Sousa et al. 2015) that has accelerated since the 1960s (D’alva 2004). All sampled populations have been subjected to recent disturbance, showing signs of fire, intensive leaf extraction, timber harvesting, and trampling by cattle resulting in damage to regeneration (Fig. S1). Government data showing powder and wax production derived from C. prunifera are given in Table 1 and are based on the Brazilian Institute of Geography and Statistics Automatic Recovery System—SIDRA (IBGE 2018).

DNA extraction, PCR, and electrophoresis

DNA extraction was performed using the CTAB method, as described by Doyle (1990). We tested 29 ISSR primers and selected seven that best amplified C. prunifera DNA. For polymerase chain reaction (PCR), the Veriti automatic thermocycler was used with a volume of 12 μL containing genomic DNA. The PCR mix was composed of buffer (10x), BSA (1.0 mg mL−1), MgCl2 (50 mM), dNTP (2.5 mM), primer (2 μM), Taq polymerase (5.0 U μL), DNA (diluted 1:50), ISSR primer (2 µM), and ultrapure water. The reaction sequence consisted of denaturation at 94 °C for 2 min followed by 37 cycles of 94 °C for 15 s, 47 °C for 30 s, and 72 °C for 1 min. The process was completed with a final step at 72 °C for 7 min and then cooled to 4 °C. Amplification products were subjected to 1.5% horizontal agarose gel electrophoresis, stained with GelRed in 1 X TAE (Tris–Acetate-EDTA) buffer at a voltage of 100 V for two and a half hours against a 1 kb molecular weight size marker. Subsequently, the gels were visualised and photographed in ultraviolet light using the E-Box VX2 (Vilber Lourmat, Marne la Valle, France).

Genetic diversity

Polymorphic information content (PIC) was calculated to test the ability of the ISSR primers to distinguish polymorphism between individuals, with the absence or presence of bands as indicators. For the calculation, we used the formula proposed by Anderson et al. (1993): \({\text{PIC}}_{{{\text{i }}}} = 1 - \sum\nolimits_{{{\text{j}} = 1}}^{{\text{n}}} {{\text{P}}_{{{\text{ij}}}}^{2} }\), where \({\text{P}}_{{{\text{ij}}}}\) is the frequency of allele “j” in marker “I”. To estimate the genetic diversity parameters, we used the software PopGene v.1.32 (Yeh et al. 1997) to assess the total number of observed alleles (na), number of effective alleles (ne), Nei's (1973) genetic diversity (h), and Shannon index (I) for each population. The Bayesian approach to determine genetic diversity (hs, Holsinger 1999) was also estimated using the program Hickory v.1.1 (Holsinger and Lewis 2007).

Genetic structure and discontinuity

Genetic differentiation among populations was calculated using both Nei’s (1978) standard genetic distance (Ds) and a Bayesian approach (theta), in which we assessed the theta-II statistic (Holsinger and Lewis 2007) that corresponds to theta-B of Holsinger and Wallace (2004). This provides the best estimate of the proportion of genetic diversity due to differences among contemporaneous populations in the program Hickory v1.1 (Holsinger and Lewis 2007). Mantel’s test was performed using GenAlex v.6.503 (Peakall and Smouse 2012), resampled using the Monte Carlo method (999 permutations), to test for the existence of a correlation between geographic distance and both Nei’s genetic distance (Ds, 1978) and theta-II (Holsinger and Lewis 2007).

The program Ntsys (Rohlf 1993) was used to produce a dendrogram based on the unweighted pair-group method using arithmetic averages (UPGMA) to simplify interpretation of genetic identity based on Nei’s (1978) distance obtained with PopGene. The stability of the clusters was verified with bootstrap analysis using 1000 permutations implemented in the program Bood-P, version 1.2 (Coelho 2001). Bayesian analysis was performed using the program Structure v.2.3.4 (Pritchard 2000) to infer the number of genetic groups (K) that represent the sampled populations. Ten independent runs for each K (ranging from 1 to 13) were conducted, with the estimates of K based on the model of mixed ancestry (admixture) and the frequency of correlated alleles. Each run was comprised of 250,000 simulations via Markov Chain Monte Carlo (MCMC) and a burn-in of 500,000 iterations. The number of K populations was identified according to the method ∆K (Evanno et al. 2005), as implemented in the Structure Harvester program (Earl and Vonholdt 2012). We used the program Arlequin 3.5 (Excoffier and Lischer 2010) for the analysis of molecular variance (AMOVA) to understand how genetic variation is partitioned within and among clusters (according to Bayesian analysis), using 10,000 permutations to test for significance.

Subsequently, a fully Bayesian clustering approach was implemented in the program Barrier 2.2 (Manni et al. 2004) to identify any potential discontinuity of genetic data across the geographical area. The sampled populations were connected by Delaunay's triangulation according to their geographical coordinates. Monmonier's algorithm was implemented to identify zones with the greatest genetic differences (Ds).

Environmental variables

BIOCLIM variables (Booth et al. 2014) included in the model to predict the availability of suitable environments for the species were obtained from the WorldClim database, version 2.0 (worldclim.org/; Fick and Hijmans 2017). Climate projections (average for 2061-2080) were downloaded from WorldClim version 1.4 (Hijmans et al. 2005). Projections were based on the representative concentration pathway 8.5 or ‘business as usual’ scenario (Riahi et al. 2011) from the Earth system configuration of the 2nd Hadley Centre Global Environmental Model (HadGEM2-ES, Collins et al. 2011). Climate distributions were projected at a spatial resolution of 30 arc-s (~ 1 km2). To derive a model with a reduced set of variables, we used Pearson's correlation coefficient for each pairwise comparison to eliminate highly correlated, redundant variables (r ≥ 0.85 or r ≤ − 0.85, Table S1), with the program ENMTools 1.4.3 (Warren et al. 2010). Then, a reduced final set of six current bioclimatic variables that maximized training gain (Quipildor et al. 2018) and the area under the curve (AUC) were utilized, based on the preliminary MaxEnt model (Table S1).

Niche modeling

We obtained C. prunifera occurrence records (n = 35) using self-collected data and from Brazil’s speciesLink network (splink.cria.org.br; Canhos et al. 2015), an e-infrastructure that provides free and open access to primary biodiversity data and associated tools. Errors, duplicates, and records of cultivated plants were identified and eliminated inside a geographic area of approximately 260,500 km2, in order to avoid bias caused by uneven sampling. The distribution model to predict the availability of suitable environments for the species was obtained using the machine-learning maximum entropy model, Maxent version 3.4.1 (Phillips and Dudík 2008). Ten replicates of multiple runs of cross-validation were used, in which the occurrence data are randomly divided into a number of equal-sized groups (Phillips and Dudík 2008). As a threshold, we chose the 10th percentile training presence to optimize the correct discrimination between presence and pseudo-absences in the test data, using the raw output of Maxent (Merow et al. 2013). We explored a range of regularization coefficient values (1.0 to 5.0) to compare competing models (Merow et al. 2013). The Bayesian (BIC) and sample size corrected Akaike information criteria (AICc) were employed for model selection (Warren and Seifert 2011), showing that 2.0 was the most appropriate level of regularization (Table S2).

Detection of genetic bottlenecks

Recent reductions in effective population size were assessed using the Bottleneck program, version 1.2 (Cornuet and Luikart 1996). The Infinite Allele Model (IAM) and Stepwise Mutation Model (SMM), based on Kimura and Crow (1964) and Kimura and Otha (1978), respectively, were used to infer the presence of genetic bottlenecks. The mutation model of the ISSR loci is an intermediary between IAM and SMM (Luikart et al. 1998), thus we used both models. The sign test was applied (α = 0.05) based on the frequency of alleles to determine the existence of recent, significant genetic bottlenecks (Cornuet and Luikart 1996).

Results

Genetic polymorphism

The seven selected primers amplified 101 loci. The number of loci per primer ranged from 13 to 18 with an average of 14.4 (Table 2). The PIC of each primer used varied from 0.339 to 0.446, with an average of 0.418.

Table 2 Nucleotide sequence of ISSR primers, number of loci, and PIC value of each primer

Genetic diversity

The percentage of polymorphic loci of the populations ranged from 16.83% in SER to 79.21% in SMG. The mean Nei’s genetic diversity (h) was 0.213, the mean Bayesian genetic approach (hs) was 0.236, and the Shannon index (I) was 0.312 (Table 3). The estimates of hs based on Bayesian approach were less variable (Coefficient of Variation = 19.89%) than Nei's genetic diversity h (CV = 36.30%) and Shannon index I (CV = 36.11%) (Fig. S2).

Table 3 Genetic diversity parameters of Copernicia prunifera natural populations

We found a positive and significant correlation between estimates of h and hs (rPearson = 0.986; P < 0.0001), between estimates of h and I (rPearson = 0.999; P < 0.0001), and between hs and I (rPearson = 0.986; P < 0.0001). The populations SMG, MOS, ICA, and RUS presented higher values of Nei’s genetic diversity (h ≥ 0.280 Table 3). The Shannon index (I) showed that the SMG, MOS, ICA, AR1, and RUS populations have higher values (I ≥ 0.400).

The greatest genetic distance was between SMG and SER (0.581) according to Nei’s Ds (Table S3), and between APD and SER (0.657) according to theta-II genetic distance (Table S4). The smallest genetic distance was between AR1 and AR2 for both methods (Ds = 0.017; theta-II = 0.005). The mean Ds was 0.213 and the mean theta-II was 0.375.

Population genetic structure and ENM

According to Bayesian inference, the full statistical model had the smallest DIC (Table S5; Spiegelhalter et al. 2002). Thus, the analyses of genetic diversity (hs) and pairwise genetic differentiation among populations (theta-II) were run using the full statistical model.

The Mantel test revealed the existence of a positive and significant correlation between geographic and genetic distances using both Nei’s (r = 0.423; P = 0.006) and theta-II genetic distance (r = 0.449; P = 0.003) (Fig. 2). C. prunifera populations are geographically structured and the results obtained from Bayesian analysis suggest the existence of three genetic groups (ΔK = 3; Fig. 3); this structure is congruent with the UPGMA dendrogram and Bayesian subdivisions (Fig. 4).

Fig. 2
figure 2

Relationship between geographic distances and Nei’s genetic distance (A) and theta-II genetic distance (B) for Copernicia prunifera populations

Fig. 3
figure 3

Plot of the mean values of LnP (D) of the Bayesian analysis (solid line) and ΔK analysis (dotted line). The bars indicate standard deviations of LnP (D) values

Fig. 4
figure 4

UPGMA dendrogram based on Nei’s genetic identity (left). Bootstrap values, when ≥ 50%, are given at each of the forks in the dendrogram. Bayesian analysis with the proportion of genotypes in the sampled populations (right), whereas the dark horizontal lines delimit populations. SE – Southeast (red); NC—North Coast (blue); NW—Northwest groups (green)

The AMOVA indicated the existence of significant population structure, with 14.61% variation among the Northwest, North Coast, and Southeast groups (\(\Phi_{{{\text{CT}}}}\), P = 0.005), 25.84% among populations within groups (\(\Phi_{{{\text{SC}}}}\), P < 0.0001), and 59.56% within populations (\(\Phi_{{{\text{ST}}}}\), P < 0.0001) (Table 4). The Southeast group had a smaller total h (0.151), hs (0.206), and I (0.221) than the Northwest group (h = 0.221; hs = 0.235; I = 0.324) and North Coast group (h = 0.281; hs = 0.285; I = 0.414).

Table 4 Analysis of molecular variance (AMOVA) in Copernicia prunifera populations

The mapping of Ds using Delaunay's triangulation showed three genetic discontinuities (barriers) that separated even geographically proximal populations, as follows: (1) SER and LGP; (2) MAC; (3) ICA, SMG, AR1, AR2, RUS, MOS, APD and JUC, as shown in Fig. 1 and Fig. S3. The identified genetic discontinuities correspond to the most unfavourable geographical range for the species according to niche modelling (barrier a-a, Fig. 1c and d) and to altitudinal gradients (barriers b–b and c–c, Fig. 1b). According to the ENM analyses, the most favourable region for the occurrence of C. prunifera is in the Northwest of the sample area (Fig. 1c). The species does not grow well at high altitude, where the current range was identified as unsuitable for the species (Fig. 1b and c). The environmental variables that most influenced the current range were minimum temperature of coldest month (bio06) and mean temperature of warmest quarter (bio10) (Table S6). For the future scenario, the most influential variables were bio06, and the annual temperature range (bio07). In the future scenario, the extent of potentially suitable habitat for C. prunifera within its native range is reduced (Fig. 1d).

Genetic bottlenecks

Populations SER, MAC, JUC, APD, and RUS revealed a highly significant deficit in heterozygosity under both IAM and SMM models, thus demonstrating the occurrence of population bottlenecks (Table 5). MOS, ICA, and AR1 populations showed a significant bottleneck based on the IAM model and only the LGP population showed a significant genetic bottleneck based on the SMM model. Populations AR2 and SMG demonstrate equilibrium between mutation and drift.

Table 5 Tests of equilibrium between mutation and genetic drift for the studied Copernicia prunifera populations based on IAM and SMM models

Discussion

The markers used in the present study were moderately informative (Botstein et al. 1980), with PIC values ranging from 0.339 to 0.446. We found a high percentage of polymorphic loci for the whole population (99.09%), which demonstrates that the ISSR molecular markers used in this study are effective for estimating genetic diversity. ISSR markers have been used successfully in recent studies of genetic diversity (Pádua et al. 2021; Torres-Silva et al. 2021). Based on AMOVA, greater genetic variation occurred within than among populations. However, the genetic differentiation among populations was relatively high (\(\Phi_{{{\text{ST}}}} = 0.371;\;37.1\%\)) according to the expectations for species with similar life-history traits (Nybom 2004), and likely related to the large geographical distances between populations as discussed below.

Historical range and recent changes to the size and distribution of populations can influence the diversity within and genetic differentiation between populations (da Silva et al. 2015). According to Monmonier’s algorithm, our analysis indicates that populations from the Southeast group (LGP, SER, and MAC) are more isolated than the other population groups, with less genetic diversity (Table 3) and were clustered by Structure as sharing genotypes (Fig. 1 and Fig. 3). The Bayesian analysis revealed that C. prunifera populations occurring in the most favourable region of the species’ geographical range showed the highest levels of genetic diversity (Northwest and North Coast groups). The likely absence of genetic discontinuities in the Northwest region and the indication that this is the most favourable area of the species’ range may have enabled the maintenance of high levels of genetic diversity in these populations. This finding is of particular interest for the understanding of the local adaptation of C. prunifera populations and to make conservation decisions, since the genetically informed ecological niche models (gENMs) improve the predictions of species distributions under ongoing climate change (Ikeda et al. 2017).

The high suitability in the Northwest and the average suitability in the Southeast can be explained by the native range. C. prunifera populations generally occur at river valleys (Fig. 1c; green lines) and seasonally flooded areas in the semi-arid. Furthermore, the Northwest populations belong to the Caatinga biome, a seasonally dry tropical forest. On the other hand, the populations in the Southeast are influenced by the Atlantic Forest biome, a rainforest. The humidity coming from the ocean currents of the Atlantic Ocean (Xie and Carton 2004) added to the presence of the Atlantic Forest (da Silva and Tabarelli 2000) probably are not enough to provide high suitability for the wide distribution of the species in the Southeast of the sample area. However, in the future scenario, the extent of potentially suitable habitat for C. prunifera within its native range is reduced, mainly in the coastal region of the Northwest and Southeast occurrence area (Fig. 1d), which is also subject to the greatest anthropogenic pressure (e.g. urban and agricultural expansion, wind power plants) from human populations (Scarano and Ceotto 2015).

The Mantel test confirms that the most geographically remote sampled populations were also less genetically similar. Nei’s (1978) standard genetic distance between populations had an average of 0.21, which is high for species with animal-ingested seed dispersal mechanisms (\({\text{G}}_{{{\text{ST}}}} = 0.16\) ; Nybom 2004). Although bats are potential dispersers (Sousa et al. 2015), C. prunifera individuals present an aggregated spatial pattern and spatial genetic structure up to 12.3 m which may be related to restricted seed dispersal (Pinheiro et al. 2017a). The greatest genetic similarity was found between populations AR1 and AR2, and between RUS and MOS, which are geographically proximal to each other and belong to the Northwest group. Despite the considerable geographic distance between the ICA and SMG populations, they are nearest the coast and grouped by both the dendrogram and Bayesian analysis. However, phylogeographic data are necessary to better understand the colonization history of the species in different habitats (e.g. Zhang et al. 2020).

Alongside potential future reductions in habitat suitability, as well as overexploitation and anthropogenic disturbances, it is essential to identify populations that have undergone reductions in effective population size to understand the risks of possible local extinction due to reduced population size (Cobo-Simón et al. 2020). A reduction in effective population size may lead to a reduction in genetic diversity within populations, likely as a result of genetic drift after demographic bottlenecks (Jacquemyn et al. 2009), especially given the predicted reduction in suitable habitat for C. prunifera under ongoing climate change. Most of the populations showed a genetic bottleneck (Table 5), which is likely due to the significant anthropogenic pressure related to intense exploitation of carnauba wax in these areas since the 18th Century, as well as deforestation for the expansion of agriculture (D’alva 2004; Sousa et al. 2015). Although the SMG population showed no evidence of a recent bottleneck, it is currently affected by extensive anthropogenic impacts due to the expansion of wind power generation and the occurrence of fires in the neighbouring vicinity (personal observations), which may result in future genetic bottlenecks.

Although C. prunifera is not currently listed as an endangered species (Martinelli and Moraes 2013), it has been substantially affected by the expansion of agricultural activities over time, contributing to reductions in its natural populations (D’alva 2004; Sousa et al. 2015). In addition to recent reductions in population size and loss of diversity, we can infer that the studied populations have high genetic divergence, indicating current genetic isolation. Consequently, conservation measures for natural C. prunifera populations are needed to minimize further loss of alleles and to ensure sustainability resources for traditional communities. While herein we assessed neutral diversity, parallel losses in functional diversity might have consequences for the future of the species as its environment continues to change. Climate change will have profound effects on the semi-arid region (Marengo et al. 2017; Pinheiro et al. 2017b), and alterations in the potentially suitable habitats showed in our study should be considered (Fig. 1). In addition to in situ conservation of natural populations, and given the substantial economic importance of this species, one strategy would be the creation of germplasm banks for ex situ conservation, with seeds coming from the most diverse populations. Since the seeds are recalcitrant (Araújo et al. 2013), we recommend in vivo germplasm banks. Another approach could include the preservation of several populations across the geographic distribution of the species, considering the divergent genetic groups identified herein.

In order to avoid or minimize the deleterious effects of bottlenecks observed in most populations, one approach to mitigation would be to enhance gene flow between populations (Luikart et al. 1998). However, given the likely interaction between genetic and demographic decline, we suggest that in situ conservation to induce natural regeneration is a priority. Nevertheless, most of the populations are likely to be subjected to limitations in terms of palm establishment, for example due to NTFP extraction and soil compaction and trampling through animal husbandry. Consequently, management strategies should also focus on practical measures to improve regeneration success, such as pausing extractive activity during reproductive periods and introducing rotation cycles for leaf harvesting to recover over-exploited areas. Also, there is a need to consider the current social and economic conditions of harvesters to reach successful ‘social’ forests (Pritchard and Brockington 2019). This means that harvesters in poorer areas need additional support, including longer-term investments, to keep the equilibrium between the socioeconomic demand and forest conservation (Poudyal et al. 2018; Oldekop et al. 2019). These strategies can occur alongside the development of productive C. prunifera forests to support a more sustainable resource supply by reducing pressure from wild harvesting. The sustainable management of non-timber C. prunifera products is urgently needed to limit the negative impacts resulting from the deforestation of these populations which can contribute to developing a sustainable supply that can provide financial income for rural communities into the future.