Introduction

The population history of most plant species in East Africa is still in the dark, but accumulated evidence suggests that the variable climate in the region over the millennia has resulted in plastic and complex patterns of diversity (Livingstone 1975; Hamilton 1982). In particular, Afromontane forest species have been subject to a number of population expansion and retraction dynamics, following changing precipitation conditions (Street-Perrott and Perrott 1993). Among the better studied species is Prunus africana (Hook. f.) Kalkman (synonym Pygeum africanum Hook. f.), a keystone species of the Afromontane forests. It is a medium to large multipurpose tree of 30–40 m height, commonly known as African cherry. P. africana is hermaphroditic, insect pollinated, and bird dispersed. It mainly occurs in montane forests but also along creeks at lower elevation. The species is an economically important and endangered member of the Afromontane flora and occurs geographically widespread in mainland Africa and outlying islands (Bioko, Grande Comore, Sao Tome, and Madagascar) (Kalkman 1965). In Ethiopia, P. africana is one of the victims of omnipresent deforestation (Reusing 2000). During the last century, most of the Afromontane forests have been cleared and only a very small proportion of the original vegetation remains (Teketay and Granström 1995). Besides timber and numerous traditional medicinal uses of this species, its bark extracts are commercially used to treat benign prostatic hyperplasia (Kadu et al. 2012, and references therein). Bark is especially exploited in Cameroon and Madagascar (Cunningham and Mbenkum 1993; Cunningham et al. 1997). In addition to this, overexploitation for pharmacological use, deforestation, habitat fragmentation, wildfires, and invasive alien plants threaten this species at different levels throughout its native range (Jimu 2011). Moreover, the modeled future distribution of P. africana indicates that it will be negatively affected by climate change (Vinceti et al. 2013). Consequently, the “Convention on International Trade in Endangered Species of Wild Fauna and Flora” includes African cherry as a vulnerable species that may become threatened by extinction unless its trade is regulated (http://www.cites.org/eng/app/appendices.php accessed 8/2015; Dawson et al. 2000). At the national level, policies have been established in various African countries aiming to ensure sustainable utilization and management of P. africana, although enforcement problems persist (Vinceti et al. 2013). It is very likely that valuable genetic resources have already been lost as some of the populations have been overexploited or gone extinct.

A high level of molecular diversity is still maintained in the species despite its heavy exploitation. Previous studies using randomly amplified polymorphic DNA (RAPD) markers indicated high diversity levels within certain regions of its native range (Barker et al. 1994; Dawson and Powell 1999; Muchugi et al. 2006). More recent studies using microsatellite markers also pointed to high genetic diversity (Farwig et al. 2008; Kadu et al. 2011, 2013).

In regard to its phylogeography, Aubréville (1976) suggested that P. africana or its progenitor migrated from its possible origin in the Middle East into the northeast of Africa. Based on DNA findings, it is likely that this assumed southward migration route split into a western and eastern lineage, which finally contributed to the colonization of southern Africa (Muchugi et al. 2006; Kadu et al. 2011). Populations in Madagascar are highly diverged and were isolated for an extensive time period (Kadu et al. 2011, 2013). The Eastern Rift Valley (ERV) was identified as a major barrier to gene flow between population groups (Kadu et al. 2011, 2013). The Ethiopian highlands, which form the northern part of the ERV, are of particular importance, because they represent the suggested primary immigration and spreading route of the species or its predecessors into Africa. Unfortunately, Ethiopian samples were unavailable at the time of the genetic inventory of P. africana across its native range in the previous analysis conducted by Kadu and her colleagues (2011, 2012, 2013). Six Ethiopian populations have already been studied using RAPDs by Atnafu (2007), resulting into two main clusters geographically separated by the main Ethiopian Rift. Nevertheless, an extensive sampling was desirable to elucidate the phylogeography and population history of the species in this important region and in particular the integration of Ethiopian data with common methods into the available data set to provide an overall understanding of the phylogeography of this important species. In the present study, we investigate the population genetic structure of this species in Ethiopia and study whether Ethiopia is an ancestral region for the continental diversity of this species, by adding chloroplast and microsatellite analysis of Ethiopian populations to the data sets of Kadu et al. (2011, 2013). The results of our study are expected to increase our understanding of the phylogeography of this Afromontane tree and should also increase the efficiency of conservation efforts toward the species. The data set presented here is one of the most complete concerning any African tree species with a wide-ranging distribution.

Material and methods

Sampling

Twenty-one populations were sampled in Ethiopia from 35° 32′ (Gore) to 41° 49′ (GaraMuleta) longitude east and 5° 53′ (Kibre Mengist) to 11° 50′ (Debre Tabor) latitude north covering an altitudinal range from 1584 m (Harenna) and 2859 m (Debre Tabor) above sea level (Table 1). The geographic distance between populations ranged from 26 km (Addis Ababa—Menagesha) to 700 km (Gara Muleta—Gore). From each population, young leaves from 10 trees were collected and dried in ziplock plastic bags containing silica gel. To minimize the collection of closely related individuals, sampled trees were at least separated by a distance of 100 m. Due to the very small population size in Denkoro (DE) and Wof Washa (WW), this minimum distance was abandoned and every available tree was sampled. Compared to the sampling scheme of Kadu et al. (2011, 2013), the average sampled population size was smaller but more populations were sampled over a comparatively restricted geographic area.

Table 1 Description of 21 sampling localities of Prunus africana in Ethiopia

DNA isolation

Total genomic DNA was isolated from 40 to 60 mg of dried leaf samples using a modified triple cetyltrimethyl ammonium bromide (CTAB) extraction technique (Borsch et al. 2003). DNA samples were run in 1 % agarose gelelectrophoresis in order to check their quality and quantity. Concentration and purity of the DNA samples were further determined using a ND-1000 spectrophotometer (NanoDrop, USA).

DNA amplification and genotyping

In total, 11 microsatellites originally developed for other species were used to genotype all samples. Nuclear DNA was studied using six microsatellite loci (nuclear simple sequence repeat (nSSR)) originally characterized in peach [Prunus persica, primer pairs U3 (UDP9-403) and U5 (UDP96-018) (Cipriani et al. 1999) and P2 (PS12A02) Sosinski et al. 2000] and wild cherry (Prunus avium, primer pairs EMPaS01, EMPaS06, and EMPaS10; Vaughan and Russell 2004). These microsatellites were previously transferred to P. africana (Cavers et al. 2009; Kadu et al. 2013). For the investigation of plastid DNA variation, five microsatellites originally developed for Japanese plum (Prunus salicina, primer pairs TPSCP1, TPSCP5, and TPSCP10; Ohta et al. 2005) and Sorbus aucuparia (primer pairs rps16pm2 and trnT-Lpm1; Chester et al. 2007) were used. These chloroplast simple sequence repeat (cpSSR) markers were also previously transferred to P. africana, and for rps16pm2 and trnT-Lpm1, two indels were detected by Kadu et al. (2011). Indels were treated as four additional loci, i.e., rps16pm2a, rps16pm2b, trnT-Lpm1a, and trnT-Lpm1b.

PCR reactions were performed in a 10 μL volume containing 1 μL (10–70 ng) total genomic DNA, 2.3 μL KAPA2G Buffer A (Kapa Biosystems), 0.05 mM of each dNTP, 0.21 mM of each primer, one of the two being 5′ labeled with a fluorescence dye, 0.45 U KAPA2G Fast DNA Polymerase (Kapa Biosystems), and autoclaved deionized water. Amplifications were run on a PTC-100 thermocycler using the following heating profile: a first step initial denaturing at 95 °C for 3 min followed by 35 cycles, each consisting of 30 s denaturing at 94 °C, 30 s annealing at a specific temperature (54 °C for all nSSR primers, 49 °C for trnT-Lpm1, 50 °C for TPSCP5 and rps16pm2, 53 °C for TPSCP1, and 55 °C for TPSCP10), and 5 s extension at 72 °C. The last cycle was ended by additional 30 s at 72 °C to complete extension.

PCR-amplified DNA fragment size was determined by capillary gel electrophoresis using a CEQ8000 sequencer (Beckman Coulter, USA). Selected DNA samples from the previous studies of Kadu et al. (2011, 2013) were used as reference to standardize genotyping and to link the results with those of previous studies.

Data analyses—Ethiopia

In a first step, the Ethiopian nSSR data set was checked for the presence of genotyping errors and null alleles using the program Micro-Checker (van Oosterhout et al. 2004). Thereafter, the number of observed alleles (N a ), the number of effective alleles (N e ), the observed heterozygosity (H o ) and unbiased expected heterozygosity (H e ), the fixation index (F), the number of private alleles (A p ), and the percentage of polymorphic loci (PPL) per population were calculated, and deviations from Hardy–Weinberg expectations (HWEs) were assessed using GenAlEx 6.501 (Peakall and Smouse 2012). The number of alleles corrected for equal sample size (allelic richness, R s ) was calculated with FSTAT (Goudet 1995). The presence of a phylogeographic pattern within the Ethiopian populations was tested using SPAGeDi1.4b (Hardy and Vekemans 2002) by evaluating the contribution of the stepwise mutation to the differentiation patterns at the nSSRs. Here, the observed differentiation R ST is compared to the differentiation obtained after 1000 allele size permutations (pRST) as an approximate for F ST. If stepwise mutations do not contribute to differentiation, R ST should be equal to F ST, but if stepwise mutations have contributed to differentiation, R ST is expected to be significantly higher than pRST, respectively, F ST.

The model-based Bayesian clustering method implemented in STRUCTURE 2.3 (Pritchard et al. 2000) was used to infer clusters by assigning individual multilocus genotypes probabilistically to a user-defined number of K clusters. The admixture model without incorporation of population information was used, assuming correlated allele frequencies applying K values ranging between 1 and 21 as well as run lengths of 800,000 iterations with a burn-in period of 200,000. Five runs per K were performed for the Ethiopian populations separately. The most likely number of clusters was inferred using the ΔK statistic of Evanno et al. (2005) implemented in STRUCTURE HARVESTER 0.6.93 (Earl and von Holdt 2012). Analysis of molecular variance (AMOVA) was carried out in Arlequin 3.5 (Excoffier and Lischer 2010) to investigate population differentiation at various subdivisions based on geographical units, tree seed zones, and individual-based population clusters from STRUCTURE using both F ST and R ST in separate analyses based on 10,000 permutations. Isolation by distance was evaluated by a Mantel test (Mantel 1967) using Nei’s unbiased genetic distance (Nei 1978).

The chloroplast data were transformed into multilocus haplotypes and for a unique identification of haplotypes combined with the range-wide data set of Kadu et al. (2011). The number of haplotypes per population (N a ), the effective number of haplotypes (N e ), the number of private haplotypes (N p ), and haplotype diversity (H e ) were calculated using GenAlEx 6.501 (Peakall and Smouse 2012). Total haplotype diversity (h T ) and average within population haplotype diversity (h S ) were calculated according to Pons and Petit (1995, 1996) using the software PerMut (http://www.pierroton.inra.fr/genetics/labo/Software). To test for the existence of phylogeographic pattern in haplotype diversity, we compared the two measures of differentiation N ST and G ST, which are based on ordered and unordered alleles, respectively. Significance was tested based on 1000 permutations.

The data sets of both the nSSRs and the cpSSRs were split into two groups (north-northeastern vs. south-southwestern populations, low vs. high elevated population, and farmland vs. non-farmland) in order to test for differences by t- tests. The results of these tests were corrected for multiple comparisons by Bonferroni corrections (one-sided or two-sided). Estimates of diversity derived for nuclear and plastid genomes, respectively, were rank-correlated.

Both AMOVA and Mantel test were computed using GenAlEx 6.501 (Peakall and Smouse 2012), and significances were tested based on 9999 permutations. The pollen-to-seed migration proportion was calculated after Ennos (1994) by \( r=\frac{\left[\left(\frac{1}{F_{\mathrm{ST}}}-1\right)\left(1+{F}_{\mathrm{IS}}\right)-2\left(\frac{1}{N_{\mathrm{st}}}-1\right)\right]}{\left(\frac{1}{N_{\mathrm{ST}}}-1\right)} \)

Data analysis—range wide

To analyze the relationships between the Ethiopian populations and the range-wide collection of Kadu et al. (2011, 2013), an unweighted pair group arithmetic mean dendrogram (UPGMA) based on Nei’s standard genetic distances (Nei 1972) was computed for the nSSR data using 1000 bootstrapped distance matrices created in MSA (Dieringer and Schlotterer 2003). The computer programs NEIGHBOR and CONSENSE in the PHYLIP 3.63 package (Felsenstein 1989) were used for tree construction. As an additional method to identify population clusters, principal co-ordinate analysis was conducted using GenAlEx 6.501 (Peakall and Smouse 2012) to explore multivariate relationships among inter-individual Nei’s standard genetic distance (Nei 1972). Also, the STRUCTURE analysis as described above was performed for the total data set (including the data from Kadu et al. 2013).

Haplotype relationships of all chloroplast polymorphisms were inferred by constructing a statistical parsimony network following a two-step strategy according to Bänfer et al. (2006). This procedure accounts for the (presumably) different mutation rates underlying indel and microsatellite variation. First, haplotype data from the indel variation were employed to construct a backbone network using TCS 1.18 (Clement et al. 2000). Second, the network was enlarged by adding the variation at microsatellite loci manually at the respective positions of the backbone.

Results

Genetic structure within Ethiopia

In total, 89 alleles were detected at six nuclear microsatellites within the Ethiopian populations (mean number of alleles per locus was 14.83). The most variable locus was EMPAS10 with 27 alleles, while the least variable one was U3 at which only two alleles were detected (Suppl. Table S1). Total gene diversity for all populations (H T ) was 0.725, and overall observed heterozygosity (H o ) was 0.678 resulting into a positive fixation index (F) of 0.056. Observed number of alleles (N a ), number of effective alleles (N e ), fixation index (F), and number of private alleles (A p ) are shown in Table 2.

Table 2 Genetic diversity indices based on six nSSRs for 21 Prunus africana populations in Ethiopia

Very few populations showed heterozygote deficiency or excess at single loci. Population DE is remarkable, because here, for five out of six markers, significant deviations from HWE were found (Suppl. Table S2). Genetic comparisons based on the effective number of alleles between cohorts [low vs. high altitude (number of population equally split) and farmland vs. non-farmland] were not significant. Overall, differentiation among populations was highly significant (P < 0.001) both based on unordered alleles (F ST = 0.122) and ordered alleles (R ST = 0.186). Comparison of R ST and pRST values (pRST = 0.125) revealed a weak phylogeographic signal as R ST > pRST (P > 0.05).

STRUCTURE analysis for the Ethiopian populations resulted in two biologically meaningful groups; also, STRUCTURE HARVESTER 0.6.93 (Earl and von Holdt 2012) identified the most likely group structure to be two (Suppl. Fig. S1 and Suppl. Table S3). These two population groups were largely located east and west of the Ethiopian rift, and an extensive admixture gradient between the populations was evident (Fig. 1), showing ongoing exchange of migrants between both sides of this gene flow barrier. AMOVA partitioned the variation based on geographical units (north, southwest, central, and southeast), tree seed zones, and STRUCTURE-based groups (east and west of the Ethiopian Rift). Although significant differentiation was found, the proportion of the variation due to variation between the assumed regions (geographic units, seed zones, and genetic clusters) was generally small (Table 3). The pairwise population matrix of Nei’s unbiased genetic distances and geographic distances for the Mantel test is given in Suppl. Table S4. Isolation by distance has not significantly contributed to the nuclear population structure as only a very weak positive correlation between genetic and geographic distance matrices was found within Ethiopia (Rxy = 0.20, P = 0.057) (Fig. 2).

Fig. 1
figure 1

Results of individual population assignment (admixture model) performed with STRUCTURE for the combined data set (Ethiopian populations and data set of Kadu et al. (2013) using K = 6 and separate for the 21 Prunus africana populations from Ethiopia (K = 2)). Stars indicate populations east of the Ethiopian rift

Table 3 Partitioning of variation based on six nuclear microsatellite loci among 210 Prunus africana individuals in Ethiopia computed by analysis of molecular variance (AMOVA))
Fig. 2
figure 2

Unbiased Nei’s genetic distance based on six nuclear SSRs plotted against the geographic distance

Even though five chloroplast DNA (cpDNA) and four cpDNA indels (the same as used in Kadu et al. 2011) were considered in generating data from 210 individuals from Ethiopia, only three of these microsatellite loci were polymorphic (Suppl. Table S1). At TPSCP5 and TPSCP10, four haplotypic variants were detected at each locus, while at TPSCP1, two variants were found resulting in a total of 14 haplotypes in the Ethiopian samples. The monomorphic indel loci were not excluded from further analysis for the sake of consistency in comparing the present data with previously published results (Kadu et al. 2011). Haplotypic variation strongly differed among populations (Suppl. Table S5). Sixteen of the 32 multilocus haplotypes, which were constructed from cpSSRs of the range-wide samples, were found in Ethiopia. Total haplotype diversity within the Ethiopian sample amounted to h T  = 0.703, which could be significantly partitioned into 47.8 % among populations and 52.2 % within populations. The mean within population plastid haplotype diversity was estimated to be h S  = 0.367. Comparison of observed N ST (0.483) with mean-permuted N ST (0.473) did not reveal a significant phylogeographic pattern. A Mantel test showed only a very weak positive correlation between geographic and Nei’s genetic distance matrices (Rxy = 0.107, P < 0.001).

The ratio between pollen flow to seed flow was estimated as r = 7.1, indicating a several fold more extensive gene flow by pollen compared to seed dispersal. A test for correlations between the effective number of nuclear alleles and the number of plastid haplotypes amounted to low and insignificant rank correlation (Spearman’s r 2 = −0.15).

Genetic structure across Africa

The relationship among populations from the range-wide sample (including populations from Kadu et al. 2011, 2013) based on a bootstrapped Nei’s standard genetic distance of the nSSRs is shown in the UPGMA tree (Fig. 3). With one exception (population DE), all Ethiopian populations fell within one major branch of the tree. Ethiopian populations east of the Ethiopian Rift also grouped close together on a sub-branch. Interestingly, all Ethiopian populations clustered with populations of the western lineage from the range-wide sample (Kadu et al. 2013). In the principal co-ordinate analysis, the following six groups were identified: (1) Ethiopia, (2) Cameroon and Equatorial Guinea, (3) East Africa west of the ERV (Uganda and western Kenya), (4) East Africa east of the ERV (Tanzania and eastern Kenya), (5) Zimbabwe and South Africa, and (6) Madagascar (Fig. 4). Consistent with the UPGMA, the Ethiopian cluster was more closely related to the western lineage populations than to populations originating east of the ERV. A similar pattern was obtained in the STRUCTURE analysis of the combined nuclear data set (Fig. 1): the results of K = 6 show the grouping already reported by Kadu et al. (2013) plus a well-defined Ethiopian group that exhibits some admixture with the western lineages of the range-wide sample.

Fig. 3
figure 3

UPGMA dendrogram for the combined data set of P. Africana from Ethiopia and Kadu et al. (2013). Populations located east of the Ethiopian Rift Valley are marked by a star

Fig. 4
figure 4

Principal co-ordinate analysis showing the multivariate relationships of 46 P. africana populations including the data set of Kadu et al. (2013). Population acronyms for non-Ethiopian populations are from GQ1 Moka (Equ. Guinea), CM1 Lower Man’s Spring (Cameroon), CM2 Ngashie Mountains (Cameroon), CM3 Mount Danoua (Cameroon), UG1 Bwindi (Uganda), UG2 Kalinzu (Uganda), UG3 Kibale (Udana), UG4 Mabira (Uganda), KE1 Kibiri (Kenya), KE2 Kakamega (Kenya), KE3 Kapcherop (Kenya), KE4 Londiani (Kenya), KE5 Chuka (Kenya), KE6 Lari (Kenya), KE7 Ol Danyo Sambuk (Kenya), KE8 Taita Hills (Kenya), TZ1 Kilimanjaro Catchment Forest Reserve (Tanzania), TZ2 Kindoro Catchment Reserve (Tanzania), TZ3 Shume Mamba Forest Reserve (Tanzania), TZ4 Meru Catchment Reserve (Tanzania), TZ5 Udzungwa (Tanzania), TZ6 Kidabaga (Tanzania), ZW1 Nyanga (Zimbabwe), ZW2 Cashel Valley Chimanimani (Zimbabwe), ZW3 Chirinda Forest Reserve Chipinge (Zimbabwe), SA1 Mpumalanga (South Africa), SA2 Kwazulu-Natal (South Africa), and MG1 Marovoay (Madagascar); for more information on non-Ethiopian populations, see Kadu et al. (2013). For Ethiopian population acronyms, see Table 1

The relationship between the 32 haplotypes is demonstrated by the genealogical network in Fig. 5. The backbone of the network consists of five main haplotypes (denoted HT1–HT5). Ten of the Ethiopian haplotypes (HT1p–HT1y) were not found in other countries. Five of the sixteen haplotypes (HT1a, HT1c, HT1g, HT1h, and HT1i) were shared with Eastern (excluding Uganda) and southern African populations, but only one haplotype (HT1k) was shared with a West African population (Equatorial Guinea) (Fig. 6). All haplotypes found in Ethiopia were members of the HT1 family, which is the most commonly detected haplotype in Kadu et al. (2011). In the Ethiopian samples, the predominant haplotype HT1h mainly was found on the northwestern side of the Ethiopian Rift. This haplotype occurred in 16 Ethiopian populations, while it was rare in the non-Ethiopian populations occurring only within one Kenyan and one Tanzanian population, respectively.

Fig. 5
figure 5

Genealogical network of 32 haplotypes (HTs) of 53 P. africana populations from 10 African countries (including 32 populations of Kadu et al. 2011). Non-white colors represent haplotypes found in Ethiopia, and haplotypes shared with populations of other African countries are represented by concentric circles. The size of the haplotypes is shown proportional to their frequencies. Very small white circles represent hypothetical intermediate haplotypes

Fig. 6
figure 6

Distribution of 16 “Ethiopian” haplotypes of P. africana found in Africa. Codes for non-Ethiopian populations are according to Kadu et al. (2011) GQ Moka, KA Chuka, KB Kinale, KC Kapcherop, KD Kakamega, KE Londiani, KG Taita, KS Kibiri Forest, SA Mpumalanga, SB KwaZulu-Natal, TB Kilimanjaro, TD Shume Magamba, TE Kidabaga, TT Udzungwa, ZWA Nyanga, ZWB Chimanimani, and ZWC Chipinge, and for Ethiopian populations, see Table 1 (map adapted from Kadu et al. 2011)

Discussion

While the phylogeographic patterns of northern hemisphere tree species are well analyzed and found to be mainly affected by repeated poleward expansion-contraction cycles in the Quaternary (e.g., Hewitt 1999), the population history of trees in the low-latitude zone is less studied and seems to be shaped by interaction dynamics between topography and changing climate patterns (e.g., Hamilton and Taylor 1991). A frequent difficulty in developing likely scenarios of migration and population expansion at low latitudes is the identification of the founder populations and origins of spread. The present study makes use of one of the most complete data sets of an African tree species that—although restricted to Afromontane forests—is widespread across the continent. Until today, populations from the Ethiopian highland were thought to be the founder populations in Africa, because it was suggested that Afroalpine and Afromontane species colonized Africa through the coastal mountain ranges of the Arabian Peninsula and the Ethiopian highlands (Koch et al. 2006; Aubréville 1976). However, our joint analysis of Ethiopian populations with data derived from the remaining species’ range reported by Kadu et al. (2011, 2013) suggests that the Horn of Africa is a hot spot of plastid diversity and evolution but not the ancestral origin for a majority of populations across Africa. Ethiopian populations seem to form an additional distinct entity with independent plastid evolution for an extended period of time resulting in the highest values of plastid diversity across the species’ range. Total plastid haplotypic diversity (h T ) for all Ethiopian populations nearly reached the estimate calculated for 32 sub-Saharan populations from nine countries and within populations diversity (h s ) averaged to 0.367 throughout Ethiopia but only to 0.234 for the rest of the species’ range (Kadu et al. 2011). Besides this remarkable diversity, the genetic relatedness among the cpDNA haplotypes provides new answers to the questions whether P. africana has initiated its migration from the Horn of Africa or from locations further south; all haplotypes found in Ethiopia were members of the HT1 lineage, while East African populations from Kenya, Tanzania, and Madagascar also contained haplotypes from the more diverged and probably “older” lineages HT2–HT5. Thus, the initial diversification very likely did not start in Ethiopia but probably in the Kenyan or Tanzanian Mountains. This model would also be supported by one of the oldest continuous pollen records of Eastern Africa retrieved from Sacred Lake, north-east of Mount Kenya spanning more than 40,000 years before present (BP). It is remarkable that since approximately 30,000 years BP, Prunus pollen was constantly found here with a slight increase in the mid-Holocene (Coetzee 1967). Very old pollen records of approximately 11,000 years BP were also reported close to Lake Victoria in eastern Kenya (Hamilton 1982; Vincens 1986), as well as in Burundi, Rwanda, and western Uganda (Jolly et al. 1997). It is therefore tempting to speculate that from this region, P. africana had spread to north and south after the Last Glacial Maximum. However, maximum time scales covered among different palynological studies vary erratically. But to the best of the authors’ knowledge, pollen of P. africana has been only reported during the last 3000 years in the Arsi Mountains in southeastern Ethiopia (Bonnefille and Umer 1994). Since all major occurrences now have been sampled with inconclusive results, it may be that the primary immigration route into Africa and the origin of the species stay dubious. Several climate oscillations have taken place in East Africa over an extensive time period, and it is possible that original patterns have been blurred by several cycles of population expansion and contraction or that ancestral populations went extinct or lost the majority of its genetic diversity by drift.

At present, Ethiopian populations are well isolated from populations further south (Uganda, Tanzania, and Kenya), since the Turkana Basin forms a strong gene flow barrier. At least by the middle of the Holocene (around 5000 years BP), the climatic conditions had become similar to those found today (Owen et al. 1982) and the arid environment in this basin inevitably has disrupted gene flow.

Six out of 16 haplotypes found in Ethiopia were also present in Kenya and Tanzania, Zimbabwe, and South Africa (Kadu et al. 2011), while populations in Cameroon, Uganda, Nigeria, and Madagascar did not share haplotypes with the Ethiopian samples. Only HT1k was shared between one Ethiopian and one Equatorial Guinean population, maybe as a result of homoplasy which may have influence haplotype patterns in closely related variants (Kadu et al. 2011). But the absence of “Ethiopian” haplotypes in West Africa and in populations west of the Albertine Rift rejects certain migration scenarios of P. africana proposed by Kadu et al. (2011), such as (i) the southward migration of P. africana from the southern fringe of the Ethiopian highlands to the Albertine Rift Valley (Uganda) and then to West Africa and (ii) the independent colonization of West Africa directly from the Ethiopian highlands using the Marrah Mountains (Sudan) as stepping stones via a northwestern migration corridor and the subsequent colonization of western Uganda from West Africa. Instead, it seems more likely that West Africa was colonized via the southern migration tract (White 1993) or that populations from west of the Albertine Rift, which so far showed the highest similarities to West African populations, are early descendents of Kenyan or Tanzanian populations. An analysis of populations in southwestern Africa (e.g., Angola) is strongly needed to clarify this issue. The time period of the western immigration is difficult to determine, but pollen records of P. africana obtained from central Cameroon imply that the species has already been present in western Africa 7000 years BP (Vincens et al. 2010).

Despite missing evidence for migration of P. africana from the Ethiopian highland to West Africa, the analysis of the nuclear markers revealed a closer clustering between populations from West Africa and populations west of the ERV than populations east of the ERV. A possible explanation is that gene flow via pollen between Ethiopian and West African populations took place during a comparatively recent climatic episode when the species may have been distributed also at lower altitudes. Such favorable conditions existed between 9000 and 6000 years BP in the area (Street-Perrott and Perrott 1993). After this humid period, precipitation in the region decreased and climate changed to drier conditions leading to retraction into higher altitudes and the strong isolation of present populations.

The high diversity found in the plastid genomes was not reflected in the nuclear data, where we obtained only moderate estimates of molecular diversity. For instance, the range of expected heterozygosities (0.452–0.732) in our study was similar to those obtained for 25 P. africana populations across the remaining native distribution area of sub-Saharan Africa (H e  = 0.430–0.827; Kadu et al. 2013). Similar results were found for eight cohorts of an Ethiopian population in the southern Amhara region (H e  = 0.77) by Yineger et al. (2014) who employed identical microsatellites as we did. Therefore, the highest nuclear diversity across the species’ range was found in Ugandan and Kenyan populations west of the Eastern Rift Valley averaging to 6.723 for R s (N = 10) and 0.755 for H e (eight populations studied; Kadu et al. 2013), while our respective Ethiopian values amounted to only 4.989 (arithmetic mean across regions) and 0.678, respectively. One might speculate that landscape fragmentation and overexploitation might have caused the low nuclear genetic diversity, since populations of P. africana are especially fragmented on farmland. However, our comparison of farmland versus non-farmland population did not result in significant differences and suggests that the genetic structure is so far not or at least not heavily affected by overexploitation.

Haplotypic and nuclear diversities were not significantly correlated in Ethiopian populations. However, there are only very few reports to our knowledge which tackle the correlation between nuclear and plastid data in forest trees (Vriesea gigantea, r rank = 0.47, not significant, calculated based on data in Palma-Silva et al. 2009; Sorbus domestica, r rank = 0.077, not significant, George 2015), and all of them are not significant. The levels of genetic differentiation found in the Ethiopian populations both for nuclear (F ST = 0.122) and plastid markers (N ST = 0.483) were expectedly lower than in the remaining species’ distribution area ranging from Madagascar to Equatorial Guinea and from South Africa to Uganda (F ST = 0.27, Kadu et al. 2013; N ST = 0.735, Kadu et al. 2011), likely due to the much smaller region covered. Moreover, when Afromontane forests were widely distributed in Ethiopia between 9000 and 6000 years BP (Street-Perrott and Perrott 1993), many populations were presumably connected genetically, resulting in low among-population differentiation. Today, most Ethiopian populations of P. africana are genetically isolated due to anthropogenic fragmentation of the forests (Teketay and Granström 1995), which probably started approximately 2500 years ago and has accelerated strongly during the twentieth century (Hurni 1988). But some remnant trees still exist and may function as stepping stones for gene flow, e.g., between populations like Bonga-Bedele and Chilimo-Nekemte, as relatively small genetic distances (for both populations pairs 0.048) across moderate geographic distances (134 and 168 km, respectively) indicate.

The genetic differentiation among populations in cpDNA was expectedly higher than in nuclear DNA (nDNA), which could be explained (1) by the fact that gene flow via seeds and pollen is generally higher than by seeds alone and (2) by genetic drift which acts stronger on cpDNA with smaller effective populations sizes than on nDNA (Birky et al. 1983). In support of the first explanation, Berens et al. (2013) reported different effective gene flow ratios of pollen over seed dispersal derived from propagule samples at different ontogenetic stages (from seeds to older seedlings) that varied from 3.4 to 24.8 in P. africana, while our indirect measures of this ratio resulted in a 7-fold preponderance of pollen flow.

In the Kenyan and Tanzanian parts of the African Rift Valley, eastern and western populations were strongly diverged both at nuclear and plastid markers (Kadu et al. 2011, 2013). Such a gene flow barrier was also present at the Ethiopian rift, although the pattern was weaker and included a strong admixture cline with the STRUCTURE analysis. We presume that during humid periods (interglacials) of the Pleistocene, the Afromontane vegetation belt shifted to lower altitudes and covered the highlands, while the valley bottom in Ethiopia (between 1000 and 1500 m asl) provided probably ample growing conditions for P. africana and thus may have enabled effective gene flow between eastern and western populations. Further south (main ERV), montane forests probably did not expand to the valley floor due to dryer conditions.

AMOVA revealed weak genetic differentiation for R ST and a lack of differentiation if stepwise mutation were neglected and F ST was used. Expectedly, also the seed zones (Aalbæk 1993), which had been delineated based on ecological conditions (Aalbæk 1993), were not supported by the molecular data. In our study, isolation by distance was very weak and probably of little biological significance. Environmental barriers, historical processes, and life history traits may shape the genetic structure of populations (e.g., Poelchau and Hamrick 2012) and eventually lead to isolation by distance due to limited pollen and seed dispersal; e.g., in a strongly fragmented Ethiopian P. africana population (Yineger et al. 2014), where both for adults (r = 0.68) and young seedlings (r = 0.43), a significant Mantel correlation each has been reported.

Although the Ethiopian highland was not confirmed to be the point of origin for continental genetic diversity, the present study demonstrates that Ethiopian populations harbor unique genetic variation, which requires urgent conservation measures to counteract deforestation and climate change. Our data suggest that P. africana in Ethiopia has been isolated for an extensive period of time evidenced by independent plastid evolution, although probably, extensive gene flow via pollen in more recent millennia facilitated gene exchange with West African nuclear lineages. The contrasting relatedness of Ethiopian populations to East African populations on the chloroplast genome, but to populations of West African and west of the Rift Valley on the nuclear genome, corroborate the importance of the complex interaction between the topographical structures (i.e., the Rift Valley) and climate oscillations as drivers of population structure and diversity. Further studies are needed to investigate local adaptation but also differences in bark compounds that could be utilized for medicine and would create a further stimulus to plant and conserve the species in the region.