Introduction

In recent years, many ecologically and economically important tree species have come under threat from a range of emergent pathogens. The outbreaks of the fungus Ophiostoma novo-ulmi, the agent of Dutch elm disease in the 1900s, led to extensive losses of several Ulmus species, including an estimated two thirds of the elm population of the UK during the 1970s (Webber 1981). In the last decade in the UK and Ireland, notable fungal and oomycete pathogens have included sudden oak death, chestnut blight and red needle blight. Most recently, outbreaks of ash dieback, a potentially serious threat which affects several species of ash (Fraxinus spp.), have been reported in continental Europe, and have subsequently spread to Great Britain and Ireland. Common or European ash (Fraxinus excelsior) is a key species of mixed broadleaved woodlands across Europe, with a natural range that extends from southern Scandinavia to northern Spain and the Balkans, and from Ireland in the west to continental Russia in the east. European ash within woodlands forms mixed stands, usually with beech (Fagus sylvatica), pedunculate oak (Quercus robur), sessile oak (Quercus petraea), alder (Alnus glutinosa) and sycamore (Acer pesudoplatanus), and is an important component of woodland ecosystems, as well as being a valuable timber species (FRAXIGEN 2005). The symptoms of ash dieback were first reported in Poland in the early 1990s (Pautasso et al. 2013), but it was not until 2006 that the causative agent of ash dieback was identified as Chalara fraxinea (Kowalski 2006), which has since been found to be synonymous with the ascomycete fungus Hymenoscyphus pseudoalbidus (Queloz et al. 2011). The disease was first recorded in Britain in February 2012, and the first case of ash dieback in Ireland was reported in October 2012.

Replanting of forests will have to be considered if ash dieback outbreaks result in substantial loss of trees, either via pathogenic mortality or anthropogenic clearance to prevent possible spread. In Great Britain, the Forestry Commission has developed recommendations to maintain provenance of replanted individuals, by using seed sourced from the same area (Herbert et al. 1999). Consequently, a map of “seed zones” that divide Great Britain into 24 areas delineated by geographic features and general climatic similarity has been drawn up to assist restorative conservation programmes. However, a recent study on ash in England, Scotland and Wales (Sutherland et al. 2010) found limited genetic differentiation between 42 populations from 21 of the 24 seed zones, indicating large-scale genetic homogeneity. This suggests that all populations of ash in Britain could be treated as a single management unit (DeSalle and Amato 2004), a more efficient and cost-effective approach to replanting, contrary to recommendations based on previously identified “seed zones”.

Seed dispersal plays a central role in the demography of natural plant populations across a broad range of geographic scales, from initial colonization to shaping community structure and regeneration (Howe and Smallwood 1982; Nathan and Muller-Landau 2000; Levine and Murrell 2003). Despite the importance of dispersal in plant population ecology, the logistics of tracing dispersal events accurately from source are not straightforward. Methods involving “tagging” of seeds are generally less than optimal due to factors such as extremely low recovery rates and the effects of the tags themselves on the dispersal process (reviewed in Wang and Smith 2002; Forget and Wenny 2005; Ashley 2010). Most attempts to estimate seed dispersal distributions have instead relied on seed trapping, coupled with models that generally make a priori assumptions about seed source (Nathan and Muller-Landau 2000). In recent years, climate change, habitat loss and fragmentation, and increased mortality associated with emergent plant pathogens, such as H. pseudoalbidus, have increased interest in more direct, precise measurements of seed dispersal to determine the capacity of plant populations to recover from these threats.

With the recent report of the first case of ash dieback in Ireland, and the lack of population genetic information for the species across the island, the main aim of the present study was to determine the levels and patterns of genetic diversity in naturally seeded trees in ash woodlands and hedgerows. We focused on Northern Ireland which, like the rest of Ireland, has no map of “seed zones” on which to base management units, and the development of rational conservation and replanting strategies. We analysed populations from the northern, eastern, southern and western extremes of Ireland to ensure our findings are applicable to the island as a whole. We also used a molecular genetic approach to quantify fine-scale seed dispersal distances in two natural ash woodlands, employing a combination of high-resolution nuclear and chloroplast microsatellite markers. Our results suggest that ash woodlands across Ireland could be treated as a single management unit, and thus material from anywhere in Ireland could be used as a source for replanting. We also identified potential for seed dispersal over hundreds of metres, which will be important in addressing both post-ash dieback recolonization, and assessing the capacity of ash to migrate in response to global climate change.

Materials and methods

Study species reproductive ecology

European ash (F. excelsior) has protandrous, anemophilous flowers. The species exhibits a wide range of complex polygamy, ranging from pure male and female trees, through combinations of male/female and hermaphroditic flowers in the same individual, to sexual changes across successive years (Wardle 1961; Bacles and Ennos 2008). Although F. excelsior is preferentially outcrossing, hermaphrodites are self-compatible, and whereas females and hermaphrodites exhibit high seed set, hermaphrodites exhibit reduced male fertility. Fruits are winged and wind-dispersed and generally contain a single seed. Seedlings are shade-tolerant but need good light levels to promote full growth, generally only establishing in clearings within woodlands (Marigo et al. 2000).

Sampling and DNA extraction

For the broad-scale study, samples were collected from 33 sites across Northern Ireland and three sites in the Republic of Ireland that had been previously designated as ancient or semi-natural woodland based on data collected for the Woodland Trust Inventory of ancient and long-established woodland in Northern Ireland (www.backonthemap.org.uk) and the National Survey of Native Woodlands 2003–08 in the Republic of Ireland (www.npws.ie; Fig. 1 and Table 1). The congeneric Fraxinus angustifolia has been planted in the Republic of Ireland but is not found in the vicinity of any of the native woodlands analysed in the present study. A single leaf was collected from each of 30 trees per site and stored in silica gel, and GPS coordinates recorded for every tree sampled. DNA was extracted using the cetyltrimethylammonium bromide (CTAB) method of Doyle and Doyle (1987).

Fig. 1
figure 1

Locations of sites sampled in this study. Numbers correspond to those in Table 1

Table 1 Details of populations studied

For the fine-scale study, two sites were chosen. The first, Barnett Demesne, was also used for the broad-scale study. It is a ca. 40-ha public park in South Belfast, Northern Ireland (54.55° N, 5.96° W—Fig. 2), and is an area of mixed parkland and woodland, the woodland being semi-continuous stands of mixed deciduous trees, primarily beech and oak. The ash is found in the northern part of the main wooded area, with a few scattered trees in the adjoining parkland. The second site, Cregagh Glen, is a narrow (50–60 m), steep-sided ravine ca. 700 m long on the eastern outskirts of Belfast (54.56° N, 5.89° W—Fig. 2). It is the surviving remnant of a former ca. 400-ha forest and comprises mixed woodland of Scots pine, sycamore, beech and ash. The ash is distributed sporadically throughout the length of the glen. For both sites, samples were obtained from all reproductive (adult) trees, as well as from selected saplings (96 from Barnett Demesne and 48 from Cregagh Glen; Fig. 2). A single leaf was collected from each individual and stored in silica gel, and GPS coordinates recorded (Table S1, Supporting Information). DNA was extracted using the CTAB method of Doyle and Doyle (1987).

Fig. 2
figure 2

Two woodland sites sampled at a Belfast, Northern Ireland (insert) showing tree cover from orthophotographs of b Barnett Demesne and c Cregagh Glen. Adult trees that were sampled are shown as red dots and saplings as blue dots; surrounding trees, hedgerows and woodlands are clearly visible. North is aligned with the top of the page. Image © 07/08/2006 DigitalGlobe, Google Earth

Genotyping

All trees and saplings were genotyped for eight nuclear and ten chloroplast microsatellite loci. For nuclear microsatellite genotyping, we used six previously reported loci which have been widely used in population genetic studies on ash: Femsatl-4, Femsatl-8, Femsatl-11, Femsatl-16 and Femsatl-19 (Lefort et al. 1999) and M230 (Brachet et al. 1999), as well as two loci developed for the present study (FR639485 and FR646655). As previous studies highlighted the possibility of null alleles using the Lefort et al. (1999) and Brachet et al. (1999) primers (Morand et al. 2002; Ferrazzini et al. 2007; Sutherland et al. 2010), we designed new primers for all loci (Table 2) using the Primer3 program (v 0.4.0; http://primer3.ut.ee). The F IS values calculated in the present broad-scale study were lower than those from several previous studies, which is consistent with the occurrence of null alleles when using the original primers. To investigate this further, we also genotyped a subset of our samples for comparison using the original Femsatl-4, Femsatl-8 and Femsatl-16 primers, since these exhibited the highest F IS values in the earlier studies. To develop further markers, we also tested five pairs of primers developed from EST sequences in GenBank, but only two of these (FR639485 and FR646655) consistently gave clear, reproducible products.

Table 2 Nuclear microsatellite loci analysed in this study

F. excelsior chloroplast sequences in the GenBank database were searched for mononucleotide repeats of ten or more (Provan et al. 2001). Primers were designed using the Primer3 program to amplify the ten loci in four multiplexes (Table S2, Supporting Information). One of these (AF528042.2) corresponds to the highly polymorphic CPFRAX6 locus described in Harbourne et al. (2005) but was monomorphic across all samples tested. Consequently, we screened a subset of our samples using the original CPFRAX6 primers, but these did not reveal any additional variation to that displayed using the AF528042.2 primers.

PCR was carried out in a total volume of 10 μl containing 100 ng genomic DNA, 5 pmol 6-FAM- or HEX-labelled M13 primer, 0.5 pmol M13-tailed forward primer, 5 pmol reverse primer, 1× PCR buffer, 200 μM each dNTP, 2.5 mM MgCl2 and 0.25 U GoTaq Flexi DNA polymerase (Promega, Sunnyvale, CA, USA). PCR was carried out on a MWG Primus thermal cycler (Ebersberg, Germany) using the following conditions: initial denaturation at 94 °C for 3 min followed by 40 cycles (30 for chloroplast loci) of denaturation at 94 °C for 30 s, annealing at 55 °C for 30 s, extension at 72 °C for 30 s and a final extension at 72 °C for 5 min. Genotyping was carried out on an AB3730xl capillary genotyping system (Applied Biosystems, Foster City, CA, USA). Allele sizes were scored using the GeneMapper software package (v4.1; Applied Biosystems) using LIZ-500 size standards, and were checked by comparison with previously sized control samples. Chromatograms were all inspected visually to check for large allele dropout (see “Discussion”).

Data analysis—broad scale

GenePop (V3.4; Raymond and Rousset 1995) was used to test for linkage disequilibrium between nuclear microsatellite loci. To estimate genetic diversity within the populations, levels of observed (H O) and expected (H E) heterozygosity, levels of allelic richness (A R) and fixation indices (F IS) were calculated using the Fstat software package (V2.9.3.2; Goudet 2001). Significance of F IS was determined by 10,000 randomization steps. We also estimated null allele frequencies using the Cervus software package (V3.0.3; Kalinowski et al. 2007), as previous studies using the same microsatellites (Femsatl-4, Femsatl-8, Femsatl-11, Femsatl-16, Femsatl-19 and M230) have suggested the possibility of null alleles. Chloroplast microsatellite allele sizes were combined into haplotypes, and levels of genetic diversity (H) based on haplotype frequencies were calculated using the Arlequin software package (V3.5.1.2; Excoffier and Lischer 2010).

The overall level of genetic differentiation between populations was estimated using Φ ST, which gives an analogue of F ST (Weir and Cockerham 1984) calculated within the analysis of molecular variance (AMOVA) framework (Excoffier et al. 1992) using Arlequin. In addition, as the high numbers of alleles and high levels of diversity associated with microsatellite loci can lead to an underestimation of genetic differentiation between populations, we also calculated Hedrick’s GST (Hedrick 2005) for the nuclear microsatellite data set. To further identify possible patterns of genetic structure, the software package BAPS (V5; Corander et al. 2003) was used to identify clusters of genetically similar populations using a Bayesian approach. Ten replicates were run for all possible values of the maximum number of clusters (K) up to K = 36, the number of populations sampled, with a burn-in period of 10,000 iterations followed by 100,000 iterations. Multiple independent runs always gave the same outcome.

A test for isolation by distance (IBD; Rousset 1997) was carried out to test the null hypothesis of a stepping-stone model of gene flow between populations of F. excelsior. The Isolde test implemented in the Genepop software package was used to assess the relationship between genetic distance, measured as Hedrick’s GST (Hedrick 2005), and geographical distance between population pairs. One thousand permutations were used for the Mantel test.

To test for spatial genetic structuring (SGS) within populations, which could give rise to Wahlund effects, we carried out spatial autocorrelation analyses using SPAGeDi (V1.4; Hardy and Vekemans 2002). Mean coancestry coefficients (θ xy ; Loiselle et al. 1995) between pairs of individuals were calculated for both the 0–50- and 50–100-m distance classes for each population, with the remaining size intervals (50 to 500 m) reflecting the overall size of each population, and plotted as a correlogram, with 95 % confidence intervals calculated from 1000 permutations of individuals within each distance class, and for estimates of θ xy using 1000 permutations. Finally, for comparison of levels of SGS with those of other species, we calculated the Sp statistic of Vekemans and Hardy (2004) as −b k  / (1 − θ 1), where b k is the slope of the regression of θ xy against the logarithm of the distance, and θ 1 is the mean value of the pairwise coancestry coefficients calculated between all pairs of individuals within the first distance class (0–50 m).

Data analysis—fine scale

Only seven nuclear microsatellite loci were used in the fine-scale study, since locus Femsatl-19 exhibited alleles that differed by only a single base pair, and we wanted to ensure exact matches between putative parents and offspring. We employed two approaches to determine parentage of saplings. The first was a simple exclusion approach, based on the premise that we had sampled all potential adult parents in each stand. Any adult that did not have at least one allele matching those exhibited by a sapling at all seven loci was excluded as a potential parent of that sapling. The second was a likelihood-based approach implemented in the Cervus software package (V3.0; Kalinowski et al. 2007). This was used in addition to strict exclusion, since the program can allow for potential genotyping errors, and the fact that not all putative parents may have been sampled. Simulations were run for 10,000 iterations, with a genotyping error rate of 0.01, since we had manually scored all markers to check for automated miscalls and allelic dropout, and assuming 95 % sampling of putative parents. Parent pairs or individual parents were assigned based on the critical values for the 95 % strict log-likelihood (LOD) scores.

Results

Broad-scale study

No evidence of linkage disequilibrium was detected between any of the eight nuclear microsatellites analysed. Between nine (FR646655) and 51 (M230) alleles were detected per locus, with a total of 261 (mean = 32.625 per locus; Table 2). Levels of observed (H O) and expected (H E) heterozygosity ranged from 0.442 (FR646655) to 0.909 (M230; mean = 0.709), and from 0.477 (FR646655) to 0.937 (M230; mean = 0.765), respectively. Levels of F IS ranged from −0.004 (Femsatl-16) to 0.236 (Femsatl-8), with a mean value of 0.067. The estimated frequency of null alleles ranged from zero (Femsatl-16) to 0.142 (Femsatl-8), with a mean value of 0.041. The proportion of large alleles not called by the GeneMapper software under the default settings in the four loci where there was significant large allele dropout (Femsatl-4, Femsatl-8, Femsatl-11 and M230) ranged from 2.98 % (Femsatl-4) to 11.59 % (M230).

Within populations, levels of allelic richness (A R) averaged over loci ranged from 9.52 (Glenarm Forest) to 11.52 (Killeter Forest), with a mean value of 10.53 (Table 1). A total of 41 private alleles was detected, with the number per population ranging from zero to four. The majority (38) of these were restricted to a single individual, with the remaining three being found in two individuals. Levels of observed (H O) and expected (H E) heterozygosity ranged from 0.637 (Castle Hill) to 0.823 (Glenarm Forest; mean = 0.709), and from 0.712 (Trassey Road) to 0.809 (Rostrevor and Randalstown; mean = 0.765), respectively. The heterozygote deficit observed in the majority of the populations gave rise to F IS values which were significantly higher than zero in 27 of the 36 populations studied, ranging from 0.053 (Killeter Forest) to 0.168 (Letterfrack; mean = 0.067). Diversity statistics for individual loci by population are given in Table S3, Supporting Information, and indicate that significant F IS values were generally due to high values at locus Femsatl-8, which were significantly higher than zero in 32 of the 36 populations studied, suggesting the presence of null alleles at this locus.

Five of the ten chloroplast microsatellite loci studied were polymorphic in the samples analysed, exhibiting between two and four alleles (Table S1, Supporting Information). Combining allele sizes across loci gave eight haplotypes (See Fig. S1, Supporting Information for a network of evolutionary relationships between haplotypes). One of these (H1) was found in the vast majority (995 out of 1052) of the trees studied. Levels of haplotype diversity (H) ranged from zero (several populations) to 0.572 (Barnett Demesne; Table 1).

Levels of population differentiation were Φ ST = 0.0131 and Hedrick’s GST = 0.0547 for the nuclear microsatellites and Φ ST = 0.2629 for the chloroplast microsatellites (results of the AMOVA are given in Table 3). The BAPS analysis assigned all 36 populations to a single genetic cluster, although a weak but significant isolation by distance (P = 0.005) was observed across all populations but not across NI populations only (P = 0.09; Fig. 3). Finally, the spatial autocorrelation analyses revealed very little significant within-population spatial genetic structuring, with structuring only observed up to 50 m in the Dromora, Rostrevor, Randalstown and Lemnagore Wood populations (Fig. S2, Supporting Information), and Sp values ranging from 0.000 (several populations) to 0.020 (Knockninney Hill; Table 1).

Table 3 Analysis of molecular variance (AMOVA)
Fig. 3
figure 3

Mantel test for isolation by distance (IBD) between populations including (above) and excluding (below) the three Republic of Ireland populations

Fine-scale study

We successfully genotyped 140 adult trees and 93 saplings from Barnett Demesne and 44 adults and 39 saplings from Cregagh Glen. For the Barnett Demesne stand, there was extremely good agreement between parentage based on strict exclusion and assignments based on likelihood implemented in Cervus: in only five cases, Cervus identified a second parent where strict exclusion only identified a single parent, and there were four cases where a sapling/parent combination was identified by exclusion, but where the adult had a LOD score below the threshold calculated by Cervus. Based on the Cervus results, a single parent was identified for 42 saplings, both parents were identified for 41 saplings, and no parent within the stand was identified for five saplings. Three putative parents above the LOD threshold were identified for a single sapling, which was not included further in the analysis, as were the four saplings where a parent was identified by exclusion, but this adult had a LOD score below the threshold. Three chloroplast microsatellite haplotypes were identified, and in the 41 cases where both parents were identified, differences in chloroplast haplotypes between the parents allowed the identification of the seed parent in 13 cases. Consequently, seed dispersal distances could be calculated for 55 of the saplings: 13 where the seed parent was identified in the parent pair and for the 42 saplings where a single parent was identified, since the assumption that a single parent is the seed parent is far more parsimonious than the alternative explanation of the adult being the pollen parent, which pollinated another tree outside the stand, with the seed subsequently dispersing back into the stand. Furthermore, in all cases of single parent assignment, there was a match between the adult and sapling chloroplast haplotype, consistent with the adult being the seed parent. This includes the rarest haplotype, found in a single sapling and a single adult, which were classed as parent-offspring pairs by both Cervus and strict exclusion. Dispersal distances ranged from 3 to 223 m (mean = 42 m; median = 31 m; Fig. 4). Pollination distances were calculated for the 13 parent pairs and ranged from 2 to 266 m (mean = 93 m; median = 83 m). Realized pollen dispersal distances, i.e. from pollen parent to sapling, ranged from 7 to 168 m (mean = 65 m; median = 47 m; Table 4).

Fig. 4
figure 4

Summary of identified seed dispersal events in Barnett Demesne. Left Histogram showing dispersal distances in 10-m classes. Black shading indicates assignment of a maternal plant from a parent pair. Grey shading shows assignment of a maternal plant to a single identified parent. Right Distance and direction of identified dispersal events. Broken arrows represent dispersal distances of greater than 50 m

Table 4 Pollination distances and realized pollen dispersal distances, i.e. from father to sapling for the 13 saplings where the maternal parent was identified in the parent pair

In the Cregagh Glen stand, very few putative parents were identified by Cervus which had LOD scores above the critical value. This was due to a combination of lower overall genetic variation and the occurrence of high-frequency alleles at several of the microsatellite loci. Four dispersal events from separate single parents were identified, with distances of 23, 82, 123 and 148 m. In 11 cases, however, no parent was identified, suggesting immigration of seed into the stand. All adults and saplings shared a single chloroplast microsatellite haplotype.

Discussion

Lack of genetic structure and implications for restorative planting

For over 50 years now, the concept of provenance has been integral to forestry practices, particularly with respect to restocking and/or replanting of woodlands (reviewed in Jones and Burley 1973). This reflects observed phenotypic and underlying genetic variation across species’ distributions and recommends that where possible, woodlands should be restocked with local seeds or seedlings. Our finding that ash woodlands across Ireland are genetically uniform suggests that the concept of provenance might more usefully reflect the geographic distribution of genetic variation, and that all could be treated as a single management unit, given the lack of genetic differentiation between populations and the low incidence of private alleles. The observed level of population differentiation based on nuclear microsatellites (Φ ST = 0.0131) was the second lowest reported for European ash, with previous studies estimating between 1.2 % (Hebel et al. 2006) and 8.7 % (Heuertz et al. 2001) of nuclear diversity partitioned between populations, and is consistent with wind pollination and seed dispersal (Wardle 1961). Unfortunately, these previous studies did not calculate comparable statistics to the GST value of 0.0547 observed in the present study, but future studies using microsatellites should calculate the equivalent statistic to take into account underestimation of levels of differentiation when using highly variable markers (Hedrick 2005). Replacement of native trees for whatever reason should be based on knowledge of the geographic distribution of genetic variation (Godefroid et al. 2011). Our results clearly indicate that the source of material for replanting ash, at least in Ireland, is largely irrelevant, given such low levels of differentiation. The inclusion of samples from the extreme east (Co. Wicklow), south (Co. Cork) and west (Co. Galway) of Ireland in the same genetic cluster as the 33 populations from Northern Ireland indicates that our findings are probably applicable to ash woodlands across the island of Ireland as a whole. Furthermore, the Irish samples of ash exhibited similar levels of nuclear genetic diversity to those in Great Britain and continental Europe, including populations from putative refugial areas that should harbour the highest levels of variation (Heuertz et al. 2001; Morand et al. 2002; Heuertz et al. 2004a; Heuertz et al. 2004b; Ferrazzini et al. 2007; Sutherland et al. 2010; Gérard et al. 2013). However, such replanting should be carried out using native material from long-established, unplanted/unstocked woodlands, since recent studies have shown that material used for replanting in Ireland often contains individuals that possess alleles introgressed from the congeneric F. angustifolia (Thomasset et al. 2013).

Levels of chloroplast genetic diversity were very low, with a single haplotype found in almost 95 % of all the individuals studied. This haplotype most likely corresponds to haplotype H04 from Heuertz et al. (2004a), which is also the dominant haplotype in Britain as a result of postglacial recolonization from Iberia. Populations in the east of Northern Ireland tended to exhibit the highest levels of chloroplast diversity, with most of the populations in the west fixed for the most common haplotype. This could be due to founder effects associated with westward recolonization from Britain or to predominantly eastward seed dispersal by prevailing westerly winds, since the chloroplast genome is maternally inherited in ash, or to a combination of both.

Several previous population genetic studies on ash have reported significant, and often high, levels of F IS, which have been attributed to various factors including inbreeding, null alleles, large allele dropout and Wahlund effect (Morand et al. 2002; Ferrazzini et al. 2007; Sutherland et al. 2010). The mean F IS calculated for ash populations across Northern Ireland (0.067) is significantly lower than that reported by Sutherland et al. (2010), who used six of the eight loci analysed in the present study to examine populations throughout Great Britain (mean F IS = 0.182; Mann-Whitney test, z = 6.07, P < 0.0001). We designed new primers to amplify previously characterized microsatellite loci with the aim of circumventing any potential problems with null alleles, and our estimated null allele frequencies were generally much lower than those reported in Sutherland et al. (2010). However, on genotyping a subset of our samples using the same primers as Sutherland et al. (2010), we did not observe an increase in F IS or estimated null allele frequencies, suggesting that the previously observed homozygote deficiencies were not due to null alleles as a result of non-amplification. Furthermore, although Femsatl-8 exhibited the highest F IS among the loci analysed in both cases, which might suggest null alleles associated with this particular marker, the same locus exhibited the lowest F IS in a previous study in Germany (Hebel et al. 2006). The fact that this locus was significantly higher than zero in 32 of the 36 populations studied, even where the majority of the other loci in these populations did not yield significant F IS values, however, does suggest the presence of null alleles.

Another potential cause of heterozygote deficiency is large allele dropout, where short alleles are preferentially amplified during the PCR. Automated scoring software packages, such as Genotyper and GeneMapper, will often not identify these long alleles. We took the precaution of manually checking each chromatogram and using the default settings for allele scoring in the GeneMapper package; we identified uncalled large alleles at frequencies ranging from 2.98 to 11.59 % at four of the eight loci studied (see Fig. S3, Supplementary information for an example). The phenomenon is particularly prevalent at loci with a high number of alleles over a large size range, and with such high frequencies of uncalled alleles, analyses based on the raw outputs from these genotyping packages would result in apparent homozygote excesses and subsequently inflated F IS values.

Sutherland et al. (2010) suggested that the F IS values observed in their study might be due to a Wahlund effect, namely the occurrence of spatial genetic structuring within populations, a scenario also proposed to a lesser degree by Ferrazzini et al. (2007). Because we recorded GPS coordinates for each of the trees sampled in the present study, we were able to carry out spatial autocorrelation analyses to test for such structuring. With the exception of significantly higher levels of relatedness up to 50 m in only four populations out of the 36 studied, we found no evidence of Wahlund effects.

Given that we can exclude null alleles (with the possible exception of locus Femsatl-8, which consistently exhibits high F IS levels across most studies), large allele dropout and Wahlund effects, the F IS values observed would appear to give a true measure of the levels of inbreeding in populations of ash in the present study. Our spatial autocorrelation analyses found little evidence for the breeding “subunits” previously suggested to exist within ash woodlands (Heuertz et al. 2001; Morand et al. 2002). The mean value of Sp calculated for the 36 populations studied (0.006) was lower than the mean value reported for trees (0.010) by Vekemans and Hardy (2004) and the mean value from six microsatellite-based studies in tropical trees (0.017; Hardy et al. 2006), although higher than that calculated for a Romanian population of F. excelsior (0.002; Heuertz et al. 2003). This may be due at least in part to the differing densities of ash trees in the various woodlands analysed in the present study. For example, the Knockninney Hill population, which presented the highest Sp value, had only a single pair of individuals within 100 m of each other.

Evidence for frequent long-distance dispersal events

Although the use of molecular genetic techniques, particularly high-resolution microsatellite markers, has provided valuable insights into seed dispersal in natural plant populations (Powell et al. 1996; Ashley 2010), there still remain problems associated with identifying the seed parents of established plants and/or seedlings in monoecious species. Estimates of pollen-mediated gene flow can be relatively easily obtained by genotyping seeds and “subtracting” the genotype of the maternal plant, thus leaving the paternal genotype which can be matched to putative parent plants. Such an approach has been employed previously in ash, indicating pollination events at distances over several hundred metres and up to nearly 3 km (Bacles et al. 2005; Bacles and Ennos 2008; Thomasset et al. 2014). Estimating seed dispersal, however, is a more difficult process, particularly in monoecious species (Sork and Smouse 2006). Previous studies have attempted to genotype the endocarp tissue to identify the maternal parent of dispersed seeds (Godoy and Jordano 2001; Garcia et al. 2007), but this only provides estimates of initial dispersal and does not necessarily provide an indication of true population demography and recruitment (effective dispersal), for which identification of the mothers of seedlings or established plants is necessary. For seedlings in a population of an outcrossing species, it may be possible to identify both parents; one parent, which will be either the pollen or seed parent; or neither parent, indicating immigration of seed into the population. In angiosperms, chloroplast-specific markers can be used in conjunction with biparentally inherited nuclear markers to assign the maternal and the paternal parent where both parents are identified, since the chloroplast genome is usually maternally inherited. One drawback of such an approach is the low mutation rate in chloroplast genomes, meaning that often there is inadequate resolution to assign the maternal parent (Wolfe et al. 1987). By using highly polymorphic chloroplast microsatellite markers, which allow the high resolution of maternal genotypes, it may often be possible to determine which is the mother plant in cases where both parents are identified using nuclear microsatellites (Provan et al. 2001).

By using a combination of nuclear and chloroplast microsatellite markers, we were able to assign seed and pollen parents unambiguously for 13 out of 41 saplings for which both parents were identified within the Barnett Demesne stand, as well as assigning putative seed parents to a further 42 saplings. Previous studies using genetic markers to identify the source of established seedlings relied on genotyping any maternal tissue associated with the seedling, but these approaches can be problematic due to the low quality of DNA typically recovered from the pericarp (Grivet et al. 2009; Smouse et al. 2012). Chloroplast microsatellite markers provide a convenient, high-resolution, uniparental assay (maternal in the majority of angiosperms and paternal in the majority of gymnosperms) that can be run on leaf material from established plants and thus allow the assignment of seed and pollen parents where both parents are identified (Provan et al. 2001; Ebert and Peakall 2009a), circumventing the need to rely on genotypes from maternal tissues. Primers to amplify chloroplast microsatellites are available for a wide range of species, and the high levels of conservation of the chloroplast genome mean that primers developed for a particular species often give polymorphic markers in related taxa (Provan et al. 2001). In addition, sets of universal primers are available to facilitate de novo development of these markers, particularly for non-model organisms or taxa with little sequence information available in DNA sequence databases such as GenBank (e.g., Ebert and Peakall 2009b).

Our findings indicate frequent seed dispersal over distances greater than 100 m, with six known within-stand dispersal events (over 10 %) exceeding this range. We also identified immigration into the stand in 16 cases (five out of 93 [5 %] from Barnett Demesne and 11 out of 39 [28 %] from Cregagh Glen). Barnett Demesne is located in a largely urban area, and the nearest stand of ash trees was 400 m to the east, at Shaw’s bridge (Fig. 1), suggesting that this was the minimum dispersal distance of immigrant seed into the stand. Despite the lower genetic diversity in the Cregagh Glen population, which led to a lower level of parentage assignment, the higher rate of immigration appeared to result from extra-stand fertilization. Cregagh Glen is in a more agriculture-dominated landscape on the eastern edge of Belfast, and it is possible that immigrant seed dispersed from neighbouring isolated individuals, low-density hedgerow trees, or from the next-nearest substantial stand of ash, which was a similarly sized stand in a ravine ca. 500 m to the east (Fig. 1).

Only two previous genetic studies on seed dispersal in ash have been carried out. Heuertz et al. (2003) used simulation studies coupled with estimates of kinship from five biparentally inherited nuclear microsatellites, including four of the seven loci used in the present study, to infer levels of seed and pollen dispersal in a continuous forest in southeast Romania. The estimated levels of seed dispersal, which were ≤14 m, were lower than both the mean and median values calculated for the Barnett Demesne stand in the present study and far lower than the majority of individual events identified. Bacles et al. (2006) used a direct, individual-based approach to assess seed dispersal in a highly fragmented landscape in southern Scotland. They detected multiple long-distance events, often between fragments of up to 1.4 km, but this is most likely due to the chronically fragmented nature of their study site, with far fewer barriers to dispersal, and the increased chance of the winged seeds being uplifted in the initial stages of dispersal. This scenario is very different to the closed, semi-continuous woodlands analysed in the present study, and our findings may better reflect patterns of dispersal in more typical mixed deciduous woodlands. Interestingly, since Bacles et al. (2006) had no means to identify the seed parent where parent pairs were identified, they assumed that the closer of the two to the seedling was the seed parent, but our analysis indicated that the seed parent was the more distant parent of the two in four out of 12 cases (and in one case they were equidistant). This further highlights the utility and importance of our approach in unambiguously identifying maternal and paternal parents to accurately quantify dispersal.

The ability to identify the seed sources of established plants allows insights into the end results of dispersal in population demography, i.e. initial dispersal followed by germination and survival/recruitment into the population. This means that post-dispersal processes, such as competition and density-dependent mortality, can be addressed. This was not possible in early genetic studies on dispersal, which relied on genotyping seeds, and thus could only assess initial seed dispersal (e.g. Godoy and Jordano 2001; Ziegenhagen et al. 2003; Grivet et al. 2009). Although we did not specifically test for such effects, our plot of effective seed dispersal distances within Barnett Demesne is consistent with a Janzen-Connell recruitment process (Janzen 1970; Connell 1971; Augspurger 1983). Dispersal in the stand peaked at 30–40 m, before tailing off quickly, suggesting density-dependent mortality close to the mother plant. A similar pattern was observed in a genetic study on Aleppo pine (Pinus halepensis) specifically designed to test for Janzen-Connell effects (Steinitz et al. 2011).

Conclusions

Our results suggest that although there is considerable genetic variation in ash trees across the whole of Ireland, there is no evidence of population genetic structure. Hence, the imposition of “seed zones” as part of a recovery plan for ash trees in the aftermath of near total mortality due to ash dieback may not be justified, and is an avoidable cost. Our findings of frequent, long-distance dispersal events have further implications for the survival and persistence of ash woodlands in the face of a range of threats. Infection by the causal agent of ash dieback, C. fraxinea, may lead to loss of woodlands, either by pathogenic mortality or by anthropogenic clearance as a means of control (Pautasso et al. 2013). The high capacity for dispersal indicated by our results suggests good potential for natural regeneration, as well as for the spread of resistance to the disease, both via seeds and via pollen-mediated gene flow from individuals exhibiting inherent resistance. In addition, high levels of migration will be necessary to respond to global climate change, although this is very much dependent on the rate and extent of these changes.