Introduction

The likelihood of speciation-with-gene-flow is still one of the most debated topics in evolutionary biology with the difficulty of isolating the effect of gene flow and the effect of time since population divergence (Nosil 2008), while gene flow is frequently considered as a constraining force in evolution. Nevertheless, a growing number of examples of speciation-with-gene-flow are being reported, and are advancing our understanding of speciation in the absence of geographic obstacles (Jonsson et al. 2014; Keller et al. 2013; Martin et al. 2013; Stacy et al. 2017). Several conceptual frameworks have been developed for the comparison of speciation with and without gene flow (Feder et al. 2014; Smadja and Butlin 2011). In the speciation-with-gene-flow process, the effective migration experienced across the genome depends on many factors such as recombination rate, strength of selection, and number of selected loci. These dynamic changes occurring across the genome indicate that whole genomic analysis can provide better empirical understanding of the genetic differentiation and speciation (Gagnaire et al. 2013).

Due to the sequential geological formation of islands in the Hawaiian archipelago, founder events, and speciation within the Hawaiian Drosophila, the iconic example of adaptive radiation, appears to have occurred in sequential order from the oldest (northwest) to the youngest (southeast) islands (Price and Clague 2002). Drosophila silvestris (SIL) and D. heteroneura (HET) from the Big Island (Hawai'i) are presumably the youngest species in the group, and D. planitibia (PLA) from Maui is their closest sister species living in allopatry (Magnacca and Price 2015). The two sympatric species, D. silvestris and D. heteroneura, have been known to hybridize in nature in spite of mate discrimination (Carson 1989). However, neither the extent of hybridization footprints at the whole genome level nor a possibility of gene flow between allopatric species has yet been tested in this system. Here, gene flow between allopatric species would entail surviving the crossing a 26-miles wide channel between Maui and Hawai'i and then finding a suitable mate.

One of the most productive attempts to test for interspecies gene flow explores a genome-wide excess of shared derived alleles between taxa using Patterson’s D statistic (Durand et al. 2011). Known as the four-taxon test, or ABBA/BABA test, the method was originally developed to determine whether there was a genetic admixture between modern human and neanderthal populations (Green et al. 2010). We use a modified ABBA/BABA method which incorporates the fixation index (F ST) with genome sequences of D. silvestris, D. heteroneura, and D. planitibia that we recently obtained (Kang et al. 2016) and the genome sequences of D. grimshawi, a more distantly related Hawaiian Drosophila, that had its genome first reported by the Drosophila 12 Genomes Consortium (Clark et al. 2007). We test for gene flow in sympatry, as well as allopatry, between the three Hawaiian picture-winged Drosophila species.

Materials and Methods

Genome Sequencing and Analysis

Genomic sequences from D. silvestris, D. heteroneura, and D. planitibia were obtained, processed, pre-assembled, and annotated as described by us elsewhere (Kang et al. 2016). Briefly, pooled genomes from ten non-inbred males per species were used for construction of paired-end and mate pair libraries that were Illumina-sequenced at a total >80 × coverage. De-duplicated reads were then assembled with Spades (Bankevich et al. 2012) and protein-coding genes were predicted using MAKER2 (Holt and Yandell 2011). The completeness of assembly estimated with CEGMA (Parra et al. 2007) was in the range of 93–98% (‘complete’) and 98–99% (‘partial’).

F ST, Tajima’s D, π, Watterson’s θ, and Ne Calculations

Sequences were mapped using BWA (Li and Durbin 2010) with default parameters and D. grimshawi assembly as reference. Samtools (Li et al. 2009) was used to generate the pileup result. SNPs within 10 bp of indel were discarded, and PoPoolation2 (Kofler et al. 2011b) was used to estimate the F ST value for each SNP. All pairwise analyses used the maximum number of sites, i.e., F ST estimates were based on sites that were polymorphic in at least one of the three species or divergent (if monomorphic) between at least two species. The progressiveMauve (Darling et al. 2010) was used for multiple sequence alignments of D. silvestris, D. heteroneura, D. planitibia, and D. grimshawi. PoPoolation (Kofler et al. 2011a) was used to estimate Tajima’s π and Watterson’s θ, with the window size set to 10 Kb. Ne, was calculated as 4 µ/θ, where mutation rate µ = 3.5 e−9 was taken from Drosophila melanogaster (Keightley et al. 2009).

ABBA/BABA Approach

We used a modified ABBA/BABA method (Green et al. 2010) using D. grimshawi (GRI) as an outgroup. Reference D. grimshawi genome sequences were downloaded from FlyBase (r1.3, http://flybase.org), and PLA, HET, and SIL reads were mapped using BWA (Li and Durbin 2010) with default parameters. We generated the genotypes for each species using GATK (DePristo et al. 2011) with default parameters except for setting heterozygosity to 0.01. Only sites with genotyping quality greater than 30 and minimal depth 10 were kept. After genotyping, we determined the G1G2G3G4 pattern for each SNP position, where G1, G2, G3, and G4 represent genotypes in HET, SIL, PLA, and GRI, respectively. GRI alleles were considered ancestral (A), and sites with more than two alleles were filtered out. For each site, species were either assigned a “A” or “B” in the four taxa pattern according to the agreement of its genotype with GRI. The “AABA” pattern is the position in which PLA carries derived allele “B,” but HET and SIL carry ancestral allele “A” as GRI. Similarly, the “BBAA” pattern indicates that HET and SIL carry derived allele, while PLA carries the ancestral allele, and so forth. We used similar D statistic (Green et al. 2010) for AABA and BBAA and other combinations to test if the two patterns were of equivalent abundance/counts using a jackknife block size of 2 Mb. To evaluate the linkage between the four taxa pattern and sequence diversity, we estimated F ST values in the neighborhood of each pattern p \(({\text{p}} \in \left\{ {{\text{AABA}}, {\text{ABAA}},{\text{ABBA}},{\text{BAAA}},{\text{BABA}},{\text{BBAA}}\left. {,{\text{BBBA}}} \right\}} \right.).\) First, for each SNP from the pattern p, we calculated Rs (where \({\text{s}} \in \left\{ {{\text{HS}}, {\text{HP}}, {\text{SP}}} \right\}\) and H, S, P stands for HET, SIL and PLA, respectively), which is an average normalized F ST of N downstream and N upstream SNPs (excluding the SNP itself). Normalized F ST is obtained from the raw F ST value divided by the mean F ST for each species pairwise comparison s. In each block of the size of n bases, the statistic D for s1 and s2 (\({\text{s}}1,{\text{s}}2 \in \left\{ {{\text{HS}}, {\text{HP}}, {\text{SP}}} \right\}\)) was calculated:

$$D_{{{\text{s}}1,{\text{s}}2;p}} = \frac{{\mathop \sum \nolimits_{i = 1}^{n} B_{{{\text{s}}1,{\text{s}}2}} (i)}}{{\mathop \sum \nolimits_{i = 1}^{n} C_{\text{p}} (i)}},$$

where C p(i) can be 1 or 0 according to the pattern p is seen in position i or not, B s1,s2(i) is given by:\(B_{{s1,s2}} \left( i \right) = \left\{ {\begin{array}{*{20}c} {1,\quad if\,R_{{s1}} (i) > R_{{s2}} (i)} \\ {0,\quad if\,R_{{s1}} \left( i \right) = R_{{s2}} (i)} \\ { - 1,\quad if\,R_{{s1}} \left( i \right) < R_{{s2}} (i)} \\ \end{array} } \right.\) Other than Fig. 3 with various Ns, results for N = 100 only are reported throughout the text. Mean D and its variance were used to obtain Z-scores for jackknife tests, with the block size of 2 Mb, and a total of 100 blocks were generated. To reduce the effect of recent HET–SIL ancestry as a confounding factor in F ST calculations for the HET–SIL pair, we used only fixed SNPs because fixation permits more time (~4Ne) for eroding the association by HET–SIL ancestral LD under the neutral model. For comparison, we used the same method and tested for all sites including non-fixed genotypes (total 7,359,805 sites, including 5,216,097 sites found in SIL and 5,215,281 sites found in HET, thus indicating similar mutation rates in SIL and HET). We obtained very similar results, including F ST dynamics and ancestral branch estimates (see below). We also simulated alternative demographic models on ABBA/BABA expectations using MSMS software (Ewing and Hermisson 2010) and confirmed that our results are best explained by gene flow between species (Supplementary Table 1 and Supplementary Fig. 1).

The Inference of Ancestral Relationships

Using a neutral coalescent model without gene flow, incomplete lineage sorting, or recurrent mutations, let us assume that AABA (a total of 118,181 mutations) accumulates only along the PLA branch (ac, Supplementary Fig. 2). We also assume that XBAA (X can be either A or B) accumulates along the branch abd leading to SIL (Let us ignore HET branch for now, as HET may have hybridized with an unknown species—see “Results and Discussion” section), and we note that lengths of branches a–c and a–b–d are the same (assuming equal mutation rates in all three species). Therefore, the number of XBAA should be equivalent to AABA. XBAA can be divided into three sets, with the first set accumulating divergence along the branch a–b belonging to the common SIL–HET ancestor, quantified as S 1, or the count of ancestral BBAA genotypes. The second set includes variants accumulated along the branch b–d in SIL and exchanged with HET through hybridization (S 2, the count of recombinant BBAA genotypes). The third set includes variants accumulated along the branch b–d and unique to SIL, i.e., not exchanged with HET (S 3, the count of ABAA genotype = 42,187 mutations). Then S 1 + S 2 + S 3 = AABA, and thus S 2 = AABA − S 3 − S 1 = 75994 − S 1. If we assume that SIL and HET had equal contributions to the number of recombinant BBAA, then the BBAA count cannot exceed \(2S_{2}\), that is, <151988 − 2S 1. Since the total number of BBAA is 132558 and should not exceed \(151988 - 2S_{1} + S_{1}\), we obtain \(132558 < 151988 - S_{1} \Rightarrow S_{1} < 19430\). The estimate indicates that the ancestral branch a–b does not exceed 16.5% (19430/118181) of the a–c PLA branch (see “Materials and Methods” section for details). We obtained a similar estimate (17.3%) including both fixed and non-fixed alleles.

ABBA/BABA Simulations

MSMS software (Ewing and Hermisson 2010) was used to simulate different demographic models and their consequences on SNP patterns related to ABBA/BABA tests. Gene flow was simulated by adding a migration event (-m). A total of seven demographic models were simulated, each demographic model representing gene flow between.

  1. a)

    HET and SIL (m(HS));

  2. b)

    SIL and PLA (m(SP));

  3. c)

    HET and PLA (m(HP));

  4. d)

    SIL and PLA, and between HET and PLA (m(SP + HS));

  5. e)

    HET and PLA, and between HET and SIL (m(HP + HS));

  6. f)

    SIL and PLA, between HET and PLA, and between HET and SIL (m(HP + SP + HS)); and

  7. g)

    No gene flow (m(NULL)).

We assumed Ne to be 100,000, adjusted across species according to their estimated effective population sizes (-en 0.28 1 2 -en 0.28 2 1.40), and we simulated the recent decrease of population size (-en 0.002 1 0.20 -en0.002 2 0.20 -en 0.002 3 0.04, (Foote 1995)). A total of 377,627 segregated sites were simulated, corresponding to the number of fixed genotype differences relative to D. grimshawi. Estimated divergence times from mcmctree were used to adjust the time of joint events (−ej). Average numbers of genotypes were counted based on 100 simulation cycles. The command line for m(HS) simulation was given as follows:

java -jar msms.jar -N 100000 -ms 3 100 -s 377627 -I 3 1 1 1 -t 1400 -r 1400 -en 0.28 1 2 -en 0.28 2 1.40 -ej 0.145 3 2 -ej 0.274 2 1 -en 0.002 1 0.20 -en 0.002 2 0.20 -en 0.002 3 0.04 -em 0.05 3 2 5 -threads 12.

We found the model m(HS) to be most consistent with our empirical results, thus corroborating the close sympatric relationship between these two species (Supplementary Table 1 and Supplementary Fig. 1).

Results and Discussion

We used the ABBA/BABA method (Green et al. 2010) to contrast the two phylogenetic hypotheses, based on a total of 3,736,586 sites with fixed differences. To determine if paleo-PLA could be an ancestor of SIL and HET, we used a modified version of the ABBA/BABA method (Green et al. 2010) with D. grimshawi (GRI), a more distantly related Hawaiian picture-winged Drosophila (Clark et al. 2007) as an outgroup. We estimated the frequency of the BBAA (in the order: HET, SIL, PLA, GRI throughout the text) SNP pattern as a measure of the inter-node divergence time (a–b) between the most recent common SIL–HET ancestor and PLA ancestor (Supplementary Fig. 2). The BBAA pattern was abundant. It was enriched beyond a frequency easily attributable to inter-node ancestral divergence, as it exceeded the divergence of the PLA (AABA) branch by 11% (jackknifed Z-score = 3.78, P < 0.0002). This also suggests that at least some SNPs shared by SIL and HET (BBAA) may have originated through introgression after sympatric interbreeding between SIL (ABAA) and HET (BAAA). This potential genetic admixture between SIL and HET is consistent with previous observations that up to 2% of F 1 and backcross SIL–HET hybrids were found in some localities of Hawaii (Carson 1989). After subtracting the estimates of recombinant BBAA from the total BBAA count (see “Materials and Methods” section for details), we found that the length a–b (to HET–SIL putative ancestor) branch was 16.5% of the a–c (to PLA) branch or less. This coalesces PLA very close to the most recent ancestral node for HET and SIL, indicating that the PLA lineage was likely ancestral to both HET and SIL.

The relatively high frequency of the PLA-specific AABA SNP pattern indicates divergence time almost twice as long as that in the PLA branch relative to the HET (BAAA) and SIL (ABAA) branches. This is roughly consistent with the difference in the geological ages of Maui and Hawaii Islands and the hybrid sterility observed between PLA and both HET and SIL (Brill et al. 2016). Additionally, the HET-specific BAAA SNP pattern is enriched relative to the SIL-specific ABAA pattern (jackknifed Z-score = 10.54). This possibly reflects elevated genetic drift in HET due to the dramatically reduced population size that has led to its endangered species status (Muir and Price 2008), or reflects the admixture between HET and an unknown species or population having occurred after the HET–SIL split. Using Tajima’s estimates of nucleotide diversity π and Watterson’s θ (Fu 1994) in HET (π = 0.00647, θ = 0.0073), SIL (π = 0.00798, θ = 0.0101), and PLA (π = 0.0109, θ = 0.0125) (Fig. 1), as well as mutation rate estimates µfrom D. melanogaster (µ = 3.5 e−9 (Keightley et al. 2009)), we estimated effective population sizes Ne for HET, SIL, and PLA to be 504,286, 728,571, and 1,057,143, respectively. This smaller Ne of D. heteroneura suggests that drift effects in this species may indeed be stronger compared with the other species. From the shapes of the site frequency spectrum (or allele frequency spectrum) of the three species, SIL and HET have more similar demographic configurations compared to that of PLA (Fig. 2).

Fig. 1
figure 1

a Distribution of the mean number of pairwise differences (Tajima’s π) and b Watterson’s θ for D. heteroneura (red), D. silvestris (blue), and D. planitibia (green). Both π and θ values were calculated based on a window size of 10 Kb (Color figure online)

Fig. 2
figure 2

Site frequency spectrum of three species: D. heteroneura, D. silvestris, and D. planitibia

We also found a high frequency of both ABBA and BABA, two SNP patterns that are incongruent with phylogeny. Their presence can be explained by (1) hybridization between PLA and SIL (ABBA) and PLA and HET (BABA), (2) recurrent mutations, or (3) incomplete lineage sorting (Green et al. 2010). Remarkably, the ABBA/BABA test (Green et al. 2010) indicates significant hybridization between HET and PLA (mean = −0.087, SE = 0.022, jackknifed Z-score = −4.03). However, caution is needed in interpreting these results, since BAAA is overrepresented relative to ABAA, which could be a violation of substitution or fixation rate constancy. We notice that the ABBA/BABA test is in fact no longer significant (mean = −0.006, SE = 0.007, jackknifed Z = −0.92) when both polymorphic and fixed sites are counted, as opposed to fixed sites only; and BAAA and ABAA numbers are then less dissimilar (301,326 and 226,184, respectively).

To explore hybridization as a source of ABBA/BABA enrichment, we reasoned that recently introgressed haplotypes have had insufficient time to be broken down by recombination, and that closely linked introgressed alleles should therefore occur in linkage-disequilibrium ‘islands’ (Martin et al. 2013; Sankararaman et al. 2012), which, in turn, should exhibit reduced sequence divergence in recipient-donor species comparisons relative to non-introgressed regions. To narrow down potential introgression ‘islands,’ we focused on the neighborhoods of all informative SNP patterns from the ABBA/BABA approach and their species pairwise F ST values (Table 1). A number of very specific predictions can be tested using this new method. For example, if ABBA and BABA reflect interbreeding between PLA and, respectively, SIL and HET, F ST values from comparisons involving PLA should be significantly decreased in the neighborhoods of these SNP patterns. Such a pattern was indeed observed. In fact, ABBA and BABA produced the largest and most statistically significant differences between normalized F ST values (jackknifed 7.05 < |Z-scores| <17.58, Supplementary Table 2), exceeding the estimates for BBAA. This is expected only if BBAA was a mixture of ancestral and HET–SIL post-hybridization introgressive genotypes, whereas ABBA and BABA were mostly post-hybridization-introgressed genotypes. For comparison, relatively low differences between normalized F ST values were found within AABA, a genotype specific of PLA divergence with F ST estimates least likely to have been affected by introgression/recombination on Hawai'i Island. We also predicted that if BBAA derives from recent interbreeding and introgression between HET and SIL, there should be a significant decrease in F ST between HET and SIL, but not in PLA–SIL or PLA–HET, across the BBAA neighborhoods. Indeed, not only did we observe a dramatic (2.1×) decrease in mean HET–SIL F ST values, but we also found that the lowest F ST values within the HET–SIL comparison were produced by BBAA, as expected under introgressive hybridization (Table 1; jackknifed |Z-scores| >11.03).

Table 1 Frequencies of trans-species single nucleotide polymorphisms (SNPs) with their corresponding F ST values (normalized by mean F ST value for all three pairwise comparisons)

Alternatively, F ST values around BBAAs will necessarily be biased downwards if the speciation time is much more recent than the ultimate time to coalesce or, in other words, BBAA are surrounded by neighborhoods of LD spikes originating from the HET–SIL common ancestor rather than from recent introgressions. To minimize this bias, we focused on fixed SNPs, which under a neutral model require on average ~4Ne generations for fixation, a long and likely sufficient time to erase most of the internal branches corresponding to a shallower coalescent history. Although this time-dependent homogenizing effect of recombination applies to both ancestral and newly introgressed linkage groups, our approach introduces an ascertainment bias in favor of recent LD spikes. Additional support for enrichment of the configurations BBAA, BABA, and ABBA due to introgressive hybridization comes from the analysis of species pairwise F ST with respect to genomic window sizes used (Fig. 3). Unlike HET–PLA and SIL–PLA, HET–SIL F ST values from BBAA neighborhoods tend to grow with the window size, consistent with introgression erosion due to recombination (Fig. 3a). Similarly, SIL–PLA F ST values tend to increase only in ABBA neighborhoods (Fig. 3b), while HET–SIL F ST values rise only in BABA neighborhoods (Fig. 3c). This suggests possible introgressive hybridization between PLA and SIL, as well as between PLA and HET, respectively. All other SNP patterns produce F ST values slightly declining with the window size, which is a typical genome-wide tendency due to the greater sampling error in smaller windows (Beissinger et al. 2015). Thus in addition to previously reported SIL–HET hybridization (Carson 1989), there may have been occasional interbreeding between PLA and the other two species, after migrants had successfully crossed the 26-mile wide Alenuihaha Channel separating Maui from Hawai'i. Although the direction of the inter-island migration and hybridization remains unknown, a Maui-Hawaii flyover by PLA is a more parsimonious scenario as compared to HET and SIL both migrating back to Maui.

Fig. 3
figure 3

Average F ST values in the neighborhoods of SNP patterns with respect to species pairwise comparisons and window sizes (N). a BBAA. b ABBA. c BABA

In addition to D. silvestris and D. heteroneura, natural hybridization has been documented only for few other Hawaiian picture-winged Drosophila species: D. setosimentum and D. ochrobasis, and D. hawaiiensis and D. engyochracia (Carson et al. 1975; Kaneshiro 1990; Price and Muir 2008). However, the frequency of hybridization in Hawaiian picture-winged Drosophila in nature assessed with traditional methods has likely been underestimated because of the extensive analyses required to detect hybrids in the wild (Carson et al. 1975; Kaneshiro 1990; Price and Muir 2008. New genome-wide methods hold promise for investigating the extent and time course of genomic admixture, both before speciation and during different time periods following speciation (Martin et al. 2013; Sousa and Hey 2013). The rapid speciation and radiation of Hawaiian picture-winged Drosophila (Craddock 2000), and plausible changes in mating behavior in small populations (Kaneshiro 1983), suggest that the Hawaiian picture-winged Drosophila may be particularly prone to hybridization, which in turn may compromise the genetic integrity of some of the species, potentially posing a serious conservation challenge.