Introduction

As human activity increasingly shapes the distribution and abundance of species at scales ranging from landscape to global, an understanding of the ecological consequences must follow if we are to respond appropriately to conservation issues raised by these biogeographical alterations. The significant and wide-ranging consequences of anthropogenic influences on biogeographic patterns, biodiversity and ecosystem stability are exemplified in human-mediated species introductions. Once established, introduced species can have significant ecological and evolutionary impacts on native species, communities and ecosystems, and severe consequences for the economic welfare of invaded environments (Sakai et al. 2001). In recent years, it has been increasingly recognised that the genetic diversity and demographic stability of introduced populations are key indicators of the potential for self-sustaining establishment in novel environments (Suarez and Tsutsui 2008). Thus, population genetic and demographic investigations are valuable in indicating the requirement and direction for conservation focus in empirical introductions (Costa-Pierce 2003).

The rapid increase in global anthropogenic movements over the past century has led to an associated dramatic rise in the number of reports of species outside their natural ranges (Allendorf and Lundquist 2003). Although widespread commercial transportation has provided copious opportunities for species rafting, proportionately few imported populations successfully establish in new environments (Shigesada and Kawasaki 1997). Past investigations of this phenomenon have focused on the susceptibility of environments and characterisation of invasive species traits to account for this differential establishment ability (Schloesser et al. 1996). Furthermore, it has been recently recognised that imported species are frequently exposed to novel selective pressures in the new environment (Lee 2002). Although genetic characterisations of introduced populations (e.g. using mitochondrial DNA sequencing) are based on neutral genetic variation, they can infer whether a degree of genomic genetic diversity exists, in turn indicating a potential for adaptive evolutionary change (Sakai et al. 2001). The identification of the geographic source for introduced non-natives using phylogenetic techniques is fundamental to elucidating the influence of multiple introductions, gene flow and hybridisation on introduction success in addition to the reconstruction of introduction histories (Durka et al. 2005). Moreover, genetic analyses of introduced founder populations have valuable applications outside the field of biological invasions. Molecular investigations of founder populations also provide an invaluable model for the genetic consequences of demographic bottlenecks faced by many endangered species and fragmented populations (Frankham et al. 2002), and contribute to the understanding of natural colonisation and radiation processes.

Human-mediated invasions have often been documented within decades of initial import (Williams 1969). It is less common, however, to discover an invasion less than 5 years from the initial importation. Reports of the non-native Anolis sagrei (Sauria: Iguanidae) on the island of St Vincent, Lesser Antilles, dating back to 2003 (F. Providence, personnel communication), have provided a fortuitous opportunity for investigating the genetic processes involved in the early stages of introductions. Anoles have been widely transported throughout the Caribbean both by natural (Calsbeek and Smith 2003; Stenson et al. 2004) and human mediated dispersal (Campbell 1996). However, the St Vincent discovery is rare in that the single point of import is known, its current range within the island is small, and two native competitors (A. trinitatis and A. griseus) already occupy and partition Anolis niche space on the island (Schoener and Gorman 1968).

A. sagrei has been known to be a widespread and successful coloniser (Williams 1969). It has spread throughout the entire Florida peninsula (Campbell 2003), well into the south eastern US (see historical overview in Campbell and Echternacht 2003) and into mainland Belize/Mexico (Lee 1992). It has even reached some very isolated islands including Hawaii (McKeown 1996), Taiwan (Norval et al. 2002), Grenada (Greene et al. 2002) and St Vincent (this paper). An extensive phylogeographical study by Kolbe et al. (2004) identified the source of many of these populations using mitochondrial DNA data, and indicated that the high genetic diversity in introduced populations may result from multiple introductions from genetically different native populations.

In this study we use molecular sequencing techniques to reveal the source population for introduced A. sagrei in St Vincent. A phylogeny of A. sagrei was produced from mitochondrial DNA (mtDNA) sequence data comprised of haplotypes obtained from new sampling in St Vincent and published sequences for this species from throughout the native and introduced range (Kolbe et al. 2004). Qualities such as relatively low effective population size and rapid mutation rate make mtDNA useful for assessing genetic relationships of individuals or groups within a species, accounting for the popularity of mtDNA as a marker in phylogeographic studies (Avise 1994). We use haplotype networks and phylogeographic analysis to reveal the likely geographical origin for the introduced population, and discuss the impact of native range within-population genetic diversity on the reliability of identifying geographical source populations. We compare the genetic diversity of the St Vincent population to other (including A. sagrei) introductions, suggest its basis, and discuss the relative importance of genetic diversity and introduction history for establishment ability.

Methods

Thirty-six tail tip biopsies, preserved in 95% ethanol, were taken from hand-caught and released lizards at the site of invasion, adjacent to the dock in the industrial port of Campden Park, on the south west coast of St Vincent, Lesser Antilles (Fig. 1a), following the conditions of permit and strictly adhering to the protocol of the Academic Ethics Committee (AEC). Locality positions were recorded using a handheld Garmin 60CS GPS. Samples were collected throughout the limited range of the introduced population (~400 m2). To locate the geographic origin of the introduced population, A. sagrei mtDNA sequences representing areas from across the native (74 populations) and introduced (29 populations) range were used for comparison with St Vincent samples. These mtDNA sequences were deposited in GenBank by Kolbe et al. (2004) accession numbers AY655172-484; DQ846752-771. Eight outgroup haplotypes were used comprising three congeneric Cuban natives; A. homolechis, A. bremeri and A. quadriocellifer (GenBank accession numbers AY655164-171).

Fig. 1
figure 1

a Map showing geographic distribution between A. sagrei haplotypes within the SE Cuba, WC Florida, W Jamaica, St Vincent clade. Grey squares indicate haplotypes from Cuba, open squares indicate haplotypes from Florida/St Vincent, lined squares indicate haplotypes from Jamaica. 257 A. sagrei haplotypes from 74 native range populations and 77 haplotypes from 29 introduced populations from Kolbe et al. (2004) were included in this study (not marked on the map). b Network analysis showing genealogical relationship between A. sagrei haplotypes, shown in a, the number of nucleotide substitutions between haplotypes is shown by the number of dots connecting them in the network (Haplotype Identifiers: StV/TII = St Vincent/Tampa, Florida II; TI = Tampa, Florida I; StPI = St Petersburg, Florida I; StPII = St Petersburg, Florida II; JMI = Manchester, Jamaica I; JMII = Manchester, Jamaica II; JWI = Westmoreland, Jamaica I; JWII = Westmoreland, Jamaica II; PI = Portillo, Cuba I; P II = Portillo, Cuba II; NML I = N Media Luna, Cuba I; NML II = N Media Luna, Cuba II; NML III = N Media Luna, Cuba III)

Total genomic DNA was extracted using the DNeasy Tissue Kit (QIAGEN) following the manufacturer’s protocol for animal tissues. Approximately 1,200 bp of mtDNA, including NADH2 and two flanking tRNAs (tRNATrp tRNAAla) was amplified using the primers H5730 (Glor et al. 2004) and L4437b (Macey et al. 1997). The amplifications were performed in a total volume of 25 μl using 50 ng total genomic DNA, 20 μl (1.5 mM MgCl2) ReddyMix® PCR Master Mix (ABGene Surrey, UK) and 1 μM of each primer. PCR amplifications were performed using an MJ Research PTC-200 thermal cycler (Waltham, MA) using the following temperature cycling profile: a 15-min denaturation step at 95°C followed by 30 cycles of denaturation for 30 s at 94°C, annealing for 30 s at 50°C and extension for 1 min at 70°C. A final step at 72°C for 10 min followed the 30 cycles. Unincorportated deoxynucleotides and primers were removed using Promega 10× SAP and New England BioLabs Exonuclease I, following the manufacturer’s protocols. The forward and reverse primers were used for sequencing using dye-labelled terminators (ABI PRISM™ BigDye™ Terminator Cycle Sequencing Ready Reaction Kit) and subsequently run on an ABI Prism 377 DNA sequencer.

Sequences were aligned using CODONCODE ALIGNER 1.2 (CodonCode Co.) and replicated haplotypes were excluded from the analysis in order to reduce computational time. The substitutional model that best fit the data was assigned according to distance analyses performed using the log likelihood function of modeltest 3.0 (Posada and Crandall, 1998), and the chosen model was then used for subsequent distance-based analyses. McDonald and Kreitman’s (1991) test was used to evaluate the possibility of non-neutral evolution of the NADH2 gene (Ballard and Kreitman 1995). The genetic data analysis software packages PAUP × 4.0b10 (Swofford 1998) and MRBAYES version 3.1 (Huelsenbeck and Ronquist 2001) were used to reconstruct phylogenetic trees by maximum parsimony (MP) and Bayesian methods, respectively. Analyses included 334 unique haplotypes (excluding eight outgroups) from native and introduced populations across the native and invasive ranges, between one and ten unique haploypes per population. This strategy enabled all available information to contribute towards producing the most structurally informative phylogeny possible.

In all analyses, all sites were equally weighted. MP was performed using a random addition heuristic search with tree-bisection-reconnection with 1,000 random addition sequence replicates. Bootstrapping (1,000 replicates) was performed to obtain a relative measure of node support for the resulting MP tree (Felenstein 1985). The model of evolution used for Bayesian analysis was estimated by modeltest 3.0 and the parameter values were estimated by MRBAYES and fixed for the analysis. Four chains were run, with 2 million generations, sampling the chains every hundred generations. Time to stationarity was estimated by plotting tree log-likelihood score against generation number and determining the number of generations until the values reached an asymptote. The initial 25% of trees generated prior to stationarity were discarded as the “burn-in”. A majority rule consensus tree (‘Bayesian’ tree) was calculated from the posterior distribution of trees, and the posterior probabilities calculated as the percentage of samples recovering any particular clade (Huelsenbeck and Ronquist 2001). Three further independent Bayesian analyses were run to check for local optima.

A haplotype network was constructed using the network building software TCS (Clement et al. 2000) which uses a statistical parsimony procedure (Templeton et al. 1992). TCS produces a network linking different haplotypes only if they have a 95% probability of being justified by the parsimony criterion. Although network analysis is traditionally used to show genealogical relationships between haplotypes within populations (Templeton et al. 1992), it gives a useful visual indication of the number of steps (nucleotide differences) between haplotypes. Within-population nucleotide diversity (π) was calculated using MEGA3.1 (Kumar et al. 2004) for all populations sampled with more than one unique haplotype.

Results

A 1,194 bp fragment of mtDNA from a total of 342 sequences were used in the analysis. A single haplotype was identified from 36 sequenced St Vincent A. sagrei individuals. This haplotype was identical to a haplotype from Tampa, Florida, and the duplicate sequence was removed from further analyses. No insertions, deletions or stop codons were found in the data and the substitutional model of molecular evolution rejected the null hypothesis of neutral evolution for any comparison. Of the 1,194 sites, 666 were variable, 547 of which were parsimony-informative.

Bootstrapped MP and Bayesian analyses gave congruent tree topologies with similar, high bootstrap values, summarised in the Bayesian tree (Fig. 2). Both trees group St Vincent A. sagrei in a clade with 12 haplotypes from 6 localities across South East Cuba, West Central Florida and West Jamaica (shaded clade shown in Fig. 2; inset). Phylogenetic tree support was high for the clade (posterior probability 0.98, MP bootstrap support 81%, Fig. 2). Support for the 4 taxa clade grouping the St Vincent haplotype with Tampa and St Petersburg was high (posterior probability 1.00, MP bootstrap support 85%). The geographical distribution of A. sagrei haplotypes within the SE Cuba, WC Florida, W Jamaica clade is presented with geneological relationships among these haplotypes obtained from the network analysis (Fig. 1b). This clade was unconnected to other haplotypes because only connections with a probability higher than 95% are shown. Network analysis separates St Vincent/Tampa II from St Petersburg by 1 step, Tampa I by 4 steps, W Jamaica by 6 steps and from SE Cuba by 7 steps (Fig. 1b). Native populations have typically higher within-population nucleotide diversities, while introduced populations have typically lower diversities (Fig. 3). A notable exception is Florida, which contains among the oldest established of the introduced populations and has been colonised by multiple introductions from across the native range (Kolbe et al. 2004).

Fig. 2
figure 2

Bayesian phylogram of the NADH2 region sequence data, N = 342 haplotypes. Shaded clade is expanded in inset. Support for nodes are presented in the order Bayesian posterior probability value and maximum-parsimony bootstrap percentage. MtDNA haplotype data from Kolbe et al. (2004). Italics indicate colonised regions outside the native range

Fig. 3
figure 3

Histogram of within-population nucleotide diversity (π) for 1,194 bp within the mtDNA NADH2 region for all native and invasive populations included in the study, mtDNA haplotype data from Kolbe et al. (2004). Populations are grouped by country/state. Populations with no bar contain a single haplotype (4 populations from Cuba; 2 from Grand Cayman; Belize; 11 from Florida; 1 from Texas; 2 from Jamaica and St Vincent)

Discussion

Identical mtDNA haplotypes from St Vincent and Tampa II, FL supports a Tampa origin for the St Vincent population. Additionally, both MP and Bayesian tree topologies converge and group St Vincent into a well-supported clade with three haplotypes from Tampa and St Petersburg, FL, localities within small geographic distances (20 km). Network analyses group St Vincent, St Petersburg and Tampa together, separated from the next closest Jamaican haplotype by five steps, further supporting Tampa, FL as the source area for the St Vincent population. Commercial container imports have been routinely received at the Campden Park dock, St Vincent from the Northern Florida area during the period when the introduction occurred (F. Providence, personnel communication) while we are not aware of imports from any other of the native or introduced A. sagrei ranges, providing additional evidence for a Florida origin for introduced St Vincent A. sagrei.

The absence of the St Vincent/Tampa II haplotype from the native range haplotype set despite the use of data from a comprehensive study of intensive sampling effort (257 haplotypes from 74 populations from Kolbe et al. 2004), highlights a recurring problem when attempting to elucidate the origin of a new population of a widespread species. It is improbable that the haplotype would have been lost in the native range though drift, or formed in the invasive range through mutation since introduction to Florida in the late 1800’s (Williams 1969). The most probable explanation for the situation is that the haplotype exists in the native range but is absent from the sample set. This absence may be explained by the high genetic diversities within native range populations (Fig. 3; Table 1). Even thorough sampling will detect only a subset of the most frequently occurring haplotypes in these populations, with those at lower frequencies being overlooked. Following this logic, it is plausible, though perhaps less probable that the St Vincent population may originate from a source population in the native range which contains the St Vincent/Tampa II haplotype at a low frequency.

Table 1 Within-population nucleotide diversity (π) for each native and introduced range population of Anolis sagrei in this study that were found to contain more than a single haplotype

A recent study has also recognised that native population genetic diversity and sampling efficiency can substantially affect the accuracy of putative source population identification (Muirhead et al. 2008). In a simulation using published data sets, both the degree of genetic differentiation between source populations, and the number of source populations included in the study were found to significantly impact the accuracy of source population assignment. Furthermore, small sample sizes affected population assignment accuracy most when introduced populations comprised of dominant or fixed haplotypes that were at low frequencies in source populations (Muirhead et al. 2008). For this study, the sample size in the introduced St Vincent range was high (36 individuals in a single locality), and a large number of potential source populations were sampled (74 native populations and 29 populations across the introduced range). However, in accordance with Muirhead et al. (2008), because the haplotype was fixed in the introduced range and genetic differentiation between potential source populations was high (Fig. 3; Table 1), the estimation of the putative source population should be treated with some prudence.

Assuming a Tampa origin for A. sagrei in St Vincent, this recent importation represents a secondary introduction event, i.e. originating from an introduced rather than a native population. Other recent secondary introductions originating from Florida A. sagrei have been shown to retain a proportion of the genetic variation found within Floridian populations (Kolbe et al. 2004). In contrast we demonstrate that despite high within-population variation in the source population, Tampa (7.1% within-population nucleotide diversity, Table 1), the introduced population of A. sagrei in St Vincent, which contains a single mtDNA haplotype has experienced the most severe genetic depauparation of any of the documented global A. sagrei introductions. Such a complete absence of genetic diversity has rarely been documented in other introduced populations (although see Puillandre et al. 2008), although variation in more sensitive population genetic markers may represent severe founder effects (e.g. microstatellites, Hawley et al. 2006). For A. sagrei in St Vincent, the implied genetic founder effect is likely to be a result of colonisation history processes, directly via a demographic bottleneck, and augmented by the early stage of the St Vincent introduction, which was sampled before any subsequent, additional introductions may have had an opportunity to occur.

This study highlights the importance of determining source populations so that valid comparisons may be made with native populations. Such comparisons may be valuable in elucidating adaptive and plastic responses to environmental pressures, the influence that habitat preferences may have for population dynamics, potential impacts on native species and importantly, possible short- and long-term management strategies. For example, recent investigations have demonstrated that newly founded populations containing high levels of genetic diversity have shown rapid evolutionary responses to local conditions (Costa-Pierce 2003). In such cases, diversity often arises from the admixture of haplotypes originating from multiple introductions. Conversely, when a strong founder effect is combined with no admixture, the minimal amounts of available variance present are unlikely to enable adaptive genetic responses (Allendorf and Lundquist 2003), although some additive genetic variance may still be available to selection following demographic bottlenecks (e.g. via epistasis, Carson 1990).

The data presented here demonstrate, in contrast to previous genetic characterisation of introduced A. sagrei populations (Kolbe et al. 2004), that founder populations can exhibit extremely low haplotype diversity despite high source population genetic diversity, probably as a result of colonisation history. This study identifies a single mtDNA haplotype in a recent anthropogenically imported population of A. sagrei on St Vincent, identical to a haplotype from Tampa, FL, suggesting that Tampa is the geographical source for the introduction. However, it is also shown that multiple haplotypes with high nucleotide diversities within native range populations, combined with low frequencies (rareness) of introduced haplotypes in native ranges can impair attempts to identify source populations, even despite intensive sampling effort. Thus, both phylogenetic structure and within-population genetic diversities must be appreciated to identify the likely geographical source for introduced populations. Together with Muirhead et al. (2008) we recommend careful consideration of sampling design to avoid inaccurate assignment to putative source populations due to sampling errors.