Introduction

Many phylogeographic studies of marine organisms with larval dispersal have detected little structure along stretches of coastlines that span hundreds of kilometres or more (e.g. Nephrops norvegicus, Stamatis et al. 2004; Homarus gammarus, Triantafyllidis et al. 2005; Petromyzon marinus, Almada et al. 2008; Sprattus sprattus Debes et al. 2008; Lipophrys pholis, Francisco et al. 2011). This is assumed to result from the presence of planktonic eggs and/or larvae capable of dispersing over distances of kilometres or hundreds of kilometres per generation (however, see Ayre et al. 2009). Most studies assume that the Pleistocene and specially the Last Glacial Maximum (LGM) were critical times when populations were absent from the frozen north and suffered severe bottlenecks in their refugial habitats. In temperate areas like the west coast of Portugal and most of the south-west European shore, however, the conditions seem to have been more complicated. Although sea surface temperatures (SSTs) during the LGM along the coast of Portugal were not significantly different from what they are today (Abreu et al. 2003), several cold episodes towards the end of the Last glaciation considerably lowered mean summer SST (as much as 5 °C).

During the Holocene the oscillations in SSTs continued to be a salient feature of this geographic area, although their amplitudes were smaller. Some cold pulses such as the 8,400 years ago cold event and the Little Ice Age (LIA) caused temperature drops of one or a few degrees (Casper 2010) and may have affected the distributions of fish populations. A temperature anomaly need not to last very long to extirpate a population, it suffices that the population does not breed during a period that surpasses the lifespan of all its members. So, even short pulses that last for a few fish generations can cause a very substantial local effect. Even in present conditions small interannual oscillations can substantially affect the fish fauna, at least for the populations that are at or near the higher latitude limits of the distribution of species (Thomson and Lehner 1976; Stenseth et al. 2002; Henriques et al. 2007).

The Lusitanian toadfish Halobatrachus didactylus (Batrachoididae) (Bloch and Schneider 1801) offers a very interesting model to investigate the structure of populations that are at the extreme north of the species distribution comparing them with others further to the south, thus more sheltered from SSTs oscillations. The Lusitanian toadfish is a thermophilic demersal species that lives in shallow waters down to 50 m depth, often found buried in muddy and soft sand bottoms or concealed in rock crevices (Roux 1986; Bauchot 1987). It inhabits coastal and estuarine areas and its distribution ranges from the tropical Western African coast to the Mediterranean and the Atlantic shore of the Iberian Peninsula (Bauchot 1987) being virtually absent north of the Tagus River (Costa et al. 2003). At the Portuguese western coast, populations occur only in estuaries, while both coastal and estuarine populations can be found in the south (Costa 2004). It is considered a sedentary species (Roux 1986), but some individuals have been found to perform important displacements of up to 17 km (Costa 2004; Campos et al. 2008). The breeding season in the south of Portugal lasts from late spring to early–mid summer (e.g. from May to August in Algarve) (Modesto and Canário 2003) when water temperature reaches 18 °C, which is apparently the minimum required for breeding to take place (Amorim et al. 2006). H. didactylus is a polygynous species, characterized by low fecundity and high survival of embryos and juveniles (Palazón-Fernández et al. 2001). Males defend nests under rocks in shallow water, where they vocalize to attract females to mate with (Vasconcelos et al. 2011). Each female spawns a single batch of eggs annually, but males can guard the eggs of several females (Modesto and Canário 2003; Amorim et al. 2010). The eggs, that are rich in yolk, are guarded by the males until yolk sac absorption and juveniles venture for themselves when they are about 18 mm (Costa 2004). Thus, development is direct and no planktonic stage exists (Costa 2004).

The absence of a planktonic stage in H. didactylus reduces the likelihood that the signature of a bottleneck is quickly wiped out by migrants from other populations, a condition that is to be expected in marine fish with very large population sizes and high dispersal capabilities (e.g. Debes et al. 2008; Francisco et al. 2011).

In the Tagus estuary, summers with a very limited number of days with SSTs above 18 °C are not uncommon leading to a very limited production of young. This observation that unusual cold summers impact reproduction negatively led us to hypothesize that LIA, 8,400 and other cold events of smaller intensity may have caused local extinction of the population.

In this paper, we compare the genetic structure of the Tagus population with a series of other populations south to Algarve, an area with higher SSTs and probably less exposed to the extremes of cold events. The analysis is based on two markers: a putative fragment of the mitochondrial control region and the first intron of the S7 ribosomal protein gene.

Methods

Sampling

Samples of H. didactylus were collected by beach seining from 5 locations: Morocco (Salé), Algarve (Ria Formosa Estuary and Guadiana) and the estuaries of the Rivers Mira, Sado and Tagus (Fig. 1). A total of 117 individuals were obtained, and the number of individuals collected per site/per fragment is listed in Table 1.

Fig. 1
figure 1

Map with the sampling locations, haplotypes and haplotype network for Halobatrachus didactylus. Left distribution range of H. didactylus. Right sampling locations with a representation of the haplotypes presented and the haplotype network. Coloured circles represent the relative proportion of individuals. In the haplotype network the colours correspond to the sampling locations

Table 1 Sampling locations, numbers, haplotypes and diversity measures for Halobatrachus didactylus putative CR and S7 sequences

DNA extraction, amplification and sequencing

Total genomic DNA was extracted from fin or muscle samples preserved in 96 % ethanol with the REDExtract-N-Amp kit (Sigma-Aldrich) following the manufacturer’s instructions. Voucher specimens are deposited in ISPA (ethanol preserved tissues). PCR amplification of CR and the first intron of the nuclear S7 ribosomal protein gene (S7) were performed with the following pairs of primers: CR—LPro1 (5′-ACTCT CACCC CTAGC TCCCA AAG-3′) and HDL1 (5′-CCTGA AGTAG GAACC AGATG CCAG-3′) (Ostellari et al. 1996) and the first intron of the S7 ribosomal protein gene—S7RPEX1F (5′-TGG CCT CTT CCT TGG CCG TC-3′) and S7RPEX2R (5′-AAC TCG TCT GGC TTT TCG CC-3′) (Chow and Hazama 1998). A search in GenBank yielded no CR sequences for Batrachoididae. In spite of this, the pair of primers used in this study typically amplifies a portion of the CR of other species (e.g. Symphodus melops, Robalo et al. 2012; Salaria pavo, Almada et al. 2009) and succeeded in a considerable proportion of our samples. Because of this and as a note of caution, we refer to this fragment as a putative control region fragment.

PCR amplifications were performed in a 20 μl total reaction volume with 10 μl of REDExtract-N-ampl PCR mix (Sigma-Aldrich), 0.8 μl of each primer (10 μM), 4.4 μl of Sigma water and 4 μl of template DNA. An initial denaturation at 94 °C for 7 min was followed by 35/30 cycles (denaturation at 94 °C for 30/45 s, annealing at 55 °C for 30/45 s and extension at 72 °C for 1 min) and a final extension at 72 °C for 7 min on a BioRad Mycycler thermal cycler (values CR/S7, respectively). The same primers were used for the sequencing reaction, and the PCR products were purified and sequenced by STABVIDA (http://www.stabvida.net/).

DNA analysis

All sequences were edited with Codon Code Aligner (http://www.codoncode.com/index.htm) and aligned using Clustal X (Thompson et al. 1997).

ARLEQUIN software package V.3.5 (Excoffier and Lischer 2010) was used to estimate the genetic diversity within each sample, to access potential population differentiation and to perform neutrality tests. It was also used to perform analysis of molecular variance (AMOVA) (Excoffier et al. 1992) to compute pairwise F STs and corrected pairwise differences between populations. Significance levels of all multiple statistical tests were corrected using the false discovery rate (FDR) approach (Benjamini and Hochberg 2005) implemented in QVALUE package of software R (Dabney et al. 2006). In the case of the S7 intron, the analyses were also run in ARLEQUIN, after allowing the programme to reconstruct the haplotypes present, using the ELB algorithm (Excoffier et al. 2003). The number of migrants was also calculated for each population pair using the same software.

For the control region, the spatial analysis of molecular variance (SAMOVA 1.0) (Dupanloup et al. 2002) was used to identify groups of sampling locations that are geographically and genetically homogeneous and maximally differentiated from each other. This approach relies on a technique of AMOVA (Excoffier et al. 1992). However, in contrast to conventional AMOVA, SAMOVA does not require that the groups’ constitution is defined a priori, allowing instead the groups to emerge from the data. The most likely number of groups was identify by running SAMOVA with two to four groups and choosing the partition scheme with the highest F CT value. For each of the groups of locations identified by SAMOVA, Fu’s Fs (Fu 1997) and Tajima’s D (Tajima 1983) tests were performed to test for population expansion.

A haplotype network was constructed for the putative CR data using Network v4.5 (Bandelt et al. 1999). Past population demography of H. didactylus was inferred using the linear Bayesian skyline plot (BSP) (Drummond et al. 2005) model as implemented in BEAST v.1.6 (Drummond and Rambaut 2007) employing the Bayesian MCMC coalescent method, a HKY + I + G model of substitution, and a strict clock. The Bayesian distribution was generated using results from five independent runs of 150 million MCMC steps obtaining effective samples sizes (ESS) of parameter estimates of over 200, with a burn-in of 10 %. The time to most recent common ancestor (t MRCA) and the median and corresponding credibility intervals of the BSP were depicted using Tracer v.1.4 (Rambaut and Drummond 2007). MtDNA control region mutation rates in fish are widely variable (e.g. 2.2–4.5 %/MY between lineages for East African cichlids—Sato et al. 2003, to 15–20 %/MY for Indo-Pacific sardines—Bowen and Grant 1997). In the absence of a clock calibration for the CR of H. didactylus, we address the uncertainty by tentatively assuming a within lineage mutation rate of 5 %/MY. This value is within the range of values found by (Bowen et al. 2006) after a review of CR molecular clock calibrations for several tropical Atlantic fish species. Nevertheless, these estimations of divergence times must be interpreted with great caution.

Results

The putative control region (CR) data set consisted of a total of 377 characters (92 individuals, Genbank Accession Numbers JQ278604-95). The amplification of the first intron of the S7 gene (S7) yielded a fragment of 713 characters, with 15 polymorphic sites (117 individuals, Genbank Accession Numbers JQ310526-659). The putative CR haplotype network shows only 4 haplotypes (Fig. 1). The most abundant haplotype contains 66 fish from all localities. The locations of Tagus and Sado yielded only this haplotype. The only fish from Morocco included in the CR analysis also displayed this haplotype. The second haplotype (connected to the first by one mutational step) contains 16 fish only from Algarve and the Mira estuary. The same holds for the third haplotype, which contains 9 fish from the same locations. The fourth haplotype is a singleton from Algarve.

For both gene fragments, the genetic diversity indices for each population are summarized in Table 1. When we take together Fig. 1 and Table 1, we conclude that for the putative CR, the location of Algarve showed a strikingly high diversity when compared with all other locations. All haplotypes found in the remaining locations are present in Algarve. The haplotypes of Mira are shared with some fish from Algarve, yielding moderate diversity indices. However, the number of sequences obtained is low, and this finding must be taken with caution. The locations of Tagus and Sado display an impressive null diversity with a single haplotype present in all fish and shared with Algarve and Morocco. We suspect that there may be one or more haplotypes that failed amplification in fish from Algarve, Mira and Morocco. This situation is easily explained if one or more mutations hit the site of primer attachment. Our suspicion is based on the fact that some fish from Algarve, Mira and Morocco repeatedly failed in PCR of the putative CR while yielding good S7 sequences. We believe, however, that the main results of this study were not affected because, if anything, the diversity of Algarve, Mira and Morocco is underestimated. In Tagus and Sado all 49 fish yielded good PCR amplifications of the putative CR and the S7 intron, so that within the limits of our sampling size, no hidden haplotypes could be present as no fish failed amplification. For the S7 intron, no consistent trend of declining diversity from Algarve to Tagus/Sado was detected, which is to be expected if the processes that gave rise to these populations are so recent that a gene with a fourfold number of copies (compared to CR) had not sufficient time to undergo advanced or complete lineage sorting.

The pairwise F STs and corrected average pairwise differences are presented in Tables 2 (putative CR) and 3 (S7). For the putative CR, a very clear pattern emerges that was anticipated by the haplotype network. Tagus and Sado show zero differences, and both are closer to Algarve than to Mira, which in turn shows relatively small differences from Algarve. These results make full sense if the population of Algarve gave rise independently to a population in Mira, on one hand, and another clade that possibly founded the Sado population and later that of the Tagus. For the S7 intron, the smallest F ST and corrected pairwise differences are between Sado and Mira, the pair that also shows the highest number of migrants (28.605). Migration values for the remaining pairs are >1 only for Sado/Algarve = 2.282; Tagus/Algarve = 2.271; Tagus/Morocco = 1.067; Mira/Algarve = 1.238 and Algarve/Morocco = 1.045. Except for the Moroccan fish, Tagus and Sado show the largest differences among them in FST and in average corrected pairwise differences.

Table 2 Pairwise F ST and corrected average pairwise differences among locations for putative CR of Halobatrachus didactylus
Table 3 Pairwise F ST and corrected average pairwise differences among locations for S7 intron of Halobatrachus didactylus

SAMOVA (putative CR dataset) did not yield significant FCT for any of the groups analysed (data not shown). In view of this, we rejected the hypothesis that Portuguese populations cluster in well-defined groups. None of the sample localities tested displayed signs of expansion as Tajima’s D and Fu’s Fs tests never yielded significant negative values (results not shown). An AMOVA for the putative CR performed with the 4 locations showed that there is significant interpopulation variation: percentage of variation among populations = 47.17; percentage of variation within populations = 52.83 (overall F ST = 0.471, p < 0.001). A similar result was obtained when we performed an AMOVA for the S7 data (including also individuals from Morocco): percentage of variation among populations = 36.24; percentage of variation within populations = 63.76 (overall F ST = 0.362, p < 0.001).

The BSP (resulting from BEAST) for the putative CR data set of H. didactylus (Fig. 2) reveals that the Lusitanian toadfish, as a whole, experienced a mild growth in the past 40–60 years, reaching a Nef of ~6,000 fish in the present day, the t MRCA being 2,910 years ago (confidence interval: 0.05–32.11 kya, median: 2.91 kya).

Fig. 2
figure 2

BSP for the CR data set of Halobatrachus didactylus. Bayesian Skyline Plot for the mitochondrial control region of Halobatrachus didactylus

Discussion

The results presented above are striking because of the very low number of haplotypes that make up the Portuguese populations and the fact that the populations that are at the species northern limit are extremely impoverished concerning the putative control region. We suggest that H. didactylus, being a warm water fish, may be in a very unstable demographic and genetic situation in Portuguese waters, especially in Tagus. As stated in the introduction, the species does not breed below 18 °C and in cool summers, when temperature drops below this point, male mate attraction calls are suppressed and nests with eggs are scarce (Amorim et al. 2006). Thus, if a cold event, even of moderate intensity, takes place, the populations of Tagus and Sado, and eventually that of Mira, may be extirpated. At Algarve SST is more frequently above 18 °C in May and June than in the Tagus area (NOAA, http://www.nodc.noaa.gov/dsdt/oisst/). Thus, at Algarve the fish are more likely protected from small amplitude lowering of SSTs than at Tagus or Sado. We propose that repeated extinctions in the extreme north of marine species distributions may be more common than usually assumed and likely continue to occur in interglacial times. An interannual survey of 10 years concluded that about 20 % of the species of the inshore fish assemblage at Arrábida (Central Portugal) could change from 1 year to the next, when SST changed from cold to warm years and vice versa (Henriques et al. 2007). When two successive warm years took place, some warm water new comers were even able to breed but disappeared completely from the censuses when a new cold year occurred. Similar results have been widely documented in the literature. For example, Thomson and Lehner (1976) found that for the intertidal fish communities in Baja California, cold water events could, in some years, affect drastically the tropical and subtropical components of the fish fauna. Additionally, Stenseth et al. (2002) reported the same results in a variety of groups of marine animals around Western Europe, after a transition from several mild winters to a cold winter in 1996. These episodes of change in the composition of fish communities are probably underestimated for two reasons: (1) long-term surveys are still insufficient; and (2) most fish have dispersive propagules that allow rapid recolonizations involving large numbers from nearby populations, so that the track of each colonization may be quickly erased.

The absence of a pelagic stage and the sedentary nature of H. didactylus would mean that recolonizations of the north would likely be achieved by a few migrants per generation, helping to make the dynamics of local extinction and recolonizations in the north easier to detect with genetic tools. We propose that the putative CR data are compatible with the following scenario: locations at Algarve (or closer to the entrance of the Mediterranean) could have operated as refugia where toadfish could survive many cold events. Populations more to the north would likely go extinct. When conditions improved, fish from Algarve or more southern locations expanded northwards, colonizing Mira, on one hand, and Sado on the other. From Sado some fish would be able to reach Tagus. All these migration routes will be used by very small numbers of fish, giving rise to strong drift that culminated in the Sado/Tagus clade. How to reconcile this data with those for S7? The significant values of most pairwise F STs for S7 point to a reduced connectivity among populations. Mira and Sado showed the smallest differentiation and the highest number of migrants for S7. For all populations, the diversity indices for S7 were higher than those for the putative CR. This result is to be expected because a marker with 4 times the number of copies (compared to CR) suffers the effects of drift with less intensity and, if the time is short, the process of lineage sorting will tend to occur less markedly. Moreover, if the diversity of the S7 marker was sufficiently high in the source populations, a random process of migration to the north could leave different alleles of S7 in different estuaries. Another interesting possibility is that the males disperse more than females. This would explain the possibility of migration between Sado and Mira, suggested by S7 but apparently contradicted by the putative CR. As males may fight for territories and there are sneakers (small males without territories that try to fertilize eggs in others males nests) described for this species (Modesto and Canário 2003), the possibility of a higher dispersal rate for males seems a possible explanation to the present findings.

How old is the recolonization of Sado and Tagus? We know that in the Holocene, several cold events did take place that could trigger a process like the one hypothesized in this paper. Aside from the recent Little Ice Age (only a few centuries from the present), other cold events were recorded after the end of the Last glaciation (e.g. Casper 2010). With the lack of solid information on mutation rates, it makes no sense to try to ascribe the demographic bottlenecks and probable recolonization of the Tagus and Sado to a specific event. In any case, it seems reasonable to consider that we are dealing with processes that took place after the Last glaciation, probably only a few hundreds or thousands of years from the present. The fact that the populations of Mira, Sado and Tagus acquired few or no mutations of their own and the sharp contrast in diversity between CR and S7 provide evidence compatible with very recent phenomena that allowed no time for population recovery. On a longer timescale even the population of Algarve is not very diverse genetically, suggesting possible marks of older events. However, these older processes must be studied on a broader geographic framework. The Portuguese populations are likely too eroded genetically to preserve an older demographic record.

The present paper addresses events that were probable at the extreme north of the species range and does not intend to replace a full phylogeographic analysis of the species. Indeed, the main caveat of this study is the lack of samples covering a larger part of the geographic distribution of the species, namely tropical and subtropical Africa where samples are very hard to obtain. We hope that, in the future, it would be possible to examine deeper phylogeographic processes affecting this species through international collaboration.

One major conclusion of the present study is the need to combine molecular genetic studies with ecological surveys that allow us to detect and study advances and retreats of species on a rapid basis. In addition, we suggest that more attention must be given to populations that are at the limits of their species distributions, that have reduced or no planktonic stages or have limited dispersal capabilities for any other reason. In general, repeated local extinctions and recolonizations may be a frequent feature of the history of many marine fish populations as it was proposed several years ago (Grant and Bowen 2006).