Introduction

Lakes that have persisted for at least 100 ka, i.e. ancient or long-lived lakes (Gorthner 1994; Cohen, 2012; Wilke et al., 2016) tend to harbour biota characterized by high levels of species diversity and endemicity (Brooks, 1950; Gorthner, 1994; Martens et al., 1994). The Great Lakes of the East African Rift Valley (Victoria, Tanganyika, and Malawi) are prime examples of such ancient lakes and are well known for harbouring species flocks of cichlid fishes that most likely evolved sympatrically after the formation of each of these lakes (McKaye et al., 1984; Seehausen, 2000; Lande et al., 2001; Seehausen, 2006). This tremendous diversity of teleost fishes, however, is usually not mirrored by similarly high levels of invertebrate diversity (Michel, 1994; Cumberlidge et al., 1999; Martens & Schön, 1999; Schön et al., 2001; Marijnissen et al., 2006; Genner et al., 2007; Salzburger et al., 2014). While ostracods and gastropods show higher levels of species diversity within the East African Great Lakes compared to the surrounding areas (Martens & Schön, 1999; Schön et al., 2001; Genner et al., 2007), brachyuran crabs do not show a comparable level of elevated intra-lacustrine diversity (Cumberlidge & Meyer, 2011). This is surprising, because African freshwater crabs likely face ample ecological opportunities for adaptive diversification in ancient lakes because they can occupy a range of trophic niches such as shredding (Hill & O’Keeffe, 1992; Dobson et al., 2002; Masese et al., 2014) and predation (West & Cohen, 1994; Weigand et al., 2014). In Lake Tanganyika, primary freshwater crabs of the family Potamonautidae have formed a small endemic species flock comprising nine phylogenetically closely related, but morphologically and ecologically differentiated Platythelphusa species (Cumberlidge et al. 1999; Marijnissen et al., 2004, 2006, 2008, 2009). Nevertheless, their species diversity lags far behind the c. 250 cichlid species in this lake (Turner et al., 2001). Even more extreme is the situation in Lake Malawi, which harbours about 700 cichlid species (Meyer, 1993; Turner et al., 2001), while only one potamonautid crab, the Malawi blue crab has been reported (Daniels & Bayliss, 2012). Comparably low freshwater crab diversity is also found in Lake Victoria (one species, not endemic) and Lake Kivu (four species, two of which are endemic; Cumberlidge & Meyer, 2011).

Comparing species numbers between cichlids and freshwater crabs in the East African Great Lakes supports the view that behavioural components of sympatric speciation, which are thought to be major drivers of accelerated speciation rates in cichlids (Danley & Kocher, 2001), play only a minor role in freshwater crabs. For example, unlike cichlids, East African freshwater crabs do not display sexually dimorphic colour patterns. These colour ornaments evolved through sexual selection and serve (beside other functions) mate attraction (Lande et al., 2001). While ‘sensory drive’—divergent adaptation of sensory and signalling systems in a heterogeneous environment that may cause premating isolation—plays a role in intra-lacustrine cichlid speciation (Seehausen et al., 2008), it has never been proposed to drive speciation in freshwater crabs. Also, small effective population sizes as a consequence of mouth-breeding have been proposed to drive fixation of rare alleles in cichlids (Salzburger et al., 2005), but small population sizes do not apply to the Malawi blue crab which has large populations in Lake Malawi, even if it is a direct developer with comparatively small broods (Yeo et al., 2008).

Nevertheless, the much lower freshwater crab diversity of Lake Malawi compared to Lake Tanganyika is puzzling. The estimated age of the Lake Malawi basin is between 8 and 4 Ma (Delvaux, 1996) which means that Lake Malawi formed roughly at the same time as Lake Tanganyika (8–5 Ma; Lezzar et al., 1996; Cohen et al., 1997). One potential explanation for different evolutionary diversification rates between these two ancient lakes is attributed to the Pleistocene drying of the major lakes of the East African Rift Valley (Van Damme & Gautier, 2013), i.e. Lake Victoria (Johnson et al., 1996), Lake Tanganyika (Cohen et al., 1997) including Lake Malawi (Cohen et al., 2007; Scholz et al., 2011). The respective African megadroughts (Cohen et al., 2007; Lyons et al. 2015; Ivory et al. 2016) have most likely affected the biota of Lake Malawi (maximum depth ca. 700 m; Eccles, 1974) more severely than Lake Tanganyika (ca. 1470 m deep; Felton et al., 2007) (Van Damme & Gautier, 2013). Alternatively, it is possible that if the Malawi blue crab has inhabited Lake Malawi for a long time it might harbour cryptic diversity of morphologically similar but divergently evolving lineages, as has been demonstrated for species of freshwater crabs on the Seychelles (Daniels 2011; Cumberlidge & Daniels, 2014) and in Europe (Jesse et al., 2013). Lake Malawi’s long north–south extension of 570 km (and its relatively narrow east–west extension of only about 70 km) could produce at least some degree of genetic isolation-by-distance along the lake’s north–south axis, and/or (in absence of accidental drift of individuals) genetic differentiation between eastern and western shores. Indeed, freshwater crabs have not been reported from below 200 m depth in Lake Malawi (Cunnington, 1907; Cumberlidge & Meyer, 2011) which is considered anoxic in Lake Malawi (Vollmer et al., 2002).

The Malawi blue crab is currently assigned to the species Potamonautes lirrangensis (Rathbun, 1904), which is not endemic to Lake Malawi, but has a more widespread occurrence, reaching Lake Kivu in the north and the Congo River drainage in the Democratic Republic of the Congo in the west (Reed & Cumberlidge, 2006; Cumberlidge & Meyer, 2011; Daniels & Bayliss, 2012). The name ‘Malawi blue crab’ originates from the aquarium trade and does not indicate that populations of this species within Lake Malawi are morphologically different from populations outside the lake. However, the type locality of P. lirrangensis at Lirranga is at the confluence of Congo and Ubangi Rivers, about 2000 km from Lake Malawi (Rathbun, 1905). Phylogenetic divergence between Lake Malawi and the Upper Congo system was inferred for viviparid freshwater snails at ~4.5 Ma (Schultheiss et al., 2014), and this could well reflect a general biotic pattern. Molecular data indeed indicate considerable genetic divergence across the distribution range of P. lirrangensis (Marijnissen et al., 2006), raising the question of whether Potamonautes orbitospinus (Cunnington, 1907)—whose type locality is Lake Malawi—is actually a valid species. In the following, before more detailed molecular data become available, we will refer to the Malawi blue crab as P. lirrangensis, in line with the latest revisions (Reed & Cumberlidge, 2006; Cumberlidge & Meyer, 2011).

We investigated genetic differentiation of the Malawi blue crab within Lake Malawi based on mitochondrial sequence markers with the following aims: (1) we asked if there is evidence for morphologically cryptic diversity (Pfenninger & Schwenk, 2007; Jesse et al., 2013), (2) we reconstructed the past population demography of the lake’s population(s) to test if the observed patterns confirm hypotheses about the lake’s palaeogeography (Cohen et al., 2007; Scholz et al., 2011; Ivory et al., 2016), and (3) we evaluated genetic similarity/differentiation between populations along the western and eastern lake shores and tested for a potential signature of isolation-by-distance along the shore line.

Materials and methods

Sampling and acquisition of sequence data

Freshwater crabs were sampled at seven sites along the shores of Lake Malawi during 2012 (N = 60; Fig. 1A) and fixed in 70% ethanol. Crabs were hand caught in shallow water close to the beach—with the exception of the sites Mbamba bay and Chikale at the eastern shores that were sampled in slightly deeper water by local fishermen. We extracted DNA using the CTAB method (Del Sal et al., 1989) and amplified the partial mitochondrial NADH dehydrogenase subunit 1 (ND1, primers 16L11/NDH5, see Jesse et al., 2011; including the partial 16S rRNA gene and the tRNA Leu) and cytochrome b genes (Cytb, primers Cytb-MVZ15/Cytb2; Klaus et al., 2013). PCR conditions were 35 cycles; denaturing at 94°C for 45 s; annealing at 48°C for 1 min; extension at 72°C for 45 s. Sequencing was outsourced to GATC Biotech AG (Konstanz, Germany). Assembly of forward and reverse strands and sequence quality control was done in SEQUENCHER 5.2.3 (GeneCodes Corporation, Ann Arbor, U.S.A.). We submitted all new sequences to the European Nucleotide Archive (ENA; accession numbers LT837525–LT837584 for Cytb; and LT837856–LT837915 for ND1). We aligned the sequences manually, as there were no ambiguous sites, and checked for stop codons to minimize the risk of including pseudogenes. The final total alignment length was 507 bp for ND1 and 291 bp for Cytb.

Fig. 1
figure 1

A Location of the seven sampling sites of Potamonautes lirrangensis in Lake Malawi. Bathymetry is based on Scholz et al. (2011); dashed lines indicate the 500 m depth contour below the present lake level. B Pairwise ΦST-values between sampling sites (significant values indicated by an asterisk), and average number of pairwise nucleotide differences within sampling sites (diagonal values) based on combined mitochondrial Cytb and ND1 genes. Comparisons between and within the three southern sites are highlighted

Data analysis

Unless stated, otherwise, all population genetic analyses were conducted in ARLEQUIN version 3.5.1.2 (Excoffier et al., 2005). We estimated genetic heterogeneity of the total sample (as a proxy for the total lake population) and within each sampling site, namely haplotype diversity (h), which indicates the probability that two randomly drawn haplotypes are different, and nucleotide diversity (π), which indicates the mean number of differences between all pairs of haplotypes in a population, expressed as a percentage value (Nei, 1987).

Pairwise fixation indices (Wright, 1951) were computed based on genetic distances between sampling sites under a Tajima-Nei model (Tajima & Nei, 1984). Significance of fixation indices (ΦST) under the null hypothesis of no differences among populations was tested using a non-parametric permutation approach (Excoffier et al., 1992) with 10,000 permutations of haplotypes among populations. The population genetic structure within the Lake was explored with a series of AMOVAs (Excoffier et al., 1992) to identify the grouping scheme that results in maximal between- and minimal within-group variation. Significance of the results was estimated with 10,000 permutations.

To investigate if genetic similarity/differentiation between population pairs can be explained by linear distances (indicating migration through open waters) and/or shoreline distance, we tested for a correlation between linear as well as shoreline distance matrices (Table 1) and ΦST-values between all sampling sites (Fig. 1B) using a Mantel test with 1000 permutations (Mantel, 1967; Smouse et al., 1986). Geographic distances were measured in Google Earth (Alphabet Inc., Mountain View, U.S.A.).

Table 1 Matrix of distances between sampling sites (km) as used in the test for isolation-by-distance (Mantel test)

We asked if the demographic history of P. lirrangensis in Lake Malawi was subject to fluctuation in effective population size. To this end, we calculated a set of neutrality tests, testing for equilibrium between the loss of genetic variation through random drift and the accumulation of molecular diversity through mutation. We calculated Tajima’s D (Tajima, 1989) and Fay and Wu’s H statistic (Fay & Wu, 2000), whereby the latter was computed in DNASP 5.10.01 (Librado & Rozas, 2009). Those indices use the mutation frequency for assessing potential deviations from equilibrium while another index, Fu’s FS (Fu, 1997), is based on haplotype frequencies. The significance of D and FS was estimated by generating random samples under the null hypothesis of selective neutrality and population equilibrium using a coalescent simulation (Hudson, 1990). While D is more sensitive to population expansion, H is largely influenced by purifying selection, thus potentially allowing discrimination between both processes (Fay & Wu, 2000).

Subsequently, we applied a mismatch distribution analysis on the frequency distribution of observed numbers of differences between pairs of haplotypes using a generalized least-squares approach (Schneider & Excoffier, 1999). Populations at equilibrium are expected to show a multimodal distribution, while populations that have recently experienced either a demographic or range expansion are predicted to have a unimodal distribution (Rogers & Harpending, 1992). As an indicator of the smoothness of the distribution, we inferred the raggedness index, rg (Harpending et al., 1993; Harpending, 1994), and tested its significance with a parametric bootstrap approach (10,000 replicates) under the null hypothesis of population expansion.

Moreover, past demographic processes were inferred using a Bayesian skyline plot model (Drummond et al., 2005) as implemented in BEAST 1.8.2 (Drummond et al., 2012) and visualised in TRACER 1.5. This individual-based coalescent approach uses Bayesian MCMC sampling procedures to estimate a posterior distribution of gene genealogies and population parameters. We applied a HKY+G model of sequence evolution and a log-normal relaxed molecular clock model (Drummond et al., 2006) for all gene partitions, and ran the analyses for 507 iterations, sampling every 10,000th iteration, and discarding the first 106 as burn-in. We used an external substitution rate of 2% per Ma for all mitochondrial partitions with a standard deviation of 0.6% per Ma as inferred for mitochondrial genes of the freshwater crab genus Potamon based on the cessation of the Messinian Salinity Crisis, assuming that this event triggered divergence between Potamon fluviatile and Potamon algeriense (see Jesse et al., 2009). We set the number of groups (m) to 10. The resulting posterior distributions were then used to generate credibility intervals that represent phylogenetic and coalescent uncertainty in TRACER 1.6.0 (Fig. 2B). As we used mitochondrial sequence markers, population size estimates reflect the maternal effective population size.

Fig. 2
figure 2

Historical demography of the Malawi blue crab Potamonautes lirrangensis. A Mismatch distribution with identical expected distributions under both spatial and demographic expansion models, with only slightly differing 90% confidence intervals (outer interval boundaries: demographic expansion). B Bayesian Skyline plot indicating past changes in effective population size (median and 95% credibility intervals)

Results

Highest genetic differentiation in the northern lake basin

Overall, we found moderate haplotype diversity but only low levels of nucleotide diversity for the two mitochondrial loci (Table 2), supporting the view that no morphologically cryptic lineages of freshwater crabs exist in Lake Malawi. Irrespective of the grouping scheme of sampling sites, most of the genetic variation was due to diversity within sampling sites (Table 3). However, comparison of different AMOVAs revealed that grouping the three southern sites (Chikale, Nkotakhota bay, Cape McLear) versus the remaining sites maximizes between-group variation (Table 3). The three southern sites were characterized by moderate haplotype (h = 0.51–0.63) and low nucleotide diversity (π = 0.000698–0.001321; Table 2), low mean pairwise nucleotide differences within sites (0.60–1.10; Fig. 1B), and pairwise ΦST-values close to zero (Fig. 1B). In contrast, the northern sites harboured most of the genetic diversity of P. lirrangensis within Lake Malawi (h = 0.79–1.00; π = 0.001606–0.003774; mean pairwise nucleotide differences = 1.30–3.00; pairwise ΦST = 0.00–0.13; Table 2; Fig. 1B). A Mantel test found linear distances between sites to correlate significantly with genetic distances (i.e., pairwise ΦST; r = 0.442, P = 0.02) and to explain 21% of the genetic variance between sites. By contrast, shoreline distances did not correlate with genetic distances (r = 0.071, P = 0.36; < 0.001% variance explained).

Table 2 Sample sizes (N), haplotype code names detected at each site, haplotype (h) and nucleotide diversity (π) for the combined Cytb and ND1 genes, and museum voucher numbers (SMF, Senckenberg Forschungsinstitut und Naturmuseum, Frankfurt am Main) for each sampling site
Table 3 AMOVA results, demonstrating that grouping into northern and southern populations (grouping scheme A) maximised the among-group variation

Evidence for a recent population expansion

Negative values of Tajima’s D (−2.312; P < 0.001) and Fu’s FS (−24.760; P < 0.001) argue in favour of a recent (demographic or spatial) population expansion, or a scenario of purifying selection during a population bottleneck. The high value of Fay and Wu’s H (0.936), however, suggests that purifying selection most likely plays only a minor role compared to the contribution of a recent population expansion. Mismatch analysis revealed that the observed number of pairwise differences did not differ significantly from the expected distribution for both demographic and spatial expansion models (spatial expansion model: P-value of sum of squared deviations (SSD) = 0.45, rg = 0.062, P-value of rg = 0.44; demographic expansion model: P-value of SSD = 0.30, rg = 0.062, P-value of rg = 0.43; Fig. 2A). The Bayesian Skyline plot yielded particularly small estimates for the maternal effective population size (partly < 0 on a logarithmic scale; Fig. 2B)—most likely reflecting low overall genetic diversity—which can at most be interpreted as an approximation of changes in relative population size. Under this premise and the applied external substitution rate, the Bayesian Skyline plot suggests that population size increased between 70,000 and 30,000 years ago before present (Fig. 2B).

Discussion

We found no evidence for independently evolving—and thus, potentially sympatrically speciating—divergent mitochondrial lineages in the Malawi blue crab P. lirrangensis within Lake Malawi. This contrasts with the extraordinary species richness of the Lake Malawi cichlids (Danley & Kocher, 2001), and the occurrence of several divergent lineages of the gastropod genera Bellamya (Viviparidae; Schultheiss et al., 2011), Lanistes (Ampullariidae; Schultheiss et al., 2009) and Melanoides (Thiaridae; Genner et al., 2007; Schultheiss et al., 2009; Van Bocxlaer et al., 2015). Given the low genetic diversity as observed in our present study, the lake’s population of freshwater crabs most likely originated from a single colonization event, as has also been inferred for gastropods of the genus Lanistes (see Schultheiss et al., 2009); but not for thiarid gastropods (Genner et al., 2007) and haplochromine cichlids (Joyce et al., 2015). At this point, based on the available mitochondrial sequence data, we cannot exclude that there is very recent, ongoing genetic differentiation or incipient speciation in these freshwater crabs. Such a process would possibly go undetected in this study due to incomplete lineage sorting and/or slow evolution of the mitochondrial genes when compared, for example, to faster-evolving nuclear markers such as microsatellite loci or single nucleotide polymorphisms. Nevertheless, we can clearly reject the hypothesis of a completely panmictic population within Lake Malawi—as inferred for some pelagic cichlid species (Shaw et al., 2000)—on the basis of the genetic differentiation between populations from northern and southern sites. The recent population expansion of P. lirrangensis between 70,000 and 30,000 years ago, along with the extremely low genetic diversity at our southern sampling sites, argues in favour of the potential influence of the last East African megadrought. During that period (117–85 ka), Lake Malawi supposedly desiccated to approximately 2% of its original size in a then semiarid environment, and only the northern, deeper parts of the lake remained filled with water (Scholz et al., 2011). After a period with fluctuating lake levels (85–71 ka) and another low-stand at 62 ka (−200 m; Scholz et al., 2011), the lake level rose again (Ivory et al., 2016). The shallow southern parts of Lake Malawi were supposedly flooded last, thus explaining the low genetic diversity of the southern populations. Alternatively, higher levels of gene flow in the southern portion of the lake, along with stronger isolation between populations in the northern parts of the lake (assuming that freshwater crabs do not disperse through the deeper, anoxic waters in the northern parts of the lake), could result in a similar pattern. However, we could show that linear distances are a better predictor of genetic differentiation between sampling sites than shoreline distances. This indicates that gene flow does not strictly follow a stepping-stone model along the coastline (i.e., each population receives migrants mainly from neighbouring populations as, e.g. in the Lake Malawi cichlid Melanochromis auratus, see Albertson et al., 1999; Markert et al., 1999; or in the stone loach of Lake Constance, see Barluenga & Meyer, 2005), but most likely allows for erratic dispersal across the open water throughout the lake. We propose that the results of our present study should be confirmed based on a denser sampling of P. lirrangensis along the lake’s shores.

Our hypothesis that the Malawi blue crab was severely affected by the Pleistocene East African megadrought is congruent with the higher species diversity of Lake Tanganyika’s freshwater crabs (Marijnissen et al., 2004, 2006). The Lake Tanganyika basin is twice as deep as Lake Malawi, and most likely retained several isolated water bodies during times of maximum lake level low-stands (Cohen et al., 1997) that enabled the persistence of ancestral lineages (see Van Damme & Gautier, 2013) and might have triggered allopatric speciation in the isolated water bodies. A similar pattern applies for cichlid lineages, as Lake Malawi, despite its high species number, harbours only cichlids of the tribe Haplochromini, while as many as 14 tribes occur in Lake Tanganyika (Meyer et al., 2015). Also, much smaller Southeast Asian ancient lakes, under more stable palaeoclimatic conditions, harbour a variety of primary freshwater crab species belonging to different trophic guilds (Schubart et al., 2008; Poettinger & Schubart, 2014). Therefore, at least for freshwater crabs, Lake Malawi is apparently not all that ancient, with the time elapsed since the last Pleistocene megadrought being too short for sympatric speciation to occur.