Introduction

One of the main goals of studies of evolutionary research is to identify which forces were responsible for shaping the evolutionary history of extant taxa (Cowman and Bellwood 2013). Evolutionary mechanisms that act in marine environments and on marine species differ from those affecting terrestrial environments and species (Mayr 1954). The relative importance of population isolation versus interconnectivity is a central point in this discussion, alongside the premise that allopatric barriers comprise the primary speciation process initiation point and, in many cases, the promoting factor (Bowen et al. 2013). Connectivity between populations plays a key role in the dynamics of species that inhabit marine environments, where population genetic diversity and differentiation are directly affected by three fundamental evolutionary forces: natural selection, genetic drift, and gene flow (Slatkin 1987).

It has long been thought that marine species typically exhibit broad geographic distributions with continuously interconnected populations (Cox and Moore 2005). This view, however, has changed in recent years, with the observation of spatially disjunct populations resulting from speciation events associated with past vicariant events (Cowman and Bellwood 2013; Sales et al. 2017; Bowen et al. 2020). Thus, oceanographic regions presenting high environmental heterogeneity (e.g., in currents and temperature and salinity gradients) can represent substantial barriers to population connectivity and exert strong speciation pressure, most notably on species with wide distributions across large latitudinal ranges (Hare et al. 2005; Galarza et al. 2009). The role of oceanographic heterogeneity as a speciation-promoting factor has not, however, been clearly investigated for some marine biogeographic regions (Cowman et al. 2017; Delic et al. 2020). In this sense, species with the ability to migrate or move throughout the water column would not be directly affected by soft barriers (Floeter et al. 2008).

Loliginidae (Lesueur 1821) comprises several commercially important squid species present in neritic and coastal regions worldwide, inhabiting both tropical and subtropical waters and present on the continental shelf, seagrasses, and coral reefs (Vecchione et al. 2005; Jereb and Roper 2010). Due to the high environmental heterogeneity of coastal and shelf habitats under distinct hydrological and oceanographic regimes, which directly influence the dispersal capacity of both adults and the paralarva stage (Boyle and Rodhouse 2005), loliginids have played a significant role as organisms capable of indicating past biogeographic events. For example, the absence of hard marine substrates for egg mass deposition has been shown to be a determinant in promoting speciation within some species (Sales et al. 2017; Costa et al. 2021; Jesus et al. 2021) as well as biological traits (Ibañez et al. 2023).

In recent years, molecular revisions have aided in the molecular delimitation and phylogenetic positioning of rare species or those with questionable taxonomic status (Sales et al. 2019; Anderson and Marian 2020; Avendaño et al. 2020; Pardo-Gandarillas et al. 2021). Molecular techniques have also demonstrated the presence of a high number of cryptic lineages within cephalopod species previously indicated as distributed across wide regions. For example, in the East Atlantic, the Strait of Gibraltar was found to separate lineages within Sepiolidae Leach, 1817 (Fernández-Álvarez et al. 2021), while in the southwestern Atlantic, ecological speciation linked to the utilization of new habitats has been proposed for Pickfordiateuthis G. L. Voss, 1953 (Anderson and Marian 2020), and the formation of the Caribbean and associated changes in suitable areas for reproduction has been proposed for Lolliguncula brevis (Blainville 1823) (Sales et al. 2014; Costa et al. 2021) and Doryteuthis pleii (Blainville 1823) (Sales et al. 2013; 2017). The latter genus comprises species of high economic importance, especially Doryteuthis pleii, Doryteuthis gahi (d’Orbigny, 1835), Doryteuthis opalescens (S. S. Berry, 1911), and Doryteuthis pealeii (Lesueur 1821) (Jereb and Roper 2010).

D. pealeii is widely distributed across shelf regions of the northwestern Atlantic Ocean and Caribbean Sea, from Newfoundland (47°N) to the Orinoco Delta (10°N), and is usually found far from the coast in colder regions (Jereb and Roper 2010). The species is similar in appearance, sympatric throughout the majority of the northwestern Atlantic Ocean, Gulf of Mexico, and the Caribbean Sea, with D. pleii (Sánchez et al. 1996; Jereb and Roper 2010). However, there is some evidence of distinct subpopulations within D. pealeii (Herke and Foltz 2002), and recent studies have indicated that the distribution of this species extends to the southwestern Atlantic, possibly representing a cryptic lineage (Sales et al. 2013). The main aims of the present study are to use a broader sampling to verify the presence and determine the degree of divergence of cryptic lineages within D. pealeii utilizing mitochondrial and nuclear markers, and discuss the biogeographic events that could have been responsible for causing diversification in these lineages.

Materials and methods

Sampling and molecular procedures

D. pealeii samples were collected in the Western Atlantic from 2009 to 2014, totaling 39 individuals from the southwestern Atlantic (33), the Gulf of Mexico (4), and the Western North Atlantic (2) (Fig. 1A, Table S1). All samples were obtained under a full license for biological material collection (Brazilian biodiversity collection authorization license: Permanent License SISBIO 5857–1) and SisGen research registration (AD53401). Two datasets were created, the first a population analysis database composed only of cytochrome C oxidase subunit I (cox1) sequences and the second, a divergence time inference database composed of sequences of two mitochondrial fragments, the mitochondrial large ribosomal subunit (rrnL, also known as 16S, and cox1) and one nuclear fragment (Rhodopsin). All molecular procedures (DNA extraction, PCR and sequencing of rrnL, cox1 and Rhodopsin gene fragments) were based on the protocols described by Sales et al. (2013, 2017).

Fig. 1
figure 1

A Sampling places and sequence origin utilized in the present study. Numbers on the shorelines of Northern Brazil, Gulf of Mexico, and the US represent the number of sequences sampled from each point. Colors represent marine ecoregions according to Spalding et al. (2007). Dark blue: Virginian; Green: Northern Gulf of Mexico; light blue: Southern Gulf of Mexico and cream: Amazonia; B species tree generated in *BEAST for Doryteuthis based on three genes (rrnL, coxI, and Rhod); the ML and BI trees have a similar topology. Only support values greater than 80% (0.8) are presented. The support values correspond to ML and BI, respectively. Bars on the right represent the species delimitation methods results for each species/lineage; C haplotype network of coxI sequence data using the maximum-likelihood method (GTR + I + G model). Map projection on WGS 84

Sequence alignment, phylogenetic methods, and species delimitation

Sequences from Sales et al. (2013) and Costa et al. (2021) (rrnL, cox1, and Rhod) were implemented in addition to these in our concatenated dataset employed for divergence estimation, maximum likelihood and species tree inference, and reconstruction of ancestral states. For the population inference dataset (haplotype networks, species delimitation methods, and populational demography analyses), cox1 sequences from Herke and Foltz (2002) and Díaz-Santana-Iturrios et al. (2019) were added to the dataset generated herein, generating a final total of 85 sequences (Table S1).

For the rrnL sequences generated herein, the hypervariable regions containing indels were excluded by eye from the final pool to avoid any homologous alignment errors. The sequences were then aligned using the Clustal W tool (Thompson et al. 1997) available in the BioEdit v.7.2.6.1 program (Hall 1999). The most appropriate evolutionary models for each of the datasets for each marker were estimated using jModelTest 2 (Darriba et al. 2012) where the AIC criterion was used to determine the evolutionary model to be used for phylogenetic inferences employing the maximum-likelihood (ML) technique. The ML tree with the concatenated dataset of three markers (rrnL + cox1 + Rhod) was built using PhyML 3.0 (Guidon et al. 2010), employing 1000 bootstrap pseudo-replicates (Felsenstein 1985) for node support values under HKY + G for rrnL, GTR + I + G for cox1, and GTR + I for Rhod evolutionary models selected by jModelTest 2.

The StarBEAST3 procedure (Douglas et al. 2022) was run in BEAST 2.7 (Bouckaert et al. 2019) where the definition of each species was made based on the results obtained in the previously inferred ML tree, where only lineages with more than 80% branch support values in the ML tree were selected as separate species a priori. The substitution models of each data partition were then unlinked in BEAUTi and the models previously selected by jModelTest 2 (TN93 + I for rrnL, GTR + I + G for cox1, and TN93 for Rhod) were implemented. We then applied a Strict clock model and a Yule Process as species tree prior (Heled et al. 2011) to perform four simultaneous analyses using Markov Chain Monte Carlo (MCMC) simulations for 80 million generations, with one sample taken every 40,000 generations. Tracer 1.5 (Rambaut and Drummond 2009) was used to check ESS values; only runs with ESS values greater than 200 for all marginal parameters were used. The first 10% of the trees were discarded as burn-in during the production of a maximum clade credibility (MCC) tree using TreeAnnotator v. 2.5 (Bouckaert et al. 2019).

We performed three different species delimitation methods using the cox1 dataset: single- and multiple-threshold Generalized Mixed Yule Coalescent (s-GMYC and m-GMYC, Fujisawa and Barraclough 2013; Pons et al. 2006), Bayesian Poisson tree process (bPTP) (Zhang et al. 2013), and Automatic Barcode Gap Discovery (ABGD) (Puillandre et al. 2012). For the delimitation of mtDNA lineages using GMYC, we built an ultrametric tree. For this, the outgroups were removed and only one sequence per haplotype was maintained to avoid the accumulation of short branches on the tree. For the selection of unique haplotypes, we used the ElimDupes portal (https://www.hiv.lanl.gov/content/sequence/elimdupesv2/elimdupes.html). With this haplotype dataset, we estimated an ultrameric tree in BEAST v. 2.5 (Bouckaert et al. 2019) using the Yule model as a tree prior and a strict molecular clock. The analysis was performed with two independent runs of 2 × 107 MCMC generations each, sampling every 1,000 generations. Convergence of the chains and ESS values were verified in Tracer 1.5 (Rambaut and Drummond 2009). An MCC tree was obtained using TreeAnnotator (Bouckaert et al. 2019) to be used as an input tree for GMYC analyses. In the case of bPTP, an ML tree was constructed in PhyML 3.0 (Guidon et al. 2010) under the TrN + I model (-InL = 1233.6799) estimated in jModelTest 2 for the unique haplotype dataset. This tree was used as a guide tree. Both analyses (GMYC and bPTP) were then performed using the Species delimitation web server [Species delimitation server (h-its.org)], and for bPTP, the tree was reconstructed based on the GTRGAMMA model and 1,000 bootstrap iterations.

Finally, for ABGD, analyses were performed through the ABGD web server (https://bioinfo.mnhn.fr/abi/public/abgd/abgdweb.html) using three distinct distance options (Jukes-Cantor, Kimura 2-parameter and Simple), intraspecific prior ranges from 0.001 to 0.25 in ten steps and a relative gap width of 1.5 following the default parameters of ABGD (Puillandre et al. 2012).

Divergence time estimation and historical biogeographic analyses

The divergence times between D. pealeii lineages were estimated using BEAST 2.5 (Bouckaert et al. 2019) employing the partitions of the three genes. The HKY + I + G (rrnL, cox1) and TrNef + I + G (Rhod) models were indicated as optimal, but the model used for the Rhodopsin dataset was GTR + I + G; TrNef is not available in BEAUti. The Yule speciation prior was used for the tree prior, modeled with a relaxed log-normal clock (Drummond et al. 2006). Relaxed log-normal distributions were employed for all calibration priors. We used the same calibration points as Sales et al (2017) and included the ones estimated for Lolliguncula in Costa et al. (2021), as follows: (a) origin of Vampyromorpha (~ 162 million years ago) (Fischer and Riou 2002; Strugnell et al. 2006), (b) estimated origin of Doryteuthis (41.2–47.8 mya) (Neige et al. 2016), (c) probable origin of Loligo Lamarck, 1798 (~ 44 mya) based on loliginid statoliths found from mid-Eocene (Clarke and Maddock 1988; Strugnell et al. 2006), (d) the split between Pacific and Atlantic Lolliguncula lineages (~ 18.4 mya), (e) the middle Miocene orogeny of the Caribbean plate (~ 15.8 mya), (f) the separation of the northern and southern L. brevis lineages (~ 13.7 mya), and (g) a ~ 3 mya statolith fossil identified as belonging to D. opalescens (Clarke and Fitch 1979). Four independent MCMC runs with 200 million interactions were used, with samples taken every 1000 generations. The MCMC log files were combined in Tracer 1.5 (Rambaut and Drummond 2009) to summarize posterior divergences times with 95% highest posterior density limits; only values equal to or higher than 200 for all marginal parameters were used after discarding 10% of the first trees as burn in.

RASP 4 (Yu et al. 2020) was utilized for biogeographic ancestral area reconstruction employing the Bayesian Binary Method (BBM). The TMRCA tree generated previously for the concatenated data set was used as a guide tree along with the sampling locations of the analyzed species summarized in Spalding et al (2007), employing ecoregion levels for reconstructions. The BBM analysis was run for seven million cycles, using ten chains, and one sampling every 100 cycles. The temperature was set at 0.1 and a fixed Jukes–Cantor model was used. The maximum number of areas for all nodes was set to four. All information was then summarized and plotted as a pie chart.

Demographic history

We used the cox1 dataset to estimate the genetic diversity indices of haplotype number (Hap), haplotype diversity (H), and nucleotide diversity (π) using Arlequin 3.01 (Excoffier et al. 2005), because it was more extensive. A neutrality test was performed to detect population expansion signals, using Tajima’s D (Tajima 1989) and Fu’s Fs (Fu 1997) with their respective P values based on 1000 coalescent simulations. The pairwise FST fixation index (Weir and Cockerham 1984) and an Analysis of Molecular Variance (AMOVA; Excoffier et al. 1992) were also applied to test for significant genetic differentiation among sampling areas. Genealogical haplotype relationships were obtained by creating a haplotype network using the maximum-likelihood algorithm estimated for the cox1 dataset in jModelTest 2 (Darriba et al. 2012) using the Haploviewer software (Salzburger et al. 2011).

The genetic structure of D. pealeii was inferred using GENELAND v4.0.3 (Guillot et al. 2005a, 2005b) implemented in R software (v.4.0.2), using Rstudio (R Core Team 2020). For these simulations, we used two datasets: cox1 and concatenated (rrnL + cox1 + Rhod) for estimating the similarity of populations across the sampling areas. Five independent runs were performed with 5,000,000 Monte Carlo Markov Chain (MCMC) iterations, sampling every 100 iterations, and discard of the first 10% of samples as a ‘‘burn-in” phase. The best run was selected using the greatest mean posterior probability using the post-processing function, and the ‘‘Map of population membership” and ‘‘Map of probability of population membership” produced based on populations identified under Hardy–Weinberg equilibrium with linkage equilibrium between loci (HWLE). Nucleotide distance matrix, based on uncorrected p distances, was generated in MEGA 11 (Tamura et al. 2021) for cox1 dataset.

Results

Phylogeographical inferences

Both phylogenetic reconstructions supported the separation of the South Atlantic D. pealeii lineage from the North Atlantic/Gulf of Mexico lineages, with high statistical support values (100/1 for ML/BI, respectively, Fig. 1B) and with 22 mutations separating South Atlantic populations in relation to the North Atlantic/Gulf of Mexico haplotype clusters in cox1 (Fig. 1C). A second structuring, indicated by high Bayesian posterior probability and moderate ML support was detected within the North Atlantic/the Gulf of Mexico lineage, indicating the possible presence of another cryptic D. pealeii lineage in the region (Fig. 1B). The s-GMYC and m-GMYC species delimitation analyses separated D. pealeii from Gulf of Mexico, Gulf of Mexico (North Atlantic) and North Atlantic, but there was no genetic separation between these sampling locations under the bPTP and ABGD analyses (Fig. 1B).

According to the TMRCA analysis, the time of separation between Lolliguncula and Doryteuthis was estimated at ~ 49.80 mya (Fig. 2, node I, Table 1). Separation of the clade containing the ancestors of most Atlantic lineages from the D. gahi/D. sanpaulensis/D. opalescens ancestor occurred ~ 40.22 mya (Fig. 2, node II, Table 1) followed by separation between D. gahi and the D. sanpaulensis/D. opalescens ancestor, at ~ 35.92 mya (Fig. 2, node III, Table 1). The separation between the ancestors of the D. pleii + D. surinamensis/D. pealeii species complex occurred ~ 34 mya (Fig. 2, node IV, Table 1), followed by the separation between D. sanpaulensis and D. opalescens (~ 30.76 mya, Fig. 2, node V, Table 1). The South Atlantic lineage of D. pleii (including samples from around the type locality from Isla Margarita) split from D. surinamensis at ~ 12 mya (Fig. 2, node VI, Table 1). The separation between the southern Atlantic D. pealeii lineages and the northern D. pealeii lineage and Gulf of Mexico D. pealeii lineage ancestor occurred ~ 8.5 mya (Fig. 2, node VII, Table 1), finally followed by the separation of these latter two D. pealeii lineages at approximately 2.11 mya (Fig. 2, node VIII, Table 1).

Fig. 2
figure 2

Estimate of TMRCA of Doryteuthis inferred from BEAST 2, using a relaxed log-normal clock and the concatenated three-gene dataset. Pie charts on the nodes represent the ancestral area reconstruction estimated with RASP4 (BBM). Red numbers represent nodes detailed in Table 1

Table 1 Summary of RASP 4 (BBM) ancestral area reconstructions

The RASP 4 analysis indicated that the ancestral area for Doryteuthis was the Tropical Northwestern Atlantic region, with a high association with the Southern Gulf of Mexico ecoregion (42.77%) (Fig. 2, node I, Table 1). The Tropical Northwestern Atlantic region remains the primary ancestral area for the Lolliguncula + Doryteuthis ancestor, and the most recent common ancestor of Doryteuthis probably also lived in the Tropical Northwestern Atlantic (55.88%) (Fig. 2, node II, Table 1). The Malvinas/Falkland ecoregion was inferred as the ancestral area for the most common ancestor of D. gahi, D. sanpaulensis, and D. opalescens (62.63%) (Fig. 2, node III, Table 1). The ancestor of the remaining Doryteuthis lineages (D. pealeii, D. plei, and D. surinamensis) appears to have originated in the Southern Gulf of Mexico (82.90%) (Fig. 2, node IV, Table 1). This was followed by later dispersal of D. sanpaulensis to the Warm Temperate Southwestern Atlantic region, occupying the Southeastern Brazil ecoregion (43.27%), while D. opalescens became restricted to the Warm Temperate Northeast Pacific region, in the Southern California Bight ecoregion (27.50%) (Fig. 2, node V, Table 1). Later speciation processes within the genus resulted in distributions of the remaining lineages associated with the Guianan (D. pleii Southwestern + D. surinamensis) (26.68%) and Amazonian (D. pealeii South Atlantic) ecoregions (VI and VII nodes, respectively), but always with the predominant origins associated with the Southern Gulf of Mexico ecoregion (51.05% and 78.35%, respectively), especially for the two D. pealeii lineages present in the North Atlantic and Gulf of Mexico (Fig. 2, node VIII, Table 1).

Overall, the non-corrected nucleotide distances (p) estimated for the cox1 dataset show that the intraspecific average level of divergence estimated between D. pealeii from North Atlantic/Caribbean in relation to the South Atlantic D. pealeii lineage was 0.051 (~ 5.1%) corresponding to the lowest nucleotide diversity value of Doryteuthis (Table 2), but very close to other p distances associated with cryptic species in other loliginid genera.

Table 2 Genetic divergences (P) estimated for coxI for Doryteuthis species

Population and demographic analyses

Two distinct grouping schemes for D. pealeii, one based simply on geography and the other based on geography and phylogenetic history, were simulated. For the first one, AMOVA shows no structuring detected between the North Atlantic, Gulf of Mexico (USA) and the Gulf of Mexico. In the second, the North Atlantic/Gulf of Mexico localities formed one group and the South Atlantic another group (Table 3).

Table 3 Indices of genetic differentiation (Fst, lower diagonal) based on the cox1 gene, obtained for populations of D. pealeii used in the present study

The second AMOVA results reinforced differentiation between these groups, with the highest differentiation index noted between groups (North Atlantic/Gulf of Mexico x South Atlantic, 88.28%) compared to within groups (11.51%) (Фst: 0.88 P > 0.05). The same pattern was noted when comparing the Fst indices, where the South Atlantic populations exhibited high and significant differentiation indices compared to the North Atlantic, Gulf of Mexico (North Atlantic), and Gulf of Mexico populations (Table 4). Following the AMOVA structure recovered in our study, GENELAND analysis supported two genetically distinct groups (K = 2) with high posteriori probability (bars on Fig. 3a and b). All samples from the South Atlantic had isolated from the Gulf of Mexico/Gulf of Mexico (North Atlantic) and North Atlantic groups with a posterior attribution probability of 90%, whereas the other five sampling locations had posterior probabilities of attribution to that group of only 10% (Fig. 3A). The opposite pattern [isolation of Gulf of Mexico/Gulf of Mexico (North Atlantic) and North Atlantic with 90% posterior probabilities in relation to South Atlantic] was recovered (Fig. 3B).

Table 4 Values of Fst and AMOVA for populations of Doryteuthis pealeii surveyed in this study
Fig. 3
figure 3

Population structure of Doryteuthis pealeii. The highest probability is indicated in the bar graphs in each figure. A Spatial posterior probability of belonging to northwestern Atlantic D. pealeii populations; B spatial posterior probability of belonging to south Atlantic D. pealeii populations. Map projection on WGS 84. Colors representing each population are as described for Fig. 1

A total of 39 haplotypes were defined by variation in the cox1 fragment sequenced here and showing high haplotypic and low nucleotide diversity (Table 5). Individually, the North Atlantic and Gulf of Mexico populations displayed the highest haplotypic diversity rates, with the latter displaying the highest number of unique haplotypes. In the case of the South Atlantic population, haplotypic diversity was almost half that estimated for the North Atlantic and Gulf of Mexico populations, even though a higher number of sampled individuals was assessed (Table 5). Values of Fu’s Fs for all populations were negative and significant. Values of Tajima’s D were negative (also indicative of population expansions) but not significant for all populations (Fig. 2).

Table 5 Indices of genetic diversity obtained for cox1 for populations of Doryteuthis pealeii used in this study

Discussion

The cryptic history of Doryteuthis diversity

Sales et al. (2013) performed the first molecular phylogenetic analyses including D. pleii and D. pealeii samples from the South Atlantic. The results clearly indicated that these species each comprise two genetically distinct clades, one occupying the Gulf of Mexico and North Atlantic and the second, the northern portion of the South Atlantic. This pattern was later confirmed for D. pleii, where, even following an extremely broad sampling of the South Atlantic lineage (Venezuela to Southern Brazil), the two lineages (North Atlantic/Gulf of Mexico and South Atlantic) were very well defined (Sales et al. 2017). In this study, we observed three different lineages of D. pealeii in the North Atlantic, the Gulf of Mexico and the South Atlantic by analyzing regions of two mitochondrial genes and one nuclear gene.

The cox1 genetic p distances inferred here are also concordant with previous distances inferred for other cryptic loliginid lineages (Cheng et al. 2014; Sales et al. 2013, 2014, 2017) as well between valid species of Loligo (5.7% between Loligo reynaudii d'Orbigny [in Ferussac & d'Orbigny], 1839–1841 and Loligo. vulgaris Lamarck, 1798) where it reflects a very recent divergence (Costa et al. 2021). The divergence between populations of D. pealeii from the North Atlantic and South Atlantic is therefore comparable to species-level divergence observed in other loliginid squid. The Fs and D values suggest a historical population expansion, and it seems that D. pealeii lineages most likely followed a similar process as that proposed for D. pleii lineages (Sales et al. 2017) as the data show the same patterns (higher haplotype diversity in the northern lineage and lower haplotype diversity in the southern lineage).

A genetic break between D. pealeii populations in the Gulf of Mexico and the North Atlantic has been previously reported (Herke and Foltz 2002), although no sign of cryptic speciation was noted by the authors. Their use of only cox1 may have been responsible for this proposition as no separation between the North Atlantic and Gulf of Mexico lineages was observed in the present study when analyzing the datasets for individual genes, reinforcing the need for phylogenetic investigations employing multiple markers reflecting different evolutionary rates, as well as adequate phylogenetic reconstruction methods that minimize the effect of incomplete lineage sorting (ILS) (Maddison and Knowles 2006), as well as sampling along the entire known area of occurrence of the target species, to increase the phylogenetic signal of the reconstructions (Zwickl and Hillis 2002).

The utility of these approaches has been shown in previous species assessments within Decapodiformes (Anderson et al. 2011; Fernández-Álvarez et al. 2021; Sanchez et al. 2021) and especially for loliginids (Anderson et al. 2008; Sales et al. 2013; 2014; Cheng et al. 2014; Anderson and Marian 2020; Costa et al. 2021). The presence of two additional lineages within D. pealeii raises some additional questions. Cohen (1976) described two species of Doryteuthis (Loligo at the time) from the West Atlantic: one similar to D. pealeii, but apparently limited to the Caribbean Sea, from the edge of the Grand Bahamas Bank near Punta Alegre in Cuba (Doryteuthis ocula Cohen 1976) and another, similar to D. pleii, with a wider distribution, occurring in both the Caribbean Sea and in the Gulf of Mexico, with the type specimen from Bimini Island, Bahamas (Doryteuthis roperi) (Cohen 1976; Jereb and Roper 2010). However, these latter two species are the subject of recent debates about their validity (Sales et al. 2013; Díaz-Santana-Iturrios et al. 2019).

Doryteuthis ocula has been confused with D. pealeii and D. roperi with D. pleii (Cohen 1976; Jereb and Roper 2010; Díaz-Santana-Iturrios et al. 2019). For the latter, Díaz-Santana-Iturrios et al. (2019) performed morphological, morphometric, and genetic revisions of loliginid species present in the Gulf of Mexico, utilizing museum specimens of D. roperi. The authors pointed out several similarities between D. pleii and D. roperi, and noted that it was impossible to separate females of the two species due to overlap of the described morphological characteristics (Blainville 1823; Cohen 1976). For the males, no consistent differences were observed in hectocotylus sucker counts or in the shape of the pedicels of the hectocotylized suckers; however, the authors state that the full sexual maturity of all the examined males was warranted. The authors concluded that although the lack of consistent differences suggests that D. pleii and D. roperi represent the same species, molecular evidence was necessary to test this hypothesis. Based on this, the authors suggest that these characters (sucker counts and pedicel shape) must not be considered for identification for Doryteuthis species (Díaz-Santana-Iturrios et al. 2019), with similar suggestions also made for Lolliguncula (Sales et al. 2014). However, the results of Díaz-Santana-Iturrios et al. (2019) must be interpreted with caution, because the authors do not clearly state whether the individuals analyzed in their study were mature or not (they considered the “size-at-maturity” of these squids and determined that every squid larger than that size was mature). Size at maturity can be very variable for squid species (Juanicó 1983; Forsythe and Van Heukelem 1987; Hatfield and Cadrin 2002).

Only three species of Doryteuthis (D. pleii, D. pealeii, and D. ocula) have been reported for the Gulf of Mexico (Judkins et al. 2009; Jereb and Roper 2010). We speculate that D. ocula and the lineage of D. pealeii reported for the Gulf of Mexico in the present study could be the same species, a question that can only be clarified with further morphological and molecular comparisons. Three Doryteuthis species have been previously listed for the South Atlantic coastal waters: Doryteuthis surinamensis (G. L. Voss, 1974), D. pleii, and Doryteuthis sanpaulensis (Brackoniecki, 1984). The latter was described by Brackoniecki (1984) after he considered Loligo brasiliensis Blainville 1823 a nomen dubium, that was also used for other related species in Blainville's original description (Brakoniecki 1984).

Until recently, the southern limit of the distribution of D. pealeii was the Gulf of Venezuela in the Caribbean Sea (Jereb and Roper 2010). However, Sales et al (2013) identified D pealeii collected from the Northern Coast of Brazil based on their phylogenetic proximity with the North Atlantic lineage, for which the type specimen from South Carolina was described by LeSueur (1821). Further comparative morphological and molecular studies are necessary to establish the taxonomic status of the southern lineage of D. pealeii to clarify the present diversity of Doryteuthis from the Western Atlantic.

Biogeographic implications

Vicariant events in Caribbean biogeography have impacted the evolution of several marine species (Bellwood et al. 2004; Baums et al. 2005; Rocha et al. 2005; Sales et al. 2017; Costa et al. 2021). The separation time between Doryteuthis and Lolliguncula estimated in the present study is within the confidence interval proposed by Costa et al. (2021) for the separation between these genera and possibly the formation of shallower waters associated with the beginning of the formation of the Isthmus of Panama (~ 40 mya) (O’Dea et al. 2007) affecting ancestors belonging to both genera. Although Doryteuthis was not the focus of Costa et al. (2021), some of the biogeographic patterns inferred by those authors were also recovered here (i.e., dispersion routes of Doryteuthis and Lolliguncula) and the timing of cladogenetic events was concordant between both studies. However, there are some differences between our results and the results of Ulloa et al. (2017). After diverging from the Lolliguncula lineage, two major Doryteuthis clades were formed followed by an extinction event in the Southern Gulf of Mexico ecoregion (Table 1, node II). One clade comprised the ancestor of the western Atlantic Doryteuthis species, and another comprised the D. gahi lineage and the ancestor of D. sanpaulensis and D. opalescens. In our study, the D. gahi lineage arose ~ 35.9 mya in the Malvinas/Falkland ecoregion, resulting in reconstruction of the southwestern Atlantic as the ancestral area of Doryteuthis, as proposed by Ulloa et al. (2017).

By that time, the Caribbean plate was compressed as it was forced between the westward-drifting North American and South American plates, resulting in uplift of the Aves Ridge (Karig 1972) to form an ostensibly continuous land bridge linking the Puerto Rico region to northern South America (Ali and Hedges 2021) establishing the Greater Antilles and Aves Ridge land bridge (GAARlandia) (MacPhee and Iturralde-Vinent 1994; Iturralde-Vinent and MacPhee 1999; Philippon et al. 2020; Garrocq et al. 2021). However, unlike the Pacific Lolliguncula species, the Tropical Northwestern Atlantic region and the Southern Gulf of Mexico ecoregion were the predominant ancestral areas for Doryteuthis. During this period, the D. gahi/sanpaulensis/opalescens ancestor separated from the ancestor of the other Atlantic lineages and dispersal took place toward colder waters to the North American Pacific, as well toward colder waters of southern South America.

During this period, the planet underwent severe climatic transitions, changing from warm, high-sea-level conditions during the Paleocene and early Eocene to a colder climate in the Oligocene, with rapid formation of the Antarctic glaciation at ~ 34 mya (Zachos et al. 2001; Miller et al. 2005). At this time, ocean cooling and opening of the Drake Passage around the southern tip of South American created a mixed region of cold waters rich in nutrients, which, alongside pulses of warm waters and winds, produced an upwelling region with increasing primary productivity, especially in the Tropical and Subtropical Atlantic (Zachos et al. 1996; Suto et al. 2012). This process may have shaped the distribution of D. gahi, as well as the distribution of D. sanpaulensis in the subtropical Atlantic region, as reflected in the TMRCA estimated here and the routes estimated with RASP 4 (Malvinas/Falkland ecoregion).

Additionally, the decreasing oceanic temperatures of this period generated several sharp primary productivity reductions, leading to changes in the abundance of both nektonic and planktonic organisms, which generated regions containing lower concentrations of marine resources (Shackleton et al. 1984; Rea et al. 1991; Zachos et al. 1996). Thermohaline and atmospheric circulation also intensified, leading to high oceanic turnover rates (Mackensen and Ehrmann 1992; Diester-Haass 1992), thus moving primary productivity zones previously associated with coastal zones to shelf areas (Sarnthein et al. 1987). These changes in primary productivity coupled with decreasing marine temperatures may have been responsible for shaping the dispersal of the ancestor of the D. pleii/D. surinamensis species complex to the southwestern Atlantic, where this environment was more suitable to maintaining populations of these two species and making them more associated with coastal regions, which was different from the D. pealeii species complex ancestor. These temperature and primary productivity changes may have also influenced the vicariance and subsequent dispersal of D. sanpaulensis to the Southeastern Brazil ecoregion and D. opalescens to the Southern California Bight (27.50%) via the South Atlantic/Western Pacific, as indicated by RASP 4. Also, at that time (~ 30 mya), there was no land connection between North and South America, meaning that it is possible that this dispersal occurred directly from the NW Atlantic to the NE Pacific.

The two D. pleii lineages appear to have separated from each other during the formation of the Lesser Antilles during the Middle Miocene (~ 16 mya) as part of the tectonic processes that formed the Caribbean plate (the orogeny of the western portion of the Andes), which modified the plate between the Caribbean and South America in a convergent fault, intensifying local volcanism (James 2005). Additionally, a series of changes to local ocean circulation currents caused by global ocean circulation pattern alterations (De Schepper et al. 2015; Cárcamo-Tejer et al. 2021) resulted in sea floor topography alterations, perhaps affecting substrate availability for egg laying by squids.

The continuity of the Andes uplift, especially during the late Miocene (~ 10 mya), altered the course of the Amazon River (Figueiredo et al. 2009; Hoorn et al. 2017), creating a less saline and more turbid environment in the Western South Atlantic. At this time, significant sea level changes took place globally, decreasing by more than 70 m compared to today, and more than 100 m in relation to the previous sea level (Haq et al. 1988). This probably constituted the combination of events responsible for the vicariance between D. pleii and D. surinamensis in the region, since this phenomenon has been already associated with speciation in other marine taxa in the region (Tourinho el al. 2012; Rodrigues-Filho et al. 2023). Doryteuthis surinamensis became more associated with the Amazon plume-influenced region (Guianan ecoregion, 26.68%) in contrast with D. pleii, which had recently separated from the North Atlantic/Gulf of Mexico lineage and may have been undergoing the same vicariance process as D. pealeii.

However, while D. pleii is distributed in the western South Atlantic (north and south of the Amazon River plume), D. pealeii is not found in the same region (Isla Margarita, Venezuela) where Sales et al (2017) sampled D. pleii (João Braullio Sales, pers. comm.). The Northwestern Atlantic D. pealeii lineage may have dispersed to the Southwestern Atlantic (Amazonia ecoregion), and the divergence time estimated in the present study between these lineages coincides with the beginning of the establishment in the Western South Atlantic Amazon plume (Figueiredo et al. 2009; Hoorn et al. 2017). The significant temperature fluctuations caused by intensification of Northern Hemisphere glaciations (NHG) at ~ 2.75 mya (Ravelo et al. 2004) also affected global sea levels (Pillans et al. 1998; Miller et al. 2005), which play a significant role in marine and coastal taxa (Floeter et al. 2008; Pardo-Gandarillas et al. 2018; Costa et al. 2021).

These oscillations had a major impact on the distribution of marine and coastal species in various regions, such as the Gulf of Mexico and the North Atlantic. Marine cooling may have forced populations southward, which may have influenced contact between Atlantic and Gulf populations in Mexico in the region now known as South Florida (Avise 1992), where Northern Hemisphere Glaciation (NHG) caused sea level drops of over 150 m, exposing tremendous expanses of the Florida and Yucatan peninsulas, where Florida was bordered by some intermediate-salinity estuaries and saltmarshes, favoring coastal species. This glacial advance may have contributed to the separation of some coastal populations from the Atlantic and Gulf through the establishment of isolated estuarine habitats in the western portion of the Gulf of Mexico, thus favoring speciation between the North Atlantic and Gulf of Mexico D. pealeii lineages (Avise 2006) present in these areas at the time, as has been inferred for other marine species in the same region (Gold and Richardson 1998; Anderson 2007; Anderson et al. 2012).

The present study provides new evidence of potential cryptic speciation within Doryteuthis around the American continent, and suggests that historical biogeographic processes were responsible for shaping the current species composition and distributions within the genus, as seen in another Doryteuthis species (such as D. pleii), as well as within Lolliguncula. The separation between D. pealeii lineages between the Gulf of Mexico and the North Atlantic must, however, be analyzed cautiously, as there are previous proposals of multiple populations (Buresch et al. 2006; Gerlach et al. 2012) as a single stock (Shaw et al. 2012). The recent implementation of next-generation sequencing methods has helped resolve several taxonomic problems for cephalopod species (Anderson and Lindgren 2021; Sanchez et al. 2021) and may aid in elucidating the speciation between the North Atlantic and the Caribbean. Additionally, a broad morphological revision must be conducted for D. pleii and D. pealeii from North Atlantic and the Caribbean to clarify the status of D. ocula and D. roperi. The lineage of D. pealeii detected in the Western South Atlantic represents a new species for the region which reinforces the high cryptic diversity of cephalopods in the Atlantic.