Abstract
Mito-nuclear phylogenetic discordance in Bivalvia is well known. In particular, the monophyly of Amarsipobranchia (Heterodonta + Pteriomorphia), retrieved from mitochondrial markers, contrasts with the monophyly of Heteroconchia (Heterodonta + Palaeoheterodonta), retrieved from nuclear markers. However, since oxidative phosphorylation nuclear markers support the Amarsipobranchia hypothesis instead of the Heteroconchia one, interacting subunits of the mitochondrial complexes ought to share the same phylogenetic signal notwithstanding the genomic source, which is different from the signal obtained from other nuclear markers. This may be a clue of coevolution between nuclear and mitochondrial genes. In this work we inferred the phylogenetic signal from mitochondrial and nuclear oxidative phosphorylation markers exploiting different phylogenetic approaches and added two more datasets for comparison: genes of the glycolytic pathway and genes related to the biogenesis of regulative small noncoding RNAs. All trees inferred from mitochondrial and nuclear subunits of the mitochondrial complexes support the monophyly of Amarsipobranchia, regardless of the phylogenetic pipeline. However, not every single marker agrees with this topology: this is clearly visible in nuclear subunits that do not directly interact with the mitochondrial counterparts. Overall, our data support the hypothesis of a coevolution between nuclear and mitochondrial genes for the oxidative phosphorylation. Moreover, we suggest a relationship between mitochondrial topology and different nucleotide composition between clades, which could be associated to the highly variable gene arrangement in Bivalvia.
Similar content being viewed by others
Introduction
Deep bivalve phylogeny: state-of-art
Bivalves are an extremely diverse group with about 50,000 living species1. Deep evolutionary relationships among major clades within the molluscan class Bivalvia are only recently coming to a shared figure. The class is split into two main subgroups, Protobranchia and Autobranchia, whose origins root deep in the middle Ordovician periods2,3,4,5,6. Most likely, extant protobranchs resemble the Cambrian forerunners the most, for many molluscan symplesiomorphies are present, like a well-developed foot and true molluscan ctenidia devoted to gas exchange7,8; moreover, food is brought to the mouth by palp proboscides. Two sister groups are usually acknowledged within Protobranchia, Nuculida and Solemyida, which are given an ordinal status2,9,10,11,12,13; analyses mainly based on molecular markers proposed to exclude the protobranch superfamily Nuculanoidea from Protobranchia and to better place it within Autobranchia14,15,16,17; the name Opponobranchia was proposed for remaining protobranchs18. On the other hand, the clade Protobranchia has been recovered by most of large-scale datasets19,20, but with some exceptions20. Therefore, the monophyly of this clade still needs to be assessed.
The way of feeding is radically different in Autobranchia (= Autolamellibranchiata sensu18), whose common ancestor developed a feeding gill, one of the main drivers of the Ordovician bivalve radiation3 and led most groups to the key ecological shift towards infaunalization4,5,21. Autobranchia is comprised by three major clades (subclasses22): Heterodonta (clams, cockles, razor clams, and their kin), Palaeoheterodonta (freshwater mussels and their kin), and Pteriomorphia (mytilids, oysters, scallops, and their kin)19,23,24. Moreover, the former subclass Anomalodesmata22,25,26,27 has been found to be nested within Heterodonta14,15,18,20,28,29,30,31. Currently, Archiheterodonta (order Carditida) are considered sister group to other Euheterodonta, which are further split into Anomalodesmata itself and Imparidentia15,20,23,26,27,31,32,33.
Relationships among the main bivalve sub-lineages remained unresolved or uncertain until recently. With minor issues linked to the position of Nuculanida and Anomalodesmata, two main hypotheses have been put forward: the Heteroconchia hypothesis, which involves a sister group relationship between Heterodonta and Palaeoheterodonta (Fig. 1A), and the Amarsipobranchia hypothesis, which involves the sister group relationship between Heterodonta and Pteriomorphia instead (Fig. 1B).
The traditional taxonomic view and morphological analyses of Autobranchia heralded the Heteroconchia hypothesis4,11,15,26,33,34; however, a closer relationship between Heterodonta and Pteriomorphia has been suggested following palaeontological evidence35,36,37,38,39,40. The Amarsipobranchia hypothesis was also highly supported by molecular phylogenetics, using mitochondrial markers15,17,24,41,42,43. Contrastingly, the Heteroconchia hypothesis is always supported when nuclear markers are used (either combined with morphological data or not), as well as by means of transcriptomics15,19,20,33,44,45,46. This is a clear example of mito-nuclear phylogenetic discordance47.
The OXPHOS genes and mito-nuclear coevolution
The massive ATP production of aerobic respiration in eukaryotes is mostly made possible through the oxidative phosphorylation (OXPHOS) pathway, which takes place across the inner mitochondrial membrane. OXPHOS pathway is carried out by five enzymatic complexes (CI-V). The genes encoding for the subunits are mostly located in the nuclear genome (around 70 genes), while 13 genes are typically harbored in the mitochondrial genome (mtDNA), at least in most bilaterians. All the complexes but Complex II (CII) have cooperating subunits that are encoded by genes that are located on two different genomes, which show different mutation rate, population size and way of inheritance48.
In particular, the low recombination rate of mtDNA leads to the accumulation of slightly deleterious mutations49. This process would affect the efficiency of OXPHOS, but slightly negative mutations can be counterbalanced by compensatory mutations in the nuclear genes50 or even by new nuclear subunits added to the OXPHOS complexes51. According to this model of mito-nuclear coevolution, the process is driven by the accumulation of slightly deleterious mitochondrial mutations, which affects the selective pressure on the interacting nuclear subunits. Indeed, a correlation between the amino acid substitution rate of mitochondrial genes and their interacting nuclear counterparts was shown52,53,54. The evolutionary rate correlation (ERC)55 analysis is considered highly reliable to detect signals of mito-nuclear coevolution56 and bivalves are among the clades where a positive ERC has been identified56,57,58.
Quite surprisingly, the Amarsipobranchia clade is also supported by nuclear genes encoding for the OXPHOS subunits57. Moreover, nuclear and mitochondrial OXPHOS genes show significant ERC and a similar dN/dS ratio57 (the ratio between nonsynonymous substitution rate and the synonymous substitution rate59).
The mtDNA of bivalves has a highly variable architecture, showing features that are unique among metazoans. Gene order is not conserved inside the class and the high frequency of rearrangements prevents to infer an ancestral gene order for Autobranchia60. Among Protobranchia, in the mitochondrial sequence of Solemya velum the leading strand, which is also the AC-rich one, harbors the genes co1, co2, co3, nadh1, nadh2, nadh4, nadh4L and nadh5, whereas the other strand harbors the genes atp8, atp6, cytb, nadh1 and nadh643. Among Bivalvia, this is likely the most ancestral gene arrangement43.
In Palaeoheterodonta the genome organization is highly conserved, and notable rearrangements were never detected within this subclass. Most of the protein coding genes are retained on the GT-rich strand (atp6, atp8, co1, co2, co3, nadh3, nadh4, nadh4L and nadh5), whereas the other strand harbors cytb, nadh1, nadh2 and nadh661.
Heterodonta and Pteriomorphia show a high degree of rearrangement. Few blocks of genes are shared between different orders of the same clade, and sometimes even among the same family62. It is however worth noting that in Heterodonta and Pteriomorphia all genes are retained on one strand, which is rich in G + T. As a matter of fact, the unidirectional replication of the mitochondrial genome leads to an asymmetric nucleotide composition of the two strands, increasing the G + T content in the heavy strand63. Most metazoans harbor most of the genes on the light strand, which is rich in A + C, but mollusks show an inverted pattern, in that in these species most of the genes are located on the GT-rich strand64,65. The position of mitochondrial genes on different strands has already been reported as a source of phylogenetic artifacts64,66. Thus, there could be a relationship between the diverging phylogenetic signal of the mitochondrial markers and the location of some genes in Palaeoheterodonta compared to Heterodonta and Pteriomorphia.
In this study, we performed a phylogenetic analysis using mitochondrial (mt-OXPHOS) and nuclear OXPHOS (nu-OXPHOS) markers, exploiting different phylogenetic approaches. For the sake of comparison, we added two more datasets: genes related to the glycolytic pathway and the genes related to the biogenesis of regulative small noncoding RNAs (sncRNAs). We also analyzed different features of markers selected for phylogenies: how the phylogenetic signal is distributed along the genes, codon usage, amino acid composition and strand location of the markers. We tested possible relationships between these features and the retrieved phylogenetic signals.
Regardless of the phylogenetic method, the Amarsipobranchia are supported only by the OXPHOS markers, both nuclear and mitochondrial. This phylogenetic signal is mostly retained in the organellar markers; among nuclear genes, subunits in direct contact with the mitochondrial counterparts lend most support to this topology. Moreover, we report an unbalanced nucleotide and amino acid composition between Amarsipobranchia and the Palaeoheterodonta, with a higher guanine and thymine content in the latter clade. We suggest that this pattern might be related to a different transcriptional mechanism, which has driven the mitochondrial phylogenetic signal to support Amarsipobranchia.
Results
The phylogenetic analysis on the four datasets
The datasets were comprised by 35 species, for four species two mitochondrial haplotypes were sampled (i.e., the female and male mitochondrial haplotypes; see below) (Table 1). All four datasets were incomplete, glycolysis being the most incomplete matrix (Table S1). Conversely, the mt-OXPHOS dataset was the most complete. Species showed a different range of completeness as well: Myzuhopecten yessoensis was the most complete species, while the outgroup Graptacme eborea was the least complete species (Fig. S1). After the masking step, the mt-OXPHOS dataset was the shortest but also that with the lowest number of discarded sites. The longest dataset was the glycolysis one; the sncRNAs dataset was that with the highest number of discarded sites (Table S1).
The three maximum-likelihood (ML) trees and the single Bayesian tree inferred from the mt-OXPHOS dataset were never significantly different and did not show any alternative resolution of major clades (Fig. 2a, Fig. S2 and Table S2). Protobranchia were basal, exception made for Aequiyoldia eightsii (Nuculanida), which clusters within Amarsipobranchia. Autobranchia were fully supported by all four trees. Then, the tree was divided into Amarsipobranchia and Palaeoheterodonta, both fully supported. The Amarsipobranchia were divided into Heterodonta and a clade comprised by A. eightsii and Pteriomorphia. Within this clade a polytomy between A. eightsii, Mytilida (Perna viridis, Bathymodiolus azoricus, Mytilus edulis) and the other pteriomorphians was recovered. Heterodonta were split into Imparidentia and Anomalodesmata, both fully supported.
The ML and Bayesian trees inferred from the nu-OXPHOS dataset were never significantly different and did not show any alternative resolution of major clades (Fig. 2b, Fig. S3 and Table S2). Protobranchia were basal, but monophyletic in the MrBayes tree only (Fig. 2b); according to the other trees this group was not monophyletic or not robustly supported (Fig. S3). As for the mt-OXPHOS dataset, Autobranchia were split into Palaeoheterodonta and monophyletic Amarsipobranchia. Amarsipobranchia were divided into Pteriomorphia and Heterodonta, and the latter clade was split into Anomalodesmata and Imparidentia; all these clades were fully supported. Within Pteriomorphia, Mytilida are the sister group of remaining OTUs.
The ML and Bayesian trees inferred from the sncRNAs dataset were never significantly different and did not show any alternative resolution of the main clades (Fig. 3a, Fig. S4 and Table S2). Overall, several phylogenetic relationships were not resolved and some species were placed in unexpected major clades. After the separation of Ennucula tenuis, there was a polytomy with 6 branches: Heteroconchia; Mytilida + Ostreida, exception made for Pinna atropurpurea; Pectinida; A. eightsii + P. atropurpurea; Tegillarca granosa; Solemya velum (Fig. 3a). Heteroconchia were divided into Palaeoheterodonta and Heterodonta. Heterodonta were split into Anomalodesmata and Imparidentia, even if within the latter clade the palaeoheterodont Margaritifera margaritifera was recovered, which does belong to freshwater mussels.
The ML and Bayesian trees inferred from the glycolysis dataset were never significantly different and did not show any alternative resolution of major clades (Fig. 3b, Fig. S5 and Table S2). A long branch led to the Bivalvia node, which further separated Pteriomorphia from other bivalves, leading to the paraphyly of Autobranchia. Namely, Protobranchia and Heteroconchia clustered into a monophyletic group that was supported by all four trees. Heteroconchia were split into Palaeoheterodonta and Heterodonta. The latter clade was divided in Anomalodesmata and Imparidentia; all these clades were fully supported. Within major clades all relationships were resolved and supported and the pteriomorphian and imparidentian species clustered in the expected orders.
Concluding, notwithstanding some issues with the major clade of Protobranchia which blurred the comparison and the substantial overlapping of all phylogenetic trees, the Amarsipobranchia hypothesis was supported in both OXPHOS datasets, while the Heteroconchia hypothesis was supported in the glycolysis and sncRNAs datasets. Henceforth, we will use mt-topology to refer to the Amarsipobranchia hypothesis and nuc-topology for the Heteroconchia hypothesis.
Phylogenetic signal and its distribution across markers and complexes
Markers belonging to the same dataset may support a different phylogenetic signal. Gene concordance factor (gCF), site concordance factor (sCF)67 and ultrafast bootstrap approximation68 (UFBoot) were calculated for the Heteroconchia and Amarsipobranchia (which represent alternative resolutions of a node). The mt-OXPHOS dataset showed high support for Amarsipobranchia according to each value (UFBoot = 100; gCF = 30.8; sCF = 48.6), and low support for the Heteroconchia (UFBoot = 0; gCF = 0; sCF = 25.5). Despite a non-zero gCF suggests more markers concordant with the nuc-topology than with the mt-topology, the nu-OXPHOS dataset similarly favors mt-topology (UFBoot = 87; gCF = 3.57; sCF = 37.2) against nuc-topology (UFBoot = 12; gCF = 5.36; sCF = 32.5). Regarding the sncRNAs and glycolysis datasets, markers are more concordant with Heteroconchia, since the UFBoot, gCF and sCF calculated for this topology are considerably higher (Table S3).
For the two OXPHOS datasets we clustered the markers according to the OXPHOS complexes; the sCF for each complex was computed; moreover, it was compared to the sitewise log-likelihood score (SLS) calculated for both topologies. The difference between the mt-topology sitewise log-likelihood score and the nuc-topology sitewise log-likelihood score (ΔSLS) can tell which topology is favored by each site: sites with ΔSLS > 0 supports the mt-topology; sites with ΔSLS < 0 support the nuc-topology. Moreover, by summing all the ΔSLS within a complex we obtained a complexwise log-likelihood score69,70 (ΔCLS; Table 2). Since the summed ΔCLS highly depends on the number of sites within each complex, we divided the ΔCLS for the number of sites of each complex (average ΔCLS). For the mitochondrial markers that belong to CI we made a distinction between those nadh genes that in Palaeoheterodonta are on the plus strand (CI-ps) from those nadh genes located on the minus strand (CI-ms), since we were willing to test if the mt-topology is mostly supported in the genes that are in different strands in Palaeoheterodonta and Amarsipobranchia (i.e. cytb, nadh1,2,6; see “Introduction”).
All the mitochondrial groups (Table 2) show a positive ΔCLS; a positive average ΔCLS; more sites that strongly support the mt-topology; more sites in the alignment that agree with the mt-topology. The only exception is CI-ps, where the ΔCLS and average ΔCLS are negative, although the other statistics follow the pattern of the other groups.
Complexes III to V of the nu-OXPHOS dataset (Table 2) support Amarsipobranchia; sites that strongly support the mt-topology (with ΔSLS > 0.5) are more than those supporting the nuc-topology and most sites in the alignment agree with the mt-topology. Contrastingly, CI and CII do not support Amarsipobranchia. In CII there is an equal number of sites for either topology, while in the CI those with a ΔSLS > 0.5 are more. The sCF calculated on the nuc-topology is higher in CII and almost equal in CI with respect to the sCF calculated on the mt-topology.
Overall, in all complexes ΔCLS, average ΔCLS and sCF variate together; statistics related to the strongly supporting sites do not always follow the same pattern, since CI shows a negative ΔCLS but a higher number of sites with ΔSLS > 0.5.
To test whether the mt-topology phylogenetic signal is mostly retained in the nu-OXPHOS subunits that interact with the mitochondrial subunits, we calculated the sCF referred to each marker and we split the markers into two groups: those that are in direct contact with the mitochondrial counterparts and those that are not. The sCF values of the “contact” nu-OXPHOS markers are significantly higher than the values of “non-contact” nu-OXPHOS makers (p value = 0.006363; Fig. 4).
In the sncRNAs and glycolysis datasets the ΔSLS was calculated on the whole matrix. In both datasets the average ΔSLS is negative and there are more sites strongly supporting Heteroconchia (Table S3).
Nucleotide composition and mitochondrial topology
We placed attention on the nucleotide asymmetry between the two mitochondrial strands, which can be assessed calculating the AT skew and GC skew71. In Bivalvia the plus strand is richer in guanines and thymines than the minus strand64,72. Thus, we also analyzed possible dissimilarities in the guanine and thymine content (G + T content) between markers that in Pteriomorphia, Imparidentia, Anomalodesmata and Palaeoheterodonta are on the same strand (i.e., atp6,8, cox1-3, nadh3-5; Table S4).
For each mt-OXPHOS marker of each species we calculated the AT skew, the GT content, the frequency of codons with guanines or thymines at the first and the second position (GT-rich codons) and the GT content at the third position of four-fold degenerated codons (Fig. 5). Among Imparidentia, Anomalodesmata, Pteriomorphia and Palaeoheterodonta the markers show an AT skew < 0 and a GT content > 0.5. On average, Palaeoheterodonta show the highest values in all the statistics but the AT skew (Fig. 5a). Indeed, Palaeoheterodonta are always significantly different from the other groups, with the only exception of Anomalodesmata in GT-rich codons (Fig. 5b). On the other hand, the comparisons between Pteriomorphia and Imparidentia are never significant. Regarding the outgroups and Protobranchia values, data show a high standard deviation in most of the cases. The only exception is A. eightsii, whose statistics are in line with the values of Imparidentia and Pteriomorphia.
Finally, we studied if the nucleotide compositional patterns outlined in the protein coding regions were extended to the unassigned regions (URs): we downloaded the mitochondrial genomes available on NCBI of all the species that belong to Imparidentia, Anomalodesmata, Palaeoheterodonta, Pteriomorphia and Protobranchia. Then, we calculated the GT content in the URs of the genomes. The GT content of URs calculated on 92 Palaeoheterodonta entries is significantly higher than the one calculated on 77 Pteriomorphia entries, 70 Imparidentia entries and 4 Protobranchia entries. Conversely, it is not significantly higher than the one calculated on 6 Anomalodesmata entries (Fig. S6). For what concerns the other comparisons, no clade is significantly different from any other.
Overall, the nucleotide composition of Palaeoheterodonta mt-OXPHOS markers is most of the times significantly different from the one of other major clades. In particular, we detected a higher GT content. This pattern is reflected in all codon positions as well as in the URs. On the other hand, statistics are overlapping between Imparidentia and Pteriomorphia.
Discussion
For all four datasets, the more recent nodes were resolved and highly supported. In Imparidentia and Pteriomorphia the OTUs were always placed in the expected orders and major clades (i.e. Pteriomorphia, Anomalodesmata, Imparidentia, Palaeoheterodonta and Protobranchia). Only few exceptions were detected, i.e. the position of A. eightsii in the sncRNAs and mt-OXPHOS trees and the position of M. margaritifera in the sncRNAs trees. The latter was likely a long branch attraction bias73, since the final branches of the OTU and its sister species were the longest in the tree (Fig. 3a). Generally speaking, we regard to these misplacements as minor phylogenetic issues in the broader figure of deep evolutionary relationships among bivalves, which do not significantly blur the topology connecting major clades.
Major clades were retrieved with higher support and with better resolution from the OXPHOS datasets with respect to the glycolysis and sncRNAs dataset. Overall, OXPHOS genes are known to be more conservative, therefore these markers might be more informative in the resolutions of cladogenetic events dating to the Ordovician, approximately 470–480 million years ago (Mya)36,38,39.
The mt-OXPHOS trees were mostly coherent with the previous mitochondrial phylogenetic analyses, exception made for the monophyly of the Heterodonta with Anomalodesmata inferred from our analysis17,24,41,42,57,64,74. Since all the nu-OXPHOS trees supported the Amarsipobranchia clade (Fig. 1b), our data confirmed that the mt and nu-OXPHOS markers share the same phylogenetic signal, which is different from that inferred from transcriptome-wide analyses or other nuclear markers14,19,33,44,45.
Among the interacting sites of coevolving proteins there are epistatic interactions, which lead the sites of both proteins to evolve at the same rate75. Bivalvia OXPHOS subunits show a positive ERC, which is the most solid clue of protein coevolution55,56. Our data enforce the hypothesis of mito-nuclear coevolution in bivalves, depicting a clear relationship between the phylogenetic signal of interacting subunits. Moreover, they provide an overview on how the phylogenetic signal of OXPHOS subunits may be biased under this type of interaction. The CF and ΔSSL analyses suggested that OXPHOS markers did not equally support the Amarsipobranchia, yet the two dataset were largely coherent with each other: CIII-V markers from both OXPHOS datasets largely supported the mt-topology; contrastingly, CI did not show a clear pattern, and the nuclear-only Complex II favors the nuc-topology. Moreover, the nu-OXPHOS subunits in contact with the mitochondrial counterparts were significantly more concordant with the mt-topology than the subunits that are not directly in contact. Accordingly, previous analyses reported that the CII is the only complex that shows uncorrelated rates of evolution compared to the other subunits57.
Finally, the support and concordance statistics (BP, PP, UFBoot, gCF and sCF) calculated for the mt-OXPHOS dataset on the Amarsipobranchia node were always equal to or higher than those calculated for the nu-OXPHOS dataset. Thus, the mt-topology in the first dataset was more consistent: more sites and markers agreed with this topology and the signal was less susceptible to resampling.
The mito-nuclear coevolution is expected to be mainly driven by slightly deleterious mitochondrial mutations that are compensated by the nuclear genome48,50. Even if previous data did not show signal of nuclear compensation57, it is tempting to conclude that the mitochondrial genome acquired the mutations leading to the mt-topology at first, and then the phylogenetic signal has been traced by interacting sites in the nuclear markers through nuclear compensation.
Pteriomorphia and Imparidentia share some unique mitochondrial features: their gene order is highly rearranged, but all genes are on the same strand. Contrastingly, Palaeoheterodonta show a highly conserved gene order, with a set of genes on the minus strand (nadh1,2,6 and cytb; see Introduction for further details). According to the nucleotide composition analyses, the Palaeoheterodonta mt-OXPHOS markers were significantly GT-richer in each codon position as well as in URs, while Pteriomorphia and Imparidentia did not show any significant difference (Fig. 5, Fig. S6). Thus, this pattern was accounted for either synonymous and non-synonymous substitutions and it was extended also to URs. Mito-nuclear coevolution largely explains why the nu and mt-OXPHOS markers support a common topology.
Mito-nuclear discordance is a quite common phenomenon and a multitude of processes can cause it76. The introgression of mitochondrial lines from a phylogenetically distant population is widely used to explain mito-nuclear discordance47,76. In some cases it has been hypothesized that a set of nuclear genes might cointrogress to avoid mito-nuclear incompatibilities77. The mito-nuclear cointrogression would explain very well our data, since the phylogenetic artifact is mostly supported by the nuc-OXPHOS markers that directly interact with the mt-OXPHOS markers, whereas almost all mt-OXPHOS markers support the mt-topology. In this case, the use of a single mitochondrial strand and other features would be apomorphies arisen along a single branch and subsequently acquired by the other branch through introgression. Having said that, other evidences of cointrogression are limited and only restricted to populations within the same genus77,78,79. In our case the discordance mainly resides in the resolution of deep nodes, which originated around 480 Mya5,36,80, between clades that already evolved quite different life habits5. Under this scenario, the mito-nuclear cointrogression might not be the most likely hypothesis.
Another source of mito-nuclear discordance can be found in how markers are located on the two mitochondrial strands66. In mollusks, whose mitochondrial genome is highly rearranged, the nucleotide bias is also reflected in amino acid bias64. Our results showed that the signal supporting the mt-topology (the Amarsipobranchia clade) is not only retained in the set of markers that in Palaeoheterodonta are on the minus strand. Instead, the mt-OXPHOS CIV-V markers on the plus strand clearly favor the Amarsipobranchia hypothesis (Table 2). The higher GT content in Palaeoheterodonta is consistent throughout different parts of the mitochondrial genome, from coding to unassigned regions. Therefore, the nucleotide substitutions that have led to this pattern are likely to be produced by a process that act on the whole genome. Possible candidates might be the mitochondrial transcription and replication, which are indeed notable source of deamination63,81. Moreover, mitochondrial replication constitutes the main source of mitochondrial point mutations, at least in humans82.
The position of all genes on the same strand is probably linked to the fact that the two clades do not show any significant difference in GT content. It is tempting to hypothesize that transcription involves the coding strand only for these mtDNAs and, thus, the aforementioned deamination effect may be less pronounced. Indeed, even those sncRNAs that were recently described in the imparidentian R. philippinarum were annotated on the same coding strand83, thus corroborating the idea that only one strand is transcribed in these clades.
The use of a single strand seems also linked to the mitochondrial architecture: among most of the metazoan taxon that share this feature it has been detected a higher mitochondrial rearrangement rate24,84,85. Likewise, Pteriomorphia and Imparidentia show highly rearranged mitochondrial genomes62. An additional clue is the behavior of A. eigthsii: the protobranch species cluster with Pteriomorphia and shows similar nucleotide composition features (Fig. 2a, Fig. 5a). Indeed, although no mitochondrial genome has been annotated from the order Nuculanida, it is possible that A. eigthsii mitogenome harbors all the genes on the heavy strand, since all its mitochondrial genes show AT skew < 0 and GC skew > 0 (Table S5).
If the hypothesis holds true, it is reasonable to consider the different transcriptional patterns among Bivalvia as the most likely source that has led the mitochondrial genome to support a different phylogenetic signal, namely a biased one. Since we demonstrated that also non-synonymous mutations have shaped the GT content pattern, the modifications of the amino acid sequences could have altered the epistatic interactions between nuclear and mitochondrial OXPHOS subunits, leading the OXPHOS markers to support the same phylogenetic artifact.
Conclusions
The results obtained from the phylogenetic analysis of Piccinini and colleagues57 has been confirmed by our work, since markers of both OXPHOS datasets support the same biased topology, regardless of the phylogenetic pipeline used. Moreover, we depicted how the coevolution process affected the phylogenetic signal in different set of OXPHOS markers, concluding that the artifactual topology is mainly supported by the OXPHOS subunits that interact more directly.
Considering that the phylogenetic signal is more stable and stronger in the mt-OXPHOS markers, we suggest that the biased topology arose for these markers at first, then it has been acquired also by the nu-OXPHOS markers through the coevolution of interacting subunits. This model agrees with the pattern of evolution hypothesized for the mito-nuclear coevolution. That is, the mito-nuclear coevolution is mainly driven by slightly deleterious mitochondrial mutations that are compensated by the nuclear genome48,50.
Our data suggest a relationship between the mt-topology supporting Amarsipobranchia and the gene rearrangements in the Bivalvia mitochondrial genome. The clades that harbor all the mitochondrial genes on a single strand and show a similar nucleotide composition (Pteriomorphia, Heterodonta, and possibly A. eightsii) are grouped together in a monophyletic clade. On the other side, Palaeoheterodonta show a peculiar nucleotide composition, which is not only due to the genes located on the minus strand. Indeed, genes such cox1-3, atp6,8, nadh3-5, even if they are located on the plus strand, show a higher GT content compared to the Amarsipobranchia ones. Overall, the difference in GT content between OTUs may be a source of possible phylogenetic artifacts. Further analyses will be focused on understanding how the nuclear subunits compensated differently during the evolution of Palaeoheterodonta, Pteriomorphia and Heterodonta.
Finally, according to the data, the reliability of the Amarsipobranchia clade should be reconsidered. At the state of the art, although many mitochondrial phylogenies confirmed the Amarsipobranchia clade24,57,64,74, no phylogeny supports Amaripobranchia when based on nuclear markers (exception made for the nu-OXPHOS markers57; Fig. 2b, Fig. S2). On the other side, the Heteroconchia clade has been retrieved by genome-wide, transcriptomic, and morphological analyses19,33,44,45. If the evolutionary scenario depicted in our discussion is correct, then the taxon Amarsipobranchia cannot be supported anymore and has to be considered a phylogenetic artifact: the Heteroconchia clade should be regarded as a more reliable hypothesis instead.
Materials and methods
The datasets
Our phylogenetic analyses were performed on four datasets: mt and nu-OXPHOS genes, glycolytic pathway genes, and genes related to the biogenesis of sncRNAs. All markers were retrieved from the transcriptomes used by Piccinini and colleagues57: the transcriptomes of 35 molluscan species were assembled (Table 1). When available, the mt-OXPHOS markers from both sexes were retrieved for those species that show mitochondrial Doubly Uniparental Inheritance (DUI86,87,88).
Information about the assembly of transcriptomes is detailed in the aforementioned paper57. Briefly, the annotation of transcripts was performed using BLASTx89 against a user-defined database and HMMER90 against the Pfam database 30.091; the user-defined database contains sequences of all genes of that dataset available for Bivalvia on NCBI.
Clam homologs for the first three datasets were extracted following the gene lists available in the Kyoto Encyclopedia of Genes and Genomes (KEGG92,93,94,95; https://www.kegg.jp), which provides a curated database of enzymes involved in specific biochemical pathways: namely, the Oxidative phosphorylation pathway (KEGG entry: map00190) and the Glycolysis/Gluconeogenesis pathway (KEGG entry: map00010). Regarding genes for the fourth dataset, i.e. genes related to the biogenesis of sncRNAs, we identified a set of genes shared across Metazoa96,97. Entries available on NCBI and UniProt98 were included in the database (Table S6). Annotation was performed using BLASTp89.
Paralogs were recurrent among the markers associated to glycolysis. Therefore, we devised a method to conservatively distinguish paralogs from orthologs. We inferred the ML tree from each single marker putting orthologs together, which was obtained using IQ-TREE1.799 with mixture model as model of evolution, 1000 UFBoot68 replicates, and constraining the Bivalvia clade. Through the analysis of topologies, more than one group of clear monophyletic orthologs were detected in some cases, namely in the markers with KEGG ID K00002, K00128, K00129, K00149, K00627, K00844, K01596, K01623, K01689, K01785, K01895, K03103, K08074, and K13953 (Table S7). In these cases, groups of orthologs were split and considered as single markers. Aiming to ensure that the phylogenetic signal supported by the glycolysis matrix after this scrutiny was coherent, we retained two different datasets associated to glycolysis genes: a larger dataset with all markers obtained in this way (total-glyco) and a dataset with markers that showed no evidence of paralogs (partial-glyco). All subsequent analyses were carried out independently for both datasets; since differences in results were negligible, we are confident that we identified paralogs correctly, thus in the results we mean the total-glyco dataset only when referring to the “glycolysis” dataset.
Phylogenetic reconstruction
We performed the phylogenetic analysis using amino acid sequences, since we were more interested in deep relationships and nucleic sequences are bound to saturate along long branches. First, we aligned sets of homologous markers with PSI-Coffee100. Then, to remove the uninformative or misleading sites for the analysis, we used and combined the results of five different masking algorithms24: BMGE101, Aliscore102, Gblocks103, ZORRO104 and Noisy105. This step was performed by masking_package 1.1, downloaded from GitHub and available at https://github.com/mozoo/masking_package. To include the indels in the phylogenetic reconstruction we ran GapCoder106 on every alignment.
To assign the best-fitting evolutionary model to each marker of the matrix we used PartitionFinderProtein107. All markers belonging to the same dataset were concatenated together. For each dataset we obtained four trees. (1) One tree was obtained through IQ-TREE 1.7 with the dataset partitioned according to the PartitionFinder results. (2) One tree was obtained through IQ-TREE 1.7 with the mixture model as model of evolution99. (3) One tree was obtained through RAxML version 8.2.11108 with the dataset partitioned according to the PartitionFinder results, using the CAT model instead of the Gamma model109. 1000 bootstrap replicates were executed for each run, to test the robustness of the nodes, and the UFBoot approximation was chosen for IQ-TREE. (4) The fourth tree is based on the Bayesian inference, obtained through MrBayes110 with the dataset partitioned according to the PartitionFinder results. Number of generations was set to 10,000,000; the convergence between runs were manually checked to set the burn-in value. To set this value, we looked at the standard deviation of average split frequency over generations; moreover, we took the Potential Scale Reduction Factor (PSRF111) into consideration. In each analysis the monophyly of Bivalvia was constrained and in the Bayesian analysis the outgroup was set to be the polyplacophoran Acanthochitona crinita (Table 1).
Analyses on topologies and markers
At the end of phylogenetic analysis, four trees were obtained for each dataset through four different pipelines, as described above. To test whether the trees obtained from the same dataset are significantly different or not, we performed the Shimodaira-Hasegawa test (SH-test112), exploiting the RAxML option “-f H”.
The support of each site for the Amarsipobranchia hypothesis (“mt-topology”) and the Heteroconchia hypothesis (“nuc-topology”) was calculated through the ΔSLS. Sites with ΔSLS > 0.5 or ΔSLS < − 0.5 were retained as sites with strong support for either hypothesis69,70. To calculate the sitewise log-likelihood we exploited the RAxML option “-f g” providing the RAxML ML tree when the sitewise log-likelihood was calculated on the mt-topology. A tree with the nuc-topology was obtained by running the phylogenetic analysis with the same settings, but constraining the Heteroconchia clade (as suggested by Shen and colleagues70), and the resulting ML tree was used to calculate the nuc-topology sitewise log-likelihood.
The sCF and the gCF were calculated through IQ-TREE 1.7 (again with 1000 UF-bootstrap replications) with the option “–cf-verbose” to study phylogenetic signal between and within partitions67. Each dataset was partitioned into single markers in order to calculate the sCF per marker and the gCF. Then, the matrices were partitioned according to complexes to obtain sCF per complex. The nu-OXPHOS subunits in direct contact with the mitochondrial counterparts were defined according to the list of Piccinini and colleagues57.
Custom-tailored python and R113 scripts were used to analyze and plot the nucleotide and amino acid composition, using Biopython114 and ggplot2. Since mitochondrial URs are missing from transcriptomes, their nucleotide composition was calculated for a list of NCBI indexes obtained through the alMighto database65: a single entry was selected for each species in the database belonging to Palaeoheterodonta, Imparidentia, Anomalodesmata, Pteriomorphia or Protobranchia.
For DUI species the mtDNA of both sexes was selected. The guanine and thymine content in URs was obtained through a customized version of the HERMES tool115. The significance of the comparisons was calculated through the Kruskal and Wallis test116, followed by the Dunn’s test117 with Bonferroni’s correction.
Data availability
The data underlying this article are available in the GenBank Nucleotide Database at https://www.ncbi.nlm.nih.gov/ and in the SwissProt database at https://www.uniprot.org/, and can be accessed with the accession numbers provided in the article and in the supplementary materials.
References
Gosling, E. M. Bivalve Molluscs: Biology, Ecology and Culture (2003).
Morton, B. The evolutionary history of the Bivalvia. In Origin and Evolutionary Radiation of the Mollusca (ed. Taylor, J. D.) 337–359 (Oxford University Press, 1996).
Cope, J. C. W. & Babin, C. Diversification of bivalves in the Ordovician. Geobios 32, 175–185 (1999).
Cope, J. C. W. Diversification and biogeography of bivalves during the Ordovician period. In Palaeobiogeography and Biodiversity Change: The Ordovician and Mesozoic-Cenozoic Radiations (eds Crame, J. A. & Owen, A. W.) 25–52 (Geological Society of London, 2002).
Fang, Z. J. An introduction to Ordovician bivalves of southern China, with a discussion of the early evolution of the Bivalvia. Geol. J. 41, 303–328 (2006).
Tsubaki, R., Kameda, Y. & Kato, M. Pattern and process of diversification in an ecologically diverse epifaunal bivalve group Pterioidea (Pteriomorphia, Bivalvia). Mol. Phylogenet. Evol. 58, 97–104 (2011).
Yonge, M. The protobranchiate mollusca; A functional interpretation of their structure and evolution. Philos. Trans. R. Soc. Lond. Ser. B Biol. Sci. 230, 79–147 (1939).
Stasek, C. R. Synopsis and discussion of the association of ctenidia and labial palps in the bivalve Mollusca. Veliger 6, 91–97 (1963).
Starobogatov, Y. I. Morphological basis for phylogeny and classification of Bivalvia. Ruthenica 2, 1–25 (1992).
von Salvini-Plawen, L. & Steiner, G. Synapomorphies and plesiomorphies in higher classification of Mollusca. In Origin and Evolutionary Radiation of the Mollusca (ed. Taylor, J. D.) 29–51 (Oxford University Press, 1996).
Waller, T. R. Origin of the molluscan class Bivalvia and a phylogeny of major groups. In Bivalves: An Eon of Evolution (eds Johnston, P. A. & Haggart, J. W.) 1–45 (University of Calgary Press, 1998).
Steiner, G. & Hammer, S. Molecular phylogeny of the Bivalvia inferred from 18S rDNA sequences with particular reference to the Pteriomorphia. Geol. Soc. Lond. Spec. Publ. 177, 11–29 (2000).
Passamaneck, Y. J., Schander, C. & Halanych, K. M. Investigation of molluscan phylogeny using large-subunit and small-subunit nuclear rRNA sequences. Mol. Phylogenet. Evol. 32, 25–38 (2004).
Giribet, G. & Wheeler, W. On bivalve phylogeny: A high-level analysis of the Bivalvia (Mollusca) based on combined morphology and DNA sequence data. Invertebr. Biol. 121, 271–324 (2002).
Giribet, G. & Distel, D. L. Bivalve phylogeny and molecular data. In Molecular Systematics and Phylogeography of Mollusks (eds Lydeard, C. & Lindberg, D.) 45–90 (Smithsonian Institution Press, 2003).
Bieler, R. & Mikkelsen, P. M. Bivalvia—A look at the branches. Zool. J. Linn. Soc. 148, 223–235 (2006).
Plazzi, F. & Passamonti, M. Towards a molecular phylogeny of Mollusks: Bivalves’ early evolution as revealed by mitochondrial genes. Mol. Phylogenet. Evol. 57, 641–657 (2010).
Giribet, G. Bivalvia. In Phylogeny and Evolution of the Mollusca (eds Ponder, W. F. & Lindberg, D. R.) 105–142 (University of California Press, 2008).
González, V. L. et al. A phylogenetic backbone for Bivalvia: An RNA-seq approach. Proc. R. Soc. B Biol. Sci. 282, 20142332 (2015).
Lemer, S., Bieler, R. & Giribet, G. Resolving the relationships of clams and cockles: Dense transcriptome sampling drastically improves the bivalve tree of life. Proc. R. Soc. B Biol. Sci. 286, 20182684 (2019).
Plazzi, F., Puccio, G. & Passamonti, M. Burrowers from the past: Mitochondrial signatures of ordovician bivalve infaunalization. Genome Biol. Evol. 9, 956–967 (2017).
Newell, N. D. Classification of the Bivalvia. Am. Mus. Novit. 2206, 1–25 (1965).
Combosch, D. J. et al. A family-level tree of life for bivalves based on a Sanger-sequencing approach. Mol. Phylogenet. Evol. 107, 191–208 (2017).
Plazzi, F., Puccio, G. & Passamonti, M. Comparative large-scale mitogenomics evidences clade-specific evolutionary trends in mitochondrial DNAs of Bivalvia. Genome Biol. Evol. 8, 2544–2564 (2016).
Myra, K. A. Marine Molluscan Genera of Western North America: An Illustrated Key (Stanford University Press, 1963).
Carter, J., Altaba, C., Anderson, L. & Yancei, T. A synoptical classification of the Bivalvia (mollusca). Paleontol. Contrib. https://doi.org/10.17161/PC.1808.8287 (2011).
Morton, B. & Machado, F. M. Chapter one—Predatory marine bivalves: A review. Adv. Mar. Biol. 84, 1–98 (2019).
Harper, E. M., Hide, E. A. & Morton, B. Relationships between the extant Anomalodesmata: A cladistic test. In The Evolutionary Biology of the Bivalvia (eds Haper, E. M. et al.) 129–143 (The Geological Society of London, 2000).
Dreyer, H., Steiner, G. & Harper, E. M. Molecular phylogeny of Anomalodesmata (Mollusca: Bivalvia) inferred from 18S rRNA sequences. Zool. J. Linn. Soc. 139, 229–246 (2003).
Harper, E. M., Dreyer, H. & Steiner, G. Reconstructing the Anomalodesmata (Mollusca: Bivalvia): Morphology and molecules. Zool. J. Linn. Soc. 148, 395–420 (2006).
Taylor, J. D., Williams, S. T., Glover, E. A. & Dyal, P. A molecular phylogeny of heterodont bivalves (Mollusca: Bivalvia: Heterodonta): New analyses of 18S and 28S rRNA genes. Zool. Scr. 36, 587–606 (2007).
Taylor, J. D., Williams, S. T. & Glover, E. A. Evolutionary relationships of the bivalve family Thyasiridae (Mollusca: Bivalvia), monophyly and superfamily status. J. Mar. Biol. Assoc. U.K. 87, 565–574 (2007).
Bieler, R. et al. Investigating the Bivalve tree of life—An exemplar-based approach combining molecular and novel morphological characters. Invertebr. Syst. 28, 32 (2014).
Waller, T. R. The evolution of ligament systems in the Bivalvia. In The Bivalvia (ed. Morton, B.) 49–71 (Hong Kong University Press, 1990).
Morris, N. J. A new Lower Ordovician bivalve family, the Thoraliidae (? Nuculoida), intepreted as actinodont deposit feeders. Bull. Br. Mus. Nat. Hist. (Geol.) 34, 265–272 (1980).
Cope, J. C. W. The early evolution of the Bivalvia. In Origin and Evolutionary Radiation of the Mollusca (ed. Taylor, J. D.) 361–370 (Oxford University Press, 1996).
Sánchez, T. M. & Babin, C. Distribution paléogéographique des mollusques bivalves durant l’Ordovicien. Géodiversitas 25, 243–259 (2003).
Sánchez, T. M. Taxonomic position and phylogenetic relationships of the bivalve Goniophorina Isberg and related genera from the early Ordovician of northwestern Argentina. Ameghiniana 43, 113–122 (2006).
Fang, Z. & Sanchez, T. M. Treatise Online no. 43: Part N, Revised, Volume 1, Chapter 16: Origin and early evolution of the Bivalvia. In Treatise Online (2012).
Cope, J. C. W. & Kříž, J. The Lower Palaeozoic palaeobiogeography of Bivalvia. In Geological Society, London, Memoirs Vol. 38 (eds Harper, D. A. T. & Servais, T.) 221–241 (Geological Society of London, 2013).
Doucet-Beaupré, H. et al. Mitochondrial phylogenomics of the Bivalvia (Mollusca): searching for the origin and mitogenomic correlates of doubly uniparental inheritance of mtDNA. BMC Evol. Biol. 10, 50 (2010).
Plazzi, F., Ceregato, A., Taviani, M. & Passamonti, M. A molecular phylogeny of bivalve mollusks: Ancient radiations and divergences as revealed by mitochondrial genes. PLoS ONE 6, e27147 (2011).
Plazzi, F., Ribani, A. & Passamonti, M. The complete mitochondrial genome of Solemya velum (Mollusca: Bivalvia) and its relationships with Conchifera. BMC Genom. 14, 409 (2013).
Kocot, K. M. et al. Phylogenomics reveals deep molluscan relationships. Nature 477, 452–456 (2011).
Smith, S. A. et al. Resolving the evolutionary relationships of molluscs with phylogenomic tools. Nature 480, 364–367 (2011).
Sharma, P. P. et al. Phylogenetic analysis of four nuclear protein-encoding genes largely corroborates the traditional classification of Bivalvia (Mollusca). Mol. Phylogenet. Evol. 65, 64–74 (2012).
Toews, D. P. L. & Brelsford, A. The biogeography of mitochondrial and nuclear discordance in animals. Mol. Ecol. 21, 3907–3930 (2012).
Sloan, D. B. et al. Cytonuclear integration and co-evolution. Nat. Rev. Genet. 19, 635–648 (2018).
Lynch, M. Mutation accumulation in transfer RNAs: Molecular evidence for Muller’s ratchet in mitochondrial genomes. Mol. Biol. Evol. 13, 209–220 (1996).
Osada, N. & Akashi, H. Mitochondrial–nuclear interactions and accelerated compensatory evolution: Evidence from the primate cytochrome c oxidase complex. Mol. Biol. Evol. 29, 337–346 (2012).
van der Sluis, E. O. et al. Parallel structural evolution of mitochondrial ribosomes and OXPHOS complexes. Genome Biol. Evol. 7, 1235–1251 (2015).
Havird, J. C., Whitehill, N. S., Snow, C. D. & Sloan, D. B. Conservative and compensatory evolution in oxidative phosphorylation complexes of angiosperms with highly divergent rates of mitochondrial genome evolution. Evolution (N Y) 69, 3069–3081 (2015).
Weng, M.-L., Ruhlman, T. A. & Jansen, R. K. Plastid–nuclear interaction and accelerated coevolution in plastid ribosomal genes in geraniaceae. Genome Biol. Evol. 8, 1824–1838 (2016).
Rockenbach, K. et al. Positive selection in rapidly evolving plastid-nuclear enzyme complexes. Genetics 204, 1507–1522 (2016).
Wolfe, N. W. & Clark, N. L. ERC analysis: Web-based inference of gene function via evolutionary rate covariation. Bioinformatics https://doi.org/10.1093/bioinformatics/btv454 (2015).
Yan, Z., Ye, G. & Werren, J. H. Evolutionary rate correlation between mitochondrial-encoded and mitochondria-associated nuclear-encoded proteins in insects. Mol. Biol. Evol. 36, 1022–1036 (2019).
Piccinini, G. et al. Mitonuclear coevolution, but not nuclear compensation, drives evolution of OXPHOS complexes in bivalves. Mol. Biol. Evol. 38, 2597–2614 (2021).
Forsythe, E. S., Williams, A. M. & Sloan, D. B. Genome-wide signatures of plastid-nuclear coevolution point to repeated perturbations of plastid proteostasis systems across angiosperms. Plant Cell 33, 980–997 (2021).
Nielsen, R. Molecular signatures of natural selection. Annu. Rev. Genet. 39, 197–218 (2005).
Ghiselli, F. et al. Molluscan mitochondrial genomes break the rules. Philos. Trans. R. Soc. B Biol. Sci. 376, 20200159 (2021).
Guerra, D. et al. Evolution of sex-dependent mtDNA transmission in freshwater mussels (Bivalvia: Unionida). Sci. Rep. 7, 1551 (2017).
Ren, J., Liu, X., Jiang, F., Guo, X. & Liu, B. Unusual conservation of mitochondrial gene order in Crassostreaoysters: Evidence for recent speciation in Asia. BMC Evol. Biol. 10, 394 (2010).
Saccone, C., de Giorgi, C., Gissi, C., Pesole, G. & Reyes, A. Evolutionary genomics in Metazoa: The mitochondrial DNA as a model system. Gene 238, 195–209 (1999).
Sun, S., Li, Q., Kong, L. & Yu, H. Multiple reversals of strand asymmetry in molluscs mitochondrial genomes, and consequences for phylogenetic inferences. Mol. Phylogenet. Evol. 118, 222–231 (2018).
Formaggioni, A., Luchetti, A. & Plazzi, F. Mitochondrial genomic landscape: A portrait of the mitochondrial genome 40 years after the first complete sequence. Life 11, 663 (2021).
Hassanin, A., Léger, N. & Deutsch, J. Evidence for multiple reversals of asymmetric mutational constraints during the evolution of the mitochondrial genome of Metazoa, and consequences for phylogenetic inferences. Syst. Biol. 54, 277–298 (2005).
Minh, B. Q., Hahn, M. W. & Lanfear, R. New methods to calculate concordance factors for phylogenomic datasets. Mol. Biol. Evol. 37, 2727–2733 (2020).
Hoang, D. T., Chernomor, O., von Haeseler, A., Minh, B. Q. & Vinh, L. S. UFBoot2: Improving the ultrafast bootstrap approximation. Mol. Biol. Evol. 35, 518–522 (2018).
Castoe, T. A. et al. Evidence for an ancient adaptive episode of convergent molecular evolution. Proc. Natl. Acad. Sci. 106, 8986–8991 (2009).
Shen, X.-X., Hittinger, C. T. & Rokas, A. Contentious relationships in phylogenomic studies can be driven by a handful of genes. Nat. Ecol. Evol. 1, 0126 (2017).
Reyes, A., Gissi, C., Pesole, G. & Saccone, C. Asymmetrical directional mutation pressure in the mitochondrial genome of mammals. Mol. Biol. Evol. 15, 957–966 (1998).
Yu, H. & Li, Q. Mutation and selection on the wobble nucleotide in tRNA anticodons in marine bivalve mitochondrial genomes. PLoS ONE 6, e16147 (2011).
Felsenstein, J. Cases in which parsimony or compatibility methods will be positively misleading. Syst. Zool. 27, 401 (1978).
Stöger, I. & Schrödl, M. Mitogenomics does not resolve deep molluscan relationships (yet?). Mol. Phylogenet. Evol. 69, 376–392 (2013).
Avila-Herrera, A. & Pollard, K. S. Coevolutionary analyses require phylogenetically deep alignments and better null models to accurately detect inter-protein contacts within and between species. BMC Bioinform. 16, 268 (2015).
Funk, D. J. & Omland, K. E. Species-level paraphyly and polyphyly: Frequency, causes, and consequences, with insights from animal mitochondrial DNA. Annu. Rev. Ecol. Evol. Syst. 34, 397–423 (2003).
Sloan, D. B., Havird, J. C. & Sharbrough, J. The on-again, off-again relationship between mitochondrial genomes and species boundaries. Mol. Ecol. 26, 2212–2236 (2017).
Beck, E. A., Thompson, A. C., Sharbrough, J., Brud, E. & Llopart, A. Gene flow between Drosophila yakuba and Drosophila santomea in subunit V of cytochrome c oxidase: A potential case of cytonuclear cointrogression. Evolution (N Y) 69, 1973–1986 (2015).
Morales, H. E. et al. Concordant divergence of mitogenomes and a mitonuclear gene cluster in bird lineages inhabiting different climates. Nat. Ecol. Evol. 2, 1258–1267 (2018).
Sánchez, T. M. The early bivalve radiation in the Ordovician Gondwanan basins of Argentina. Alcheringa Aust. J. Palaeontol. 32, 223–246 (2008).
Lawless, C., Greaves, L., Reeve, A. K., Turnbull, D. M. & Vincent, A. E. The rise and rise of mitochondrial DNA mutations. Open Biol. 10, 200061 (2020).
Zheng, W., Khrapko, K., Coller, H. A., Thilly, W. G. & Copeland, W. C. Origins of human mitochondrial point mutations as DNA polymerase γ-mediated errors. Mutat. Res. Fundam. Mol. Mech. Mutagen. 599, 11–20 (2006).
Pozzi, A., Plazzi, F., Milani, L., Ghiselli, F. & Passamonti, M. SmithRNAs: Could mitochondria “bend” nuclear regulation?. Mol. Biol. Evol. 34, 1960–1973 (2017).
Gissi, C., Iannelli, F. & Pesole, G. Evolution of the mitochondrial genome of Metazoa as exemplified by comparison of congeneric species. Heredity (Edinb) 101, 301–320 (2008).
Malkócs, T. et al. Complex mitogenomic rearrangements within the Pectinidae (Mollusca: Bivalvia). BMC Ecol. Evol. 22, 29 (2022).
Breton, S., Beaupré, H. D., Stewart, D. T., Hoeh, W. R. & Blier, P. U. The unusual system of doubly uniparental inheritance of mtDNA: Isn’t one enough?. Trends Genet. 23, 465–474 (2007).
Passamonti, M. & Plazzi, F. Doubly uniparental inheritance and beyond: The contribution of the Manila clam Ruditapes philippinarum. J. Zool. Syst. Evol. Res. 58, 529–540 (2020).
Zouros, E. & Rodakis, G. C. Doubly uniparental inheritance of mtDNA: An unappreciated defiance of a general rule. Adv. Anat. Embryol. Cell Biol. 231, 25–49 (2019).
Camacho, C. et al. BLAST+: Architecture and applications. BMC Bioinform. 10, 421 (2009).
Mistry, J., Finn, R. D., Eddy, S. R., Bateman, A. & Punta, M. Challenges in homology search: HMMER3 and convergent evolution of coiled-coil regions. Nucleic Acids Res. 41, e121–e121 (2013).
El-Gebali, S. et al. The Pfam protein families database in 2019. Nucleic Acids Res. 47, D427–D432 (2019).
Kanehisa, M. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30 (2000).
Kanehisa, M. Toward understanding the origin and evolution of cellular organisms. Protein Sci. 28, 1947–1951 (2019).
Kanehisa, M., Furumichi, M., Sato, Y., Ishiguro-Watanabe, M. & Tanabe, M. KEGG: integrating viruses and cellular organisms. Nucleic Acids Res. 49, D545–D551 (2021).
Kanehisa, M., Sato, Y. & Kawashima, M. <scp>KEGG</scp> mapping tools for uncovering hidden features in biological data. Protein Sci. 31, 47–53 (2022).
Lewis, S. H., Salmela, H. & Obbard, D. J. Duplication and diversification of Dipteran Argonaute genes, and the evolutionary divergence of Piwi and Aubergine. Genome Biol. Evol. 8, 507–518 (2016).
Ha, M. & Kim, V. N. Regulation of microRNA biogenesis. Nat. Rev. Mol. Cell Biol. 15, 509–524 (2014).
Bateman, A. et al. UniProt: The universal protein knowledgebase in 2021. Nucleic Acids Res. 49, D480–D489 (2021).
Nguyen, L.-T., Schmidt, H. A., von Haeseler, A. & Minh, B. Q. IQ-TREE: A fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268–274 (2015).
Chang, J.-M., di Tommaso, P., Taly, J.-F. & Notredame, C. Accurate multiple sequence alignment of transmembrane proteins with PSI-Coffee. BMC Bioinform. 13, S1 (2012).
Criscuolo, A. & Gribaldo, S. BMGE (Block Mapping and Gathering with Entropy): A new software for selection of phylogenetic informative regions from multiple sequence alignments. BMC Evol. Biol. 10, 210 (2010).
Kück, P. et al. Parametric and non-parametric masking of randomness in sequence alignments can be improved and leads to better resolved trees. Front. Zool. 7, 10 (2010).
Castresana, J. Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol. Biol. Evol. 17, 540–552 (2000).
Wu, M., Chatterji, S. & Eisen, J. A. Accounting for alignment uncertainty in phylogenomics. PLoS ONE 7, e30288 (2012).
Dress, A. W. et al. Noisy: Identification of problematic columns in multiple sequence alignments. Algorithms Mol. Biol. 3, 7 (2008).
Young, N. D. & Healy, J. GapCoder automates the use of indel characters in phylogenetic analysis. BMC Bioinform. 4, 6 (2003).
Lanfear, R., Calcott, B., Ho, S. Y. W. & Guindon, S. PartitionFinder: Combined selection of partitioning schemes and substitution models for phylogenetic analyses. Mol. Biol. Evol. 29, 1695–1701 (2012).
Stamatakis, A. RAxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
Stamatakis, A. Phylogenetic models of rate heterogeneity: A high performance computing perspective. In Proceedings 20th IEEE International Parallel & Distributed Processing Symposium 8 pp. (IEEE, 2006). https://doi.org/10.1109/IPDPS.2006.1639535.
Ronquist, F. & Huelsenbeck, J. P. MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics 19, 1572–1574 (2003).
Gelman, A. & Rubin, D. B. Inference from iterative simulation using multiple sequences. Stat. Sci. https://doi.org/10.1214/ss/1177011136 (1992).
Shimodaira, H. & Hasegawa, M. Multiple comparisons of log-likelihoods with applications to phylogenetic inference. Mol. Biol. Evol. 16, 1114–1116 (1999).
R Core Team. R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, 2021).
Cock, P. J. A. et al. Biopython: Freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics 25, 1422–1423 (2009).
Plazzi, F., Puccio, G. & Passamonti, M. HERMES: An improved method to test mitochondrial genome molecular synapomorphies among clades. Mitochondrion 58, 285–295 (2021).
Kruskal, W. H. & Wallis, W. A. Use of ranks in one-criterion variance analysis. J. Am. Stat. Assoc. 47, 583 (1952).
Dinno, A. dunn.test: Dunn’s Test of Multiple Comparisons Using Rank Sums (2017).
Carter, J. G. et al. A synoptical classification of the Bivalvia (Mollusca). Paleontol. Contrib. https://doi.org/10.17161/PC.1808.8287 (2011).
WoRMS Editorial Board. World Register of Marine Species (2022).
Acknowledgements
We are grateful to Dr Marco Calderone for retrieving published sequences of proteins associated to sncRNA biogenesis in public databases. We are also deeply indebted to Dr Giovanni Piccinini and Dr Mariangela Iannello who prompted us to investigate deeper in bivalve phylogenetic incongruencies: the present work would not have been possible without their insightful discussions. The original manuscript was also improved thanks to two anonymous reviewers. This research was funded by the “Canziani Bequest” fund.
Author information
Authors and Affiliations
Contributions
Conceptualization, F.P and M.P.; methodology, F.P. and A.F.; investigation, A.F.; formal analysis, A.F.; supervision, F.P. and M.P.; writing—original draft, A.F.; writing—review and editing, F.P. and M.P. All authors read and reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Formaggioni, A., Plazzi, F. & Passamonti, M. Mito-nuclear coevolution and phylogenetic artifacts: the case of bivalve mollusks. Sci Rep 12, 11040 (2022). https://doi.org/10.1038/s41598-022-15076-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-022-15076-y
- Springer Nature Limited