Abstract
Eukaryotic organelle genomes are generally of conserved size and gene content within phylogenetic groups. However, significant variation in genome structure may occur. Here, we report that the Stylonematophyceae red algae contain multipartite circular mitochondrial genomes (i.e., minicircles) which encode one or two genes bounded by a specific cassette and a conserved constant region. These minicircles are visualized using fluorescence microscope and scanning electron microscope, proving the circularity. Mitochondrial gene sets are reduced in these highly divergent mitogenomes. Newly generated chromosome-level nuclear genome assembly of Rhodosorus marinus reveals that most mitochondrial ribosomal subunit genes are transferred to the nuclear genome. Hetero-concatemers that resulted from recombination between minicircles and unique gene inventory that is responsible for mitochondrial genome stability may explain how the transition from typical mitochondrial genome to minicircles occurs. Our results offer inspiration on minicircular organelle genome formation and highlight an extreme case of mitochondrial gene inventory reduction.
Similar content being viewed by others
Introduction
Organelle genomes (e.g., mitogenomes and plastomes) that are derived from bacteria typically consist of single, circular DNA molecules. However, significant variation in genome size, structure and gene content exists among different eukaryote lineages1,2,3,4, including algae in the Archaeplastida5. Among these departures from the norm are multipartite or fragmented genomes whereby either several different circular (i.e., minicircles) or linear DNAs encode the organelle gene inventory. Examples are the Euglenozoa, where three distinctive variants exist: maxi- and minicircles in Kinetoplastea6,7, minicircles in Diplonemea8, and linearly fragmented mitogenomes in Euglenida9. Minicircular mitogenomes have been reported in many other lineages10,11,12,13,14,15. In the case of plastids, peridinin-containing dinoflagellates contain minicircular genomes16. In contrast, multiple linear ptDNA molecules are found in the green algal order, Cladophorales17.
Several extraordinary organelle genomes exist in the Archaeplastida with genome expansion having occurred both in green and red algae, primarily through the proliferation of repeats and transposable elements, including group II introns18,19,20,21,22,23,24. Multipartite plastomes occur in Cladophorales (Ulvophyceae) species17 and in Koshicola spirodelophila (Chaetopeltidales, Chlorophyceae)12. The plastome of K. spirodelophila is tripartite with >10 genes encoded on each partition, whereas the Cladophorales contain linear hairpin chromosomes that encode 1–3 genes on each fragment, but 1–5 fragmented coding regions within the same or different chromosomes. These plastid genes are highly diverged when compared to the expected sequence divergence rate and gene content is greatly reduced relative to other green algal lineages. Unlike the diverse organelle genome types present in the green lineage, fragmented organelle genomes have not yet been reported from the red algae (Rhodophyta).
Here, we provide the evidence of minicircular mitogenomes in the red algal class Stylonematophyceae, comprising 50 species that are either unicellular or pseudofilamentous with thick mucilaginous cell walls (https://www.algaebase.org/). Mitochondria in the Stylonematophyceae have tubular cristae25 and along with the Compsopogonophyceae and Rhodellophyceae, an endoplasmic reticulum-mitochondria-Golgi association doest not occur25,26,27. Using long-read sequencing data (i.e., Oxford Nanopore and PacBio HiFi), we assembled complete minicircular mitogenomes in Rhodosorus marinus CCMP1338 and Chroodactylon ornatum JAW4256, in addition to a pseudo-chromosomal nuclear genome assembly from R. marinus. Using short-read sequencing data from five additional Stylonematophyceae species (Tsunamia transpacifica JAW4874, Rufusia pilicola O7031, Stylonema alsidii JAW4424, Chroothece mobilis SAG104.79, and Bangiopsis subsimplex UTEX LB2854), we identified that Stylonematophyceae contain minicircular mitogenomes. Each mitogenome consists of a gene- and species-specific cassette and a species-specific constant region. These minicircles were directly visualized using fluorescence microscope (FM) and scanning electron microscope (SEM), providing conclusive optical evidence of minicircle and its circularity. Minicircle GC content has dramatically increased, not only in coding sequences (CDSs) but also in non-coding regions (NCRs). We demonstrate the Stylonematophyceae has unique gene inventory controlling organelle genome stability that is different from other red algae. The formation of hetero-concatemers through recombination may explain the origin and evolution of minicircles, not only in the Stylonematophyceae, but generally in other eukaryotes.
Results and discussion
Minicircular mitogenomes in the Stylonematophyceae
Results of agarose gel electrophoresis using total genomic DNA (gDNA) from seven Stylonematophyceae species (Fig. 1; for species information, see Supplementary Fig. 1 and Supplementary Data 1) demonstrates the presence of multiple low-molecular-weight (LMW) DNA fragments. In addition, complete mitogenomes were not assembled using short-read sequencing data, which is unusual for red algae28,29,30,31. Instead, partial linear contigs were identified in the genome assemblies (Supplementary Fig. 2a). Each of these contigs encodes one or two (mostly one) mitochondrial gene(s) and has large shared regions. To avoid assembly artifacts that can be caused by repetitive sequences, long-read sequencing data were also generated for R. marinus and C. ornatum. These new mitogenome assemblies comprised ~2–6 kb and ~6–17 kb circular sequences, respectively (Fig. 2a and Supplementary Fig. 2b). Most of the raw reads from Nanopore or HiFi sequencing associated with mitochondrial gene peaked roughly at the same length of the assembled minicircles that were distinct from the other nuclear and plastid gene-containing reads (Supplementary Fig. 3). This result excludes the possibility of assembly artifacts and linear concatemers (Supplementary Note 1).
Minicircles of the Stylonematophyceae share a conserved structure that includes a CDS with a short and gene-specific NCR; and a NCR whose sequence is shared among many minicircles within a species but not between species (Fig. 2b) (hereafter, we will refer to the CDS-flanking region as cassette and conserved NCR as constant region32). Minicircles without genes were not detected. Generally, constant regions of minicircles that encode genes ubiquitous in the Stylonematophyceae show high sequence similarity, with a small number of single nucleotide polymorphisms (SNPs) and repeat variation in the constant region (Fig. 2c, d, and Supplementary Fig. 4). There are however some exceptions. For example, for species such as R. pilicola, T. transpacifica, and S. alsidii, sequences between atp6 minicircles and other gene minicircles could be partially aligned, whereas in the other species, the sequences of atp6 minicircles were highly diverged from other minicircles. Although nearly all constant regions contain tandem repeats, we found species-specific structural variation in the constant regions (Fig. 2c, d, Supplementary Figs. 4, and 5).
Furthermore, we confirmed the presence of minicircles using Southern hybridization (Fig. 3a, Supplementary Figs. 2b, and 6; see primer information in Supplementary Data 2). Comparison of banding patterns resolved in undigested and restriction enzyme-digested gDNA showed the former to comprise three types of conformation (supercoiled, linear, and open circular), whereas only a linear conformation of expected size is found in the lane with SpeI-digested gDNA. Some restriction enzymes (e.g., BamHI) did not produce the expected banding DNA pattern, however, in the comparison of digested vs. undigested DNA in these cases, only the middle band is variable, which represent the increase in amount of linearized DNA of the predicted size via digestion of supercoiled DNA. Therefore, multiple bands do not necessarily weaken the evidence for a circular topology of mtDNA in Stylonematophyceae (Supplementary Fig. 6 and Supplementary Note 2). This result supports the idea that these red algal mitogenomes are composed of minicircles.
Evidence of minicircular mitogenomes by DNA molecule visualization with microscopy images
A more direct line of evidence comes from microscopy (Fig. 3). Here, extracted LMW DNA was stained with fluorescent protein-DNA-binding peptides or proteins (FP-DBP) and imaged using FM (Fig. 3b). LMW DNA appears as dots in the image of undigested LMW DNA whereas short lines rather than dots are observed when LMW DNA is treated with SacI restriction enzyme. Similarly, LMW DNA images generated by SEM frequently exhibit open circular and supercoiled DNA molecules when SacI is not treated, whereas linear DNA molecules are apparent after SacI treatment of LMW DNA (Fig. 3c, d and Supplementary Note 3). Consequently, the results provide a conclusive and direct optical evidence of minicircles. Moreover, the images of minicircles demonstrates a powerful advantage of the high resolution of SEM that FM cannot achieve. Although there have been many DNA images obtained using TEM, taking DNA images using TEM is not a simple process. Our approach of visualizing DNA under SEM is simple and straightforward, and thus, it can be a promising tool to confirm DNA structures with nanometer resolutions, as shown in Fig. 3c, d.
Mitogenomes with divergent gene sequences and reduced gene contents
A distinctive feature of Stylonematophyceae mitogenomes is their unexpectedly high GC content and increased rate of non-synonymous substitutions (Fig. 4a, Supplementary Fig. 7, and Supplementary Note 4). Consequently, mitochondrial genes in this lineage are greatly diverged when compared to other red algae that have typical (i.e., ancestral) mitogenome structure and gene content (Fig. 4b and Supplementary Fig. 8). In addition, most organelle genomes have an elevated adenine plus thymine (AT) content due to AT-biased mutations, which is accelerated by stressors such as reactive oxygen species (ROS)33,34,35,36. Various selection-based hypotheses such as resource availability37,38, environmental condition39, DNA stability40, regulation of gene expression41,42, and GC-biased gene conversion (gBGC)43 have been proposed, but none of these convincingly explain high GC content. Thus, the driver of high GC content in these genomes is also unknown.
A total of 13 mitochondrial-encoded proteins were identified in seven Stylonematophyceae species (Fig. 5a and Supplementary Fig. 2). These species have lost all the genes involved in cytochrome c biogenesis and those encoding ribosomal proteins and transporters. Gene sets for mitochondrial complex I, II, and V are partially present, because nad6, sdhC, sdhD, atp4, and atp8 were lost. In some species, nad3, nad4L, and sdhB are also absent (Fig. 5a). An interesting feature is that all of the remaining minicircle genes have functions related to proton influx (Fig. 5b), although these translocation mechanisms are not thoroughly understood44,45. Among seven subunits (nad1–6 and nad4L) in mitochondrial complex I, four subunits (nad1, nad2, nad4, and nad5) are involved in the proton input pathway45,46 and only these four genes remain in all studied Stylonematophyceae. In complex V, atp4, atp6, and atp9 comprise the F0 system with the accessory subunit atp847. Here, protons pass through subunit-a (atp6)48 or between subunit-a and the subunit-c ring (atp6 and atp9)44. Subunits in complex III and IV (cob and cox1–3) also are involved in proton pumping49,50,51. In this sense, loss of sdhC and sdhD, or even sdhB in mitogenomes may not be lethal because complex II does not play a role in the proton gradient52,53. The TCA cycle would not be affected, because sdhA is the key player in this process54 and this gene has been lost in all red algal mitogenomes (Supplementary Note 5). The extant mitogenome gene inventories fit well with the hypothesis termed Co-location for Redox Regulation of gene expression (CoRR)55,56, whereby genes required for redox regulation are maintained by selection in organelle (mitochondrial or plastid) genomes. Instead of outright loss, the other genes are likely to be transferred and retain their function in the nuclear genome without notable effects on fitness.
Other red algal classes such as the Rhodellophyceae22, Compsopogonophyceae18,21, and Porphyridiophyceae23 contain organelle DNAs that are interrupted by introns that encode ORFs. As an example, Kim, et al.23 classified intron-encoded proteins (IEPs) in group II introns based on IEP amino acid sequences. However, in the Stylonematophyceae, only a handful of introns are present in mitogenomes and no ORFs were found using BLASTx (Supplementary Fig. 2). We also found evidence of trans-splicing, as previously described in Diplonemea32. Notably, the gene nad5 is fragmented into two pieces (Supplementary Fig. 2), which are encoded on different minicircles (hereafter, nad5-f for the N-terminal half and nad5-s for the C-terminal half). Analysis of a transcriptome assembly uncovered two RNA transcripts that support the presence of trans-splicing in R. marinus (mean coverage = 8.5; minimum coverage = 2). However, we were unable to obtain PCR validation using cDNA. How nad5 is transcribed and translated requires further inspection. In the case of C. mobilis, we were not able to find nad5-s. Given that the read coverage of organelle genomes is extremely high, absence of nad5-s in C. mobilis is unlikely to be a result of sequencing or assembly artifacts. In some species, one of the two gene fragment is found in nuclear genome after the split57,58,59,60. Additional long-read sequencing is needed to determine if nad5-s has been transferred to the nuclear genome or lost.
We also identified the small and large subunit ribosomal RNAs (SSU and LSU rRNA) in R. marinus and C. ornatum (Fig. 2a and Supplementary Fig. 2b). In the case of tRNA, only a few were encoded on existing minicircles (Supplementary Fig. 2).
For species containing a multipartite mitochondrial genome, segregation of individual minicircles to preserve the gene repertoire in progeny should be under strong selection61,62. A simulation study suggested that if copy numbers are not sufficiently high, then selection favors gene linkage63 under stochastic DNA segregation. Even though minicircle segregation is well controlled in Kinetoplastea61, minicircle loss in laboratory culture were observed64,65. Thus, a prerequisite for minicircular genome maintenance is high copy number to prevent loss of function15. In this regard, a massive number of minicircles mitigates segregation issues through stochastic assortment in Diplonemea66. Similar to Diplonemea, our qPCR analysis demonstrates that mtDNA molecules present in high copy numbers. Specifically, the Stylonematophyceae has a 357.1 ± 152.2-fold copy number difference between mitogenome and nuclear genome, whereas the differences are 3.9-fold and 25.1-fold in the typical (single and circular) mitogenome-containing red algae: P. purpureum (Porphyridiophyceae) and C. merolae (Cyanidiophyceae), respectively (Supplementary Data 3 and Supplementary Fig. 9). The high mitogenome copy number may allow minicircle gene repertoire maintenance, even under a random segregation model.
Nuclear genome assembly of R. marinus and endosymbiotic gene transfer
We assembled the nuclear genome of Rhodosorus marinus using long-read sequencing data and investigated genes derived via endosymbiotic gene transfer (EGT) and genes that regulate mitochondrial recombination. The assembled nuclear genome comprises 13 large (>1 Mb) and four small contigs (<40 kb), totaling 31,189,410 bp (Fig. 6). Twelve contigs have at least one telomere and six are assembled telomere-to-telomere. Repeats comprise 15% of genome content (Supplementary Fig. 10). The number of genes and introns are 8392 and 24,821, respectively. The BUSCO67 value for core eukaryote gene content is 94.0% (S:91.7%, D:2.3%, F:2.6%, M:3.4%, n:303; eukaryote_odb9) and the RNA-seq mapping rate is 98.6%, supporting high completeness of the newly generated genome.
In the extremophilic red alga, Galdieria sulphuraria, nuclear-encoded mitochondrial sdhB, rpl20, and rps12 are EGT-derived31. We searched for EGT-derived mitochondrial genes in the R. marinus genome assembly and in eight publicly available red algal datasets (see Supplementary Data 1) that include two Stylonematophyceae species (Bangiopsis sp. CCMP1999 and C. ornatum). Nine out of eleven ribosomal protein genes were found to be nuclear-encoded, however sdhB was the only non-ribosomal protein gene that was identified (Supplementary Fig. 11 and Supplementary Data 4). These EGT-derived genes are scattered throughout the genome (arrowheads in Fig. 6). Because some of these ribosomal subunit genes are clustered (Supplementary Data 5), whereas others are not, there may have been independent gene transfers or translocations of genes, post-EGT, of the operon. In the nine species that were investigated, we identified rpl6, rpl14, rpl16, rps11, rps12, and rps14; whereas rpl20, rps4, and rps8 were absent only in a few species. Absence of these ribosomal subunit genes could arise from the high number of scaffolds (i.e., low assembly quality of reference genome). For example, assembly of the C. crispus genome that appears to lack rps8 comprises 1266 scaffolds, suggesting that many fragmented genes may remain to be identified. Mitochondrial transit peptides were rarely found in proteins encoded by EGT-derived genes (Supplementary Data 6) and the identified N-terminal extensions were not specific to certain proteins. This is not surprising because a cleavable presequence is not necessarily required for mitochondria targeted proteins68.
To understand the impact of EGT, we compared the GC content of transferred genes with that of mitochondrial- and other nuclear-encoded genes (Supplementary Fig. 12). Overall gene GC content and 3rd codon GC content (GC3) of EGT-derived genes were statistically different from mitochondrial genes, whereas 1st codon GC content (GC1) showed a significant difference between EGT-derived and nuclear genes, but not between nuclear genes and early EGT-derived genes (i.e., nuclear-encoded mitochondrial ribosomal subunit genes that are shared among all red algal classes; rpl1-4, rpl6-7, rpl10-11, rpl13-17, rpl21-22, rpl24, rpl46-47, rps1-2, rps4-5, rps7-9, rps11, and rps15). Thus, once a gene is transferred to the nuclear genome, its overall GC content is adjusted to that of nuclear genes via an elevated GC3. However, GC1 change appears to be a relatively slow process.
We could not identify missing genes other than ribosomal protein genes (and one sdhB) in the mitogenomes. It is possible that some of the targeted sequences may have diverged too rapidly to be identified using an in silico approach. If so, mutations would have accumulated only on a specific set of genes, because the identified EGT-derived genes do not exhibit high sequence divergence. In contrast, gene function may have been replaced by a duplicated copy of another gene. In plant mitogenomes, several ribosomal gene losses have occurred, followed by functional replacement by other genes. For instance, in plants, either transferred mitochondrial rps19 or chloroplast rps13 substitute the function of lost mitochondrial rps1369, a diverged copy of cytosolic rps15A takes the place of mitochondrial rps870, or nuclear-encoded chloroplast rps10 is dual targeted to mitochondria and chloroplasts71. However, reports have primarily concerned ribosomal protein genes. Although considered unlikely, in the most extreme case, missing genes could be outright lost, as in the case of the parasitic plant lineage Viscum that has lost all mitochondrial complex I genes72. This leads to decreased mitochondrial ATP production and increased glycolysis73. Alternative oxidases and NAD(P)H dehydrogenases are also assumed to be crucial for electron transfer in Viscum (note that these genes are not specifically present in Viscum). Using three Arabidopsis alternative NAD(P)H dehydrogenase subfamilies (NDA, NDB, and NDC)74 as queries, Stylonematophyceae was confirmed to contain more putative alternative NAD(P)H dehydrogenase genes which may contribute to respiration (Supplementary Fig. 13). Nonetheless, because none of the mitochondrial complex was completely lost in the Stylonematophyceae, generating a proton gradient through mitochondrial complexes is likely to remain a major pathway for respiration.
Coding sequences are not the only DNA regions that are transferred to the nuclear genome. Nuclear mitochondrial DNAs (NUMTs)75 that primarily comprise non-coding sequences are widespread in the nuclear genomes of diverse eukaryotes76,77. Because NUMT insertion progresses via nonhomologous DNA end-joining (NHEJ)77, the high copy number and small size of minicircular mitogenomes may favor their incorporation into nuclear DNA. Consistent with this idea, we find that NUMTs are abundant in R. marinus (Stylonematophyceae). R. marinus has the third largest NUMTs and the proportion of NUMT size to nuclear genome size is the highest among 14 red algae from five classes (Supplementary Fig. 14 and Supplementary Data 7). However, additional long-read based assemblies are needed to allow robust comparisons to be made. In contrast, NUMTs are apparently absent in the P. purpureum (Porphyridiophyceae) genome, a species that has a significantly expanded but group II intron-rich mitogenome23. Group II introns do not interrupt the nuclear genome of eukaryotes78 (Supplementary Note 6).
Origin of minicircles
We identified concatemers, in which more than two minicircles merged, in long-read data from R. marinus. Among 6982 collected (pairwise identity of ≥80% to mitochondrial genes) raw minicircle reads from R. marinus, 67 concatemers were present (Fig. 7a and Supplementary Fig. 15). Because the mode of replication is unknown (Supplementary Note 7), there is a possibility that concatemer of the same minicircles (homo-concatemers) may originate via rolling circle replication. To exclude this possibility, we differentiated homo- and hetero-concatemers (which more than two different minicircles merged into) and found 31 high confidence hetero-concatemers. Concatemers have also been reported in lineages containing minicircular organelle genomes, providing evidence of recombination-based merging of two minicircles (Fig. 7b)79,80. We assumed that concatemers are not the result of sequencing artifacts for three reasons. First, no concatemers harboring atp6 and LSU were found (Fig. 7a, c). atp6 and LSU minicircles do not share a constant region with other gene minicircles. Therefore, concatemers would not be produced from artifacts during ligation. Second, nanopore sequencing allows ssDNA to pass through the pore. Neither template switch during DNA synthesis nor multiple passes of the same ssDNA is permitted. Last, we further verified the presence of hetero-concatemers in vivo using PCR with different primer combinations (Supplementary Fig. 16). Thus, hetero-concatemers provide strong evidence of recombination, likely through homologous recombination.
Why do most minicircles contain nearly identical conserved regions? Although not always81,82,83, this conservation could be essential for minicircle persistence and is therefore found in many independent cases of minicircle formation10,11,12,14,15,16,32,84,85,86,87. Concerted evolution is believed to be primarily responsible for the near-identity of dinoflagellate minicircles88. For a minicircle to survive, a functional NCR such as the origin of replication is needed. Because minicircles originated from a single typical mitogenome, the replication origin sequence they retained would have been generally similar with each other. Resulting homologous recombination would ensure that minicircles become more similar over time.
In plants, knock-out of DNA replication, repair, and recombination (DNA-RRR) genes that control organelle genome stability leads to recurrent recombination89. The origin of minicircular genomes could be linked to changes in these regulatory genes that disturb mitogenome stability and consequently, allow tandem repeats in the mitogenomes to act as recombination hotspots. Among a list of DNA-RRR genes that were proven to target organelle genome, two candidate genes were identified (Supplementary Data 8 and Supplementary Note 8): RAD52 and MSH1. RAD52 has diverse roles in nuclear genome and mitogenome maintenance through homologous recombination (HR)90,91,92, thereby preventing microhomology-mediated recombination (MHMR). In plants, the RAD52 homolog RAD52-like protein or organellar DNA-binding protein 1 (ODB1) is localized in both the nucleus and mitochondria depending on its C-terminal region93 (primarily in mitochondria94). Mitochondria in Arabidopsis odb1 mutants experience frequent MHMR under stressful conditions such as oxidative stress94. Surprisingly, RAD52 is absent only in the Stylonematophyceae (Supplementary Fig. 17 and Supplementary Data 8), which is consistent with the idea that recombination drives minicircle formation. MSH1 is dual-targeted to both the mitochondrion and the plastid in plants and suppresses recombination between repeats95,96. The impact of MSH1 on minicircle formation is speculative because it is present in all red algae. However, there is a clear discrepancy between the Stylonematophyceae and other red algae (Supplementary Fig. 18). Stylonematophyceae MSH1 genes are grouped with other eukaryotes in the MSH1 clade that is hypothesized as HGT-derived gene from giant viruses97,98 and distinctively carries GIY-YIG domain. Although currently lacking evidence, it would be interesting if the timing of minicircle formation and MSH1 replacement in the Stylonematophyceae were to be coincident. For these two genes, we also inspected data in seven red algal genomes from Florideophyceae and Bangiophyceae (those that do not have annotation information; Supplementary Data 7 and Supplementary Note 9). Generally, the result was consistent with the expectation of the presence of RAD52 and differentially grouped MSH1 in these taxa. The only exception was the absence of RAD52 in Agarophyton vermiculophyllum (Gracilaria vermiculophylla). A unique DNA-RRR gene inventory could therefore stimulate the formation of minicircular mitogenomes by allowing frequent MHMR (Fig. 7d).
Given that concatemers can originate from recombination involving two non-homologous minicircles and a specific DNA-RRR gene inventory, the transition from a typical single circular DNA to multiple minicircles could be mediated by recombination, as previously proposed9,15,79,99. Microhomology between repeats and other sequences might stimulate recombination and could have led to minicircle formation (Fig. 7d). Among novel minicircles, those that contain a functional NCR would predominate over those lacking it, as well as the ancestral mitogenome. These minicircles may have proliferated because replication time was reduced for individual mtDNA molecule15. This would lead to a lowered reliance on the ancestral mitogenome, that would eventually be lost, as we find in the Stylonematophyceae.
In Rhopalocnemis, WHY2 and OSB2/3/4 were listed as the genes responsible for minicircular mitogenomes10, whereas in Selaginellaceae, which has a multipartite plastome, plastid-targeted RECA1 was lost100. In all these cases including the Stylonematophyceae, genes related to homologous recombination are absent, suggesting that illegitimate recombination may drive minicircle formation. In contrast, some louse species lack mtSSB101 that primarily plays a role in replication102. It has been postulated that in the absence of mtSSB, minicircle formation might be favored to deal with a stalled replication fork (RF), because mtSSB prevents replication slippage (copy-choice recombination) in a dose-dependent manner101,103. This reasoning is interesting because in the cases described above, RAD52, MSH1, and RECA genes as well as homologous recombination are involved with a stalled RF96,104,105,106. MSH1, in particular the clade that has GIY-YIG domain was postulated to suppress recombination between short repeats during the repair of stalled or collapsed RF96. Whether WHY2 and OSB2/3/4 act on stalled RF is unknown but given that these genes are ssDNA-binding proteins107 like mtSSB102 and RAD52108, the likelihood is high. Thus, a stalled RF could be another important component of minicircle formation. Taken together, the formation of minicircles appears to be explained by the loss of genes involved in both homologous recombination and repairing stalled RF, thus trigerring recombination to resolve stalled RF. However, because very few studies dealing with minicircles have discussed DNA-RRR genes, it may be premature to reach a final conclusion and more comprehensive studies are needed.
Red algae are at least a billion years old and have undergone significant remodeling, both with respect to their nuclear and organelle genomes109. The Rhodellophyceae contain the largest plastomes known22, whereas the Porphyridiophyceae contain expanded mitogenomes23. The Compsopogonophyceae have both large mitogenome and plastomes18,21. Adding to this list of eccentric features, we demonstrate that the Stylonematophyceae contain minicircular mitogenomes with reduced gene contents and highly diverged coding regions. These minicircular mitogenomes occur in high numbers (at least 100-fold more than nuclear genome; Supplementary Fig. 9) and may locate within mitochondria containing normal tubular cristae25. The Stylonematophyceae are as far as we know the first case in Rhodophyta in which every mitochondrial gene is encoded on a different DNA molecule. Our results expand the knowledge base of organelle genome evolution by providing evidence from long-read data and the assembly of nuclear genomes that can be used to infer evolutionary processes. We highlight the extent to which this gene inventory can be reduced and its content fragmented in a lineage that is nonetheless ecologically highly successful. Analysis of gene regulation and functional complementation of missing genes are important next steps in the analysis of the Stylonematophyceae minicircular mitogenomes.
Methods
Sample preparation
Culture strains of Tsunamia transpacifica JAW4874, Rufusia pilicola O7031, Stylonema alsidii JAW4424, Chroodactylon ornatum JAW4256, Chroothece mobilis SAG104.79, Rhodosorus marinus CCMP1338, and Bangiopsis subsimplex UTEX LB2854 were obtained from J.A. West (School of Biosciences 2, University of Melbourne, Parkville, Victoria 3010, Australia), F.D. Ott (905 NE Hilltop Drive, Topeka, Kansas 66617, USA), The Culture Collection of Algae at Göttingen University, Germany (SAG), The National Center for Marine Algae and Microbiota (NCMA), and the Culture Collection of Algae at The University of Texas at Austin, USA (UTEX), respectively. DY-V medium (added sea salt to 5 ppt) was used for culturing Rufusia pilicola. Chroothece mobilis and Chroodactylon ornatum were cultured in L1 + DY-V (1:1 ratio) medium. The other samples were cultured in L1 medium. Culture flasks were kept under a white LED lamp (7.76 µmol photon m−2 s−1) at 20 °C in a 12:12 light-dark cycle.
DNA and RNA extraction
Samples were either collected from 10 µm membrane filters or by centrifugation (30 min, 7197 rcf). Harvested cells were frozen in liquid nitrogen before grinding. Genomic DNAs for Illumina short-read sequencing were extracted using the Exgene Plant SV Kit (General Biosystems, Seoul, Korea) and cleaned up using DNeasy® PowerClean® Pro Cleanup Kit (QIAGEN, Hilden, Germany). For long-read sequencing, genomic DNA was extracted using the manual CTAB protocol with a customized lysis buffer110. Harvested samples were placed in a 2 ml tube with bullet and frozen in liquid nitrogen. Then samples were machine ground. After grinding, samples were resuspended by adding 600 µl of CTAB isolation buffer (1% 2-mercaptoethanol added right before usage) and incubated at 65 °C for 20 min. When samples were completely thawed, bullets were removed from the tubes and 6 µl of RNase A was added. After incubation, we centrifuged tubes at 20,817 rcf at 4 °C for 20 min. While not disturbing the pellets, samples were placed into another 2 ml tube and mixed with one volume of phenol:chloroform:isoamyl alcohol (25:24:1, v/v) before centrifugation at 20,817 rcf for 20 min. The aqueous phase was then mixed with one volume of chloroform in a new 2 ml tube and centrifuged at 20,817 rcf at 4 °C for 15 min. After centrifugation, one volume of 100% isopropanol was added and incubated at −20 °C for 30 min. Samples were then centrifuged at 20,817 rcf at 4 °C for 20 min. Precipitated DNA was washed with 70% ethanol and centrifuged again at 20,817 rcf at 4 °C to remove ethanol. Finally, DNA was air-dried and dissolved in 50 µl AE buffer from Exgene Plant SV Kit. Total RNA of R. marinus was extracted using RNeasy® Plant Mini Kit (QIAGEN, Hilden, Germany).
Whole genome sequencing and genome assembly
Library preparation and whole genome sequencing for both short-read and long-read sequencing were carried out by DNA Link Inc. (Seoul, Korea). For short-read sequencing, libraries were prepared using the Truseq Nano DNA Prep Kit (550 bp Protocol) and sequencing was done with the Illumina HiSeq2500 platform according to the protocol using 100 bp paired-end reagents. Long-read sequencing was carried out with Oxford Nanopore platform (ONT GridION) for R. marinus (6 kb size selection) and the Pacific Biosciences (PacBio) High-Fidelity (HiFi) sequencing platform for C. ornatum (no size selection). RNA-seq for R. marinus was done with the Illumina NovaSeq600 platform. The raw data from short-read sequencing were assembled using SPAdes 3.14.1111 with —careful pipeline option and those from long-read sequencing were assembled using NextDenovo 2.5.0 (https://github.com/Nextomics/NextDenovo) for nuclear genome of R. marinus. Assembled NextDenovo contigs were polished 3 times with Pilon 1.22112 using short-read mapping data generated by bowtie2 2.3.5.1113. For mitogenome assemblies using long-read data, reads that have BLAST hits to mitochondrial CDS were used. The program miniasm 0.3 (r179)114 was used to identify the R. marinus mitogenome and IPA 1.3.1 (https://github.com/PacificBiosciences/pbipa) was used for C. ornatum. In addition, reads that had BLAST hits to the NCR were used to search for empty minicircle reads that do not contain a CDS, however, no contigs were assembled, meaning the collected reads are just fragments of CDS-containing reads. Because minicircles share long conserved region that short-reads cannot discriminate, we used long-read data and NextPolish 1.4.0 (https://github.com/Nextomics/NextPolish) to polish the miniasm-derived contigs. We did not perform polishing on IPA contigs, because HiFi sequencing generates extremely accurate reads. Remaining SNPs and ambiguities were manually corrected using mapping data of long-reads containing CDS. For C. ornatum, each sequence from step 10 (10-assemble/p_ctg.fasta) was considered as a minicircle sequence, because the following step of the IPA assembler (polish and purge dups) did not function correctly.
For the short-read data, sorted and verified mitochondrial genes (see below) were used as seeds for NOVOplasty 4.2115. Using Geneious (Biomatters, Auckland, New Zealand), generated NOVOplasty contigs were then de novo assembled. Assembled contig that codes any of mitochondrial genes was considered as part of mitochondrial genome. Those contigs were polished (SNP & Indel) with Pilon 1.22112, using short-read mapping data generated by bowtie2 2.3.5.1113. Trinity 2.11.0116 was used to assemble RNA-seq data.
Sorting and verifying mitochondrial contigs
BLAST 2.2.31+117 was used to search for mitochondrial genes. Because mitochondrial gene sequences of the Stylonematophyceae were absent in the National Center for Biotechnology Information (NCBI) database, mitochondrial protein sequences from several red algal species were searched against assembled SPAdes contigs with e-value 1e-05 using tBLASTn. All the matched sequences were translated (Genetic code 4118) and aligned against NCBI protein database (nr). Sequences that have eukaryotic taxa in top 100 matches were considered as candidate genes. Those that only had prokaryotic taxa in top 100 matches with significantly low identity or query coverage were also selected as possible candidates.
To exclude bacterial contigs from possible mitochondrial contigs, genomic features such as GC content, read coverage and tBLASTn result (top match and identity) of the contig were used as criteria for selection. CDSs of each contig were compared against nr database using default parameters. These candidate contigs were verified manually using phylogenetic analysis. Using translated CDS in candidate contigs as queries, protein sequences from nr database were searched by MMSeqs2119 (Version: 330ea3684fd3f985d0127ffe8ca5b3f13053c619) with maximum sensitivity and e-value 1e-05.
Nuclear gene prediction
RNA-seq reads were mapped against the assembled nuclear genome of R. marinus using hisat2 (2.2.1)120 and STAR 2.7.7a121 (--outFilterScoreMinOverLread 0.45 --outFilterMatchNminOverLread 0.45). Mapping information was used as training set of ab initio gene models, performed using BRAKER 2.1.5122. Completeness was measured using BUSCO 3.0.2 with the eukaryote_odb9 database67, following Cho, et al.123. RAD52 was not found in the gene model of C. crispus and contaminant assemblies were found in the transcriptome assembly of C. ornatum. Therefore, we chose to generate a transcriptome assembly and perform gene modeling using the available RNA-seq data (see Supplementary Data 1). We used Trinity 2.11.0116 to obtain the transcriptome assembly. Cd-hit 4.8.1124,125 was used to cluster sequences with similarity over 95% and proteins were predicted using Transdecoder 5.5.0 (https://github.com/TransDecoder/TransDecoder). BUSCO67 values were: C. crispus, C:97.4% [S:27.4%, D:70.0%], F:2.6%, M:0.0%, n:303; and C. ornatum, C:96.7% [S:20.5%, D:76.2%], F:1.3%, M:2.0%, n:303. Consequently, we predicted several novel genes that are not present in existing red algal data.
Comparative analysis of CDSs
The mitochondrial genomes of 23 red algae representing Cyanidiophyceae, Compsopogonophyceae, Porphyridiophyceae, Rhodellophyceae, Bangiophyceae, and Florideophyceae were downloaded from NCBI nucleotide database (nt) and used for the comparison (Supplementary Data 1). Translated sequences of 11 CDSs (atp6, atp9, cob, cox1, cox2, cox3, nad1, nad2, nad4, nad5-f and nad5-s) were aligned by MAFFT 7.310126 and concatenated. From the concatenated alignment, amino acid similarity were calculated by Geneious 10.2.3 using blosum62 matrix127 with threshold 1 as well as nucleotide identity. Maximum likelihood phylogenetic tree was built using IQ-TREE 1.6.8128. Optimal evolutionary model was automatically chosen after model selection129. Pairwise dN/dS calculation was performed by ParaAT 2.0130 and KaKs Calculator 2.0131.
For species identification of seven Stylonematophyceae, we downloaded rbcL sequences of 16 Stylonematophyceae and one Compsopogonophyceae from NCBI and aligned with those of our samples using MAFFT 7.310126. IQ-TREE 1.6.8128 was used to construct a maximum likelihood phylogenetic tree. Optimal evolutionary model was automatically chosen after model selection129.
Detecting mitochondrial tRNA, rRNA, and other sequences in the nuclear genome
We searched for tRNA genes using ARAGORN 1.2.38132. Ribosomal RNAs were initially searched in long-read assemblies of R. marinus and C. ornatum using barrnap 0.9 (https://github.com/tseemann/barrnap) and RNAmmer 1.2133 but no matches were found. We then looked for raw reads reporting BLASTn hits on collected red algal rRNA sequences under diverse e-values, up to 100 but the results were not useful. Next, we looked for RNA transcripts of R. marinus. Long-reads mapped on each of assembled transcripts were collected, and assembled using minimap2 (2.17-r941)134,135 and miniasm 0.3 (r179)114. Transcripts whose assembled contig has circular topology were aligned to red algal rRNA sequences and manually inspected. LSU rRNA of C. ornatum was detected using BLASTn with LSU rRNA of R. marinus as a query but not in the case of SSU rRNA. SSU rRNA minicircle has a constant region that is shared among most of other minicircle in R. marinus. Thus, we collect reads that have BLASTn match against constant regions and filtered out reads that have BLASTn match against coding sequences. After the assembly (described above), only one contig that has a circular topology left, a SSU rRNA minicircle of C. ornatum. Polishing was performed using the procedure described above.
For coding genes, we downloaded gene sets of six red algae and the transcriptome assembly of two red algae (Supplementary Data 1). As described above, we predicted gene sets for C. crispus, Bangiopsis sp. CCMP1999, and C. ornatum. We aligned all the mitochondrial genes from 30 species against gene sets of each red alga using DIAMOND 0.9.36.137136 to find EGT-derived genes. Then we aligned all the gene sets against the nr database. Hit queries and subjects were aligned using MAFFT 7.310126 and a phylogenetic tree was constructed using IQ-TREE 1.6.8128.
Genes that control genome stability were selected based on existing data137,138,139,140,141. Proteins were downloaded from the NCBI database and aligned against the red algal gene set using BLASTp (e-value 1e-03). Hit identity, alignment length, query length, and subject taxon from BLASTp results against NCBI database, as well as alignment and phylogenetic tree were taken into account to determine the presence of a gene. Specifically, for RAD52, RAD52 and its paralog/homolog protein RAD59, RTI1, and RAD22, as well as some other related proteins MGM101 and RDM1 were collected from the NCBI database for query and an e-value with a maximum of 10 was used. Seven additional red genomes were used for identification of RAD52 and MSH1 (Supplementary Data 8 and Supplementary Note 9). Interproscan 5.52–86.0142 was used for domain prediction.
Transit peptides were identified using targetP2.0143 (-org pl). Statistical tests were performed using the Wilcoxon rank sum test in R 4.0.3. Complete mitogenome sequences were used as BLASTn queries against nuclear genome to find NUMTs. Hits with e-value under 1e-04 were considered as NUMTs144,145.
Raw long-read length distribution
Raw long-reads of R. marinus, C. ornatum, P. purpureum, and G. chorda were aligned against mitochondrial, plastid, and nuclear CDSs of each species (Supplementary Data 1). Plastid CDSs for R. marinus and C. ornatum were manually annotated. For P. purpureum (NC_023133.1) and G. chorda (NC_031149.1), plastid genomes were downloaded from the nt database. Reads with hit length over 80% of CDS length and identity over 90% were used. In case of intron-rich genes (e.g., R. marinus nuclear genes), hit length over 200 bp was used as criteria. Finally, only CDS that has considerable coverage (differs genome by genome) was used for analysis to reduce noise signal.
Polymerase chain reaction (PCR) and quantitative PCR (qPCR)
R. marinus total DNA was used to confirm hetero-concatemers and primers were designed to target ends of CDSs and toward the NCR, so that NCRs were amplified. R. marinus cDNA was used to confirm trans-spliced nad5 transcript and primers were designed to target 3' end of nad5-f and 5' end of nad5-s. R. marinus cDNA synthesis was performed using First Strand cDNA Synthesis kit (random hexamer primer; Thermo Scientific, Massachusetts, USA). PCR was performed using AccuPower® PCR PreMix kit (BIONEER, Daejeon, Korea). All primer designs were done using a modified version of Primer3 (2.3.7) built in Geneious 10.2.3. PCR conditions consisted of initial denaturation at 95 °C for 3 min, followed by 35 cycles of denaturation at 95 °C for 30 sec, annealing at 55 °C for 30 sec, extension at 72 °C, and a final 7 min extension step at 72 °C. Extension steps take 4.5 min for the former and 1 min for the latter. PCR products were purified with LaboPassTM PCR kit (Cosmo Genetech, Seoul, Korea). Purified PCR products were sequenced using Sanger method by Macrogen Inc. (Seoul, Korea).
The SsoFastTM EvaGreen® Supermix (Bio-Rad, California, USA) was used for the qPCR assays (two replicates) that were run on a CFX96TM system (Bio-Rad, California, USA). Primers were designed to amplify gene-specific 150 bp fragments and were tested in advance to check for primer-dimer formation in no-template control (NTC). Each tube contained 5 µl of supermix, 0.2 µl of forward and reverse primers, 3.6 µl of nuclease free deionized water, and 1 µl of template DNA (final volume 10 µl). Probes for Southern hybridization were used as template DNAs of standard (see Supplementary Data 2 for more information). Starting from concentration of 1 ng/µl, seven 10-fold serial dilution series were prepared. Starting concentration of standard sample of sdhB was 0.01 ng/µl because sdhB is a nuclear-encoded gene in R. marinus. For target samples, 0.06 ng of gDNA extracted using the CTAB method were used. Concentration of gDNA was measured using Qubit® 2.0 Fluorometer and QubitTM dsDNA BR Assay Kit (Invitrogen, Massachusetts, USA) and all samples underwent the same treatment. Quantitation cycle (Cq) values were calculated in Bio-Rad CFX Manager 2.1 (Cq determination mode = Single Threshold). Copy number was calculated using equation \(\tfrac{x\times N_A}{l\times 660\times {10}^{9}}\), where x is amount of DNA (ng), NA is Avogadro number, and l is length of DNA (bp)146. PCR efficiency were calculated using equation \({10}^{-\tfrac{1}{k}}-1\), where k is the slope of a standard curve147. qPCR conditions consisted of initial denaturation at 95 °C for 3 min, followed by 50 cycles of denaturation at 95 °C for 5 sec, annealing and extension at 60 °C for 15 sec.
Probe synthesis
All fragments for the minicircle DNA genes (atp6, atp9, cob, cox1, cox2, cox3, nad1, nad5-s, and nad5-f), LSU rDNA gene, and the sdhB nuclear gene sequences were prepared from gDNA with specific primers (Supplementary Data 2) by using PCR and were purified using the LaboPassTM PCR kit (Cosmo Genetech, Seoul, Korea) prior to labeling. The digoxigenin (DIG)-labeled probes for Southern blot were synthesized using the DIG-High Prime DNA Labeling and Detection Starter Kit I (Roche Diagnostics, Mannheim, Germany), according to the manufacturer’s instructions.
Southern blot analysis
For the Southern blot analysis, 1 µg of gDNA from of R. marinus was either undigested or digested with each minicircle-suitable restriction enzyme that has only one restriction sites outside the targeted region (Supplementary Fig. 2b). The digestion products were separated using 1% agarose gel electrophoresis in TAE buffer and transferred overnight to a positively charged nylon membrane (Cat. No. 11209299001, Sigma-Aldrich, Missouri, USA) through capillary blotting with 10X Saline-sodium citrate (SSC) buffer. After transfer, the membrane was auto-crosslinked using the Stratagene UV-Stratalinker. The crosslinked membrane was prehybridized, hybridized with the DIG-labeled probes, and then washed. Finally, the hybridized DNA probes were immunodetected with anti-digoxigenin-AP (Fab fragments) and visualized with the colorimetric substrates NBT/BCIP using the DIG-High Prime DNA Labeling and Detection Starter Kit I (Roche Diagnostics, Mannheim, Germany) according to the manufacturer’s instructions. Blot images were stored by photocopying the wet filters.
Microchannel and positive surface preparation
Polydimethylsiloxane (PDMS) devices and positively charged surfaces were prepared as previously described148. In particular, microchannel template was utilized to create two layers on a silicon wafer through repeated photolithography procedures, following the protocol specified in the Kayaku Advanced Materials SU-8 2000 datasheet. First, a silicon wafer was spin-coated with a 20 µm layer of SU-8 2015 photoresist (Kayaku Advanced Materials, Massachusetts, USA) using a spin coater (Midas System SPIN-1200D, Daejeon, Korea). Subsequently, the spin-coated wafer was exposed to 350 nm radiation with an aligner (Midas System MDA-400LJ, Daejeon, Korea) through a mask and developed using an SU-8 developer (Kayaku Advanced Materials, Massachusetts, USA). Next, SU-8 TF 6002 was spin-coated as a second layer on top of the first layer. Since SmartPrint (SmartForce Technologies, La Tronche, France) is compatible with g-line photoresists, SU-8 TF 6002 was used. Following fabrication of the template wafer, the outlet port was created by attaching a tube to the peak area of the triangular section. At last, microchannel template was placed onto a silicon wafer. Then, a mixture (10:1 wt ratio) of PDMS pre-polymer and curing agent (K1 solution, Gwangmyeong, Korea) was poured onto it and incubated at 65 °C for 12 hr. The resulting PDMS layer was peeled from the wafer, and a chamber was created by physically punching a channel into it. The PDMS microchannel underwent oxidation for 30 sec at 100 W in an air plasma generator (Femto Science Cute Basic, Korea). Finally, the PDMS device was washed and stored in deionized water.
Silicon wafers (Wafer market, Yongin, Korea) were purchased with a 30 nm SiO2 layer on top. To eliminate the polymer coating, oxidized silicon wafers and glass coverslips were arranged in a Teflon rack and soaked in piranha etching solution (30:70 v/v H2O2/H2SO4) for 3 hr. The wafers and coverslips were washed thoroughly with deionized water. Neutral pH of 7 was achieved and confirmed by pH paper. Subsequently, the wafers and coverslips were sonicated in deionized water for 30 min, followed by another round of rinsing with deionized water, to uncover the piranha surface. In the end, solutions with a concentration of 1.1 mM were prepared by adding 150 µl of Q-siloxane in 50% methanol to 250 ml of deionized water. Wafers and coverslips were incubated at 65 °C and 100 rpm for 16 hr. Finally, they were rinsed three times with 99.9% ethanol and stored in 99.9% ethanol.
DNA molecule visualization under FM and SEM
DNA molecules pre-mixed with FP-DBP149,150 were stained with 5% polyvinylpyrrolidone (PVP, molecular weight (MW): 40,000) solution. Stained DNA molecules were elongated and immobilized on a positively charged surface using a PDMS microfluidic device. DNA molecules were imaged under a FM. The microscopy system consisted of an inverted microscope (Olympus IX70, Japan) equipped with 100× Olympus UPlanSApo oil immersion objectives and an illuminated LED light source (SOLA SM 2 light engine, Lumencor, OR). Fluorescence images were captured using a scientific complementary metal-oxide semiconductor (sCMOS) camera (PRIME; Photometrics, AZ) and stored in a 16-bit TIFF format generated by Micro-manager software. In addition, DNA molecules were imaged using field emission SEM (FE-SEM; JSM-7100F, JEOL, Japan). Circular and supercoiled DNA molecules that appear as dots under the FM were confirmed under the SEM.
The length of circular and supercoiled DNA molecules was manually measured using ImageJ151. Length of 1 bp is commonly known to be 0.34 nm, however observed length depends on stretching of DNA molecules148,152 and may need case-specific conversion factor. For example, Kosar, et al.153 used conversion factor of 0.36 nm/bp which was calculated from internal standard. For DNA molecules <10 kb, fractional extensions are <80%148. Therefore, we used plasmid with known length (5.2 kb) for correct measurement. Average length of the plasmid was 1233.3 ± 150.6 nm (n = 37), which in turn tells that 1 bp is ~0.24 nm for ~5 kb DNA molecules (70.6% fractional extension).
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
Short-read data generated in this study have been deposited in the NCBI Sequence Read Archive under BioProject PRJNA778797. Mitogenomes for two species and CDSs for other five species generated in this study have been deposited in the NCBI database under accession numbers OK643888-OK643971, ON716284, ON716285, OP177696, and OP146131. Whole Genome Shotgun project of R. marinus generated in this study has been deposited at GenBank under the accession number JAMWBK000000000.1 (GCA_029953675.1) and National Marine Genome Information Center of Korea (http://www.magic.re.kr) under the accession number MA00405. Source data are provided as a Source Data file. The other data generated in this study are deposited in DRYAD database154 (https://doi.org/10.5061/dryad.tqjq2bw0w). Accession numbers for published genetic data used in the study are provided in the Supplementary Data 1 and 7. Source data are provided with this paper.
Code availability
This study does not report custom codes or mathematical algorithms.
References
Williamson, D. The curious history of yeast mitochondrial DNA. Nat. Rev. Genet. 3, 475–481 (2002).
Oldenburg D. J. & Bendich A. J. DNA maintenance in plastids and mitochondria of plants. Front. Plant Sci. 6, 883 (2015).
Bendich, A. J. The end of the circle for yeast mitochondrial DNA. Mol. Cell 39, 831–832 (2010).
Bendich, A. J. The size and form of chromosomes are constant in the nucleus, but highly variable in bacteria, mitochondria and chloroplasts. BioEssays 29, 474–483 (2007).
Smith, D. R. & Keeling, P. J. Mitochondrial and plastid genome architecture: reoccurring themes, but significant differences at the extremes. Proc. Natl. Acad. Sci. USA 112, 10177–10184 (2015).
Stuart K. & Feagin J. E. Mitochondrial DNA of Kinetoplastids. Int. Rev. Cytol. 141, 65–88 (1992).
Shapiro, T. A. & Englund, P. T. The structure and replication of kinetoplast DNA. Annu. Rev. Microbiol. 49, 117–143 (1995).
Marande, W., Lukeš, J. & Burger, G. Unique mitochondrial genome structure in Diplonemids, the sister group of Kinetoplastids. Eukaryot. Cell 4, 1137–1146 (2005).
Spencer, D. F. & Gray, M. W. Ribosomal RNA genes in Euglena gracilis mitochondrial DNA: fragmented genes in a seemingly fragmented genome. Mol. Genet. genomics: MGG 285, 19–31 (2011).
Yu, R. et al. The minicircular and extremely heteroplasmic mitogenome of the holoparasitic plant Rhopalocnemis phalloides. Curr. Biol. 32, 470–479.e475 (2022).
Yahalomi, D. et al. The multipartite mitochondrial genome of Enteromyxum leei (Myxozoa): eight fast-evolving megacircles. Mol. Biol. Evol. 34, 1551–1556 (2017).
Watanabe, S., Fučíková, K., Lewis, L. A. & Lewis, P. O. Hiding in plain sight: Koshicola spirodelophila gen. et sp. nov. (Chaetopeltidales, Chlorophyceae), a novel green alga associated with the aquatic angiosperm Spirodela polyrhiza. Am. J. Bot. 103, 865–875 (2016).
Watanabe, K. I., Bessho, Y., Kawasaki, M. & Hori, H. Mitochondrial genes are found on minicircle DNA molecules in the mesozoan animal Dicyema. J. Mol. Biol. 286, 645–650 (1999).
Suga, K., Mark Welch, D. B., Tanaka, Y., Sakakura, Y. & Hagiwara, A. Two circular chromosomes of unequal copy number make up the mitochondrial genome of the rotifer Brachionus plicatilis. Mol. Biol. Evol. 25, 1129–1137 (2008).
Shao, R., Kirkness, E. F. & Barker, S. C. The single mitochondrial chromosome typical of animals has evolved into 18 minichromosomes in the human body louse, Pediculus humanus. Genome Res. 19, 904–912 (2009).
Zhang, Z., Green, B. R. & Cavalier-Smith, T. Single gene circles in dinoflagellate chloroplast genomes. Nature 400, 155–159 (1999).
Del Cortona, A. et al. The plastid genome in Cladophorales green algae is encoded by hairpin chromosomes. Curr. Biol. 27, 3771–3782.e3776 (2017).
van Beveren, F., Eme, L., López-García, P., Ciobanu, M. & Moreira, D. Independent size expansions and intron proliferation in red algal plastid and mitochondrial genomes. Genome Biol. Evol. 14, evac037 (2022).
Tymms, M. J. & Schweiger, H. G. Tandemly repeated nonribosomal DNA sequences in the chloroplast genome of an Acetabularia mediterranea strain. Proc. Natl. Acad. Sci. USA 82, 1706–1710 (1985).
Smith, D. R. & Lee, R. W. The mitochondrial and plastid genomes of Volvox carteri: bloated molecules rich in repetitive DNA. BMC Genom. 10, 132 (2009).
Preuss M., Verbruggen H., West J. A., Zuccarello G. C. Divergence times and plastid phylogenomics within the intron-rich order Erythropeltales (Compsopogonophyceae, Rhodophyta). J. Phycol. 57, 1035–1044 (2021).
Muñoz-Gómez, S. A. et al. The new red algal subphylum Proteorhodophytina comprises the largest and most divergent plastid genomes known. Curr. Biol. 27, 1677–1684.e1674 (2017).
Kim, D. et al. Group II intron and repeat-rich red algal mitochondrial genomes demonstrate the dynamic recent history of autocatalytic RNAs. BMC Biol. 20, 2 (2022).
Brouard, J.-S., Otis, C., Lemieux, C. & Turmel, M. The exceptionally large chloroplast genome of the green alga Floydiella terrestris illuminates the evolutionary history of the Chlorophyceae. Genome Biol. Evol. 2, 240–256 (2010).
West, J. A., Zuccarello, G. C., Scott, J. L., West, K. A. & Karsten, U. Rhodaphanes brevistipitata gen. et sp. nov., a new member of the Stylonematophyceae (Rhodophyta). Phycologia 46, 440–449 (2007).
West, J. A., Zuccarello, G. C., Scott, J., Pickett-Heaps, J. & Kim, G. H. Observations on Purpureofilum apyrenoidigerum gen. et sp. nov. from Australia and Bangiopsis subsimplex from India (Stylonematales, Bangiophyceae, Rhodophyta). Phycol. Res. 53, 49–66 (2005).
Scott, J., Baca, B., Ott, F. & West, J. Light and electron microscopic observations on Erythrolobus coxiae gen.et sp. nov. (Porphyridiophyceae, Rhodophyta) from Texas U.S.A. Algae 21, 407–416 (2006).
Yang, E. C. et al. Highly conserved mitochondrial genomes among multicellular red algae of the Florideophyceae. Genome Biol. Evol. 7, 2394–2406 (2015).
DePriest, M. S., Bhattacharya, D. & López-Bautista, J. M. The mitochondrial genome of Grateloupia taiwanensis (Halymeniaceae, Rhodophyta) and comparative mitochondrial genomics of red algae. Biol. Bull. 227, 191–200 (2014).
Nan, F. et al. Origin and evolutionary history of freshwater Rhodophyta: further insights based on phylogenomic evidence. Sci. Rep. 7, 2934 (2017).
Cho, C. H. et al. Potential causes and consequences of rapid mitochondrial genome evolution in thermoacidophilic Galdieria (Rhodophyta). BMC Evol. Biol. 20, 112 (2020).
Vlcek, C., Marande, W., Teijeiro, S., Lukes, J. & Burger, G. Systematically fragmented genes in a multipartite mitochondrial genome. Nucleic Acids Res. 39, 979–988 (2011).
Martin, A. P. Metabolic rate and directional nucleotide substitution in animal mitochondrial DNA. Mol. Biol. Evol.12, 1124–1131 (1995).
Asada, K. Production and scavenging of reactive oxygen species in chloroplasts and their functions. Plant Physiol. 141, 391–396 (2006).
Murphy Michael, P. How mitochondria produce reactive oxygen species. Biochem. J. 417, 1–13 (2008).
Shokolenko, I., Venediktova, N., Bochkareva, A., Wilson, G. L. & Alexeyev, M. F. Oxidative stress induces degradation of mitochondrial DNA. Nucleic Acids Res. 37, 2539–2548 (2009).
McEwan, C. E. A., Gatherer, D. & McEwan, N. R. Nitrogen-fixing aerobic bacteria have higher genomic GC content than non-fixing species within the same genus. Hereditas 128, 173–178 (1998).
Naya, H., Romero, H., Zavala, A., Alvarez, B. & Musto, H. Aerobiosis increases the genomic guanine plus cytosine content (GC%) in prokaryotes. J. Mol. Evol. 55, 260–264 (2002).
Foerstner, K. U., von Mering, C., Hooper, S. D. & Bork, P. Environments shape the nucleotide composition of genomes. EMBO Rep. 6, 1208–1213 (2005).
Bernardi, G. The vertebrate genome: isochores and evolution. Mol. Biol. Evol. 10, 186–204 (1993).
Vinogradov, A. E. Isochores and tissue‐specificity. Nucleic Acids Res. 31, 5212–5220 (2003).
Vinogradov, A. E. Noncoding DNA, isochores and gene expression: nucleosome formation potential. Nucleic Acids Res. 33, 559–563 (2005).
Duret, L. & Galtier, N. Biased gene conversion and the evolution of mammalian genomic landscapes. Annu. Rev. Genom. Hum. Genet. 10, 285–311 (2009).
Bai, C. & Warshel, A. Revisiting the protomotive vectorial motion of F0-ATPase. Proc. Natl. Acad. Sci. USA 116, 19484–19489 (2019).
Kampjut, D. & Sazanov, L. A. Structure of respiratory complex I—an emerging blueprint for the mechanism. Curr. Opin. Struct. Biol. 74, 102350 (2022).
Baradaran, R., Berrisford, J. M., Minhas, G. S. & Sazanov, L. A. Crystal structure of the entire respiratory complex I. Nature 494, 443–448 (2013).
Jonckheere, A. I., Smeitink, J. A. M., Rodenburg, R. J. T. & Mitochondrial, A. T. P. synthase: architecture, function and pathology. J. Inherit. Metab. Dis. 35, 211–225 (2012).
Hahn, A., Vonck, J., Mills, D. J., Meier, T. & Kühlbrandt, W. Structure, mechanism, and regulation of the chloroplast ATP synthase. Science 360, eaat4318 (2018).
Meunier B., Fisher N., Ransac S., Mazat J-P, Brasseur G. Respiratory complex III dysfunction in humans and the use of yeast as a model organism to study mitochondrial myopathy and associated diseases. Biochim. Biophys. Acta 1827, 1346–1361 (2012).
Wikström, M., Sharma, V., Kaila, V. R. I., Hosler, J. P. & Hummer, G. New perspectives on proton pumping in cellular respiration. Chem. Rev. 115, 2196–2221 (2015).
Mulkidjanian, A. Y. Activated Q-cycle as a common mechanism for cytochrome bc1 and cytochrome b6f complexes. Biochim. Biophys. Acta 1797, 1858–1868 (2010).
Fullerton M., McFarland R., Taylor R. & Alston C. The genetic basis of isolated mitochondrial complex II deficiency. Mol. Genet. Metab. 131, 53–65 (2020).
Kühlbrandt, W. Structure and function of mitochondrial membrane protein complexes. BMC Biol. 13, 89 (2015).
Yankovskaya, V. et al. Architecture of succinate dehydrogenase and reactive oxygen species generation. Science 299, 700–704 (2003).
Allen, J. F. Control of gene expression by redox potential and the requirement for chloroplast and mitochondrial genomes. J. Theor. Biol. 165, 609–631 (1993).
Allen, J. F., Raven, J. A. & Allen, J. F. The function of genomes in bioenergetic organelles. Philos. Trans. R. Soc. Lond. Ser. B: Biol. Sci. 358, 19–38 (2003).
Adams, K. L., Ong, H. C. & Palmer, J. D. Mitochondrial gene transfer in pieces: fission of the ribosomal protein gene rpl2 and partial or complete gene transfer to the nucleus. Mol. Biol. Evol. 18, 2289–2297 (2001).
Funes, S. et al. A green algal apicoplast ancestor. Science 298, 2155–2155 (2002).
Kück, U., Jekosch, K. & Holzamer, P. DNA sequence analysis of the complete mitochondrial genome of the green alga Scenedesmus obliquus: evidence for UAG being a leucine and UCA being a non-sense codon. Gene 253, 13–18 (2000).
Nedelcu, A. M., Lee, R. W., Lemieux, C., Gray, M. W. & Burger, G. The complete mitochondrial DNA sequence of Scenedesmus obliquus reflects an intermediate stage in the evolution of the green algal mitochondrial genome. Genome Res. 10, 819–831 (2000).
Jensen, R. E. & Englund, P. T. Network news: the replication of kinetoplast DNA. Annu. Rev. Microbiol. 66, 473–491 (2012).
Borst, P. Why kinetoplast DNA networks? Trends Genet. 7, 139–141 (1991).
Smith, J. M. & Szathmáry, E. The origin of chromosomes I. selection for linkage. J. Theor. Biol. 164, 437–446 (1993).
Thiemann, O. H., Maslov, D. A. & Simpson, L. Disruption of RNA editing in Leishmania tarentolae by the loss of minicircle-encoded guide RNA genes. EMBO J. 13, 5689–5700 (1994).
Simpson, L. The genomic organization of guide RNA genes in kinetoplastid protozoa: several conundrums and their solutions. Mol. Biochem. Parasitol. 86, 133–141 (1997).
Valach, M., Moreira, S., Hoffmann, S., Stadler, P. F. & Burger, G. Keeping it complicated: mitochondrial genome plasticity across diplonemids. Sci. Rep. 7, 14166 (2017).
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
Chacinska, A., Koehler, C. M., Milenkovic, D., Lithgow, T. & Pfanner, N. Importing mitochondrial proteins: machineries and mechanisms. Cell 138, 628–644 (2009).
Sánchez, H., Fester, T., Kloska, S., Schröder, W. & Schuster, W. Transfer of rps19 to the nucleus involves the gain of an RNP-binding motif which may functionally replace RPS13 in Arabidopsis mitochondria. EMBO J. 15, 2138–2149 (1996).
Adams, K. L., Daley, D. O., Whelan, J. & Palmer, J. D. Genes for two mitochondrial ribosomal proteins in flowering plants are derived from their chloroplast or cytosolic counterparts. Plant Cell 14, 931–943 (2002).
Kubo, N. & Arimura, S.-I. Discovery of the rpl10 gene in diverse plant mitochondrial genomes and its probable replacement by the nuclear gene for chloroplast RPL10 in two lineages of Angiosperms. DNA Res. 17, 1–9 (2010).
Petersen, G. et al. Genes from oxidative phosphorylation complexes II-V and two dual-function subunits of complex I are transcribed in Viscum album despite absence of the entire mitochondrial holo-complex I. Mitochondrion 62, 1–12 (2022).
Maclean, A. E. et al. Absence of complex I is associated with diminished respiratory chain function in European mistletoe. Curr. Biol. 28, 1614–1619.e1613 (2018).
Michalecka, A. M. et al. Arabidopsis genes encoding mitochondrial type II NAD(P)H dehydrogenases have different evolutionary origin and show distinct responses to light. Plant Physiol. 133, 642–652 (2003).
Lopez, J. V., Yuhki, N., Masuda, R., Modi, W. & O’Brien, S. J. Numt, a recent transfer and tandem amplification of mitochondrial DNA to the nuclear genome of the domestic cat. J. Mol. Evol. 39, 174–190 (1994).
Ku, C. et al. Endosymbiotic gene transfer from prokaryotic pangenomes: Inherited chimerism in eukaryotes. Proc. Natl. Acad. Sci. USA 112, 10139–10146 (2015).
Kleine, T., Maier, U. G. & Leister, D. DNA transfer from organelles to the nucleus: the idiosyncratic genetics of endosymbiosis. Annu. Rev. Plant Biol. 60, 115–138 (2009).
Novikova, O. & Belfort, M. Mobile group II introns as ancestral eukaryotic elements. Trends Genet. 33, 773–783 (2017).
Zhang, Z., Cavalier-Smith, T. & Green, B. R. A family of selfish minicircular chromosomes with jumbled chloroplast gene fragments from a dinoflagellate. Mol. Biol. Evol. 18, 1558–1565 (2001).
Shao, R. & Barker, S. C. Chimeric mitochondrial minichromosomes of the human body louse, Pediculus humanus: evidence for homologous and non-homologous recombination. Gene 473, 36–43 (2011).
Sanchez-Puerta, M. V. et al. Genome-scale transfer of mitochondrial DNA from legume hosts to the holoparasite Lophophytum mirabile (Balanophoraceae). Mol. Phylogenet. Evol. 132, 243–250 (2019).
Roulet, M. E. et al. Multichromosomal structure and foreign tracts in the Ombrophytum subterraneum (Balanophoraceae) mitochondrial genome. Plant Mol. Biol. 103, 623–638 (2020).
Sanchez-Puerta, M. V., García, L. E., Wohlfeiler, J. & Ceriotti, L. F. Unparalleled replacement of native mitochondrial genes by foreign homologs in a holoparasitic plant. New Phytol. 214, 376–387 (2017).
Hunt, V. L. et al. The genomic basis of parasitism in the Strongyloides clade of nematodes. Nat. Genet. 48, 299–307 (2016).
Phillips, W. S. et al. The mitochondrial genome of Globodera ellingtonae is composed of two circles with segregated gene content and differential copy numbers. BMC Genom. 17, 706 (2016).
Guo, W., Zhu, A., Fan, W. & Mower, J. P. Complete mitochondrial genomes from the ferns Ophioglossum californicum and Psilotum nudum are highly repetitive with the largest organellar introns. New Phytol. 213, 391–403 (2017).
Zverkov O. A. et al. Dicyemida and orthonectida: two stories of body plan simplification. Front. Genet. 10, 443(2019).
Zhang, Z., Cavalier-Smith, T. & Green, B. R. Evolution of dinoflagellate unigenic minicircles and the partially concerted divergence of their putative replicon origins. Mol. Biol. Evol. 19, 489–500 (2002).
Odahara, M., Nakamura, K., Sekine, Y. & Oshima, T. Ultra-deep sequencing reveals dramatic alteration of organellar genomes in Physcomitrella patens due to biased asymmetric recombination. Commun. Biol. 4, 633 (2021).
Sallmyr, A. & Tomkinson, A. E. Repair of DNA double-strand breaks by mammalian alternative end-joining pathways. J. Biol. Chem. 293, 10536–10546 (2018).
Jalan M., Olsen K. S. & Powell S. N. Emerging roles of RAD52 in genome maintenance. Cancers 11, 1038 (2019).
Stein, A., Kalifa, L. & Sia, E. A. Members of the RAD52 epistasis group contribute to mitochondrial homologous recombination and double-strand break repair in Saccharomyces cerevisiae. PLOS Genet. 11, e1005664 (2015).
Samach, A., Melamed-Bessudo, C., Avivi-Ragolski, N., Pietrokovski, S. & Levy, A. A. Identification of plant RAD52 homologs and characterization of the Arabidopsis thaliana RAD52-like genes. Plant Cell 23, 4266–4279 (2011).
Janicka, S. et al. A RAD52-like single-stranded DNA binding protein affects mitochondrial DNA repair by recombination. Plant J. 72, 423–435 (2012).
Fukui, K. et al. The GIY-YIG endonuclease domain of Arabidopsis MutS homolog 1 specifically binds to branched DNA structures. FEBS Lett. 592, 4066–4077 (2018).
Odahara, M., Kishita, Y. & Sekine, Y. MSH1 maintains organelle genome stability and genetically interacts with RECA and RECG in the moss Physcomitrella patens. Plant J. 91, 455–465 (2017).
Muthye, V. & Lavrov, D. V. Multiple losses of MSH1, gain of mtMutS, and other changes in the MutS family of DNA repair proteins in animals. Genome Biol. Evol. 13, evab191 (2021).
Wu, Z., Waneka, G., Broz, A. K., King, C. R. & Sloan, D. B. MSH1 is required for maintenance of the low mutation rates in plant mitochondrial and plastid genomes. Proc. Natl. Acad. Sci. USA 117, 16448–16455 (2020).
Burger, G., Moreira, S. & Valach, M. Genes in hiding. Trends Genet. 32, 553–565 (2016).
Kang, J.-S. et al. Distinctive evolutionary pattern of organelle genomes linked to the nuclear genome in Selaginellaceae. Plant J. 104, 1657–1672 (2020).
Cameron, S. L., Yoshizawa, K., Mizukoshi, A., Whiting, M. F. & Johnson, K. P. Mitochondrial genome deletions and minicircles are common in lice (Insecta: Phthiraptera). BMC Genom. 12, 394 (2011).
Ruhanen, H. et al. Mitochondrial single-stranded DNA binding protein is required for maintenance of mitochondrial DNA and 7S DNA but is not required for mitochondrial nucleoid organisation. Biochim. Biophys. Acta 1803, 931–939 (2010).
Viguera, E., Canceill, D. & Ehrlich, S. D. Replication slippage involves DNA polymerase pausing and dissociation. EMBO J. 20, 2587–2595 (2001).
Ait Saada, A. et al. Unprotected replication forks are converted into mitotic sister chromatid bridges. Mol. Cell 66, 398–410.e394 (2017).
Pardo, B., Moriel-Carretero, M., Vicat, T., Aguilera, A. & Pasero, P. Homologous recombination and Mus81 promote replication completion in response to replication fork blockage. EMBO Rep. 21, e49367 (2020).
Robu, M. E., Inman, R. B. & Cox, M. M. RecA protein promotes the regression of stalled replication forks in vitro. Proc. Natl. Acad. Sci. USA 98, 8211–8218 (2001).
García-Medel, P. L. et al. Plant organellar DNA polymerases repair double-stranded breaks by microhomology-mediated end-joining. Nucleic Acids Res. 47, 3028–3044 (2019).
Sugiyama, T., New, J. H. & Kowalczykowski, S. C. DNA annealing by Rad52 protein is stimulated by specific interaction with the complex of replication protein A and single-stranded DNA. Proc. Natl. Acad. Sci. USA 95, 6049–6054 (1998).
Bhattacharya, D. et al. When less is more: red algae as models for studying gene loss and genome evolution in eukaryotes. Crit. Rev. Plant Sci. 37, 81–99 (2018).
Ahn, J.-S. et al. Optimization of RNA purification method from Ecklonia cava Kjellman (Laminariales, Phaeophyceae). Algae 19, 123–127 (2004).
Nurk, S. et al. Assembling single-cell genomes and mini-metagenomes from chimeric MDA products. J. Comput. Biol. 20, 714–737 (2013).
Walker, B. J. et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLOS One 9, e112963 (2014).
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
Li, H. Minimap and miniasm: fast mapping and de novo assembly for noisy long sequences. Bioinformatics 32, 2103–2110 (2016).
Dierckxsens, N., Mardulyn, P. & Smits, G. NOVOPlasty: de novo assembly of organelle genomes from whole genome data. Nucleic Acids Res. 45, e18–e18 (2017).
Grabherr, M. G. et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 29, 644–652 (2011).
Boratyn, G. M. et al. BLAST: a more efficient report with usability improvements. Nucleic Acids Res. 41, W29–W33 (2013).
Boyen, C., Leblanc, C., Bonnard, G., Grienenberger, J. M. & Kloareg, B. Nucleotide sequence of the cox3 gene from Chondrus crispus: evidence that UGA encodes tryptophan and evolutionary implications. Nucleic Acids Res. 22, 1400–1403 (1994).
Steinegger, M. & Söding, J. MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nat. Biotechnol. 35, 1026–1028 (2017).
Kim, D., Langmead, B. & Salzberg, S. L. HISAT: a fast spliced aligner with low memory requirements. Nat. Methods 12, 357–360 (2015).
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Hoff, K., Lange, S., Lomsadze, A., Borodovsky, M. & Stanke, M. BRAKER1: unsupervised RNA-Seq-based genome annotation with GeneMark-ET and AUGUSTUS. Bioinformatics 32, 767–769 (2015).
Cho, C. H. et al. Genome-wide signatures of adaptation to extreme environments in red algae. Nat. Commun. 14, 10 (2023).
Li, W. & Godzik, A. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22, 1658–1659 (2006).
Fu, L., Niu, B., Zhu, Z., Wu, S. & Li, W. CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics 28, 3150–3152 (2012).
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).
Henikoff, S. & Henikoff, J. G. Amino acid substitution matrices from protein blocks. Proc. Natl. Acad. Sci. USA 89, 10915 (1992).
Nguyen, L.-T., Schmidt, H. A., von Haeseler, A. & Minh, B. Q. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268–274 (2015).
Kalyaanamoorthy, S., Minh, B. Q., Wong, T. K. F., von Haeseler, A. & Jermiin, L. S. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat. Methods 14, 587–589 (2017).
Zhang, Z. et al. ParaAT: a parallel tool for constructing multiple protein-coding DNA alignments. Biochem. Biophys. Res. Commun. 419, 779–781 (2012).
Zhang, Z. et al. KaKs_Calculator: calculating Ka and Ks through model selection and model averaging. Genom. Proteom. Bioinform. 4, 259–263 (2006).
Laslett, D. & Canback, B. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res. 32, 11–16 (2004).
Lagesen, K. et al. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 35, 3100–3108 (2007).
Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).
Li, H. New strategies to improve minimap2 alignment accuracy. Bioinformatics 37, 4572–4574 (2021).
Buchfink, B., Reuter, K. & Drost, H.-G. Sensitive protein alignments at tree-of-life scale using DIAMOND. Nat. Methods 18, 366–368 (2021).
Gualberto, J. M. & Newton, K. J. Plant mitochondrial genomes: dynamics and mechanisms of mutation. Annu. Rev. Plant Biol. 68, 225–252 (2017).
Odahara M. Factors affecting organelle genome stability in Physcomitrella patens. Plants 9, 145 (2020).
Mishra, A., Saxena, S., Kaushal, A. & Nagaraju, G. RAD51C/XRCC3 facilitates mitochondrial DNA replication and maintains integrity of the mitochondrial genome. Mol. Cell. Biol. 38, e00489–00417 (2018).
Maréchal, A. & Brisson, N. Recombination and the maintenance of plant organelle genome stability. New Phytol. 186, 299–317 (2010).
Pannunzio, N. R., Watanabe, G., Lieber, M. R. & Nonhomologous, D. N. A. end-joining for repair of DNA double-strand breaks. J. Biol. Chem. 293, 10512–10523 (2018).
Blum, M. et al. The InterPro protein families and domains database: 20 years on. Nucleic Acids Res. 49, D344–D354 (2021).
Almagro Armenteros, J. J. et al. Detecting sequence signals in targeting peptides using deep learning. Life Sci. Alliance 2, e201900429 (2019).
Richly, E. & Leister, D. NUMTs in sequenced eukaryotic genomes. Mol. Biol. Evol. 21, 1081–1084 (2004).
Pamilo, P., Viljakainen, L. & Vihavainen, A. Exceptionally high density of NUMTs in the honeybee genome. Mol. Biol. Evol. 24, 1340–1346 (2007).
Whelan, J. A., Russell, N. B. & Whelan, M. A. A method for the absolute quantification of cDNA using real-time PCR. J. Immunol. Methods 278, 261–269 (2003).
Kubista, M. et al. The real-time polymerase chain reaction. Mol. Asp. Med. 27, 95–125 (2006).
Kim, T. et al. Counting DNA molecules on a microchannel surface for quantitative analysis. Talanta 252, 123826 (2023).
Lee, S. et al. DNA binding fluorescent proteins for the direct visualization of large DNA molecules. Nucleic Acids Res 44, e6 (2016).
Park, J. et al. Single-molecule DNA visualization using AT-specific red and non-specific green DNA-binding fluorescent proteins. Analyst 144, 921–927 (2019).
Schneider, C. A., Rasband, W. S. & Eliceiri, K. W. NIH Image to ImageJ: 25 years of image analysis. Nat. Methods 9, 671–675 (2012).
Lee, S. et al. Investigation of various fluorescent protein–DNA binding peptides for effectively visualizing large DNA molecules. RSC Adv. 6, 46291–46298 (2016).
Kosar, M., Piccini, D., Foiani, M. & Giannattasio, M. A rapid method to visualize humanmitochondrial DNA replication through rotary shadowing and transmission electronmicroscopy. Nucleic Acids Res. 49, e121 (2021).
Lee, Y. et al. Origin of minicircular mitochondrial genomes in red algae. Dryad Digit. Reposit. https://doi.org/10.5061/dryad.tqjq2bw0w (2022).
Acknowledgements
We are grateful to our collaborator the late Franklyn D. Ott, who provided the culture strain of Rufusia pilicola O7031. We thank to Woongghi Shin and Minseok Jeong (Chungnam National University) for technical supports on light microscopy images of Stylonematophyceae species and Jeremy Wideman (Arizona State University) for valuable comments. We also acknowledge Hyun Ju Jung (Yonsei University) for helpful comments about southern blot. This work was supported by the Collaborative Genome Program of the Korea Institute of Marine Science and Technology Promotion (KIMST) funded to the Ministry of Oceans and Fisheries (MOF) (grant numbers 20180430 and 20210469), the National Research Foundation of Korea (grant number NRF-2020R1I1A1A01074092, 2022R1A2B5B03002312, and 2022R1A5A1031361), and the National Institute of Food and Agriculture and United States Department of Agriculture (grant number NJ01180).
Author information
Authors and Affiliations
Contributions
H.S.Y., Y.L., and C.H.C. designed the study and Y.L. and C.H.C. analyzed the data. Y.L., Y.M.L., S.I.P., and J.H.Y. did wet lab experiments. Y.L., S.I.P., Y.M.L., and J.A.W. provided cultured strains. H.S.Y. and K.J. supervised the study. C.N. and K.J. generated fluorescence and scanning electron microscopy images. Y.L., C.H.C., D.B., and H.S.Y. wrote the manuscript. All authors have read and edited the final manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks M. Virginia Sanchez-Puerta, Claudio H. Slamovits and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Lee, Y., Cho, C.H., Noh, C. et al. Origin of minicircular mitochondrial genomes in red algae. Nat Commun 14, 3363 (2023). https://doi.org/10.1038/s41467-023-39084-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-023-39084-2
- Springer Nature Limited
This article is cited by
-
Mitogenome features and phylogenetic analysis of red algae, Grateloupia cornea (Rhodophyta, Halymeniales)
Journal of Genetics (2024)