Introduction

Plastids, unique organelles in photosynthetic eukaryotes, are capable of oxygenic photosynthesis and have their own genomes and genetic systems. Endosymbiotic cyanobacteria are widely accepted to be the origin of plastids. There are two types of plastids; one is a direct descendent of the primary cyanobacterium endosymbiont (i.e., “primary” plastids) and the other was acquired secondarily from primary plastids, that is, endosymbiosis of plastid-bearing eukaryotes by another eukaryotic host (i.e., “secondary” or “tertiary” plastids) (Kim and Archibald 2009). Primary plastids are found in Archaeplastida, including three phyla, Glaucophyta, Rhodophyta and Viridiplantae.

Rhodophyta (red alga) is divided into two subphyla with seven classes (Yoon et al. 2006): the subphylum Cyanidiophytina contains one class (Cyanidiophyceae) and the subphylum Rhodophytina contains six classes (Florideophyceae, Bangiophyceae, Compsopogonophyceae, Stylonematophyceae, Rhodellophyceae and Porphyridiophyceae). The complete plastid genomes of Rhodophyta were reported in only three classes: two species of Cyanidiophyceae (Glöckner et al. 2000; Ohta et al. 2003), four species of Florideophyceae (DePriest et al. 2013; Hagopian et al. 2004; Janouškovec et al. 2013) and three species of Bangiophyceae (Reith and Munholland 1995; Wang et al. 2013). The detailed phylogenetic relationships of Rhodophyta remain unidentified except in these three classes.

The plastid of Chromalveolata (stramenopiles, Alveolata, Cryptophyta and Haptophyta) originates from Rhodophyta (Cavalier-Smith 1998), but a more accurate phylogenetic relationship remains to be determined, that is, whether the plastid of Chromalveolata originated from the ancestor of Rhodophytina or the ancestor of (Florideophyceae + Bangiophyceae), because the only three classes for the complete plastid genome of Rhodophyta were used in the phylogenetic analyses (Wang et al. 2013). Recently, a draft genome of Porphyridium purpureum CCMP 1328 was reported (Bhattacharya et al. 2013), but the complete plastid genome of this species has not been published.

Here, we report the sequence of the plastid genome of Porphyridium purpureum strain NIES 2140. This strain is a unicellular marine rhodophyte belonging to the unsequenced class Porphyridiophyceae. We analyzed its genome structure and gene content. In addition, using concatenated amino acid alignments of plastid-encoded proteins, we determined the phylogenetic relationship of P. purpureum and the origin of the Chromalveolata plastid.

Materials and methods

DNA sources

P. purpureum strain NIES 2140 was obtained from the collection of the National Institute for Environmental Studies (Tsukuba, Japan). Cells of P. purpureum were cultured in a modified medium similar to enriched sea water medium (ESM) (http://mcc.nies.go.jp/02medium.html#esm) (Okaichi et al. 1982) without soil extract, but including 35 g l−1 of an artificial seawater called Viesalt (MARINETECH, Tokyo, Japan) instead of seawater. The preculture was statically grown at 22 °C under white light (12 h light: 12 h dark; 20 μmol m−2 s−1) for about a month. For DNA isolation, the cells were grown at 30 °C with aeration by 1 % CO2 under white light at 30 μmol m−2 s−1 for 6 days. The cells were harvested by centrifugation, and disrupted with glass beads. DNA was released by treatment with cetyltrimethylammonium bromide (CTAB) and proteinase K, extracted with phenol/chloroform, and purified by CsCl ultracentrifugation.

DNA sequencing

Genomic DNA (5.5 μg) was sheared by ultrasonic treatment and sequenced with a genome sequencer FLX instrument (Roche Diagnostics, Indianapolis, IN, USA) according to the manufacturer’s protocol (this is usually referred to as ‘454 sequencing’). Total genome data of P. purpureum strain NIES 2140 were assembled with Newbler Matrics version 2.5.3 into 3,355 contigs (average length = 5,775 bp). The contigs had a peak depth of 27.0 and totaled 19.4 Mbp. To find its genomic origin, namely, nuclear genome or organelle genomes, each contig was analyzed by blastn software version 2.2.18 against the sequences of the organelles in algae and plants as well as the nuclear genome. The sequenced reads corresponding to plastid sequences were assembled into 13 contigs. PCR experiments were performed to fill the gaps. This plastid genome has a pair of nearly identical inverted repeat sequences. To validate the sequence differences, each of the repeat sequences was separately amplified by long PCR. PCR products were sequenced by conventional Sanger sequencing, using the sequencing services of FASMAC Co. Ltd. (Atsugi, Japan). The complete plastid genome is available under the GenBank Accession Number AP012987.

Genome annotations and data analyses

To detect the open reading frames (ORFs), we used the MetaGeneAnnotator program (Noguchi et al. 2008). Then, we used the tblastn program, which was run against the cluster data of CyanoClust (Sasaki and Sato 2010) prepared by the Gclust software (Sato 2009). Homology search using tblastn was also run against the plastid genomes of seven species of Heterokontophyta: Ectocarpus siliculosus, Fistulifera sp. JPCC DA0580, Fucus vesiculosus, Heterosigma akashiwo, Synedra acus, Thalassiosira oceanica and Vaucheria litorea (Supplementary material 1). Finally, we searched for ORFs that were longer than 30 codons and started with ATG or GTG. Homology search of ORFs with orthologs in other plastid genomes of algae and plants, and cluster analysis of proteins were performed using the described database (Supplementary material 1). To detect the RNA genes, we used the blastn program against the plastid genomes of Rhodophyta and Glaucophyta (Supplementary material 1). To search for tRNA genes, we used the tRNAscan-SE program (Lowe and Eddy 1997). To detect introns, we used the RNAweasel program (Lang et al. 2007), EST-data of related species (Chan et al. 2011) and the result of alignment analysis. We compared the rRNA operon structure of P. purpureum with that of sequenced algae and plants in the GOBASE database (O’Brien et al. 2009). In addition, the plastid genomes of the following species were used as references: Paulinella chromatophora, seven species of Heterokontophyta (as described above), and four species of Rhodophyta (Pyropia haitanensis, Calliarthron tuberculosum, Chondrus crispus, and Grateloupia taiwanensis) (Supplementary material 1). Processing of DNA and protein sequences was performed with the SISEQ software version 1.59 (Sato 2000). Sequence alignment of tRNA(CAU) genes was constructed with ClustalX software version 2.0.9 (Larkin et al. 2007). Other sequence alignments were constructed with ClustalW/ClustalX software version 1.8.3 (Thompson et al. 1994) or Muscle software version 3.6 (Edgar 2004). The genomic sequence was manipulated using the Artemis software version 13.0 (Rutherford et al. 2000).

Phylogenetic analyses of Rhodophyta and Chromalveolata

For phylogenetic analyses of Rhodophyta and Chromalveolata, four types of sequence alignments of proteins were constructed: three types used sequences that are conserved in six species of Rhodophyta, 13 species of Chromalveolata and 17 species of cyanobacteria, and one type used sequences that are conserved in six species of Rhodophyta, 13 species of Chromalveolata, 17 species of cyanobacteria, six species of Viridiplantae and one species of Glaucophyta. The analyzed proteins are listed in Supplementary material 2. The four alignments of concatenated protein data are described in Supplementary material 3a–d, respectively. We used maximum likelihood (ML), neighbor-joining (NJ) and Bayesian inference (BI) methods for phylogenetic inference. ML analysis was performed using TREEFINDER software version March 2011 (Jobb et al. 2004). The evolutionary model was selected with the ProtTest program version 2.4 (Abascal et al. 2005). The LG model was selected, but because it was not implemented in MEGA and MrBayes, the best available model was used for each respective analysis. NJ analysis was carried out using MEGA software version 5.05 (Tamura et al. 2011) with the JTT model. BI analysis was performed using MrBayes version 3.2.1 (Ronquist and Huelsenbeck 2003) with the WAG model. Two chains were run; trees were sampled at every 200 for 1,000,000 generations and the first 2,000 trees were discarded as burn-in.

Phylogenetic analyses of introns

For phylogenetic analysis of P. purpureum maturases, homologous proteins were obtained from the dataset “Cyanoclust4” (Cluster No. 5 and Cluster No. 4454) of the Gclust database (Sato 2009). We used sequences that were more than 350 amino acids in length. The phylogenetic relationship was analyzed by the ML method using the LG model. The alignment of maturases is provided in Supplementary material 3e.

Phylogenetic analysis of group II introns was performed by the ML method with the GTR + G + I model. The alignment of group II introns is provided in Supplementary material 3f.

Results

Structure of the plastid genome of P. purpureum

We sequenced the plastid genome of P. purpureum strain NIES 2140 using Roche 454 GS FLX. The sequenced reads were assembled into 13 contigs, and gaps were filled completely by Sanger sequencing. The plastid genome of P. purpureum is a circular molecule composed of 217,694 bp, with a GC content of 30.3 % (Table 1). It is larger than the plastid genomes of other species of Rhodophyta due to the presence of many introns (total length: 30,433 bp) in P. purpureum. The plastid genome of P. purpureum encodes one non-coding RNA (ncRNA) (rnpB), six rRNAs (two rrs, two rrl, and two rrf), 29 tRNAs and 224 proteins, including unidentified ORFs (Table S1, Fig. 1a).

Table 1 Major features of the plastid genomes of Rhodophyta reported to date
Fig. 1
figure 1

Structure of the plastid genome of P. purpureum. a A gene map of the plastid genome of P. purpureum. The two internal sectors represent RNA genes, whereas the two external sectors represent protein-coding genes. The outer rings and inner rings of each set represent genes on the clockwise and counterclockwise strands, respectively. Protein-coding genes are color-coded by their function, as shown at the bottom left corner. The belts in protein-coding regions indicate intron regions. b rRNA operon structure of the plastid genome. The region between the two IRs is called the small single copy (SSC) region. IR1 is shown to the right. Each single capital letter indicates the tRNA gene corresponding to the amino acid abbreviation

Generally, plastid genomes have an inverted repeat (IR) structure consisting of a pair of identical sequences containing rRNA operons. The rRNA operons of plastid genomes of most algae contain three rRNA genes (rrs, rrl, and rrf) and two tRNA genes, namely, tRNAIle(GAU) and tRNAAla(UGC). The tRNAIle(GAU) and the tRNAAla(UGC) genes are located between the rrs (encoding 16S rRNA) and the rrl (encoding 23S rRNA) genes (Fig. 1b, Fig. S1). rRNA operons of plastid genomes in Bangiophyceae are a pair of direct repeat structures. In Cyanidiophyceae and Florideophyceae, the plastid genome has only one rRNA operon.

In the plastid genome of P. purpureum, the rRNA genes exist as a pair of nearly identical IRs. One of them (IR1, right) has a tRNAAla(UGC) gene between the rrs and the rrl genes, while another (IR2, left) has a tRNAIle(GAU) gene (Fig. 1a). This was confirmed by sequencing of the two IRs, which were amplified separately. According to the EST-data (Chan et al. 2011), the plastid genome of a related species (Porphyridium purpureum CCMP 1328) holds a similar structure of nonidentical IRs. This type of IR architecture involving nonidentical rRNA operons is unique to the class of Porphyridiophyceae among all sequenced plastid genomes (Fig. 1b, Fig. S1).

Comparison of protein-coding genes

We conducted cluster analysis of all proteins in the plastid genome of P. purpureum (Fig. S2). We performed all-against-all blastp by using a data set including all of the plastid-encoded proteins of P. purpureum, five species of Rhodophyta, 13 species of photosynthetic Chromalveolata, a Glaucophyta (Cyanophora paradoxa), 40 species of Viridiplantae, and all the proteins of 41 species of cyanobacteria (Supplementary material 1). 37 protein genes of P. purpureum do not have a homolog in any other analyzed genome. Five genes (mat1a, mat1b, mat1c, mat1d and mat1e) are homologs of known maturase genes that are conserved in Rhodomonas salina and some cyanobacteria. Except for these and ycf12 and ycf80, all protein genes encoded by the plastid genome of P. purpureum are conserved in cyanobacteria. The 57 genes, which are conserved in all phototrophs, include genes encoding RNA polymerases, ribosomal proteins, cytochrome complexes, ATP synthases and photosystems I and II. The genes for subunits of allophycocyanin (apcA, apcB and apcD), phycocyanin (cpcA and cpcB) and phycobilisome (apcE and apcF) are conserved in Rhodophyta and C. paradoxa, but not in Chromalveolata. The genes for ribonucleases (rne and ycf56), phycoerythrin-related proteins (cpeA and cpeB) and tRNA synthetases (syfB and syh) are conserved in Rhodophytina. The homologs of manganese ABC transporter genes (mntA and mntB) are conserved in C. paradoxa and P. purpureum.

We analyzed the loss of plastid protein-coding genes during the evolution of Rhodophyta (Fig. S3). Nineteen genes are conserved only in Cyanidiophytina. The genes for protochlorophyllide reductase proteins (chlB, chlL and chlN) are conserved only in Bangiophyceae. The genes for the subunits of 3-isopropylmalate dehydratase (leuC and leuD) were considered to be acquired by horizontal gene transfer into the plastid genome only in Gracilaria tenuistipitata (Janouškovec et al. 2013). The genes for acetyl-CoA carboxylase (ACCase) (accA, accB and accD), acetylglutamate kinase (argB) and ycf29 are conserved in the plastid genomes of Rhodophyta, except P. purpureum. P. purpureum has only the homolog of the nuclear-encoded single-subunit eukaryotic form of ACCase in the nuclear genome (data not shown), and lacks the prokaryotic form that is commonly encoded in the plastid genomes, as reported in Gramineae (Konishi et al. 1996) and Chromalveolata (Livne and Sukenik 1990; Roessler and Ohlrogge 1993).

Introns and maturase genes

We detected introns by various means, namely, using the RNAweasel program (Lang et al. 2007), comparison with EST-data of related species (Chan et al. 2011) and alignment analysis of putative protein sequences. Among the 224 protein-coding genes, 29 genes contain one or multiple intron(s) (Table S2). A group I intron was found in rpl28. This intron has a secondary structure typical for group I introns (pseudoknot structure) (Cech 1988; Lang et al. 2007) (Fig. S4). Others were group II introns. Introns in dnaK, infC, gltB and rpoC2 encode a maturase. In total, the P. purpureum plastid genome has five maturase genes (Fig. S5). mat1a is located in the intergenic region between atpE and orf44. All maturases contain RT domain and X domain, and only Mat1d contains a Zn domain (nuclease domain). Mat1a and Mat1b lost the YxDD motif (polymerase active site, in domain 5) (Zimmerly et al. 2001). The intron within the dnaK gene, which is written as int.b (dnaK) in the present article, contains mat1b, but this intron was not detected using the RNAweasel program. Almost all land plants have intronic maturase genes (matK) within the plastid trnK genes (Neuhaus and Link 1987), but the maturases of P. purpureum were not homologous to MatK. According to phylogenetic analysis, two types of maturases are encoded by the plastid genome of P. purpureum (Fig. S6). One type contains Mat1a, Mat1c, Mat1d and Mat1e. Mat1a is closely related to the plastid-encoded maturase of Rhodomonas salina. Mat1b differs from these. Mat1d seems to retain mobility, and the introns encoding this type of maturase may have been present as several copies inserted at difference sites. According to the phylogenic analysis of introns (Fig. S7), int (mntA), int.b (rpoC2) and int.b (atpI) are closely related to the intron encoding the mobile maturase [int (gltB)], to which other introns are distantly related. The total number of base pairs of the introns is 30,433 bp, which corresponds to about 14 % of the entire P. purpureum plastid genome.

Identification of the three tRNA species containing the CAU anticodon

Usually, at most one tRNA species exists per anticodon, except for tRNA genes in the repeat region. The CAU anticodon, however, has three tRNA species: tRNAMet(CAU), tRNAfMet(CAU) and tRNAIle(CAU), which is also the case in the plastid genome of P. purpureum (Table S3).

tRNAfMet(CAU) functions as the initiator tRNA. In tRNAfMet(CAU), AU base pairs at positions 11 and 25, GC base pairs at positions 12 and 24 in the D-loop stem, and two or three GC base pairs in the anticodon stem are conserved in plastids and cyanobacteria (Alkatib et al. 2012). tRNA(CAU) in region 126,118–126,045 was identified as tRNAfMet(CAU).

tRNAIle(CAU) contains a lysidine, a modified cytidine, at the first anticodon by the action of TilS (tRNAIle-lysidine synthetase) (Soma et al. 2003), which is encoded by the ycf62 gene universally conserved in the plastid genomes of Rhodophyta. This modification in tRNAIle(CAU) switches the codon-recognition from AUG to AUA, which corresponds to the change in amino acid specificity from methionine to isoleucine. TilS recognizes C33 and A39 of tRNAIle(CAU) (Nakanishi et al. 2009). Curiously, the other two tRNA(CAU) genes also have identical nucleotides at these positions. The RNA sequence of a spinach chloroplast tRNAIle(CAU) was confirmed experimentally (Francis and Dudock 1982). The alignment analysis using spinach sequences as the reference indicated that tRNA(CAU) in region 114,922–114,837 is tRNAIle(CAU). The remaining tRNA(CAU) in region 19,649–19,577 is tRNAMet(CAU). We also determined and corrected the identity of some tRNA(CAU)s of other Rhodophyta and C. paradoxa plastid genomes that were misannotated (Fig. S8).

Phylogenetic analyses of Rhodophyta and Chromalveolata

For phylogenetic analyses of Rhodophyta and Chromalveolata, we constructed an amino acid sequence alignment of 76 concatenated proteins (20,088 sites) that are conserved in Rhodophyta, Chromalveolata and cyanobacteria (Fig. S9a). Qiu et al. (2012) suggested that serially duplicated genes might provide a misleading phylogenetic signal due to the concerted evolution and differential gene loss that accompanies genome reduction. In addition, they suggested that short sequence alignments lack a phylogenetic signal. Therefore, sequences of <75 amino acids in alignment length were removed. Accordingly, an alignment of 65 concatenated proteins was constructed (18,905 sites) (Fig. 2). Additionally, we constructed an amino acid sequence alignment of 34 concatenated proteins (10,570 sites) that are conserved in Rhodophyta, Chromalveolata and cyanobacteria (Fig. S9b). We also constructed an amino acid sequence alignment of the same concatenated proteins (10,524 sites), including Viridiplantae/Glaucophyta OTUs (Fig. S9c). We used maximum likelihood (ML), neighbor-joining (NJ) and Bayesian inference (BI) methods. The phylogenetic relationships of Rhodophyta, as shown in Fig. 2 and Fig. S9, were supported in all data sets by the three analyses. These relationships are consistent with the results of Yoon et al. (2006) (see also Verbruggen et al. 2010).

Fig. 2
figure 2

Phylogenetic tree of plastids and cyanobacteria based on concatenated sequences of 65 conserved proteins. The bar above the tree shows the scale for the branch length (substitutions per site). The numbers at each branch indicate the confidence level, bootstrap value and posterior probability, provided by ML (first value), NJ (second value) and BI (third value) analyses, respectively. Each “-” indicates that this branch is not supported by NJ analysis. The tree topology as shown here was identical in both ML and BI analyses. The thick lines represent the branches that were supported by the three analyses

The results of ML and BI analyses in the all datasets suggest that the plastids of Chromalveolata are sister groups of Rhodophytina, but the result of NJ analysis suggests that Cryptophyta is separated from the rest of Chromalveolata, and is a sister group of Rhodophyta as a whole. The differences in the results of the NJ analysis could result from the known problem that NJ anlyses tend to be affected by long-branch attraction, especially in multigene phylogenetic analyses. The ML tree of 76 concatenated proteins (Fig. S9a) did not support the monophyletic relationship of Chromalveolata, which was supported by other trees (Fig. 2, Fig. S9b, c). The sequences of the proteins of serially duplicated genes (psbA and psbD) were suggested to contribute to this difference. The tree of 34 concatenated proteins, including Viridiplantae/Glaucophyta OTUs (Fig. S9c), supported the monophyletic relationship of Cryptophyta and Haptophyta, which was not supported by the trees for the 65 concatenated proteins (Fig. 2) and 34 concatenated proteins without Viridiplantae/Glaucophyta OTUs (Fig. S9b).

Analysis of the gene cluster for ribosomal proteins in the plastid genome

We compared the gene cluster for ribosomal proteins in the plastid genome of P. purpureum with those in the genomes of Synechocystis sp. PCC 6803 and Anabaena sp. PCC 7120, as well as the plastid genomes of Glaucophyta, other Rhodophyta and Chromalveolata (Fig. S10). dnaK is located upstream of rpl3 and oriented in the reverse direction with respect to other genes in Rhodophyta and Chromalveolata, except Phaeophyceae. The Haptophyta (Emiliania huxleyi) plastid genome lacked some genes in the gene cluster for ribosomal proteins, namely rpl4, rpl29, rpl24, rpl18 and rpl13. The gene called ycf88, which was only detected in diatoms, is located between rps19 and rpl22.

Despite these differences in gene content of the gene cluster for ribosomal proteins, the order of genes is highly conserved in Rhodophyta and Chromalveolata, in which the rps12rps7 segment is located downstream of the rpl36rpl31 segment, a result of rhodophyte-specific translocation (Ohta et al. 1997). This is not the case in C. paradoxa and cyanobacteria.

In P. purpureum, the gene cluster for ribosomal proteins is divided into four parts, possibly by further translocation: the sites of disjunction are shown by arrowheads in Fig. S10 and are found between rps5 and secY, between secY and rpl36, and between rps7 and tufA. secY is isolated from the cluster and is located between orf57c and rpoC2. secY, rps10 and tufA are oriented in the reverse direction with respect to the other genes. The disjunction between secY and rpl36 was also found in the plastid genome of C. paradoxa but not in the genomes of cyanobacteria (Stirewalt et al. 1995). These data suggest that the plastid genome of P. purpureum underwent large-scale rearrangement after separation of this species from other red algae.

Discussion

We sequenced the complete plastid genome of P. purpureum and found some unique features in this plastid genome. First, the P. purpureum plastid genome has many introns. Second, it has a unique nonidentical structure of rRNA operons. Third, it has a highly rearranged gene order in the ribosomal protein gene cluster.

Introns have been reported in several plastid genes of Rhodophyta (Bernard et al. 1992; Janouškovec et al. 2013; Yoon et al. 2006), but in these cases, only one or two introns existed in the entire plastid genome. Some species of Rhodellophyceae, Porphyridiophyceae and Stylonematophyceae have an intron in the psaA gene (Yoon et al. 2006). In P. purpureum, the psaA gene has an intron, but the insertion position of the intron is different. In Florideophyceae, an intron encodes a reverse transcriptase in tRNA(CAU) (Janouškovec et al. 2013).

In Chromalveolata, which has secondary plastids originating from Rhodophyta, the rRNA operons exist as a pair of identical IRs, but this symmetric structure has not been found in any Rhodophyta analyzed so far. The plastid rRNA operons of P. purpureum are a pair of nonidentical IRs, each containing a different tRNA gene between the rrs and the rrl genes, respectively.

A previous report described the phylogenetic relationship of plastids in Rhodophyta and Chromalveolata (Wang et al. 2013). Unfortunately, only three classes of Rhodophyta were used in the analysis. Whether the plastids of Chromalveolata originated from the ancestor of Rhodophytina or the ancestor of (Florideophyceae + Bangiophyceae) remains to be determined.

We added a new species to the analysis. Our phylogenetic analyses included Porphyridiophyceae, which is a different clade from Florideophyceae and Bangiophyceae (Verbruggen et al. 2010). The results suggest that the plastids of Chromalveolata originated from Rhodophytina. Previous reports showed that the monophyly of the red algae-derived secondary/tertiary plastids (Harper and Keeling 2003; Yoon et al. 2002) and our phylogenetic analyses using concatenated proteins without the proteins of serially duplicated genes support this hypothesis. Our phylogenetic analyses suggest that the phylogenetic position of Haptophyta was affected by taxon sampling (presence or absence of Viridiplantae/Glaucophyta OTUs). Adding more Haptophyta taxa or new lineages with potentially intermediate positions will be helpful in resolving the phylogenetic relationships of Chromalveolata (Burki et al. 2012). Data from a larger number of rhodophytan taxa will be helpful in resolving a more detailed and accurate phylogenetic relationship between Rhodophyta and secondary/tertiary plastids.

The results obtained in the present study imply the following evolutionary scenario regarding the structure of the plastid genome. First, the ancestor of Rhodophyta had two inverted rRNA operon copies. One of the copies was lost in Cyanidiophyceae after its separation from Rhodophytina. Then secondary endosymbiosis events occurred one or several times in Rhodophytina; the secondary plastids conserved two copies of inverted rRNA operons. In Bangiophyceae, the rRNA operons were rearranged to become direct repeats and one of the copies was lost in Florideophyceae. In Porphyridiophyceae, large-scale genome rearrangement occurred and the two rRNA operons became nonidentical, perhaps due to asymmetric selective loss of tRNA genes after separation from other classes. The rRNA operon structure of the other three classes (Compsopogonophyceae, Stylonematophyceae and Rhodellophyceae) remains to be determined. Elucidation of the complete plastid genome in all classes of Rhodophyta will be helpful in understanding the evolution of Rhodophyta and Chromalveolata.