Abstract
The structure of a chromosomal DNA molecule may influence the way in which it is replicated and inherited. For decades plastid DNA (ptDNA) was believed to be circular, with breakage invoked to explain linear forms found upon extraction from the cell. Recent evidence indicates that ptDNA in vivo consists of linear molecules with discrete termini, although these ends were not characterized. We report the sequences of two terminal regions, End1 and End2, for maize (Zea mays L.) ptDNA. We describe structural features of these terminal regions and similarities found in other plant ptDNAs. The terminal sequences are within inverted repeat regions (leading to four genomic isomers) and adjacent to origins of replication. Conceptually, stem-loop structures may be formed following melting of the double-stranded DNA ends. Exonuclease digestion indicates that the ends in maize are unobstructed, but tobacco (Nicotiana tabacum L.) ends may have a 5′-protein. If the terminal structure of ptDNA molecules influences the retention of ptDNA, the unprotected molecular ends in mature leaves of maize may be more susceptible to degradation in vivo than the protected ends in tobacco. The terminal sequences and cumulative GC skew profiles are nearly identical for maize, wheat (Triticum aestivum L.) and rice (Oryza sativa L.), with less similarity among other plants. The linear structure is now confirmed for maize ptDNA and inferred for other plants and suggests a virus-like recombination-dependent replication mechanism for ptDNA. Plastid transformation vectors containing the terminal sequences may increase the chances of success in generating transplastomic cereals.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Chromosomal DNA in the eukaryotic nucleus is comprised of linear molecules with specific end sequences (telomeres) that protect against degradation by exonucleases and the potential loss of DNA at the 5′ end with each round of replication (Watson 1972). Whereas the telomerase mechanism is employed in the eukaryotic nucleus, alternative strategies are used to protect and maintain the ends of linear chromosomes for viruses, bacteria, and mitochondria, including covalently bound proteins, hairpin structures, and terminal inverted repeats (Chaconas and Kobryn 2010; Nosek et al. 2006; Smith and Keeling 2013). The conceptual “ends problem” does not apply to circular DNA forms, such as bacterial plasmids and mammalian mitochondrial DNA (mtDNA) (Pohjoismaki and Goffart 2011). Until recently, a circular form was also assumed for plastid DNA (ptDNA), and the linear forms that were observed were dismissed as artifacts from breakage of circular molecules (Bendich 2004). Our objective was to determine the sequences at the ends of ptDNA molecules for two reasons. First, this information may be used to elucidate the replication, inheritance, and maintenance (or lack thereof) of ptDNA throughout leaf and chloroplast development (Bendich 2004, 2007; Oldenburg et al. 2014). And second, the form of chromosomal DNA molecules within plastids may influence the design of plastid transformation vectors (Oldenburg and Bendich 2009) so as to improve plastid transformation in cereals.
Plastid genomes typically vary from 115–165 kb and in most, but not all, green plants there are four defined regions: long single copy (LSC), short single copy (SSC) and two inverted repeats (IRa and IRb) (Ravi et al. 2008). The convention for sequenced plastid genomes is to define the first nucleotide (nt = 1) as the beginning of the LSC or its equivalent in genomes without IRs. The ptDNA origins of replication (oris) have been mapped for several plant species and two closely spaced oris are often found. For maize ptDNA three origins had been identified: oriA and oriB by sequence similarity to Oenothera ptDNA; and the third, designated here as oriC, by similarity to Chlamydomonas reinhardtii (Heinhorst and Cannon 1993; Kunnimalaiyaan and Nielsen 1997; Oldenburg and Bendich 2004a). The size of the plastid genome in maize, Zea mays L., is 140,384 bp (Maier et al. 1995), although the extracted ptDNA is found in various structural forms, including branched-linear multigenomic molecules, unit-genome-sized linear isomers and head-to-tail concatemers, and less-than-genome-sized fragments (Oldenburg and Bendich 2004a). The relative abundance of these forms changes during development and depending on growth conditions: branched-linear and monomer ptDNA molecules are prominent at the base of the stalk in green maize seedlings, whereas the ptDNA from a green leaf blade is mostly or entirely degraded to subgenomic fragments (Oldenburg and Bendich 2004b; Oldenburg et al. 2006). In dark-grown maize the genomic monomer, dimer, and higher concatemers are found (Oldenburg et al. 2006), and these forms are abundant in green leaves for tobacco, Nicotiana tabacum L. (Lilly et al. 2001), barrel medic, Medicago truncatula Gaertn. (Shaver et al. 2006), and other dicots (Deng et al. 1989; Rowan et al. 2010).
We previously identified discrete termini (ends) for maize ptDNA by restriction enzyme digestion and blot hybridization (Oldenburg and Bendich 2004a). This method showed that there are two major genomic isomers with ends located at nucleotides (nts) 88,000 and 100,000, with a standard deviation of ±5000 bp, and three minor end isomers; the approximate location of only one of these (nt 78,000) was determined (Oldenburg and Bendich 2009). Using the same methods, ends have also been identified for ptDNA of tobacco (Scharff and Koop 2006, 2007), and barrel medic, a species without an IR (Shaver et al. 2008). This method, however, only provided the approximate terminal locations within the plastid genome and no information about the terminal structure.
We have now determined the precise location for two terminal regions of maize ptDNA by direct sequencing. We find that the ends are near oris and may form stem-loop structures. Cumulative GC skew analysis may be used to identify ends and oris in ptDNA from other species. The structure and mechanism of replication for maize ptDNA molecules are discussed with reference to the linear DNA molecules of herpes simplex virus (HSV).
Materials and methods
Seeds of Zea mays L. inbred line B73 and commercial hybrid Mycogen 2722 were soaked in tap water overnight, sown in soil (Sunshine mix #4) and grown in controlled temperature chambers at 24 °C with 16/8 h light/dark cycles under fluorescent lights or in continuous darkness. Juvenile maize seedlings (8–12 days old) were harvested and washed for ~3 min in 0.5 % sarkosyl (w/v), then rinsed exhaustively in water to minimize microbial contamination. The entire seedling or the first and second leaf blades (L1 and L2) were used for plastid and ptDNA preparations (Oldenburg and Bendich 2004a; Shaver et al. 2006). For plastid isolation, we used high-salt buffer [HSB; 1.25 M NaCl, 40 mM HEPES pH 7.6; 2 mM EDTA, 0.1 % BSA (w/v)] and dilution buffer [DB; 0.33 M D-sorbitol, 20 mM HEPES pH 7.6, 2 mM EDTA, 1 mM MgCl2, 0.1 % BSA (w/v)]. In the initial grinding step, 0.1 % ß-mercaptoethanol (v/v) was added to HSB, and the tissue was homogenized in HSB (plus 0.5–1 mL of antifoam) using a Waring blender and 3 × 5-s bursts. The homogenate was filtered through 1–3 layers of Miracloth and plastids were pelleted by centrifugation at 10,000×g for 5 min. The plastids were then washed once each with HSB and DB. The plastids were layered onto a cushion of 80 % Percoll in DB and centrifuged at 14,000×g for 10 min. Any residual nuclei and starch remaining after differential centrifugation pellets to the bottom of the microfuge tube. The intact plastids, which remain at the top of the Percoll layer, were recovered using a wide-bore pipet and washed 2 or 3 times with DB. The final plastid pellet was resuspended in a small volume of DB.
Plastids were embedded in 0.7 % low-melt agarose (w/v) and lysed overnight at 48 °C in 40 mM EDTA pH 8, 1 % sarkosyl (w/v), 200 µg/mL proteinase K. Phenylmethyl-sulfonyl fluoride (PMSF) was used to inactivate the proteinase K. In-gel ptDNA samples were washed extensively in TE (10 mM Tris pH 8, 1 mM EDTA) and stored at 4 °C in TE. In-gel tobacco ptDNA was prepared by the same procedure, using the leaves of greenhouse-grown plants. Alternative lysis conditions were used for in-gel ptDNAs intended for exonuclease digestion (given in Table 3), and the digestions were performed as described for liverwort mtDNA (Oldenburg and Bendich 2001). Note that the in-gel ptDNAs used for end-cloning (Fig. 1) were prepared using 40 mM EDTA, rather than the more commonly used 470 mM EDTA. We found that lysis conditions using either high EDTA or low EDTA plus 1 M NaCl lead to increased DNA migration from the well (see Online Resource 3). However, these high-salt concentrations can also induce the formation of G-quartet structures at the ends of double-stranded DNA (Bochman et al. 2012; Sharma 2011) that would probably interfere with the end-cloning process. In fact, in silico analysis using QGRS Mapper showed that the End1 (but not End2) sequence could form G-quartet structures (Kikin et al. 2006). It is for this reason that we avoided high ionic strength in the buffer used to lyse the plastids in-gel.
The ptDNA ends were ligated to a linearized plasmid, either prior to or after fractionation of in-gel preparations of intact maize ptDNA molecules by pulsed-field gel electrophoresis (PFGE). The in-gel ptDNA was treated with end repair mixture (T4 DNA polymerase Klenow fragment and dNTPs only or with the addition of T4 polynucleotide kinase and ATP) and then ligated to a blunt-end linearized plasmid (pBluescriptKSII+ digested with EcoRV). The in-gel plasmid-ligated ptDNA was fractionated by PFGE, and the well-bound fraction (multigenomic, branched-linear form) and linear monomer fraction were excised from the gel. The next step was to digest the plasmid-ligated-ptDNA with a restriction enzyme (BamHI or SacI) to give ends compatible within the plasmid polylinker and the ptDNA, followed by ligation of the compatible ends. The plasmid-ptDNA ligation mixture was used for Escherichia coli transformation followed by blue/white colony screening, selection of white colonies and isolation of plasmids. Plasmid preparations were assessed for the ptDNA End insert by restriction enzyme digestion and agarose gel electrophoresis. Sequencing of the ptDNA End insert from both directions was performed using universal primers [M13(-20) forward and M13 reverse] (GeneWiz, Seattle, Washington USA). This end-cloning process was performed using ptDNA from several different maize ptDNA preparations and on both branched-linear (well-bound) and linear monomer molecules. The terminal sequences were identified by alignment to the maize plastid genome sequence [GenBank: NC_001666] using MacVector.
The similarity to the maize End sequences was assessed using Align-to-Reference and ClustalW features of MacVector for the plastid genomes given in Fig. 2 and in Figs. S1–S4 and Tables S1–S4 of Online Resource 1. For Align-to-reference the settings were: residue scoring match = 2, mismatch = −3, ambiguous match = 0, gap penalty = 4 and alignment parameters hash value = 4, sensitivity = 4, score threshold = 50, X dropoff = 15. For ClustalW the settings were: gap penalty = 5, window = 4, open gap penalty = 10, extend gap penalty = 5, delay divergent = 40 %, and weighed transitions. The Align-to-Reference tool, used to compare the cloned-end sequencing reads to the maize plastid genome reference sequence, provides directional information about the ptDNA End insert indicating that all five End1 inserts were from IRb, whereas one of the End2 inserts was from IRb and the other two were from IRa.
The cumulative GC skew (Grigoriev 1998) profiles were generated with window size 500 and step size 50, using http://gcat.davidson.edu/dgpb/gc_skew/. Both the GC for each 500-nt window [(G−C)/(G+C)], and the cumulative GC [sum of (G−C)/(G+C)] are shown in each skew diagram. The GC skew profiles are shown in Fig. 5 for maize and in Online Resource 2 for other species. The secondary stem-loop structures were produced using Mfold for DNA (Zuker 2003), http://www.bioinfo.rpi.edu/applications/Mfold. The predicted secondary structures and the folding energies (dG values) are shown in Online Resource 4.
GenBank accession nos. for plastid genomes: Arabidopsis thaliana L. NC_000932; Chlamydomonas reinhardtii P.A.Dang. BK0000554; Euglena gracilis NC_001603; Marchantia polymorpha L. (liverwort); X04465; Medicago truncatula Gaertn. (barrel medic) NC_003119; Mesostigma viride Lauterborn. NC_002186; Nicotiana tabacum L. (tobacco) NC_001879; Oryza nivana S.D.Sharma and Shastry. (wild rice) AP0062728; Oryza sativa L. indica (Indica rice) AY522329; Oryza sativa L. japonica (Japonica rice) X15901; Pisum sativum L. (pea) NC_014057; Spinacia oleracea L. (spinach) NC_002202; Triticum aestivum L. (wheat) NC_002762; Zea mays L. (maize) NC_001666. GenBank accession no. for viral genome of Herpes simplex Z896099.
Results and discussion
Total ptDNA of maize is found in three major molecular forms (multigenomic branched complexes, linear unit-genome-sized monomers, and linear fragments of subgenomic size) that can be separated by pulsed-field gel electrophoresis (PFGE) (Fig. 1). We previously reported that both the complex forms (well-bound fraction) and monomers have the same defined ends as determined by restriction digestion and PFGE (Oldenburg and Bendich 2004a), whereas the subgenomic fragments end at any point along the 140-kb genome sequence. Furthermore, simple, linear molecules comprise >90 % of all ptDNA molecules from light-grown maize seedling, most of which are subgenomic fragments (Lanes 3 and 5 in Fig. 1 and references Oldenburg and Bendich 2004a, b). Therefore, in a preparation of total ptDNA, the number of moles of random ends greatly outnumbers the moles of defined ends found in the complex and monomer molecules. To ensure recovery and identification of the terminal regions, we used an end-cloning procedure.
In-gel prepared total maize ptDNA was blunt-end ligated to a linearized plasmid followed by PFGE fractionation and excision of the multigenomic complex (well-bound ptDNA) and genomic monomer fractions (Fig. 1). Following this end-cloning process (Fig. 2), nine plasmids with ptDNA inserts were recovered and sequenced (Table 1). Two major terminal regions, End1 and End2, were identified by alignment to the maize plastid genome. An additional minor terminal region was also indicated, but needs further characterization.
The ptDNA end-cloning process outlined in Fig. 2 may not be very efficient as indicated by the low number (9) of plasmids with end sequences compared to the total (85) analyzed by restriction digestion. Two factors may contribute to this inefficiency. (1) It is necessary to perform all of the end-cloning steps in-gel. The ptDNA end ligation (Step 3 in Fig. 2) requires diffusion of the EcoRV-linearized pBluescript plasmid into the gel to encounter a ptDNA end. Although excess non-end-ligated plasmid would be removed in the next step (PFGE fractionation), some plasmid could be re-ligated around the gel matrix and become trapped in the well-bound fraction. (2) The end-ligation step was performed using total in-gel ptDNA, leading to competition between plasmid ligation to the defined ends of the monomer and branched complex forms and ligation to the more numerous random ends of the degraded, subgenomic ptDNA fragments. Nonetheless, the ptDNA end-cloning process yielded eight plasmids containing the End1 and End2 inserts and one plasmid with an End3 insert (Table 1).
The locations of both End1 and End2 are within the IRs, leading to four genomic isomers (Fig. 3). The position of End1 was determined as nt 94,973 in IRb and 127,764 in IRa, and is located in the intergenic region between the trnV and 16S rRNA genes on the maize map, and in fact aligns with the 16S rRNA promoter. In addition, End1 is located 5′-upstream of the origin of replication oriB2 in IRb. The position of End2 was determined as nt 93,859 in IRb and 128,877 in IRa; it is located in the intergenic region between the rps12 and orf85 genes and is 3′-downstream of oriA2 in IRb. Previously, restriction fragment and blot hybridization results suggested that maize ptDNA consists of a population of molecular isomers with five different terminal regions, and mapping predicted two major isomers with ends at approximately 88,000 and 100,000 (map d and map c, respectively, Fig. 1 in reference Oldenburg and Bendich 2004a). In addition, the amount of each isomer appeared to vary in relative abundance based on the ethidium-DNA fluorescence intensity of the different restriction fragments. To determine whether the ends sequenced here correspond to map c or map d, an in silico analysis of the predicted fragments and hybridization patterns for End1 and End2 was performed (Table 2). For both End1 and End2, the smaller AscI fragment would be expected to hybridize to the rbcL and 3′rps12/rps7 probes, therefore correlating with map c (for map d, the smaller fragment hybridizes only to rbcL). This finding suggests that the End1 and End2 AscI fragments would co-migrate to the same position following PFGE fractionation and likely accounts for the higher ethidium-DNA fluorescence intensity found with the map c fragments. Although the end-cloning and sequencing process resulted in identification of the End1 and End2 isoforms, it is likely there are other end isomers (such as corresponding to map d) that were not found. Surprisingly, we were unable to clone the ends of tobacco ptDNA, despite starting with a much larger amount of the tobacco genomic monomer ptDNA (Lane 1 in Fig. 1). As described below, the different terminal structures for maize and tobacco ptDNAs probably account for our inability to ligate the linearized plasmid to the termini of tobacco ptDNA.
Sequence comparisons for the End1 and End2 regions were performed to assess similarity to other plastid genomes: four cereals, five dicots, one bryophyte, and two algal species. For each end, two sequences of 120 nts were used for comparison and these correspond to the regions located 3′-upstream and 5′-downstream of nt 94,973 for End1 and nt 93,859 for End2. These sequences are shown in Fig. 4, along with rice and wheat ptDNA alignments. Sequence alignments and similarities for all the species comparisons are shown in Online Resource 1. For the cereals, a high degree of similarity (≥97 % identity) was found with three of the maize plastid end sequences. Less similarity (51–57 %) was found with the fourth sequence (End2-5′IRb), although the terminal ~90 nts were highly similar. Three of the dicots (Arabidopsis, spinach, and tobacco) showed high sequence identity to the End1 regions (80–93 %) and End2-3′IRb (83–94 %), but again low similarity was found with End2-5′IRb (27–46 %). Slightly lower similarity was found in the two legumes (pea and barrel medic) for End1 (61–77 %) and End2-3′IRb (68–70 %). The lower similarity to End1-5′IRb is not surprising since a divergence in the 16S rRNA promoter sequence has been described for some legumes relative to other higher plants (Suzuki et al. 2003). For End2-5′IRb, 46–47 % similarity was found with pea and barrel medic. For liverwort, similarity was greatest with End1-3′IRb (74 %), with lower similarity (40–53 %) for the other three regions. For the algae, similarity was only 38–56 % for all four sequences. These results suggest that the termini of linear molecules are highly conserved among cereals. Although similarity to the maize End sequences was found for the five dicots, it is unclear from sequence comparisons alone whether these truly represent terminal regions.
DNA strand asymmetry exists when the leading strand of DNA replication contains more G than C as compared to the lagging strand. Such asymmetry has been analyzed using GC skew algorithms to provide information about DNA replication mechanisms for bacterial and organellar genomes (Gerhold et al. 2014; Grigoriev 1998; Xia 2012). We therefore generated GC skew plots for the plastid genomes of maize (Fig. 5a, b shows IRb) and other plant species and HSV DNA, previously considered as a model for ptDNA replication (Oldenburg and Bendich 2004a) (see Online Resource 2).
Cumulative GC skew plots were originally used to predict ori and termination (ter) sites for circular bacterial chromosomes, showing the ori at a single minimum, the ter at a single maximum, and assuming bidirectional replication (Grigoriev 1998; Xia 2012). Subsequently, more complex GC plots (with multiple peaks and valleys) were found for archaebacterial chromosomes where multiple origins were identified (Xia 2012). In addition, for the linear mitochondrial genome of the yeast Candida parapsilosis, the GC skew profile showed local minima near both ends, suggesting the ends function as oris for a recombination-dependent replication (RDR) mechanism (Gerhold et al. 2014). The GC skew plots for the plastid genomes of the protist Euglena gracilis (Morton 1999) (see Online Resource 2) and two green algal species (Belanger et al. 2006; Brouard et al. 2011) conform to the patterns found for E. coli, Bacillus subtilis and other circular-mapping bacterial genomes. Thus, a similar profile would be expected if maize ptDNA existed as a circular chromosome and employed a D-loop-to-bidirectional replication mechanism (theta or Cairns model) as initially proposed (Kolodner and Tewari 1975). The maize ptDNA GC plot, however, is complex with multiple minima and maxima, and the oris are at slight inflection points (bumps) with the two Ends at GC minima between oriA and oriB. The GC skew profiles for ptDNAs of rice and wheat are very similar to that of maize, with maize-like Ends at local minima between the two origins (see Online Resource 2). The same pattern is also found with the GC skew profiles for tobacco and barrel medic (see Online Resource 2). These findings refute the Cairns model in favor of the RDR mechanism proposed for linear ptDNA molecules (Bendich 2004; Marechal and Brisson 2010; Oldenburg and Bendich 2009). Furthermore, the GC plot for HSV DNA is also complex (see Online Resource 2), and this viral genome has other features similar to maize ptDNA: a linear molecule ~150-kb in size with single-copy regions flanked by inverted repeats, three oris, and the ends are found in the IR regions (Oldenburg and Bendich 2004a; Weller and Sawitzke 2014). A specific type of RDR mechanism, single-strand annealing (SSA), was proposed for HSV DNA replication (Weller and Sawitzke 2014). SSA may initiate replication, produce concatemers, and generate branched-linear multigenomic forms—all features shared by HSV and ptDNA. We propose that ptDNA replication involves activation of an internal ori within a linear genomic monomer and strand-invasion of an end to generate a replication fork (Bendich 2004; Oldenburg and Bendich 2009).
The terminal regions were assessed for sequence or structural properties that could be important for ptDNA function and integrity. The susceptibility of linear DNAs to exonuclease digestion can be used to infer whether the ends are “open” or “blocked”, such as with a 5′-protein or hairpin loop. Resistance to exonuclease digestion may be affected by the methods used for preparing in-gel DNA, but this property can also yield information about the terminal structure (Oldenburg and Bendich 2001). Maize ptDNA was prepared in-gel using four different lysis conditions: low or high EDTA and with or without proteinase K before treatment with λ exonuclease or Exonuclease III and fractionation by PFGE (see Online Resource 3). Maize ptDNA was digested by both exonucleases under all four lysis conditions (Table 3), indicating that the ends are open. In contrast, for ptDNA from tobacco, exonuclease sensitivity was dependent on the lysis conditions, suggesting a protein at the 5′-end and an open 3′-end, as previously proposed for liverwort mitochondrial DNA (Oldenburg and Bendich 2001). The amount of ptDNA first increases and then decreases during leaf development, although the magnitude of the decline is greater for maize than tobacco (Oldenburg et al. 2014; Shaver et al. 2006). The greater stability and integrity of tobacco ptDNA in mature leaf tissue may, in part, be due to protection at the ends of the linear molecules, whereas the unprotected maize ptDNA would be more susceptible to degradation in vivo. This blockage at the ends of tobacco ptDNA would also account for our inability to clone and sequence these ends using the same methods as used here for maize ptDNA.
Secondary DNA structures, including stem-loop and cruciform, have been associated with DNA replication by recruitment of replication proteins and activation of oris (Boulikas 1996; Pearson et al. 1996), with HSV DNA as a relevant example (Muylaert et al. 2011; Weller and Sawitzke 2014). To determine whether the ends of maize ptDNA exhibit such structural motifs, we assessed the potential to form secondary structures within a single strand of DNA at the terminal regions using Mfold for DNA (Zuker 2003) (see Online Resource 4). For End1-5′IRb, four stem-loops were predicted, with the ‘−35’ and ‘−10’ elements of the 16S rRNA promoter (Suzuki et al. 2003) located at the base of two of the stem-loops. The complex structure of End1-3′IRb corresponds to that expected for folding of the trnV gene. Three small stem-loops were predicted for End2-5′IRb, and for End2-3′IRb one large and three small stem-loops were predicted. Similar patterns of stem-loops were also predicted for the analogous end sequences in ptDNA of rice and wheat (see Online Resource 4). Although the types of predicted stem-loop structures differed between the maize ptDNA 120-nt sequences of End1 and End2, such terminal structures may function in ori activation and/or RDR. The propensity for stem-loop formation by short IR sequences, close proximity to rRNA genes, and AT-richness are all features associated with DNA replication origins in viruses, bacteria, and the eukaryotic nucleus (Aslani et al. 2000; Boulikas 1996; Pearson et al. 1996), and these features are also found in plastid genomes (Heinhorst and Cannon 1993). Interestingly, maize End1-5′IRb is adjacent to the 16S rRNA gene, and End2-5′IRb is AT-rich (A+T = 82 %, whereas it is ~50 % for the other three 120-nt end sequences). For HSV, the UL9 origin-binding protein (OBP) association with the stem-loop promotes opening of its AT-rich segment to allow access for other DNA replication proteins (Aslani et al. 2000; Muylaert et al. 2011; Weller and Coen 2012). One of these, ICP8, is a single-strand binding protein (SSBP) that has been implicated in the SSA replication process that generates branched, multigenomic molecules (Weller and Sawitzke 2014). Proteins involved in ptDNA (and plant mtDNA) replication and recombination include TWINKLE helicase, SSBPs WHY and RPA, and an OBP (Deng et al. 2015; Lassen et al. 2011; Majeran et al. 2012; Marechal and Brisson 2010; Moriyama and Sato 2014; Sakaguchi et al. 2009). A replication mechanism analogous to that of HSV may operate for maize ptDNA, whereby an OBP interacts with an internal or end-located ori to facilitate DNA unwinding and binding of SSBP to promote SSA replication (Oldenburg and Bendich 2015).
Conclusions
For years the plastid chromosome was modeled as a unit-genome-sized circle and proposed to replicate by D-loop-to-theta and rolling-circle mechanisms. Similarly, a circular replication model was initially suggested for the linear genome of HSV, although its branched-linear replicative forms are more likely the product of the recently proposed single-strand-annealing RDR mechanism (Weller and Sawitzke 2014). For maize ptDNA, a linear structure is now established by sequencing and characterization of two terminal regions, and an end-mediated recombination-dependent mode of ptDNA replication is proposed.
Nevertheless, our understanding of ptDNA replication and maintenance is still minimal (Marechal and Brisson 2010; Oldenburg and Bendich 2015) and progress toward both objectives could be facilitated by focusing on the ends of ptDNA molecules, end-binding proteins (Majeran et al. 2012), and the roles of recombination and strand-invasion in ptDNA replication. The introduction of DNA into the plastid genome has led to agronomic improvement for some crop plants, although plastid transformation for cereals has not yet been successful (Clarke et al. 2011; Hanson et al. 2013; Maliga and Bock 2011). Designing transformation vectors with maize ptDNA terminal regions may increase the chances for success (Bendich and Oldenburg 2013; Oldenburg and Bendich 2009).
Abbreviations
- DB:
-
Dilution buffer
- HSB:
-
High salt buffer
- HSV:
-
Herpes simplex virus
- IR:
-
Inverted repeat
- LSC:
-
Long single copy
- mtDNA:
-
Mitochondrial DNA
- OBP:
-
Origin binding protein
- ori :
-
Origin of replication
- PFGE:
-
Pulsed-field gel electrophoresis
- PK:
-
Proteinase K
- ptDNA:
-
Plastid DNA
- RDR:
-
Recombination-dependent replication
- SSBP:
-
Single-strand binding protein
- SSC:
-
Short single copy
- SSA:
-
Single-strand annealing
References
Aslani A, Simonsson S, Elias P (2000) A novel conformation of the herpes simplex virus origin of DNA replication recognized by the origin binding protein. J Biol Chem 275:5880–5887
Belanger AS, Brouard JS, Charlebois P, Otis C, Lemieux C, Turmel M (2006) Distinctive architecture of the chloroplast genome in the chlorophycean green alga Stigeoclonium helveticum. Mol Genet Genomics 276:464–477
Bendich AJ (1996) Structural analysis of mitochondrial DNA molecules from fungi and plants using moving pictures and pulsed-field gel electrophoresis. J Mol Biol 255:564–588
Bendich AJ (2004) Circular chloroplast chromosomes: the grand illusion. Plant Cell 16:1661–1666
Bendich AJ (2007) The size and form of chromosomes are constant in the nucleus, but highly variable in bacteria, mitochondria and chloroplasts. Bioessays 29:474–483
Bendich AJ, Oldenburg DJ (2013) Plastid transformation using linear DNA vectors. I. B. World Intellectual Property Organization, USA, pp 1–59
Bendich AJ, Smith SB (1990) Moving pictures and pulsed-field gel electrophoresis show linear DNA molecules from chloroplasts and mitochondria. Curr Genet 17:421–425
Bochman ML, Paeschke K, Zakian VA (2012) DNA secondary structures: stability and function of G-quadruplex structures. Nat Rev Genet 13:770–780
Boulikas T (1996) Common structural features of replication origins in all life forms. J Cell Biochem 60:297–316
Brouard JS, Otis C, Lemieux C, Turmel M (2011) The chloroplast genome of the green alga Schizomeris leibleinii (Chlorophyceae) provides evidence for bidirectional DNA replication from a single origin in the chaetophorales. Genome Biol Evol 3:505–515
Chaconas G, Kobryn K (2010) Structure, function, and evolution of linear replicons in Borrelia. Annu Rev Microbiol 64:185–202
Clarke JL, Daniell H, Nugent JM (2011) Chloroplast biotechnology, genomics and evolution: current status, challenges and future directions. Plant Mol Biol 76:207–209
Deng X-W, Wing RA, Gruissem W (1989) The chloroplast genome exists in multimeric forms. Proc Natl Acad Sci U S A 86:4156–4160
Deng SK, Chen H, Symington LS (2015) Replication protein A prevents promiscuous annealing between short sequence homologies: implications for genome integrity. Bioessays 37:305–313
Gerhold JM, Sedman T, Visacka K, Slezakova J, Tomaska L, Nosek J, Sedman J (2014) Replication intermediates of the linear mitochondrial DNA of Candida parapsilosis suggest a common recombination based mechanism for yeast mitochondria. J Biol Chem 289:22659–22670
Grigoriev A (1998) Analyzing genomes with cumulative skew diagrams. Nucleic Acids Res 26:2286–2290
Hanson MR, Gray BN, Ahner BA (2013) Chloroplast transformation for engineering of photosynthesis. J Exp Bot 64:731–742
Heinhorst S, Cannon GC (1993) DNA replication in chloroplasts. J Cell Sci 104:1–9
Kikin O, D’Antonio L, Bagga PS (2006) QGRS Mapper: a web-based server for predicting G-quadruplexes in nucleotide sequences. Nucl Acids Res 34(suppl 2):W676–W682
Kolodner RD, Tewari KK (1975) Chloroplast DNA from higher plants replicates by both the Cairns and the rolling circle mechanism. Nature 256:708–711
Kunnimalaiyaan M, Nielsen BL (1997) Chloroplast DNA replication: mechanism, enzymes and replication origins. J Plant Biochem Biotechnol 6:1–7
Lassen MG, Kochhar S, Nielsen BL (2011) Identification of a soybean chloroplast DNA replication origin-binding protein. Plant Mol Biol 76:463–471
Lilly JW, Havey MJ, Jackson SA, Jiang J (2001) Cytogenomic analyses reveal the structural plasticity of the chloroplast genome in higher plants. Plant Cell 13:245–254
Maier RM, Neckerman K, Igloi GL, Kössel H (1995) Complete sequence of the maize chloroplast genome: gene content, hotspots of divergence and fine tuning of genetic information by transcript editing. J Mol Biol 251:614–628
Majeran W, Friso G, Asakura Y, Qu X, Huang M, Ponnala L, Watkins KP, Barkan A, van Wijk KJ (2012) Nucleoid-enriched proteomes in developing plastids and chloroplasts from maize leaves: a new conceptual framework for nucleoid functions. Plant Physiol 158:156–189
Maliga P, Bock R (2011) Plastid biotechnology: food, fuel, and medicine for the 21st century. Plant Physiol 155:1501–1510
Marechal A, Brisson N (2010) Recombination and the maintenance of plant organelle genome stability. New Phytol 186:299–317
Moriyama T, Sato N (2014) Enzymes involved in organellar DNA replication in photosynthetic eukaryotes. Front Plant Sci 5:480
Morton BR (1999) Strand asymmetry and codon usage bias in the chloroplast genome of Euglena gracilis. Proc Natl Acad Sci U S A 96:5123–5128
Muylaert I, Tang KW, Elias P (2011) Replication and recombination of herpes simplex virus DNA. J Biol Chem 286:15619–15624
Nosek J, Kosa P, Tomaska L (2006) On the origin of telomeres: a glimpse at the pre-telomerase world. Bioessays 28:182–190
Oldenburg DJ, Bendich AJ (2001) Mitochondrial DNA from the liverwort Marchantia polymorpha: circularly permuted linear molecules, head-to-tail concatemers, and a 5′ protein. J Mol Biol 310:549–562
Oldenburg DJ, Bendich AJ (2004a) Most chloroplast DNA of maize seedlings in linear molecules with defined ends and branched forms. J Mol Biol 335:953–970
Oldenburg DJ, Bendich AJ (2004b) Changes in the structure of DNA molecules and the amount of DNA per plastid during chloroplast development in maize. J Mol Biol 344:1311–1330
Oldenburg DJ, Bendich AJ (2009) Chloroplasts. In: Kriz AL, Larkins BA (eds) Molecular Genetic Approaches to Maize Improvement Biotechnology in agriculture and forestry, vol 63. Springer, Berlin, Heidelberg, pp 325–343
Oldenburg DJ, Bendich AJ (2015) DNA maintenance in plastids and mitochondria of plants. Front Plant Sci 6:883. doi:10.3389/fpls.2015.00883
Oldenburg DJ, Rowan BA, Zhao L, Walcher CL, Schleh M, Bendich AJ (2006) Loss or retention of chloroplast DNA in maize seedlings is affected by both light and genotype. Planta 225:41–55
Oldenburg DJ, Rowan BA, Kumar RA, Bendich AJ (2014) On the fate of plastid DNA molecules during leaf development: response to the Golczyk et al. Commentary. Plant Cell 26:855–861
Pearson CE, Zorbas H, Price GB, Zannis-Hadjopoulos M (1996) Inverted repeats, stem-loops, and cruciforms: significance for initiation of DNA replication. J Cell Biochem 63:1–22
Pohjoismaki JL, Goffart S (2011) Of circles, forks and humanity: topological organisation and replication of mammalian mitochondrial DNA. Bioessays 33:290–299
Ravi V, Khuranan JP, Tyagi AK, Khurnana P (2008) An update on chloroplast genomes. Plant Syst Evol 271:101–122
Rowan BA, Oldenburg DJ, Bendich AJ (2004) The demise of chloroplast DNA in Arabidopsis. Curr Genet 46:176–181
Rowan BA, Oldenburg DJ, Bendich AJ (2010) RecA maintains the integrity of chloroplast DNA molecules in Arabidopsis. J Exp Bot 61:2575–2588
Sakaguchi K, Ishibashi T, Uchiyama Y, Iwabata K (2009) The multi-replication protein A (RPA) system—a new perspective. FEBS J 276:943–963
Scharff LB, Koop HU (2006) Linear molecules of tobacco ptDNA end at known replication origins and additional loci. Plant Mol Biol 62:611–621
Scharff LB, Koop HU (2007) Targeted inactivation of the tobacco plastome origins of replication A and B. Plant J 50:782–794
Sharma S (2011) Non-B DNA secondary structures and their resolution by RecQ helicases. J Nucleic Acids 2011:724215
Shaver JM, Oldenburg DJ, Bendich AJ (2006) Changes in chloroplast DNA during development in tobacco, Medicago truncatula, pea, and maize. Planta 224:72–82
Shaver JM, Oldenburg DJ, Bendich AJ (2008) The structure of chloroplast DNA molecules and the effects of light on the amount of chloroplast DNA during development in Medicago truncatula. Plant Physiol 146:1064–1074
Smith DR, Keeling PJ (2013) Gene conversion shapes linear mitochondrial genome architecture. Genome Biol Evol 5:905–912
Suzuki JY, Sriraman P, Svab Z, Maliga P (2003) Unique architecture of the plastid ribosomal RNA operon promoter recognized by the multisubunit RNA polymerase in tobacco and other higher plants. Plant Cell 15:195–205
Watson JD (1972) Origin of concatemeric T7 DNA. Nat New Biol 239:197–201
Weller SK, Coen DM (2012) Herpes simplex viruses: mechanisms of DNA replication. Cold Spring Harb Perspect Biol 4:a013011
Weller SK, Sawitzke JA (2014) Recombination promoted by DNA viruses: phage lambda to herpes simplex virus. Annu Rev Microbiol 68:237–258
Xia X (2012) DNA replication and strand asymmetry in prokaryotic and mitochondrial genomes. Curr Genomics 13:16–27
Zuker M (2003) Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res 31:3406–3415
Acknowledgments
This research was funded by the Junat Fund (a private charitable fund).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors DO and AB are co-inventers for the patent application: “Plastid Transformation Using Linear DNA Vectors”; application nos. PCT/US2013/030775, WO2014/065857A1. The linear vectors described in this patent are based on the terminal ptDNA sequences reported in this manuscript.
Additional information
Communicated by M. Kupiec.
Electronic supplementary material
Below is the link to the electronic supplementary material.
294_2015_548_MOESM1_ESM.pdf
Supplementary material 1 (PDF 1370 kb) Online Resource 1: Comparisons of maize ptDNA end sequences to sequences from other ptDNAs; sequence alignments and amount of similarity are shown
294_2015_548_MOESM2_ESM.pdf
Supplementary material 2 (PDF 365 kb) Online Resource 2: GC skew plots of the plastid genomes from several land plants and the protist Euglena gracilis and viral genome of herpes simplex
294_2015_548_MOESM4_ESM.pdf
Supplementary material 4 (PDF 383 kb) Online Resource 4: Secondary structure prediction of the terminal sequences from maize, rice, and wheat
Rights and permissions
About this article
Cite this article
Oldenburg, D.J., Bendich, A.J. The linear plastid chromosomes of maize: terminal sequences, structures, and implications for DNA replication. Curr Genet 62, 431–442 (2016). https://doi.org/10.1007/s00294-015-0548-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00294-015-0548-0