Introduction

The replication and transcription processes in prokaryotes are dependent on DNA gyrase enzyme, a type II topoisomerase (type II indicates the ability of DNA gyrase to transiently break both strands of DNA) (Wang 1996; Champoux 2001). The enzyme, an A2B2 tetramer encoded by the gyrA and gyrB genes, catalyzes negative supercoils in DNA during replication and transcription, which is a critical process for survival of the cells (Cozzarelli 1980; Gellert 1981). Eukaryotic nuclear genomes do not require gyrase activity as the negative supercoiling in DNA is achieved by wrapping up of DNA molecules around the histone proteins, and the relaxation of DNA molecules is carried out by gyrase-like type I and type II topoisomerases. On the other hand, organellar genomes, i.e., of chloroplast and mitochondria do not possess histones and essentially require the presence of DNA gyrase enzyme to supercoil and relax their DNA. Previous studies and sequencing data available in databases confirm that DNA gyrase genes are a part of nuclear genome of eukaryotes and the proteins encoded by them are targeted to respective organelles for carrying out negative supercoiling of chloroplastic and mitochondrial DNA. Following endosymbiotic organelle acquisitions, many genes were transferred from the organelles to nucleus which contributed to organelles becoming dependent on host eukaryotic cells (Martin and Harrmann 1998). Phylogenetic analysis of AtGyrA (gyrase A gene of Arabidopsis thaliana) suggested that the gyrase A genes were assimilated through the acquisition of endosymbiotic bacteria by the plant (Wall et al. 2004).

Most of the eukaryotic genes present in nuclear genomes, including gyrase, possess introns. These introns are invariably spliceosomal introns, which are the segments of noncoding sequences excised from pre-mRNA by spliceosomal complex (Gilbert 1978; Grabowski et al. 1985; Moore and Sharp 1993; Nilsen 2003; Rogozin et al. 2012). There are two alternate hypotheses explaining the origin of spliceosomal introns, viz. “introns early model” and “introns late model.” The “introns early” model postulates that introns had always been an integral part of the genes, i.e., genes originated as interrupted structures and those presently without introns, must have lost them in the course of evolution (Gilbert and Glynias 1993). On the other hand, the “introns late” model proposes that the ancestral protein-coding genes were uninterrupted and that the introns were subsequently inserted into them (Cech 1986). The rationality of both the hypotheses is supported with various arguments. The “introns early” model has been found advantageous to explain the evolution of novel ancient genes through the process of recombination of different polypeptide units. The early cell is believed to have contained certain different protein-coding genes, and the evolution of many more and novel ones became possible through reshuffling of different polypeptide units to construct the newer ones, and it has been proposed that this reshuffling via recombination of protein-coding gene fragments was made possible by the assistance of introns (Blake 1985; Holland and Blake 1987). In support of this hypothesis, there are evidences from different genes, which were found to contain related exons, suggesting a pattern of gene assembling by the process of exon shuffling (Patthy 1999; Kolkman and Stemmer 2001). It might have also been possible that exon(s) from one gene translocated in the intronic region of another gene, thereby resulting in evolution of a newer protein. Since in many organisms, the introns are often much larger in size compared to the exons, it is likely to suggest the insertion of an exon or a sequence including an exon within an intron and be flanked by functional 5′ and 3′ splice junctions (Krebs et al. 2013). Introns have been discovered in several chloroplast genes (Plant and Gray 1988; Sheveleva and Hallick 2004), including some that are homologous to bacterial genes, suggesting that endosymbiotic event occurred before introns were lost from the prokaryotic lineage (Krebs et al. 2013). Although these examples emphasize that introns were part of genes since the onset of evolution, there are certain examples which contradict this view and rather support the “introns late” model. For instance, in case of tRNAs, all of which have same general conformation, it is unlikely that the two regions of the gene evolved independently because the two regions base pair to fold the molecule into the functional shape. In case of the intron-bearing tRNA genes of eukaryotes (Lowe and Chan 2011; Yoshihisa 2014), introns are thought to become inserted into a continuous gene (Krebs et al. 2013). Ahmedinejad et al. (2010) compared exon–intron structures of 64 mitochondria-derived genes and found that intron density of recently transferred genes was similar to that of genes originated by most ancient transfers, which led to the conclusion that spliceosomal introns evolved following endosymbiotic gene transfer. Further, it has been discovered that the genes of yeast and mammalian mitochondria encode virtually identical proteins. But, vertebrate mitochondrial genomes are very small and extremely compact (Wolstenholme 1992; Taanman 1999), whereas yeast mitochondrial genomes are much larger and have some complex interrupted genes (Langkjaer et al. 2003). It appears that, since yeast mitochondrial introns (and certain other introns) can be mobile, they might have arisen by insertions thus supporting the “introns late” hypothesis (Krebs et al. 2013).

To understand the origin of introns in gyrase A gene (which is thought to be the counterpart of bacterial gyrase A subunit) of plants, we, in the present study, analyzed the gyrase A gene sequences of different prokaryotes and plant species for intron characteristics and identified that they appeared in the gyrase A genes following the endosymbiotic event. We also found that the positions of the homologous introns are highly conserved in diverse green plant lineages.

Materials and methods

Sequence retrieval and determination of intron positions

The coding sequences of gyrase A belonging to various organisms (listed in Table 1) were retrieved from NCBI (https://www.ncbi.nlm.nih.gov/) or Uniprot (http://www.uniprot.org/) database. These CDS sequences were either subjected to BLASTN (https://blast.ncbi.nlm.nih.gov) against genomic DNA database to identify the corresponding genomic DNA sequences or the corresponding genomic DNA sequences were directly retrieved from NCBI/Uniprot database. Each CDS sequence was again subjected to BLASTN to identify the presence of possible paralogs of gyrase A. All organisms except Brassca napus were found to possess single copy of gyrase A genes in their genomes. NCBI database indicated that Brassica napus genome had three paralogs of gyrase A (accession numbers: XM_013880667.1, XM_013792701.1, XM_013829267.1) belonging to three different chromosomes; A3, A7, and C3, respectively. The CDS and corresponding genomic DNA sequence of each gyrase A were then aligned by using CLUSTALW software available at http://www.genome.jp/tools/clustalw/, and the gaps resulted in the CDS sequences were identified as introns. The genomic DNA clone of gyrase A gene of Lotus japonica was retrieved from the NCBI database (AP009637.1), and its corresponding CDS sequence of the gene was manually identified by employing MAC vector software. The CDS and the genomic DNA sequences were further aligned to find out the positions of the introns as described earlier. The gene lengths of each gyrase A sequences are depicted in Fig. 1a, relative positions of introns in the CDS of gyrase A are shown in Figs. 1b and 3, and the lengths of introns are also shown in Fig. 4.

Table 1 Gyrase A protein characteristics, gene length, and number of introns present in the gyrase A genes belonging to different organisms
Fig. 1
figure 1

a The gyrase A gene sizes belonging to different organisms. The lengths of intron-containing gyrase A genes are more as compared to intron-less gyrase A genes belonging to bacteria, archaea, and green alga Bathycoccus. The difference in sizes of intron-containing genes is due to varying lengths of introns possessed by them; b schematic representation of distribution of introns in CDSs of gyrase A belonging to different organisms. Black bars represent the homologous introns, red bars indicate the extra introns that were missing in rest of the plant species, and blue bars represent nonconserved intron positions

Multiple sequence alignment and analysis of phylogeny

Multiple sequence alignment of Gyrase A protein sequences was carried out by employing T-COFFEE protein alignment server available at http://tcoffee.crg.cat/apps/tcoffee/do:expresso, and Phylogeny software available at http://www.phylogeny.fr/ was used to construct the phylogenetic tree (Fig. 2). The “Structural alignments (Expresso)” option of the T-COFFEE protein alignment server with default parameters was chosen to align the Gyrase A protein sequences. The “One Click” option with default parameters was chosen for construction of phylogenetic tree that employs MUSCLE (MUltiple Sequence Comparison by Log-Expectation) for multiple alignment, Gblocks for automatic alignment curation, PhyML for tree building, and TreeDyn for tree drawing (Dereeper et al. 2008).

Fig. 2
figure 2

a Evolutionary relationship between Gyrase A of different organisms. The phylogenetic tree (generated by Phylogeny software) showing grouping of Gyrase A proteins of various species based on their evolutionary relationship; b simplified tree of life showing the algal/plant lineages and their evolutionary relationships (information obtained from Coates et al. 2015)

Identification of the positions of homologous introns

Identification of positions of homologous introns in the CDSs of intron-containing gyrase A genes of green alga Klebsormidium nitens, bryophyte Physcomitrella patens, lycopodiophyte Selaginella moellendorffii, and angiosperm species was carried out by aligning the sequences in CLUSTALW software (http://www.genome.jp/tools/clustalw/). About 10 bp upstream exon sequence of each intron (exon–intron junction sequences were found by carrying out pairwise alignment of CDS with genomic DNA sequences of respective plant species) was marked in the aligned sequence manually in all the gyrase A CDSs of the plant species. Schematic representation of distribution of introns in CDSs of gyrase A (Fig. 1b) was made by using same length bars as the aligned CDSs of various organisms, and the introns were represented by rectangular solid boxes. The approximate distances between each pair of introns were maintained by correlating the number of nucleotides with the lengths between them.

Construction of heat map to depict intron length variation

The heat map (Fig. 4) was constructed to depict intron length variation using the “Conditional Formatting” option of the “Home” section of Microsoft Excel (Windows 10). All the cells in Excel sheet containing the intron length values were selected first, and then the “Color Scales” option was chosen under the “Conditional Formatting” option, and further, under the “More Rules” option, the “2-Color Scale” format style option was chosen (one color representing the lowest value and another signifying the highest value) to generate the heat map.

Prediction of protein localization

Subcellular targeting of Gyrase A proteins was predicted by using the Web-based TargetP signal peptide predicting software programs (http://www.cbs.dtu.dk/services/TargetP/) (Emanuelsson et al. 2000). The “Plant” option under the “Organism group” section and “specificity > 0.95” option under the “Cutoffs” section were chosen to predict the subcellular location of the Gyrase A proteins.

Protein molecular weight and isoelectric point determination

The molecular weight and isoelectric point (pI) of Gyrase A proteins were determined by using ExPASy Compute pI/Mw tool (http://web.expasy.org/compute_pi/) with default parameters.

Dot plot analysis

The dot matrices (Fig. 5) were prepared by employing multi-zPicture software available at http://zpicture.dcode.org/ using all parameters by default (Ovcharenko et al. 2004). To construct the dot matrices, for each event, either two CDS sequences of gyrase A belonging to two different organisms or genomic DNA sequence of gyrase A of two different species was aligned by putting the two sequences under “SEQUENCE 1” and “SEQUENCE 2” columns, respectively.

Results

Sequence analyses of gyrase A genes from various organisms

A total of thirty-one different gyrase A gene sequences belonging to eubacteria (seven), archaea (one), red alga (one), brown alga (one), green algae (four), bryophyte (one), lycopodiophyte (one), and angiosperm (fifteen) (Table 1) were analyzed in the present study. We could not find any gyrase A gene sequences from gymnosperms in the available databases. The gyrase A genes belonging to red alga, brown alga, green algae (except Bathycoccus prasinos), bryophyte, lycopodiophyte, and angiosperms were found to possess introns while these were missing in the prokaryotes (bacteria and archaea) and the green alga Bathycoccus prasinos. Among all the intron-bearing gyrase A genes studied, only Brassica napus had three paralogs of gyrase A, while rest of the gyrase A genes were present as single copy in their respective plant genomes. The gene and protein sequences of two paralogs (XM_013792701.1 and XM_013880667.1) of gyrase A of Brassica napus were identical while the third paralog (XM_013829267.1) had slightly different gene and protein sequences (Online Resource 1). Introns are the characteristics of eukaryotic genes, which result in increased size of the genes. Due to intron accumulation, the size of the genes became bigger (Fig. 1a); however, their size varied as they possessed introns of varying lengths (Fig. 4). In the present study, the protein sequences of different gyrase A gene were found to be highly conserved across these genera of prokaryotes and eukaryotes with high degree of similarity percentage (Online Resource 2). The phylogenetic studies established a close evolutionary relationship between the Gyrase A proteins of various organisms. Six distinct clades were observed among prokaryotic and eukaryotic Gyrase A (Fig. 2). The eubacterial and archaeal Gyrase A proteins grouped together in one clade, and those belonging to red alga (Galdieria sulphuraria) formed a clade with brown alga (Ectocarpus siliculosus). The Gyrase A from three species of green algae, namely Chlorella variabilis, Bathycoccus prasinos, and Chlamydomonas reinhardtii, formed another separate clade while the one from Klebsormidium nitens was the sole member of another clade in close association with Gyrase A of bryophyte, lycopodiophyte, and the angiosperms. The Gyrase A proteins from the bryophyte (Physcomitrella patens) formed a clade with lycopodiophyte (Selaginella moellendorffii), and those belonging to all the angiosperm species grouped together in separate clade (Fig. 2).

Protein localization of Gyrase A

The localization of Gyrase A was predicted for all the sequences taken under the study, and almost all the proteins were found targeted putatively to either chloroplast or mitochondria. Gyrase A proteins belonging to most of the angiosperms except Oryza sativa were found to localize in the chloroplast, and it was found to be localized in mitochondria in Oryza sativa. The organeller localization of Gyrase A proteins was not obtained for Arachis duranensis, Setaria italica, and Zea mays by the TargetP signal peptide predicting software at the desired level of specificity (indicated in the Materials and Methods section). Gyrase A belonging to green algae Bathycoccus prasinos and Chlorella variabilis were localized in chloroplast, and for rest of the green, brown, and red algae, bryophyte and lycopodiophyte, the protein localization targets were not found (Table 1) The localization of Gyrase A belonging to moss (bryophyte) Physcomitrella patens was intentionally not predicted since some amino acids from the N terminus were found to be missing from the protein (source: NCBI database).

Locating the position of homologous introns in gyrase A genes

The analysis of intron sequences of gyrase A genes revealed the presence of splicing junctions with the sequences GT–AG except some variations at a few places. The introns defined in this way start with dinucleotide GT and ends with dinucleotide AG, following the GU–AG rule, which is a characteristic feature of most of the spliceosomal introns (Rogozin et al. 2012). The number of introns in gyrase A genes of various organisms is indicated in Table 1. The introns were invariably located at similar conserved positions in respective gyrase A genes of green alga Klebsormidium nitens, bryophyte, lycopodiophyte, and angiosperms (Figs. 1b and 3). Barring certain places (Fig. 3), the intron positions were well conserved in the gene sequences. While some homologous introns were found to be missing in gyrase A genes of Klebsormidium nitens (the green alga), Physcomitrella patens (the moss), Selaginella moellendorffii (the lycopodiophyte), Arabidopsis thaliana, Brassica napus, and Oryza sativa at various locations (Figs. 1b and 3), extra introns in nonhomologous positions were found in Klebsormidium nitens, Selaginella moellendorffii, Oryza sativa, and Sesamum indicum gyrase A gene sequences (Figs. 1b and 3). Intron positions in three paralogs of gyrase A belonging to Brassica napus were found to be conserved (Online Resource 1).

Fig. 3
figure 3

Location of introns in the gyrase A genes. The figure depicts approximately 10 bp upstream sequences (exonic regions) of various homologous introns present in the gyrase A genes belonging to different plant species. The sequences represent snapshot view of aligned coding sequences (CDSs) of the gyrase A genes. Extra exons belonging to Selaginella, Klebsormidium, Oryza, and Sesamum at the respective positions are marked with colored squares

Identifying the level of conservation of intron sequences

Although the intron positions were well conserved in gyrase A throughout the species taken under study, the introns at homologous locations had different lengths (Fig. 4). To know the level of conservation in the sequence composition of these introns, dot matrices were plotted involving two gyrase A genomic DNA sequences from different plant species and the results were compared with other dot matrices which involved respective CDS sequences of both the species. The analysis showed a high degree of sequence conservation in CDSs while multiple gaps were present in the aligned dot matrices of their genomic DNA sequences (Fig. 5), wherein these gaps certainly arose due to the nonconservation of intron sequences.

Fig. 4
figure 4

Heat map and hierarchical cluster displaying the varying lengths of homologous introns belonging to gyrase A gene of different plant species. The lengths of homologous introns of gyrase A gene are not conserved throughout the evolution for being under lesser selection pressure

Fig. 5
figure 5

Similarity level in coding and genomic DNA sequences of gyrase A gene. Dot matrices showing the levels of conservation in the paired genomic DNA or coding sequences (CDSs) of gyrase A genes. The dot matrices corresponding to genomic DNA sequences contain gaps indicating nonconservation of intron sequences. On the contrary, CDSs are highly conserved as evident from near absence of gaps in the aligned CDSs

Discussion

The DNA gyrase enzyme performs a crucial function of inducing negative supercoils in the DNA of bacterial and organellar genomes (Drlica 1978). The functional gyrase protein has two subunits: Gyrase A and Gyrase B encoded by gyrase A and gyrase B genes, respectively. The A and B subunits together bind to DNA, hydrolyze ATP, and introduce negative supercoils. The A subunit carries out nicking of DNA, B subunit introduces negative supercoils, and finally, the A subunit reseals the nicked strands (Cozzarelli 1980; Gellert 1981). Any abrupt mutation in these genes would impair their functions and might lead to the death of organism due to a nonfunctional chloroplast or mitochondria. Considering above facts, it is extremely important for bacterial or plant species to conserve the sequence of this gene throughout the course of evolution, which made us curious to dissect evolutionary path of gyrase A gene. In the present study, a total of thirty-one gyrase A gene sequences from distantly related organisms were analyzed to evaluate the conservation of exon and intron sequence compositions and their respective positions. Out of thirty-one gyrase A gene sequences, nine have no introns, among which eight belong to prokaryotes and one to eukaryotic green alga Bathycoccus prasinos. In rest of the twenty-two gyrase A genes, eighteen gene sequences (one from green alga (Klebsormidium nitens), one from bryophyte (Physcomitrella patens), one from lycopodiophyte (Selaginella moellendorffii), and fifteen from various angiosperms) have introns at the homologous positions while rest of the four gyrase A genes (from red, green, and brown algae) possess introns without any positional conservation. Previous studies also identified interruptions at homologous positions relative to the coding sequence of all active globin genes in mammals, birds, and frogs (Hardison 2012). In our study, we found a high level of positional conservation of homologous introns in gyrase A genes of distantly related plant species. There was a loss of introns in gyrase A genes of Klebsormidium, Physcomitrella, Selaginella, Oryza, Arabidopsis, and Brassica and a gain of introns in gyrase A genes of Klebsormidium, Selaginella, Oryza, and Sesamum at various homologous positions (Figs. 1b and 4). The actual function of loss and gain of introns is not clear. In all gyrase A gene sequences, exons are well conserved, whereas introns are not only of varying lengths but also not conserved across the various plant species, which supports the view that introns evolve more rapidly than their neighboring exons for not being under higher selection pressure (Morello and Breviario 2008).

Earlier studies have confirmed DNA gyrase activity in the chloroplasts and mitochondria of plants. The inhibitors of bacterial DNA gyrase, novobiocin, and nalidixic acid were found to inhibit chloroplast DNA synthesis in higher plants (Heinhorst et al. 1985). Also, Arabidopsis thaliana genome analysis revealed that Arabidopsis DNA gyrase A and B subunits were targeted to organelles (Elo et al. 2003). In our study, all the eukaryotic Gyrase A proteins, except a few, were found to be localized in either chloroplast or mitochondria. It is worthwhile to mention here that the gyrase A genes of eukaryotes do not belong to the organeller genomes but are a part of the nuclear genomes, and most of these genes contain introns. Since Gyrase A is a prokaryotic protein, the crucial question arises whether it acquired introns in eukaryotes before the endosymbiotic gene transfer or the introns appeared in the gene after they migrated from organelles to the nucleus following the endosymbiotic event.

There are mainly two models to explain the process of evolution of introns in eukaryotic genes: “introns early” and “introns late” model. The “introns early” model proposes that introns have always been an integral part of the genes and intron-less genes evolved by losing introns during the course of evolution (Gilbert and Glynias 1993). It has been postulated that the introns (flanked by exons) had favored genetic recombination among the exons, leading to evolution of newer proteins in the ancient prokaryotic cell (Blake 1985; Holland and Blake 1987). But the “introns early” model fails to explain why introns are absent in prokaryotic genome. Although the “genome streamlining model” (according to this model, introns were eliminated by prokaryotic cells as the main pressure in the evolution of prokaryotes had been maximization of the replication rate) explains why introns are missing in the prokaryotic genome, it does not have any concrete evidence to support the validity of the hypothesis (Gilbert 1987; Roy 2003). The “introns late” model proposes that the ancestral genes were intron-less and those with introns gained them during the course of evolution (Cech 1986). The hypothesis emphasizes that the introns are a eukaryotic innovation and the intron gain has been a continuous process during the evolution of eukaryotes (Doolittle and Stoltzfus 1993; Mattick 1994; Stoltzfus et al. 1994). As opposed to the “intron early” model, the “introns late” model argues that exon shuffling through recombination is a recent phenomenon important for protein variability in complex eukaryotes only (Patthy 1991). We considered both “introns early” and “introns late” models and found that the “introns late” model suitably explains the evolution of introns in gyrase A gene of plants.

In the present study, conservation of position of homologous introns in most of the plant gyrase A genes makes the “introns early” model lucrative to explain the origin of introns in gyrase A because it might appear that the progenitor ancient gyrase A gene contained introns and the gene got passed onto higher order eukaryotic plant species resulting in positional conservation of homologous introns. However, this model fails to explain loss of introns in the green alga Bathycoccus prasinos, where intron loss cannot be explained through “genome streamlining” hypothesis meant for prokaryotes. Also, nonconservation of intron positions in four algal gyrase A genes refutes the “introns early” model as there should have been no difference in intron positions in the genes if introns were the integral part of the gene since the time of its genesis. On the other hand, the “introns late” model successfully explained the evolution of introns based on acquired results. We propose that the ancient gyrase A gene was intron-less and acquired introns after endosymbiotic gene transfer and subsequently, the introns diverged with variations in both length and sequence compositions in different plant species. This seems a logical proposal as the prokaryotic bacteria and eukaryotic green alga Bathycoccus prasinos did not contain any intron in their gyrase A genes (gyrase A genes of algae are the part of nuclear genome), which indicates the absence of introns in ancestral gyrase A gene. The Gyrase A protein sequences belonging to seven different species of bacteria, namely Escherichia coli, Mycobacterium tuberculosis, Micrococcus luteus, Streptomyces coelicolor, Aquifex aeolicus, Thermus thermophilus, and Salmonella typhimurium, showed 86% similarity with the Gyrase A protein sequences from diversely related plant species (Online Resource 2) indicating a fairly good level of conservation in exonic regions. Hence, prokaryotic gyrase A gene should have been the progenitor of eukaryotic counterparts. Further, since prokaryotic gyrase A genes are intron-less, intron gain must have occurred after the endosymbiotic gene transfer. In our study, we found that in red, brown, and two green algae, the positions of homologous introns are not conserved. This might be due to independent acquisition of introns by various algal gyrase A genes. Since brown, red, and green algae evolved through separate lineages from a common eukaryotic ancestor (Coates et al. 2015), it might have led to independent acquisition of introns, which further resulted in nonconservation of intron positions in gyrase A genes of these algal species. We also believe that the gyrase A gene of green alga Bathycoccus prasinos did not acquire introns during the course of evolution. Interestingly, one species of green algae (Klebsormidium nitens) showed similarity in positions of homologous introns when compared to gyrase A genes of higher order plant species. Green algae are known to share significant structural and biochemical features with embryophytes (Shaw et al. 2011). Biochemical, molecular, ultrastructural, and phylogenetic studies support close relationship between charophycean algae and embryophytes (Turmel et al. 2006), and hence, green algae have been identified to be the ancestor of the bryophytes (Shaw et al. 2011). The green alga Klebsormidium nitens in the present study belongs to division charophyta, and the positions of homologous introns in its gyrase A gene are similar to bryophyte Physcomitrella mitens, which in turn depicts possibility of bryophyte to acquire gyrase A gene from an ancestral species of green algae possibly belonging to division charophyta. Since the positions of the homologous introns remained more or less conserved without further re-arrangement throughout the evolutionary timescale starting with the green algae, we hypothesize that the bryophytes acquired intron-bearing gyrase A from a common ancestral green algal species and thereafter, during speciation, the gyrase A gene containing introns moved to different plant species belonging to lycopodiophytes and angiosperms, and this consequently resulted in conservation of homologous intron positions throughout the course of evolution of green plant species. The conservation of intron positions in all the three paralogs of gyrase A gene of Brassica napus showed no effect of gene duplication on the positions of the homologous introns. It was not possible to analyze intron positions in gyrase A of gymnosperms due to the absence of sequence information in available databases. However, we believe that the homologous introns in those species will have the same conserved intron positions as in algae, bryophytes, lycopodiophytes, and angiosperms. Thus, on the basis of our observations, we propose the “introns late” model for the evolution of introns in gyrase A genes in plant species (Fig. 6).

Fig. 6
figure 6

Evolution of introns in gyrase A gene of plants. Schematic diagram representing probable pathway for origin of introns in gyrase A genes of the plants. EGT denotes endosymbiotic gene transfer

Conclusions

To the best of our knowledge, this is the first report to discover that gyrase A genes in plants have well-conserved exons and possess introns at homologous positions with characteristic variations in their nucleotide sequence compositions and lengths. Further, we conclude that these spliceosomal introns evolved following the endosymbiotic gene transfer of the gyrase A to the nuclear genome of an ancestral green plant and the higher order green plants, i.e., bryophytes, lycopodiophytes, and angiosperms, acquired intron-bearing gyrase A gene from a common ancestral green algal species, which resulted in positional conservation of introns in the gyrase A genes. We believe that our approach can be suitably utilized in other genes too for studying intron and exon characteristics and their possible evolutionary pathway. Furthermore, more sequencing data of plant genomes will definitely be an important asset to analyze and consolidate our views regarding the “introns early” or “introns late” model.