Genomic characterization of Rim2 / Hipa elements reveals a CACTA-like transposon superfamily with unique features in the rice genome

Wang, G.-D.; Tian, P.-F.; Cheng, Z.-K.; Wu, G.; Jiang, J.-M.; Li, D.-B.; Li, Q.; He, Z.-H.

doi:10.1007/s00438-003-0918-z

Genomic characterization of Rim2 / Hipa elements reveals a CACTA-like transposon superfamily with unique features in the rice genome

Original Paper
Published: 26 September 2003

Volume 270, pages 234–242, (2003)
Cite this article

Download PDF

Access provided by CONRICYT-eBooks

Molecular Genetics and Genomics Aims and scope Submit manuscript

Genomic characterization of Rim2 / Hipa elements reveals a CACTA-like transposon superfamily with unique features in the rice genome

Download PDF

G.-D. Wang^1,3,
P.-F. Tian²,
Z.-K. Cheng⁴,
G. Wu¹,
J.-M. Jiang⁴,
D.-B. Li²,
Q. Li¹ &
…
Z.-H. He^1,2

423 Accesses
30 Citations
Explore all metrics

Abstract

The availability of huge amounts of rice genome sequence now permits large-scale analysis of the structure and molecular characteristics of the previously identified transposase-encoding Rim2 (also called Hipa) element, which is transcriptionally activated by infection with the fungal pathogen Magnaporthe grisea and by treatment with the corresponding fungal elicitor. Based on genomic cloning and data mining from 230 Mb of rice genome sequence, 347 Rim2 elements, with an average size of 5.8 kb, were identified. This indicates that an estimated total of 600–700 Rim2 elements are present in the whole genome. Rim2 insertions occur non-randomly on the chromosomes, as visualized by fluorescence in situ hybridization. The elements harbor 16-bp terminal inverted repeats with the core sequence CACTG, 16-bp sub-terminal repeats, internal variable regions, 3-bp target sequence duplications in the flanking regions, and genes coding for Rim2 proteins (the putative transposase) and hydroxyproline-rich glycoproteins. High levels of insertion into genic regions are observed for members of this family, and the transposition history of the family can be deduced from the high level of shared sequences and analysis of repeat target sites of the elements. Phylogenetic analysis indicates that the putative RIM2 proteins fall into a subgroup distinct from the TNP2-like subgroup of transposases. Southern hybridization with genomic DNA from monocotyledonous and dicotyledonous plants demonstrates that the RIM2-coding sequence is unique to the Oryza genome. Our results demonstrate that the Rim2 elements from rice belong to a distinct superfamily of CACTA-like elements with evolutionary diversity.

Transposable Element Dynamics in Rice and Its Wild Relatives

Analysis of Tc1-Mariner elements in Sclerotinia sclerotiorum suggests recent activity and flexible transposases

Article Open access 03 October 2014

Genome-wide comparison of Asian and African rice reveals high recent activity of DNA transposons

Article Open access 28 April 2015

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Transposable elements (TEs) are fundamental components of most, if not all, plant genomes, which play important roles in the structure, variation and adaptive evolution of genomes (McClintock 1984; McDonald 1995; Kidwell and Lisch 1997). TEs are divided into two classes. Class 1 comprises the RNA elements (retrotransposons), which transpose via an RNA intermediate by a mechanism that is dependent on reverse transcription (Kumar and Bennetzen 1999). Class 2 consists of the DNA elements, which usually contain terminal inverted repeats (TIRs) and transpose via a DNA intermediate (Kunze et al. 1997). Class 2 elements can be further divided into two groups. Autonomous elements, such as the CACTA elements (also called the En/Spm family) which have the core sequence CACTA in their TIRs, possess two ORFs encoding a TNPA/TNP1 DNA-binding protein and a TNPD/TNP2 (TNP2-like) transposase (TPases), both of which are essential for their transposition. This group includes the elements En/Spm from maize (Gierl 1996) and Tam1 from snapdragon ( Antirrhinum majus) (Nacken et al. 1991). The non-autonomous elements, such as the maize dSpm (deletion derivatives of Spm), on the other hand, require the corresponding autonomous element (in this case, Spm) for transposition (Gierl 1996).

It is known that TEs contribute significantly to genomic plasticity in the face of diverse environmental conditions. During the past decade, a number of TEs have been found to be activated when plants or cells were subjected to stresses (Hirochika 1993; Arnault and Dufournel 1994; Wessler 1996). For example, transcription of the tobacco retrotransposon Tnt1 could be induced by pathogens and microbial elicitors, as well as by abiotic factors, indicating that activation of Tnt1 might be a local and early plant response to microbial stress (Pouteau et al. 1994; Mhiri et al. 1997; Vernhettes et al. 1997). Moreover, mobility of Tnt1 correlated with its transcriptional activation by fungal elicitation (Melayah et al. 2001), and Tnt1 insertion could change host gene splicing (Leprinc et al. 2001). Similarly, we isolated a cDNA ( Rim2) derived from a transcript that was strongly induced by the rice fungal pathogen Magnaporthe grisea and the corresponding elicitor. Rim2 harbored an ORF encoding a putative protein that shows weak similarity to the TNP2-like TPases encoded by CACTA TEs (He et al. 2000). This report was the starting point for the study of class 2 elements involved in stress responses in the rice genome. Subsequently Panaud et al. (2002) isolated four genomic clones of the Rim2 family using representational difference analysis (RDA), and named these Hipa elements. These authors suggested that Hipa is an ancient component of the Oryza genome. However, we still lack information on the organization of the entire Rim2/Hipa family in the rice genome (for consistency, we will refer to this family as Rim2 in the following).

DNA sequencing and data mining have demonstrated that the rice genome is rich in TEs (Bureau et al. 1996; Hirochika et al. 1996; Song et al. 1998; Ohtsubo et al. 1999; Mao et al. 2000; Jiang and Wessler 2001; Turcotte et al. 2001; Jiang et al. 2002; Panaud et al. 2002). Most recently, the rice class 2 element mPing was shown to transpose actively in rice cell cultures using a data mining and transposon display approach (Jiang et al. 2003). Similarly, the availability of genomic Rim2 clones and vast amounts of rice genomic sequence (http://www.ncbi.nlm.nih.gov) now allow us to address the issue of the structure and genomic organization of the Rim2 family. In this paper, we report that rice Rim2 elements constitute a CACTA-like superfamily with at least 600 members per haploid genome. Of particular interest is the finding that members of the Rim2 family carry a unique CACTG core sequence in their TIRs.

Materials and methods

Rim2 DNA cloning and sequencing

A 786-bp cDNA which codes for the conserved Rim2 TPase-was used as probe to screen a rice bacterial artificial chromosome (BAC) library containing 1.07×10⁴ clones (He et al. 2000). From positive BACs, three Rim2 subclones, Rim2-228 , Rim2-248 and Rim2-553, were sequenced on an ABI 3700 sequencer (Perkin-Elmer, Foster City, Calif.). The sequences have been deposited in the GenBank database under the Accession Nos. AY090462, AY090463 and AY090464, respectively.

Sequence mining and identification of Rim2 elements

Publicly available genomic sequence data from rice (the japonica cultivar Nipponbare) obtained by the International Rice Genome Program were accessed through the GenBank database (http://www.ncbi.nlm.nih.gov) and subjected to Blast searches (Altschul et al. 1990), using the Rim2 cDNA and the conserved sequences of Rim2-228, Rim2-248 , Rim2-553 as queries. The conserved TIR sequence 5′-CACTGGTGGAGAAACC-3′ and target site duplications (TSDs) were excluded from the query sequences. More than 340 elements were identified among the rice sequences available as of 5 September, 2002. The Rim2 content of the genome was calculated by dividing the total length of the Rim2 sequences detected by the total length of sequence searched. Copy numbers were estimated assuming that the haploid genome size is 420–466 Mb (Goff et al. 2002; Yu et al. 2002).

DNA and protein sequence analysis

Initial sequence analysis was performed using GCG (Genetics Computer Group, Madison, Wis.) programs and the EMBOSS program (European Molecular Biology Open Software Suite) accessed through the Shanghai Institutes for Biological Sciences (SIBS). Sub-terminal repeat sequences were analyzed with Lasergene GeneQuest (DNAstar, Madison, Wis.). Multiple alignments were generated with ClustalX (Thompson et al. 1997), then refined and displayed using GeneDoc (http://www.psc.edu/biomed/genedoc). Sequence similarity between proteins was estimated based on the BLOSUM62 matrix (Henikoff and Henikoff 1993). AT/GC contents of regions around insertion sites were calculated and compared to the values derived from 7.07 Mb of randomly chosen sequences as a control. Insertions of annotated elements into intergenic and genic regions were predicted based on GenBank searches. To document element mobility in the past, sequences immediately flanking Rim2 inserts were used as queries to search for target sequences without inserts, an approach also called 'related to empty sites' (RESites) (Le et al. 2000). High levels of shared nucleotide sequence similarity between members were taken to indicate recent activity of the Rim2 family, as has been documented for other TEs.

Phylogenetic analysis

A phylogenetic tree was constructed using the distance-based neighbor-joining method MEGA2.1 (Kumar et al. 2001), based on amino acid sequences of TPases encoded by representative CACTA-like elements, including Rim2-228, Rim2-248 and Rim2-553 (this study), the Rim2 -cDNA (He et al. 2000), the maize TNPD (Pereira et al. 1986), DOPD (Bercury et al. 2001) and Sho sequences (Panavas et al. 1999), the snapdragon TNP2 (Nacken et al. 1991), and the Arabidopsis ATENSPM6 and ATENSPM9 sequences (Jiang and Wessler, http://www.girinst.org), the Tdc1-ORF from carrot (Ozeki et al. 1997), the soybean Tgm5-ORF (Rhodes and Vodkin 1988), and the Tnp2L from sorghum (GenBank AAD27566). Bootstrap analysis was performed with 1000 replicates to test the significance of nodes.

Fluorescence in situ hybridization (FISH) on pachytene chromosomes

The FISH procedure essentially followed the published protocol using young panicles of a japonica rice variety, Nipponbare (Jiang et al. 1995). A 2.8-kb Eco RI cDNA fragment (Accession No. AF121139) and a 12-kb Rim2-248 fragment were labeled with biotin-16-dUTP by nick translation. Hybridized slides were examined under an Olympus BX60 fluorescence microscope. Chromosomes and FISH signal images were captured using a SenSys charge-coupled device camera (Photometrics, Tucson, Ariz.).

Distribution of Rim2-coding sequences in monocotyledonous and dicotyledonous plants

Genomic DNAs of members of the family Poaceae, including cultivated rice ( Oryza sativa L.), wild rice ( O. rufipogon), wheat, barley, maize, sorghum, millet and Arabidopsis were digested with Eco RI, and Southern analysis was performed as described by He et al. (2000), using the conserved 786-bp Rim2 fragment as probe, and washing at 65°C. The blots were autoradiographed for only 3 h to avoid overexposure of the lanes containing rice DNA.

Results

Molecular structure of the Rim2 elements

Even though the Rim2 genomic sequences could be deduced by bioinformatic analysis of the sequenced rice genome, we nevertheless isolated and sequenced some Rim2 clones for direct determination of their sequences and for use as FISH probes. Three Rim2 subclones, Rim2-228 , Rim2-248 , Rim2-533 , were isolated from BACs, and subjected to detailed sequence analysis. These elements ( Rim2-228,10,917 bp; Rim2-248, 14,873 bp; and Rim2-533, 14,293 bp) consist of highly structured termini made up of three motifs (TSD, TIR and sub-terminal direct/inverted repeats), coding regions, and large internal variable regions (Fig. 1A and B). All three elements have perfect 16-bp CACTGGTG/AGAGAAACC TIRs. A unique feature is the presence of the 5-bp core sequence CACTG in the TIRs, instead of the core sequence CACTA found in the TIRs of other CACTA elements. Upon insertion, 3-bp TSDs, GAA ( Rim2-228), TAA ( Rim2-248) and TTA ( Rim2-553), respectively, were generated. In the sub-terminal regions, near identical 16- to 17-bp direct and inverted repeats with the 16-bp consensus sequence ATCTTTAGTCCCGGTT were identified (Fig. 1A and B); these repeats were also present in the central region (Panaud et al. 2002). Rim2 shares this feature with other CACTA elements, which contain sub-terminal repeats of 9–12 bp that differ in their consensus sequences. Interestingly, there are 105-bp tetra-repeats immediately downstream of the RIM2 coding region in Rim2-228 (Fig. 1A and B), which are almost the same as the repeats found in the Rim2 cDNA clone, suggesting this element might be the source of the Rim2 transcript previously isolated (He et al. 2000). Similar tandem repeats, albeit of variable length, are also found in Rim2-248 and Rim2-533. The role of these repeats remains unknown.

ORF prediction revealed that Rim2-228, Rim2-248 and Rim2-553 all contain two genes. Gene 1 encodes RIM2 proteins (which show more than 90% sequence identity to the product of the ORF in the Rim2 cDNA and to each other) that show fragmentary homology to TNP2/TNPD-like proteins encoded by the CACTA family (see below). The predicted product of Gene 2 is a putative hydroxyproline-rich glycoprotein (HRGP; Accession No. AU090537), and shows no similarity to the TNP1/TNPA DNA-binding protein encoded by other elements of the CACTA family or other putative proteins coded for by Cs1, another CACTA-like element from sorghum (Chopra et al. 1999). The significance of the HRGP ORF in this TE family is not known. No ORF for a TNP1/TNPA protein could be detected in the Rim2 elements in spite of detailed screening. Based on these features, a model structure for Rim2 elements can be derived (Fig. 1C).

Genome-wide mining and sub-grouping of the Rim2 elements

Based on the structural features of the Rim2 elements described above, we conducted a genome-wide search for Rim2 elements using the publicly available rice genome sequence ( japonica accession). As a result, 344 individual Rim2 elements were retrieved from a total sequence of about 230 Mb, including the full-length sequences of chromosomes 1, 4 and 10, that were accessible by 5 September, 2002 (see Electronic Supplementary Material). Their average size is 5.8 kb (their TIRs were delimited on the basis of the structured motifs described in Fig. 1). Extrapolating from this result, it is estimated that about 600–700 Rim2 elements are present in the haploid rice genome of 420–466 Mb (Goff et al. 2002; Yu et al. 2002). Rim2 elements therefore account for about 0.88% of the total rice genome.

Most of the mined elements have conserved 16-bp TIRs and sub-terminal repeats, internal variable regions, as well as 3-bp TSDs (see Electronic Supplementary Material). Intriguingly, 49 members contain imperfect TIRs with CACTA and CACTG sequences at their 5′ and 3′ ends (CACTA-CACTG), respectively. The 347 Rim2 elements can be divided into three subgroups depending on their coding capacity: RIM2-coding (84), RIM2-pseudogene (50) and non-coding (213) subgroups (Table 1). The members of the RIM2-coding subgroup have ORFs near their 5′ ends for putative RIM2 proteins with 416–1110 amino acids. Most of the RIM2-coding elements also harbor ORFs near the 3′ ends for putative HRGPs with 521–2635 amino acids. Eight elements ( Rim2-M90, Rim2-M93 , Rim2-M272 , Rim2-M328 , Rim2-M333 , Rim2-M341 , Rim2-M342 and Rim2-M344) have ORFs that encode novel putative proteins which show only weak similarity to the RIM2 proteins, and all of these, except Rim2-M333, harbor ORFs for putative HRGPs. There are 50 Rim2 members containing pseudogenes; i.e., their ORFs are disrupted by several stop codons or frameshifts. These data support the postulate that high-copy-number TPase genes usually evolve into pseudogenes due to rapid accumulation of substitutions, insertions and deletions (Feschotte et al. 2002). These results also demonstrate that the Rim2 family, although structurally conserved, has been subject to dramatic evolutionary variation, indicative of long-term evolution of this TE family in the rice genome. Similar patterns of divergence are also observed in CACTA elements (Kunze and Weil 2002).

Table 1. Subgroups of sequenced/mined Rim2 element

Full size table

Insertion preference of the Rim2 elements

Since the whole japonica genome has not been completely annotated with sequenced BACs, we could not make a complete inventory of Rim2 elements on all chromosomes. Instead, we performed FISH analysis on pachytene chromosomes using the Rim2 cDNA and Rim2-248 DNA as probes. The FISH images obtained with the cDNA and genomic DNA are quite similar, and provide direct evidence that Rim2 elements are distributed widely across the whole genome (Fig. 2). They also indicate that Rim2 elements are not evenly dispersed on chromosomes; some regions show higher element densities than others.

The observation that Rim2 elements are unequally distributed along chromosomes raises the question whether Rim2 inserts preferentially at certain types of sites. We conducted a survey of the AT/GC contents of the regions adjacent to all Rim2 inserts (Fig. 3). The result indicates that Rim2 elements appear to show a preference for AT-rich regions: the 5′ and 3′ flanking regions lying within 100–500 bp from TSDs show average AT contents of 59.8–60.2% and 60.7–61.6%, respectively, compared to the average 56.3% AT content of a large sample of randomly chosen rice sequences. This preference for AT-rich regions has also been observed for some other transposons (Le et al. 2000).

In contrast to the observation that TEs insert randomly into rice genes (Mao et al. 2000), a probable preference for insertion into genic regions including exons, introns and the immediate 5′ and 3′ non-coding termini of predicted genes, was observed for this TE family. Of the 344 elements mined, 204 have been annotated in sequences. We surveyed the distribution patterns of these 204 elements (Fig. 4A). There are 44 elements (21.6%) inserted in coding regions of predicted genes, while 101 (49.5%) have inserted into non-coding regions including introns and 5′ and 3′ non-coding termini of predicted genes. Together, these two sets account for 71.1% of all the annotated elements; only 59 (28.9%) are found in predicted intergenic sequences. For example, Rim2-M6 is integrated in the promoter of a gene encoding a hypothetical protein, Rim2-M8 is inserted in a putative gene to constitute its intron 1, exon 2 and intron 2, and a putative peroxidase gene contains Rim2-M25 in intron 3 (Fig. 4B). It has previously been reported that the element is present in the Xa21 gene cluster in rice (He et al. 2000). These results suggest that the Rim2 elements, like many other TEs, can affect gene structure and expression by transposing into transcriptionally active sections of the chromosomes.

The Rim2 elements belong to a distinct CACTA-like superfamily

The previously isolated Rim2 cDNA was initially considered to be derived from a possible member of the CACTA family that includes the En / Spm element from maize and the Tam1 element from snapdragon, based on the fact that the putative RIM2 protein showed weak similarity to the TNP2/TNPD proteins encoded by the CACTA transposons (He et al. 2000). The genomic organization of the Rim2 elements also shows similarity to that of the CACTA elements. However, the Rim2 elements differ in several important respects from to the known CACTA elements: they exhibit neither the TNP1/TNPA protein-coding region nor the consensus CACTA motifs in the TIRs, and harbor a unique HRGP-encoding region. Moreover, the Rim2 elements are much more abundant than the CACTA elements (Nacken et al. 1991; Gierl 1996; Snowden and Napoli 1998). In order to gain insight into the phylogenetic relationship between Rim2 and other CACTA elements, we constructed a multiple alignment based on the sequences of representative RIM2 TPases and other CACTA-encoded TPases, and derived a phylogenetic tree for these proteins. This clearly shows that there is only fragmentary homology between the different kinds of proteins even in their most highly conserved regions (Fig. 5A). The phylogenetic tree indicates that the proteins encoded by the CACTA-like elements can be divided into two subgroups, the TNP2 and the RIM2 subgroups (Fig. 5B). Our results provide evidence that Rim2 elements constitute a CACTA-like (CACTA-related) superfamily that is distinct from other CACTA elements, and has several unique features: CACTG TIRs, conserved TPase- and HRGP-coding regions, and a high degree of amplification in the rice genome.

Rim2 elements were recently active

Despite variation in size and coding region, the high level of shared nucleotide sequence similarity (>90% identity) over the entire sequences of Rim2 members suggests that at least some of the elements were recently active, as has been deduced for other TEs (Le et al. 2000). We also tried to reconstruct the evolutionary history of element mobilization by identifying repeat target sequences with and without Rim2 insertions (also called the RESite approach; Le et al. 2000). Several sequences were found to correspond to certain mined members such as Rim2-M3, Rim2-M39 and Rim2-M45 (Fig. 6), clearly attesting to the past transposition/insertion of the Rim2 elements. Thus active transposition is undoubtedly responsible for the high copy number of these TEs in the rice genome. Circumstantial evidence from studies of MITEs and retrotransposons has strongly suggested the involvement of high-copy-number TEs in the recent restructuring of plant genomes (Witte et al. 2001; Feschotte et al. 2002). Our observations on the transposition history of Rim2 elements implies a similar function for the Rim2 family in restructuring the rice genome, although no current transposition activity of the family has been detected because it is difficult to identify new inserts against the background of numerous older copies.

The RIM2-coding sequence is specific for the Oryza genome

Southern hybridization was performed with genomic DNA from several monocot and dicot species (Fig. 7). The result showed that the RIM2-coding sequence only hybridized to the genomes of cultivated and wild rice, and did not cross-hybridize to the genomes of other cereal crops or Arabidopsis. In agreement with this observation, no homologous sequence from other plants was discovered in the databases by Blast searches with the Rim2 sequence queries. We also detected the Rim2 sequence in other Oryza species (data not shown). Thus, the RIM2-coding sequence is most probably specific to Oryza genomes.

Discussion

Rice is certainly a TE-rich species and, as the amount of sequence information for rice continues to increase, a complete picture of the TEs in the genome will ultimately emerge (Mao et al. 2000; Turcotte et al. 2001; Jiang and Wessler 2001). Here, we report that the previously isolated, pathogen-inducible, rice Rim2 elements belong to a distinct CACTA-like superfamily in the rice genome, which shows some novel features. Our study suggests that Rim2 elements might have originated from an autonomous CACTA ancestor that has yet to be identified. Indeed, rice elements with typical CACTA TIRs have been found, such as Tnr3 (Motohashi et al. 1996), Tnr12 (Han et al. 2000), and CACTA-E, F, G (Jiang and Wessler; www.girinst.org), which supports this hypothesis. However, all of these CACTA elements are small and harbor neither a TPase-encoding region nor sequence homology to Rim2 elements. We also could not find CACTA elements with both protein-coding regions and the conserved CACTA TIRs among the available rice sequences. The CACTA elements in rice probably represent the remnants of the CACTA ancestor, forming the residue of its evolution in the rice genome. Furthermore, some rice CACTA-like elements, such as those reported by Tarchini et al. (2000) and Turcotte et al. (2001), are in fact Rim2 elements with their CACTG TIRs. With respect to these observations, we suggest that the CACTA ancestor could have developed into two versions, CACTA in other species and its divergent counterpart, Rim2 , in rice. These results also suggest that the CACTA-related superfamily including Rim2 and the other CACTA ( En/Spm) groups is highly diverse in the plant kingdom.

The ORFs of the Rim2 elements have coding capacity for putative RIM2 proteins that structurally resemble the TNP2/TNPD proteins encoded by autonomous CACTA elements such as En / Spm and Tam1 (Fig. 5). TNPD, and probably TNP2 as well, is known to possess TPase activity (Kunze and Weil 2002). It is postulated that the TNP2/TNPD TPases function together with the TNP1/TNPA DNA-binding proteins as trans-acting factors, which bind to the TIRs and the 9- and 12-bp sub-terminal repeats that serve as cis-acting determinants, in order to mediate transposon excision. Furthermore, activation of the autonomous CACTA elements may be regulated by the sub-terminal repeats that are subject to methylation (Miura et al. 2001). The presence of TPase-like RIM2 proteins, 16-bp sub-terminal repeats and highly structured TIRs in Rim2 elements (Fig. 1) implies that a similar mechanism of trans / cis interaction-dependent transposition might have functioned in Rim2 amplification/transposition.

Interestingly, the other gene in the Rim2 elements encodes a putative HRGP instead of a DNA-binding protein, a unique feature not found in other class 2 TEs. It will be interesting to investigate the biological significance of this protein for TE activity.

It has long been known that defective elements can lose the ability to transpose actively during evolution and come to depend on the functions of autonomous elements, as proposed in the cases of the Sorghum element Cs1 which lacks a gene for the TNP2-like TPase (Chopra et al. 1999), the maize Dop which cannot synthesize functional products due to several mutations (Bercury et al. 2001), and the Arabidopsis element Atenspm which has suffered large internal deletions (Kapitonov and Jurka 1999). Whether members of the Rim2 family, which lack a coding sequence for the TNP1/TNPA DNA-binding protein, are defective or non-autonomous is still an open question. If so, the mobility of the Rim2 element(s) would depend on functions of an autonomous member of the Rim2 family that has not yet been identified. For genome stabilization, non-autonomous elements could derive an evolutionary advantage by preventing deleterious mutations from accumulating in the host organism, which is consistent with the observation that high-copy-number TEs seem to be transpositionally inactive (Feschotte et al. 2002). However, since Rim2 transcription is strongly induced under biotic stress (He et al. 2000), the pattern of transposition induced by stress-dependent activation will be worth investigating.

Sequence information for rice has increased dramatically as a result of the genome project (Goff et al. 2002; Yu et al. 2002). On the other hand, computer-aided data mining has proven to be the only viable approach for identifying vast groups of TEs (Le et al. 2000; Mao et al. 2000; Turcotte et al. 2001; Jiang et al. 2002). Certainly, more Rim2 members are expected as rice genome projects progress, making this family one of the biggest class 2 families in the rice genome, considering both element size and copy number. To the best of our knowledge, the Rim2 data presented here makes this the first class 2 family reported to be so highly amplified in the rice genome. So far, only class 1 elements have been shown to attain such copy numbers, as a consequence of their copy-paste mode of transposition. It is interesting to speculate that a similar copy-paste mechanism might function in Rim2 transposition. Moreover, Rim2 elements appear to preferentially insert into genic regions (Fig. 4), indicating the importance of the Rim2 family for rice gene structure and expression. Furthermore, because of their ubiquitous distribution and insertion polymorphism in different rice lines (Figs. 2 and 3, and unpublished data), the Rim2 elements can be exploited as novel genetic markers.

References

Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990) Basic local alignment search tool. J Mol Biol 215:403–410
Article CAS PubMed Google Scholar
Arnault C, Dufournel I (1994) Genome and stresses: reactions against aggressions, behavior of transposable elements. Genetica 93:149–160
CAS PubMed Google Scholar
Bercury SD, Panvas T, Irenze K, Walker EL (2001) Molecular analysis of the Doppia transposable element of maize. Plant Mol Biol 47:341–351
Article CAS PubMed Google Scholar
Bureau TE, Ronald PC, Wessler SR (1996) A computer-based systematic survey reveals the predominance of small inverted-repeat elements in wild-type rice genes. Proc Natl Acad Sci USA 93:8524–8529
CAS PubMed Google Scholar
Chopra S, Brendel V, Zhang J, Axtell JD, Peterson T (1999) Molecular characterization of a mutable pigmentation phenotype and isolation of the first active transposable element from Sorghum bicolor. Proc Natl Acad Sci USA 96:15330–15335
CAS PubMed Google Scholar
Feschotte C, Jiang N, Wessler SR (2002) Plant transposable elements: where genetics meets genomics. Nat Rev Genet 3:329–341
Google Scholar
Gierl A (1996) The En/Spm transposable element of maize. Curr Topics Microbiol Immunol 204:145–159
CAS Google Scholar
Goff SA, et al (2002) A draft sequence of the rice genome ( Oryza sativa L. ssp. japonica). Science 296:92–100
Article CAS Google Scholar
Han CG, Frank MJ, Ohtsubo H, Ohtsubo E (2000) New transposable elements identified as insertions in rice transposon Tnr1. Genes Genet Syst 75:69–77
Article CAS PubMed Google Scholar
He ZH, Dong HT, Dong JX, Li DB, Ronald PC (2000) The rice Rim2 transcript accumulates in response to Magnaporthe grisea and its predicted protein product shares similarity with TNP2-like proteins encoded by CACTA transposons. Mol Gen Genet 264:2–10
Article CAS PubMed Google Scholar
Henikoff S, Henikoff JG (1993) Performance evaluation of amino acid substitution matrices. Proteins 17:49–61
CAS PubMed Google Scholar
Hirochika H (1993) Activation of tobacco retransposons during tissue culture. EMBO J 12:2521–2528
CAS PubMed Google Scholar
Hirochika H, Sugimito K, Otsuki Y, Tsugawa H, Kanda M (1996) Retrotransposon of rice involved in mutations induced by tissue culture. Proc Natl Acad Sci USA 93:7783–7788
CAS PubMed Google Scholar
Jiang J, Gill BS, Wang GL, Ronald PC, Ward DC (1995) Metaphase and interphase fluorescence in situ hybridization mapping of the rice genome with bacterial artificial chromosomes. Proc Natl Acad Sci USA 92:4487–4491
CAS PubMed Google Scholar
Jiang N, Wessler SR (2001) Insertion preference of maize and rice miniature inverted repeat transposable elements as revealed by the analysis of nested elements. Plant Cell 13:2553–2564
CAS PubMed Google Scholar
Jiang N, Bao Z, Temnykh S, Cheng Z, Jiang J, Wing RA, McCouch SR, Wessler SR (2002) Dasheng. A recently amplified nonautonomous long terminal repeat element that is a major component of pericentromeric regions in rice. Genetics 161:1293–1305
CAS PubMed Google Scholar
Jiang N, Bao ZR, Zhang XY, Hirochika H, Eddy SR, McCouch S, Wessler S (2003) An active DNA transposon family in rice. Nature 421:163–167
Article CAS PubMed Google Scholar
Kapitonov VV, Jurka J (1999) Molecular paleontology of transposable elements from Arabidopsis thaliana. Genetica 107:27–37
Article CAS PubMed Google Scholar
Kidwell MG, Lisch D (1997) Transposable elements as sources of variation in animals and plants. Proc Natl Acad Sci USA 94:7704–7711
PubMed Google Scholar
Kumar A, Bennetzen JL (1999) Plant retrotransposons. Annu Rev Genet 33:479–532
CAS PubMed Google Scholar
Kumar S, Tamura K, Jakobsen IB, Nei M (2001) MEGA2: molecular evolutionary genetics analysis software. Bioinformatics 17:1244–1245
CAS PubMed Google Scholar
Kunze R, Weil CF (2002) The hAT and CACTA superfamilies of plant transposons. In: Craig NL, Craigie R, Gellert M, Lambowitz AM (eds) Mobile DNA II. ASM Press, Washington, pp 565–609
Kunze R, Saedler H, Lönnig WE (1997) Plant transposable elements. Adv Bot Res 27:331–470
CAS Google Scholar
Le QH, Wright S, Yu Z, Bureau T (2000) Transposon diversity in Arabidopsis thaliana. Proc Natl Acad Sci USA 97:7376–7381
CAS PubMed Google Scholar
Leprinc AS, Grandbastien MA, Christian M (2001) Retrotransposons of the Tnt1B family are mobile in Nicotiana plumbaginifolia and can induce alternative splicing of the host gene upon insertion. Plant Mol Biol 47:533–541
Article CAS PubMed Google Scholar
Mao L, Wood TC, Yu Y, Budiman MA, Tomkins J, Woo S, Sasinowski M, Presting G, Frisch D, Goff S, Dean RA, Wing RA (2000) Rice transposable elements: a survey of 73,000 sequence-tagged connectors. Genome Res 10:982–990
CAS PubMed Google Scholar
McClintock B (1984) The significance of responses of the genome to challenge. Science 226:792–801
Google Scholar
McDonald JF (1995) Transposable elements:-possible catalysts of organismic evolution. Trends Ecol Evol 10:123–126
Google Scholar
Melayah D, Bonnivard E, Chalhoub B, Audeon C, Grandbastien MA (2001) The mobility of the tobacco Tnt1 retrotransposon correlates with its transcriptional activation by fungal factors. Plant J 28:159–168
Article CAS PubMed Google Scholar
Mhiri C, Morel JB, Vernhettes S, Casacuberta JM, Lucas H, Grandbastien MA (1997) The promoter of the tobacco Tnt1 retrotransposon is induced by wounding and by abiotic stress. Plant Mol Biol 33:257–266
CAS PubMed Google Scholar
Miura A, Yonebayashi S, Watanabe K, Toyama T, Shimada H, Kakutani T (2001) Mobilization of transposons by a mutation abolishing full DNA methylation in Arabidopsis. Nature 411:212–214
Article CAS PubMed Google Scholar
Motohashi R, Ohtsubo E, Ohtsubo H (1996) Identification of Tnr3, a Suppressor-Mutator/Enhancer -like transposable element from rice. Mol Gen Genet 250:148–152
Article CAS PubMed Google Scholar
Nacken WKF, Piotrowiak R, Saedler H, Sommer H (1991) The transposable element Tam1 from Antirrhinum majus shows structural homology to the maize transposon En/Spm and has no sequence specificity of insertion. Mol Gen Genet 228:201–208
CAS PubMed Google Scholar
Ohtsubo H, Kumekawa N, Ohtsubo E (1999) RIRE2, a novel gypsy-type retrotransposon from rice. Genes Genet Syst 74:83–91
CAS PubMed Google Scholar
Ozeki Y, Davies E, Takeda J (1997) Somatic variation during long-term subculturing of plant cells caused by insertion of a transposable element in a phenylalanine ammonia-lyase (PAL) gene. Mol Gen Genet 254:407–416
Article CAS PubMed Google Scholar
Panaud O, Vitte C, Hivert J, Muzlak S, Talag J, Brar D, Sarr A (2002) Characterization of transposable elements in the genome of rice ( Oryza sativa L.) using representational difference analysis (RDA). Mol Genet Genomics 268:113–121
Article CAS PubMed Google Scholar
Panavas T, Weir J, Walker EL (1999) The structure and paramutagenicity of the R-marbled haplotype of Zea mays. Genetics 153:979–991
CAS PubMed Google Scholar
Pereira A, Cuyoers H, Gierl A, Schwarz-Sommer Z, Saedler H (1986) Molecular analysis of the En/Spm element system of Zea mays. EMBO J 5:835–841
CAS Google Scholar
Pouteau S, Boccara M, Grandbastien MA (1994) Microbial elicitors of plant defense responses activate transcription of a retrotransposon. Plant J 5:535–542
Article CAS Google Scholar
Rhodes PR, Vodkin LO (1988) Organization of the Tgm family of transposable elements in soybean. Genetics 120:597–604
CAS PubMed Google Scholar
Snowden KC, Napoli A (1998) PsI: a novel Spm -like transposable element from Petunia hybrida. Plant J 14:43–54
Article CAS PubMed Google Scholar
Song WY, Pi LY, Bureau TE, Ronald PC (1998) Identification and characterization of 14 transposon-like elements in the non-coding regions of the members of the Xa21 family of disease resistance genes in rice. Mol Gen Genet 258:449–456
CAS PubMed Google Scholar
Tarchini R, Biddle P, Wineland R, Tingey S, Rafalski A (2000) The complete sequence of 340 kb of DNA around the rice Adh1-Adh2 region reveals interrupted colinearity with maize chromosome 4. Plant Cell 12:381–391
CAS PubMed Google Scholar
Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG (1997) The ClustalX Windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Research 24:4876–4882
Article Google Scholar
Turcotte K, Srinivasan S, Bureau T (2001) Survey of transposable elements from rice genomic sequences. Plant J 25:169–179
CAS PubMed Google Scholar
Vernhettes S, Grandbastien MA, Casacuberta JM (1997) In vivo characterization of transcriptional regulatory sequences involved in the defence-associated expression of the tobacco retrotransposon Tnt1. Plant Mol Biol 35:673–679
Article CAS PubMed Google Scholar
Wessler SR (1996) Plant retrotransposons: turned on by stress. Curr Biol 6:959–961
CAS PubMed Google Scholar
Witte CP, Le QH, Bureau T, Kumar A (2001) Terminal-repeat retrotransposons in miniature (TRIM) are involved in restructuring plant genomes. Proc Natl Acad Sci USA 98:13778–13783
CAS PubMed Google Scholar
Yu J, et al (2002) A draft sequence of the rice genome ( Oryza sativa L. ssp. indica). Science 296:79–92
Google Scholar

Download references

Acknowledgements

We thank Pamela Ronald for providing the rice BACs, and Susan R. Wessler, Thomas Bureau and Antoni Rafalski for useful comments on this work. This work was funded by NSFC grants (30125030, 90208010), a CAS grant (KSCX2-SW-301-02) and a MOST of China grant (2001AA222321) to Z.H.

Author information

Authors and Affiliations

SHARF and National Laboratory of Plant Molecular Genetics, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 300 Fenglin Road, 200032, Shanghai, China
G.-D. Wang, G. Wu, Q. Li & Z.-H. He
Biotechnology Institute, Zhejiang University, 310029, Hangzhou, China
P.-F. Tian, D.-B. Li & Z.-H. He
Department of Biology, East China Normal University, 200062, Shanghai, China
G.-D. Wang
Department of Horticulture, University of Wisconsin, Madison, WI 53706, USA
Z.-K. Cheng & J.-M. Jiang

Authors

G.-D. Wang
View author publications
You can also search for this author in PubMed Google Scholar
P.-F. Tian
View author publications
You can also search for this author in PubMed Google Scholar
Z.-K. Cheng
View author publications
You can also search for this author in PubMed Google Scholar
G. Wu
View author publications
You can also search for this author in PubMed Google Scholar
J.-M. Jiang
View author publications
You can also search for this author in PubMed Google Scholar
D.-B. Li
View author publications
You can also search for this author in PubMed Google Scholar
Q. Li
View author publications
You can also search for this author in PubMed Google Scholar
Z.-H. He
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Z.-H. He.

Additional information

Communicated by M.-A. Grandbastien

The first two authors contributed equally to this work

Electronic Supplementary Material

(PDF 65 KB)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, GD., Tian, PF., Cheng, ZK. et al. Genomic characterization of Rim2 / Hipa elements reveals a CACTA-like transposon superfamily with unique features in the rice genome. Mol Genet Genomics 270, 234–242 (2003). https://doi.org/10.1007/s00438-003-0918-z

Download citation

Received: 20 February 2003
Accepted: 12 August 2003
Published: 26 September 2003
Issue Date: November 2003
DOI: https://doi.org/10.1007/s00438-003-0918-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Genomic characterization of Rim2 / Hipa elements reveals a CACTA-like transposon superfamily with unique features in the rice genome

Abstract

Similar content being viewed by others

Transposable Element Dynamics in Rice and Its Wild Relatives

Analysis of Tc1-Mariner elements in Sclerotinia sclerotiorum suggests recent activity and flexible transposases

Genome-wide comparison of Asian and African rice reveals high recent activity of DNA transposons

Introduction