Introduction

MITEs are short (~ 100 to 800 bp) non-autonomous truncated versions of autonomous transposable elements belonging to the class-II type of TEs. MITEs possess a pair of conserved terminal inverted repeats (TIRs ≥ 10 bp) that act as receptor sites for the transposase and a pair of short target site duplications (TSDs 2~10 bp) that are located towards the outer end of each TIRs [1]. MITEs initially discovered by Bureau and Wessler [2], during 1992 in maize belonged to Tourist family and another MITE family called Stowaway was identified in Sorghum during 1994. MITEs were later found in several species of plants and animals. Based on sequence similarity, MITEs are primarily classified into families, further grouped into superfamilies with almost identical TSD and TIR sequences shared between MITEs and autonomous transposons capable of encoding transposase enzyme [1, 3]. Through genome sequence analysis in Arabidopsis, the first evidence for such association was reported between a plant MITE family Emigrant with Lemi1 (larger emigrant) transposon which potentially encodes a pogo like transposase was connected to Tc1/mariner transposon superfamily. Apart from being transposed by autonomous DNA transposons belonging to same family, MITEs can also be cross mobilized by DNA transposons belonging distant families [4,5,6]. There are currently seven superfamilies recognized in plant genomes: Tcl/mariner, Pif/Harbinger, hAT, Mutator, CACTA, P-element, and Novosib [1, 7, 8] (Table 1). After discovering MITEs in maize and sorghum genomes, they were reported and characterized in more than 41 plant species including arabidopsis, rice, wheat and groundnut [2, 9,10,11].

Table 1 Details of MITE superfamilies with their initially associated MITE families

During most of host genome evolution period, MITEs remained quiescent. MITEs get activated under natural circumstances when host genome is exposed to severe environmental conditions during domestication. Supporting the theory of "genome shock" proposed by McClintock, that cryptic transposons activate in host genomes upon exposure to stress, as a response of genome to mitigate the danger [12]. Hence, MITEs can amplify and produce more than 1000 copies in plant genomes exposed to extreme genome shocks, as evident in one of the rice cultivars Gimbozu adapted to waterlogging stress through preferential mPing MITEs amplification [13]. Present-day human efforts in plant breeding can also mobilize MITEs through wide hybridization, plant micropropagation techniques such as tissue culture and anther culture and plant chemical mutagen like ethyl methane sulphonate (EMS) and physical mutagen like gamma-ray irradiation [4, 14,15,16,17]. All the above breeding activities seeming to be posing the host plant to experience genomic shock resulting in the MITEs amplification. These examples also indicate that the MITEs in plants are already helping us in generating variability and evolution of host genomes with time and conscious utilization of these elements can further serve us better in crop improvement programmes.

MITEs are exceptionally high in copy number per genome compared to their related autonomous class II transposons and class I transposons [6, 13, 18, 19]. The success of MITEs in reaching such a high copy number in the host genome is because of the generation of stable or neutral mutations due to specifically inserting into the non-translating regions (promoter, intron and untranslated regions), avoiding the exonic region of genes [20]. The short size of MITEs and their ability to escape from being silenced by host surveillance is also considered to be an important reason for a successful transposon burst [6]. Upon their mobilisation by trans-acting TEs, MITEs produce allelic diversity at several loci through insertional polymorphism [21, 22]. MITEs distribution and their abundance in plant genomes play an important role in biosynthetic gene cluster formation through MITEs mediated genomic rearrangement by providing homologous sequences that enable illegitimate recombination and gene relocation [2, 13, 23,24,25]. Therefore, MITE insertional polymorphism can be used as a potential molecular marker system in plant genetics and genomics studies [16, 22, 26,27,28,29,30].

Role of MITEs on regulation of plant genes

The role of MITEs in creating trait variations in plants has been shown by the ample number of studies [31] (Table 2). The wrinkled-seed character, one of the seven traits described by the Mendel to establish fundamental laws of genetics was caused by a transposon-like insertion in a gene encoding starch-branching enzyme (SBEI) by a MITE called hAT element [32]. Other trait variations include, change in the flowering time of maize [33, 34], AltSB, a gene variant in sorghum is responsible for aluminium tolerance [35], ahFAD2B allele is associated with MITE insertion resulted in high oleic acid trait in groundnut [10], change of tuber skin colour in potato [36] and petals colour variation in gentian flowers [37]. A MITE insertion into the gene governing the structure of the glume that encloses and determines rice grain shape has led to slender glume mutant character upon gamma-ray irradiation of Gimbozu rice variety seeds [17, 38]. Enhanced expression of the gene TaHSP16.9-3A due to the insertion of a MITE in its 3′ UTR is attributed for heat tolerance in a wheat cultivar (TAM107) [39]. In the case of maize, MITE insertion in the promotor of the NAC gene (ZmNAC111) is associated with seedling drought tolerance [40]. MITE mediated translational repression was reported for an agronomically important gene Ghd2, that govern grain number, plant height and heading date in rice [41].

Table 2 MITEs derived trait variations in plants

MITEs owing to their high copy number and genic preference for insertion [42], have a greater role in the modification of key genes controlling important agricultural traits. The abundance of MITEs in the rice genome and insertional polymorphisms at more than 7000 loci between Oryza sativa subspecies japonica and indica, was attributed for the divergence in gene expression patterns between these subspecies [43]. Recent epigenetic studies in rice reported MITE mediated epigenetic regulation of gene expression is responsible for changes in leaf angle, seed size and subspecies-specific resistance to different pathogens [44, 45]. MITEs have also shown their ability to regulate and control genes involved in abiotic stress responses. These regulatory mechanisms of MITEs will be very useful in understanding and identifying genes involved in abiotic stress response, particularly heat and drought stresses which are essential for the present climate changing situation [20, 39, 40, 45,46,47]. The insertion of TEs into intronic regions of genes can result in mutations that generate new exons. Recently, it is reported that the intronic transposed MITEs in the mulberry genome are frequently associated with alternative splicing due to exonization [48]. The above examples of trait variations associated with MITEs are due to the reprogramming of gene expression mainly by MITE insertional gene inactivation, repression of a nearby gene through methylation, upregulation of the gene by introducing new regulatory information and also involving MITEs mediated miRNA and siRNA pathways. Hence, MITEs can upregulate or downregulate the adjacent genes and the resultant phenotypic change may be associated with domestication and improvement of crop plant species [49].

MITEs based DNA marker systems in plants: principles and methodology

Distinctive characteristics of MITEs such as their abundance, short sequence size (~ 100 to 800 bp), non-autonomous, high frequency of insertion polymorphism and most importantly, preferential insertion of MITEs into the genic region can render them appropriate as an excellent marker system [21, 22, 26,27,28,29,30]. MITE-Display (MD), Inter-MITE polymorphism (IMP) and MITE insertional polymorphism (MIP) are the three prominent MITE based marker systems developed and being used for various molecular plant breeding studies (Table 3). MD is a MITE locus and a restriction site dependent marker system and MIP marker system uses only flanking regions of a MITE for the development of primers to detect presence or absence of a MITE at predetermined genomic region in the host genomes, while TIRs of MITEs are used for designing degenerate primers to amplify the sequence between two MITEs in IMP marker system [50,51,52]. All the three MITE marker systems use MITE locus/loci for detecting sequence variation and generate sequence polymorphism information upon comparing a set of genotypes for that locus.

Table 3 Comparison among prominent MITEs based DNA marker systems

MITE-display (MD)

MD is a restriction site and polymerase chain reaction (PCR) based marker system. It is similar to amplified fragment length polymorphism (AFLP), MD detects and provides precise information on a large number of MITE insertions in a given plant genotype. MITE insertion at a particular locus can be identified by a ligation-mediated PCR that uses part of the MITE sequence as a template and continues to amplify part of flanking sequence up to a specific restriction site. The amplified PCR product can be analysed in a polyacrylamide gel system. If a MITE insertion is correlated with a particular phenotype and such a co-segregation PCR product can be identified in gel, reamplified, cloned, sequenced and used as a probe to further isolate the tagged gene.

As described in rice [50], the important steps involved in MITE-display are restriction digestion (MseI) of genomic DNA and ligation of AFLP adaptors to the compatible ends. The MITE-containing fragments are then amplified using one AFLP primer and another primer complementary to an internal sequence of the Heartbreak (Hbr) MITE element. As in AFLP, several DNA fragments are then analyzed simultaneously in denaturing polyacrylamide gels (Fig. 1).

Fig. 1
figure 1

Steps involved in MITE-Display marker system

Identification of suitable MITEs for MITE display

MITEs showing high sequence homogeneity within a family indicate that these MITEs are still active or recently active. Such activity by MITEs results in significant insertional polymorphism at several loci within the members of a single plant species. High sequence homogeneity also helps in designing the primers with little or no degeneracy, permitting the use of stringent PCR protocols and leading to reproducible results.

Selection of MITE primers Nested PCR can be used with two sets of primers that recognize conserved regions adjacent to the TIRs. The internal primers confirm that the amplified fragments are truly anchored to the targeted MITE family.

Adapters are made by annealing two oligonucleotides i.e., a restriction enzyme specific sequence with complementary sequences. The sequence of the adapter and the adjacent restriction site serve as primer binding sites for subsequent amplification of the restriction fragments.

PCR conditions The touchdown protocol was used for the very first MITE-Display (Hbr-display) as this protocol increases the specificity of amplification when the primers have different melting temperatures (Tm > 5 °C) [50]. However, a general PCR program can be used based on the length and GC content of the primers and the length of the expected PCR product for any given primer pair.

Pre-selective amplification PCRs are done with the following cycling conditions using a primer complementary to the adapters (MseI + 0) and another primer complementary to an Hbr internal element. A general protocol is programmed to run one cycle of pre-denaturation (94 °C for 5 min), 24 cycles of denaturation (94 °C for 30 s), annealing (59 °C for 30 s) and extension (72 °C for 1 min). Followed by, one cycle of final elongation (72 °C for 5 min).

Selective amplification The 20-fold diluted pre selective amplified reaction is used for selective amplification. The internal primer pair comprising of a primer complementary to the adapter with an overhang (MseI + N) and another primer which is complementary to internal Hbr element is labelled with 33P, designed to anneal within the pre selective amplified product. The following cycling parameters are used for selective amplification [50]. The touch down PCR protocol is programmed to run one cycle of pre-denaturation (94 °C for 5 min), 10 cycles of denaturation (94 °C for 30 s), annealing (starting from 70 °C for 30 s with a decrease of 1 °C/cycle) and extension (72 °C for 1 min). Followed by 27 cycles of denaturation (94 °C for 30 s), annealing (61 °C for 30 s) and extension (72 °C for 1 min). One cycle of final elongation (72 °C for 5 min) was included. The resultant final selective amplified reactions are then analysed in denaturing polyacrylamide gels.

Sometimes, MITE display marker analysis based on denaturing polyacrylamide gels may fail to capture the complete information on MITE insertions when the restriction fragments sequences containing insertions are long enough to produce overlapping bands in the gel profile. To prevent such loss of information and to capture every individual MITE insertion within a genome, the PCR amplified products may be subjected for Next Generation Sequencing (NGS). As the NGS approach gives more precise information compared to the analysis made on denaturing polyacrylamide gels [13]. Recently NGS based MD was attempted to develop high-quality MITEs based marker in Brassica rapa. NGS was integrated with transposon display (TD) analysis to uncover the insertion polymorphism and to develop large-scale MIP markers mediated by recent insertion polymorphism of a high-copy MITE family, BraSto-2 (Bs2) members (IP-Bs2 markers) among Brassica accessions. In this method a degenerate primer developed from the conserved region of Bs2 was used along with the MseI adaptor specific primer to find the insertion polymorphism of Bs2 members among Brassica accessions. Pooled, purified and bar-coded PCR products of Brassica accessions containing MITE flanking sequences were subjected for sequencing with Illumina HiSeq2000 paired-end sequencing platform. Further, the sequences were subjected for in silico mapping on reference genome and accession specific MITE insertion polymorphic sites were identified by eliminating common sites, identified manually based on physical position information. Finally, as compared to conventional gel-based TD analysis NGS based MITE display is more advantageous. As it saves time by preventing multiple rounds of selective amplification in order to amplify most of the MITE family like Bs2 which cannot be amplified/visualized by a single gel analysis. It provides more precise information on MITE insertion polymorphism and it also gives an opportunity to develop large number of MIP markers [53].

Inter-MITE polymorphism (IMP)

IMP is a PCR based marker system. It was developed and utilized in barley for genome mapping and DNA fingerprinting purposes. IMP involves amplification of DNA sequence between two adjacent MITEs as these transposable elements are available in high copy number with insertion preference to the genic region [51]. TIRs of MITEs are most conserved hence these sequences are used for designing degenerate primers (Fig. 2.).

Fig. 2
figure 2

Steps involved in Inter-MITE polymorphism marker system

Identification of suitable MITEs for IMP and primer design The TIR sequences of MITEs families can be aligned to generate a consensus sequence for each family to develop degenerate primers. Outwardly directed oligonucleotide primers can be designed according to the consensus sequences of MITE family TIRs. Such primers are expected to amplify sequence between the two adjacent MITEs within amplifiable distances.

PCR conditions: As per Chang et al. [51], PCR- amplifications were carried out in a final volume of 20 μl containing 2.5 mM of MgCl2, 0.4 mM of dNTPs, 1.0 μM (non-labelled) or 0.1 μM (labelled) of primer and 0.5 units of AmpliTaq DNA polymerase (Perkin-Elmer). The pre-amplification and amplification phases were carried out by setting the conditions for one cycle of pre-denaturation (1 min 30 s at 94 °C), 35 cycles of denaturation (30 s at 94 °C), annealing (45 s at 58 °C or 56 °C), and extension (1 min at 72 °C). In two phases, PCR-amplifications were carried out. Pre-amplification was the first phase and it was done with nonlabelled primers for 35 cycles with 100 ng of DNA template. For the second phase of amplification, an aliquot of 2 μl of the preamplification product was used. The first pre-amplification phase done with nonlabeled primers was meant for the amplification of target sites. The product of pre-amplification was used for the second step of amplification with labelled primers in order to improve the visualization.

MITE insertional polymorphism (MIP)

MITEs by the virtue of their transpositions in host genomes preferably insert into genic regions producing allelic variations for the genes across the genome [18, 54]. This genome-wide insertional polymorphism can be of great importance making them suitable for genome-wide studies, genome analysis, mapping and tagging gene of interest in plants. MIP as a marker system has been developed and utilized in several plant species. In this marker system, the PCR primers are designed flanking to each MITE insertion site across the genome (Fig. 3). These DNA markers were developed based on transposon display, MITEs enriched libraries, in silico analysis and whole genome re-sequencing techniques [16, 29, 55,56,57].

Fig. 3
figure 3

Steps involved in MITE insertional polymorphism marker system

The efficiency of the MIP system over MD and IMP marker systems in capturing sequence variation

MD and IMP marker systems are dependent on a restriction site and a MITE insertion locus respectively in the vicinity of the MITE locus for the successful amplification of the PCR product specific to that MITE locus. Such dependency of MD and IMP marker systems may miss some important MITE insertions but MIP system which is solely dependent upon the individual MITE locus can be more efficient in considering and capturing sequence variation in the host genome created by every individual MITE insertion. As evident by the cases of conversion of MD markers into MIP markers because of ease of marker development, handling and co-dominant nature, MIP system proved to be better suited for QTL mapping studies and marker-assisted selection (MAS) over MD and IMP systems [29, 34]. Furthermore, MITE-display and inter-MITE marker systems being DNA profiling strategies can be effectively employed for rapid fingerprinting and assessment of genetic diversity in a set of genotypes. Unlike MIP, the IMP and MD marker systems do not require host genome sequence information to survey MITEs in plant genomes. It is because of the variations in the approach of primers development in these three marker systems. IMP requires only the sequence information of MITEs for the development of primers. Similarly, MD uses sequence information of a MITE and AFLP adapter sequence for primers development. Whereas MIP marker system utilizes the flanking sequences of MITEs for primers development hence this necessitates the sequence information of host plant genome.

Bioinformatic tools and resources for plant MITEs discovery and annotation

With the advent of sequencing technology and advancement in the computational methods, there is a leap in the genomic sequence databases. In order to discover and annotate MITEs in the plant genomes, several MITEs annotation programs and databases have been developed. Once, MITEs are discovered and characterized in a plant genome, the information on TEs can be catalogued into three types of repositories: TE-centric, genome-centric and polymorphism-centric. These databases can be further used to detect TEs in assembled genomes using ab initio methods [58] (Table 4). MIPS repeat element databases (mips-REdat), a plant repeat database in conjunction with catalogue (mips-REcat) resources are widely used for characterizing and comparative transposon annotation. Currently, the public version mipsREdat_v9.0p consists of 42,000 non-redundant transposon sequences from 44 species covering 20 different genera is available [59].

Table 4 Repositories for exploring TE and MITE in plants

RepBase Update is used worldwide as a reference collection for masking and annotation of repetitive DNA in genome sequencing projects. It is the most popular TE-centric repository that contains more than 38,000 consensus sequences of TE family and subfamily collected from about 834 species [60]. Dfam is the similar database as that of RepBase, has been used alongside RepeatMasker, which is a widely used database query tool for TEs annotations in eukaryotic genomes [61, 62].

Genomic-centric databases accommodate all the individual TE instances annotated in the reference genomes. P-MITE is one such repository for plant MITEs contains more than 2 Mio. MITE-related sequences of 3527 MITE families, identified from 41 sequenced plant genomes. P-MITE can be used to study the origin and dynamics of MITEs, MITE-derived small RNAs and roles of MITEs on gene and genome evolution [63].

TE insertions of individuals diverging from the annotated reference genome of a particular species are contained in polymorphism-centric repositories. BrassicaTED (Brassica Transposable Elements Database) caters the information on MITEs insertion polymorphism. It has B. rapa and B. oleracea datasets for 6026 members from 20 MITE families apart from information on other miniature transposable elements such as short interspersed elements (SINEs), terminal repeat retrotransposons in miniature (TRIMs) and large retrotransposon derivatives (LARDs) which also do not have transposase coding capacity. BrassicaTED offers to browse structural and positional characteristics for every MITE family. In addition, it has data on 289 MITE insertion polymorphisms from a survey of seven Brassica relatives. It also includes a tool, K BLAST (Karyotype BLAST), for clear visualization of the locations for each member in the B. rapa and B. oleracea pseudo-genome sequences [64].

MITEs are mainly discovered through homology-based and structure-based approaches in the plant genome. Structure-based MITE discovery utilizes defined structure of known MITEs to find putative MITEs, where as sequence similarities are used to identify MITEs in the given plant genome for detecting MITEs with homology-based MITE discovery (Table 5).

Table 5 Computational tools for TE and MITE discovery and annotation in plants

For unassembled genomes, raw reads can be used for TE detection with de novo annotation approaches. However, de novo annotation approaches can also be used for assembled genomes for discovering novel TE families that were missing in the TE databases. Following are the popular programs intended to annotate MITEs, detectMITE, MITE Digger, MITE Tracker, MITE-Hunter and MiteFinder. These programs work on different algorithms and operating systems to discover MITEs in a given genome sequence. Each program has its own advantage and disadvantages with respect to the sensitivity, speed and extent of filtering false positives [65,66,67,68,69]. MiteFinder was quick, accurate and more memory efficient compared to detectMITE, MITE Digger and MITE-Hunter remaining while analyzing their relative performance to detect MITEs in the rice genome [67]. However, in a similar test, the MITE Tracker, which is one of the latest programs developed to identify MITEs in plant genomes performed significantly better in terms of processing time and false positive scoring and was the only program able to run with the larger genomes like bread wheat genome as input and discovered 6013 MITE families and allowed the first structural exploration of MITEs in the complete bread wheat genome [65].

Programs or computational tools that allow detection and isolation of polymorphic MITEs can be very useful for efficient DNA marker development for their use in plant molecular breeding. NGS TE Mapper, TE-locate, RelocaTE, TEMP, T-Lex2, PoPoolationTE2, McClintock and PTEMD are the tools that allow detection of TE insertion polymorphisms (TIPs) for a given set of plant genotypes [70,71,72,73,74,75,76]. Recently, around 2957 AhMITE1 specific markers from the whole genome re-sequencing data (WGRS) have been developed using 33 diverse peanut genotypes. This high number of marker development has been credited to the PTEMD software that identified 3546 TIPs from a set of 33 peanut genotypes [55].

Applications and utilization of MITE specific markers

Genetic variability within and between species is key for crop improvement. Accessing molecular genetic diversity may help in understanding evolution, geographical distribution, population structure and genetic relationship in a plant species [77, 78]. Furthermore, the knowledge of the genetic relationship between the inbred lines provides information for the selection of parents and to predict heterosis levels [79,80,81]. Different molecular markers have already been employed in the assessment of plant molecular genetic diversity. But the most commonly used molecular markers like simple sequence repeats (SSR) and single nucleotide polymorphism (SNP) may not sometimes completely cover the diversity existing at the DNA level, mainly due to the transposon mediated intraspecific violation of genetic colinearity and gene structural variations existing in plant species like maize [82,83,84]. In this regard, molecular markers based on TEs can be more efficient in capturing molecular genetic diversity as these elements are involved in evolution and diversification of host genome through their activities like genome expansion by amplification bursts, restructuring of chromosomal regions due to transpositions and creating allelic diversity by insertional polymorphism [85,86,87,88,89,90,91,92,93].

MITE based markers especially, MITE-display and inter MITE marker systems have been used for assessing molecular genetic diversity, relationships and structure of the population in maize, rice, barley, sugarcane, wheat, groundnut, Antirrhinum and Agrostis. The level of polymorphism for MITE based markers in these studies was high and the estimated diversity was similar and congruent with known relationships among the genotypes studied, suggesting that MITE specific markers can be useful for plant molecular genetic diversity assessment [11, 34, 94,95,96,97,98,99,100,101]. Hence, these markers were proposed to use along with other molecular markers for genotyping and relationships studies in maize [102]. Recently, Transposon-based genetic diversity assessment was made in wild and cultivated barley using both class I and class II Transposons with three types of marker systems: IMP (Inter-MITE Polymorphism) and IRAP (Inter-Retrotransposon Amplified Polymorphism). The mean polymorphic information content (PIC) and discrimination power (D) values for IMP and IRAP markers was found to be similar in wild and cultivated species of barley [90].

True hybrid identification by MIP marker system

MIP, a PCR based co-dominant marker system has the power to discriminate heterozygote by generating DNA amplicons from both insertion and deletion for a given locus. This ability of hybrid discrimination was used for identifying true hybrids upon crossing two inbreds or diverse parental genotypes. About 15 MITEs based markers discriminated two laboratory inbred lines 165E and Sippe 50 and twelve commercial varieties of snapdragon [101]. However, DcMaster transposon display markers were also used for hybrid seed purity testing in carrot but the heterozygote discriminatory capacity was confined for only a subset of total markers [98]. Hence, co-dominant MIP marker system can be comparatively better true hybrid identification marker system. Although MIP is a co-dominant marker system, sometimes it may fail to identify true hybrids particularly in polyploid plant species like groundnut due to the presence of homeologus genomic regions and in such cases, MIP markers in homeologus genomic regions may be avoided for true hybrid identification [56].

MITE markers for trait mapping

MITE markers have been used for mapping and saturating the previously developed genetic maps in several plant species. Heartbreak (Hbr) MITE-Display was employed in maize to map 100 recombinant inbred lines derived from a cross between the maize inbreds B73 × Mo17. HBr markers when added to the maize RFLP genetic map, the total length was 1092 cM and map length was increased by 150 cM and also reduced the overall distance between markers [26]. Barfly MITE based IMP markers system was used to saturate RFLP map by mapping a total of 88 IMP markers in barley using doubled-haploid segregating population between Hordeum vulgare and Hordeum spontaneum [50].

As MITE-Display and Inter-MITE markers systems being tedious techniques compared to MIP marker systems, hence, some workers converted these markers into simple PCR markers to exploit MIP. MIP marker system can be more efficient because of its simplicity of marker development, ease of use and co-dominant nature, it has been used for mapping and trait association studies in several plant species. In rice, mPing-SCAR markers were developed from MITE-Display data and they were used for QTL mapping using RIL population derived from a japonica intraspecific cross between Nipponbare X Gimbozu EG4. As compared to SSR markers (7.00%) mPing-SCAR markers (82.30%) showed significantly high polymorphism and it was possible to get better genomic coverage [29]. In another study, 33 MIP PCR based markers were developed from MITE-Display generated information. Furthermore, these markers were used for association mapping in a panel of 367 maize lines to understand MITEs contribution to the phenotypic variation resulting in the identification of a marker ZmV1-9, highly significantly associated with the delayed male flowering [34].

A MIP marker system called Arachis hypogaea transposable element (AhTE) markers developed based on one of the MITE families, i.e., AhMITE1 in groundnut genome, it is being used extensively in groundnut evolutionary studies and construction of linkage maps as a sole marker system as well as in conjunction with other markers systems [16, 55, 56, 103,104,105,106,107,108]. The utility of AhTE Markers for genetic and genomic studies in groundnut (Arachis hypogaea L.) was recently analysed by estimating population genetic parameters among mutants and recombinant inbred lines (RILs). Association analysis and trait mapping with AhTE markers in different groundnut populations have identified MITE markers significantly associated with traits like productivity, foliar disease resistance and oil quality. Furthermore, upon validation, these markers may be advocated for utilization in groundnut improvement through MAS [27, 85, 109,110,111].

All the three MITE based marker systems (MD, IMP and MIP) are being used in mapping and marker-trait association studies as a sole marker system or conjunction with other marker systems and these markers not only saturated the maps by adding additional markers to the previously developed maps but also they were able to map and associate new traits with MITE specific markers.

Conclusion

Transposition activities of MITEs in the plant genomes can produce a wide spectrum of variation in plants both at genotypic and phenotypic levels, which can be useful for crop plant adaptation to the environment or any trait variation that improve yield and quality traits of crop species for sustainable production in this climate changing era. As already witnessed by the utilities of well-known MITEs based marker systems in different plant species like mPing-rice, Heartbreaker-maize, Barfly-barley and AhMITE-groundnut and so far, MITEs have been discovered and characterized in about more than 41 plant species. MITEs by their virtue of abundance, genic preference, short size and gene regulation mechanisms make them suitable for using them as a genetic molecular marker system. Hence, MITEs based molecular markers can be of choice for overall crop improvement through molecular plant breeding. The wide range of phenotypic or trait variations generated by MITEs indicates their ability to be an excellent molecular marker system for genetic and molecular studies in plants. The latest trends in molecular plant breeding are the MAS and genomics, which are based on the development of different molecular marker systems and using them in mapping and tagging several important genes in plants [112, 113]. Molecular markers developed based on MITEs are emerging as a potential marker system in plant species, as these markers are user-friendly, informative and highly reproducible compared to other commonly used SSR-based marker systems. The utility of MITE specific markers in genetics and genomic studies have already been shown in major crop plant species like maize, rice, wheat and groundnut [11, 16, 26, 27, 29].

With the availability of genome sequences of new species, we understand that the plant genome is made up of TEs and particularly MITEs that are widely distributed in the genome and also enriched in the genic region with high copy numbers [20, 24, 114]. Available TE databases and bioinformatics resources for plant MITEs annotation are handy in discovery, identification and classification of MITEs and this helps in the development of existing type (MD, IMP and MIP) of MITE based markers in newly genome sequenced plant species or already existing but MITEs unexploited plant species. Further, there is also scope for integrating MITEs with other marker systems/sequences of any genes to evolve new kind of MITE based marker systems, as a couple of examples cited here. MITE/RGA is a modified MITE Display marker system. It uses a combination of primers specific to MITEs and primers based on resistant gene analogs (RGAs). This marker system provided a unique opportunity for mapping and identification of candidate genes conferring resistance to grey leaf spot in maize. Compared to MITE display, this technique was more cost-effective, less time consuming as it does not require restriction digestion of plant genomic DNA and ligation of primer adaptors [115]. Another marker system called MITE based intron length polymorphism, which is also a simple and cost-effective PCR marker system was developed for its utilization in carrot molecular breeding [57]. Recently, the bibliometric analysis of plant molecular markers showed the predominance of SNPs and SSRs marker systems and also identified US, China, India, France, and Germany as the top countries to use DNA markers for molecular studies [116]. Like SSR markers, which are labelled as markers of choice, MITE based MIP marker system is also co-dominant, highly polymorphic and easily visualized in agarose gel electrophoresis compared to SSR [29]. With these added advantages, MIP markers may be integrated with SSRs or they can be used as a sole marker system for the routine needs of plant molecular studies. MITEs in rice can create a significantly very high rate of new insertions i.e., ~ 40 per plant per segregating generation as compared to a relatively low rate of new SNPs formation per generation per plant [13, 117]. Hence, this high rate of insertional polymorphisms created by dynamic MITEs activity resulting in beneficial adaptive trait variations arising due to exposure of plant genome to the present climate-changing situation allows us to utilize them as effective marker systems in plant molecular breeding.