Keywords

1.1 Introduction

Food supply is a requirement for food security, and is an important factor to reduce hunger. The agriculture industry has shown significant improvements over the past 50 years with respect to production and productivity enhancement, dominated by the cereals which form the major staples across countries. The twentieth century has seen tremendous increase in food production with the introduction of high yielding crop varieties, especially since the first ‘green revolution’ thus saving millions of people from starvation (Rajaram 2005). This has been achieved mainly because of development and adoption of high yielding cultivars, increased area under irrigation, and widespread use of fertilisers. However, the present crop yield is not sufficient enough to feed the 9 billion of global population by 2050 (Ray et al. 2013). In addition, global climate change has further worsened this scenario with altered rainfall patterns, extreme weather events, and changing patterns of pathogens and pests in terms of severity and distribution (Abberton et al. 2016). With population rising gradually and natural capital shrinking, the production of food must be improved that can be accomplished with the use of modern analytical methods and technology to increase the quality and quantitative improvement of crop plants (Agrawal et al. 2013; Nelson et al. 2014; Haddeland et al. 2014).

Cultural change relies on current or created genetic variation, and the genetic benefit rate in breeding systems will increase either by expanding the quantity of variation available for selection or by speeding up the selection process so that varieties can be developed more quickly. During the last few decades, the development and use of molecular markers in crop genetics led to the identification and mapping of genes and QTLs controlling key traits. The marker system initially started with the use of restriction fragment length polymorphisms (RFLPs) followed by RAPDs, SSRs, AFLPs and culminated with the SNP markers based on genome sequencing (Tanksley et al. 1989; Varshney et al. 2009). The SSR markers have been used extensively for linkage map construction, tagging of new genes/QTLs due to their abundance in the crop genomes as well as because of their high polymorphism. (Somers et al. 2004; McCouch et al. 2002; Varshney et al. 2007; Smith et al. 1997; Sharma et al. 2013). Use of SSR markers has few limitations like SSRs are not uniformly distributed in genomes, gel based assay and cannot be integrated across platforms, which limit its potential to be exploited in plant breeding programmes.

With the development of low-cost sequencing technologies and genotypic platforms, there is an exponential increase of plant sequences in databases and enhanced our understanding of molecular and physiological roles of genes. Plant genomics includes both structural and functional genomics, which covers both basic and applied aspects. Given the rapid evolution of novel technologies, especially the advent of bioinformatics, the understanding from gene to genome level has increased. The new developments in the ‘omics’ technologies have brought revolution in plant science research due to the next-generation sequencing (NGS) 454, Solexa Illumina, Applied Biosystems (ABI) SOLiD, and high-throughput marker genotyping technologies. Consequently, the availability of complete genome sequences for different crop plants holds great promise for research targeted towards crop improvement.

Many different functional genomics approaches have been used to identify genes that linked to the key biological processes. The crop traits are generally complex quantitative traits, which are controlled by multiple genes involving complex genic interactions (Core et al. 2008). Omics is a collaborative, broad-based field that focuses on the study of bio informative interactions obtained from genome profiling, transcriptomes, proteomes, metabolomes, and a variety of other related omes. The three main omic technologies for the analyses of genomics, proteomics, and metabolomics are the genetic, proteome and metabolism analyses of organisms’ cells and tissues (Varshney et al. 2013). In this chapter we will discuss these technologies in detail and how these can be integrated in breeding programmes for better understanding of the functioning of key genes.

1.2 Genome and Genomics

The term genome refers to the entire genome compliment contained in a haploid chromosome set of any organism and the term was coined by German botanist Hans Winkler in 1920 at the University of Hamburg (Vukmirovic and Tilghman 2000). Genomics is the study of complete DNA sequence of organisms. It has been proven as one of the most transformative impacts on biological studies. Genomics study involves recombinant DNA technology, DNA sequencing, assembly, and annotation of genome to decipher the structure, function, and evolution of genes.

The understanding of genome sequences of organisms is crucial for unravelling the roles and networks of genes, for defining evolutionary relationships and processes, and to discover previously unaware control pathways that coordinate gene operation. Genomics has become one of the most transformative forces in biological sciences, the study of the whole organism’s DNA sequence. Genomics is the genetic characterisation and cloning technique of entire genomes to explain gene structure, function, and evolution. The genetic material in eukaryotic organism is deoxyribosenucleic acid (DNA), which is a double stranded molecule composed of nucleotide bases, viz., adenine (A), guanine (G), cytosine (C), and thymine (T). The sequence of nucleotides in a gene on a DNA strand determines the sequence of amino acids that make up a protein.

For protein synthesis, the DNA in a gene is first transcribed to messenger RNA, followed by its translation into a sequence of amino acids through ribosomes in the cytoplasm of the cell. The proteins and products are fundamentally responsible for all cellular mechanisms. Finding out the pattern of arrangement of nucleotide bases in the entire genome is called genome sequencing. Time line for key discoveries in molecular biology has been depicted in Table 1.1 that led to the enormous data generation and formed a basis for better understanding of the key functions of the genomes. In 1972, gene sequence for bacteriophage MS2 coat protein was first determined by Walter Fiers and his team at the Laboratory of Molecular Biology of the University of Ghent, Belgium. The first DNA-based genome of bacteriophage Φ-X174; (5368 bp) was sequenced by Frederick Sanger (Sanger 1977). Since then genomes are being sequenced at a rapid pace ranging from viruses, human, bacteria, fungus, and many crop plants. Scientists, through a variety of functional genomics approaches, are characterising the genes that control key processes. The crop genome sequences are valuable for the discovery and insight into genetic diversity by resequencing of various genomes. The positional cloning and seed breeding promote sequence dependent markers linked to the rare elite alleles.

Table 1.1 Timeline showing key discoveries in DNA Sequencing

Affymetrix (Santa Clara, CA, USA; www.affimetrix.com) and the Illumina (San Diego, CA, USA; www.illumina.com) BeadArrayTM technology are commonly used for microarrays from GeneChipTM (GeneChipTM) and chip related technologies. Some recently built commercial platforms such as Affymetrix ® EurekaTM, and Illumina’s Infinium also rely on SNP markers of high density. Furthermore, plant breeding and agricultural biotechnology have been revolutionised by the micro-array-based characterisation of plant genomes.

Ganal et al. (2011) reported a uniform large-scale SNP genotype array of over 8,00,000 SNPs, which were spread evenly throughout genome of maize. In addition, competitive PCR (KASP), TaqMan real-time PCR, and STARP experiments use low-cost PCRs to detect SNPs and provide versatility with regard to the number of samples to be analysed (He et al. 2014; Semagn et al. 2014; Long et al. 2017). Diagnostic markers may be used without further confirmation, developed on target gene sequences, to improve application of marker-assisted selection (MAS) in breeding. The production of diagnostic markers is rapidly assisted by genome sequencing/resequencing. Such markers allow plant breeders to precisely produce new crops with targeted characteristics in future to satisfy different food requirements.

Most significant agronomic and economic characteristics in culture write traits in cultural breeding methods are several QTL regulated quantitative characteristics. Therefore, it is of considerable significance for marker-assisted breeding to explore QTL or even underlying causal genes/alleles. Linkage mapping is a standard approach for the genetic dissection of the quantitative trait loci genetic foundation (Emebiri et al. 2017; Liu et al. 2017; Zhang et al. 2017). To date, this approach has detected a significant number of QTLs. Moreover, the meta-QTL study was developed by combining QTL published to classify reliable QTL for crop enhancement with the achievement of biological informatics and large genetic knowledge (Van and McHale 2017). The extensive mapping of genomes in large populations makes it easier to map and clone target genes in crops.

Due to the available high-density SNP markers, the GWAS (Genome-Wide Association Studies) analysis, which includes natural populations, opened a path. The GWAS analysis will solve a range of traditional mapping shortcomings and deliver a powerful alternate technique for the dissection of complex features. By bringing together high-performance phenotypic and genotypic results, GWAS provides inspections of maize’s complex characteristic genetic architecture, particularly in view of the rapid loss of corn imbalance (Yan et al. 2011).

Genome selection (GS) estimates the genomic estimated breeding values (GEBVs) of lines by a study of traits and high-density marker values in an artificially generated whole-genome population (Meuwissen et al. 2001; Crossa et al. 2017). GS has shown as important breeding method to develop dynamic characteristics easily. Closer relationships, including polygenic features, between the genomic estimate and true breeding values have been found (Jia and Jannink 2012). Even if also expensive, the number of QTLs that regulate a trait has proven superior to the recurring collection of markings for changes in complex characteristics in crops, so it can efficiently prevent problems.

1.3 Transcriptomics

Transcriptome is the study of total RNA, which gets edited and becomes mRNA, and finally translated into proteins. It involve the analysis of expression profiling of mRNAs in a given cell population.

The transcriptome is complex, since it is mainly a representation of the genes actively expressed at any given time under different circumstances. It assesses the changes in gene expression pattern due to inner and outer factors such as biotic and abiotic stress. It is an important method for understanding responses of biological system under different circumstances. Transcriptome research offers a crucial forum for the review of the connexions between genotype and phenotype, which provides a deeper understanding of the fundamental pathways and mechanisms involved in the regulation of the cell fate (Ruan et al. 2004).

Transcriptomic approaches such as next-generation sequencing (NGS) facilitate the comprehension of functional components of genome. From past two decades, a number of agricultural researchers have made use of advances NGS techniques that enabling them to study high-resolution linkage between gene variants and traits of interest resulting in an upsurge in the scope of transcriptomic studies, not only in number but also in the range of agricultural systems that are being studied (Imadi et al. 2015).

Microarrays, serial analysis of gene expression (SAGE), massive parallel signature sequence (MPSS), and next-generation sequencing platforms (NGSPs) are typical technologies applied for genome wide or high-throughput genetic expression analysis (Costa et al. 2010; Harbers et al. 2005; Mardis et al. 2008). High-performance, quantitative PCR is used to confirm the expression of a broad variety of genes that are elucidated by tissue omics and choice care (Czechowski et al. 2004).

1.3.1 Differential Display

Differential Display technology is a PCR based technique for analysing gene expression using an Oligo-dT (3′) and an arbitrary 5′-end oligonucleotide primers (Liang and Pardee 1992). In order to achieve good visualisation, all RNA fingerprint variants need a low annealing temperature during PCR amplification. Therefore, not only the initial concentration of the cDNA depends on the quantity of the actual amplification components, but also the consistency of a precise match between primer and prototype (McClelland et al. 1995).

Throughout the course of PCR amplification, plentiful cDNAs with bad matches to the primers used would possibly outperform rare species with perfect matching. Despite this drawback, the classification of genes expressed distinctly by plants is recorded in many ways. This include the cloning of the last (1-aminocyclopropane-1-carboxylate-oxidase) enzyme involved in the synthesis of ethylene, a hormone which is involved in tomato ripening (Barry et al. 1996); and senescence, particular genes involved in the synthesis of flavonoids (Saito et al. 1999; Yamazaki et al. 1999); genes expressed differentially during carpel growth (Yung et al. 1999) or floral transformation (Yu and Goh 2000); module such as plant/environmental interaction based nitrate conveyors (Filleur, and Daniel-Vedele 1999); light photoreceptor-regulated genes, phytochrome (Kuno et al. 2000), and abiotic stress-response proteins (Brosche and Strid 1999; Baldi et al. 1999; Kim et al. 2000). This method is also used to differentiate within a species between individuals (Lapopin et al. 1999; Ni et al. 2000).

1.3.2 cDNA-AFLP

This fingerprinting approach depends on the sub-set of DNA molecules from a diverse pool being selectively amplified. The creation of an amplified polymorphic fragment length (AFLP) gives a consistent means with which two similar individuals can distinguish the few differences (Vos et al. 1995). The method is based on highly stringent conditions that are supported by the insertion of dual strand adapters at the ends of limit fragments that act as primary sites during amplification. Selective fragment enhancement is obtained by adding one or two foundations to the PCR primers and is only successfully improved by a fragment flanked by the limiting site, minimising the amount of visualised band.

More than 260 studies presented the advantages of this technique for exposing DNA fingerprinting. As cDNA was used to amplify transcripts in plants due to the benefit of this approach in producing a good individual polymorphism. As a tool for producing differently expressed products, the use of any pair of four and six simple restricting enzymes was suggested and successfully used as a combination of Pst I/Mse I and Asen I/Taq I (Bachem et al. 1996; Money et al. 1996). This technique seems more effective than that to recognise the individual RNA fingerprint and discern it; it is possible to quickly clone and subsequently characterise a certain band, particularly if silver stain is used to establish the fingerprint pattern. There have been accounts of definitions of many ESTs (Suarez et al. 2000; Durrant et al. 2000; Bachem et al. 2000).

1.3.3 DNA Microarray

A high efficiency, most frequently used technique for gene expression analysis, and a central factor in today‘s practical genomics research (Aharoni and Vorst 2001) is the DNA microarray (also referred to as a DNA chip or microarray). In short, this procedure involves the extraction, first of all by means of a laser-capture micro-dissection (LCM) technology of individual cells from two or more biological samples, and then the extraction from cells captured of complete RNA. The copies of each RNA that has been expressed are multiplied. Thus generated RNAs (and hybridised into a DNA chip under machine control be radioactively labelled or labelled with fluorescent colours) are engineered to visualise which genes are transcribed into the RNA in the cell when the sample is taken.

For any colouring position of the chip, scanners are used to read signals and fluorescence measurements are made. For data retrieval and interpretation (http://www.parisdevelopment.com), advanced applications and data collection systems are then used. A variety of plant procedures including the circulation clock, plant protection, environmentally stressful responses, fruit maturing, phytochrome signalling, seed growth, or nitrate assimilation have been investigated using microarray technology (Aharoni and Vorst 2001).

1.3.4 Serial Analysis of Gene Expression (SAGE)

The key difference between DNA hybridisation and SAGE approaches is that the latter does not need previous sequence knowledge to be studied, as SAGE is a gene expression profiling technique focused on sequencing (Velculescu et al. 2000). SAGE can be used to achieve full transcriptional profiles of expressed genes, although unknown, for species with poorly defined and expressed sequences. A new modification of SAGE known as Long SAGE makes it possible to use the derived transcriptome in the annotation of genetic expression in the genome (Saha et al. 2002). SAGE is a genuinely global and neutral technique for gene expression. The SAGE procedure uses several enzyme steps, amplifying and cleaning PCR and cloning. The SAGE protocol begins with mRNA cleaning connected with solid-phase magnetic oligo(dT) beads. The cDNA is synthesised directly on the oligo(dT) bead and then digested with the anchoring enzyme NlaIII (AE) to reveal the 3′ most restriction site on the oligo(dT) bead. The 4-bp site-detection enzyme NlaIII is expected to take place per 256 bp and thus to be present on most mRNA organisms in most SAGE experiments. However, a second SAGE library can be used for the identification of transcripts without a NlaIII site with a separate anchoring enzyme and also for the reconfirmation of a transcript identity in both sites. This could decrease the work relevant to research substantially, but the marginal importance of this method remains to be seen. Next, the sample is also divided into two separate tubes and attached to two separate connectors, A and B. The BsmFI, a type IIS restrictive enzyme that reduces 10 bp 3′ from the position of anchoring enzyme recognition, is recognised by both linked instruments. A special oligonucleotide known as the SAGE tag, hence known as the Tagging Enzyme (TE) is produced by BsmFI. The SAGE tags released from the beads of oligo (dT) are then removed and binding to each other for the purpose of making the ditag was amplified by the help of an electronic sequencer (Patino et al. 2002) and released from the ligators. The gel is purified, serially ligated, clone and sequenced.

1.4 Proteomics

Proteomics includes the recognition and characterisation of the full collection of genome proteins (Wilkins et al. 1996) sufficient for the preservation of the structure and maintenance of important regulatory functions (Whitelegge 2002). Proteins are huge amino acid molecules, each with a special genetically-defined series that specify the basic shape and role of the proteins in various ways, to form tens of thousands of proteins. Proteomics analyses the sequence of amino acids and various post-translation modifications to determine their relative abundance (Brygos and Joyards 2004). A cell, tissue, or organism's protein is the complete complement of proteins expressed by its genome at any given time (Renaut et al. 2006).

It includes systematic study of proteins expressed by a genome, from their amino acid sequence to relative quantities assessment, post-translocation transition state and the interplay with other proteins or molecules of various forms to protein function and structure classification (Brygoo and Joyard 2004; Rhee et al. 2006). It starts with functionally modified proteins and the corresponding gene sequences, as opposed to genomics.

Proteomics is simply a method for plant functional genomics. The proteomics research leads to understanding diverse biological mechanisms and cellular response to environmental stress. It has emerged as an effective method to understand the role of proteins that establish homeostasis in cells, participate in cell signals, and are needed to maintain the structure. Protein-level studies are thus important to demonstrate molecular mechanisms that underlie plant growth, production, and environmental interactions (Chen and Harmon 2006). These are particularly compatible with crops because they can provide insights, not only in terms of nutritional worth, but also in terms of yield and the impact of adverse conditions on these factors (Solekdeh and Komatsu 2007).

Many emerging proteome approaches are available, each seeking to increase isolation, resolution, and automation in conjunction with various experimental objectives and with the chemical and physical properties of the target proteins. The significant technology used during research is the identifying and characterisation of 2-D polyacrylamide gel electrophoresis of separated proteins (Klose 1975; Fenn et al. 1989; Kara et al. 1989) and Mass Spectroscopy (Fenn et al. 1989; Kara et al. 1989) for the identification and characterisation of separated proteins. 2-DE is based on isoelectric concentration, where the proteins are segregated in polyacrylamide gels pH gradient (first dimension) and in the SDS (sulphate sodium dodecyl) PAGE according to their pI (second dimension) (Klose 1975; Klose and Kobalz 1995) and are separated according to their molecular weight.

The visualisation is accomplished with the use of staining methods of isolated protein spots. Electron-spray-ionisation (ESI) is carried out to transform peptides into ions by transmitting them through high-voltage columns. Proteins within the points of interest are first detected by digesting them into peptides usually with trypsin and then analysed by the mass spectrometry (MS). Flight Time (TOF) is an operation that analyses the mass of peptide ions in mass spectrometry. The MALDI TOF (Matrix-assisted laser desorption and ionisation time-of-flight) technique is the most widely used MS technique. These methods are used to trace peptides to classify protein and its complex interaction (Kersten et al. 2002; Henzel et al. 1993).

An alternative approach for analysis of protein products directly by MS, without isolation of gel, has been developed that enables the automatic analyses of peptide mixtures that are formed from complex protein samples, to be carried out using a capillary, high-performance liquid chromatography pair (HPLC), MS or MS/MS (Appella et al. 1995; Washburn et al. 2001, in French). In addition, the ground breaking reagent known as isotope encoded affinity tag (ICAT) in the LC-MS/MS method is made possible for quantitative proteomics. Yeast two-hybrid (Y2H) assay helps in detection of the protein–protein interactions (Fields and Song 1989). The large-scale protein-interaction map was developed using Y2H systems, which were used in the study of signalling complexes (Chen and Harmon 2006). These maps are also available. This genetic technique enables the speedup detection and separation of the relationship between in vivo protein–protein interactions of the related nucleic acid sequences. In holding the intracellular balance, the relationship between protein and protein is important (Ozbabacan et al. 2011). The study of proteomics is therefore important for understanding the dynamics of complex biological processes. Proteomic analytics are becoming a good method for tracking growth and for obtaining insight into the molecular workings of plants. Attempts were made to improve the photosynthesis and abiotic stress resistance in crops. C4 plants were found to possess two forms of chloroplasts, making them more energy-efficient. A comparative proteomic analysis of C3 and C4 plant chloroplast was carried out to determine the proteins that cause more effective light fastening (Zhao et al. 2013).

In Arabidopsis the application of proteome analysis resulted in better understanding of the complex cellular events while analysing the role of gibberellic acid (GA) during the initial stages of seed germination and the effect of scarification on seed germination. During the production of food, after proteome review, a number of previously unknown, novel enzyme encoding genes were found in maize. The genetic diversity at the level of expressed proteins has been extensively tested with proteomics (Canovas et al. 2004). Genetically heterogeneity at the level of the expressed proteins has been commonly applied to proteomics (Canovas et al. 2004). The male sterility proteins fundamentally required for the selection of hybrids have been identified by Wang et al. (2013). The conditions of heat stress influence crop plant growth and yield. The organ-specific proteomic trial was conducted in various crop plants (Hossain and Komatsu 2013) to determine proteins directly acquired during abiotic stress. The proteomic information has been extended to the development of desired transgenic plants (Gong and Wong 2013).

1.5 Metabolomics

The quality and derivative products of crop plants depend directly on their metabolites. The taste, fragrance, colour and texture of crops, their storage properties, and their field efficiency are determined by metabolite concentration (Memelink 2005). A plant’s metabolite material is its metabolome. It is a technology for comprehensive analysis which identifies and quantifies all the metabolites of an organism at a particular time (Fiehn 2002). Building on their existence at the developmental stage, metabolites are categorised as primary and secondary.

The metabolome is very diverse: lipid soluble chemical usually present in membranes, acidic and simple ions, acidic and basic chemicals for aqueous cell elements, stable oxidisation systems and structures (Maloney 2004). This metabolome is very diverse. The plant kingdom contains an estimated 100,000–200,000 metabolites (Oksman-Caldentey and Inze 2004). A broad view of the biochemical status, which can then be used to track and determine gene expression (Fiehn et al. 2000), is provided by quantitative and qualitative measurements of this vast number of cellular metabolites. In several plant resistance and stress reactions the metabolites are stated to be working (Bino et al. 2004).

The organism’s biochemical reaction to conditional disruption is characterised by the impact it has on differential aggregation of each metabolite (Raamsdonk et al. 2001). As metabolites reflect catabolic and anabolic behaviour by proteins at a given time (Maloney 2004), macro-molecular processes such as feedback inhibition and signalising molecules (Dixon et al. 2006) are increasingly understood as modulating (metabolite). It originated as a practical genome approach to help us understand the diverse molecular dynamics in biological processes (Hall et al. 2002).

In contrast with genes and proteins, metabolites are much more variable in the order for atoms and subgroups. The invention of new methods such as mass spectrometry (MS), nuclear resonance magnetism (NMR), and four-transform infrared spectroscopy (FTIR) makes the study of metabolites much simpler. The study of metabolomics relies on the sample being extracted at a certain point in time (Hall 2006). The techniques of metabolomics give us an insight into the diverse metabolism and diversity of the organic reactions involved in plant growth, differentiation, stress tolerance, and pathogens defence mechanisms. The performance of the metabolome analysis depends on a few important factors, such as biological material/sample production and the processing of samples/metabolite detection (Hall 2006).

Bino et al. (2004) proposed a metabolomics research (MIAMET) protocol that includes experimental design; sampling; planning, extraction, and deriving of metabolites; design and design of metabolite profiles; and calculation of metabolites and parameters of metabolites. Because of complicated metabolic data for complex analytics, various analytical methods have been developed. These methods include target analysis, profiling of metabolites, metabolomics, and metabolism. Metabolite target analysis requires a variety of approaches for the preparation and analysis of samples for one or a few compounds from diverse mixtures. Samples based on genotype, phenotype, or biologic significance are tested by metabolic fingerprinting on a variety of metabolites (Shanks 2005).

Metabolomics in the strict sense is the calculation of all metabolites in a given organism. It is not yet theoretically feasible, and would possibly require platform of complementary technologies, since no single methodology is systematic, selective, and efficient enough to quantify them all (Weckwerth 2003). Technology for the study of target compounds and metabolic profiling is such as gas chromatography (GC), high-performance liquid chromatography (HPLC), and NMR. These methods focus on chromatographic isolation, often paired with sophisticated analytical calibrations. Samples are studied using NMR, direct injection mass spectrometry (MS), or FTIR spectroscopy for metabolic fingerprinting as crude extracts without separation. Often paired with multivariate analyses, these fingerprinting methods get the best out of the results.

In order to produce metabolic profile for study of different stress, economically significant characteristics, and heredity results, studies in metabolomics were carried out in the agricultural sector. Seasonal variations, geographical area, and natural variation have all been successfully used to determine the effect of metabolic profiles. The metabolic profiles of transgenic plants have been studied in order to characterize them. Metabolomics allows researchers to classify and distinguish genotypes and phenotypes based on metabolic levels in a non-biased way. This information will be important for tracking crop quality characteristics (Hall et al. 2005). The detection and enhancement of trait production in farm and biological refining products (Reid et al. 2004) will identify possible biological markers for commodity contaminations and adulteration (Dixon et al. 2006).

Plant properties can be enhanced in different ways, for example, through raising metabolic fluxes by metabolic engineering to useful biochemical routes (e.g. increasing food’s nutritional value, reducing pesticidal or fertiliser requirements, etc.) or through utilising the required routes for pharmaceutical development in plants (Giddings et al. 2000). In the same way, the incorporation of a foreign group of enzymes that contribute to the creation of desired finished products from other or more upstream precursors can lead to metabolistic shortcuts, and the synthesis of new metabolites may also generate foreign enzymes (Bino et al. 2004; Memelink 2005). Metabolic fluid analyses helped to define the similar catalytic properties of orthologous enzymes. Metabolistic experiments to classify the growth profile, production, and chemical taxis research were also conducted. Biochemical network construction has been carried out by creating relative metabolite profiles (Mendes 2002). The incorporation of metabolome and transcriptome evidence indicates that regulatory networks are being recognised and genetic material is correlated with phenotypic characteristics (Urano et al. 2009).

1.6 Bioinformatics

Over the past 15 years, bioinformatics has steadily developed and has become a modern major field in biology. A vast range of genetically and genome dependent results have been created using next-generation sequence technology to produce massive genetic and genome data, too detailed to be processed by the human brain. Bioinformatics includes one discipline: genetics, computer science, and IT (Rhee et al. 2006; Vassilev et al. 2005). It uses specialised tools and computers to search and interpret datasets from gene sequences and other sources. The National Science Foundation (NSF) began a project following the genome sequence of Arabidopsis thaliana to determine the function of 25,000 genes of Arabidopsis (Ruperao et al. 2014). Rice was the first genome-sequenced crop (International Rice Genome Sequencing Project 2005), and subsequently a variety of large crop genomes were sequenced. A vast number of data were published from all these sequencing initiatives. A series of bioinformatics instruments have been developed to organise and interpret these data that have significantly contributed to the drawing of significant biological conclusions, gene function predictions, etc. In addition, it allows researchers to define quantitative trait loci characteristics (QTLs) in the production of unorthodox mapping communities and online marker tools.

For the DNA sequencing research, transcriptomic, proteomics and metabolomics, multiple bioinformatics instruments and databases were developed and different researchers have given a detailed list of the same for separate purposes (Bino et al. 2004; Varshney et al. 2005; Rhee et al. 2006). These databases formed the foundations for generating theories, developing tests, and inferring knowledge of a single organism. Furthermore, various species have used datasets and omic tools to evaluate the ‘omic’ properties of species which allow for further analysis of preserved genes and evolutionary relationships. The role of bioinformatics in gluing fundamental science to applied research will continue to be important, and biotechnology is crucial in solving the pressing issues of society such as sustainable energy production, reduction of world malnutrition, hunger and protection of the environment (Rhee et al. 2006).

1.7 Conclusion

The genomics, proteomics, and metabolomics technology platform is extremely successful. They significantly increase the number of proteins/genes which can be concurrently identified and are able to connect complex mixtures in gene/protein expression profiles with complex results. Constitute infrastructure for the implementation of genomics information and techniques (and other ‘omics’) was considerably advanced to allow alterations in crop plant characteristics to boost real and potential production, increase resource use efficiency, and improve health in the crop system. The need for developing new crop varieties with higher yield, heat resistance, and lower use of pesticides is to meet the demands of the growing population, improvements in climate trends, and environmental stress.

High-dimensional biological data have been produced by advanced technologies, analysed using statistical methods to detect interrelationships within omics studies. In order to grow new resistant varieties, the study of metabolic responses of cultivars against different biotic and abiotic stresses will be helpful. However, over time, stage, and environmental factors the transcriptome, proteome, and metabolomic data have become extremely variable and thus the processing and interpretation of samples needs to be conducted with great caution. The understanding of omics also enhanced the resolution of biological experiments, which led to analyses of metabolic pathways and biological system research. Disciplines including structural genomics, transcriptomics, proteomics, and metabolomics are also required for the production of strategy for problem-oriented and process-oriented processes in plant physiology, bio-methods, and plant breeding.