Introduction

Tea [Camellia sinensis (L.) O. Kuntze] is a perennial tree and source of the popular beverage produced from processed young leaves of the plant (Mukhopadhyay et al. 2016). Cultivated tea plants generally belong to two major varieties, namely, Camellia sinensis (China type tea) and Camellia assamica (Assam type tea). The Assam type tea, one of the two cultivated tea plants, is processed from the indigenous Camellia sinensis var. assamica (Masters) (Meegahakumbura et al. 2018). The Assam tea is a distinctive black tea known for its malty sweetness and earthy flavor with several reported potential health benefits (Baruah 2017).

Due to the monocropping nature of cultivation covering large areas, the tea plantations are vulnerable to attacks from a large number of pests and diseases. The recurrent episodes of pest attacks necessitate their management through the application of agrochemicals, many of which find their way into tea brew (Zhang et al. 2017). Tea is a dynamic “bioactive” beverage perceived as health value and a majority of consumers prefer organically grown tea over conventionally cultivated or processed teas (de Godoy et al. 2013). Organic tea is usually considered as a value-added product and the recent trend in the global market shows that such teas command a premium of 30–40% over conventionally grown teas (Singh et al. 2016a). In addition, organically grown tea has longer shelf-life and better keeping quality than conventionally grown teas (Yugandhar et al. 2018). Exclusion of chemical inputs, including fungicides, herbicides, and pesticides, is an important aspect of organic tea cultivation which inevitably exposes the plantation to various soil- and air-borne tea diseases, such as blister blight, black rot (leaf diseases), dieback or seed decay (stem diseases), charcoal stump rot, and brown rot (root diseases) (Doanh et al. 2018). Climate change-induced erratic precipitation patterns coupled with nutrient leaching are other contributing factors leading to significant losses in crop productivity and quality of processed tea (Jayasinghe and Kumar 2020).

A comprehensive knowledge of microbial community structure associated with the Camellia sinensis var. assamica root system, including the root endosphere and the rhizosphere regions, was expected not only to provide a better understanding of their influence on the plants’ health and productivity but also to help in harnessing beneficial microorganisms with plant growth promoting traits (Fernández-González et al. 2019). Plant-associated beneficial microbes can help implant ecologically compatible and sustainable organic tea cultivation practices for improving productivity and stimulating production of enhanced bioactive compounds (Hu et al. 2006; Nath et al. 2013). Among the plethora of beneficial microorganisms, the endosphere and rhizosphere components (particularly, fungi, and bacteria) draw attention due to their plant growth promoting traits, such as secretion of indole acetic acid, gibberellin, cytokinin, siderophores, phosphate-solubilizing enzymes, and 1-aminocyclopropane-1-carboxylic acid (ACC) deaminase (Xie et al. 2020). The ability of the plant microbiome to colonize and endure plant endosphere aids in the plant’s ability to withstand biotic and abiotic stress conditions and enhance the fitness of the host plant (Meena et al. 2017; Hassani et al. 2018; Xie et al. 2020). We have previously reported culture-dependent studies that revealed the microbial diversity and their functional aspects prevalent in tea garden soils of Assam (Nath et al. 2013, 2015; Goswami et al. 2017). However, our current understanding of the Camellia sinensis var. assamica-associated microbiota still remains sketchy. Therefore, the present high-throughput sequencing-based study was undertaken to unravel the microbial community structure of belowground rhizosphere and endosphere, and their relevant functional attributes were associated with an organically grown Camellia population of Assam, India. The following objectives were pursued: (a) to elucidate the overall rhizosphere as well as belowground endosphere microbial community structure of Camellia sinensis var. assamica; (b) to unveil distinctive functional signatures between the rhizosphere and endosphere communities.

Methods and materials

Sites description and sample collection

This study was conducted at the organically cultivated area of the Experimental Garden for Plantation Crops (26° 72′ 12.83″ N, 94° 19′ 72.30″ E, elevation 88 m), Assam Agricultural University, Jorhat district of Assam, India (sample collection date: 25-07-2020). The total area of the organically maintained plot is 0.26 ha cultivating an indigenous large leaf cultivar, Betjan, planted in the period of 1951–1957. The tea plantation management system has a biofertilization regime of vermicompost and cow dung manure applied at the rate of 2 tons/ha/year and 8 tons/ha/year, respectively. Additionally, the mature tea bushes receive a foliar spray of Azolla water and vermiwash (10% v/v) six times a month. The tea garden has a history of high infestations of tea mosquito bugs (mainly, Helopeltis theivora), and low occurrences of red spider mite (Oligonychus coffeae) and bunch caterpillar (Andraca Bipunctata). To counteract such pest infestations, an aqueous mixture of leaf extracts from Azadirachta indica (neem) and Pteridium aquilinum L. (vernacular name: Bih-dhekia) at 5% (w/v) concentration is applied three times a week during incidences. A pruning cycle of 4 years along with intermittent skiffing (light trimming) is maintained to achieve a uniform plant height.

For the collection of the rhizosphere and endosphere samples, three disease-free healthy mature tea plants were selected, and their lateral roots were partially uprooted. These lateral roots were vigorously shaken to eliminate loose soils and tertiary fine roots. Finally the lateral roots were washed with 100 ml of 10 mM NaCl solution. Wash solutions from the three lateral root samples were mixed proportionately and collected into 50-ml tubes to constitute the rhizosphere sample. For root endosphere sampling, samples of lateral roots were thoroughly rinsed with distilled water and then immediately transported to the laboratory for surface sterilization. In the laboratory, the surface sterilization of the roots started with washing the samples with double-distilled H2O for three times followed by serially immersing in H2O2 (3%) and absolute ethanol for 30 s each, NaOCl 6.15% (in the presence of 2–3 drops of Tween 20 per 100 ml) for 3 min followed by a wiping with H2O2 (3%) again for 30 s. The roots were given a final washing with sterile dd. H2O and stored at − 80 °C for future use. Characterization of different soil edaphic factors was performed at the Laboratory of Microbial Biotechnology, Assam Agricultural University. The characteristics of the Camellia trees (diameter, height, etc.) and soil parameters (texture, percent C, percent N, etc.) are presented in Table 1. All the observations and experiments related the characteristics of the Camellia trees (diameter, height, etc.) and soil parameters (texture, percent C, percent N, etc.) were performed in triplicate and the variations (analysis of variance) in the data were evaluated using the XLSTAT package program.

Table 1 Tree and soil characteristics from the sampling site

Metagenomic DNA extraction

In case of the rhizosphere soil sample, 1 g of the soil slurry centrifuged at 2500g for 2 min was considered for DNA extraction following the manufacturer’s protocol mentioned in the Environmental gDNA isolation kit (Xcelgen, India). While dealing with DNA extraction from the endosphere sample, the same gDNA isolation kit was used with powdered surface-sterilized roots (using liquid nitrogen). The lysis buffer was amended with 50 µl of 10% cetyltrimethylammonium bromide to enhance the lysis process. The extracted gDNA samples were quantified using Qubit® 2 fluorometer as per manufacturer’s protocol.

Library preparation and sequencing

The quality pair-end sequencing library was prepared using Illumina TruSeq DNA Library Preparation Kit. For library preparation, 1.0 µg gDNA was fragmented followed by paired-end adapter ligation. The ligated product was purified using 1X ampure beads. The purified ligated product was subjected to size selection on E-gel at ~ 400–650 bp. The size-selected product was PCR amplified as described in the kit protocol. The amplified library was analyzed in Bioanalyzer 2100 (Agilent Technologies) using High Sensitivity (HS) DNA chip as per manufacturer's instructions. After obtaining the Qubit concentration for the library and the mean peak size from Bioanalyzer profile, 10 pM of the library was loaded into the reagent cartridge of Miseq V2 Reagent Kit 500 cycles PE. The library fragments were captured on a lawn of oligos (complementary to the adapters linked to the fragments) coated on the surface of flow cells. Later, each immobilized fragment was amplified through bridge amplification to develop into a clonal cluster. These clonal clusters were finally subjected to a reversible terminator-based sequencing method. Paired-end sequencing approach allowed the bidirectional (forward and reverse) sequencing of template fragments.

Metagenome assembly and ORF prediction

Prior to the assembly, the quality of the sequencing data, overall GC content, repeat abundance, or the proportion of duplicated reads were assessed using FastQC that provides summary statistics. The high-quality metagenome reads were then screened for contamination with the human genome using the CLC Genomics Workbench 6.0. After removal of host plant-related sequences and adapter sequences from both the endosphere and rhizosphere samples, the resulting clean reads were assembled using the same workbench with default parameters (minimum contig length: 200, mismatch cost: 2, insertion cost: 3, deletion cost: 3, length fraction: 0.5, similarity fraction: 0.8). Total rRNAs were predicted for mining residue samples by aligning the assembled scaffolds against in-house RNA databases using BlastN. The Open Reading Frames were predicted for the mining residue sample using Prodigal (v2.6.1) (Hyatt et al. 2010). The predicted ORFs along with their rRNAs containing scaffolds were then taken for Taxonomic and Functional Annotation. The 16S regions were identified using webMGA (Wu et al. 2011).

Selection and assembly of sequencing reads

The raw reads were first refined by trimming adapter as well as low-quality score sequences (minimum-quality score: 20; minimum read length: 50 bp). The SOAPdenovo was used to assemble the quality reads with Kmer range of 39–47. Scaffolds with minimum 500 bp length were then broken into continuous contigs which were later used for functional and taxonomic predictions. For quality assurance of SOAPdenovo-based assembly, all the sequencing reads were remapped to the contigs using the Burrows-Wheeler Aligner (http://bio-bwa.sourceforge.net/) BWA tool (Li and Durbin 2009).

Gene prediction, taxonomy, and functional assignment

The publically available MetaGene Annotator was used to mine the contigs for ORFs and annotations (Ma et al. 2013). Only ORFs with a minimum 100 bp of length were considered for translations through NCBI ORF finder. The translated sequences were tested for BLASTP (BLAST Version 2.2.28+) against the NCBI-nr database (E value threshold: 1e−5). For taxonomic classification, the ORFs containing the small subunit (SSU) rRNA gene tags were aligned with the SILVA database and Ribosomal database project (E value threshold: 1e−5, alignment length: 100 bp, and cutoff: 80%) (Cole et al. 2014). The difference in microbial abundance between the two metagenomes was examined by the two-tailed Wilcoxon rank-sum test, performed using the base R package “stats” (v. 3.4.1) wilcox.test function. Alpha (within sample) diversity indices (Species richness, Shannon diversity, Simpson’s index, Inverse Simpson and Shannon evenness index) were estimated for each of the samples using the R package “vegan.”

For functional analysis, the eggNOG (evolutionary genealogy of genes: Non-supervised Orthologous Groups) database version 4.5 was used to align the ORFs and produce corresponding Cluster of Orthologous groups of protein (COG) (Huerta-Cepas et al. 2016). This annotation was accomplished through BLASTP (hosted under the latest BLAST + package version 2.2.28+) against the NCBI-nr database with optimized E value threshold of 1e−5. The web server-based KOBAS version 3.0 (KEGG Orthology Based Annotation System) was used for KEGG GENES (Kyoto Encyclopedia of Genes and Genomes database)-based functional annotation (default E value threshold: 1e−5) (Xie et al. 2011). The KEGG database was used to investigate a few microbial genes linked to plant growth-promoting traits (PGPTs), viz., production of siderophore, 1-aminocyclopropane-1-carboxylic acid (ACC) deaminase, indole acetic acid (IAA), mineral (phosphate) solubilization, N2 fixation, abiotic stress alleviation (betaine utilization), soil health improvement (lipopolysaccharide production), as well as plant disease suppression (chitinase and endoglucanase). The frequency of the genes was calculated as follows: frequency = (number of sequence reads for a particular gene/total length of the gene) × (100,000/Total number of sequence reads). Finally, the number of the particular gene was normalized by the gene length (kbp) (Tsurumaru et al. 2015).

Results

The average diameter of the three disease-free trees at their breast height was 73 ± 4.58 cm, while crown height and crown diameter were 86.67 ± 1.15 cm and 417 ± 11.36 cm, respectively. Rhizosphere soil samples from these trees were analyzed for various physical and biochemical parameters, including total carbon and nitrogen, soil organic matter, and available ammonium and total nitrate ions. All of the soil characteristics did not vary among the three samples (Table 1). The soil samples were moderately acidic in nature with pH values less than 4.50 (range 4.3–4.5). The average soil organic matter (SOM) ranged from 4.85 to 5.46%, while TC and TN ranged from 1.66 to 1.92 and 0.183–0.223, respectively.

Overview of metagenomic sequencing

The assembled tea root endosphere metagenome contained 24,231 contigs (total 7,771,089 base pairs with an average length of 321 bps), while the tea rhizosphere soil metagenome contained 261,965 contigs (total number of bases: 230,537,174 bp, average length: 846 bps) which corresponded to ~ 2.8–3.1 GB data. The metagenomes were deposited in the Sequence Read Archive hosted at the NCBI (accession number: PRJNA698063). For this study, the metagenomes were further trimmed and filtered by using WebMGA, a web server in order to remove the adaptors and low-quality reads. After this quality control step, the root endosphere metagenome finally contained 21,721 and the rhizosphere metagenome contained 241,280 high-quality reads, respectively. The summary of metagenome sequencing and annotations, and the rarefaction curves are presented in the additional files (Table S1, Figs. S1 and S2).

Taxonomic category hits distribution

Abundance and functional groups estimations are critical in metagenomic studies. A two-tailed Wilcoxon rank-sum test was performed between the microbial abundances of rhizosphere and root endosphere and it was found that there was a significant difference (p value < 0.00001) between the two metagenomes. The rhizosphere metagenome contained a total of 1592 (bacteria: 1344, fungi: 84, Archaea: 85) microbial species, while the endosphere metagenome harbored 223 species (bacteria: 176, fungi: 26, Archaea: 5). The Shannon diversity index (H) and Simpson's Index of Diversity (1-D) for the rhizosphere metagenome were calculated to be 5.648 and 0.986, respectively; these values were higher than that of the endosphere metagenome (H: 4.053; 1-D: 0.926) which confirmed the presence of higher microbial diversity and abundance in the rhizosphere as compared to the endosphere. A higher Shannon evenness index of the rhizosphere metagenome (0.766) than that of the endosphere one (0.750) implied that the endosphere was over-represented by dominant microbial taxa. Information on various alpha diversity indices is available in the additional file (Table S2).

Class-level taxonomic hits distribution showed that the rhizosphere was highly populated by Proteobacteria (relative abundance 40.98%), Acidobacteria (19.74%), Firmicutes (14.43%), Actinobacteria (10.95%), Bacteroidetes (4.70%), Verrucomicrobia (3.05%), and Cyanobacteria (1.9%), while the root endosphere was inhabited by Proteobacteria (89.17%), Actinobacteria (5.08%), Bacteroidetes (1.52%), Firmicutes, and Synergistetes (both with 0.85%). Order-level taxonomic hits distribution revealed the dominance of Bacillales (12.57%), Rhizobiales (11.03%), Actinomycetales (10.22%), Solibacterales (6.35%), Burkholderiales (7.73%), Enterobacteriales (6.55%), Acidobacteriales (5.37%), and Pseudomonadales (4.14%), in the rhizosphere with approximately 10.25% remained as undefined. Bacterial orders, such as Enterobacteriales (57.36%), dominated the root endosphere microflora followed by Rhizobiales (14.04%), Burkholderiales (5.41%), and Actinomycetales (4.57%). The majority of genus-level hits belonged to Bacillus (10.35%), Candidatus Solibacter (6.36%), Burkholderia (5.19%), Pseudomonas (3.9%), Streptomyces (3.52%), and Bradyrhizobium (2.77%) in the rhizosphere, while in case of the endosphere, most of the hits were occupied by bacterial genera, viz., Serratia (46.64%), Methylobacterium (8.02%), Yersinia (5.97%), Burkholderia (2.05%), etc. The presence of few agronomically important bacterial genera, such as Bradyrhizobium, Rhizobium (0.93% each), Sinorhizobium (0.37%), Azorhizobium, and Flavobacterium (0.17% each), was also detected in the root endosphere (Fig. 1).

Fig. 1
figure 1

Donut chart showing the bacterial abundance at genus level

In case of the fungal population, Sordariomycetes (49.06%), Eurotiomycetes (26.19%), Saccharomycetes (14.76%), Agaricomycetes (12.79%), and Leotiomycetes (2.47%) were the major taxa in the rhizosphere, while the endosphere was dominated by class Eurotiomycetes (20.75%), followed by Sordariomycetes (13.81%), Agaricomycetes, and Leotiomycetes (each with 5.66%). Order-level taxonomic distribution hits demonstrated that fungal orders, such as Eurotiales (endosphere: 9.43%; rhizosphere: 21.15%), Saccharomycetales (endosphere: 1.89%; rhizosphere: 14.76%), Agaricales (endosphere: 5.66%; rhizosphere: 11.57%), Sordariales (endosphere: 15.09%; rhizosphere: 5.40%), and Hypocreales (endosphere: 33.96%; rhizosphere: 5.51%), were prevalent in both the environments. Fungi belonging to the genera, viz., Ustilago (8.36%), Schizosaccharomyces (8.25%), Neosartorya (8.07%), Filobasidiella (6.27%), Aspergillus (5.92%), Gibberella (4.41%), Penicillium (4.26%), Laccaria (3.95%), Coprinopsis (3.8%), Neurospora (3.35%), Yarrowia (2.86%), and Schizophyllum (2.81%), were abundant in the rhizosphere, and genera, such as Gibberella (18.18%), Phaeosphaeria (14.55%), Podospora, Hypocrea (both 10.91%), Aspergillus (5.45%), Coprinopsis, Neurospora, Coccidioides, and Botryotinia (each with 3.64% relative abundance), were prevalent in the root endosphere. Fungal populations, viz., Chaetomium, Coniothyrium, Cladosporium, Gliocladium, Trichoderma, Penicillium, Fusarium, Verticillium, Lecanicillium, Phoma, Alternaria, Aspergillus, Metarhizium, Phomopsis, Curvularia, Colletotrichum, Xylaria, Nigrospora, Humicola, Rhizoctonia, and Tilletiopsis, were prevalent in both the environment (i.e., rhizosphere and endosphere) (Fig. 2). The fungi could be broadly categorized into three groups: (A) ubiquitous soil inhabitant with no- or latent pathogenicity; (B) phytopathogens responsible for various plant diseases; and (C) ecologically important fungi with potential implications as plant growth promoters. Species of the genera Tilletiopsis, Humicola, Chaetomium, Alternaria, Curvularia, and Xylaria are ubiquitous in various environmental matrices, such as dead or living plant materials and decomposing soils. The second group consisted of members belonging to genera Nigrospora, Cladosporium, Fusarium, Lecanicillium, Phoma, Phomopsis, Colletotrichum, and Rhizoctonia which are essentially phytopathogens responsible for plant diseases, such as leaf spots, scab, rot, and wilt. The third group consisting of ecologically important fungi exhibits diverse functionality, such as biocontrol activities (mycoparasitic activity: Coniothyrium, Gliocladium, Trichoderma, and Penicillium; entomopathogenic activity: Beauveria, Metarhizium, and Lecanicillium) and plant growth-promoting activities (Aspergillus and Trichoderma).

Fig. 2
figure 2

Donut chart showing the abundance of the fungal microflora at genus level

COG annotation and analysis

The functional analysis through COG analysis showed that the associated genes could be deciphered into four categories, viz., (1) metabolism, (2) cellular processes and signaling, (3) information storage and processing, and (4) poorly characterized. The category for “metabolism” was most prevalent in both the metagenomes (rhizosphere: 43.5%; endosphere 33.9%) followed by “Cellular processing” (rhizosphere: 21.3%; endosphere: 19.93%). In case of “Information storage and processing,” the endosphere metagenome harbored more genes (26.5%) than its rhizosphere counterpart (18.0%). A significant number of genes (rhizosphere: 17.23%; endosphere: 19.6%) could not be assigned to any function and remained in the “poorly characterized” category. This COG-based functional analysis indicated that the endosphere environment promoted information storage and processing of the microorganisms which is integral to any microbial colonization (War Nongkhlaw and Joshi 2017). Functional categories, viz., “general function prediction only,” “amino acid transport and metabolism,” “energy production and conversion,” “carbohydrate transport and metabolism,” “replication, recombination and repair,” “translation,” “ribosomal structure and biogenesis,” “cell wall/membrane/envelope biogenesis,” and “inorganic ion transport and metabolism,” dominated the COG functional annotations in both the metagenomes. The relative abundance of reads assigned to cytoskeleton, cell motility, RNA processing and modification, chromatin structure and dynamics, etc., were low (≤ 1.0%) in both the environments (Fig. 3).

Fig. 3
figure 3

Distribution of predicted reads in the COG classification

KEGG function annotation and analysis

KEGG-based functional annotations showed that the rhizosphere metagenome contained more genes, pathways, and enzymes as compared to the endosphere metagenome. We defined dominant pathways as those with a relative abundance greater than 0.5% in any of the metagenomes and as such 55 various pathways were mined with this criterion (Table S3). The most prominent pathways in this group linked to cysteine and methionine metabolism, RNA degradation, plant–microbe interaction, RNA transport, RNA polymerase, cell cycle, oxidative phosphorylation, spliceosome, peroxisome, pyrimidine metabolism, protein processing in endoplasmic reticulum, pyruvate metabolism, phenylpropanoid biosynthesis, etc. The pathway ko02010 assigned to “ABC transporter” was most abundant in the two environments (endosphere: 4.8%; rhizosphere: 6.4%) followed by the pathway “Two-component system-ko02020” (3.7% in both). Among the 55 dominant pathways, the relative abundance of 22 pathways was more in the endosphere metagenome than in the rhizosphere one. Few pathways associated with plant–microbe interaction, such as bacterial chemotaxis (ko02030), vitamin B6 metabolism (ko00750), folate biosynthesis (ko00790), steroid biosynthesis (ko00100), flagellar assembly (ko02040), calcium signaling pathway (ko04020), and glutathione metabolism (ko00480), were significantly more abundant in the endosphere than in the rhizosphere (Fig. 4). A total of 233 functional pathways determined in the three metagenomes are presented in the additional file.

Fig. 4
figure 4

Distribution of predicted reads in the KEGG classification

The KEGG-based classification data were further mined for the frequency detection of few functional genes liked to plant growth promoting traits (PGPTs) (Table 2). Among the categories of PGPTs, Cytochrome o ubiquinol oxidase subunit I (cyoB) and Glycine betaine/proline transport system (proV) linked to betaine utilization were most frequently detected (12.46 and 11.37 per 105 reads, respectively), followed by ACC deaminase, Betaine-aldehyde dehydrogenase (betB, gbsA), Glycine betaine/proline transport system (proW), D-glycero-D-manno-heptose 1,7-bisphosphate phosphatase (gmhB), and non-ribosomal peptide synthetase (dhbF) (11.14, 10.42, 9.20, 8.96, and 7.45 per 105 reads, respectively) (Table 2). Apart from this, other important PGPTs frequently detected in the rhizosphere were lipopolysaccharide production, N2 fixation, plant disease suppression, phosphate solubilization, ACC deaminase production and IAA production (20.52, 13.28, 12.00, 11.97, 11.14, and 9.15 per 105 reads, respectively). In the case of the endosphere metagenome, the most frequently assigned PGPTs were Chitinase, β-1,3-glucanase, ACC deaminase, and YUCCA family monooxygenase (3.14, 2.64, 2.15, and 2.14 per 105 reads, respectively). Meanwhile, siderophore production-related sequences (vibF, dhbF, entB, entA, entF) were detected at a low frequency (0.36, 1.14, 0.14, 0.11, 1.17, respectively, per 105 reads). Overall, the endosphere metagenome harbored genes for all the PGPTs except for lipopolysaccharide production.

Table 2 Detection frequency of functional genes liked to plant growth promoting traits (PGPTs) prevalent in the belowground tea microbial communities

Discussion

Based on alpha diversity indices, it was evident that the microbial richness and diversity was greater in the rhizosphere than the endosphere which could definitely be attributed to the rhizosphere being the primary and dynamic interface of plant–microbe interactions (Mohanram and Kumar 2019). These interacting microbes produce an array of hormones and metabolites which are known to signal and promote plant growth, antagonize pathogens, and increase availability of essential micro- and macronutrients for plant uptake (Singh et al. 2019). In this study, we observed a high abundance of Proteobacteria both in the rhizosphere (relative abundance 40.98%) as well as the endosphere (89.17%). Our previous study using culture-dependent approach had revealed the dominance of Firmicutes (more than 70%) in the tea garden bulk soils (Goswami et al. 2017) which indicated substantial differences in the two interfaces, i.e., bulk–rhizosphere and rhizosphere–endosphere. Environmental conditions, such as pH, water, oxygen, and nutrient content, function as drivers for such differential microbial pattern between these dynamic interfaces (Vieira et al. 2020).

The tea rhizosphere harbored a diverse array of the ubiquitous soil microbiota, while the endosphere harbored functionally and strategically important subset of the rhizosphere microbiota, viz., symbiotic Bradyrhizobium, Mesorhizobium, and Rhizobium, and non-symbiotic rhizobacteria, viz., Azotobacter, Azospirillum, Azomonas, Bacillus, Pseudomonas, etc. Microorganisms, such as rhizobia, mycorrhizal fungi, actinomycetes, and diazotrophic, are known to be aggressive colonizers which mobilize and allocate nutrients through symbiotic and non-symbiotic associations with plant roots (Barea et al. 2005). Healthy stature of the tea bushes indicated the rhizosphere–microbial taxa playing a key role in disease suppression, nutrient cycling, and healthy status of the soil. Most of the endosphere bacteria mined through this study have been reported to carry promising plant growth promoting traits, such as solubilization of minerals (zinc and phosphate), production of siderophore and indole acetic acid, and exhibition of biocontrol activities (Aeron et al. 2011; Fernández-González et al. 2017; Verma et al. 2019). These functionally important traits have been proven to increase tea productivity right from tea seed germination to seedling growth in nursery and subsequent field establishment (Nath et al. 2013; Bhattacharyya and Sarmah 2018). The potentials of free-living N2-fixing bacteria, such as Azotobacter (aerobic) and Azospirillum (anaerobic, microaerobic), that play an important role in nitrogen geocycle have been realized in the form of organic amendments in tea soils (Gebrewold 2018). Both Azotobacter and Azospirillum are suitable candidates for biofertilizer development targeted for better tea productivity, while Azotobacter is suited for aerated, light-textured soil and Azospirillum is indicated for waterlogged hard-textured soils (Aquilanti et al. 2004). Microbial pesticides intended against a range of phytopathogens, including tea pests, are an important component of integrated pest and disease management systems in tea cultivation (Idris et al. 2020). Several microbial species revealed in this metagenomics-based study, viz., Bacillus spp., Pseudomonas spp., Aspergillus sp., Beauveria bassiana, Gliocladium sp., Metarhizium anisopliae, Paecilomyces sp., Penicillium sp., Trichoderma spp., Lecanicillium, and Streptomyces sp., have been reported to show biopesticidal activities and as such their introduction as biocontrol agents has the potential to elevate tea productivity (Krauss et al. 2004; Scheepmaker and Butt 2010; Sandhu et al. 2012). Coniothyrium minitans (a coelomycete) was detected in both the rhizosphere and endosphere regions. Coniothyrium is a potential biocontrol candidate with mycoparasitic activity against Sclerotinia sclerotiorum (Zhao et al. 2020). Many studies have reported that C. minitans can inhibit mycelial growth and sclerotia formation of Sclerotinia ascospores, thereby suppressing leaf blight infection (Smolińska and Kowalska 2018). Gliocladium virens is another important mycoparasitic fungus which exhibits broad-spectrum biocontrol activity (van Tilburg and Thomas 1993). This common fungus exhibited strong hyperparasitism against plant–pathogens, viz., Rhizoctonia solani, Sclerotium rolfsii, and Pythium aphanidermatum (Sreenivasaprasad and Manibhushanrao 1990). The presence of enzymes, viz., 1-glucanase, N-acetylglucosaminidase, lipase, and proteinase, makes it a successful wide-spectrum biocontrol agent (van Tilburg and Thomas 1993). In terms of plant growth-promoting endofungal agents, it has been generally observed that fungus belonging to Trichoderma genus interact with plants by inducing the plant’s defense system and promotes the plant growth (Zhang et al. 2016). The endosphere ability of Trichoderma has been reported in many plants, like maize, cucumber, cotton, and tomato (Yuan et al. 2017; Contreras-Cornejo et al. 2018; Harman et al. 2019). Trichoderma colonizes in the intracellular spaces, as well as the spaces between the plasma membrane and the cell wall of the root tissue (Ramírez-Valdespino et al. 2019). Once in the endosphere, the fungus can promote seed germination, development of root, shoot, and biomass, and increase tiller number and overall crop yield (Hajieghrari and Mohammadi 2016). In addition, Trichoderma can exhibit antagonistic activities against its adversaries through competition for space and nutrients, antibiosis and mycoparasitism (Chen et al. 2016).

The relative abundance of microbial metabolic genes, pathways, and orthologies were more abundant in rhizosphere metagenome compared to the endosphere counterpart. An increase in the relative abundance of few agriculturally important traits (genes and metabolic pathways) in the endosphere suggested that the tea plant system can selectively permit the penetration and establishment of an elite group of symbionts which help the plant in overcoming various stress conditions. For instance, KEGG-based analogies showed that abundance of several previously reported genes related to vitamin B6 and B9 metabolism (ko00750, ko00790), steroid biosynthesis (ko00100), glutathione metabolism (ko00480), etc., was relatively higher in the endosphere than in the rhizosphere. These pathways and metabolisms improve plant biomass production and empower the plant defense mechanism against various biotic and abiotic stress conditions (Zhu et al. 1999; del Barrio-Duque et al. 2019). In the presence of phytopathogens in soils, the plant has to modulate or reprogram its defense signaling processes and activates suitable yet costly pathogenesis-related pathways. To subsidize this metabolic cost, the resilient nature of the plant defense mechanism partly engineers (active recruitment) the beneficial rhizosphere soil microbiome and depends on it in a semi-intrinsic manner (Singh et al. 2016b). Therefore, it was assumed that the presence of potential phytopathogenic agents, such as Gibberella, Aspergillus, and Ustilago, induced the plant to actively recruit a few beneficial endospheres as a step of disease priming. Metabolic pathway genes linked to “ABC transporter” was most abundant in the two environments (endosphere: 4.8%; rhizosphere: 6.4%) followed by “Two-component system” (3.7% in both). In bacteria, the ATP binding cassette (ABC) superfamily comprises mainly ATP-dependent pump proteins dedicated toward active mobilization of metabolites (Lewis et al. 2012). These superfamily proteins mobilize hormones, lipids, peptides, and primary as well as secondary metabolites, etc., and therefore, have immense significance in plant growth and adaptations (Vasiliou et al. 2009). A few members also have intrinsic channel properties with functions related to drug resistance, heavy metal detoxification, etc. (Lewis et al. 2012). In addition, these proteins also have defense-related implications as evidenced by the presence of an early defense gene in rice which is induced in the presence of salicylic acid and phytopathogenic Magnaporthe grisea (Lee et al. 2005). The two-component system consists of a first component protein with transmembrane domain and a second component protein with phosphotransfer histidine kinase property (Zschiedrich et al. 2016). These systems play an important role in plant–microbe interaction by recognizing the active metabolites present in the root exudates and thereby facilitate chemotaxic movement of the microbial population toward the root surface. The GacS/GacA present in pseudomonads and enteric bacteria exemplifies such systems that are responsible for microbial root colonization (Heeb and Haas 2001). Further, we analyzed potential functional genes linked with plant growth promotion activities (ACC deaminase production, IAA production, N2 fixation, phosphate solubilization, siderophore production), abiotic stress alleviation (betaine utilization), plant defense mechanism (lipopolysaccharide production), and plant disease suppression (chitinase and β-1,3-glucanase) prevalent in the belowground tea microbial community. The rhizosphere microbiota exhibited a significant prevalence of PGPTs related to betaine utilization; this important trait boosts up osmolyte concentration and antioxidant activity in plants under various abiotic stresses (Malekzadeh 2015). The second most frequently detected PGPT in the rhizosphere was related to siderophore production. Siderophores sequester iron through chelation, thereby playing a crucial role in increasing bioavailability and internalization of the ions to the plant (Hider and Kong 2010). The rhizosphere metagenome harbored important PGPTs related to lipopolysaccharide production (toward soil health improvement), N2 fixation, plant disease suppression, phosphate solubilization, ACC deaminase production, and IAA production. Among these many traits, production of lipopolysaccharide, a ubiquitous component of Gram-negative bacteria holds special ground for the induction of systemic resistance in plants against various phytopathogens (Erbs and Newman 2003). However, the endosphere metagenome did not harbor any detectable frequency of lipopolysaccharide genes probably due to high abundance of Proteobacteria (89.17%) and Actinobacteria (5.08%) and low levels of Gram-negative Bacteroidetes (1.52%) in the metagenome. Nonetheless, the endosphere microbiota contained some important PGPTs related to plant disease suppression (Chitinase, β-1,3-glucanase), N2 fixation, betaine utilization, IAA production, siderophore production, phosphate, and ACC deaminase production (5.78, 5.15, 4.81, 3.65, 2.92, 2.68, 2.15 per 105 reads, respectively). Tea plants require a large amount of N fertilizer (90 kg N/10 acres/year) for theanine accumulation in the first flush shoots; therefore, the amount of applied biofertilizer may have modulated the diversity and functionality of the tea microbiome especially in the endosphere.

Conclusion

Our study using culture-independent approach provided a comparative insight into the microbial diversity and function associated with an organically grown tea ecosystem of Assam. EggNOG-based COG analysis of the metagenomes revealed that microbial metabolism was most prevalent in both the rhizosphere and root endosphere environments. The root endosphere metagenome carried a higher relative abundance of genes linked to “information storage and processing” which is integral to any kind of plant–microbe interaction (mutualism or antagonism). KOBAS-mediated KEGG Orthology analysis of the metagenomes showed that the “ABC transporter pathway” genes were highly abundant followed by the “Two-component system pathway”-related genes in both the environments. As compared to the rhizosphere, the root endosphere contained higher abundances of PMI-important genes linked to bacterial chemotaxis, vitamin B6 and folate metabolism, glutathione metabolism, etc. The Shannon diversity indices confirmed higher bacterial diversity in the rhizosphere compared to the endosphere. Bacterial species, such as Serratia, Methylobacterium, Yersinia, and Burkholderia, populated the root endosphere and also had minor representations of few agriculturally important symbiotic and non-symbiotic taxa, such as Bradyrhizobium, Mesorhizobium, Azotobacter, Azospirillum, Azomonas, Bacillus, and Pseudomonas. Overall, a vast array of agriculturally important microbes linked to phytohormone secretion, nitrogen fixation, phosphate and potash solubilization, biotic–abiotic stress tolerance and biocontrol (entomopathogenesis) activity were detected. The exposition of microbial diversity, function, and indicators revealed in this study was supported by our earlier reports of the PGP activities of some of these isolates. This study will pave a way for including natural enemies prevalent in tea plantations for protecting the plants from various pests and diseases. Further studies on the cross-talks of a tripartite interaction encompassing tea plants, bioinoculum, and natural enemies can be considered for the promotion of a sustainable and healthy tea ecosystem.