Introduction

Living in an extremely cold and oligotrophic environment, Antarctic microorganisms have formed unique physiological and biochemical properties in the long-term natural selection evolution (Niederberger et al. 2008). Many strains have genetic machinery to degrade multiple compounds as a source of nutrients and can produce low-temperature (cold-active) enzymes, antibacterial and anti-cancer active substances (Zhang et al. 2004), which are valuable in many fields including environmental engineering, agriculture biotechnology, pharmaceutical industry and enzyme industry. Accordingly, we carried out a programme to explore potential sources of psychrotolerant species from Antarctic soil, during which a putatively novel strain (3D7T) of the genus Microvirga was isolated. The genus Microvirga was proposed by Kanso and Patel (2003), with Microvirga subterranea as the type species. It belongs to the family Methylobacteriaceae of the order Rhizobiales. At the time of writing, the genus Microvirga contains 18 species listed on LPSN (List of Prokaryotic Names with Standing in Nomenclature: www.bacterio.net) with validly published names. They are distributed widely in various ecological habitats, such as air (Weon et al. 2010), natural, domestic, and contaminated soils (Dahal et al. 2017; Tapase et al. 2017; Zhang et al. 2009; Zhang et al. 2019a, b), geothermal water (Kanso et al. 2003), Tibet hot spring sediments (Liu et al. 2020), human stool (Caputo et al. 2016), nodules of native legumes and cowpea (Ardley et al. 2012; Radl et al. 2017; Safronova et al. 2017) and roots of rapeseed plants (Jiménez-Gómez et al. 2019). Most members of the genus Microvirga are moderately thermophilic. Some studies reported the genetic potential of strains classified within the Microvirga genus for arsenic oxidation (Tapase et al. 2017) and the production of pigments, amylolytic enzymes (Radl et al. 2017), phosphatases and exopolysaccharides (Jiménez-Gómez et al. 2019). Based on polyphasic taxonomic characterisation, we propose the description of Microvirga antarctica sp. nov., classified as a novel psychrotolerant member of the genus Microvirga with phosphatase and lipase activities. Moreover, an analysis of the sequenced genome of strain M. antarctica 3D7T, showed genes encoding proteins with potential biotechnological or industrial applications.

Materials and methods

Isolation and culture conditions

Strain 3D7T was isolated from a soil sample collected from the surface of Deception Island (62° 55′ 09" S, 60° 34′ 46" W), Antarctica. The collected soil sample (0.9 g) was suspended in 8.1 mL sterile water and stirred for 30 min as a 10–1 dilution solution, then diluted it to 10–2 by gradient. 150 µL of the 10–2 sample dilution was spread on Reasoner’s 2A agar (R2A; AOBOX) medium (pH 7.5). After 7 days of incubation at 15 °C, representative colonies were picked and purified by steaking repeatedly. A pink-coloured isolate, designated strain 3D7T, was picked up and subsequently purified by plate streaking. The purified strain was stored in 20% (v/v) glycerol suspensions at − 80 °C.

DNA amplification and determination of 16S rRNA gene sequence

The genomic DNA of strain 3D7T was extracted using TIANamp Bacteria DNA kits (TianGen) according to the manufacturer’s instructions. The 16S rRNA gene was amplified by PCR with the bacterial universal forward primer 27F and reverse primer 1525R (Li et al. 2006). The products above were purified and sequenced by BGI (The Beijing Genomics Institute). After sequencing, the 16S rRNA gene sequence of strain 3D7T was obtained and similarity searches were performed by using the EzBioCloud server (www.ezbiocloud.net/identify) (Yoon et al. 2017a, b). The phylogenetic tree was constructed according to the neighbour-joining (NJ) algorithm (Saitou et al. 1987) and supported by the minimum-evolution (ME) (Rzhetsky et al. 1992) and maximum-likelihood (ML) algorithms (Felsenstein 1981) in the MEGA X program (Kumar et al. 2018). Kimura’s two-parameter model was used to calculate the evolutionary distances (Kimura 1980). Bootstrap values were determined based on 1000 replications (Felsenstein 1985).

Genome sequencing, assembly and function analysis

The genome of strain 3D7T was sequenced using the Illumina HiSeq systems with paired-end sequencing technology. The sequencing data was filtered to remove the sequences containing the adaptor and the low quality data, and the obtained clean data was used for subsequent analysis. The genome assembly was performed by SOAPdenovo (version 2.04) (Li et al. 2010, 2008). The assembly results were submitted to the NCBI (www.ncbi.nlm.nih.gov). The function of coding genes in the assembled genome were annotated by Gene Ontology (GO) (Ashburner et al. 2000), Clusters of Orthologous Groups (COG) (Galperin et al. 2015) and Kyoto Encyclopedia of Genes and Genomes (KEGG) (Minoru et al. 2016). The carbohydrate-active enzymes (CAZymes) were analyzed using HMMER annotation (Zhang et al. 2018), and the analysis of gene clusters related to secondary metabolites production was performed using antiSMASH 5.0 webserver (Blin et al. 2019).

DNA-DNA hybridization and genome-based phylogenetic analysis

The genomic information of related strains of the same genus or neighbouring genera was obtained from the EzTaxon and NCBI databases. Phylogenetic tree based on the whole genome sequences of strain 3D7T and related species was constructed using the Composition Vector (CV) approach. Average nucleotide identity (ANI) based on the BLAST algorithm (ANIb) and the MUMmer ultra-rapid aligning tool (ANIm), as well as the correlation indexes of tetranucleotide signatures (Tetra) were calculated through the website of JSpeciesWS (http://jspecies.ribohost.com/jspeciesws/) (Richter et al. 2016). The orthoANIu values were estimated using the EzBioCloud web service (www.ezbiocloud.net/tools/ani) as described by Yoon et al. (2017a, b). Digital DNA–DNA hybridization (dDDH) was conducted using the Genome-to-Genome Distance Calculator (GGDC; version 2.1) under the recommended Formula 2 (http://ggdc.dsmz.de/distcalc2.php) provided by the DSMZ website (Meier-Kolthoff et al. 2013).

Morphology, physiological and biochemical analysis

Growth tests were performed on Reasoner’s 2A, tryptic soy agar (TSA; AOBOX), nutrient agar (NA; AOBOX) and Ancylobacter–Spirosoma Medium (ASM; glucose 1 g, peptone 1 g, yeast extract 1 g, agar 15 g). Growth at different temperatures (0, 4, 10, 15, 20, 25, 28, 30, 32, 35 and 40 °C) was observed on R2A for 7 days. Tolerance to different NaCl concentrations (0–6%, at intervals of 0.5%, w/v) and pH (4.0–12.0, at intervals of 0.5 unit) were tested in R2A broth. The pH of the basal medium was adjusted using the buffer system: pH 4.0–5.0: 0.1 M citric acid/0.1 M sodium citrate; pH 6.0–8.0: 0.1 M KH2PO4/0.1 M NaOH; pH 9.0–10.0: 0.1 M NaHCO3/0.1 M Na2CO3; pH 11.0: 0.05 M Na2HPO4/0.1 M NaOH; pH 12.0: 0.2 M KCl/0.2 M NaOH (Xu et al. 2005). Cell morphology was observed by light microscope (BH-2; Olympus) and transmission electron microscope (JEM-1400, JEDL) after 3 days growth on R2A medium at 28 °C. Gram stain reaction was performed according to the method described by Dong and Cai (2001). Anaerobic growth test (Zhang et al. 2019a, b) was performed on R2A medium with 1 g pyrogallic acid and 2 ml 10% (w/v) NaOH in the plate, which was then sealed with Vaseline and growth detected for up to 7 days. Motility was tested in R2A medium containing 0.4% (w/v) agar and using the hanging-drop technique as described by Bernardet et al. (2002). Catalase activity was determined by assessing bubble production in 3% (v/v) H2O2, and oxidase activity was determined using 1% (w/v) tetramethyl-p-phenylenediamine (Ohta et al. 1983). Sensitivity and resistance to antibiotics were performed on R2A plates using filter-paper discs containing different antibiotics (Hangzhou Microbial Reagent). Hydrolysis of starch, gelatin and Tween 20, 40, 60 and 80 was determined on R2A for incubation at 28 °C for up to 7 days as described by Tindall et al. (2007). Other physiological properties and enzyme activities were determined using the API 50CH, API 20NE and API ZYM systems (bioMérieux) according to the manufacturers’ instructions.

Chemotaxonomic analyses

Cellular fatty acids of strain 3D7T and its reference strains were analysed by using colonies grown on R2A medium at 28 °C for 3 days. The fatty acid methyl ester mixtures were separated and analysed using the standard protocol of the Sherlock Microbial Identification System (MIDI Sherlock software package, version 6.0) (Kämpfer et al. 1996; Sasser 1990). For analyses of quinones and polar lipids, cells were collected at the exponential phase by centrifugation, then washed three times with sterilized water and freeze-dried. Isoprenoid quinones were extracted and purified by the methods of Collins (1985) and then analysed by reversed-phase HPLC. Polar lipids were extracted from freeze-dried cells and loaded onto thin-layer silica gel 60 plates (Merck). Two-dimensional migration was performed on each plate using chloroform–methanol–water (65:25:4, by vol.) as the first solvent and chloroform–acetic acid–methanol–water (80:15:12:4, by vol.) as the second one (Minni et al. 1979; Collins et al. 1980). Total polar lipids were detected by spraying with phosphomolybdic acid solution followed by heating at 110 °C for 10 min. Aminolipids were detected by spraying the plate with a 0.4% (w/v) solution of ninhydrin in butanol saturated with water followed by heating at 105 °C for 10 min. Phospholipids were detected by spraying with the reagent of Dittmer and Lester.

Results and discussion

Phylogenetic characteristics

The nearly complete 16S rRNA gene sequence of strain 3D7T (1396 bp) was determined and compared with the corresponding sequences. It shared the highest 16S rRNA gene similarity to ‘Microvirga brassicacearum’ CDVBN77T (98.3%), followed by Microvirga subterranea DSM 14364 T (96.8%), Microvirga guangxiensis 25BT (96.5%), and Microvirga aerophila DSM 21344 T (96.5%). The NJ analyses (Fig. 1) showed that strain 3D7T shared a branching node with ‘M. brassicacearum’ CDVBN77T, which was highly consistent with ME tree (Fig. S1) and ML tree (Fig. S2). It was clear that strain 3D7T was a member of the genus Microvirga.

Fig. 1
figure 1

Neighbor-joining phylogenetic tree based on 16S rRNA gene sequences showing the relationships between strain 3D7T and the closely related species of the genus Microvirga. Numbers at nodes represent bootstrap percentages (> 50%) based on 1000 replicates. Phyllobacterium salinisoli LLAN61T was chosen as an outgroup. Bar represents 0.01 substitutions per nucleotide position

Genome composition and DNA-DNA hybridisation

The draft genome sequence of strain 3D7T was 4,457,992 bp in length with 31 contigs. The coverage, N50 and DNA G + C content were 410 × , 431,466 bp and 63.5 mol%. The genome data met the proposed minimal standards for the use of genome data for the taxonomy of prokaryotes (Chun et al. 2018). The genome had 4321 protein-coding genes and 49 RNAs (Table 1). Genomic analyses showed that strain 3D7T and ‘M. brassicacearum’ CDVBN77T yielded ANIb and dDDH values of 77.5% and 22.2%, respectively. The ANI values between strain 3D7T and other species of the genus Microvirga were detailed in Table 1, which were all below standard criteria for classifying strains as different species (95–96%) (Kim et al. 2014). The dDDH values between strain 3D7T and other species of the genus Microvirga were also detailed in Table 1, and they were far below the 70% cut-off value generally recommended for species differentiation (Wayne et al. 1987). A genome-based phylogenetic tree was included in Fig. S3, which showed that strain 3D7T was affiliated to the genus Microvirga. These data confirmed that strain 3D7T represented a novel species of the genus Microvirga.

Table 1 Genome-based comparisons of strain 3D7T and the closely related type strains of the genus Microvirga

Genome features and function prediction

Gene Ontology database analysis results of strain 3D7T reflected a complex metabolic and regulatory network: 1022 genes were related to biological processes, accounting for about 23.7%; 981 genes were related to cell components, accounting for about 22.7%; 2,427 genes were related to molecular functions, accounting for about 56.2% (Fig. S4). Among the 20 general COG functional categories, the detailed distribution of genes was as follows: Amino acid transport and metabolism, 497 genes; Inorganic ion transport and metabolism, 293 genes; Energy production and conversion, 204 genes; Transcription, 202 genes; Carbohydrate transport and metabolism, 200 genes. Detailed information of the COG functional categories was presented in Fig. S5. KEGG metabolic pathways were classified according to the relationship between KO (KEGG ORTHOLOGY) and Pathway. Functional annotation of genes by comparisons against the manually curated KEGG GENEs database revealed that there were 62 genes related to the biosynthesis of other secondary metabolites, 122 genes related to the biodegradation and metabolism of xenobiotics, 374 genes related to the metabolism of carbohydrates and 45 genes related to the metabolism of terpenoids and polyketides (Fig. S6). The genome annotations showed genes encoding for proteins with phosphatase activity, such as the enzymes alkaline phosphatase (EC 3.1.3.1), acid phosphatase (EC 3.1.3.2), inorganic triphosphatase (EC 3.6.1.25) and pyrophosphatase (EC 3.6.1.1). Some enzymes involved in the production of triglyceride lipases, including lysophospholipase (EC 3.1.1.5) and unidentified phospholipase were also observed. These capabilities were tentatively proven in physiological tests, with potential applications in the agriculture biotechnology, washing industry and low-temperature environment remediation.

Analysis of the genome sequence of strain 3D7T showed 132 genes encoding different CAZymes in six different classes: glycoside hydrolases (GHs), enzymes that catalyze the hydrolysis of glycosidic linkage of glucoside—27 gene counts; glycosyltransferases (GTs), involved in the formation of glycosidic bonds—47 gene counts; carbohydrate esterases (CEs), which hydrolyze carbohydrate esters—35 gene counts; auxiliary activities (AAs), redox enzymes that act in conjunction with CAZymes—20 gene counts; polysaccharide lyases (PLs), which perform non-hydrolytic cleavage of glycosidic bonds—2 gene counts and carbohydrate-binding modules(CBMs)—1 gene count (Table S1). AntiSMASH output revealed four biosynthetic gene clusters (BGCs) involved in the secondary metabolism of the bacterium. One of those clusters encodes terpene BGC, which is related to the synthesis of isoindolinomycin. Other clusters encode an arylpolyene, a homoserine lactone and a terpene BGC that are not described for the production of an already known molecule. These genetic characteristics indicated that strain 3D7T may have biotechnological potential for the degradation of biomass and the pharmaceutical industry.

Phenotypic characteristics

Strain 3D7T grew well on R2A agar and ASM agar, but grew weakly on TSA and NA. Colonies on R2A agar plate were light-pink, semi-transparent, smooth and round. Cells were Gram-stain-negative, aerobic, non-motile, rod-shaped, 1.0–2.2 µm long and 0.7–0.9 µm wide (Fig. S7). It was able to grow at 4–32 °C(optimum, 25–28 °C), pH 7.0–10.0 (optimum, 7.0–7.5) and in the presence of 0–3% (w/v) NaCl (optimum without NaCl). These characteristics markedly differentiated strain 3D7T from the first related strain ‘M. brassicacearum’ CDVBN77T. Strain 3D7T assimilated L-arabinose, D-xylose, D-ribose and D-cellobiose, weakly assimilated D-fucose, D-glucose and D-mannose, but negative for assimilation of D-fructose, D-mannitol, aesculin ferric citrate, maltose, melibiose, D-sucrose, erythritol, D-arabinose, L-xylose, D-mannose, N-acetyl-glucosamine, D-mannose, D-galactose, inositol, glycerol, malic acid, butryric acid and acetoacetic acid. Sensitive to penicillin (10 U), ampicillin (10 µg), chloramphenicol (30 µg), tetracycline (30 µg), streptomycin (10 µg) and neomycin (30 µg), but resistant to polymyxin B (300 IU), vancomycin (30 µg) and bacitracin (0.04 U). Strain 3D7T hydrolysed Tween 20, 40, 60, and weakly hydrolysed Tween 80. It can hydrolyse aesculin, but not gelatin and tyrosine. Positive reaction for alkaline phosphatase, valine arylamidase and naphthol-AS-BI-phosphohydrolase, weakly positive for lipase (C14) and trypsin. These characteristics differentiated strain 3D7T from ‘M. brassicacearum’ CDVBN77T and other closely related reference strains. More differential characteristics between strain 3D7T and its closely related species in the genus Microvirga were given in Table 2, and the other detailed physiological and biochemical characteristics are present in the species description.

Table 2 Differential characteristics between strain 3D7T and other closely related species of the genus Microvirga

Chemotaxonomic characteristics

The major cellular fatty acids of strain 3D7T (> 10%) were summed feature 8 (C18:1ω7c and/or C18:1ω6c) (36.2%) and C19:0 cyclo ω8c (21.7%), which was similar to that of closely related species of the genus Microvirga. Minor qualitative and quantitative differences could be used to distinguish strain 3D7T from the closest relatives of the genus Microvirga. Compared with ‘M. brassicacearum’ CDVBN77T, strain 3D7T possessed higher amounts of C16:0, summed feature 2 (C14:0 3-OH and/or iso-C16:1 I) and summed feature 3 (C16:1ω6c and/or C16:1ω7c), and lower amounts of C18:0, C18:0 3-OH, C18:1ω7c 11-methyl and feature 8 (C18:1ω7c and/or C18:1ω6c). C14:0 and C17:0 cyclo were detected in strain 3D7T, but not detected in ‘M. brassicacearum’ CDVBN77T (Table 3). The predominant respiratory quinone of strain 3D7T was Q-10, which was in good agreement with other species of the genus Microvirga. The polar lipids of strain 3D7T consisted of phosphatidylcholine and phosphatidylethanolamine as the major component, plus one unidentified aminophospholipid, two unidentified amino lipids and three unidentified lipids (Fig. S8). Strain 3D7T shared the same major polar lipids with most of the described species of the genus Microvirga.

Table 3 Fatty acid profiles of strain 3D7T and the closely related species of the genus Microvirga

In conclusion, all phenotypic, chemotaxonomic, phylogenetic and genomic analyses suggested that strain 3D7T should be considered to represent a novel species of the genus Microvirga, for which the name Microvirga antarctica sp. nov. was proposed.

Description of Microvirga antarctica sp. nov.

Microvirga antarctica (ant.arc'ti.ca. L. fem. adj. antarctica southern, pertaining to the Antarctica, where the type strain was isolated).

Cells are Gram-stain-negative, aerobic, non-motile and rod-shaped (0.7–0.9 × 1.0–2.2 µm). Growth occurs on R2A agar and ASM agar, weakly on NA and TSA. Colonies are light-pink, semi-transparent, smooth, round and smaller than 1.0 mm in diameter after 3 days at 28 °C. Growth occurs at a range of 4–32 °C (optimum, 25–28 °C) and pH 7.0–10.0 (optimum, 7.0–7.5) and in the presence of 0–3% (w/v) NaCl (optimum without NaCl). Oxidase, catalase and nitrate reduction are positive, but glucose fermentation, arginine dihydrolase, indole production and urease are negative. Hydrolyses aesculin, Tween 20, 40 and 60, weakly hydrolyses Tween 80, but not starch. Positive for alkaline phosphatase, esterase (C4), esterase lipase (C8), leucine arylamidase, valine arylamidase, acid phosphatase and naphthol-AS-BI-phosphohydrolase, weakly positive for lipase (C14), cystine arylamidase and trypsin. The major polar lipids are phosphatidylcholine and phosphatidylethanolamine. The predominant quinone is Q-10 and the major fatty acids are summed feature 8 (C18:1ω7c and/or C18:1ω6c) and C19:0 cyclo ω8c. The genomic DNA G + C content of the type strain is 63.5 mol%.

The type strain, 3D7T (= CGMCC 1.13821 T = KCTC 72465 T), was isolated from a soil sample collected from Deception Island, Antarctica. The GenBank/EMBL/DDBJ accession number for the 16S rRNA gene sequence of strain 3D7T is MH561859. This Whole Genome Shotgun project of strain 3D7T has been deposited at DDBJ/ENA/GenBank under the accession number JAGEMM000000000.