Abstract
Qipengyuania sediminis CGMCC 1.12928T, a family member of Erythrobacteraceae, the class of Alphaproteobacteria, was isolated from a borehole sediment sample collected from Qiangtang Basin in Qinghai-Tibetan Plateau, the largest permafrost in China. Understanding bacterial molecular feature may shed light on the ecological strategy in the extreme environment. Here we describe the complete genome sequence and annotation of strain CGMCC 1.12928T, including the complete genome sequence and annotation. The genome of strain CGMCC 1.12928T consist of a single-circular chromosome, comprises 2,416,000 bp with an average G + C content of 66.7 mol%, and contains 2414 genes; including 2367 CDSs, 44 tRNA genes, as well as one operon of 16S-23S-5S rRNA genes. Genomic properties indicated that strain CGMCC 1.12928T has a relatively smaller genome size and higher G + C content within the family Erythrobacteraceae. In addition, genomic analysis revealed its genome contains multiple function genes responsible for nitrogen, sulfur and phosphorus cycles and explained the cold adaption mechanism. Thus, this strain plays an active role in the biogeochemical cycle in cold niche. The whole-genome of this isolate will widen our understanding of the ecological role of the genus Qipengyuania in permafrost.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
The Qinghai-Tibetan Plateau is the largest permafrost in China [1] and seasonal fluctuation by freezing and melting [2]. The light and precipitation are abundant in summer (from May to September) and the temperature is low in winter with low decomposition and ecosystem respiration, which has created a unique and complex ecosystem [3]. Microorganism, as producers and decomposers, is one of the pivotal compositions of microbial food chains and plays a significant role in ecology and biogeochemistry of permafrost ecosystem. Many key life processes, including nitrogen, sulfur, and phosphorus metabolism in the soil, may be regulated by the local bacteria. Understanding bacterial molecular profile will be helpful for explicating the vital physiological function in certain ecosystem.
The genus Qipengyuania, belonging to the family Erythrobacteraceae and the class Alphaproteobacteria, contain only one species Qipengyuania sediminis which is also the type species of the genus [1]. The type strain was isolated from a borehole sediment sample collected from Qiangtang Basin in Qinghai-Tibetan Plateau, China [1]. The study with respect to the ecological roles of bacteria belonging to the genus of Qipengyuania has not been reported previously.
Currently, strain CGMCC 1.12928T is the only strain affiliated to the species Q. sediminis (with 16S rRNA gene similarity > 97%) via searching the NCBI database. However, some candidate Q. sediminis strains from different habitats, such as soil, dolomite rock, coal bed, air, sediment, drinking water [4], and gastrointestinal specimens [5], were found via cultured-independent approach. Interestingly, these strains were from harsh environments, such as biological soil crust of copper mine tailings wastelands, soil contaminated with anthracene [6], and heavy metal contaminated estuarine sediment. In this study, the genome of Q. sediminis strain CGMCC 1.12928T, the type strains of the type species of the genus Qipengyuania is reported and its ecological roles was analyzed.
Organism Information
Strain CGMCC 1.12928T was purchased from the China General Microbiological Culture Collection Center (CGMCC). Cells are Gram-reaction-negative, facultatively aerobic, and chemoheterotrophic [1]. Strain CGMCC 1.12928T is cultivated routinely in Marine Broth 2216 (MB, Difco) at 37 °C. By convention, the strain could be preserved at −80 °C in MB supplemented with 30% (v/v) glycerol. General features of strain CGMCC 1.12928T are summarized in Table 1.
Genome Sequencing, Assembly, and Annotation
Strain CGMCC 1.12928T was cultured in R2A at 30 °C. When bacteria reached the late exponential phase (about 48 h), the cells were collected by centrifuging at 10,000×g for 10 min. High-quality of genomic DNA was extracted using AxyPrepTM Bacterial Genomic DNA Miniprep Kit (Axygen®, Corning). The quality and concentration of the total DNA was measured by Nanodrop (ND-2000, Thermo Scientific™). The draft genome was sequenced using paired-end sequencing technology with Illumina HiSeq-PE150 platform (Novogene Bioinformatics Technology Co. Ltd, Beijing). The sequencing generated approximate 1.0 G bytes clean data with genome coverage of 280X. The de novo assembly of the reads was implemented by ABySS 1.5.2 [7] and one contig of the genome was obtained. PCR amplification were carried out with a designed primer pair (Qse-F: ACTATGAGACCAGGACGC, Qse-R: TTCGGCAGGTGAGTTGT) and PrimeSTAR GXL DNA polymerase (TaKaRa, Dalian, PR China). The PCR product was sequenced via Sanger sequencing and the obtained sequence could overlap the front and rear of the initial contig.
The ribosome RNA (rRNA) genes and transfer RNA (tRNA) genes were identified by RNAmmer version 1.2 server (http://www.cbs.dtu.dk/services/RNAmmer/) [8] and tRNAscan-SE version 1.3 (http://lowelab.ucsc.edu/tRNAscan-SE/) [9], respectively. The open reading frames (ORFs) and functional annotation of translated ORFS were performed using Rapid Annotation using Subsystem Technology (RAST) server online [10]. Functional genes of interest were categorized via SEED subsystems. Gene functional categories were predicted using Clusters of Orthologous Groups of proteins (COG) database [11] to elaborate their putative function. The number of genes with signal peptides were performed using SignalP version 4.1 server (http://www.cbs.dtu.dk/services/SignalP/) [12]. The number of genes with transmembrane helices were identified by TMHMM server v.2.0 (http://www.cbs.dtu.dk/services/TMHMM/) [13]. Metabolic pathway were predicted by Kyoto Encyclopedia of Genes and Genomes (KEGG) automatic annotation server with BBH method [14]. CRISPR (clustered regularly interspaced short palindromic) repeats structures were searched by CRISPRfinder (http://crispr.i2bc.paris-saclay.fr/Server/) [15]. Circular map of strain CGMCC 1.12928T was visualized using CGView [16].
Results and Discussion
The complete genome of strain CGMCC 1.12928T consist of a single-circular chromosome (Fig. 1), comprising 2,416,000 bp with an average G + C content of 66.7 mol%, which is a bit lower than the value reported before (73.7 mol%) [1]. The complete genome of strain CGMCC 1.12928T contain 2414 genes, including 2367 CDSs, 44 tRNA genes, as well as one operon of 16S-23S-5S rRNA genes. According to the result of RAST server online annotation, 1927 CDSs (79.4% of the total CDSs) could be allocated to COG database. The main COG categories of strain CGMCC 1.12928T were category E (11.2%), category C (10.7%), category J (10.2%), category L (9.9%), category K (9.6%), category G (9.5%), and category P (8.8%). In addition, 1198 CDSs (50.6%) were allocated to KEGG database, and metabolic pathways were the predominant pathways. The genome features and COG classification are summarized in Tables 2 and 3, respectively.
Currently, 27 genomes of the Erythrobacteraece species are available. Here, we compare the genome of strain CGMCC 1.12928T with other Erythrobacteraceae strains (Table S1). The genome size of strain CGMCC 1.12928T (2.4 Mb) is smaller than that of other Erythrobacteraceae strains (2.59–4.11 Mb). In addition, the G + C content of the strain CGMCC 1.12928T (66.7 mol%) is higher than that of other Erythrobacteraceae strains except for Altererythrobacter sp. NS1 (67.0 mol %), Erythrobacter sp. HL-111 (68.1 mol%) and Porphyrobacter sp. CACIAM 03H1 (67.6 mol%).
Biogeochemical relevant genes were found in the genome of strain CGMCC 1.12928T leading to the understanding of its role in matter-cycle, especially the utilizing of nitrogen, sulfur, and phosphorus.
-
(i)
Nitrogen Metabolism. The ammonium, nitrate, or nitrite transporters were not detected in the genome of strain CGMCC 1.12928T, indicating it could not transport ammonium, nitrate, and nitrite from extracellular to intracellular. However, the presence of amino acids permeases and ABC transporters (leucine/isoleucine/valine) indicated that strain CGMCC 1.12928T may utilize amino acids as a source of organic nitrogen. The nitrate reductase and the nitrite reductase genes in nitrate/nitrite assimilation were not detected, which is consistent with the previous report that strain CGMCC 1.12928T is negative for nitrate reduction [1]. The urease, encoded by ureA, ureB, and ureC genes, is absent in the genome of strain CGMCC 1.12928T, indicating that it is unable to utilize urea as a source of inorganic nitrogen [17].
-
(ii)
Sulfur Metabolism. Genes encoding assimilatory sulfate reduction and cysteine biosynthesis are annotated in the genome of strain CGMCC 1.12928T, including sulfate adenylyltransferase (cysND), adenylylsulfate kinase (cysC), phosphoadenosine phosphosulfate reductase (cysH), NADPH‐dependent sulfite reductase (cysJI), and cysteine synthase (cysK), indicating the incorporation of inorganic sulfur into organic compound (cysteine). Oppositely, genome of strain CGMCC 1.12928T lack the transporter genes responsible for alkanesulfonates acquisition from extracellular and the genes involved in alkanesulfonate assimilation, indicating its unable to utilize organic sulfur.
-
(iii)
Phosphorus Metabolism. The positive activities of alkaline phosphatase were confirmed in strain CGMCC 1.12928T by using API ZYM (bioMérieux) [1]. One alkaline phosphatase gene was also annotated. The genome of strain CGMCC 1.12928T possesses a high-affinity phosphate acquisition system (pstSCAB) and regulatory system (phoUBR) related to the phosphorus cycle, mediating inorganic phosphate acquisition to ensure the sufficient inorganic phosphate supply [18] under reduced phosphate conditions [19]. The genome of strain CGMCC 1.12928T harbors genes encoding the polyphosphate kinase (pkk) with the ability of synthesis and utilization of inorganic polyphosphate [20]. The type of polyphosphate kinase is PKK2, using polyphosphate as a substrate to generate GTP from GDP with higher polyphosphate utilization activity. The concentration of phosphorus source could be regulated by the PKK by synthesizing or catalyzing polyphosphate. However, the genome lacks of genes for transport (phnCDE) and cleavage (phnGHIJKLN) of organic phosphate [21]. These features indicated strain CGMCC 1.12928T may prefer to utilize inorganic phosphate than organic phosphate [22, 23].
Beside the element metabolism, the genome of strain CGMCC 1.12928T was mined to analyze potential genes involved in its cold adaptation, which including three functional genes. The two-component regulatory system, comprising sensor histidine kinases and response regulator proteins, play a vital role in cold adaptations [24] by detecting and adapting to their extra- or the intracellular environment changes [25]. Temperature stimuli induce differential expression of two-component regulatory systems in bacteria. The genome of strain CGMCC 1.12928T contained four copies of two-component system sensor histidine kinase and one copy of response regulator protein. In addition, seven copies of histidine kinase were predicted in the genome. Previous studies reported the pigment role in improving bacterial survival at cold environment [26]. Strain CGMCC 1.12928T does not have violacein-encoding gene cluster (vioABCDE) which leading to the lack of violet pigment. Oppositely, strain CGMCC 1.12928T possess the carotenoid-like pigments (yellow) pigment [1] and the relevant genes involved in carotenoid biosynthesis pathway were detected, including one copy of phytoene synthase gene, phytoene desaturase and lycopene beta-cyclase were observed. The presence of these genes may maintain the homeostasis and promote the adaptability when temperature changed [27]. The genome of strain CGMCC 1.12928T harbor genes responsible for DNA repair genes (recN, recO, radA, mutS, and deoxyribodipyrimidine photolyase) and chaperone genes (cbpA, dnaK, dnaJ, clpB, ecmE, htrA), which could be induced in bacteria associated with rapid decrease of growth temperature [25]. It has been reported that low temperatures could promote the expression of the groEL gene in cyanobacterial strain [28]. Similarly, strain CGMCC 1.12928T harbors one copy of molecular chaperone (groEL gene) and cochaperonin protein complex (groES gene) which may participate in the cold adaption.
Conclusions
The complete genome of strain CGMCC 1.12928T contains a circular chromosome. Genomic properties indicated that strain CGMCC 1.12928T has small genome size and a high G + C content within the family Erythrobacteraceae. In addition, genomic analysis reveals that strain CGMCC 1.12928T contains multiple function genes responsible for nitrogen, sulfur, and phosphorus cycles. These findings will improve our understanding of the ecological adaption and response behavior of the genus Qipengyuania. In the future, the complete genome sequence of this strain could capacitate further research of the molecular mechanisms of its survive strategy, and may improve the comprehending of this bacterium biogeochemical role in high-attitude environment.
Nucleotide Sequence Accession Number
The complete genome sequence of Q. sediminis CGMCC 1.12928T was deposited in DDBJ/EMBL/GenBank under the accession numbers CP037948.
References
Feng XM, Mo YX, Han L, Nogi Y, Zhu YH, Lv J (2015) Qipengyuania sediminis gen. nov., sp nov., a member of the family Erythrobacteraceae isolated from subterrestrial sediment. Int J Syst Evolut Microbiol 65:3658–3665
Zhao L, Li YN, Xu SX, Zhou HK, Gu S, Yu GR, Zhao XQ (2006) Diurnal, seasonal and annual variation in net ecosystem CO2 exchange of an alpine shrubland on Qinghai-Tibetan plateau. Globa Change Biol 12(10):1940–1953
Kato T, Tang YH, Gu S, Cui XY, Hirota M, Du MY, Li YN, Zhao ZQ, Oikawa T (2004) Carbon dioxide exchange between the atmosphere and an alpine meadow ecosystem on the Qinghai-Tibetan Plateau. Chin Agric For Meteorol 124(1–2):121–134
Lin WF, Yu ZS, Chen X, Liu RY, Zhang HX (2013) Molecular characterization of natural biofilms from household taps with different materials: PVC, stainless steel, and cast iron in drinking water distribution system. Appl Microbiol Biot 97(18):8393–8401
Li E, Hamm CM, Gulati AS, Sartor RB, Chen HY, Wu X, Zhang TY, Rohlf FJ, Zhu W, Gu C, Robertson CE, Pace NR, Boedeker EC, Harpaz N, Yuan J, Weinstock GM, Sodergren E, Frank DN (2012) Inflammatory bowel diseases phenotype, C. difficile and NOD2 genotype are associated with shifts in human ileum associated microbial composition. PLoS ONE 7(6):e26284
Nunez EV, Valenzuela-Encinas C, Alcantara-Hernandez RJ, Navarro-Noya YE, Luna-Guido M, Marsch R, Dendooven L (2012) Modifications of bacterial populations in anthracene contaminated soil. Appl Soil Ecol 61:113–126
Simpson JT, Wong K, Jackman SD, Schein JE, Jones SJM, Birol I (2009) ABySS: a parallel assembler for short read sequence data. Genome Res 19(6):1117–1123
Lagesen K, Hallin P, Rodland EA, Staerfeldt HH, Rognes T, Ussery DW (2007) RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res 35(9):3100–3108
Lowe TM, Chan PP (2016) tRNAscan-SE On-line: integrating search and context for analysis of transfer RNA genes. Nucleic Acids Res 44(W1):W54–W57
Overbeek R, Olson R, Pusch GD, Olsen GJ, Davis JJ, Disz T, Edwards RA, Gerdes S, Parrello B, Shukla M, Vonstein V, Wattam AR, Xia FF, Stevens R (2014) The SEED and the rapid annotation of microbial genomes using subsystems technology (RAST). Nucleic Acids Res 42(D1):D206–D214
Tatusov RL, Fedorova ND, Jackson JD, Jacobs AR, Kiryutin B, Koonin EV, Krylov DM, Mazumder R, Mekhedov SL, Nikolskaya AN, Rao BS, Smirnov S, Sverdlov AV, Vasudevan S, Wolf YI, Yin JJ, Natale DA (2003) The COG database: an updated version includes eukaryotes. BMC Bioinformatics 4:41
Petersen TN, Brunak S, von Heijne G, Nielsen H (2011) SignalP 40: discriminating signal peptides from transmembrane regions. Nat Methods 8(10):785–786
Krogh A, Larsson B, von Heijne G, Sonnhammer EL (2001) Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol 305(3):567–580
Kanehisa M, Goto S, Kawashima S, Okuno Y, Hattori M (2004) The KEGG resource for deciphering the genome. Nucleic Acids Res 32:D277–D280
Grissa I, Vergnaud G, Pourcel C (2007) CRISPRFinder: a web tool to identify clustered regularly interspaced short palindromic repeats. Nucleic Acids Res 35:W52–W57
Stothard P, Wishart DS (2005) Circular genome visualization and exploration using CGView. Bioinformatics 21(4):537–539
Kanamori T, Kanou N, Atomi H, Imanaka T (2004) Enzymatic characterization of a prokaryotic urea carboxylase. J Bacteriol 186(9):2532–2539
Liu Q, Wu YH, Cheng H, Xu L, Wang CS, Xu XW (2017) Complete genome sequence of bacteriochlorophyll-synthesizing bacterium Porphyrobacter neustonensis DSM 9434. Stand Genom Sci 12:32
O’May GA, Jacobsen SM, Longwell M, Stoodley P, Mobley HL, Shirtliff ME (2009) The high-affinity phosphate transporter Pst in Proteus mirabilis HI4320 and its importance in biofilm formation. Microbiology 155(5):1523–1535
Zhong CQ, Fu JF, Jiang TY, Zhang CM, Cao GX (2018) Polyphosphate metabolic gene expression analyses reveal mechanisms of phosphorus accumulation and release in Microlunatus phosphovorus strain JN459. FEMS Microbiol Lett 365(6):fny034
Moran MA, Belas R, Schell MA, Gonzalez JM, Sun F, Sun S, Binder BJ, Edmonds J, Ye W, Orcutt B, Howard EC, Meile C, Palefsky W, Goesmann A, Ren Q, Paulsen I, Ulrich LE, Thompson LS, Saunders E, Buchan A (2007) Ecological genomics of marine Roseobacters. Appl Environ Microbiol 73(14):4559–4569
Satinsky BM, Crump BC, Smith CB, Sharma S, Zielinski BL, Doherty M, Meng J, Sun SL, Medeiros PM, Paul JH, Coles VJ, Yager PL, Moran MA (2014) Microspatial gene expression patterns in the Amazon River Plume. Proc Natl Acad Sci USA 111(30):11085–11090
Luo HW, Benner R, Long RA, Hu JJ (2009) Subcellular localization of marine bacterial alkaline phosphatases. Proc Natl Acad Sci USA 106(50):21219–21223
Pontinen A, Markkula A, Lindstrom M, Korkeala H (2015) Two-component-system histidine kinases involved in growth of Listeria monocytogenes EGD-e at low temperatures. Appl Environ Microbiol 81(12):3994–4004
Kumar R, Acharya V, Singh D, Kumar S (2018) Strategies for high-altitude adaptation revealed from high-quality draft genome of non-violacein producing Janthinobacterium lividum ERGS5:01. Stand Genomic Sci 13:11
Qin QL, Xie BB, Yu Y, Shu YL, Rong JC, Zhang YJ, Zhao DL, Chen XL, Zhang XY, Chen B, Zhou BC, Zhang YZ (2014) Comparative genomics of the marine bacterial genus Glaciecola reveals the high degree of genomic diversity and genomic characteristic for cold adaptation. Environ Microbiol 16(6):1642–1653
Kumar R, Singh D, Swarnkar MK, Singh AK, Kumar S (2016) Complete genome sequence of Arthrobacter alpinus ERGS4:06, a yellow pigmented bacterium tolerant to cold and radiations isolated from Sikkim Himalaya. J Biotechnol 220:86–87
Hossain MM, Nakamoto H (2002) HtpG plays a role in cold acclimation in cyanobacteria. Curr Microbiol 44(4):291–296
Acknowledgements
This work was supported by China Ocean Mineral Resources R & D Association (COMRA) Special Foundation (DY135-B2-10), the Scientific Research Fund of the Second Institute of Oceanography, MNR (JT1702), and the National Natural Science Foundation of China (41876182 & 41806204).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that there are no conflicts of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Gao, Y., Wu, YH., Xu, L. et al. Complete Genome Sequence of Qipengyuania sediminis CGMCC 1.12928T, Shed Light on Its Role in Matter-Cycle and Cold Adaption Mechanism of the Genus Qipengyuania. Curr Microbiol 76, 988–994 (2019). https://doi.org/10.1007/s00284-019-01712-w
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00284-019-01712-w