Introduction

Members of Sphingomonas within Alphaproteobacteria have been isolated from a variety of environments, including human-associated niches, water (fresh and sea), endophytes, terrestrial habitats, sediment (river and subsurface), and rhizosphere soil (Asaf et al. 2020; Aylward et al. 2013). Many isolates possess multifaceted functions ranging from remediation of environmental contaminations to synthetizing highly beneficial phytohormones, such as polycyclic aromatic hydrocarbon (PAH) degradation and sphingan producing. Some species of the genus have also been noted to improve plant-growth during stress conditions such as drought, salinity, and heavy metals in agricultural soil, because of their potential to produce plant growth hormones e.g. gibberellins and indole acetic acid (Asaf et al. 2020; Khan et al. 2014; Yang et al. 2014)

Currently (June, 2020), the genus Sphingomonas includes 131 species with validly published and correct names according to the List of Prokaryotic names with Standing in Nomenclature (LPSN) (Parte 2018). A punctiform, orange strain designated HHU CXWT was isolated from the hair of a healthy 21-year-oldfemale student in class of microbiology experiment, Hohai University, Nanjing, China. The 16S rRNA gene analysis indicated strain HHU CXWT possibly represent a novel species in genus Sphingomonas, and polyphasic taxonomy was performed on this strain subsequently.

Materials and methods

Isolation of the bacterial strain and culture conditions

The hair was collected from a female student in class, and was cut into pieces (< 1 cm in length) before being bespattered on the trypticase soy agar (TSA) medium (BD Diagnostics, Maryland, USA). The plate was incubated at 28 °C for 5 days, and single colonies were selected and cultivated on TSA plates. The strain HHU CXWT was maintained on TSA medium and stored as aqueous glycerol suspensions (20%, v/v) at  − 80 °C.

Phenotypic and biochemical characterization

Colony properties of strain HHU CXWT were observed on TSA medium. Cell morphology was examined using optical microscopy (Axio Vert A1, Zeiss, German) after 2-day-incubation in trypticase soy broth (TSB) (BD Diagnostics, Maryland, USA) at 28 °C with vibration (180 rpm). Gram staining was determined by using a Gram Stain kit (G1060, Solarbio, China) according to the manufacturer’s instructions. Growth was tested at 4, 10, 20, 28, 35 and 37 °C on TSA medium. The pH range for growth was determined by measuring the optical densities (at 600 nm) of TSB cultures after 2 days. The pH was adjusted to pH 4–10 (at intervals of 1.0 pH unit) in a vertical flow clean bench after sterilization, using NaOH and HCl solutions. NaCl tolerance was determined on TSA plates adjusted to 0%, 0.5%, 1%, 2%, 3%, 4% and 5% concentrations (w/v). Motility was determined by observing growth of cells in test tubes containing semisolid TSA medium with 0.5% agar after 3 days of incubation at 28 °C (Cowan and Steel 1996). Additional physiological and biochemical characterization were performed using the Biolog GEN III microtest system (Biolog, USA), API 20NE and API ZYM systems (bioMérieux, France) according to the manufacturer’s instructions.

Chemotaxonomic characterisation

The biomass used for analysis of cellular fatty acids, polar lipids, and quinones, were obtained from cultures grown in TSB medium for 2 days at 28 °C. Cellular fatty acids were extracted, methylated and analyzed by using the Sherlock Microbial Identification System (MIDI) according to previous method (Sasser 1990) and the manufacturer’s instructions. Quinones were extracted (Collins et al. 1977) and detected by HPLC (Tamaoka 1986). Polar lipids were determined according to published procedures (Collins and Jones 1980; Minnikin et al. 1979).

Phylogenetic and genotypic analysis

Genomic DNA was isolated by using the Ezup Column Bacteria Genomic DNA Purification Kit (Sangon Biotech, China), and PCR amplification of the 16S rRNA gene sequence were performed with the universal primers 27F and 1492R (Lane 1991). The obtained 16S rRNA gene sequence was analyzed on the EzBioCloud server (https://www.ezbiocloud.net/) (Yoon et al. 2017). Phylogenetic analysis was carried out based on the neighbor-joining (Saitou and Nei 1987), maximum-likelihood (Felsenstein 1981) and maximum-parsimony (Fitch 1971) methods by using the MEGA X software package (Kumar et al. 2018) after multiple alignments of sequences using Clustal version 2.1 program (Larkin et al. 2007). Evolutionary distance matrices of phylogenetic trees were calculated according to Kimura’s two-parameter model (Kimura 1980). Bootstrap analysis was performed with 1000 replications (Felsenstein 1985).

Whole-genome sequencing of HHU CXWT was performed using paired-end sequencing method with Hiseq X platform (Illumina) at Magigene Company, Guangzhou, PR China. Reads of each data set were filtered, and high-quality paired-end reads were assembled using SPAdes version 3.13.0 (Nurk et al. 2013) in UGENE software package (Okonechnikov et al. 2012). Sequences with coverage < 5 or length < 500 bp were excluded from the output genome sequences. The average nucleotide identity (ANI) based on BLAST (ANIb) was determined using JSpecies version 1.2.1 (Richter and Rossello-Mora 2009) while the digital DNA-DNA hybridization (dDDH) values were calculated using the Genome-to-Genome Distance Calculator (GGDC) available at https://ggdc.dsmz.de (Meier-Kolthoff et al. 2013). Formula 2 was applied for the dDDH analysis. Genome annotation was performed by the NCBI Prokaryotic Genome Annotation Pipeline (Tatusova et al. 2016) and eggNOD online server (https://eggnog-mapper.embl.de/) (Huerta-Cepas et al. 2017). The DNA G + C content of the genome was calculate based on the genome sequences.

A gene set called bac120 was used to infer a super phylogeny based on multilocus sequence analysis, which includes 120 universal single-copy genes among domain Bacteria (Parks et al. 2018). This genome-based tree was referred as ‘bac120 tree’. Generally, the protein sequences of bac120 from a S. ginsenosidimutans strain (Assembly No. GCF_002374835.1) were retrieve from the Genome Taxonomy Database (https://gtdb.ecogenomic.org/); local tblastn program in blast-2.9.0 + software package (Altschul et al. 1997) was used to search homologs in the Sphingomonas genomes of interest with 1e-5 evalue and 50% identity as cutoffs; the gene sequences of each gene cluster were aligned by using Clustal Omega version 1.2.1 (Sievers et al. 2011); a maximum-likelihood tree was inferred by applying FastTree 2.1 (Price et al. 2009) to the common genes of the bac120 set among all the genomes of interest after alignment trimming and concatenation. The bac120 tree was constructed on a set of genomes as input data, which was implement by using an in-house Perl script under Ubuntu operation system.

Results and discussion

Phylogenetic and genotypic characteristics

Based on the results from EzBioCloud server, HHU CXWT shared the highest 16S rRNA gene sequence similarity (99.1%) with two non-type strains Sphingomonas sp. JS21-1 (FNZZ01000011) and Sphingomonas sp. Mn802worker (AORY01000018) (Aylward et al. 2013), while S. aquatilis JSS7T (AF131295) from the valid published species of Sphingomona showed the highest similarity (97.6%) with strain HHU CXWT. All three phylogenetic trees using neighbor-joining, maximum-likelihood and maximum-parsimony algorithms, indicated that HHU CXWT clustered with Sphingomonas sp. JS21-1 and Sphingomonas sp. Mn802worker, forming a distinct but not well supported clade in the genus Sphingomona (Fig. 1, S1 and S2). Based on the 16S rRNA gene analysis, S. aquatilis KCTC 2881T (= JSS7T) was used as a reference for the subsequent tests.

Fig. 1
figure 1

Neighbour-joining phylogenetic tree based on 16S rRNA gene sequences showing the position of strain HHU CXWT. Bootstrap values (expressed as percentages of 1000 replications) of above 70% are shown at the branch nodes. The GenBank accession numbers are indicated in the brackets at the end of the tip labels. The black dots denote nodes that were also recovered using the maximum-likelihood and maximum-parsimony methods. Erythrobacter longus DSM 6997T is used as out group. Bar, 0.02 substitutions per nucleotide position

Whole Genome Shotgun project of HHU CXWT has been deposited at GenBank/EMBL/DDBJ under the accession number JABULH000000000. The draft genome consists of 36 contigs with a total size of 3,558,003 bp and genomic DNA G + C content 67.1%. A total of 3345 coding genes was predicted, of which 736, 1904 and 2802 genes were assigned to Gene Ontology (GO) (The Gene Ontology Consortium 2019), Kyoto Encyclopedia of Genes and Genomes (KEGG) (https://www.kegg.jp/) and Cluster of Orthologous Groups (COG) (Galperin et al. 2015) database, respectively. Future analysis of the KEGG annotation results from eggNOD server suggested that there were 28 genes associated with flagellar assembly (ko02040), 35 with biofilm formation (ko05111, ko02025 and ko02026), 7 with carotenoid biosynthesis (ko00361), respectively. Dramatically, strain HHU CXWT harbors a crtX gene (locus tag HRV97_09920), which could encode an enzyme catalyzing zeaxanthin to produce zeaxanthin diglucoside, and is rare present among Sphingomonadales (Fig. 2) (Siddaramappa et al. 2018).

Fig. 2
figure 2

Maximum-likelihood phylogenetic tree based on 69 common bac120 gene sequences showing the position of strain HHU CXWT. Bootstrap values of 100% are marked with black dots at the branch nodes. Tip labels followed by a asterisk indicated a crtX homolog was found in the genome, and the GenBank accession number or whole genome shotgun (WGS) project number is indicated in the bracket. Erythrobacter longus DSM 6997 T is used as out group. Bar, 0.05 substitutions per nucleotide position

According to the taxa of 16S rRNA gene tree, 15 Sphingomonas genomes together with genomes of strain HHU CXWT and Erythobacter longus DSM 6997T (JMIW01), were used to infer a robust tree based on the bac120 gene set, which was named bac120 tree in this study. A total of 69 genes from bac120 gene set was detected in all the 17 genomes and used in further analysis. The bac120 tree was well supported on all the branches (Fig. 2). The same as shown in the 16S rRNA gene tree, strains HHU CXWT is close related to isolates Sphingomonas sp. JS21-1 and Sphingomonas sp. Mn802worker. Additionally, the bac120 tree clarified that S. phyllosphaerae (Huang et al. 2012; Rivas et al. 2004) and S. adhaesiva (Feng et al. 2018) were the currently known valid species sharing the most recent ancestor with strain HHU CXWT.

The ANIb values between HHU CXWT and the close related strains Sphingomonas sp. JS21-1 (FNZZ01), Sphingomonas sp. Mn802worker (AORY01) and S. aquatilis NBRC 16722T (BJXI01) were all below 85%, while the dDDH values were 28.5%, 27.9%, and 21.1%, respectively (Table 1). These values are lower than the proposed and generally accepted species boundaries of 70% for dDDH and 95–96% for ANI (Goris et al. 2007; Richter and Rossello-Mora 2009), which suggest that strain HHU CXWT represents a novel species of the genus Sphingomona. Furthermore, Sphingomonas sp. JS21-1 and Sphingomonas sp. Mn802workers should represent two novel species of Sphingomonas, too, according to the ANIb and dDDH analysis (Table 1).

Table 1 The average nucleotide identity based on BLAST (ANIb) and digital DNA-DNA hybridization (dDDH) analysis among HHU CXWT and closely related strains

Phenotypic characteristics

Cells of HHU CXWT were Gram-staining negative, motile, aerobic and rod shaped. The colonies on TSA medium were orange, circular, smooth, and opaque. Growth of HHU CXWT was observed at 4–35 °C (optimum temperature, 28 °C) and at pH 6–8 (optimum, pH 7). The tolerance to NaCl was up to 2% (w/v) with optimum growth in the presence of 0.5–1.0% NaCl. HHU CXWT was catalase-positive and oxidase-negative. In the API ZYM tests, HHU CXWT was positive for acid phosphatase, alkaline phosphatase, cystine arylamidase, esterase (C4), esterase lipase (C8), leucine arylamidase, naphthol-AS-BI-phosphohydrolase, trypsin, valine arylamidase, and α-glucosidase, but negative for lipase (C14), N-acetyl-β-glucosaminidase, α-chymotrypsin, α-fucosidase, α-galactosidase, α-mannosidase, β-galactosidase, β-glucuronidase, and β-glucosidase. In the API 20NE tests, HHU CXWT was positive for hydrolysis of esculin and nitrate reduction, but negative for arginine dihydrolase, d-glucose fermentation, indole production, and hydrolysis of 4-nitrophenyl β-D-galactopyranoside, gelatin, and urea. Additionally, HHU CXWT was negative for assimilation of adipic acid, capric acid, d-glucose, d-maltose, d-mannitol, d-mannose, l-arabinose, malic acid, N-acetylglucosamine, phenylacetic acid, potassium gluconate, and trisodium citrate. In the BIOLOG GEN III tests, HHU CXWT was negative for all the tests, while the reference strain S. aquatilis KCTC 2881T was positive for utilization of α-d-glucose, glycyl-l-prolin, d-glucuronic acid, and l-glutamic acid, and was resistant to pH 6, 4% NaCl, lincomycin, and tetrazolium blue.

Chemotaxonomic characteristics

The major fatty acids (> 5%) of strain HHU CXWT were C16:0, C17:1ω6c, C18:1ω7c11-methyl, summed feature 3 (C16:1ω7c and/or C16:1ω6c), and summed feature 8 (C18:1ω7c and/or C18:1ω6c) (Table S1). The predominant isoprenoid quinone detected in strain HHU CXWT was ubiquinone-10, which is the typical quinone reported for the genus Sphingomonas (Li et al. 2019). The polar lipids of strain HHU CXWT mainly consisted of diphosphatidylglycerol (DPG), phosphatidylethanolamine (PE), phosphatidylglycerol (PG), sphingoglycolipid (SGL), and phosphatidylinositol mannosides (PIM), which is similar to that of S. aquatilis KCTC 2881T (Fig. S3). Minor amount of phosphatidylcholine (PC) and an unidentified polar lipid (L) were also detected.

Taxonomic conclusions

Strain HHU CXWT shared < 98% similarities of 16S rRNA gene with valid described species and formed a distinct clade in the genus Sphingomonas (Fig. 1, S1, and S2), and the ANIb and dDDH analysis between HHU CXWT and closely related species were all below the generally accepted species boundaries 95–96% and 70%, respectively (Table 1). Furthermore, strain HHU CXWT showed differences from closely related type strains S. aquatilis KCTC 2881 T and S. phyllosphaerae FA2T on colony color, nitrate reduction, assimilation of several carbon resource, diagnostic phospholipids and major fatty acids (Table 2). Based on the genotypic, phenotypic and chemotaxonomic features, strain HHU CXWT can be considered as a novel species of the genus Sphingomonas, for which the name Sphingomonas hominis sp. nov. is proposed.

Table 2 Differential phenotypic and chemotaxonomic characteristics of strains HHU CXWT and closely related type strains

Description of Sphingomonas hominis sp. nov.

Sphingomonas hominis (ho’mi.nis L. gen. masc. n. hominis, of a human being, named for the host from whose hair this species is found).

Cells are Gram-staining negative, aerobic, rod shaped, and motile. Colonies are orange, circular, smooth and opaque on TSA medium. Growth occurs at 4–35 °C (optimum temperature, 28 °C) and at pH 6–8 (optimum, pH 7). The tolerance to NaCl was up to 2% (w/v) with optimum concentration 0.5–1.0%. Catalase-positive and oxidase-negative. In the API ZYM tests, positive for acid phosphatase, alkaline phosphatase, cystine arylamidase, esterase (C4), esterase lipase (C8), leucine arylamidase, naphthol-AS-BI-phosphohydrolase, trypsin, valine arylamidase, and α-glucosidase; but negative for lipase (C14), N-acetyl-β-glucosaminidase, α-chymotrypsin, α-fucosidase, α-galactosidase, α-mannosidase, β-galactosidase, β-glucosidase, and β-glucuronidase. In the API 20NE tests: positive for nitrate reduction and hydrolysis of esculin; negative for arginine dihydrolase, d-glucose fermentation, indole production, and hydrolysis of 4-nitrophenyl β-D-galactopyranosidegelatin, and urea; negative for assimilation of adipic acid, capric acid, d-glucose, d-maltose, d-mannitol, d-mannose, l-arabinose, malic acid, N-acetylglucosamine, phenylacetic acid, potassium gluconate, and trisodium citrate. The major fatty acids are C16:0, C17:1ω6c, C18:1ω7c11-methyl, summed feature 3 (C16:1ω7c and/or C16:1ω6c), and summed feature 8 (C18:1ω7c and/or C18:1ω6c). The predominant isoprenoid quinone is ubiquinone-10. The polar lipids are diphosphatidylglycerol (DPG), phosphatidylethanolamine (PE), phosphatidylglycerol (PG), sphingoglycolipid (SGL), phosphatidylinositol mannosides (PIM), and an unidentified glycolipid (GL).

The type strain, HHU CXWT (= KCTC 72946T = CGMCC 1.17504T = MCCC 1K04223T), was isolated from the hair of a 21-year old female student of Hohai University, Nanjing, China. The GenBank/EMBL/DDBJ accession numbers for the genome sequence is JABULH000000000. The DNA G + C content is 67.1%.