Introduction

Obesity is a major public health problem that increases the risk of several diseases such as metabolic diseases (type II diabetes) and cardiovascular diseases. According to Ng et al. (2014), the world rate of obesity has increased between 1980 and 2013. Obesity results from an imbalance between feed quantity intake and energy expenditure. This process involves both genetic and environmental factors (Frayling et al. 2007; Cecil et al. 2008). More recently, obesity has been associated with an imbalance of the gut microbiota composition (Ley et al. 2006; Turnbaugh et al. 2006; Million et al. 2012).

Several methods of treating obesity have been proposed, mainly dietary and lifestyle measures, medical and surgical treatments. Bariatric surgery is one of the most effective treatments for obesity. It is also associated with an increase in microbial diversity (Zhang et al. 2009; Kong et al. 2013).

For the purpose of assessing gut microbiota dynamics from obese people before and after bariatric surgery, we studied the gut microbiota of stool samples from obese subjects before and after bariatric surgery by culturomics (Lagier et al. 2015). During this study, we isolated a new anaerobic, Gram-negative coccus strain Marseille-P2082T, from a stool sample of a 33 year-old French woman living in Marseille after bariatric surgery. This new bacterium has been previously reported as a new species announcement without a thorough description and was provisionally named “Negativicoccus massiliensis” strain AT7 (Togo et al. 2016c).

Here, we describe the phenotypic characteristics of strain Marseille-P2082T (formerly AT7) together with the description of its complete genome sequencing and annotation.

Material and methods

Sample collection, strain Marseille-P2082T isolation and identification

A stool sample was collected in November 2011 from a 33 year-old obese French woman with a body mass index of 38.6 kg/m2. Written and informed consent was obtained from the patient at the Nutrition, Metabolic disease and Endocrinology service, in la Timone Hospital, Marseille, France. The study and the assent procedure were approved by the local ethics committee of IFR 48, under number 09-022. The stool sample was stored at −80 °C after collection and studied by a microbial culturomics approach.

The strain was isolated under anaerobic conditions. To isolate this new strain, 1 g of stool sample was injected into an anaerobic blood culture vial (BACTEC Lytic/10 vials of anaerobic culture/F), enriched with 4 ml (5%) of rumen juice sterilised by filtration and 4 ml (5%) of sheep blood, then incubated at 37 °C. A one-month follow-up was conducted as described elsewhere (Togo et al. 2017). Emerged colonies were cultivated under the same conditions for isolation and then identified by MALDI-TOF as described elsewhere (Seng et al. 2009).

Negativicoccus succinicivorans strain ADV 07/08/06-B-1388T (Marchandin et al. 2010) was generously provided by Professor Hélène Marchandin (HydroSciences Montpellier, CNRS, IRD, University of Montpellier, Department of Microbiology, Nîmes University Hospital, France) and cultured under comparable conditions.

Phylogenetic analysis

The 16S rRNA gene sequencing of the strain Marseille-P2082 was performed as previously reported (Drancourt et al. 2000) using the fD1-rP2 primers, a GeneAmp PCR System 2720 thermal cycler (Applied Bio systems, Bedford, MA, USA) and an ABI Prism 3130-XL capillary sequencer (Applied Biosciences, Saint Aubin, France).

For taxonomic assignment, CodonCode Aligner (101 Victoria Street Centerville, MA 02632, USA) was used to correct sequences and BLASTn searches was performed in the NCBI (National Centre for Biotechnology Information) web server at http://blast.ncbi.nlm.nih.gov.gate1.inist.fr/Blast.cgi using the 16S ribosomal RNA sequences (Bacteria and Archaea) database and excluding all sequences from uncultured species. Only reference sequences from the type strains of the closely related validly named species were considered. Sequences alignment was performed by a multiple sequence alignment program, ClustalW (Thompson et al. 2002). The Maximum Likelihood method based on the Kimura 2-parameter model was used to infer the tree. Rapid bootstrapping in conjunction with the autoMRE bootstopping criterion (Pattengale et al. 2010) and subsequent search for the best tree was used; for Maximum Parsimony analysis, 1000 bootstrapping replicates were used in conjunction with tree-bisection-and-reconnection branch swapping and ten random sequence addition replicates. Reliability of the nodes was estimated as to the percentages of bootstrap values obtained by repeating the analysis 1000 times to generate a consensus tree. The sequences were checked for a compositional bias using the Χ2 test as implemented in PAUP* (Swofford 2002).

Phenotypic and chemical characteristic analysis

Colony morphology and pigmentation were observed after cultivation of the strain on Columbia agar (bioMérieux, Marcy l’Etoile, France) at 37 °C for 48 h. Phenotypic characteristics of the strain was determined as previously described (Togo et al. 2016a, 2017). API strips: (API® 20A, API® Rapid ID 32A, API® ZYM, API® and API®50 CH) were used according to the manufacturer’s instruction (bioMérieux). Growth temperature range from 25, 28, 37, 45 and 55 °C was tested in aerobic, anaerobic and microaerophilic conditions. The growth of strain Marseille-P2082 was attempted at various pH values (6, 6.5, 7 and 8.5). The salt tolerance of the strain was also tested using various NaCl concentrations (5, 10, 50, 75 and 100 g/l) on Schaedler agar enriched with 5% Sheep Blood (bioMérieux) in an anaerobic atmosphere at 37 °C. Anaerobic conditions were generated by incubating the culture in an anaerobic jar using the GENbag anaer system (bioMérieux). Cellular fatty acid methyl ester analysis was performed by Gas Chromatography/Mass Spectrometry (GC/MS) as described elsewhere (Togo et al. 2017).

Genome sequencing and assembly

Genomic DNA (gDNA) of strain Marseille-P2082 and N. succinicivorans strain ADV 07/08/06-B-1388T was sequenced using the MiSeq Technology (Illumina Inc, San Diego, CA, USA) with the Mate Pair strategy and assembled as previously described (Togo et al. 2016b).

Genome annotation and comparison

For the genome annotation of strain Marseille-P2082, Open Reading Frames (ORFs) were predicted using Prodigal (Hyatt et al. 2010) and the predicted protein sequences were searched against the GenBank and Clusters of Orthologous Groups (COGs) databases by BLASTP as previously described (Togo et al. 2017). The draft genome of N. succinicivorans strain ADV 07/08/06-B-1388T was obtained in this study. For comparison, genomes were automatically retrieved for closely related species in the 16S rRNA tree: Dialister micraerophilus strain DSM 19965 (NZ_AFBB00000000), Dialister pneumosintes strain F0677 (NZ_CP017037), Dialister succinatiphilus strain YIT 11850 (NZ_ADLT00000000), Dialister invisus strain DSM 15470 (NZ_ACIM00000000) and Selenomonas bovis strain DSM 23594 (NZ_ARLB01000000) were selected. For each selected genome, complete genome sequence, proteomes sequences and Orfeomes sequences were retrieved from the FTP of NCBI. As a part of the taxonogenomics description, all proteomes were analysed with proteinOrtho (Lechner et al. 2011). Then for each pair of genomes, a similarity score was computed. This score is the mean value of nucleotide similarity between all pairs of orthologues between the two genomes studied (Ramasamy et al. 2014). An annotation of the entire proteome was performed to define the distribution of functional classes of predicted genes according to the clusters of orthologous groups of proteins. The genome of strain Marseille-P2082 was locally aligned 2-by-2 using BLSAT algorithm (Kent 2002; Auch et al. 2010) against each of the selected previously cited genomes and digital DNA-DNA hybridization (dDDH) values were estimated from a generalised model (Meier-Kolthoff et al. 2013), and the Orthologous Average Nucleotide Identity (OrthoANI) were calculated using Orthologous Average Nucleotide Identity Tool (OAT) software (Lee et al. 2016).

Results and discussion

MALDI-TOF analysis

The reference spectrum generated from isolated colonies was unable to be matched with any of those in the Bruker database (Fig. 1). A gel view was performed in order to detect the spectral differences of strain Marseille-P2082 and closely related species (Fig. 2).

Fig. 1
figure 1

Reference mass spectrum from strain Marseille-P2082T. Spectra from 12 individual colonies were compared and a reference spectrum was generated

Fig. 2
figure 2

Gel view comparing strain Marseille-P2082T to other closely related species. The gel view displays the raw spectra of loaded spectrum files arranged in a pseudo-gel like look. The x-axis records the m/z value. The left y-axis displays the running spectrum number originating from subsequent spectra loading. The peak intensity is expressed by a Grey scale scheme code. The colour bar and the right y-axis indicate the relation between the colour a peak is displayed with and the peak intensity in arbitrary units. Displayed species are indicated on the left

Phylogenetic analysis

The 16S rRNA phylogenetic analysis showed that strain Marseille-P2082T exhibits a 98. 65% sequence similarity with N. succinicivorans (Marchandin et al. 2010), which is classified in the family Veillonellaceae (Rogosa 1971). A Maximum Likelihood phylogenetic tree (Fig. 3) based on 16S rRNA gene sequences showed that strain Marseille-P2082 is closely related to N. succinicivorans. The 16S rRNA sequence of strain Marseille-P2082 has been deposited in EMBL-EBI under accession number LN876651.

Fig. 3
figure 3

Maximum Likelihood phylogenetic tree based on 16S rRNA sequence comparison highlighting the position of strain Marseille-P2082T compared to other closely related species. The corresponding GenBank accession numbers for 16S rRNA genes of each strain are indicated in brackets. Sequences were aligned using ClustalW with default parameters and Phylogenies were inferred by the GGDC web server available at http://ggdc.dsmz.de/ using the DSMZ phylogenomics pipeline. The scale bar represents a 3% nucleotide sequence divergence

Phenotypic, biochemical and chemotaxonomic characterisation

Strain Marseille-P2082 was observed to be a Gram-stain negative, non-motile, coccoid bacterium. The strain exhibits no catalase nor oxidase activity. Growth was found to occur between 28 and 55 °C, but optimal growth was observed at 37 °C after 72 h of incubation in both anaerobic and microaerophilic atmospheres on 5% sheep blood Columbia agar (bioMérieux). No growth of the strain was observed in an aerobic atmosphere. No growth of the bacterium was observed using NaCl concentration of 10 g/l or more on 5% sheep blood Schaedler agar (bioMérieux). Strain Marseille-P2082 was found to grow at pH values ranging from 6 to 8.5 but the optimal pH was 7.5. Colonies were observed to be circular, translucent and very small, with a mean diameter less than 0.5 mm. No haemolysis was seen on Columbia agar (bioMérieux) after 72 h of incubation in anaerobic condition at 37 °C. Cells were found to have a diameter ranging from 0.2 to 0.5 µm by electron microscopy (Supplementary Fig. 1).

Using rapid ID 32A strips, arginine dihydrolase, arginine arylaminidase, histidine arylamidase, leucyl glycine arylamidase, phenylalanine arylamidase and tyrosine arylamidase activities were found to be positive. Urease and indole are not produced, and nitrate is not reduced. Mannose and raffinose are not fermented. Alkaline phosphatase, alanine arylamidase, glycine arylamidase, glutamyl glutamic acid arylamidase, leucine arylamidase, proline arylamidase, pyroglutamic acid arylamidase, serine arylamidase, glutamic acid decarboxylase, n-acetyl-β-glucosaminidase, α-arabinose, α-galactosidase, β-galactosidase, β-galactosidase 6 phosphate, α-glucosidase, β-glucosidase, β-glucuronidase, and α-fucosidase activities were found to be negative.

Using API® Zym strips, positive reactions were observed for esterase, esterase lipase, valine arylamidase, acid phosphatase, and naphtol-AS-BI-phosphohydrolase, but alkaline phosphatase, lipase, leucine arylamidase, cystine arylamidase, N-acetyl-β-glucosaminidase, trypsin, α-chymotrypsin, α-galactosidase, β-galactosidase, β-glucuronidase, α-glucosidase, β-glucosidase, α-mannosidase and α-fuctosidase activities were negative.

Using API® 20A strips, positive reactions were observed with d-glucose, d-lactose, d-maltose, d-mannose, d-melezitose, d-saccharose, and glycerol. Indole production, esculin hydrolyse, urease, d-cellulose, d-mannitol, d-raffinose, d-sorbitol d-trehalose, d-xylose, gelatine, salicin, l-arabinose and l-rhamnose were found to be negative.

Using API 50 CH strips, positive fermentation reactions were observed with aesculin, d-galactose, d-glucose, d-fructose, d-lactose, d-maltose, d-mannose, d-melezitose, d-ribose, d-saccharose, d-tagatose, d-turanose, glycerol, N-acethylglucosamine, potassium 5-cetogluconate and starch. Negative reactions were observed for fermentation of adonitol, amygdaline, arbutine, d-arabinose, d-arabitol, d-cellobiose, d-fucose, d-lyxose, d-mannitol, d-melibiose, d-raffinose, d-sorbitol, d-trehalose, d-xylose, dulcitol, erythritol, gentiobiose, glycogen, inositol, inulin, l-arabinose, l-arabitol, l-fucose, l-rhamnose, l-Sorbose, l-xylose, methyl-α-d-glucopyranoside, methyl-α-d-mannopyranoside, methyl-β-d-xylopyranoside, potassium gluconate, potassium 2-ceto gluconate, salicin, and xylitol. The strain Marseille-P2082 differs from N. succinicivorans in its ability to ferment 14 sugars whereas N. succinicivorans is assaccharolytic. The differential characteristics of strain Marseille-P2082T compared with its close relatives are detailed in Table 1.

Table 1 Differential characteristics of strain Marseille-P2082

The most abundant cellular fatty acids of strain Marseille-P2082T were identified as C18:1n9 (37.2%), C18:0 (34.7%), C16:0 (21.3%), C18:2n6 (6.3%) and a minor amount of C14:0 (< 1%).

Genome properties

The draft genome from strain Marseille-P2082 has been deposited in EMBL-EBI under accession number LT700188. The draft genome is 1, 360, 589 bp long with a 51.1% G+C content (Fig. 4). It is composed of 1 scaffold (1 contig). Of the 1353 predicted genes, 1276 are protein-coding genes, 68 are RNAs genes (12 rRNA, 52 tRNA and 4 other RNA) and 9 were pseudogenes. A total of 995 genes (77.9%) were assigned a putative function (by COGs or by NR blast), 23 genes (1.8%) were identified as ORFans, and 5 genes were associated with polyketide synthase (PKS) or non-ribosomal peptide synthetase (NRPS) (Conway and Boddy 2013). The remaining 253 protein coding genes (19.8%) were annotated as hypothetical proteins. Using ARG-ANNOT (Gupta et al. 2014), no antibiotic resistance genes were found. The distribution of genes into COGs functional categories is presented in Table 2.

Fig. 4
figure 4

Graphical circular map of the genome of strain Marseille-P2082T. From outside to the centre: Contigs (red/grey), COG category of genes on the forward strand (three circles), genes on forward strand (blue circle), genes on the reverse strand (red circle), COG category on the reverse strand (three circles), GC content. (Color figure online)

Table 2 Number of genes associated with the 26 general COG functional categories of strain Marseille-P2082T

Genome comparison

The draft genome (1.36 Mb) of the strain Marseille-P2082 is larger than those of D. succinatiphilus, D. pneumosintes and D. micraerophilus (1.25, 1.27 and 1.28 Mb respectively) but smaller than those of N. succinicivorans, D. invisus and S. bovis (1.48, 1.89 and 2.69 Mb). The G+C content of strain Marseille-P2082T (51.1%) is higher than those of D. pneumosintes, D. micraerophilus, D. invisus and D. succinatiphilus (35.2%, 35.4%, 45.5% and 50.4% respectively), similar to that of N. succinicivorans, but smaller than that of S. bovis (59.2%). The gene content of strain Marseille-P2082 (1353) is smaller than those of N. succinicivorans, D. invisus, D. succinatiphilus and S. bovis (1454, 1829, 2280, and 2569 respectively) but higher than those of D. micraerophilus and D. pneumosintes (1237 and 1263 respectively). However, the gene distribution into COGs categories was similar among all compared genomes (Table 2, Fig. 5).

Fig. 5
figure 5

Distribution of functional classes of predicted genes according to the COGs of strain Marseille-P2082T compared with closely related species. (1) Strain Marseille-P2082; (2) N. succinicivorans strain ADV; (3) Dialister micraerophilus; (4) Dialister invisus; (5) Dialister succinatiphilus; (6) Dialister pneumosintes; (7) Selenomonas bovis

The dDDH values of strain Marseille-P2082 ranged from 23.2% ± 4.8% with S. bovis to 53.7 ± 5.3% with N. succinicivorans strain ADV (Table 3). The OrthoANI values ranged from 63.35% between strain Marseille-P2082 and D. pneumosintes to 93.78% with N. succinicivorans. These values are lower than the threshold value (95–96%) recommended to delineate prokaryotic species (Konstantinidis and Tiedje 2005; Rodriguez-R and Konstantinidis 2014). The OrthoANI values between strain Marseille-P2082 and closely related species are shown in Table 4 and Fig. 6.

Table 3 Pairwise comparison of strain Marseille-P2082 with other species using GGDC, formula 2 (dDDH estimates based on identities/HSP length)*
Table 4 OrthoANI values calculated from the OAT software of strain Marseille-P2082 compared with those of its phylogenetically close neighbours
Fig. 6
figure 6

Heatmap generated with OrthoANI values calculated using the OAT software between strain Marseille-P2082 and other closely related species with standing in nomenclature

Based on phenotypic, chemotaxonomic, genomic and phylogenetic characteristics, strain Marseille-P2082T (= CSUR P2082T, DSM 100853T) is designated as the type strain of a novel species, here named Negativicoccus massiliensis.

Description of “Negativicoccus massiliensis” sp. nov

Negativicoccus massiliensis (mas.si.li.en’sis. L. masc. adj. massiliensis, of Massilia, the Latin name of Marseille, where the type strain was isolated).

Cells are Gram-negative, non-motile, asporogenous and coccoid, with a mean diameter of 0.5 µm. Columbia agar-grown colonies are circular, flat, translucent with a mean diameter smaller than 0.5 mm after 72 h of incubation in anaerobic conditions. Catalase and oxidase activities are absent. The optimum growth temperature is 37 °C. Arginine hydrolase activity is positive. Not able to reduce nitrate to nitrite. Indole and urease are not produced, gelatin is not liquefied and aesculin is not hydrolysed. The G+C content of the draft genome of the type strain is 51.1%.

The type strain Marseille-P2082T (= CSUR P2082T, = DSM 100853T) was isolated from a stool sample of a morbidly obese French woman. The habitat of this microorganism is the human gut. The 16S rRNA gene sequence and whole-genome shotgun sequence of strain Marseille-P2082T have been deposited in EMBL-EBI under accession numbers LN876651 and LT700188, respectively.