Introduction

The genus Leuconostoc consists of heterofermentative, Gram-positive, non-motile, non-sporulating, and facultative anaerobic lactic acid bacteria (LAB) that produce lactic acid as one of the key fermentation products from carbohydrate metabolism (Axelsson 2004; Hayek and Ibrahim 2013). At present, nearly 400 species of LAB have been recognized (Zhang et al. 2011) and the genus Leuconostoc is comprised of 24 different species (Kot et al. 2014). The genus Leuconostoc includes Leuconostoc mesenteroides (with four subspecies: mesenteroides, dextranicum, cremoris, and suionicum) and 23 other species, i.e., L. amelibiosum, L. argentinum, L. cremoris, L. citreum, L. carnosum, L. dextranicum, L. durionis, L. fallax, L. ficulneum, L. holzapfelii, L. pseudoficulneum, L. fructosum, L. gasicomitatum, L. gelidum, L. inhae, L. kimchii, L. lactis, L. pseudomesenteroides, L. miyukkimchii, L. oeni, L. palmae, L. paramesenteroides, and the recently reported L. rapi (www.bacterio.net). Leuconostocs can be found in various natural niches such as green vegetation and plant roots (Hemme and Foucaud-Scheunemann 2004). In addition, the genus Leuconostoc has also been found to be associated with beverages, meat, vegetables, and dairy and bakery products (Wassie and Wassie 2016). The amazing fermentative capacity of these bacteria helps to improve organoleptic attributes and safety of the food products (Gemechu 2015). It has also been reported that Leuconostoc helps in augmentation of various nutrients in the food products, including ample health benefits for humans (Steele et al. 2013). Furthermore, leuconostocs are utilized as starter cultures for various industrial processes and also as a flavoring and texturizing agents (Dan et al. 2014).

Several typing methods have been used for the identification, characterization, and utilization of LAB (Sabat et al. 2013; Tanigawa and Watanabe 2011). Generally, typing of bacteria is related to phenotypic and genotypic characteristics. Phenotyping includes traditional methods such as serotypes, biotypes, phage-types, and antibiograms (Dan et al. 2014). In contrast, genotypic methods are found to be more advantageous as they are rapid, generally applicable powerful tools to differentiate isolates on the basis of genetic variations (Unemo and Dillon 2011). Therefore, molecular approaches have been used for the proper identification and characterization of bacterial species (Botina et al. 2006). Currently, several molecular typing methods, such as random amplified polymorphic DNA PCR (RAPD-PCR), amplified fragment length polymorphism (AFLP), pulsed-field gel electrophoresis (PFGE), restriction fragment length polymorphism (RFLP), repetitive element palindromic PCR (Rep-PCR), protein fingerprinting, and multilocus sequence typing (MLST) have been utilized to characterize Leuconostoc spp. (Alegría et al. 2013; Björkroth et al. 2000; Cibik et al. 2000; Kunene et al. 2000; Nieto-Arribas et al. 2010; Pérez et al. 2002; Sánchez et al. 2005; Vihavainen and Björkroth 2009; Zhang et al. 2015). Multilocus sequence typing method was built on the procedure of multilocus enzyme electrophoresis (MLEE) by targeting the variation present at multiple housekeeping loci (Sabat et al. 2013). However, in MLST, identification of this variation was achieved by nucleotide sequencing of the housekeeping loci to differentiate between isolates of the same microbial species (Bain et al. 2007). In the last few decades, the MLST technique has been recognized to study the phylogenetic structure of many bacterial species by considering six to eight housekeeping genes (Feil and Enright 2004; Maiden et al. 1998; Oh et al. 2010). According to Konstantinidis et al. (2006), the minimum number of genes needed for multilocus differentiation between species is three because of potential events of horizontal gene transfer or recombination. Originally, the technique was used to identify different virulent lineages of 107 isolates of Neisseria meningitidis (Maiden et al. 1998). Soon after that, the approach was utilized to analyze other bacterial species, including LAB (Dingle et al. 2001; Farfán et al. 2002; Helgason et al. 2004). LAB strains that have been typed by MLST technique include Oenococcus oeni (Bilhere et al. 2009), Lactobacillus casei (Cai et al. 2007), Pediococcus (Calmin et al. 2008), Lactobacillus plantarum (de Las Rivas et al. 2006; Tanganurat et al. 2009), Lactococcus lactis (Passerini et al. 2010), Lactobacillus sanfranciscensis (Picozzi et al. 2010), Lactobacillus delbrueckii (Tanigawa and Watanabe 2011), Lactobacillus sakei (Chaillou et al. 2013), L. lactis (Dan et al. 2014), Lactobacillus fermentum (Dan et al. 2015), Carnobacterium maltaromaticum (Rahman et al. 2014), L. mesenteroides (Zhang et al. 2015), and recently Lactobacillus delbrueckii subsp. bulgaricus (Song et al. 2016).

The aim of the present study was to develop an MLST scheme for the characterization of 35 L. mesenteroides strains from South Korea. The relatedness of the selected strains was displayed as dendrogram and minimum spanning tree by means of various software using differences between their allelic profiles.

Materials and methods

Bacterial strains

In this study, a total of 35 strains, deposited in the Korean Culture Collection of Probiotics (KCCP, Seongnam, South Korea), were identified as L. mesenteroides based on sequence analysis of the 16S rRNA gene (Kaur et al. 2017). These strains were isolated from various food sources, including Kimchi (11 isolate), radish (10), Chinese cabbage kimchi (4), radish kimchi (2), young radish (4), salted clam (2), salted fish (1), and salted small octopus (1) from South Korea as described in previous study (Kaur et al. 2017). The strains were grown in MRS broth (Difco Laboratories, Detroit, MI, USA) and incubated overnight at 37 °C. Stock cultures were stored in 20% glycerol (Duksan, South Korea) at − 80 °C.

DNA extraction

Total genomic DNA was extracted and purified from 3-mL overnight cultures using the AccuPrep Genomic DNA Extraction Kit (Bioneer). The bacterial cells grown in MRS broth (Difco) at 37 °C were collected by centrifugation (16,000×g, 1 min), and DNA was extracted from the cell pellets of L. mesenteroides strains as described previously (Kaur et al. 2017; Lee and Lee 2008).

PCR amplification and sequencing of amplicons

To study the population structure of L. mesenteroides, seven housekeeping genes were selected: atpA (ATP synthase subunit alpha), gyrB (gyrase subunit B), groEL (molecular chaperonin), pheS (Phenylalanyl-tRNA synthetase), pyrG (CTP synthase), rpoA (RNA polymerase), and uvrC (Excinuclease ABC, subunit C). Primers were designed using a primer blast tool from the NCBI website (www.ncbi.nlm.nih.gov) and synthesized by Macrogen (Macrogen Inc., Seoul, South Korea). For polymerase chain reaction (PCR) amplification, 1 μL purified genomic DNA of each strain, 2 μL (10 pmoles/μL) of each primer (forward and reverse), and 17 μL of distilled water were added to the premix kit (Bioneer, Daejeon, South Korea) to make a final volume of 20 μL. The PCR procedure for the amplification of the seven housekeeping genes was done in an automatic thermocycler (Bio-Rad Gel Doc XR+, Bio-Rad, CA, USA) under the following conditions: 94 °C for 2 min, 35 cycles of denaturation at 95 °C for 20 s each, annealing for 30 s at a different temperature for each gene (atpA, 51 °C; groEL, 60 °C; gyrB, 49.2 °C; pheS, 46.3 °C; pyrG, 53.7 °C; rpoA, 50.5 °C; and uvrC, 58.3 °C), extension at 72 °C for 30 s, followed by a final elongation at 72 °C for 7 min. The PCR products were stored at 4 °C then 5 μL of each was assessed by electrophoresis on a 1% (w/v) agarose gel (Seakem, Lonza, USA), and the images were recorded by using a gel documentation system (Bio-Rad Gel Doc XR+). A 100-bp ladder (Bioneer, Daejeon, South Korea) and a 1-kb ladder (Takara, Japan) were used as size markers. The confirmed PCR products were purified with the Wizard SV Gel and PCR Clean-Up system kit (Promega, USA), and sequencing was performed by Humanizing Genomics MACROGEN (Macrogen Inc., Seoul, South Korea) and Bioneer (Bioneer, Daejeon, South Korea).

Data analysis

For MLST data analysis, the forward and reverse sequences of each gene were trimmed and analyzed with Bioedit Sequence Alignment Editor v 7.2.5 (Hall 1999), and further aligned with the ClustalX v 1.83 software. The guanine-cytosine (GC) content and the polymorphism data were calculated by DnaSP v 5.10 (Librado and Rozas 2009). The ratios of the synonymous substitutions (dS) to non-synonymous substitutions (dN) were studied by using an online software, SNAP (www.hiv.lanl.gov) (Korber 2000). The allelic profiles and the sequence types (STs) were deduced by using Bionumerics v 7.6 (Applied-Maths, Sint Martens-Latem, Belgium) software. The evolution of the population was studied with the help of minimum spanning tree (MST) using the Bionumerics software. Moreover, the relatedness of the STs were evaluated by constructing a dendrogram by the unweighted pair group method with arithmetic averages (UPGMA) using the START 2 v 0.9.0 beta software (Jolley et al. 2001). To assess the degree of tree-like structure, split decomposition analysis was performed with SPLITSTREE v 4.14 (Huson and Bryant 2006).

Nucleotide sequence accession numbers

The partial sequences of the seven MLST loci for 35 L. mesenteroides strains used in this study have been deposited in GenBank/EMBL databases under accession numbers KY224760-KY224969 and KY111323-KY111357.

Results

Sequence diversity and assignment of sequence types

The sequences of the seven chosen loci (atpA, groEL, gyrB, pheS, pyrG, rpoA, and uvrC) were determined for the 35 strains of L. mesenteroides. However, only a few genes have given desired clean sequences, while the others generated noisy sequence signals. Therefore, the latter PCR products were repurified and sequenced again. The number of alleles, polymorphic sites, guanine-cytosine content, and ratios of dN/dS were calculated. The fragment sizes of the seven housekeeping loci ranged in length from 366 to 1414 bp (Table 1). The MLST scheme defined alleles between 316 bp (uvrC) and 769 bp (gyrB) in length. The allelic combinations of the selected gene loci were determined, and a unique sequence type number was assigned for each isolate. Strains could be divided into 27 STs (ST-1 to ST-27) using combined data from the seven loci, giving a clear indication of variability among 35 strains (Table 2). Among these, ST-3 and ST-5 were the most observed STs, each was presented by five different strains, and the remaining 25 STs were represented by a single strain only. All the STs differed at various loci, except for ST-5, ST-8, ST-3; ST-13, ST-23; ST-26, ST-27; and ST-11, ST-24, differed only in the atpA locus; ST-3, ST-6 which showed differences at the pyrG locus; and ST-5, ST-23; ST-16, ST-17 with variation at the pheS locus whereas ST-3 and ST-22 showed variation at rpoA locus (Table 2). The allelic profiles of the seven loci for each strain are described in Table 2. The number of alleles per gene fragment varied from 7 for each of groEL and gyrB to 12 for rpoA. All the seven loci were found to be polymorphic (each is represented by more than five polymorphic sites), and the number of polymorphic sites ranged from 9 for groEL to 48 for pyrG (Table 3), giving a total of 220 SNPs.

Table 1 Genes and sequencing primers used
Table 2 Sequence types for 35 Leuconostoc mesenteroides strains
Table 3 Allelic variation in seven housekeeping genes

Variation and recombination at MLST loci

The GC content of the partial sequence of the seven gene fragments was ranged from 37.2% (pyrG) to 42.9% (rpoA) with a mean value of 40.05% (Table 3). The dN/dS ratio for the seven gene fragments varied between 0.0245 and 1.6417 (Table 3). Except for gyrB, pyrG, and uvrC, the dN/dS ratio was found to be < 1. The allele frequencies showed that for most of the loci, allele 3 (atpA, pheS, pyrG, rpoA, and uvrC) and allele 1 (atpA, groEL, gyrB) (Fig. 1 and Table 2) were dominant in the population. In addition, the frequency of allele 2 was found to be more mainly for groEL locus. For the measurement of intragenic recombination, a split decomposition analysis was performed on each locus (atpA, gyrB, groEL, pheS, pyrG, rpoA, and uvrC) separately. As shown in split graphs (Fig. 2), the 35 L. mesenteroides strains revealed different structures in the split graphs for all the seven gene segments. In the split graphs, a parallelogram-shaped structure indicative of intergenic recombination in the evolutionary history of the gene was detected in the pheS, pyrG, rpoA, and uvrC loci. However, split graphs for atpA, gyrB, and groEL loci were appeared as tree-like structures, suggesting an absence of recombination (Fig. 2).

Fig. 1
figure 1

Polymorphic nucleotide sites in the seven Leuconostoc mesenteroides MLST genes. Variable sites are shown. The number of strains possessing the allele is indicated in parentheses

Fig. 2
figure 2

Split decomposition analysis of alleles obtained from 35 Leuconostoc mesenteroides strains for seven loci (atpA, gyrB, groEL, pheS, pyrG, rpoA, and uvrC). Numbering in the figure refers to allele numbers

UPGMA tree and clustering analysis based on MLST data

An UPGMA (Unweighted Pair Group Method with Arithmetic Mean) tree showing the genetic relatedness among the L. mesenteroides strains was displayed in Fig. 3. As deduced from the tree, 35 strains were clustered within two major groups, A and B. Group A was composed of 28 strains, included three subgroups (A1, A2, and A3). Subgroups A1, A2, and A3 were consisted of 6, 3, and 19 (11 STs) strains respectively. Group B was investigated into 7 strains that included four subgroups (B1, B2, B3, and B4). Subgroups B1 and B3 consisted of 3 and 2 strains respectively, whereas groups B2 and B4 were comprised of 1 strain individually. ST-3, ST-5, and ST-8 were closely related and clustered together in group A (subgroup A3). Among these, ST-3 and ST-5 were consisted of five different strains, whereas ST-8 was represented by a single strain only. Similarly, ST-13 and ST-23 were clustered in subgroup A3 and were found to be closely related. Phylogenetic analysis of 35 different L. mesenteroides strains based on the allelic profile data was performed using minimum spanning tree algorithm (Fig. 4). In this representation, each colored circle of the MST corresponds to an ST and the size of the circle was proportional to the number of strains with the same allelic profile. Therefore, two large circles (red and green) out of 27, depicting two STs, i.e., ST-3 (red) and ST-5 (green), were consisted of 10 strains out of 35. ST-3 was represented by 11204, 11251, 11357, 11359, 11396 strains and ST-5 was consisted of 11386, 11360, 11364, 11362, 11289 strains. In addition, the remaining 25 colored circles of similar size were represented by a single strain only, thereby completing the 35 strains in total, used in the present study (Table 2). Different circles were connected to each other by different lines representing the degree of relationship among L. mesenteroides strains. If we compare MST and UPGMA tree, the results obtained were supporting each other. However, in the present study, we could not find an appropriate association between ST and the sources from which the strains were isolated (Table 2).

Fig. 3
figure 3

UPGMA dendrogram showing the genetic relationship between 27 STs found in Leuconostoc mesenteroides via MLST typing. The phylogenetic tree was produced using START 2.0 software. Two major groups were designated as A and B

Fig. 4
figure 4

Minimum spanning tree analysis of the 35 Leuconostoc mesenteroides strains from South Korea. Each circle corresponds to a sequence type (ST), and the circle size denotes the number of strains sharing the same ST. The type of line between isolates indicates the strength of the genetic relationship between these isolates (bold—strong relationship, solid—intermediate relationship, and dotted line—weak relationship). The lines (solid, bold, and dashed) are related to genetic similarity, i.e., the number of common alleles between profiles

Discussion

In this study, MLST scheme based on an analysis of seven gene fragments from L. mesenteroides strains was described. The results of the study showed that 27 different genotypes were reported from 35 strains of South Korea. In population genetic studies, PFGE and MLST were considered as the methods of choice (Glaeser and Kämpfer 2014; Weng et al. 2013). MLST is the best method for typing the DNA sequences of selected housekeeping genes of bacterial isolates (Urwin and Maiden 2003).

The represented MLST schemes showed high discriminatory power as it was able to differentiate between highly similar strains. This was represented by strains clustered in ST-3 and ST-5, and other STs which were differed only in the one locus, out of seven used in the study. Only 10 strains were clustered in two different STs, and the rest of the 25 strains were represented by single ST. Similar results were observed in an MLST study of Carnobacterium maltaromaticum (LAB) where 5 STs were characterized by at least two strains while approximately 80% of the STs were represented by one strain only (Rahman et al. 2014). The number of alleles of the seven housekeeping loci ranged from 7 to 12 (Table 2). Our results validated the genetic heterogeneity of this species in a previous study, where the number of alleles varied in between 6 and 11 among the nine studied genes of L. mesenteroides (Zhang et al. 2015). The observed results for GC content corroborated the previously described results of the mean GC content of L. mesenteroides (Zhang et al. 2015). In support, the published GC content for the genus Leuconostoc can vary from 38 to 44% (de Bruyne et al. 2007). Analysis of synonymous and non-synonymous changes in the allele sequences of the loci can be used to determine whether they are subject to positive selection; therefore, a dN/dS ratio > 1 implies selection for the amino acid changes (Tanigawa and Watanabe 2011). By comparison, in an earlier study, the dN/dS ratio obtained from L. mesenteroides for nine loci ranged from 0.0219 to 2.4379 (Zhang et al. 2015). In contrast to earlier studies (Zhang et al. 2015), the ratio for pyrG and uvrC was found to be more than 1 in the present study. Nevertheless, the groEL, pyrG, and uvrC loci were used in the molecular typing of L. mesenteroides (Zhang et al. 2015). Although in most of the studies related to MLST, dN/dS ratio was used to detect the selection pressure. But an assumed gene evolution model was studied from MLST data by computing the dN/dS ratio (Kryazhimskiy and Plotkin 2008). However, it has been verified that the dN/dS ratio is not suitable for deducing selection pressures from single bacterial populations, in which most differences between sequences signify segregating polymorphisms rather than fixed substitutions (Kryazhimskiy and Plotkin 2008). Recombination in the sequences of the housekeeping genes can be evaluated by split decomposition analysis. This provides evidence of intraspecies recombination events which may generate diversity among isolates. Parallelogram-shaped structures have been observed in the split graphs of four (rpoA, pheS, pyrG, and uvrC) housekeeping loci, illustrating recombination had occurred in these loci. Similar parallelogram-shaped structures were also reported in murC, pheS, pyrG, and uvrC split graphs of L. lactis isolates (Dan et al. 2014), and also in the dnaA and murC loci of L. mesenteroides isolates (Zhang et al. 2015). Previously, it has been reported that the recombination could occur in Leuconostoc species because of the presence of mobile elements such as bacteriophages, genomic islands, and transposable elements in their genome sequences (Meslier et al. 2012). Furthermore, plasmids from L. mesenteroides have also been identified (Jeong et al. 2007). From the UPGMA tree analysis based on the MLST data (Fig. 3), we could not find a strong relationship between the ST obtained and the isolation source of the bacteria. Similar results have been reported for L. lactis, Lactococcus lactis, and Lactobacillus sanfranciscensis, where no significant associations between STs and the various sources of the isolates could be found (Dan et al. 2014; Passerini et al. 2010; Picozzi et al. 2010). The absence of an association between STs and the isolation source may be because of the genetic diversity of individual L. mesenteroides strains. The applicability of this technique lies in the evolutionary and population analyses for the estimation of recombination and mutation rates among bacterial population. In addition, the technique can also be utilized to investigate the evolutionary relationships among bacteria belonging to the same genus.

In conclusion, the MLST scheme presented here will be a useful tool for the precise characterization of L. mesenteroides strains, and appears to have sufficient discriminatory power for molecular typing. Phylogenetic analysis divided the 35 strains into two major groups, but it has also been concluded that the evolution of the 27 STs is not related to their origin from food sources. In the future, work will be focused on having more samples of L. mesenteroides to obtain better insight into the evolution of this bacterial species in South Korea.