Introduction

The order Xanthomonadales is an early diverging group of bacteria within the class Gammaproteobacteria (Cutino-Jimenez et al. 2010; Williams et al. 2010; Naushad and Gupta 2013). The order Xanthomonadales currently contains 5 families (viz. Algiphilaceae, Nevskiaceae, Sinobacteraceae, Solimonadaceae, and Xanthomonadaceae) which contain 30 genera encompassing a large number of species that possess a diverse range of phenotypic and biochemical characteristics (Saddler and Bradbury 2005a; Parte 2013). The members of this order include a number of major plant pathogens that have significant economic and agricultural impact. Members of the genera Xylella and Xanthomonas, in particular, are major phytopathogens which cause a wide variety of serious diseases in more than 400 agriculturally important plants including tomatoes, bananas, citrus plants, rice, and coffee plants (da Silva et al. 2002; Van Sluys et al. 2003; Lee et al. 2005; Chatterjee et al. 2008; Salzberg et al. 2008; Ryan et al. 2011). The order also contains the genus Stenotrophomonas which harbours a number of increasingly important multidrug resistant opportunistic pathogens that are responsible for hospital-acquired infections in immunodeficient patients (Crossman et al. 2008; Looney et al. 2009). Despite the important plant and human pathogens present within this order, the taxonomy of this group is highly problematic and no comprehensive phylogenetic studies have been completed that focus specifically on the interrelationships of the different members within the order Xanthomonadales (Gao et al. 2009; Cutino-Jimenez et al. 2010; Williams et al. 2010; Naushad and Gupta 2013; Tindall 2014b).

The current taxonomy of the order Xanthomonadales is largely based on 16S rRNA sequence analysis (Saddler and Bradbury 2005a; Gutierrez et al. 2012; Losey et al. 2013). However, the 16S rRNA gene sequence has shown limited ability to resolve the branching and relationships of organisms within the order Xanthomonadales (Zhou et al. 2008; Cutino-Jimenez et al. 2010; Yilmaz et al. 2013). Phylogenetic trees based on the 16S rRNA gene sequence often do not resolve a monophyletic cluster of all Xanthomonadales; the most divergent members of the order often branch separately from the majority of the species within the group (Yilmaz et al. 2013). Apart from the 16S rRNA sequence, no biochemical, morphological or physiological characteristics are known which distinguish the order Xanthomonadales from all other bacteria or the families and major phylogenetic clusters within the order from each other (Saddler and Bradbury 2005a; Gutierrez et al. 2012; Losey et al. 2013). Thus, it is of interest to identify shared characteristics that can clearly elucidate the evolutionary relationships within this highly diverse group of organisms and form the basis for a coherent taxonomic framework of the order.

Whole genome sequences for members of the order Xanthomonadales provide a rich resource for the discovery of molecular characteristics which are unique to evolutionarily related organisms (Gao et al. 2009; Cutino-Jimenez et al. 2010; Naushad and Gupta 2013). One useful type of shared molecular characteristic that has been a focus of recent research are Conserved Signature Indels (CSIs), which are insertions/deletions uniquely present in protein sequences from a group of evolutionarily related organisms (Gupta 2010; Gao and Gupta 2012b; Gupta and Lali 2013; Gupta et al. 2013; Gupta 2014). Due to the specificity of CSIs for particular groups of bacteria, they represent molecular synapomorphies (markers of common evolutionary decent) which can be used to identify and demarcate specific bacterial groups in clear molecular terms (Gupta 1998, 2010). We have previously carried out comparative genomic analysis of a limited number of members from the order Xanthomonadales in which we identified a large number of CSIs in diverse proteins that were uniquely present in all analyzed members of the order or a subgroup of the Xanthomonadales (Naushad and Gupta 2013). In this work, we have extended these studies, by carrying out detailed phylogenomic and comparative genomic analyses on a greatly expanded dataset on members of the order Xanthomonadales which includes 43 genomes from the NCBI, JGI, and EzBioCloud genome databases and 5 additional Xanthomonadales genomes, which we have sequenced de novo, representing 2 families, 20 genera, and 42 named species. Our analyses have identified no phylogenetic support for a monophyletic grouping of all sequenced members the order Xanthomonadales and no CSIs were identified which are uniquely shared by all sequenced species within the order Xanthomonadales suggesting that the order Xanthomonadales does not represent a single monophyletic lineage. Additionally, we have identified 31 CSIs which are either specific to the family Xanthomonadaceae or to one of its subgroups which demarcate these groups in molecular terms. On the basis of the identified CSIs and the results of phylogenomic analyses, we propose a new taxonomic framework for the order Xanthomonadales. In this proposal, the families Algiphilaceae and Solimonadaceae (Nevskiaceae), which do not branch with the other members of the order Xanthomonadales, are transferred into the order Nevskiales ord. nov, along with their closest evolutionary neighbour, the family Salinisphaeraceae fam. nov. The remaining members of the order Xanthomonadales are divided into two families: the family Xanthomonadaceae, containing the genus Xanthomonas and its closest relatives, and a new family, Rhodanobacteraceae fam. nov., containing the genus Rhodanobacter and its closest relatives. Lastly, the descriptions of the order Lysobacterales, the family Lysobacteraceae, and the family Nevskiaceae are emended to indicate that they are earlier synonyms of the order Xanthomonadales, the family Xanthomonadaceae, and the families Solimonadaceae and Sinobacteraceae, respectively.

Methods

DNA extraction and genome sequencing

Five Xanthomonadales isolates were sequenced de novo in this study; Dyella japonica DSM 16301T (Genbank accession number JPLA00000000), Luteibacter rhizovicinus DSM 16549T (JPLB00000000), Thermomonas brevis DSM 15422T (JPLC00000000), Xanthomonas hyacinthi DSM 19077T (JPLD00000000), and Xanthomonas pisi DSM 18956T (JPLE00000000). The isolates were obtained from the German Collection of Microorganisms and Cell Cultures (Leibniz-Institut DSMZ). The isolates were grown for 24 h under the growth conditions described in Supplemental Table 1. Genomic DNA was extracted using a CTAB based DNA extraction methodology (Wilson 1987) with specific modifications for Xanthomonadales described by Jaufeerally-Fakim and Dookun (2000). The DNA samples were diluted to 0.2 ng/µl and standard Illumina multiplex libraries were generated using the Nextera XT DNA Sample Prep Kit. The fragment size distribution of each library was verified using the Agilent High Sensitivity DNA Kit. Sequencing was performed using an Illumina HiSeq 2000 and 150 bp paired end reads were generated. Genomes were assembled using CLC Genomics Workbench 7.0.4 with default de novo assembly parameters and trimmed for contamination using the UniVec vector database (Build 8.0) (Table 1).

Table 1 Characteristics of the Xanthomonadales genomes used for phylogenetic analysis

Phylogenetic sequence analysis

Phylogenetic analysis was performed on a concatenated sequence alignment of 15 highly conserved housekeeping proteins (viz. dimethyladenosine transferase, alanyl-tRNA synthetase, arginyl-tRNA synthetase, chaperone protein DnaK, signal recognition particle-docking protein FtsY, chaperonin GroL, DNA gyrase subunit A, DNA gyrase subunit B, ATP-dependent DNA helicase UvrD, valyl-tRNA synthetase, Isoleucyl-tRNA synthetase, DNA polymerase I, SecA, RpoB, and RpoC) which have been widely used for phylogenetic analysis (Kyrpides et al. 1999; Charlebois and Doolittle 2004; Ciccarelli et al. 2006). Sequences for these proteins were obtained from the NCBI and JGI-IMG genome databases for strains of all Xanthomonadales and a representative selection of outgroup Gammaproteobacteria (which included members from the orders Aeromonadales, Alteromonadales, Cardiobacteriales, Chromatiales, “Enterobacteriales”, Legionellales, Methylococcales, Oceanospirillales, Pasteurellales, Pseudomonadales, “Salinisphaerales”, Thiotrichales, and “Vibrionales”) and Betaproteobacteria. Sequences for these proteins were also obtained from the five Xanthomonadales genomes which we have sequenced in this work and the genome of Riemerella anatipestifer, which was used to root the tree. Multiple sequence alignments for these proteins were created using Clustal_X 1.83 (Jeanmougin et al. 1998) and concatenated into a single alignment file. Poorly aligned regions from this alignment file were removed using Gblocks 0.92 (Castresana 2000). The resulting alignment, which contained 6995 aligned amino acids, was used for phylogenetic analysis. The maximum-likelihood tree based on 100 bootstrap replicates of this alignment was constructed using MEGA 5.2 (Tamura et al. 2011) employing the Whelan and Goldman substitution model.

A 16S rRNA gene sequence based phylogenetic tree was also created based on 197 sequences that included representative strains of all cultured Xanthomonadales genera. 16S rRNA gene sequences larger than 1,200 bp were obtained for all strains used in our concatenated protein based phylogenetic tree and all type strains classified under the order Xanthomonadales in the Ribosomal Database Project (Cole et al. 2014). A maximum-likelihood tree based on these sequences was created using 100 bootstrap replicates of the 16S rRNA sequence alignments in MEGA 5.2 (Tamura et al. 2011) employing the General Time-Reversible (Tavaré 1986) substitution model.

Identification and assessment of specificity of conserved signature indels

Identification of CSIs that are commonly shared by members of the Xanthomonadaceae was carried out as described by Naushad and Gupta (2013). Briefly, for the identification of CSIs, BLASTp searches were performed on each protein in the genome of Rhodanobacter fulvus Jip2. These searches were performed using the default BLAST parameters against all available sequences in the GenBank non-redundant database. For those proteins for which high scoring homologs (E values < 1e−20) were present in other species from the Xanthomonadales, multiple sequence alignments were created using the Clustal_X 1.83 program (Jeanmougin et al. 1998). These alignments were visually inspected for the presence of insertions or deletions that were flanked on both sides by at least 5–6 conserved amino acid residues in the neighbouring 30–40 amino acids. Indels that were not flanked by conserved regions were not further considered, as they do not provide useful molecular markers. To assess the specificity of the indels we identified here and to reassess the specificity of the indels identified in our previous work, we carried out detailed BLASTp and tBLASTn searches against both the NCBI and JGI-IMG genome databases using as query short sequence segments containing the indel and the flanking conserved regions (60–100 amino acids long). Local tBLASTn searches were also completed on the indel containing regions for genomes of Xanthomonadales organisms missing from the NCBI and JGI-IMG genome databases. To ensure that the identified signatures are only present in Xanthomonadales homologues, the 250 BLAST hits with the highest similarity to the query sequence were examined for the presence or absence of these CSIs. Signature files were created and formatted using the programs Sig_Create and Sig_Style (accessible from Gleans.net) as described by Gupta (2014). In this work, we report the results of CSIs that are specific for different groups within the Xanthomonadales and where similar CSIs were not observed in any other bacteria in the top 250 BLAST hits. Due to space constraints, the sequence alignment files presented here contain sequence information for a limited number of species within the order Xanthomonadales and a representative selection of outgroup species. However, in each case, all members of the order and outgroups exhibited similar sequence characteristics to the representatives.

Results

Phylogenetic analysis

The current understanding of the evolutionary relationships of the Xanthomonadales is based largely on analyses of the 16S rRNA gene (Saddler and Bradbury 2005a; Gutierrez et al. 2012; Losey et al. 2013). In past studies, the 16S rRNA gene sequence has shown limited ability to resolve some of the phylogenetic relationships of organisms within the order Xanthomonadales (Zhou et al. 2008; Yilmaz et al. 2013). Phylogenetic trees based on multiple conserved genes/proteins have been shown to provide greater resolving power than those based on any single gene or protein (Rokas et al. 2003; Wu et al. 2009). Thus, we have constructed a highly resolved phylogenetic tree of the Xanthomonadales based on a concatenated set of 15 housekeeping and ribosomal proteins (Fig. 1). In this concatenated protein based phylogenetic tree a majority of the members of the Xanthomonadales formed a well-supported monophyletic clade which branched as an outgroup of the other members of the Gammaproteobacteria. The members of the order Xanthomonadales formed two distinct and well-supported main monophyletic clades: one clade consisting of members of the family Xanthomonadaceae and another clade consisting of the family Solimonadaceae (including the genera Nevskia and Hydrocarboniphaga) and the species Salinisphaera shabanensis, a member of the family “Salinisphaeraceae”. The Xanthomonadaceae clade contained two smaller clades that were well-supported by bootstrap analysis. The first of these clades, contained the genera Xanthomonas, Xylella, Stenotrophomonas, Lutimonas, Lysobacter and their relatives (Clade 1) while the second clade contained the genera Rudaea, Dylella, Luteibacter, Rhodanobacter and Frateuria (Clade 2). Two members of the Xanthomonadales, Wohlfahrtiimonas chitiniclastica and Ignatzschineria larvae, branched separately from the rest of the order, in a well-supported clade with members of the order Cardiobacteriales, another early diverging group within the class Gammaproteobacteria.

Fig. 1
figure 1

A maximum-likelihood phylogenetic tree of the order Xanthomonadales, other Gammaproteobacteria, and Betaproteobacteria based on the concatenated amino acid sequences of 25 conserved proteins. Bootstrap values are shown at branch nodes. The major groups within the order Xanthomonadales as well as the related taxa, Salinisphaeraceae and Cardiobacteriales, are indicated

We have also produced a phylogenetic tree based on the 16S rRNA gene which contains representative species of Xanthomonadales that encompass all of the currently named genera (Fig. 2). The 16S rRNA based phylogenetic tree exhibited broadly similar branching to our concatenated protein based phylogenetic tree. In the 16S rRNA gene tree, the families Xanthomonadaceae and Solimonadaceae did not form a monophyletic clade and were separated by a large number of organisms. The family Xanthomonadaceae was divided into two well-supported clades which were analogous to the clades found in our concatenated protein based phylogenetic tree (Clades 1 and 2). In the 16S rRNA gene tree, the family Solimonadaceae (including the genera Nevskia, Hydrocarboniphaga, and Alkanibacter) branched with the family Algiphilaceae, another disparate group within the Xanthomonadales, and the genus Steroidobacter, which is currently recognized as a member of the Xanthomonadaceae. As in our concatenated protein based phylogenetic tree, Solimonadaceae and the other disparate members of the Xanthomonadales showed an association with the members of the genus Salinisphaera, the sole members of the family “Salinisphaeraceae”. Additionally, in the 16S rRNA based phylogenetic tree, the genera Wohlfahrtiimonas and Ignatzschineria, which branched with the order Cardiobacteriales in our concatenated protein based phylogenetic tree, formed a weakly supported monophyletic group with the other members of the family Xanthomonadaceae. However, Wohlfahrtiimonas and Ignatzschineria were well separated from the other Xanthomonadaceae by a long branch.

Fig. 2
figure 2

A maximum-likelihood tree based on the 16S rRNA gene sequences of representative strains of all named Xanthomonadales species. Bootstrap values are shown at branch nodes. The major groups within the order Xanthomonadales as well as the related taxon, Salinisphaeraceae, are indicated

Conserved signature indels

CSIs that are restricted to a group of related species are a novel class of molecular marker with high utility for evolutionary studies (Gupta 1998; Rokas and Holland 2000; Gupta 2010; Gao and Gupta 2012a; Gupta 2014). Recently, CSIs have been used to define novel taxonomic groups and to propose important taxonomic changes for various groups of bacteria (viz. Spirochaetes, Aquificae, Neisseriales, and Bacillus) at different taxonomic ranks (Adeolu and Gupta 2013; Bhandari et al. 2013; Gupta and Lali 2013; Gupta et al. 2013). We have recently reported a comparative genomic analysis on a limited number of members of the order Xanthomonadales in which we identified a large number of CSIs in diverse proteins that were uniquely present in all available members of the order or different phylogenetic groups within the order and absent in homologs from all other bacterial groups (Naushad and Gupta 2013). However, the genomes analyzed in our previous study were all from members of one family within the Xanthomonadales, the family Xanthomonadaceae, and did not include any of the more divergent species within the order whose phylogenetic placement is less clear. In this work, we have reassessed the specificity of these previously identified CSIs for a large number of additional Xanthomonadales, including five strains which we have sequenced, de novo, covering a vast majority of the diversity within the order and thereby have identified 31 CSIs which are either specific to the family Xanthomonadaceae or to one of its subgroups and absent in all other sequenced bacterial groups.

Of the 31 CSIs described in this work, none were present in all members of the order Xanthomonadales. All of the CSIs identified in our previous study of the Xanthomonadales (Naushad and Gupta 2013) were found to be specific to only the family Xanthomonadaceae or one of its subgroups. Of the 31 CSIs identified, 10 were uniquely found in all or most members of the Xanthomonadaceae, except Wohlfahrtiimonas and Ignatzschineria, and absent in organisms from all other sequenced bacterial groups. One example of a CSI uniquely present in members of the Xanthomonadace is shown in Fig. 3. In the example, an 18 aa insertion in a conserved region of glutaminyl t-RNA synthetase is uniquely present in all members of the Xanthomonadace, except Wohlfahrtiimonas and Ignatzschineria, but it is not found in sequences from any other bacterial group. Sequence information for the 9 other CSIs specific for all members of the Xanthomonadace, except Wohlfahrtiimonas and Ignatzschineria are presented in Supplemental Figs. 1–9 and a summary of all 10 Xanthomonadace specific CSIs is presented in Table 2A. Our analyses have also identified 10 CSIs which were found to be unique molecular characteristics of most members of Clade 1 of the Xanthomonadaceae. Two examples of such CSIs are presented in Fig. 4. One CSI, a 4 aa insert in DNA polymerase III subunit alpha, is present in all members of Clade 1 of the Xanthomonadaceae except Silanimonas lenta (Fig. 4a), while the other CSI, a 4 aa insert in the protein protoheme IX farnesyltransferase, is uniquely present in all members of Clade 1 of the Xanthomonadaceae except the early branching genera Arenimonas and Silanimonas (Fig. 4b). Sequence information for the other identified CSIs specific to the members of Clade 1 of the Xanthomonadaceae are presented in Supplemental Figs. 10–17 and summarized in Table 2B.

Fig. 3
figure 3

A partial sequence alignment of the protein Glutaminyl t-RNA synthetase, showing a CSI (boxed) that is uniquely present in all members of the order Xanthomonadales. Sequence information for only representative Xanthomonadales and a limited number other bacteria is shown here. However, unless otherwise indicated, similar CSIs were present in all members of the indicated group and not detected in any other bacterial species in the top 250 BLAST hits. The dashes (-) in the alignments indicate identity with the residue in the top sequence. GenBank identification (GI) numbers for each sequence are indicated in the second column. Sequence information for 10 other CSIs that are specific for all sequenced Xanthomonadales is provided in Supplemental Figs. 1–9 and Table 2A

Table 2 Conserved signature indels that are specific for different groups of Xanthomonadales
Fig. 4
figure 4

Partial sequence alignments of a DNA polymerase III subunit alpha showing a 4 amino acid insertion (boxed) identified in all members of Clade 1 of the Xanthomonadaceae except Silanimonas lenta b the protein Protoheme IX farnesyltransferase showing a 4 amino acid insertion (boxed) identified in all members of Clade 1 of the Xanthomonadaceae except the genera Arenimonas and Silanimonas. Due to space constraints, sequence information for only representative Xanthomonadales and a limited number other bacteria is shown here, but similar CSIs were present in all members of the indicated group and not detected in any other bacterial species in the top 250 BLAST hits. Sequence information for other CSIs showing similar group specificities are presented in Supplemental Figs. 10–17 and summarized in Tables 2B

Our analyses have also identified 11 CSIs that were specifically found in diverse proteins from members of Clade 2 of the Xanthomonadaceae, 7 of which were uniquely found in all members of Clade 2 of the Xanthomonadaceae except the early branching genus Rudaea. An example of a CSI specifically found in all members of Clade 2 of the Xanthomonadaceae is shown in Fig. 5a. In this CSI a 1 aa insert in the protein uridylyltransferase is shown to be found in all members of Clade 2 of the Xanthomonadaceae and absent in all other Xanthomonadales and all other bacterial groups. Another CSI, a 4 aa insert in the protein CDP-diacylglycerol–glycerol-3-phosphate 3-phosphatidyltransferase, specifically found in all members of Clade 2 of the Xanthomonadaceae except Rudaea cellulosilytica is shown in Fig. 5b. Sequence information for the other identified CSIs specific to the members of Clade 2 of the Xanthomonadaceae are presented in Supplemental Figs. 18–26 and summarized in Table 2C and D. Our analyses have not identified any CSIs uniquely found in all of the disparate members of the Xanthomonadales or uniquely shared by the genera Wohlfahrtiimonas and Ignatzschineria and the rest of the Xanthomonadaceae.

Fig. 5
figure 5

Partial sequence alignments of a the protein Uridylyltransferase showing a 1 amino acid insertion (boxed) identified in all members of Clade 2 of the Xanthomonadaceae b the protein CDP-diacylglycerol–glycerol-3-phosphate 3-phosphatidyltransferase showing a 4 amino acid insertion (boxed) identified in all members of Clade 2 of the Xanthomonadaceae except the early branching genus Rudaea. Sequence information for only representative Xanthomonadales and a limited number other bacteria is shown here, but similar CSIs were not detected in any other bacterial species in the top 250 BLAST hits. Sequence information for the other CSIs specific to the Clade 2 Xanthomonadaceae are presented in Supplemental Figs. 18–26 and summarized in Table 2C and D

Discussion

The current phylogeny of the order Xanthomonadales is based largely on the analysis of 16S rRNA gene sequences (Saddler and Bradbury 2005a; Gutierrez et al. 2012; Losey et al. 2013). However, the 16S rRNA gene based phylogenies exhibit limited support for a single monophyletic clade consisting of all members of the Xanthomonadales (Yilmaz et al. 2013; Fig. 2). The current taxonomy of the order Xanthomonadales is not concordant with 16S rRNA gene based phylogenies of the members of the order and the nomenclature of the order Xanthomonadales and a majority of the family names within this order are problematic and not in accordance with the International Code of Nomenclature of Bacteria (Oren 2010; Yilmaz et al. 2013; Tindall 2014a, b). However, apart from the 16S rRNA gene, no reliable morphological, biochemical, or molecular characteristics are known that are specifically shared by all members of this order or its distinct subgroups and can be used for their demarcation and classification (Saddler and Bradbury 2005a; Gutierrez et al. 2012; Losey et al. 2013). In this work, we have completed a robust phylogenetic analysis of the order Xanthomonadales and have utilized comparative genomic techniques to identify large numbers of novel molecular markers of common evolutionary decent (CSIs) shared by subgroups within the Xanthomonadales. The CSIs identified in this work both supplement gene based phylogenies and demarcate the groups within the Xanthomonadales in more definitive molecular terms. A summary diagram of the identified CSIs and the species in which they are found is shown in Fig. 6.

Fig. 6
figure 6

A summary of the evolutionary relationships of the Xanthomonadales genera based upon phylogenetic analyses and the identified CSIs. Genera with genome sequenced members are indicated with asterisks (*). The distribution of the identified CSIs and the proposed reclassification of taxonomic groups are indicated. The genera Wohlfahrtiimonas and Ignatzschineria do not branch with the members of the order Xanthomonadales and hence are regarded as order incertae sedis. The families Sinobacteraceae and Solimonadaceae are synonymous; however, only the name of the family Solimonadaceae is shown here (1). The placement of the genus Steroidobacter within the family Solimonadaceae (Nevskiaceae) is tentative until a more detailed phylogenetic analysis can be completed for this genus (2)

The results of phylogenetic analyses presented here do not support a monophyletic grouping of the members of the order Xanthomonadales and no CSI was identified that is uniquely shared by all members of this order. In phylogenetic trees, members of the order Xanthomonadales formed two main clades, one grouping together most of the members from the family Xanthomonadaceae, whereas the other clade was comprised of members from the families Algiphilaceae (containing the genus Algiphilus) and Solimonadaceae (containing the genera Fontimonas, Singularimonas and Solimonas) and genera related to these two families (viz. Alkanibacter, Hydrocarboniphaga, Nevskia and Steroidobacter). The lack of any identified CSIs or a consistent phylogenetic relationship between these two clades suggests that they may represent distinct evolutionary lineages within the Gammaproteobacteria. Additionally, in both our concatenated protein tree and the 16S rRNA gene tree, members of the clade containing Algiphilaceae and Solimonadaceae families consistently grouped with the members of the genus Salinisphaera, the sole members of the family “Salinisphaeraceae”, suggesting that the species from these groups may share a common ancestor exclusive of the Xanthomonadaceae and other Gammaproteobacteria. The genera Wohlfahrtiimonas and Ignatzschineria branch distinctly from the Xanthomonadales in a clade with members of the order Cardiobacteriales in our concatenated protein based phylogenetic tree and show limited phylogenetic association with the other members of the Xanthomonadales in our 16S rRNA tree. Due to this inconsistent branching, further research will be required to accurately assess the phylogenetic placement of the genera Wohlfahrtiimonas and Ignatzschineria, but the available data suggests that they do not belong to the order Xanthomonadales sensu stricto.

Our work has identified 10 CSIs that support a monophyletic grouping of a majority of the members of the order Xanthomonadales that are currently part of the family Xanthomonadaceae. These CSIs were initially identified in our earlier comparative genomic study (Naushad and Gupta 2013) and the sequence information for them was updated in the present work for a large number of additional Xanthomonadales, including 5 genomes which were sequenced, de novo, for this study. Our earlier work identified 13 CSIs which were specific to this group (Naushad and Gupta 2013). Of these, all but 3 CSIs were found to be still specific to the whole group, while the remaining three CSIs were found to be specific for subsets of this large group. The observed specificity of the previously identified CSIs for a distinct bacterial group despite a large increase in the number of analyzed genomes strongly indicate that they constitute reliable molecular characteristics with predictive ability for distinguishing and demarcation of evolutionarily related bacterial groups.

Our work also provides strong molecular and phylogenetic support for the existence of two distinct clades within the Xanthomonadaceae: One clade consists of the genera Xanthomonas, Xylella, Stenotrophomonas, Lutimonas, Lysobacter and their relatives (Clade 1), whereas the other clade groups together members of the genera Rudaea, Dylella, Lutibacter, Rhodanobacter and their relatives (Clade 2). The members of these monophyletic clades branch distinctly from each other with strong bootstrap support in both the concatenated protein tree as well as in the 16S rRNA gene trees. Importantly, Clade 1 and Clade 2 are also supported by 10 and 11 identified CSIs, respectively, which serve to clearly distinguish them from each other and every other bacterial group in molecular terms. This evidence suggests that the Clade 1 and Clade 2 represent two phylogenetically and molecularly distinguishable evolutionary lineages.

Taxonomic implications

Based on the branching of the members of the order Xanthomonadales in the concatenated protein and 16S rRNA gene trees and the large number of identified molecular markers (CSIs) that are specific for this group of bacteria, the following main inferences regarding the phylogeny of the Xanthomonadales can be derived.

  1. (1)

    The order presently designated as Xanthomonadales contains 2 highly divergent phylogenetic groups, one made up of the members of the family Xanthomonadaceae and the other made up of the members of the families Algiphilaceae, Solimonadaceae, and “Salinisphaeraceae”

  2. (2)

    The family presently designated as Xanthomonadaceae, which harbours a majority of the members from the order Xanthomonadales, also contains 2 distinct and distinguishable phylogenetic groups, one consisting of the genera Xanthomonas, Xylella, Stenotrophomonas, Lutimonas, Lysobacter and their relatives (Clade 1) and another clade consisting of the genera Rudaea, Dylella, Lutibacter, Rhodanobacter and their relatives (Clade 2)

Thus, the current taxonomy of the order Xanthomonadales does not accurately reflect the evolutionary histories of its members which exhibit enormous genetic diversity. In order to alleviate the taxonomic incongruences within the order Xanthomonadales, we propose that the families Algiphilaceae (containing the genus Algiphilus) and Solimonadaceae (or Nevskiaceae) (containing the genera Fontimonas, Singularimonas and Solimonas) and genera related to these two families (viz. Alkanibacter, Hydrocarboniphaga, Nevskia and Steroidobacter), which do not branch with the other members of the order Xanthomonadales, be placed within a novel order, Nevskiales ord. nov., along with their closest evolutionary relatives, the members of the family Salinisphaeraceae fam. nov. Further, to recognize the presence of two distinct groups within the family presently designated as Xanthomonadaceae, the members of this family should be divided into two families: the family Xanthomonadaceae (containing the genera Arenimonas, Luteimonas, Lysobacter, Metallibacterium, Panacagrimonas, Pseudoxanthomonas, Silanimonas, Stenotrophomonas, Thermomonas, Xanthomonas, and Xylella) and a novel family, Rhodanobacteraceae fam. nov. (containing the genera Aquimonas, Chiayiivirga, Dokdonella, Dyella, Frateuria, Fulvimonas, Luteibacter, Pseudofulvimonas, Rhodanobacter, and Rudaea). The remaining two genera, Wohlfahrtiimonas and Ignatzschineria, whose taxonomic affiliation to the above two orders is not supported should be regarded as order incertae sedis.

Additionally, the present proposal also serves to help rectify several problems associated with the nomenclature of the order Xanthomonadales, the family Xanthomonadaceae, and the family Solimonadaceae (Tindall 2014b). It has been noted previously (Oren 2010; Tindall 2014a, b) that the names of these taxa are later synonyms for the order Lysobacterales, the family Lysobacteraceae, and the family Nevskiaceae, respectively. In recognition of these nomenclatural concerns, we are providing emended descriptions of the order Lysobacterales, the family Lysobacteraceae, and the family Nevskiaceae, which indicate that they are earlier synonyms of the order Xanthomonadales, the family Xanthomonadaceae, and the family Solimonadaceae, respectively. Descriptions of Rhodanobacteraceae fam. nov., Nevskiales ord. nov., and Salinisphaeraceae fam. nov. and emended descriptions of the order Lysobacterales (Xanthomonadales), the family Lysobacteraceae (Xanthomonadaceae), and the family Nevskiaceae (Solimonadaceae) are provided below.

Emended description of the order Lysobacterales Christensen and Cook (1978) (Approved Lists 1980)

Synonym: Xanthomonadales Saddler and Bradbury (2005a, b).

The order contains two families, Lysobacteraceae and Rhodanobacteraceae. Organisms are rods, 0.2–1.8 µm in diameter and 0.8–70 µm in length. Cells are both motile and non-motile. Organisms are aerobic, or facultatively anaerobic. Organisms are chemoorganotrophic and non-spore-forming. Organisms within this order may be either positive or negative in both oxidase and catalase tests. The G + C content of the DNA is 42–75 (mol%).The type genus of the order is Lysobacter Christensen and Cook (1978) (Approved Lists 1980) (Skerman et al. 1980) emend. Park et al. (2008).

Organisms from this order are distinguished from all other bacteria examined to date by 10 conserved signature indels in Glutaminyl t-RNA synthetase, GTP-binding protein, Queuine tRNA-ribosyltransferase, Lipoyl synthase, Lysyl-tRNA synthetase, Dihydroorotate dehydrogenase, Carbamoyl phosphate synthase large subunit, Aspartate aminotransferase, DNA polymerase I, and DNA topoisomerase IV subunit B (Tables 2A).

Emended description of the family Lysobacteraceae Christensen and Cook (1978) (Approved Lists 1980)

Synonym: Xanthomonadaceae Saddler and Bradbury (2005a, b).

The family contains twelve genera, Arenimonas, Luteimonas, Lysobacter, Metallibacterium, Panacagrimonas, Pseudoxanthomonas, Silanimonas, Stenotrophomonas, Thermomonas, Xanthomonas and Xylella. Organisms are rods, 0.2–1.8 µm in diameter and 0.8–70 µm in length. Cells are both motile and non-motile. Organisms are aerobic, or facultatively anaerobic. Organisms are chemoorganotrophic and non-spore-forming. Organisms within this family may be either positive or negative in both oxidase and catalase tests. The G+C content of the DNA is 42–70 (mol%). The type genus of the family is Lysobacter Christensen and Cook (1978) (Approved Lists 1980) emend. Park et al. (2008).

Organisms from this order are distinguished from all other bacteria examined to date by 10 conserved signature indels in DNA polymerase III subunit alpha, Uroporphyrinogen decarboxylase, DNA polymerase I, Coproporphyrinogen III oxidase, tRNA isopentenyltransferase, Protoheme IX farnesyltransferase, Ribose-5-phosphate isomerase A, Aspartyl-tRNA synthetase, 2-oxoglutarate dehydrogenase E1, and Asparagine synthetase B (Tables 2B).

Description of Rhodanobacteraceae fam. nov

Rhodanobacteraceae (Rho.da.no.bac.ter.a,ce’ae N.L. masc. n. Rhodanobacter type genus of the family; -aceae ending to denote a family; N.L. fem. pl. n. Rhodanobacteraceae the family whose nomenclatural type is the genus Rhodanobacter).

The family contains nine genera, Aquimonas, Dokdonella, Dyella, Frateuria, Fulvimonas, Luteibacter, Pseudofulvimonas, Rhodanobacter and Rudaea. Organisms are rods, 0.3–0.5 µm in diameter and 1–4.5 µm in length. Cells are both motile and non-motile. Organisms are aerobic, chemoorganotrophic, and non-spore-forming. Organisms within this family may be either positive or negative in both oxidase and catalase tests. The G + C content of the DNA is 62–75 (mol%). The type genus of the family is Rhodanobacter Nalin et al. (1999).

Organisms from this order are distinguished from all other bacteria examined to date by 11 conserved signature indels in Uridylyltransferase, a xanthomonadin exporter protein, a signal peptidase, Tryptophan synthase subunit alpha, CDP-diacylglycerol–glycerol-3-phosphate 3-phosphatidyltransferase, Protease tldD, S-adenosylmethionine decarboxylase, DEAD/DEAH box helicase, F0F1 ATP synthase subunit gamma, Proline aminopeptidase P II, and Glycosyl transferase (Table 2C, D).

Description of Nevskiales ord. nov

Nevskiales (Nev.ski.a’les. N.L. fem. n. Nevskia type genus of the order; -ales ending to denote an order; N.L. fem. pl. n. Nevskiales the order whose nomenclatural type is the genus Nevskia).

The order contains three families, Algiphilaceae, Salinisphaeraceae, and Nevskiaceae. Organisms are rods and cocci, 0.6–1.3 µm in diameter and 0.4–2 µm in length. Cells are non-motile or motile by means of a one or more polar flagella. Organisms are aerobic, or facultatively anaerobic. Organisms are chemoorganotrophic and non-spore-forming. Oxidase and catalase positive. The G+C content of the DNA is 60–68 (mol%). The type genus of the order is Nevskia Famintzin 1892 (Approved Lists 1980).

Emended Description of the family Nevskiaceae Henrici and Johnson 1935 (Approved Lists 1980)

Synonyms: Sinobacteraceae Zhou et al. (2008), Solimonadaceae Losey et al. (2013).

The family contains six genera, Alkanibacter, Fontimonas, Hydrocarboniphaga, Nevskia, Solimonas and Steroidobacter.Footnote 1 Organisms are rods, 0.6–0.85 µm in diameter and 0.9–2 µm in length. Cells are non-motile or motile by means of a single polar flagellum. Organisms are aerobic, or facultatively anaerobic. Organisms are chemoorganotrophic and non-spore-forming. Oxidase and catalase positive. The G + C content of the DNA is 60–65 (mol%). The type genus of the family is Nevskia Famintzin 1892 (Approved Lists 1980).

Description of Salinisphaeraceae fam. nov

Salinisphaeraceae (Sa.li.ni.sphae.ra.ce’ae. N.L. fem. n. Salinisphaera type genus of the family; -aceae ending to denote a family; N.L. fem. pl. n. Salinisphaeraceae the family whose nomenclatural type is the genus Salinisphaera).

The family contains one genus, Salinisphaera, which is also the type genus of the family. The description of the family is the same as that of the genus Salinisphaera Antunes et al. (2003) emend. Shimane et al. (2013).