Introduction

Pantoea was described in the previous decade and has been proposed to include the genus Erwinia and some species of Enterobacter. Pantoea, a Gram-negative, rod-shaped bacterium, is non-encapsulated and produces dark yellow colonies; this bacterium is mobile and has been widely identified and isolated from animals, soils, aquatic environments [1]. Pantoea species grow at an optimum temperature range of 28–30 °C; the DNA GC-content ranges from 52.7 to 60.6% [2]. Pantoea identification is primarily based on phenotypic characters that include microanatomy as well as biochemical and genotypic assays [3]. Among the twenty-four species of Pantoea described, five species have been reported as phytopathogens. These phytopathogens impact at least 31 crop plants that could result in substantial global agricultural losses [4]. In addition, Pantoea species causes human infections [5], mainly in immunocompromised individuals, resulting in wound, blood or urinary-tract infections [6, 7]. Pantoea susceptibility to first-line antibiotics is variable and exhibits a high prevalence of multidrug resistance [8]. Growing bacterial anti-microbial resistance is a serious public health problem worldwide [9]. This pathogen resistance to antibiotics has motivated researchers to examine alternative treatments for bacterial infections.

Bacteriophages represent an effective therapy to overcome antibiotic-resistant bacteria; customized phages are now used not only for medical purposes [10], clinical trials [11], but also in aquaculture [12], the food processing industry, and agriculture [13]. Alternative therapies to antibiotics aim to find new tools with fewer risks in human, animals, and plants.

Bacteriophages can play an important role in plant and animal pathogens management. Phages have been shown effective in reducing pathogenic bacteria in wastewater effluents [14]. The efficiency of phage therapy is due to high specificity for targeted bacterial pathogens.

This research focused on the isolation, genome characterization, and comparative genomic analysis of a virulent phage Pd_C23, the first isolated bacteriophage infecting the bacteria P. dispersa.

Materials and Methods

Bacterial Strain Identification

P. dispersa was isolated in January 2014 from the wastewater treatment plant (WWTP) of Charguia, Tunis City (Tunisia). A 5-L sample of wastewater was collected in sterilized glass bottles. The physicochemical characteristics of the strain sample of P. dispersa are as follows: COD (185 ± 8.1 mg O2/L), BOD (62 ± 4.2 mg O2/L) and MES (14 ± 2.4 mg/L), PO4–P (16 ± 2.3 mg/L), NH4–N (1.51 ± 0. 2 mg/L), NO3–N (3.4 ± 1.1 mg/L) pH (7.3 ± 0.4). A monoclonal culture was produced from each colony after plating and incubating the wastewater sample for 24 h, at 37 °C. An observation with a JEOL transmission electron microscope allowed further characterization for the isolated strain to determine size. The "Boiling" method was adopted to extract the chromosomal DNA [15]. The obtained sequences were compared to the nucleotide sequences available on the National Center for Biotechnology Information (NCBI) databases (http://www.ncbi.nlm.nih.gov/) using the Basic Local Alignment Search Tool (BLAST) and Ribosomal Database Project (RDP): https://rdp.cme.msu.edu/seqmatch/seqmatch.

Phage Isolation and Purification

P. dispersa phage was isolated in 2014, from a sample of 5 L of secondary wastewater collected from the WWTP. The isolation of the phages followed the protocols recommended by [16]. 50 mL of wastewater and 1 mL of an overnight culture of the P. dispersa host strains were added to 50 mL of tryptic soy broth (TSB, Difco) and were incubated at 37 °C for 4 h to allow phage expression in its host cells, then 500 µL of chloroform was added, afterward the lysate was centrifuged at 5 000×g for 15 min, and the supernatant was filtered through 0.22 µm pore filters to extract the vB_Pd_C23 phage from the suspension. A volume of 100 μL of suspension was added to 5 mL of TSB medium previously inoculated with 100 μL of P. dsipersa overnight culture.

The phage was purified by soft agar overlay method [17]. The phage titer (i.e., infectious dose) was determined by plaque assay expressed as plaque-forming units PFU/mL, using spread plate method [18].

The method used for characterization was based on host cell interaction, plaque morphology, electron microscopy, and genome sequencing [19].

Characterization of Phage Pd_C23

Transmission Electron Microscopy

The methods used are described by [19]. Ten microliters of uranyl acetate (2%) were deposited on a Formvar carbon-coated grid (200 meshes, Pelco International) mixed with a drop of the phage particles. The grids were then examined using JEOL1230 transmission electron microscope with high contrast, 40–120 kV transmission electron microscope operated at 80 kV at the Platform of Molecular Imaging and Microscopy of the Laval University.

One Step Growth Curve

One-step growth curve was performed to determine the phage latent period and burst size as described by [20].

Phage DNA Extraction and Restriction of vB_Pd_C23

Phage DNA was extracted from 500 mL of lysate using the Qiagen Lambda Maxi DNA purification kit with modifications as described by [21]. The phage DNA was digested with 3 enzymes restriction (Sma1: recognizes the sequence 5'---CCCGGG---3'; Nae I: recognizes the sequence 5 '---GCCGGC---3'; Nar IV: recognizes the sequence 5'---GGCGCC---3'). The DNA fragments were separated by agarose gel (0.8%) electrophoresis with size marker of 1 kb suspended in 1X Tris-Acetate-EDTA buffer. A voltage of 100 V was considered during one hour (Thermo Fisher Scientific). The obtained restriction profiles were compared to those predicted in silico using GeneiousPrime2020 (www.geneious.com).

Sequencing and Genome Analysis of P. dispersa Phage vB_Pd_C23

DNA extraction was performed using titer vB_Pd_C23 phage of 9 × 107 performed by plasmid Kit Maxi (Qiagen) as described by [21]. The sequencing library was first set up with the Nextera XT DNA sample preparation kit (Illumina) according to the manufacturer’s instructions. The library was sequenced using a MiSeq reagent kit (v2) on a MiSeq system (Illumina) [21]. De novo assembly was achieved with the Ray assembler 2.2.0. [17, 22]. Coverage was calculated with Samtools. PCR product was sequenced with an ABI 3730xl sequencer. Each vB_Pd_C23 gene sequence was submitted to a homology test using Blastx on NCBI. The whole genome characteristics were compared by length, GC%, ORF number, and presence of genes. The open reading frames (ORFs) were identified using Geneious Prime 2020; NCBI ORF finder (http://www.ncbi.nlm.nih.gov/gorf/orfig.cgi), and gene prediction was carried out with GeneMark.hmm prokaryotic64 on http://exon.gatech. edu/GeneMark/gmhmmp.cgi. Genome annotation involving functions and products was performed using the FastaProtein, FastaNucleotide and the non-redundant protein database https://www.ncbi.nlm.nih.gov/protein.

Transeq and HHpred were used for further results on http://www.ebi.ac.uk/Tools/st/emboss_transeq, the tRNA placements were determined with http://lowelab.ucsc.edu/tRNAscan-SE/ [23].

Phylogenetic Analysis and Average Nucleotide Identity (ANI)

Whole vB_Pd_C23 genome was aligned in MUSCULE with five best matches Caudovirales complete genomes. Sequence alignments, phylogenetic tree reconstruction and visualization were performed using Molecular Evolutionary Genetics Analyses (MEGA.7) [24] via the Maximum Likelihood method (ML). In addition, the pairwise alignment with ClustalW of the whole genome was used to compare differences between vB_Pd_C23 and the closest 500 replicates Myovirus. Whole vB_Pd_C23 was furthermore aligned with ViPTree analysis, 2.0version developed by [25, 26]. The dendrogram of phage proteomic tree was generated from selected best fit genomes and gene markers to obtain a viral proteomic tree of vB_Pd_C23 examining the whole genomic nucleotide of 11 sequences of double-stranded DNA (dsDNA) viruses belonging to the family of Caudovirales by ML.

At last, to determine the phylogenetic relationship of vB_Pd_C23 with its close relatives, a phylogenetic relationship analysis between large terminase subunit and major capsid individual genes were identified via BLASTn. Sequences were aligned using MUSCLE [27, 28]; phylogenetic trees were generated using neighbor-joining tree analysis based on MLM alignment through PhyML software [29].

Nucleotide and amino acid identity between vB_Pd_C23 whole genome and best homologs were assessed using a Software package (RRID: SCR_001591), as multiple sequence alignment tool between multiple sequences performed in: http://www.ebi.ac.uk/Tools/msa/clustalo/. A Heatmap of the nucleotide and amino acid identities between the complete vB_Pd_C23 genome and Caudovirales sharing the highest genetic homologies and marker genes was constructed to detect the evolutionary relationship of the bacteriophage with the rest of the homologous selected phages.

Accession Number

Bacteriophage vB_Pd_C23 genome sequences were deposited to GenBank, under accession number: OL396571.1.

Results

Bacterial Identification

Transmission electron microscopy (TEM) showed a bacterium of 2 μm in size characterized by long peritrichous cilia (Fig. 1A). Sequence comparison of the isolated strain with existing data in the genomic database indicated that the strain is classified as strain Pant_SSU27 of bacteria P. dispersa; LMG2603; Genus Pantoea. The isolated complete genome strain is registered on Genbank, NCBI, under the accession Number: MN725743.

Fig. 1
figure 1

A Electron micrographs of P. dispersa strain seen by JEOL Transmission electron microscopy (MET). B Plaques formed by vB_Pd_C23 phage after 24 h of incubation at 37 °C on a lawn of Pantoea dispersa strain by Spot testing increasing dilutions estimated by plate-forming units (PFU)/mL. C Electron micrographs of vB_Pd_C23 phage by JEOL transmission electron microscopy (MET). D One-step growth curve of vB_Pd_C23 phage each data point is the mean of three independent experiments, and error bars indicate the means ± standard deviations

Phage Isolation and Purification

The phage suspension titer was estimated at 9×107 PFU/mL, as illustrated in Fig. 1B. Transmission electron microscopy revealed that the isolated phage is Myovirus presenting a long straight contractile tail with total size of 150 nm; a head size of 50 nm and a contractile tail of 100 nm×23 nm (Fig. 1C).

Characterization of Phage Pd_C23

One Step Growth Curve

One-step growth curve of the vB_Pd_C23 Myovirus was performed. The growth curve showed a 20-min latent period and a rise period estimated at 10 min, with the peak growth at 50 min with a total load of 136 PFU/mL (Fig. 1D).

Phage DNA Extraction and Restriction of vB_Pd_C23 DNA

Gel electrophoresis of vB_Pd_C23 DNA, revealed the presence of distinct bands suggesting the presence of putative dsDNA. vB_Pd_C23 DNA hydrolyzed by the enzymes NaeI, NarIV, and SmaI was cleaved into fragments, as presented in Fig. 2A; Results showed that vB_Pd_C23 genome was sensitive to all considered restriction enzymes NaeI, NarIV, and SmaI, as predicted in silico. Expected positions, and fragments sizes are shown in Supplementary Table 2. Data analyses were performed using Geneious software are summarized in Fig. 2B.

Fig. 2
figure 2

DNA electrophoresis of vB_Pd_C23 genome profile (REN) by restriction enzymes; NaeI, NarIV, and SmaI. A On the right is the in silico restriction profile and on the left is the in vitro restriction profile, molecular mass (MT indicates the size marker 1 Kb +) is indicated on the left of each profile (Bio-Rad). B Circular Genome map of vB_Pd_C23 phage indicating the positions and fragments size of the restriction enzymes NaeI, NarIV, and SmaI (Generated par Geneious 8.0 softwar)

Genome Analysis of P. dispersa Phage vB_Pd_C23

The restriction-digestion and analyses of the generated consensus sequence used for the analysis and mapped by denovo assembly of the new Tunisian phage vB_Pd_C23 revealed a linear double-stranded DNA, with a genome size of 44,714 bp. vB_Pd_C23 has 49.7% G-C content similar to other reported Pantoea phages infecting species than P. dispersa. Coding regions represent approximately 49% of the genome, and the longest noncoding region is 795 bp, occurring between ORF 40303pb and ORF41097pb (Fig. 3) (Supplementary Table 1). Transcription starts codon ATG for ribosome binding site (RBS) represent 80.3% of the total phage vB_Pd_C23 genes, and GTG represent only 12.5% of phage genes. Moreover, 19 ORFs out of 75 ORFs were reversed consecutively from the genomic position of 24735pb to the position 32429pb. From ORF32 to ORF50, the predicted proteins in this region are represented by 15 hypothetical proteins (ORF32-ORF40; ORF44; ORF46-ORF50), ORF41 was predicted to be a resistance-protein which function as a trigger to the antibiotic resistance response. HNH endonuclease (ORF42) function as an essential recombination protein trigger (ORF43) and zf-TFIIB was predicted to be a domain-containing protein (ORF45). An asymmetrical gene distribution was observed on the two DNA strands; 56 ORFs were predicted to be transcribed from the forward DNA strand, while 19 ORFs are located on the reverse complementary DNA strand.

Fig. 3
figure 3

Functional genome map of bacteriophage vB_Pd_C23. Numbers indicate open reading frame (ORF) position on the genome, functions are assigned according to the color code as follows: yellow—Proteins with no reliable identity; green— transcription, translation, nucleotide metabolism; pink ‐Bacterial origin; gray— DNA replication, recombination, repair, and packaging conserved; blue—structural proteins; red—Lysis, Orange: ORFs with unknown function (Color figure online)

Supplementary Table 1 grouped the annotation of predicted genes with putative functions. Only ORFs with a significant protein “best-match” with Myovirus sp. and Siphoviral sp. are presented in the Supplementary Table 1.

Fifty-three gene products of vB_Pd_C23 phage shared similar protein functions found in Siphoviral partial genomes phages. 46 ORFs had known functions, and the hypothetical proteins are similar to partial genomes of viruses isolated from human metagenomes revealing potential associations with chronic diseases [30] (Supplementary Table 1).

DNA Packaging Genes

The replication process requires genes for DNA packaging in the capsid, responsible for DNA recognition and initiation of DNA packaging. vB_Pd_C23 phage has a large terminase unit (ORF1) predicted to be located at the beginning of the genome and sharing 83% identity with the Klebsiella phage vB_Kpn_Chronis large unit terminase.

Structural Proteins

Four groups ORFs were determined to encode phage head and tail genes: (i) major capsid protein (ORF 6) shared 79% identity with Klebsiella phage vB_Kpn_Chronis, (ii) a putative baseplate J-like family protein (ORF21) also sharing 64% identity with Klebsiella phage, (iii) two tail fiber proteins (ORF23; ORF24), and (iv) one tail needle protein (ORF 25). All four ORFs were involved in triggering the cell infection by host recognition and promoting phage adsorption.

vB_Pd_C23 Phage Comparison with Klebsiella Phage vB_Kpn_Chronis

Pairwise alignment genomes of P. dispersa phage vB_Pd_C23 and K. pneumoniae phage (vB_KPn-chronis) were performed using Visual Interactive Simulation Tools and Applications for comparative analysis of genomic sequences (VISTA. a pairwise alignment program). Synteny test between Genomic organization of P. dispersa phage vB_Pd_C23 (OL396571) and vB_Kpn_Chronis (MN013086) was performed with blast alignment of the phages [31]. VISTA revealed the presence of similar 23 ORFs, respecting the same gene orientation from ORF 1 to ORF 23 (Supplementary Table 1).

Phylogeny and Average Nucleotide Identity (ANI)

Phylogeny was first constructed with the whole genome sequence of the vB_Pd_C23 (Fig. 4A).

Fig. 4
figure 4

vB_Pd_C23 phylogeny based on maximum likelihood method (MLM) of whole genomic nucleotide sequences of different species and genomic similarity relationships between all dsDNA viruses belonging to Caudoviricetes Class, Myoviruses, Siphoviruses, and Podoviruses mainly, with Gammaproteobacteria as host on the database. A Nucleotide sequences were aligned using Geneious.8 with 500 bootstrap replicates; model is WAG with gamma and invariant sites Bootstrap values (> 50%) are shown at each node; B vB_Pd_C23 Viral proteomic tree; Branch lengths are logarithmically scaled, the log scaled is represented by the numbers on top representing the SG values (normalized scores by blastx)

A second viral proteomic tree was constructed using VipTree, based on the ML method of the entire genomic nucleotide sequences of 11 selected dsDNA viruses among Caudovirales (Supplementary Table 3), the tree was generated based on the chronology of appearance between the closest related relationships (Fig. 4B), it revealed that vB_Pd_C23 genome embodies a distant branch, which according to the International Taxonomy Committee of the Bacterial Viruses Subcommittee and Archaeal indicates that the Monophyletic grouping generated meets the criteria for a new genus’.

A third phylogenetic tree based of individual genes (Fig. 5). The vB_Pd_C23 phylogenetic tree based on the major capsid homologous protein sequences included five phages (Fig. 5A). The phylogenetic tree, based on the large terminase unit protein sequence, included 19 phages protein sequences sharing the highest genetic similarities (Fig. 5B). (Supplementary Table 4).

Fig. 5
figure 5

Bioinformatic analyses of vB_Pd_C23 based on individual genes A Phylogenetic analyses based on large terminase large unit genes B Phylogenetic analysis based on major capsid genes. Sequences were aligned using MUSCLE, phylogenetic trees were generated using Neighbor-joining tree analysis based on MLM alignment through PhyML software (Tree percentage replicate in the bootstrap test is the value indicated next to the branches)

Average Nucleotide Identity (ANI)

Nucleotide and amino acid identity heat map constructed between vB_Pd_C23 whole genome and best hits whole genome homologs is presented on Supplementary Table 5. Overall, phylogenetic analysis revealed that vB_Pd_C23 did not significantly cluster with any known phage, and therefore represents a novel genus.

Discussion

TEM identification of the isolated strain along with the sequence comparison with existing genomic databases showed that the strain is classified as P. dispersa, LMG2603. Transmission electron microscopy of the isolated phage revealed that it belongs to the Myovirus phages. vB_Pd_C23 has a linear double-stranded DNA; with a genome size of 44,714 bp, and has a 49.7% G-C content similar to other Pantoea phages. For example, [32] showed that the phage Pantoea had a 49% G-C content. Similarly, vB_PagM_AAM22 phages (AAM22) were characterized by a 48.4% G C content [58]. However, related phages can display a GC content that can range from 52 to 55% [1].

Sequence analysis of the entire vB_Pd_C23 genome revealed the existence of 75 predicted ORF-encoding for proteins and identified one tRNA detected between the position 40706 and 40830pb, with a length of 124pb. The presence of tRNA gene in vB_Pd_C23 phage could be interpreted as a compensation for the disproportionate amount of amino acid codon between the virus and its hosts. This characteristic may offer an effective mechanism to extend the host range, as shown in research on related viruses [33]. Also, the presence of tRNA gene in vB_Pd_C23 phage confirms its lytic character as this gene is missing in lysogenic phages [34]. Bioinformatic analysis uncovered the lack of any integrase or other lysogeny-associated genes [35]. The genome start site is likely a large terminase unit gene. Predicted protein functions for vB_Pd_C23 were verified by searching for sequence similarity in the NCBI database by BLASTp. Bioinformatics analyses were based on identifying regions with similarities to gene products of tailed Caudovirales homologs. Only one protein DUF1367 family protein (ORF65) was identified to encode for a protein of Salmonella typhimurium. 35ORFs were similar with known genes of Myovirus and/or Siphoviruses associated proteins. The 40 remaining ORFs were divided between 19 ORFs that encode for hypothetical proteins of Myovirus and Siphoviruses with no reliable identity, and the other 21 ORFs had no reliable homologies of phages or non-phages genes in the NCBI database. The results suggest that there are 21 new viral proteins were associated with the phage vB_Pd_C23.

Among the vB_Pd_C23 gene products compared with detectable homologs, a total of 49 vB_Pd_C23 ORFs shared similarities with Myovirus proteins, with 22 of these ORFs sharing similarities with the complete genome of Klebsiella pneumoniae phage vB_Kpn_Chronis (MN013086.1). Similarities were verified from ORF1 to ORF 23 except for the ORF7. The best gene similarity match (83%) was registered for ORF1 with a large unit terminase of vB_Kpn_Chronis phage. The remaining ORFs matched with a Myovirus from various isolates of a human metagenome, with partial genome submitted from the Cellular Oncology Laboratory, National Cancer Institute, USA [30].

As observed for the Myoviruses genome organization, phage vB_Pd_C23 genome contains an early genomic region necessary for the phage life cycles; these genes are involved in the initiation of bacterial infection, which then triggers intermediate and late genes [36]. In addition, vB_Pd_C23 also has a small unit (ORF75) for DNA binding predicted to be at the genome end, sharing 54% of its identity with a Pseudomonas phage (DAQ82909.1). DNA packaging is known to engage the portal protein (ORF2) during viral packaging in the capsid and tail assembly. DNA packaging produces the pre-formed proheads that are needed in interactions of the portal protein with the terminase protein; this packaging is critical in the translocation of dsDNA for the final viral capsid assembly [37].

DNA Replication, Recombination, and Repair genes

vB_Pd_C23 replication process required the genes ORF 4 that produce Nudix hydrolase; this enzyme is found in viruses, bacteria and humans and plays an important role in breaking down toxic intracellular compounds as well as in creating the bacterial and viral pathogenesis [38, 39]. Additional roles may include the regulating the lytic transglycosylase (ORF15) which is associated with the initiation of bacterial cell infection. In the absence of the transglycosylase, the internalization of the phage genome is significantly delayed during infection [40]. Three ORFs (ORF42, ORF71, and ORF73) were predicted to be homing endonucleases, thus would promote DNA packaging, and act as mobile elements in genetic exchange and horizontal transfer and play a part in the dissemination of tRNA gene clusters among related organisms [41]. Restriction alleviation protein (ORF62) has endonuclease activity was predicted as the gene responsible for hindering the cleavage of the bacterial DNA restriction-modification system. The repair process for the vB_Pd_C23 genome is likely accomplished by the SOS-system (ORF56) which responds to DNA damage by increasing the gene expression required for DNA repair. The RecA (ORF66) is used to repair breaks into the bacterial chromosome [42].

Phage-Host Interaction and Lysis Genes

Six ORFs were associated with Phage-host interaction and lysis genes, involving both Holin (ORF28) and lysozyme (ORF27) along with adhesin (ORF30) and cytadhesin (ORF29) P30/P32. The lysis occurs by encoding endolysin and the protein holin. These peptidoglycan-degrading enzymes are synthesized during the late phase of gene expression [43], causing the degradation of the bacterial cell wall [17]. Holin creates small pores in the membrane facilitating the phage access to the host cell peptidoglycan, and allows the endolysins to reach their substrates. Endolysins have a high degree of biochemical variability, with higher bactericidal function than antibiotics; their associated antimicrobial activity can be exploited to reduce bacterial pathogenicity [44].

Annexin (ORF10) is known as a protein with immunomodulatory functions in viral infections and host responses [13], endosialidase (ORF5) works during the viral passage into cells and also in their release from infected cells, [45], PVL ORF-50-like family (ORF63) play a role in the pathogenesis of necrotizing pneumoniae [46] and is found in both bacteria and bacteriophages, encoded as a member of the leukocidin group of bacterial toxins that kill leukocytes by creating pores in the cell membrane [47]. The vB_Pd_C23 consensus genome is 50.6% similar to that of Klesbiella phage. The genome of vB_Pd_C23 is shorter by 988 bp, sequences encoding putative proteins made up 92.1%, this is compared to 96.7% for vB_KPn-chronis, The number of intragenic regions was 32 for vB_Pd_C23 compared to 26 for vB_KPn-chronis; the mean distances for these intragenic regions was 115 for vB_Pd_C23 and 65 bp Klesbiella phage. The genome organization of vB_Pd_C23 clearly confirms its relatedness to the Klesbiella phage.

vB_Pd_C23 phylogeny was grouped with pairwise alignment of phage vB_Kpn_Chronis along with four best matches blasted Caudovirales complete genomes. The genetic distances between the two Myovirus showed a genetic link between the two isolates, sharing a distant relationship. Also results indicate that the large terminase unit designated as the utmost commonly conserved gene sequence in phages thus exploited to build viral evolutionary relationships phylogeny between different phages is along with major Capsid protein carrying highly conserved domains [48]. As shown on Neighbor-joining tree analysis of the large unit and the major capsid protein, vB_Pd_C23 appears to embody a distinct branch in a position of aproximity with the vB_Kpn Chronis phage, confirming the previous results. This reveals vB_Pd_C23 represents a new genus, which shares a distant relationship with the vB_Kpn_Chronis phage [49]. vB_Pd_C23 demonstrated an exclusive organization regarding its clusters, supporting Hendrix theory promoting those dsDNA bacteriophages adapts to their living environment through horizontal gene exchanges, giving them the ability to recognize specifically host cells granting them the mosaic genome architecture. This suggests that vB_Pd_C23 could potentially be a mosaic of Klebsiella phage. Multiple sequence analysis revealed that although a connection was found between homologous phages, (the highest identity observed was estimated at 30.37% for Salmonella phage vB_Se_STGO_35_1 (NC_054648), proceed by 16.48% identity with vB_Kpn_Chronis phage percentage values remained lower than the nucleotide identity required between same genera, described as a cohesive group of viruses sharing a high degree of greater than 70% nucleotide identity over the entire length of the genome [50]. The new vB_Pd_C23 is associated within a new genus, since it cannot be classified in any of the currently phage recognized by the ICTV.

A submission of a taxonomic proposal for the new genus of vB_Pd_C23 (OL396571.1) as the first lytic phage specific to the P. dispersa strain was submitted to the Caudoviral Study Group. ICTV link to our proposal: https://talk.ictvonline.org/files/proposals/taxonomy_proposals_prokaryote1/m/bact01/13637

Conclusion

Myovirus phage vB_Pd_C23 was isolated from Tunisia wastewater treatment facilities. The fully sequenced genome revealed 21 ORFs with no reliable homology, indicating that vB_Pd_C23 has 21 novel viral proteins Analyses of the results indicated that the lytic phage may share a common ancestor with the unclassified K. pneumonia phage. Overall vB_Pd_C23 represents a novel bacteriophage within a new genus and presents the first lytic phage with a high specificity for its host P. dispersa.