Introduction

Infectious bursal disease has a great economic impact on the poultry industry worldwide [1]. Infectious bursal disease virus (IBDV) causes lymphoid depletion in the bursa of Fabricius, a primary lymphoid organ of chickens. Infected chickens are immunosuppressed, can be predisposed to secondary infections, and respond poorly to immunization [2].

Two serotypes of IBDV have been identified [3], however, only serotype 1 viruses cause disease in chickens. Strains within serotype 1 differ in their pathogenicity and antigenicity, and are generally classified as very virulent, classical, or antigenic variants [4, 5].

IBDV is a non-enveloped, bi-segmented, double-stranded RNA virus belonging to the Birnaviridae family [6]. Segment A is approximately 3,260 nucleotides (nts) and contains two open reading frames (ORFs). The ORF1 (438 nts) partially overlaps ORF2 (3,039 nts) and encodes the viral VP5 protein [7]. The monocistronic ORF2 encodes a precursor polyprotein VPX–VP4–VP3 and is auto-catalytically cleaved at amino acid (aa) position 512–513 for VPX–VP4 and at aa position 755–756 for VP4–VP3. The VPX protein (aa 1–512) is processed to VP2 (aa 1–441) protein [8]. Most of the research has been focused on the VP2 protein as it is the primary determinant for cell tropism [9] and induces neutralizing antibodies [10]. It has been demonstrated that changes within the VP2 antibody binding region, at aa 206–350, can lead to antigenic variants [5, 11]. The group-specific VP3 protein induces non-neutralizing antibodies [12] and is involved in virus replication and genome packaging [13]. VP4 is a cis-acting viral protease that processes the polyprotein [14].

Segment B is approximately 2827 nts and encodes for VP1, the RNA-dependent, RNA-polymerase (RdRp). The protein plays a role in viral encapsidation [15] and primes the viral RNA synthesis [16]. In addition, VP1 initiates protein synthesis by resolving 5′ unique terminal repeats and recruits the 40S ribosomal subunit [17]. The RdRp activity is 3′ non-coding region (NCR) dependent on segment A and B. The 3′ NCR has a signal recognition function for replication, transcription, and translation [18]. The replication efficiency and virulence factors have been linked to segment B [19, 20]. Although, the VP1 protein of the very virulent (vv) IBDV strains form a distinct cluster separation from the classical strains [21], chimeric IBDV viruses containing segment A and B from classical and very virulent strains, respectively do not show increase of the viral virulence or pathogenicity [22].

The level of attenuation and the system used for viral propagation affects the pathogenicity of related viruses. The Lukert strain is a further attenuated Edgar cell-culture-adapted strain [23]. The Edgar, bursa-derived strain is a pathogenic field isolate [24] and compared with both Edgar CEA and Lukert strains is the most invasive and pathogenic [25].

Most of the current molecular techniques used for IBDV typing utilize amplification of a representative genomic sequence [26] of the VP2 fragment by reverse transcriptase (RT) polymerase chain reaction (PCR) followed by restriction fragment length polymorphism (RFLP) analysis [27]. The most widely used typing method is the amplification of fragment of VP2 gene between nucleotides with position 701–1444 of ORF2 using RT-PCR followed by digestion with restriction enzymes. Different strains can be grouped into genotypes based on restriction fragment length polymorphism (RFLP) patterns [28]. Based on segment-reassortment and other approaches it has been previously suggested that VP2 is not the only virulent determinant and that viral pathogenicity can be determined by genomic regions beyond this protein [4, 19, 20, 22, 29]. In addition the hypervariable region of the VP2 protein is highly genetically unstable during viral propagation in vivo [30]

We report the molecular characterization of four IBDV strains with different pathogenicities: field isolate 9109, Lukert, Edgar CCA, and Edgar CEA strains. The full-length genomes of these strains were sequenced and analyzed. Analysis of the nucleotide and deduced amino acid sequences of segments A and B were compared with previously reported full-length sequences from serotype 1 very virulent, classical, antigenic variants, and serotype 2 IBDV strains. Our analysis revealed that although amino acid sequence of VPX may be useful, complete genomic sequence information for strain characterization is essential for thorough molecular typing as partial viral sequences may not designate a particular strain as very virulent, classical, or variant.

Materials and methods

Viruses and virus propagation

The field isolate 9109 and the classical attenuated Lukert strain were kindly provided by Dr. Pedro Villegas (University of Georgia, Athens, GA). The 9109 isolate was isolated at the Poultry Diagnostic and Research Center (PDRC), University of Georgia from immunosuppressed broiler flocks showing subclinical signs of IBDV such as bursal atrophy, and minimal mortality. Previous typing of 9109 identified a unique RFLP pattern when compared to the reference strains used for genotyping at PDRC. Edgar cell-culture-adapted (CCA) (cat. No. 120-ADV-2001) and Edgar chicken-embryo-adapted (CEA) (cat. No. 124-ADV-9501) challenge strains were purchased from National Veterinary Services Laboratory (Ames, IA). The 9109 isolate was propagated in 21 day of age (d.a.) specific pathogenic free (SPF) chickens. At 72 hours post infection, bursas were aseptically collected, snap frozen in liquid nitrogen and stored at −80°C until future use. Lukert and Edgar CCA were propagated in primary chicken embryo fibroblasts (CEF) derived from 9–11 d.a. SPF embryos as previously described [31].

RNA extraction

Total RNA was extracted directly from the Edgar CEA strain from the purchased virus without propagation using the RNeasy mini kit (Qiagen, Valencia, CA). Total RNA was extracted from Lukert or Edgar CCA infected chicken embryonic fibroblasts using the same kit. The cell culture supernatant was removed 48–72 hours post infection, the cell monolayer overlaid with 300 μl RLT buffer containing 0.143 M β-mercaptoethanol (Bio-Rad Laboratories, Hercules, CA) and scraped, as recommended by the manufacturer. Total RNA was extracted from homogenized bursas harvested from chickens infected with the 9109 isolate. All samples of extracted RNA were incubated with Proteinase K (Qiagen) at 55°C for 30 min prior to RNA column purification.

Reverse transcription-polymerase chain reaction (RT-PCR) of segment A and B

The first strand cDNA was synthesized using modified, previously reported segment-specific primers: segACEF5 or segACEF3 for segment A, and segBCEF3 for segment B [32]. Additional primers for PCR and sequencing were designed as needed using PrimerSelect software (DNAstar, Lasergene, v. 5, Inc., Madison, WI) and previously published sequences obtained from GenBank. Initially 4–5 μl total RNA and the segment-specific primer were denatured at 98°C for 5 min and cooled at 4°C for 10 min. The first-strand cDNA was synthesized at 50°C for 30 min in a 20 μl reaction containing 1x first-strand buffer, 0.01 M DTT, 0.5 μM segment-specific primer, 40 units RNase-out (Invitrogen, Life Tech., Carlsbad, CA), 200 units SuperScript II, and 0.5 mM deoxynucleoside triphosphates mix (dNTP) (Invitrogen). The RT mixture was then incubated at 70°C for 15 min. Two units of RNase H was added and incubated at 37°C for 20 min.

Segment A was amplified in two overlapping fragments with PCR primer pair segAVP25/segACEF3 and segAVP23/segACEF5 with an expected product size of 2691 and 1417 nts, respectively. Segment A positive RT-PCR controls were generated using the primer pair segAVP25/segAVP23 with an expected size of 848 nts. The last 129 nts at the 5′ end of segment A, Edgar CEA was amplified using the product from 5′ RACE and primer pair segAVP23/segACEF5. Segment B was amplified with a single PCR reaction using the primer pair segBCEF3/segBCEF5 annealing to the extreme 3′ and 5′ ends, respectively, and yielding an expected product of 2827 nts. The PCR reaction was synthesized in a 100 μl mixture containing 5 μl of the RT reaction, 1× PCR buffer, 0.2 mM dNTP, 1.5 mM MgCl2, primers at 0.5 μM each and 2.5 units of recombinant or Platinum Taq DNA polymerase (Invitrogen). The PCR cycle parameters were as follows: 94°C for 3 min followed by 35 cycles at 94°C for 15 s, 55°C for 15 s, 72°C for 5 min, and a final extension at 72°C for 30 min. The RT-PCR amplified products were electrophoresed on a 0.8% agarose gel and visualized with ethidium bromide as per manufacturer’s recommendations.

5′ RACE (rapid amplification of cDNA ends)

The sequences of the 5′ termini of the viral genome were amplified using the 5′ RACE system Version 2.0 (Invitrogen) as per the manufacturer’s recommendations. Segment A and B specific primers were constructed with the PrimerSelect software (DNAstar, Lasergene, v. 5) using the previously reported nucleotide sequence of CEF94 (GenBank accession AF194428 and AF194429, respectively) (The sequence of the primers is available upon request). First strand cDNA for segment A was synthesized using 4–5 μl total RNA using SuperScript II reverse transcriptase, segment A specific reverse primer segA676. The reaction was carried out at 98°C for 10 min, 4°C for 5 min, 50°C for 50 min (reverse transcription), 70°C for 15 min, and a final incubation with an RNase mix at 37°C for 30 min. The first strand was S.N.A.P. column purified and an oligo-dC tail was added to the 3′ end of the cDNA using terminal transferase and dCTP. The tailed cDNA was used as a template for polymerase chain reaction (PCR1) with abridged anchor (AAP) and segment A specific primer segA507. The second round of PCR (PCR2) was performed using 1 μl of PCR1 product with abridged universal amplification (AUAP) and segment A specific primer segA323. Similarly the first strand cDNA for segment B was synthesized using segment B specific reverse primer segB671. The oligo-dC tailed segment B cDNA was then used as a template for PCR1 with AAP and the segment B specific primer segB460. One microliter of the PCR1 product was used in PCR2 with AUAP and segment B specific primer segB344. All RACE-PCR reactions were carried out at 94°C for 3 min followed by 35 cycles at 94°C for 15 s, 55°C for 15 s, 72°C for 1 min, and final extension at 72°C for 30 min. The 5′ RACE products were electrophoresed on a 2.0% agarose gel. Gel purification was performed with the QIAEXII-gel extraction kit (Qiagen).

Cloning and sequencing of RT-PCR products

The purified DNA was cloned into the pCR4-TOPO or pCRII-TOPO vectors according to the manufacturer’s recommendations with maximum incubation times and transformed into competent E. coli, DH5α-T1R cells (Invitrogen). Transformants were selected on Luria-Bertani (LB) agar media (Q-BIOgene, CA, USA) containing 50 μg/ml kanamycin and screened by PCR with segment specific primers. Six clones from each PCR reaction were stored and three used for sequencing. Clones were expanded overnight at 37°C in 10 ml LB media containing 50 μg/ml kanamycin, centrifuged at 4°C for 10 min at 800 g, resuspended in 600 μl fresh LB media with 10% glycerol, and stored at −80°C until further use.

Plasmid DNA was purified using the Qiagen miniprep kit (Qiagen) according to manufacturer’s recommendations. Double-stranded DNA was sequenced by the dideoxy chain termination procedure using the automated ABI Prism 310, Genetic Analyzer (Applied Biosystems, ABI, Foster City, CA). Plasmid specific (M13 reverse and forward), as well as, IBDV segment specific primers were used for sequencing with primer walking. Clones obtained from 5′ RACE were sequenced with primers segA323 and segB344 for segment A and B, respectively. A consensus sequence was obtained using three clones which were sequenced at least three times with forward and reverse primers.

Multiple alignment and phylogenetic analysis

The nucleotide sequences obtained from the four IBDV strains in this report were deposited in GenBank with accession numbers for 9109 (AY462027, AY459321), Lukert (AY918948, AY918947), Edgar CCA (AY462026, AY459320), and Edgar CEA (AY918950, AY918949).

Previously published full-length nucleotide sequences of segment A and B used in our analysis have the following GenBank accession numbers: serotype 1 very virulent isolates D6948 (AF240686, AF240687), UPM97/61 (AF247006, AF527040), HK46 (AF092943, AF092944), UK661 (X92760, X92761), OKYM (D49706, D49707), serotype 1 classical isolates CEF94 (AF194428, AF194429), CEF (AF493979), P2 (X84034, X84035), wild type Cu1(wt) (AF362747, AF362748), Cu1 (D00867, AF362775), segment B Winterfield−2512 (AF083092) and 002/73 (M19336), serotype 1 Delaware, variant E isolate (AF133904, AF133905) and serotype 2 isolates OH (U30818, U20950) and 23/82 (AF362773, AF362774).

The nucleotide and deduced amino acid sequences were aligned with the previously published full-length IBDV sequences using ClustalW multiple sequence alignment program and the pair distance calculated using Lasergene, v. 5 (DNAstar). Unrooted cladograms were generated using maximum parsimony analysis with 1000 bootstrap replicates using the Phylogenetic Analysis Using Parsimony software (PAUP, v.10). The 5′ and 3′ NCR of segment A and B were analyzed using the Mfold software (v. 3) [33].

Results

There was a high nucleotide and amino acid similarity between IBDV strains used in this study. The amino acid sequences had higher similarities than nucleotide sequences. The similarity between both nucleotide and amino acid sequence of all strains analyzed for the VPX–VP4–VP3 polyprotein was representative for VP1, VPX, VP2, VP3, VP4, and VP5 genes (Table 1). The VPX–VP4–VP3 amino acid sequence similarity for the Edgar CCA and CEA was 96.6% and for Lukert and Edgar CEA and CCA was 96.4% and 96%, respectively. Lukert, 9109, and Edgar CEA were more similar to Cu1 strains 97.3%, 97.1%, and 96.5%, respectively than to Edgar CCA strain.

Table 1 Comparison of deduced amino acid and nucleotide sequences of the VPX–VP4–VP3 polyprotein

The similarity between amino acid and nucleotide sequences of Lukert and the classical strains Cu1, Cu1w, and P2 was higher within VP4, VP5, and the VP2 processing site at 100% and 98.6%, respectively. The VP2 processing site in the 9109 isolate had 100% amino acid similarity to the classical strains Cu1, Cu1wt, CEF94, and P2. Within the same site Edgar CEA was more similar to Lukert and very virulent strains at 90.4% compared to the other IBDV strains.

Segment A analysis

Segment A in all four strains was 3260 nts in length. ORF1, encoding the VP5 protein, contains 438 nts and encodes 145 amino acids in the four sequenced strains with the start codon at nucleotide position 97 and stop codon at 543 nt. The amino acid substitution at position 16 (D → A) is conserved between Lukert, Edgar CCA, CEA and serotype 2 OH and 23/82 strains. The second substitution in Edgar CEA is at amino acids at position 135 (H → R). VP5 in very virulent strains contains four additional amino acids at the start (MLSL) compared to the rest of the strains analyzed (Table 2, Fig. 1).

Table 2 Total amino acid substitutions in the four sequenced IBDV strains within segment A and B coding regions and comparison with the other serotype 1 and 2 strains used in this study
Fig. 1
figure 1

Multiple alignment of the predicted amino acid sequence of VP5 protein of IBDV Lukert, 9109, Edgar CCA and CEA strains with corresponding sequences of published IBDV serotype one very virulent (vv) UPM9761 and OKYM, classical P2, Cu1, Cu1wt, variant E, and serotype two, OH strain. The start site of the VP5 is at amino acid 1 from ORF1 of segment A and 145 is the last amino acid. The amino acids sequence identical to the 9109 field isolate are indicated with dots and non-identical amino acids are presented with letters. The dashes indicate amino acid deletion. The amino acid positions are based upon the 9109 isolate

The ORF2, encoding for the polyprotein, contains 3,039 nts and encodes 1012 amino acids. The polyprotein start codon is at nucleotide position 131 and stop codon at 3169 nt. Comparison of the polyprotein nucleotide sequences revealed that isolate 9109 has an insertion of C at position 2178 and deletion at 2209 nt. The VPX protein consists of 512 amino acids, VP4 consists of 243 amino acids, and VP3 consists of 257 amino acids in the four sequenced strains.

Multiple alignment of the predicted amino acid sequence of the VPX protein revealed several amino acid substitutions and potential antigenic regions (manuscript in preparation). Analysis of the VP3 protein revealed two unique substitutions 785 (I) and 993 (P) in Lukert, Edgar CCA and CEA (Table 2, Fig. 2). The 9109 isolate has unique substitution 1005 (A) also present in OKYM and vvUPM9761. Several other substitutions, 922 (Q) in Lukert and 858 (F), 922 (Q), and 948 (A) in Edgar CCA were also revealed.

Fig. 2
figure 2

Multiple alignment of the predicted amino acid sequence of VP3 protein of IBDV Lukert, 9109, Edgar CCA and CEA strains with corresponding sequences of published IBDV serotype one very virulent (vv) UPM9761 and OKYM, classical P2, Cu1, Cu1wt, variant E, and serotype two, OH strain. The start site of the VP3 is at amino acid (aa) 756 from ORF2 of segment A. The self- (aa 756–853), double stranded RNA (aa 977–1003), and VP1- (aa 1003–1012) domains are indicated as boxes. The amino acids sequence identical to the 9109 field isolate are indicated with dots and non-identical amino acids are presented with letters. The amino acid positions are based upon the 9109 isolate

Analysis of the VP4 protein revealed that the following unique amino acid substitutions: 14 in 9109, 7 in Edgar CEA, and 1 in Edgar CCA (Table 2, Fig. 3). The 9109 isolate has substitution at 541 (I), present in OKYM, variant E, and OH strains, 547 (D) within motif I, 680 (Y) present in vvUPM9761 and OKYM, and 686 (I) present in Edgar CCA and variant E, and a unique extensive (SQSTRLGQA) sequence with residue position 726–734). Edgar CEA has amino acid substitutions between serine-protease motifs I (aa 539–550) and II (aa 583–597) at position 576 (R), RIRPF with residue position 578–582 and in motif II at position 584 (E → G). No unique substitutions were found within motif III (aa 644–661), substrate-binding motif IV (aa 697–705), catalytic triad with aa positions 546 (H), 589 (D), 652 (S), and catalytic dyad with aa positions 652 (S) and 692 (K) [14, 33].

Fig. 3
figure 3

Multiple alignment of the predicted amino acid sequence of VP4 protein of IBDV Lukert, 9109, Edgar CCA and CEA strains with corresponding sequences of published IBDV serotype one very virulent (vv) UPM9761 and OKYM, classical P2, Cu1, Cu1wt, variant E, and serotype two, OH strain. The start site of the VP4 is at amino acid (aa) 513 from ORF2 of segment A. VP4 motifs I (aa 539–550), II (aa 583–597), III (aa 644–661), and IV (aa 697–705) are indicated with boxes. The amino acids sequence identical to the 9109 field isolate are indicated with dots and non-identical amino acids are presented with letters. The amino acid positions are based upon the 9109 isolate

Segment B analysis

Segment B in Lukert, 9109, Edgar CCA and CEA strains was 2827 nucleotides (nts) in length. Edgar CCA has a coding region of 2646 nts (112–2757) and encodes for VP1 protein containing 881 amino acids (aa). The 9109 isolate and Edgar CEA have coding regions of 2640 nts (112–2751) and encodes 879 amino acids. Comparison between IBDV strains revealed that 9109 and Edgar CEA have a (T → C) transition at nucleotide position 2638 and an insertion of six nucleotides CCATGA at positions 2641–2646. Several unique amino acid substitutions were identified in the VP1 protein (Tables 2, 3). Specifically, Edgar CEA and CCA have one substitution 46 (G) and 121 (T), respectively. Lukert has unique amino acid substitutions at 79 (P), 172 (G), 356 (V), 527 (S), and 720 (F). The 9109 isolate has one unique substitution at 809 (M) and amino acids 13 (T) and 546 (L) are conserved in P2 and Cu1wt, 682 (R) is conserved in Winterfield and variant E. In addition, at amino acid position 147 field isolate 9109 has (D) and very virulent strains have (N) compared to the other strains.

Table 3 Unique amino acid substitutions in VP1 protein

The 5′ and 3′ non-coding regions (NCR) sequences

The nucleotide sequences of the 5′ and 3′ NCR of the four sequenced IBDV strains were compared to the published NCR sequences of vvD6948, CEF94, and P2, segment A and vvD6948, CEF94, P2, Cu1, and CEF, segment B IBDV strains. The 5′ NCR of segment A in all strains is 97 and 131 nts for ORF1 and ORF2, respectively, while segment B has 112 nts. The 3′ NCR of segment A has 91 nts in all strains, however the length of segment B differs: 78 nts in P2, 75 nts in vvD6948, 9109, Edgar CEA, and CEF94, and 70 nts in Edgar CCA and Lukert. Segment A in Lukert has mutation at position 75 (U → C) and Edgar CEA at nucleotide position 3257 (C → A). The 18S rRNA binding motif in segment A [34] and the 5′ NCR of segment A ORF2 promoter region −131 to −100 nts [35] were highly conserved among all strains analyzed.

The unrooted phylogenetic analysis

Unrooted phylogenetic analysis using maximum parsimony analysis with 1000 bootstrap replicates was performed with the aligned segment A and B deduced amino acid sequences. Unrooted analysis allows identification of related descendants without explicitly designating a common ancestor. The analysis of the less divergent trees from VP1 (Fig. 4A) and RdRp motif (data not shown), VP4 (Fig. 5C), I–IV motifs (data not shown), and VP3 (Fig. 5D) demonstrated formation of two branches, very virulent in one branch and classical strains in the second branch. In the same trees, 9109 clustered with the classical strains. The phylogenetic analysis of the VP1, VP3, and VP4 identified a closer relationship between Lukert, Edgar CCA and CEA and classical strains with higher bootstrap probability. vvD6948 in VP3 clustered with classical strains (Fig. 3D). In the VP2 processing site, Lukert was similar to the classical strains and Edgar CCA and CEA were more similar to the very virulent strains (manuscript in preparation). In VP2, Edgar CCA and Lukert clustered together, Edgar CEA did not group with any of the clusters and formed a branch wherein it was sole the sole taxa and 9109 clustered with variant E (Fig. 4C). The 5′ and 3′ NCR of segment A 9109 isolate was found to be similar to vvD6948 (data not presented).

Fig. 4
figure 4

Phylogenetic relationship based on the amino acid (aa) sequences. The tree presented is representative of the independent phylogenetic analyses of the following segments: (A) VP1, (B) VP2 aa 202–451, VP2 aa 210–473, VPX aa 1–512, VP5–VPX–VP3–VP4–VP1, and (C) VP2 for Edgar chicken embryo adapted (CEA), Edgar cell culture adapted (CCA), Lukert, and 9109 (shaded boxes) with other IBDV strains. The cladograms were generated using maximum parsimony analysis with 1000 bootstrap replicates (confidence levels listed in parentheses) using the Phylogenetic Analysis Using Parsimony software (PAUP, v.10)

Fig. 5
figure 5

Phylogenetic relationship based on the amino acid (aa) sequences. The tree presented is representative of the independent phylogenetic analyses of the following segments: (A) VPX–VP4–VP3, (B) VP5, (C) VP4, I–IV motifs, and (D) VP3 proteins for Edgar chicken embryo adapted (CEA), Edgar cell culture adapted (CCA), Lukert, and 9109 (shaded boxes) with other IBDV strains. The cladograms were generated using maximum parsimony analysis with 1000 bootstrap replicates (confidence levels listed in parentheses) using the Phylogenetic Analysis Using Parsimony software (PAUP, v.10)

In the cladograms obtained from multiple alignments of the VPX–VP4–VP3, VP5 (Fig. 5A, B) and VP2, VP5–VPX–VP4–VP3–VP1, VPX the very virulent and serotype 2 strains (Fig. 4B, C) were represented in two distinct branches. In the same trees, 9109 and variant E clustered together with very high probability as predicted by a high bootstrap confidence level. In the same cladograms, Lukert, Edgar CCA, and CEA, and classical strains formed individual clusters. Edgar CEA and CCA were more similar to Lukert in VPX, VPX–VP4–VP3, VP5, and VP3 (Figs. 4B, 5A, B, D). The clustering pattern obtained from the VPX and amino acid sequences between amino acids at positions 202–451 and 210–473 cladograms represented the VP5–VPX–VP4–VP3–VP1 and VPX–VP4–VP3 (Figs. 4B, 5A).

Discussion

We describe the full-length sequence characterization of four IBDV strains with different pathogenicities. The nucleotide and deduced amino acid sequences of these strains were compared with previously reported full-length sequenced serotype 1 and 2 IBDV strains.

Substitutions at amino acid position 785 (L → I) and 993 (Q → P) in Lukert, Edgar CCA, and CEA are in the VP3- and double-stranded (ds)RNA-binding domains, respectively [36]. The 9109 isolate has (Q) at amino acid position 981 in the dsRNA- and (A) at position 1005 in VP1-binding domain. In addition, it was determined that 1005 (A) is a vvIBDV characteristic and may be significant for the pathogenicity [37], however, 9109 is not a very virulent strain. Previously, the VP3 carboxyl terminus, aa 857–1012, was found to be important for vvIBDV pathogenicity [38] and formation of the VP3–VP2 complex, which has a role in IBDV replication and genome packaging [13, 15].

Mutations in the VP4 protease may affect the polyprotein cleavage and may have a role in viral adaptation and pathogenicity. Substitutions in the VP4 protein at amino acid positions 541 (V → I) and 547 (N → D) in 9109 are within motif I and several substitutions in Edgar CEA are between serine-protease motifs I and II. The amino acid 680 (Y) observed in 9109 was previously proposed as very virulent marker [37].

Edgar CCA, CEA, and Lukert strains have substitutions within the VP5 protein at amino acid position 16 (D → A) indicating that this amino acid may not be important for viral pathogenicity since non-pathogenic serotype 2 strains have the same substitution. In addition Edgar CEA contains a second substitution at amino acid position 135 (H → R). The cytotoxic VP5 protein is an important for the viral release [39], induction of apoptosis [29], and pore formation in the cells [24]. Although previous analysis of VP5 did not reveal amino acids unique to the very virulent pathotype [40], amino acid substitutions could affect VP5 function. An IBDV VP5 knock-out mutant has been shown to replicate in the bursa without bursal lesions and does not induce apoptosis [41] as does the wild type.

The stem-loop structure, not the sequence of the 3′ NCR, is an important functional determinant [38] and plays a role in the protein-primed RNA synthesis [42] and RNA packaging [43]. The mutation at nucleotide position 3257, segment A in Edgar CEA is within the inverted terminal repeat 3255–3260 nts at the 3′ NCR and part of the stem-loop structure with energy level −3.1 vs. −11.7 kcal/mole in the other sequenced strains which changes the stability of the stem-loop. The mutation at position 75 (U → C) in Lukert within the 5′ NCR of segment A is in the putative 18S rRNA binding domain [34] and may affect the transcription and translation efficiency.

The viral protein VPX and amino acid sequences between aa at positions 202–451 and 210–473 of VP2 but not the entire VP2 protein are the best representatives of the entire IBDV genome. The greatest variability was found in the VP2 and 5′ NCR of segment B among IBDV strains. The RdRp within VP1 and the VP5 protein were highly conserved among isolates.

Close nucleotide and amino acid relatedness between Lukert, Edgar CCA and CEA was established. Although Lukert, Edgar CCA and CEA strains have as expected sequence characteristics of the classical strains when, using the VP2 processing site, Edgar CCA and CEA could be classified as very virulent strains as shown in this analysis.

Phylogenetic and sequence analysis of field isolate 9109 revealed unique features. The isolate has characteristics of classical, variant, and very virulent strains depending on the region analyzed. The 9109 isolate is more similar to the classical strains in the VP2 processing site, VP3, and VP4 proteins. Although we did not find the unique SspI restriction site within the VP2 of 9109 isolate [44], (manuscript in preparation) previously suggested as a very virulent marker [45], we have identified two amino acid substitutions 680 (Y) in VP4 and 1005 (A) in VP3 which were also reported as vvIBDV markers. In addition, the amino acid at position 147 in the VP1 protein was substituted only in 9109 and very virulent strains. Phylogenetic analysis of the 5′ and 3′ NCR of segment A revealed similarity between 9109 and the vvD6948 strain. There are also several similarities between variant E and 9109 in VPX, VP2, and VPX–VP4–VP3 polyprotein. The predicted antigenic regions for 9109 isolate demonstrated similarities between 9109 and variant E (manuscript in preparation).

The four sequenced strains were determined by the phylogenetic analysis of VP1 protein to be classical strains. The segment B was highly conserved among analyzed strains. The VP1 protein in Lukert and Edgar CCA contains two more amino acids compared to 9109 and Edgar CEA strains. Similar variations have been previously described for P2, Cu1, Cu1M, and CEF94 strains [21] but the significance is still unknown. Edgar CEA has a substitution at amino acid position 46 (S → G) within the Ser-phosphorylation region [46]. At amino acid position 147 only 9109 and virulent strains have substitutions. Amino acids previously suggested to play a role in cell-specific replication [47], virulence [20], phosphorylation, glycosylation, NTP-binding motifs [48], and the RNA-dependent, RNA-polymerase (RdRp) motifs [46, 49] were highly conserved between all strains analyzed.

While the molecular basis for IBDV pathogenicity has not been fully defined, full length sequence analysis of viruses with different pathogenicities is an important factor in identifying potential predicators of viral pathogenicity. Finally, genomic analysis of IBDV at both the nucleotide and amino acid level is necessary for typing new field isolates to provide complete pathogen data which can be correlated with virulence and pathogenicity.