Introduction

Newcastle disease is one of the most contagious avian diseases and is subsequently responsible for some of the greatest economic losses in the poultry industry. Newcastle disease is caused by the Newcastle disease virus (NDV), an enveloped RNA virus from the Paramyxoviridae family, genus Avulavirus [1]. The NDV genome consists of single-stranded, non-segmented, negative-sense, RNA. The genome contains the following six genes: nucleoprotein (NP), phosphoprotein (P), matrix (M), fusion (F), hemagglutinin–neuraminidase (HN), and an RNA-dependent RNA polymerase (L) [2].

Viruses are classified into three pathotypes according to their pathogenicity index, based on chicken embryo mortality: (1) lentogenic viruses that cause mild symptoms with no mortality, (2) velogenic viruses which approach 100 % mortality, and (3) mesogenic viruses which exhibit a moderate pathogenicity with mortality rates ranging from 30 to 50 % with pronounced symptoms; the last two are consider virulent [3]. The pathogenicity of NDV strains is determined primarily by the amino acid sequence at the cleavage site of cellular proteases within the fusion protein. A low proportion of basic amino acids in this site is less susceptible to cleavage by proteases and is therefore considered to be lentogenic. On the other hand, a high proportion of basic amino acids are characteristic of velogenic phenotypes [4]. However, there is wide genetic diversity within this serotype, and various methods have been proposed to classify viruses according to their genome features [58].

The most recent classification system, proposed by Czeglédi et al. [7], groups viruses into two major classes based on three genome-size categories, containing 15198, 15186, or 15192 nucleotides. Class I includes NDVs with genomes containing 15,198 nucleotides, while class II comprises two genome sizes, genotypes I, II, III, and IV containing 15,186 nucleotides and genotypes V, VI, VII, and VIII with 15,192 nucleotides.

It is widely accepted that vaccination using any NDV genotype protects against symptoms of the disease and mortality in the case of a viral outbreak. However, Miller et al. [8] showed that vaccinal protection does not prevent the excretion of viruses and subsequent infection when field strains contain large amino acid differences due to changes in the nucleotide sequence compared to vaccinal strains. Protection improves when the vaccinal strains are phylogenetically more closely related to the challenging strains [8].

Previous reports of isolated NDV strains in Mexico have determined that the most common occurrences belong to genotype V [911]. However, the strain most commonly used for vaccination in the last 40 years has been the LaSota strain, belonging to genotype II, followed by some genotype I strains. We isolated a NDV strain, APMV1/chicken/Mexico/P05/2005, henceforth called P05, from an outbreak at a hen farm that occurred despite vaccination and resulted in a significant drop in egg production. In this study, we report our findings on the complete genome characterization of this strain isolated in 2005.

Materials and methods

Isolation and propagation of virus

The strain P05 was isolated in May 2005 from a poultry farm located in the state of Puebla, Mexico. On this farm, an outbreak led to a significant drop in egg production in a hen population that had been vaccinated using the LaSota strain. The virus was isolated from internal organs (lung, trachea, and spleen) of a hen exhibiting severe symptoms of the disease. We used the isolation method extensively described by Alexander in 1989 [11]. Briefly, the organs were collected under sterile conditions, minced to 20 % w/v using Tryptose Phosphate Broth (TPB) as a diluent, centrifuged at 1,500 rpm for 15 min, and 100 μL of the supernatant of each minced organ were inoculated into the allantoic cavity of 9- to 11-day-old specific pathogen-free (SPF) chicken embryos. The embryos were incubated at 37 °C for 3 days; subsequently, a sample of allantoic fluid was extracted and identified by hemagglutination and specific antibody testing.

Characterization of NDV, HA, and HI testing

To identify the NDV-positive fluid samples, a hemagglutination assay with red cells from chicken embryos was performed. Positive samples were subjected to HI assay with anti-NDV antibodies to insure that hemagglutination resulted from the NDV HN protein by the method described by Alexander et al. in 1987 [12].

Determination of virulence

Pathogenicity of the P05 strain was determined according to the intracranial pathogenicity index (ICPI) and the mean death time (MDT) of the chicken embryos [11].

Isolation of viral RNA

For isolation of P05 viral RNA, we used allantoic fluid from SPF embryos previously (48 h) infected with the virus. For RNA purification, we used the QIAamp Viral RNA kit (Qiagen).

Oligonucleotide design and RT-PCR amplification

Our oligonucleotide design was based on the alignment of full-length sequences of previously reported NDV genomes. The design considered the non-variable regions in the genome of the virus with special attention to positions where the insertion of nucleotides has been reported in viral evolution studies. Table 1 shows the oligonucleotides used. The complete genome of the P05 virus was amplified in four overlapping segments using SuperScript® III One-Step RT-PCR System with Platinum® Taq High Fidelity (Invitrogen) to the following concentrations: viral RNA, 200 ng, 20 μM of each oligonucleotide, 1 μL of enzyme mixture, and 25 μL of 2× buffer. The amplification conditions of the thermocycler were as follows: a retrotranscription reaction at 50 °C for 30 min, a PCR at 94 °C for 5 min, 30 cycles of denaturation at 94 °C for 1 min, hybridization at 50–53 °C for 1 min, amplification at 68 °C for 1 min/kb, and a final 10 min cycle.

Table 1 Oligonucleotides used in this study

Sequencing and analysis

The PCR-generated products were sequenced directly in triplicate using the oligonucleotides shown in Table 1. Sequencing was performed by the Taq FS dye terminator cycle fluorescence-based sequencing method in a Perkin Elmer/Applied Biosystems, Model 3730. Analyses of the resulting genome sequences and genome assembly were performed by means of Geneious software (Biomatters Ltd). For the phylogenetic analysis, we compared the P05 genomic sequence with 76 complete and near-complete reference genome sequences (only sequences >15,180 nt), and of the variable region of gene F (nucleotides 47–420) of viral strains from class I and II (genotypes I–IX), available at GenBank. The evolutionary history was inferred by the Maximum Likelihood method based on the Kimura 2-parameter model. Evolutionary analyses were conducted in MEGA5 [13].

Results and discussion

Virus characterization

Pathogenicity testing showed that the isolated P05 virus can be classified as velogenic. The chicken embryo MDT was 36 h and the ICPI was 1.99, placing the virus in the highly infectious class. Furthermore, the virus is considered to have the highest pathogenicity among viruses previously reported in Mexico [911]. In addition, the deduced amino acid sequence at the cleavage site in the gene which encodes for the fusion protein confirmed a high proportion of basic amino acids, characteristic of highly virulent viruses [4].

Genome analysis and deduced proteins

We found that the complete genome of NDV P05 has a length of 15,192 bp (GenBank accession number HM117720). Similar to other avian serotype 1 paramyxoviruses (APMV1), it contains six genes: NP, P, M, F, HN, and L. The G + C % of the viral genome is 46.3 % and shows a decrease from NP to L. Characteristics of the six genes and the deduced protein sequences are shown in Table 2.

Table 2 Genomic characteristics and deduced amino acids

The 5′ and 3′ leader sequences consist of 114 and 55 nt, respectively. The nucleotide sequence of the gene start (GS), gene end (GE), and the length of the intergenic sequence (IG) in the genome of the NDV affect the efficiency of transcription of the six genes [14, 15]. In the P05 strain, similar to other NDV’s, the GS sequence of the NP–P–M–F and HN is UGCCCAUCUU with the only change shown in the GS of the L gene UGCCCAUCCU (reference). The GE sequence is relevant for efficient termination and the initiation of the downstream genes [15]. The GE sequences in NDVs differ only slightly between themselves; in the P05 strain, the GE sequences of the six genes show the sequence (U/A)A(U/A)UCUUUUUU. In all genes, the GE sequences of the P05 strain are identical to the Largo/71 strain; however, there is an extra U in the GE sequence of NP of Largo/71. The IG length between genes in P05 are 2 in NP–P, 1 in P–M, 1 in M–F, 31 in F–HN, and 47 in HN–L. Owing to the extra U in the GE of Largo/71, the intergenic sequence between NP and P is just one nt, while in P05 it is two nt.

The complete genome of P05 strain is similar to the previously reported Largo/71 strain [16]. Major similarities between the two strains were identified at the NP and L genes with 98.0 and 98.6 % similarity in the nucleotide sequence and 99.0 and 99.1 % in the deduced amino acid sequence, respectively. By comparing both strains, we found that the gene showing the greatest difference encodes for protein M (similarity percentages were 92 in nucleotide sequence and 87.7 in amino acid sequence) (Table 3).

Table 3 Similarity percentages between the P05 strain and representative strains of each genotype

In addition, the deduced amino acid sequence of protein F of the P05 strain had a 96.4 % similarity to the Largo/71 strain. However, it also showed similarity rates (96.6 and 96.2 %, respectively) with the Fontana/72 (genotype VI) and Herts/33 (genotype IV) strains (Table 3).

Comparisons of the deduced amino acid sequences of both antigenic proteins (F and HN) of P05 and other strains representative of the eight genotypes showed the lowest similarity percentages between the P05 and the LaSota strain (similarities P05/LaSota: 89.4 and 88.8 %) (Table 3). This difference is in accordance with the previous data reported by Miller et al. [17] who showed the greatest genetic diversity between genotypes II and V. This difference is a fact that should be taken into consideration in choice of vaccine seed.

On the other hand, it is well known that the sequence at the F protein cleavage site is the major determinant of pathogenicity in NDV although not the only one [4, 18, 19]. The P05 strain had a pattern of polybasic amino acids in positions 112 through 116 of the F protein (RRQKR↓F). This amino acid sequence is identical to Largo/71 and every NDV isolated in the last 15 years in Mexico [9, 10, 20].

Beyond the fusion protein cleavage site, it has been reported that five other genes in the viral genome may have an effect on viral pathogenicity [18, 19, 21, 22]. Analysis of the deduced amino acid sequence revealed that the P05 strain lacked the mutations previously reported in genes P and L that have been implicated in virulence [21, 22]. This suggests the existence of other not yet characterized regions which may contribute to increased virulence in the P05 strain.

Phylogenetic analysis

Phylogenetic analysis of 77 complete and partial genome nucleotide sequences of representative viruses belonging to class I and class II (genotypes I–IX) are shown in Fig. 1. The P05 strain is among the genotype V virus clearly more related to Largo/71. Non-significant changes can be seen in the distribution of the virus in the tree when analyzing the partial sequence of gene F (47–420 nt) (Electronic supplementary material 1). These results match the genotypes of strains isolated in commercial avian species in North America in the last 15 years [9, 20, 23].

Fig. 1
figure 1

Phylogenetic analysis of 77 complete and partial genome nucleotide sequences (only sequences >15,180 nt) of class I and class II (genotypes I–IX) of representative genomes. The evolutionary history was inferred by the maximum likelihood method based on the Kimura 2-parameter model. The phylogenetic tree is drawn to scale with the highest log-likelihood (−160892.14) with branch lengths measured in the number of substitutions per site (only shows the highest at 0.005). A discrete gamma distribution was used to model evolutionary rate differences among sites (5 categories (+G, parameter = 0.4120)). Codon positions included were 1st + 2nd + 3rd + noncoding. evolutionary analyses were conducted in MEGA5 [13]

The full-length genome sequence of the isolated P05 strain showed a 97.4 % similarity to the Largo/71 strain isolated in the state of Florida in the United States during the initial outbreaks in the early 1970s. These outbreaks have affected various border states between Mexico and the USA [16], but seem to have originated from the import of exotic birds South America [24, 25].

In North America, in commercial poultry, the major genotype identified has been genotype V. For this reason, we made a particular analysis of the variable segment (47–420 nt) of the gene that encodes for the Fusion protein with sequences available in Genbank (majority of viruses isolated in Mexico) (Fig. 2). Our results are similar to those previously reported by Perozo et al. [9]. Figure 2 shows three lineages of viruses belonging to genotype V. Two of them, have been identified circulating in commercial poultry [9] and a third includes wild birds.

Fig. 2
figure 2

Phylogenetic analysis of 33 nucleotide sequences of the variable region of gene F (nucleotides 47–420). In the tree, strong emphasis is given to viruses belonging to genotype V. The evolutionary history was inferred by the maximum likelihood method based on the Kimura 2-parameter model. The phylogenetic tree is drawn to scale with the highest log-likelihood (−2012.8672) with branch lengths measured as the number of substitutions per site. A discrete gamma distribution was used to model evolutionary rate differences among sites (5 categories (+G, parameter = 0.4120)). Evolutionary analyses were conducted in MEGA5 [13]

Particular analysis of the virus sequences isolated in Mexico show two lineages with similarities between each group of 92–95 % of the virus circulating in Mexico. The first group (Va) corresponds to isolates between 1998 and 2002 and includes the virus that caused the outbreak in Torreon in 2000. This lineage includes the chicken/Mexico(Morelos)/458/1988 virus and the Gamefowl/U.S.(CA)/211472-4/2002 strain, suggesting that the same variety is circulating in both countries and may have been circulating in Mexico for over 10 years.

The second group (Vb) corresponds to isolates made between 2004 and 2006 with similarities between them greater than 98 %. This group contains the P05 strain and is more closely related to the Largo/71 strain which is representative of the outbreak in North America in the early 1970s.

With these two lineages, the P05 strain shows high similarity (98.5–99.5 %) among strains isolated in Mexico between 2004 and 2006. However, the percentage of similarity of the P05 strain diminishes when compared to strains isolated in Mexico between 1998 and 2001 (93–94 %) [9].

With the information available on commercial chickens, we identified two evolutionary lineages of genotype V in recent years in North America. The Va lineage which includes the Gamefowl/U.S.(CA)/211472-4/2002 strain and the Vb lineage represented by P05. Probably, both lineages descended from the strain that caused the outbreak in the 1970s. The percentage of similarity between Gamefowl/U.S.(CA)/211472-4/2002 and Largo/71 is 94 %, while the percentage of similarity between P05 and Largo/71 is 97.4 %. This means that the Vb lineage accumulated fewer changes in relation to Largo/71, and the Va lineage has a higher mutation rate in relation to the same strain. These data explain the difference between these two recent strains.

There is little information about NDV isolates in North America from commercial poultry so it is unknown whether the Vb lineage continues to circulate. However, a sequence of the F gene from a virus isolated in 2008 in Belize (Turkey/Belize/4438-4/2008, Genbank access: JN942045) with high similarity to the strain isolated from the Torreon outbreak of 2000 (99 %), suggests that the Va lineage still continues to circulate.

In addition, the phylogenetic tree identifies a third lineage (Vc) which houses the wild virus isolated in the USA from cormorants and is poorly correlated with lineages Va and Vb with differences greater than 9 %. So far, no evidence has been reported that the wild bird strains are circulating in commercial poultry.

Currently, a permanent vaccination program for Newcastle disease is in effect in Mexico. During the last 40 years, vaccination of commercial birds has been carried out using the LaSota strain, categorized as genotype II. It is well known that all NDVs belong to a single serotype so there is protection against mortality in case of a highly virulent challenge. Furthermore, it has been reported that other APMV may confer cross or partial protection against highly virulent NDVs [26]. Despite the protection in mortality that may be offered by vaccination, some reports suggest that vaccination using a virus that resembles the challenge virus contributes to diminished viral shedding [8]. It is necessary, therefore, to know which virus phylogenies circulate in commercial poultry and try to design vaccines closely related to the field virus.

We isolated the P05 strain from a farm with hens vaccinated against Newcastle disease using the LaSota strain. Since the first report of an outbreak of ND in Mexico in 1946, numerous outbreaks have been well documented at the veterinary level, but little is known about the evolution of viral genomes. To our knowledge, this study is the first to provide a complete genome sequence of a viral strain of Newcastle disease isolated in Mexico. Data collected from this study should contribute considerably to the study of the genomic evolution of NDV in Latin America.