Introduction

Canine Distemper (CD) is a severe systemic disease caused by a virus classified into the genus Morbillivirus, family Paramyxoviridae, along with Measles virus, Peste des petits ruminants virus, Riderpest virus, Phocine distemper virus and Dolphin morbillivirus (Murphy et al. 1999).

This disease affects a great variety of animals, mainly terrestrial carnivores belonging to the Canidae, Felidae, Mustelidae, Procyonidae, Ursidae and Viverridae families. There are reports on the occurrence of the disease in collared peccaries and Japanese monkeys (Appel et al. 1991; Deem et al. 2000; Yoshikawa et al. 1989). In dogs, the disease is systemic with respiratory, nervous and digestive signs. Footpad hyperkeratosis and enamel hypoplasia have been described and are regarded as characteristic marks of this disease (Appel 1987).

Canine Distemper Virus (CDV) possess a negative sense, single-stranded RNA genome, encoding for 6 main proteins, the Nucleoprotein (N), Phosphoprotein (P), Matrix protein (M), Fusion protein (F), Haemagglutinin (H) and Large protein (L) ( Sidhu et al. 1993). The P gene codes in addition other two proteins, C and V, which have been associated with interferon inhibition and the control of infectivity, respectively (von Messling et al. 2005).

The H protein is involved in SLAM-mediated virus entry in the cells and elicits neutralizing protective antibodies (Jensen et al. 2009). The H gene is the most variable portion of CDV genome and has been used to investigate the genetic relationships among the various strains (Demeter et al. 2007; Haas et al. 1997; Martella et al. 2006). Although being more conserved, the N gene has also been used to gather information on the genetic diversity (Keawcharoen et al. 2005; Yoshida et al. 1998).

On the basis of the full-length sequence of the H gene of CDV strains identified globally, at least seven main geographic groups (genotypes) appear to circulate among the various susceptible hosts, namely America-1 (including the majority of vaccine strains), America-2 (CDVs circulating in Northern America), Asia-1 and Asia-2 (CDVs identified in the Asiatic continent), Europe (viruses of European origin), Arctic (viruses identified in the Arctic ecosystem and in Europe) (Bolt et al. 1997; Iwatsuki et al. 1997; Martella et al. 2007; Pardo et al. 2005). In addition, some CDV strains identified in Africa, Asia and Argentina (Gallo et al. 2007; Woma et al. 2010; Zhao et al. 2010) appear to diverge substantially and might represent separate geographic groups. Besides the vaccines grouped in America-1 genotype there are some independent strains used as vaccines which are related to Rockborn strain (Martella et al. 2011). Nearly 8–9 % aa variation is observable between America-1 CDVs (including the vaccine strains Onderstepoort and Lederle) and the other CDV genotypes in the full-length H protein but the impact of the observed genetic heterogeneity remains unclear.

Despite the availability of CDV vaccines in the market, in Mexico CD remains a major disease of dogs. Limited information is available of the epidemiological and molecular features of the CDV strains circulating locally (Simon-Martínez et al. 2007). In this work we analyzed a total of 14 samples from dogs with clinical signs of Canine Distemper, which were submitted to our laboratories, between the years 2007 and 2010. The viruses were analyzed using two different genomic regions, a fragment of the H gene and N gene.

Materials and methods

Clinical specimens

From 2007 to 2010, 127 samples (whole blood) of animals with suspect of CDV infection were sent to our laboratory for diagnostic confirmation and CDV RNA was identified only in 66 samples (52%) by RT-PCR. Out of 66 samples, 14 could be included in this study based on the amount of the sample and the amount of RNA extracts available for additional analysis. The samples were submitted to our laboratory from private veterinary hospitals in Mexico State (13 samples) and from Jalisco State (1 sample). The clinical signs, age, sex and breed of the patients are summarized in Table 1.

Table 1 Age, sex, breed and clinical signs of dogs used in this study

Extraction of nucleic acids

Viral RNA was extracted from leucocytes obtained by washing with nuclease-free water. Briefly, 500 μl of nuclease-free water were added to 500 μl of whole blood in a 1.5 microcentrifuge tube, and centrifuged 7 min at 5,000 g. A total of 500 μl of the supernatant was discarded, then 500 μl of nuclease-free water were added, repeating this step 3 times. Total RNA was extracted with TRI Reagent (Sigma®, USA), following the manufacturer’s instructions.

RT-PCR

A one step RT-PCR was made using the AccessQuick RT-PCR System (Promega®, USA). The final volume of the reaction was 51 μl, and contained 25 μl of AccesQuick Master Mix (2 X), 1 μl of each primer (10 mM), 1 μl of reverse-transcriptase (AMV-RT) (5 u/μl), 18 μl of nuclease-free water and 5 μl of RNA template.

Confirmation of the initial suspect of infection by CDV was accomplished with primers targeted to the N gene, designed by Shin et al. (2004), that amplify a fragment of 297 bp (primer forward 5’-GTAGACGAAGGGTCGAAAG-3’ and primer reverse 5’-GAATCGCCTCAAAGATAGG-3’, nucleotides 507 – 803 of the Onderstepoort strain, AF305419). Thermo-cycling conditions were 45 min at 45°C (reverse-transcription), 5 min at 94°C (initial denaturation), 30 cycles of 30 sec at 94°C, 30 sec at 55°C, 30 sec at 72°C, and a final extension of 10 min at 72°C.

The CDV-positive samples were tested with primers amplifying a 613 bp fragment at the 3’ end of the H gene. The primers were designed in conserved stretches of the H gene (DHIF 5’-TGGTTCACAAGATGGTATTC-3’ and DHIR 5’-CAACACCACTAAATTGGACT-3’). The thermo-cycling condition were as follows: 45 min at 45°C (reverse transcription), 5 min at 94°C (initial denaturation), 30 cycles of 30 sec at 94°C, 30 sec at 51°C, 30 sec at 72°C, and a final extension of 10 min at 72°C.

PCR products sequencing

The amplicons were gel-purified using the Wizard® SV Gel and PCR Clean-Up System (Promega®, USA) and subjected to direct sequencing in a Genetic Analyser 3100 (Applyed Biosystem®, USA) using the Big Dye Terminator v.3.1 Cylce Sequencing Kit (Applyed Biosystem®, USA).

Phylogenetic analysis

The nucleotide sequences were analyzed preliminary with cognate sequences available in GenBank using BLAST software (http://www.ncbi.nlm.nih.gov/). Multiple sequence alignment was made with ClustalW software (Thompson et al. 1994) included in MEGA 4.0 software (Tamura et al. 2007).

Concatenated sequences of the two gene fragments were aligned and a Neighbor Joining tree was created based on the uncorrected distance matrices, thus identifying the CDV variants circulating in Mexico (Fig. 1).

Fig. 1
figure 1

Phylogenetic tree with concatenated sequences of H and N gene of CDV

For the phylogenetic analysis, the genetic distances were computed using the Kimura two parameter algorithm and the tree was constructed with the Neighbor Joining method with a 1000 bootstrap repeats using the MEGA 4.0 Software (Tamura et al. 2007). Phylogenetic networks were constructed with Network 4.5.1.2 software by the Median Joining algorithm (Bandelt et al. 1999). One Measles strain (AY486084.1) was used as an outgroup in the Neighbor Joining trees.

Results

RT-PCR and sequence analysis

Amplification of fragments of the expected size was obtained with N and H gene and the sequences determined. After removing primer-derived sequences and some sequences with inconsistencies at the 5’ and 3’ ends, a 223 bp-long fragment of the N gene (nucleotides 537 – 759 of the Onderstepoort strain, AF305419) and a 509 bp-long fragment of the H gene (nucleotides 8078 – 8568 of the Onderstepoort strain, GenBank: AF305419) were obtained.

Neighbor Joining analysis with the uncorrected distances generated with the concatenated N/H sequences allowed us to identify at least 10 different genovariants (Fig. 1), EdoMex1(GenBank N gene: FJ490185, H gene: HM771709), EdoMex2 (GenBank N gene: FJ490186, H gene: HM771710), EdoMex4 (GenBank N gene: FJ490188, H gene: HM771711), EdoMex6 (GenBank N gene: FJ490190, H gene: HM771712), EdoMex7 (GenBank N gene: FJ490191, H gene: HM771713), EdoMex8 (GenBank N gene: HM771704, H gene: HM771714), EdoMex9 (GenBank N gene: HM771705, H gene: HM771715), EdoMex10 (GenBank N gene: HM771706, H gene: HM771716), EdoMex11 (GenBank N gene: HM771707, H gene: HM771717), and GDL1 (GenBank N gene: HM771708, H gene: HM771718).

The Mexican genovariants displayed 98.4%-99.9% nucleotide (nt) identity to each other, 91.3%-92.3% nt identity to strain Ondesterpoort and 96.0%-96.6% nt identity to strain A75/17. Deduced amino acid sequence of the H protein fragment was inferred and cysteine and proline amino acids in the Mexican strains were conserved with the majority of CDV strains. Three potential N-glycosilation sites (at position 391–393, 422–424 and 456–458) were found to be conserved as in almost all field CDV strains but strain Onderstepoort that lacks the N-glycosilation sites 391–393 and 456–458.

Phylogenetic analysis

The phylogenetic tree of the N gen grouped all the Mexican variants in the same cluster, distantly related to the vaccine strains and to other field CDV strains identified globally (Fig. 2). Some patterns of segregation were clearly distinguishable (Asian, American and vaccines strains). These segregation patterns were confirmed in the phylogenetic network of the same gen (Fig. 3).

Fig. 2
figure 2

Phylogenetic analysis of N gene of CDV, Neighbor Joining tree. GenBank Accession numbers are indicated in each strain. Bootstrap values lower than 50% are not shown

Fig. 3
figure 3

Phylogenetic analysis of N gene of CDV, Median Joining network

The neighbor joining tree and phylogenetic network generated on the 509 bp-long partial sequences of the H gene (amplified with primers DHIF-DHIR), evidenced geographic patterns (genotypes) resembling the patterns described previously using the full-length H gene (Asia1, Asia2, Europe, Arctic, America1 and America2). All the Mexican CDV variants formed a different group, also including the North American strain 19876, detected in Missouri, USA, 2004 (Pardo et al. 2005) (Figs. 4 and 5).

Fig. 4
figure 4

Phylogenetic analysis of H gene of CDV using Neighbor Joining tree. GenBank Accession numbers are indicated in each strain. Bootstrap values lower than 50% are not shown

Fig. 5
figure 5

Phylogenetic analysis of H gene of CDV, Median Joining network

Discussion

Sequence analysis of CDV strains identified worldwide has been accomplished using different genetic targets, such as the H, F, N or P gene (Bolt et al. 1997; Castilho et al. 2007; Demeter et al. 2007; Haas et al. 1997; Pardo et al. 2005). The use of different genetic targets and different size has generated a large data set of CDV sequences but it has hampered uniform comparison of the various CDV strains. Although the H gene appears to be more suitable for investigating the genetic relationships among CDV strains, haplotypes analysis can also be useful to identify CDV genetic variants (genovariants, i.e. epidemic strains spreading in a given time span or population) and also to identify recombinant viruses.

In this study partial sequences of the H and N genes of CDVs identified from a Mexico region were obtained and compared, using different phylogenetic approaches. Strains displaying 100% nt identity in the N and H sequences were regarded as a unique genovariant. Analysis of 14 CDV strains obtained between 2007–2010, identified 10 distinct genovariants. More genovariants were found to circulate in the same year in the same geographical setting (Edomex 1, 2, 4, 6, 7 in 2007, Edomex 9 and 10 in 2008, Edomex 7 and 11 in 2009). Also, genovariant Edomex7 was found to circulate in 2 distinct years (2007 and 2009) in the same geographical setting.

N gene-based neighbor joining phylogenetic tree and Median Joining network analysis using selected sequences retrieved from GenBank revealed geographic-related patterns of segregation. All the Mexican CDV genovariants were clustered in a distinct group, genetically unrelated to the vaccine CDV strains and to field strains of various geographical origin. A limited number of N gene sequences are available in the GenBank and they span different regions of the gene, thus not being comparable.

The H gene-based neighbor-joining phylogenetic tree and Median Joining network analysis was consistent with similar analysis reported in previous studies (Bolt et al. 1997; Iwatsuki et al. 1997; Martella et al. 2007; Pardo et al. 2005). All the Mexican CDV genovariants clustered in a unique, well-defined (bootstrap value 98%) group. Interestingly, a North American strain, 19876, was found to cluster along with the Mexican viruses. Identity of strain 19876 in the sequenced H gene fragment to the Mexican CDVs was 99.2–99.4 % nt and 99.4–100% aa. Strain 19876 was identified from a dog in Missouri, USA, 2004, and the dog had no history of recent travel outside USA (Pardo et al. 2005). By analysis of the full-length H gene, virus 19876 appeared as the most divergent of the North American CDV strains, as it was found to possess only 93.5–94.6 aa identity to other contemporaneous USA strains (Pardo et al. 2005). Accordingly, given the geographical continuity/proximity, it is safe to assume that 19876-like strains circulate in Mexico and in some geographical areas of USA.

It has been described that changes in N-glycosylation of the H protein may affect neutralization by antibodies and replication in vitro (Harder et al. 1996; Iwatsuki et al. 1997; Lan et al. 2007). On the basis of the partial H protein fragment, the Mexican strains differed from the vaccine strain Ondesterpoort, since they retained N-glycosylation potential sites that are conserved in the majority of field CDVs and that are missing in strain Ondesterpoort.

In conclusion, by phylogenetic analysis of CDV strains, we established a system to distinguish CDV genovariants. We applied this system to investigate the epidemiology of CDV in a Mexican region, individuating several CDV genovariants, highly related to each other, but yet distinguishable. The Mexican genovariants were found to group with a unique North-American CDV strain, genetically unrelated to other North-American viruses. Although the impact of CDV genetic/antigenic heterogeneity is not clear, gathering sequence information on CDVs is useful to understand the epidemiological patterns and global ecology of the virus.