Introduction

Bocavirus (BoV), a species from the genus Bocaparvovirus and the family Parvoviridae, is an approximately 26-nm nonenveloped capsid virus with icosahedral symmetry. It has a small linear single-stranded DNA genome comprising three open reading frames (ORFs) [1]. The first ORF encodes nonstructural proteins associated with viral gene replication (NS1), whereas the second ORF encodes two capsid proteins (VP1 and VP2); the VP2 capsid protein is the major capsid protein and plays an important role in the determination of the antigenicity and host range of BoVs [2,3,4,5]. The third ORF encodes a highly phosphorylated nonstructural protein (NP1) [6]. BoVs are found in humans and various animals, including chimpanzees, gorillas, pigs, cows, bats, dogs, cats, rats, California sea lions, and pine martens [7,8,9,10,11], and they cause gastrointestinal and respiratory diseases in their hosts. When hosts are infected with BoV, disease symptoms tend to be severe in young animals and infants, whereas subclinical infection is common in adults [8, 10]. Canine bocavirus type 2 infection is associated with massive enteritis in a litter of dogs with atrophied and fused villi, severe crypt regeneration, severe bone marrow, and lymphoid atrophy [10]. Respiratory tract symptoms and diarrhea have been reported in weanling piglets infected with porcine bocavirus [12, 13]. BoVs are related to DNA damage, which may promote viral DNA replication and mediate cell death [14].

Mink bocavirus 1 (MiBoV1), a novel species of BoV, was identified from the domestic minks in China in 2016 [15]. MiBoV1 was detected in healthy animals and animals suffering from diarrhea [15]; thus, the correlation between MiBoV1 and diarrhea requires further confirmation. Despite the availability of MiBoV sequences in clinical samples, the pathogenic role and prevalence of MiBoV1 in single or co-infection are unclear. Moreover, the genetic information, phylogenetic analysis, and codon usage pattern of this virus are limited.

In the present study, we collected 192 samples from the major mink industry province in China and detected the samples for the presence of MiBoV1 by PCR. Co-infection with MiBoV1 was also detected. Subsequently, the cloned VP2 sequences were compared between MiBoV and other known BoVs to investigate the variation of nucleotides and amino acids, and the genetic relationship of MiBoV1 to known BoV isolates was examined using phylogenetic analysis. In addition, the codon usage bias of the VP2 gene of MiBoV1 genome was measured by using bioinformatic methods.

Materials and methods

Sample collection

A total of 192 samples, including 26 sera from minks with septicemia, 39 diarrhea feces, and 127 tissues of dead animals, were collected from 23 farms in Liaoning, Jilin, Heilongjiang, Hebei, and Shandong provinces from May 2014 to September 2017. All samples originated from independent mink individuals aged 60–118 days, and the 127 tissues, including lungs, hearts, spleens, livers, kidneys, brains, etc., were screened from 127 dead animals. No duplicate samples were used (one tissue from one corpse). The details of the samples are listed in Supplementary Material Table S1.

Laboratory investigation

Genomic DNA was extracted from feces and tissues [16]. Approximately 0.1 g of each specimen was added into 500 µL of phosphate-buffered saline, homogenized, freeze thawed three times, and then centrifugated. Total genomic DNA was extracted from 200 µL of homogenate supernatant by using genomic DNA Kit (Axygen A Corning Brand, Suzhou, China) in accordance with the manufacturer’s protocol. Serum DNA was extracted by using the Serum/Plasma Circulating DNA Kit (TIANGEN BIOTECH CO., LTD.) in accordance with the manufacturer’s instructions.

According to the published protocol, BoV was detected from 192 total viral DNA by using the reference method [15]. To detect co-infection, MiBoV1-positive samples were also used to examine mink enteritis virus, mink circovirus (MiCV), and Aleutian mink disease virus (AMDV) by PCR, and the corresponding original samples were examined for canine distemper virus, canine adenovirus (CAV)-1, and CAV-2 using the colloidal gold immunochromatography strip.

Amplification and sequencing of complete VP2 gene

On the basis of the published VP2 gene sequence of the MiBoV strain (NC_030873.1), the nested primers were designed to perform PCR screening. Primers Boca-F1 (5′-AAATCGGCGTTTAGACT-3′) and Boca-R1 (5′-GGATAACAAGATATACATGAC-3′) were used for the first round of PCR, and Boca-F2 (5′-CAAACCGTAAGGGAGGAG-3′) and Boca-R2 (5′-GCTGGATATACATGACAGAAAT-3′) were used for the second round. All primers were purchased from Comate Bioscience, Changchun, China. For the nested PCR, Ex Taq DNA polymerase (TaKaRa, Japan) was used. The first round of PCR was conducted at 95 °C for 5 min, followed by 35 cycles at 95 °C for 45 s, annealing at 51.3 °C for 2 min, elongation at 72 °C for 1 min, and final extension at 72 °C for 8 min. For the second round of nested PCR, the annealing temperature was 52 °C under identical cycling conditions.

PCR products were purified using a gel extraction kit (AXYGEN, Hangzhou, China) in accordance with the manufacturer’s instructions. The purified DNA was cloned into a pMD18-T vector, the plasmids were introduced into Escherichia coli DH5α by using a standard transformation technique, and the positive clones were sent to Comate Bioscience (Changchun, China) for sequencing. At least three independent plasmid clones were analyzed and confirmed.

Genetic and phylogenetic analysis

The SeqMan program of DNAStar was used to edit and analyze the nucleotide sequences (DNAStar, Madison, WI, USA). The VP2 sequence of MiBoV1 obtained in this study had been deposited in GenBank (accession number: MK951953 to MK951962). The nucleotide and predicted amino acid sequences of complete VP2 sequences were multiple aligned between different species of BoV by Clustal W, MegAlign program of DNAStar (DNAStar, Madison, WI, USA). Subsequently, to improve our understanding of the genetic relationships between MiBoV1 and other BoVs, we constructed a phylogenetic and molecular evolution tree on the basis of the nucleotide sequences of all VP2 genes by MEGA 7.0 via the neighbor-joining method and Kimura 2-parameter model. The reliability of the phylogenetic tree obtained for the VP2 region was evaluated by running 1000 replicates in the bootstrap test (Fig. 1) [17, 18].

Fig. 1
figure 1

Phylogenetic analysis of the VP2 gene sequence of mink bocavirus and other bocavirus, using the neighbor-joining method, with 1000 bootstrap replicates (MEGA 7.0.14). Only bootstrap support values greater than 60% are shown. The bar indicates the genetic distance. Other sequences were obtained from GenBank; accession numbers of those sequences are included in the tree. The triangle represents 10 VP2 sequences in this study

Codon usage pattern analysis

The basic nucleotide composition (A%, T%, C%, and G%), AT and GC contents, nucleotide at the third position of synonymous codons (A3s%, T3s%, C3s%, and G3s%), GC content on the third synonymous codon position (GC3s), relative synonymous codon usage (RSCU), and effective number of codons (ENC) were calculated for the VP2 gene sequence of each MiBoV1 strain by using CodonW version 1.4.2 [19]. The codon adaptation index (CAI) values for MiBoV1 were calculated using Neovison vison codon usage as a reference set in the CAI calculator https://genomes.urv.es/CAIcal/ [20].

Results

Prevalence of MiBoV1

Among the 192 samples tested, 10 (5.2%) cases were positive for MiBoV1 by PCR. The positive minks aged 60–118 days were mostly characterized by vomiting, lethargy, anorexia, dehydration, diarrhea, and rough fur. Six of the 10 positive samples were from the sera of minks with septicemia. The other four samples were from feces collected from minks with diarrhea or hemorrhagic diarrhea. The prevalence of MiBoV1 varied significantly among different provinces: Hebei (10.81%) and Jilin (13.04%). No positive reaction was detected in samples from Heilongjiang, Liaoning, and Shandong provinces. The detection of co-infection indicated that five (50%) samples were positive for MiCV, and AMDV was positive in two cases (20%). In one case, MiBoV1, AMDV, and MiCV were detected simultaneously. The information of positive samples, including collection date, age, location, accession number, and other enteric pathogens detected, is listed in Table 1.

Table 1 The details about the positive samples from which the viruses were detected

Genetic characterization

To compare the mutation between nucleotides and amino acids, we analyzed 11 VP2 sequences of MiBoV1, including the reference strain (NC_030873.1), and 10 sequences with the Clustal W method using the MegAlign program (DNASTAR, Madison, WI, USA). As shown in Table 2, the nucleotide similarity varied from 96.7% to 99.9%. The lowest nucleotide sequence similarity of the MiBoV1 reference strain (NC_030873.1) and strain JL45 was 96.7%, whereas the similarity of nucleotides between strain JL25-1 and strain JL 14 was the highest (99.9%). The nucleotide sequence alignment was free of deletion or insertion. This finding suggested that the VP2 gene was highly conserved and had low viral diversity (Table 2).

Table 2 Comparison of nucleotide and amino acid sequence of the VP2 gene among bocavirus strains

The amino acid alignment revealed that some synonymous mutations were only seen in the MiBoV1 detected in this study compared with the MiBoV1 reference strain (NC_030873.1). For instance, a G to A mutation at nt positions 42, 57, 630, 669, 1125, and 1416; A to G mutation at nt positions 213, 252, 399, 441, 447, 702, and 705; T to C mutation at nt positions 141, 384, 627, 1014, 1074, and 1284; C to T mutation at nt positions 573, 651, 660, 960, 1008, 1161, 1338, and 1414; A to T mutation at nt position 1335; and T to G mutation at nt position 1458 were found in 10 MiBoV1 isolates. Four of the isolates (L58, JL35, L60, and JL45) had unique nt changes: C84G, A96G, T1038 C, and G1599A, respectively.

Table 3 shows that compared with the MiBoV1 reference strain (NC_030873.1), 11 nonsynonymous mutations, namely, A53G, G58T, G60A, T61A, A71G, G214T, G833A, T937G, G1024A, C1025G, and T1595G, led to nine amino acid mutations at positions 18 (Lys → Arg), 20 (Ala → Ser), 21 (Ser → Thr), 24 (Asn → Ser), 72 (Asp → Tyr), 278 (Ser → Asn), 313 (Ser → Ala), 342 (Ala → Arg), and 532 (Leu → Trp). These mutations were observed in all 10 MiBoV1 isolates described in this study. The strain n JL45 with the highest mutation rate possessed more amino acid changes at residues 177 (Pro → Arg), 548 (Leu → Pro), 565 (Glu → Asp), and 576 (Val → Ser). Strain L58 contained three nt changes, namely, C462A, T760C, and A1568G, which led to three amino acid mutations, that is, F154L, Y254H, and E523G. In addition, a unique A → G mutation was observed in VP2 gene at nt positions 530, 314, and 568 of strain JL45, JL35, and JL14 separately, which was different from other strains. The high variability was located at nt positions 53–214 and 833–1595.

Table 3 Comparison of VP2 nucleotide and amino acid mutations of MiBoV1 isolates reported in this study

Compared with other BoV strains, the 10 VP2 exhibited 51.9–63.7% nucleotide sequence identity. In comparing strains L60, JL14, and JL 25–1 with Canine bocavirus 2 (CBoV2), the nucleotide identity was the highest (63.7%) and that of strain JL45 and gorilla bocavirus (GBoV1) was the lowest (51.9%). The VP2 sequence of porcine bocavirus (PBoVsx and PBoV3) and four types of human bocaviruses (HBoV1–4) were compared individually with those of MiBoV1, and 54.1–60.2% nucleotide identity and 40.0–46.9% amino acid similarity were found. By contrast, MiBoV1 was most closely related to Feline bocavirus (FBoV, 63.0–63.4% nucleotide identity, and 58.6–59.1% amino acid similarity), followed by the three types of California sea lion bocaviruses (CslBoV1–3, 62.2–63.1% nucleotide identity, and 59.2–62.8% amino acid similarity) (Table S3, Supplementary Material).

Phylogenetic studies

To examine the phylogenetic relationships between MiBoV1, we constructed a phylogenetic tree on the basis of full-length VP2 gene sequences (Fig. 1). The phylogenetic analysis clearly demonstrated that MiBoV1 formed two clades. One clade included the reference sequence (NC_030873.1), and the other clade contained 10 isolates in this study. The virus showed a trend of evolutionary divergence. All MiBoV1 strains were distributed in a group with CslBoV, CBoV, and FBoV, which was consistent with the results of nucleotide comparison (Table 2), indicating that these viruses may share a common ancestor. These results were verified using the maximum likelihood and maximum parsimony methods, which yielded the same tree topology. Yang et al. performed a phylogenetic analysis on the basis of the NS1 amino acid sequences and found that MiBoV clustered with FBoV (JQ692585), CslBoV1 (JN420361), and Canine bocavirus 1 (CBoV1, JN648103) and formed a distinct clade. This result is the same with our phylogenetic tree based on the VP2 gene sequence.

Analysis of the codon usage pattern

Basic nucleotide composition analysis was conducted for the VP2 gene of MiBoV1. The mean values of A% (32.21 ± 0.09) and G% (24.06 ± 0.07) were the highest, followed by C% (22.54 ± 0.26) and T% (21.28 ± 0.08), indicating a strong compositional bias in favor of A or G. The mean values of AT% and GC% were 53.49 ± 0.12 and 46.51 ± 0.12, respectively, whereas the mean value of GC3s% was 39 ± 0.30. According to the nucleotide composition, MiBoV1 VP2 genes were AG rich. Therefore, A and G seemed to be more common than T and C at the wobble position of VP2 gene sequences. However, further analysis of third position wobble nucleotides revealed a significantly higher abundance of A3s (44 ± 0.38%) and T3s (33 ± 0.30%) than C3s (26 ± 0.23%) and G3s (24 ± 0.26%).

Among the codons of the VP2 gene of 11 MiBoV1 strains (Table 4), codons with RSCU value > 2, including CCA, AGA, and UAA, were over-represented codons. This finding showed that proline, arginine, and the terminating codon of MiBoV1 VP2 gene over-preferred CCA, AGA, and UAA, respectively. The codons UUU (Phe), GUC (Val), CAU (His), CAA (Gln), AAU (Asn), AAA (LYS), GAA (Glu), UCA (Ser), CCA (Pro), ACG (Thr), GCA (Ala), and AGA (Arg) showed similar preferences in all MiBoV1 strains. The codons preferred by each MiBoV1 strain individually were as follows. The codons solely preferred in strains JL19-1, JL36, JL45, L46, and L55 and the reference sequence (NC_030873.1) were UAU (Tyr), whereas strain L58 preferred UAC (Tyr). Codons solely preferred in MiBoV1 that originated from northern China were AUU (Ile), GAU (Asp), and UGC (Cys) for three of the amino acids, whereas the reference sequence (NC_030873.1) preferred AUC (Ile) and UGU (Cys), showing no preference for synonymous codon usage in Asp. No bias for codons encoding Tyr was detected in strains JL14, JL25-1, JL35, and L60. The RSCU values and end nucleotide composition showed that in the VP2 coding region, A/U-ended codons were strongly preferred in MiBoV1. GC-ending codons were seldom found preferred. Further analysis revealed that most amino acids in MiBoV1 VP2 protein were terminated by A/U-ending codons. For example, six different codons encoded three amino acids, namely, arginine, serine, and leucine; however, in MiBoV1 VP2 protein, all preferred synonymous codons were terminated by A-ended codons.

Table 4 RSCU analysis of mink bocavirus VP2 gene

The ENC value describes the degree to which codon usage deviates from random selection. It is inversely proportional to codon usage deviation and shows the bias from an equal use of all synonymous codons for a given amino acid. The ENC values of different MiBoV1 strains ranged from 51.64 to 52.77 with a mean value of 52.13 ± 0.4296, which indicated that high ENC values (> 40) [19] and minimal variation among different isolates suggested a moderate, but highly conserved codon usage bias in the VP2 gene of MiBoV1.

To compare the codon usage preference with respect to its host, we calculated the CAI values for MiBoV1 VP2 genes by using mink (Neovison vison) codon usage as the reference. The closer the CAI to 1, the higher the gene expression level. The CAI varied from 0.626 to 0.63, with an average of 0.63 ± 0.0016. These results indicated that the VP2 genes were slightly expressive in the host and had lower adaptability to minks.

Discussion

Despite scientific evidence proving that BoV exists in minks [15, 21], the prevalence, pathogenic role, and genetic characterizations of this virus are limited. In the present study, the detection results improve our understanding of the geographical distribution and tissue distribution of MiBoV1. The positive rates were 10.81% and 13.04% in the samples collected from Hebei and Jilin provinces, respectively. However, no positive samples were detected in the samples collected from Shandong, Liaoning, and Heilongjiang provinces. These results showed that the overall positive rate was low, and differences in the positive rate among different regions were found. In addition, MiBoV1 was detected for the first time in serum samples, not only in feces.

To further examine the tissue distribution of virus, we also tested other tissues corresponding to the positive serum samples. The tissue samples we tested included the liver, lung, spleen, heart, duodenum, jejunum, ileum, colon, brain, kidney, skeletal muscle, bone marrow, and mesenteric lymph nodes; however, BoV was not found in these tissues. This result suggested that viruses could enter the bloodstream through some mechanism to cause viremia, but why they cannot enter other tissues needs further study. Another finding showed that co-infection of MiBoV1 with Aleutian virus and circovirus existed in 60% of all MiBoV1-positive samples. These findings indicate that the virus may have only limited pathogenicity. However, considering that these samples show septicemia in clinics, the relationship between the virus and the occurrence of septicemia should be studied.

Pairwise comparative genetic variation analysis of MiBoV1 showed that the nucleotide sequence identities of VP2 between different MiBoV1 strains were 96.7–99.9%, the amino acid sequence similarity ranged from 97.7% to 99.8%, and the mutation rate was consistent with that of other parvoviruses [22,23,24]. The MiBoV1 sequence in this study had 96.7–97.2% identity with the reference strain (NC_030873.1) identified in China in 2016 [21] and had lower nucleotide (51.9–63.7%) and amino acid (40.0–62.8%) similarity compared with other BoVs of different species (Table S3, Supplementary Material). Yang et al. performed a phylogenetic analysis on the basis of the NS1 amino acid sequences and found that MiBoV clustered with FBoV (JQ692585), CslBoV1 (JN420361), and CBoV1 (JN648103) formed a distinct clade. This finding was consistent with the results of VP2 gene sequence analysis in this study.

The VP2 capsid protein of BoV plays an important role in viral pathogenicity and host range of BoVs and includes major antigenic determinants. Thus, it can be used as an antigen for detection methods and in designing new generation vaccines. Through amino acid comparison analysis, we found that MiBoV1 had two evident hypervariable regions. One located at amino acid position 278–532 was consistent with parvoviruses [18, 22, 23]. The other hypervariable region at amino acid position 18–72 was distinctive. In addition, some strains had unique amino acid point mutations. The strain JL45 had amino acid changes at residues 177 (Pro → Arg), 548 (Leu → Pro), 565 (Glu → Asp), and 576 (Val → Ser). Strain L58 exhibited three nt changes C462A, T760C, and A1568G, which resulted in three nucleotide mutations, namely, F154L, Y254H, and E523G. A unique A-G mutation at position 314 of JL35, position 530 of strain JL45, position 1568 of strain L58, and position 568 of JL14 was different from other strains. At present, the viral diversity was low, and the VP2 sequence of MiBoV1 was highly conservative. Therefore, the conservation of VP2 gene indicates that the MiBoV1 has maintained a certain genetic stability, which will lay a foundation for vaccine research.

The codon usage pattern in viruses can offer insights into improving our understanding of the evolution and pathogenesis of some viruses. The codon usage bias affects the structure and function of protein. In our study, we analyzed and compared the codon usage data of VP2 in MiBoV1. The results of codon usage analysis showed that MiBoV1 is AG rich, and most of the preferred codons terminated with either an A or U nucleotide. The unequal usage of A3s/T3s and G3s/C3s nucleotides in AG-rich VP2 genes in the present study indicated that the compositional patterns of the MiBoV1 VP2 genes were more complex than the commonly observed GT- and/or AG-rich compositions of most virus genes (Table S2, Supplementary Material). Our analysis also showed that although some differences exist in codon usage deviations among different isolates of MiBoV1, the average ENC, CAI, and GC3s values showed that codon usage bias in VP2 genes in MiBoV1 was less. The results were identical to the previously reported data on HBoV and other parvoviruses [25, 26]. Weak codon bias may be an adaptive strategy adopted by a great quantity of RNA and DNA viruses to not compete for limited tRNA resources and to maintain the efficient replication of viruses [27, 28].

To study the usage pattern of synonymous codons and the preferred degree of A/T termination codons, we calculated the RSCU of each codon in the VP2 gene of MiBoV1 (Table 4). Studying the use of synonymous codons in viruses can reveal the molecular evolution information of individual genes, which may help us understand the regulation of viral gene expression [27, 29]. In most studies, mutation bias is the major factor causing codon usage changes in genes with high A + T or G + C content [30,31,32,33]. A more comprehensive analysis is needed to reveal the true extent of codon usage bias variation within and among MiBoV1 isolates and what other factors are responsible, as well as the effects of factors such as cell tropism, principal host species, method of transmission, and viral genetic structure [26]. This information will enable us to more accurately determine the relative importance of mutation pressure and natural selection in determining the base composition and codon usage of these pathogens.

The relationship between MiBoV1 and diseases needs to be established. In this study, we only detected common viral pathogens, and one limitation is that no bacterial or other pathogens were tested in our experiment. Therefore, further studies are needed to identify the economic importance and pathogenicity of this emerging animal virus.

Conclusions

To our knowledge, this is the first study to find and characterize MiBoV1 from serum. The results indicated that MiBoV1 had a low prevalence in the farm in Jilin and Hebei provinces. In addition, our study found divergence over the evolution of MiBoV1, and nucleotide and amino acid sequence analysis of MiBoV showed overall stability and individual variation. The codon usage bias of the VP2 gene in MiBoV1 was determined by bioinformatics method, which showed that the MiBoV1 VP2 gene was more likely to use codons that ended with A or U, and no obvious codon usage bias was found. The RSCU values and end nucleotide composition showed that in the VP2 coding region, A/U-ended codons were strongly preferred in MiBoV1. However, the pathogenic role of MiBoV1 in single or polymicrobial infections must be determined, and the economic importance of this virus is yet to be elucidated. Additional detailed studies must be conducted.