Introduction

Hepatitis B virus (HBV) infection is still a serious public health problem, and more than 350 million people are chronic carriers of HBV [23]. In the Global Burden of Disease study in 2013, HBV infection caused 686,000 deaths per year [30]. Universal vaccination for HBV is currently the most cost-effective way to protect against HBV infection and can significantly decrease the prevalence of HBV infection and the occurrence of childhood HCC and fulminant hepatitis [22, 26].

Recent reports have indicated that escape mutations within the major hydrophilic region (MHR) of hepatitis B surface antigen (HBsAg) are related to detection failure, vaccine failure and hepatitis B hyperimmune globulin (HBIG) escape [5, 46, 53], leading to occult HBV infection (OBI) [4, 6, 31, 35, 36, 41, 47, 53] or the breakthrough of HBV in liver transplant patients and in children born to HBsAg carriers despite prophylaxis with hepatitis B vaccination and HBIG [9, 11, 43, 46, 51].

Continued monitoring is necessary to determine whether the prevalence of these escape mutants is increasing. However, information on the prevalence of point mutations in the MHR of HBV strains in southern China over the past decade is limited. Therefore, in this study, we investigated the prevalence and clinical characteristics of escape mutations within the MHR of hepatitis B virus in patients in Dongguan, southern China.

Materials and methods

Patients and serum samples

Between June 2015 and May 2016, 391 eligible serum samples were obtained from HBV patients in Dongguan Eighth People’s Hospital. Participants were included in the study if they met all of the following inclusion criteria: 1) subjects had HBsAg for at least six months; 2) patients were not infected with hepatitis A virus, hepatitis C virus, hepatitis D virus, syphilis or human immunodeficiency virus; and 3) all participating subjects or their guardians gave signed informed consent for their samples to be used in the study. The study protocol conformed to the Declaration of Helsinki, and it was approved by the Ethics Committee of Dongguan Eighth People’s Hospital.

Serologic test for HBV markers

HBV serological markers were detected using the chemiluminescent microparticle immunoassay (CMIA) technique with an ARCHITECT i2000 automatic analyzer (Abbot Laboratories, IL, USA). Commercially available kits were purchased from Abbott Laboratories. Samples with HBsAg >0.05 IU/ml and anti-HBs >10 mIU/ml were defined as positive. In the current study, all 391 samples were positive only for HBsAg. Serum alanine aminotransferase (ALT) levels were detected using a Beckman Coulter AU5800 analyzer (Beckman Coulter, CA, USA), using commercial kits purchased from Beckman Coulter.

HBV DNA quantification

Viral DNA was extracted and quantified using commercial kits (Biotgene, Wuhan, China) according to the manufacturer’s instructions.

PCR amplification and sequencing

PCR fragment amplification and sequencing of the full-length S gene were performed as follows: The primer sequences were synthesized by Sangon Biotech Company (Shanghai, China). PCR was carried out in a 50-μL reaction mixture containing 2 μL of HBV DNA template, 0.3 μM each primer, 0.4 mM each dNTP, 25 μL of PCR buffer and 1 μL of KOD FX polymerase (TOYOBO, Japan). The PCR reaction was performed at 94 °C for 2 min, followed by 35 cycles at 98 °C for 10 s, at 56 °C for 1 min, and at 68 °C for 1 min 45 s for outer primers or 50 s for inner primers, and incubation at 72 °C for 5 min. The outer upstream primer was 5’-GGGTCACCATATTCTTGGGAAC-3’ (nt 2814-2835), and the outer downstream primer was 5’-GGGGGTTGCGTCAGCAAACAC-3’ (nt 1180-1200). The inner upstream primer, which was also a sequencing primer, was 5’-ACTTTCCTGCTGGTGGCTCC-3’ (nt 51-70), and the downstream primer was 5’-CATATCCCATGAAGTTAAGG -3’ (nt 867–886). The PCR products were purified and sequenced directly on an ABI 3730xl DNA Analyzer (Applied Biosystems, Foster City, CA) after reaction with BigDye Terminator v3.1 (Applied Biosystems, Foster City, CA). Sequence analysis was conducted using MEGA6.0 [44]. HBV genotyping was conducted by phylogenetic analysis of the sequence and was confirmed by >96% similarity of the surface gene sequences with the reference sequences [2]. The HBV reference sequences in the study were D00330 for genotype B and AB033556 for genotype C. The escape mutations associated with detection failure, occult hepatitis B virus infection (OBI), or vaccination and hepatitis B hyperimmune globulin (HBIG) breakthrough were analyzed as described previously by Echevarria et al. [8], Zhu et al. [53] and Lazarevic [20]. HBV quasispecies were defined as a mixture of wild-type and variant strains in the viral population, resulting in two peaks, including “inner small peaks” in the sequencing results.

Statistical analysis

Statistical analysis was performed using SPSS 22.0 software (Chicago, IL, USA). χ2 or Fisher’s exact test was used to compare categorical data. Quantitative data were compared using Student’s t-test. All tests were two-tailed, and a P-value <0.05 was considered statistically significant.

Results

Characteristics of patients

Overall, 391 hepatitis B patients were enrolled in this study. The mean age of the patients was 44.1 ± 15.0 (range, 4 to 85) years, and 227 (58.1%) patients were male (Table 1). In total, 240 cases (61.4%) were grouped in genotype B, which indicated that genotype B was more prevalent in Dongguan, southern China. Interestingly, a significant difference in gender distribution was discovered when comparing genotypes B and C, and there were more male patients in the genotype B group (P = 0.025) (Table 1). For 12 samples, a negative qPCR result was obtained. HBV DNA was quantified in 379 cases (Table 1), and no difference was discovered between genotypes B and C (P = 0.792). The median serum ALT level of all patients was 36.0 (23.0-70.9) U/L in the study group, and there was no difference between genotypes B and C (P = 0.434).

Table 1 Demographic data and S gene variability of HBV in carriers infected with different genotypes

Genetic diversity within the MHR among genotype B and C strains

Among the 391 HBsAg-positive patients, 96 were infected with viruses with (24.6%) harbored aa substitutions within the MHR, and the prevalence of mutants within the “a” determinant (124-147) was 48 out of 391 (12.3%). No significant aa substitution diversity was observed within the MHR (99-169) of the surface gene between genotype B and C isolates (0.42 vs. 0.49 substitutions per 100 aa, the same as below, P = 0.345), within the “a” determinant (aa 124-147) (0.59 vs. 0.66, P = 0.665), in MHR3 (aa 124-137) (0.83 vs. 0.85, P = 0.943), in MHR4 (aa 138-147) (0.25 vs. 0.40, P = 0.607), or in MHR5 (aa 148-169) (0.25 vs. 0.12, P = 0.201) (Table 1). Mutations were discovered more frequently in MHR1 (aa 99-119) (0.24 vs. 0.76, P = 0.001) in genotype C strains. However, variants were more frequently found in MHR2 (aa 120-123) (1.25 vs. 0.17, P = 0.021) in genotype B isolates.

The most frequently mutated position was at s126 (17/391; 4.3%), followed by s100 (13/391; 3.3%), s101(11/391; 2.8%), s133 (11/391; 2.8%), s145 (9/391; 2.3%), s120 (8/391; 2.0%) and s129 (7/391; 1.8%) (Table 1). Mutations at position s120 were found only in genotype B strains (8/240; 3.3%; P = 0.026) (Table 1), and mutations at position s101 were found only in genotype C isolates (11/151; 7.3%; P < 0.001) (Table 2). This explains the significant difference within MHR1 and 2 between genotypes B and C.

Table 2 Prevalence of substitutions within the MHR in genotypes B and C

In addition, when considering the prevalence of specific point mutations in genotypes B and C, the mutations sY100C, sQ101R/K, sS114A, sP120T, sT/I126A/N/S, sQ129R, sM133L/T/S and sG145R/A were prevalent in at least one genotype, with a frequency higher than 1%, which indicated that these mutations were relatively common. Furthermore, sQ101K/R was found only in genotype C (P < 0.05), and sT/I126A was found only in genotype B (P = 0.047), which indicated that such mutations were genotype-associated mutations.

Interestingly, HBV quasispecies variants were found in two genotype B isolates (2/240; 0.8%) and three genotype C isolates (3/151; 2.0%). All of these sites are marked in yellow in Fig. 1. These strains contained variants in coexistence with the wild-type strains, resulting in two peaks including “inner small peaks” in the sequencing results (Fig. 2).

Fig. 1
figure 1

Amino acid substitutions within the MHR (aa 99-169) in CHB patients. The reference amino acid sequences of genotypes B and C are deduced from D00330 and AB033556, respectively. The capital letters represent amino acids, and letters highlighted in yellow represent variants in combination with the wild-type sequence. Letters highlighted in blue represent genotype-related amino acid variations, which are not considered mutations. Letters highlighted in red represent insertions (color figure online)

Fig. 2
figure 2

The sequencing results of HBV quasispecies strains harboring a mixture of wild-type and variant sequences. A, M133L; B, G145A; C, S114A; D, P120T; E, Y100C

Combinations of two or more mutations within the MHR were also found in genotype B (5.0%) and C (6.6%) strains (Table 1), with no significant difference (P = 0.498). In genotype B strains, sY100C/Q129R/G145A, sY100C/E164G, sY100S/M133L, sM103I/T123A, sS114K/T116N/S132Y, sP120T/M133L, sP120S/K160N, sP120T/A157G, sP120Q/T126A, sK122I/P127T/G130N and sD144A/A157G were found (Fig. 1). In genotype C isolates, sY100C/Q101K, sY100C/I126N/M133T, sL104F/ G145R, sL109P/A128V/Q129K, sS114A/G145A, sG130K/A166G, sT131N/Q101K/I126S/M133T, sT131N/sM133T/F134L and sM133S/G145A and sT131N/M133S were found (Fig. 1).

Additionally, we found two strains of HBV genotype B with an insertion of “GATCTA” between nucleotide positions 341 and 346, which affected the coding of all amino acids after s113 and were thus not counted in the number of point mutations.

Discussion

Escape mutations located in the major hydrophilic region (MHR) of HBsAg have been widely reported worldwide. Previous studies demonstrated that the prevalence of point mutations within the MHR among hepatitis B patients was 14.8% in Argentina [32], 15% in Morocco [17], 15.4% in Poland [12], 14%-32.8% in Iran [28, 29, 33], 22.6% in Serbia [19], 27.8% in France [37], and 44.1% in South Korea [21]. MHR mutations could lead to diagnostic problems, occult HBV infection, the emergence of vaccine-escape mutants, and failure of HBIG therapy [1, 10, 13, 20, 39].

In China, the overall frequency of MHR mutations among HBV genotype C strains was 46.6% in Beijing, northern China [40]. However, limited information on the prevalence of point mutations within the MHR has been reported in southern China over the past decade. We examined the sequences of HBsAg variants within the MHR from patients with hepatitis B in Dongguan, southern China.

In the current study, 24.6% and 12.3% of the strains harbored aa substitutions within the MHR and “a” determinant, respectively, which indicated that the presence of HBsAg escape mutations among HBV-infected patients in China should be monitored.

Our results demonstrated that the position s126T/I was the most frequent mutation site, with the amino acid replaced by Ala (A) (7/391; 1.8%), Ser (S) (7/391; 1.8%) and Asn (N) (3/391; 0.8%). Previous reports demonstrated that the binding reactivity to anti-HBs of HBsAg with s126I, s126T and s126A was comparable, but HBsAg with sT126S had a significantly lower antigenicity, which might lead to immune escape in chronic HBV infection [34], and result in the occurrence of OBI [53] or vaccine escape [20]. In addition, a previous study showed that sT126S impaired the secretion of subviral particles and virions, both in vitro and in vivo [14]. In the current study, sT/I126S mutations were prevalent in both genotype B (2.1%) and C (1.3%), indicating that this might be a hot spot for immune selection.

Mutations were also frequently found at position s100, including sY100C (10/391; 2.6%), sY100S (2/391; 0.5%) and sY100H (1/391; 0.3%). An in vitro study concluded that the sY100C substitution alone did not play a role in reducing the amounts of HBsAg or the HBsAg affinity in a commercial ELISA assay [27]. sY100S was reported to correlate with occult infection in genotype D strains [20, 42], but not in genotypes B and C [14, 20]. Furthermore, sY100S was relatively uncommon in genotypes B (0.4%) and C (0.7%). The clinical implications of the sY100H mutation and the impact on HBsAg antigenicity are unknown and require further investigation.

Another mutational hot spot was at s101, including sQ101R (7/391; 1.8%) and sQ101K (4/391; 1%), which was only discovered in genotype C strains. sQ101R has been reported to correlate with occult infection in genotype D strains [20, 42], but not in genotypes B and C [14, 20]. sQ101R was associated with lower reactivity in HBsAg assays when combined with some mutations [39]. As far as we know, the clinical implications of sQ101K are still unknown.

Mutations at positions s120 and s129 were mainly sP120T (5/391; 1.3%) and sQ129R (5/391; 1.3%), both of which were only discovered in genotype B strains and are considered “immune escape” mutations [20, 53]. Previous studies demonstrated that amino acids within the region between 120 and 123 are crucial for the antigenicity of HBsAg [45], and that sP120T mutants exhibit impaired binding to MAbs [14], thus indicating its association with immune escape. A previous study also showed that sQ129R decreased the secretion of surface proteins and virions [14]. This mutation was frequently discovered in chronic hepatitis B patients with coexistence of HBsAg and anti-HBs [7, 18, 24]. Therefore, sP120T and sQ129R are important escape mutations in genotype B strains in China.

The prevalence of mutations at position s133 among all patients was 2.8%, including sM133L (4/391; 1%), sM133T (5/391; 1.3%) and sM133S (2/391; 0.5%). Interestingly, sM133L was only discovered in genotype B strains (4/240; 1.7%), and sM133S was only found in genotype C strains (2/151; 2.0%). Various studies have shown that sM133L is associated with escape from vaccine-induced immunity [20, 46]. sM133T showed great clinical significance in the following two aspects. First, the sM133T mutation could create an additional glycosylation site, 131NST133, in the presence of the sT131N mutation, which contributes to immune escape [46, 52]. In the current study, we discovered one strain of genotype C harboring the sT131N/M133T double mutation. Second, the sM133T mutation could rescue virion secretion for a wide range of mutants (sI110M, sG145R, sN146Q, sN146S, sR169H, and, to a lesser extent, sG119E) [15]. In our study, combinations of mutations, sY100C/I126N/M133T, sI126S/M133T and sM133T/F134L, were discovered in genotype C strains. Thus, sM133L/T should be considered an important escape mutation and studied, especially when combined with other mutations. The clinical implications of sM133S, which is relatively uncommon and rarely reported, require further study.

The prevalence of mutations at position s145 was 2.3%, including sG145R (4/391; 1%) and sG145A (5/391; 1.3%). Both of these mutations were primarily discovered in genotype C variants. Over the past several decades, sG145R has become the most widely reported vaccine-escape mutation [3, 20, 46, 53] since it was first discovered in an infant infected with HBV who also had protective titers of anti-HBs [3]. The sG145R mutation has been shown to decrease virion and HBsAg detection significantly [14], and sG145R/A has been shown to exhibit various degrees of altered binding of HBsAg in different commercial assays [1, 46]. In addition, several studies of genotypes B and C strains have shown that sG145R/A was associated with the occurrence of OBI [20]. More importantly, sG145R was stable over time and could result in horizontal transmission [5, 20]. Thus, the sG145R/A mutation in the surface gene might pose a threat to the global vaccination program in the future, and it should be closely monitored, especially in genotype C strains.

The HBV population in the host consists of genetically heterologous variants that exist in the form of quasispecies [38]. A recent report from Indonesia confirmed the presence of quasispecies variants in the MHR of HBV strains using ultrahigh-throughput next-generation sequencing (NGS)[50]. Because the direct PCR sequencing method can only detect variants representing ≥20% of the total population [16, 25, 50], the prevalence of variants in coexistence with the wild-type strains is probably higher than that determined in the current study. Previous reports have demonstrated that the secretion of mutant virions could be rescued efficiently (i.e., sE2G, sC69*, sP127S and sG145R) [48, 49] or moderately efficiently (i.e., sW172*) by co-expression with the WT surface protein [48]. The coexistence of WT strains and defective mutants might be a general survival requirement for some variants, and the clinical implications of HBV quasispecies need to be studied further.

In conclusion, the prevalence of escape mutations within the MHR of hepatitis B virus in patients in Dongguan, China, was systematically studied. The prevalence of amino acid substitutions within the MHR varied according to the HBV genotype. Our results show that monitoring the mutations within MHR is essential, especially for hot-spot escape mutations related to detection failure, vaccination escape, or HBIG therapy breakthrough.