Introduction

About 257 million people are living with hepatitis B virus (HBV) infection worldwide, making this a global public health concern. HBV is a small, partially double-stranded DNA virus that causes acute and chronic hepatitis in humans. It has been classified phylogenetically into nine genotypes (A-I), with a putative tenth genotype, "J", isolated from a single individual. There are also several subgenotypes [1, 2].

HBV genotypes have a distinct geographical distribution: genotype A is predominant in Africa, Southeast Asia, Europe, the Americas, and Japan; genotypes B and C are common in the Pacific region of Asia; genotype D is predominant throughout the world, mostly in Africa, Europe, the Mediterranean region, and India; genotype E is restricted to West Africa; genotype F is found in South and Central America; genotype G has been found in France, Germany, and the Americas, mainly among men who have sex with men; and genotype H is detected in Central America and Mexico [1, 3, 4]. More recently, genotypes I and J were identified in Vietnam and Japan, respectively [5,6,7]. In Brazil, according to a recent study using samples from five different geographic regions of the country, seven HBV genotypes (A–G) were found circulating, but genotype A was the most prevalent (58.7 %), followed by genotypes D (23.4 %) and F (11.3 %) [8].

In Brazil, peginterferon alfa-2a and the first-line nucleos(t)ide analogues (NAs) entecavir and tenofovir are available for treatment. In rare cases, the long-term use of NAs may lead to the emergence of resistant strains, with increasing signs of liver injury, cirrhosis, and hepatocellular carcinoma [9].

Polymorphisms in the S gene, which partially overlaps with the viral polymerase gene, are frequently associated with escape from vaccine-induced immunity, impaired HBsAg detection, and a poor response to immunoglobulin therapy [10,11,12]. HBV DNA can be detected and quantified using oral fluid samples [13,14,15]. These samples are easier to collect, the procedure is less invasive and painless, and the risk of exposure to needles is eliminated [16, 17]. Therefore, oral fluid samples may be useful in molecular epidemiology studies in children, the elderly, and hard-to-reach populations, allowing broader population adherence to be achieved compared to blood sampling [18].

To our knowledge, no study has evaluated the applicability of using oral fluid for HBV DNA sequencing, detection of resistance/escape mutations, and phylogenetic analysis. In this study, we evaluated the usefulness of oral fluid samples for HBV genotyping, detection of S/polymerase mutations, and analysis of HBV subpopulation diversity compared to the gold standard, serum samples.

Material and methods

Study population

A total of 18 paired oral fluid and serum samples from chronically HBV-infected individuals (positive HBsAg in serum) were included in this study. They were recruited at public health centers in South/Southeast Brazil from July 2013 to December 2015. Informed consent was obtained from all participants prior to sample collection. The study was approved by the ethics committee (number CAAE 18281313.4.0000.5248). Information regarding treatment status and biochemical markers such as alanine aminotransferase (ALT), aspartate aminotransferase (AST), and gamma-glutamyltransferase (GGT) were obtained.

Sample collection

Blood samples were collected by venipuncture, and oral fluid samples were obtained using a Salivette collector (Sarstedt, Nümbrecht, Germany) as described previously [14, 19]. For oral fluid collection, patients were asked not to consume food or water for at least one hour beforehand to avoid sample contamination that could impair DNA amplification. All oral fluid samples were checked visually for blood contamination and excluded if it was observed.

Detection of biochemical and serological markers of HBV infection

A 12-h overnight fasting blood sample was drawn to determine serum levels of alanine aminotransferase (ALT), aspartate aminotransferase (AST), alkaline phosphatase (ALP), total bilirubin, and γ-glutamyltransferase (GGT), using an automated immunochemiluminometric assay (ICMA) (Liason, Diasorin, Varceli, Italy). Serum samples were tested using commercial enzyme immunoassays (ELISA) for the detection of HBsAg (BioElisa HBsAg 3.0, BioKIT, Spain), total anti-HBc (BioElisa anti-HBc, BioKIT, Spain), anti-HBc IgM (ETI CORE IgMK Plus, DiaSorin, Italy), anti- HBs antibodies (BioElisa anti-HB, BioKIT, Spain), HBeAg, and anti-HBe (e411 Cobas, Roche Diagnostics, Mannheim, Germany) according to the instructions from each manufacturer. Samples were also tested for antibodies against hepatitis C virus (anti- HCV) (Murex anti-HCV 4.0, DiaSorin, South Africa) and human immunodeficiency virus (anti-HIV) (DS-EIA-HIV-AGAB-SCREEN, RPC, Diagnostic System, Russia).

Amplification of HBV DNA from serum and oral fluid samples

HBV DNA in serum samples was quantified using an Abbott RealTime HBV assay (Abbott Laboratories, USA). HBV DNA was extracted from serum samples using a High Pure Viral Nucleic Acid Kit (Roche Diagnostics, Mannheim, Germany) and from oral fluid using an RTP® DNA/RNA Virus Mini Kit (Stratec, Germany) following the manufacturer’s instructions except that the oral fluid sample volume was increased by twofold (n = 400 µL) [19]. For HBV amplification, optimized PCR, using oligonucleotide primers specific for the S/polymerase gene, resulting in an amplicon of ~900 base pairs [20], was done as described by Portilho et al. [14].

Sequence analysis

Sequence analysis of positive serum and oral fluid samples was performed using PCR products after purifying them using a QIAquick Gel Extraction Kit (QIAGEN, Hilden, Germany). Direct nucleotide sequencing was done in both directions using a Big Dye Terminator Kit (version 3.1, Applied Biosystems, Foster City, CA, USA) and external and internal primers [20]. Sequencing reactions were analyzed on an ABI3730 automated sequencer (Applied Biosystems). MEGA 7.0 software was used to align and analyze nucleotide sequences and to construct a phylogenetic tree by the maximum-likelihood method. The reliability of the phylogenetic tree was assessed by bootstrap test (1000 replicates).

Consensus sequences of each HBV isolate (serum and oral fluid) were submitted to a web-based computer program for subtyping and prediction of phenotypic resistance mutations in the polymerase gene (RT mutation) and vaccine escape mutations in the S gene (Max-Planck-Institut für Informatik, Germany, at http://hbv.geno2pheno.org/index.php).

Statistical analysis

Descriptive statistical analysis was performed by calculating the mean and standard deviations. The influence of laboratory parameters on the probability of obtaining an HBV sequence was evaluated using an unpaired t-test with Welch correction for continuous variables and Fisher's exact test for categorical variables. Statistical analysis was determined using GraphPad InStat software.

Results

Study population

A total of 18 individuals were evaluated. Most were male (12/18; 66.7%), and the mean age was 42.72 ± 14.14 years. Four individuals reported previous HBV treatment (one patient was treated with tenofovir, two with entecavir, and one with tenofovir + lamivudine during pregnancy).

Biochemical and serological tests

None of the subjects had a high ALT serum level (based on reference value higher than 41 U/mL). The median ALT level was 18 U/mL, ranging from 7 to 42 U/mL. For AST levels, the median was 22.6 U/mL, ranging from 8 to 52 U/mL. For total bilirubin, the median was 0.17 U/mL (0.01 to 2.25 U/mL), for alkaline phosphatase, it was 47.5 U/L (22 to 321 U/mL), and for GGT, it was 25.5 IU/L (8 to 1059 U/mL). Regarding the serological results, all serum samples were anti-HBc total (+), one was anti-HBc IgM (+), 10 were HBeAg (+)/ anti- HBe (−), eight were anti-HBe (+), and none were anti-HBs (+). None of the samples were anti-HCV (+), and two were anti-HIV (+) (Table 1).

Table 1 Demographic, clinical, and laboratory characteristics of HBV serum samples

Molecular tests and genotyping

The mean HBV DNA viral load in serum was 6.1 ± 2.3 log IU HBV DNA/mL, and genotype distribution was as follows: A, 13/18; D, 2/18; E, 1/18; and F (2/18). Regarding subgenotype distribution, among the 13 patients with genotype A, 11 (84.6%) were classified as subgenotype A1 and two as subgenotype A2 (15.4%). The genotype D and F isolates were classified as D3 and F2, respectively. The HBV genotype distribution is shown in Table 1.

In the 13 patients with genotype A, the mean viral load was 5.59 ± 1.927 log UI HBV DNA/mL, and six of them were HBeAg(+)/anti-HBe(−). Among the patients with genotypes D (n = 2) and F (n = 2), one with each genotype was HBeAg(+)/anti-HBe(−). The mean viral load was of 7.87 ± 0.78 log UI HBV DNA/mL for genotype D and 6.35 ± 0.09 log UI HBV DNA/mL for genotype F. The HBV genotype E sample had a serum viral load of 8.67 log UI HBV DNA/mL and was HBeAg reactive. Demographic, laboratory, and clinical characteristics of the paired oral fluid and serum samples are shown in Table 1.

Of the 18 oral fluid samples with detectable HBV DNA, it was possible to sequence and analyze an HBV S/pol gene fragment from 10 samples. A comparison between the oral fluid samples that could and could not be sequenced is shown in Table 2. In the oral fluid samples that were successfully sequenced, genotype A was the most frequent (6/10; 60%), with four classified as subgenotype A1 and two as A2. The other oral fluid samples were classified as genotype D (subgenotype D3) (n = 2), E (n = 1), and F (subgenotype F2) (n = 2) (Fig. 1). All samples showed high sequence similarity to the corresponding serum samples, with pairwise genetic distances between 0.00 and 0.023 (Table 3) and 100% agreement in the genotype/subgenotype classification.

Table 2 Laboratory parameters of oral fluid samples according to HBV sequencing
Fig. 1
figure 1

Phylogenetic tree comparing serum and oral fluid sequences. The phylogenetic tree, constructed by the maximum-likelihood method, incorporates sequences from 10 paired serum and oral fluid samples from this study and 99 reference sequences retrieved from the GenBank database. Symbols in the tree indicate the serum and oral fluid pairs.

Table 3 Mutations in HBV polymerase and surface proteins found in paired serum and oral fluid samples

Mutations

It was possible to identify amino acid mutations in the polymerase and/or S gene in all 18 HBV sequences from serum and in all 10 from oral fluid. No primary antiviral resistance mutations were found. The self-compensatory V173L mutation was detected in one treatment-naïve patient. HBV polymerase/S gene polymorphisms, when present, were found in both the serum and oral fluid samples from the same patient (Table 3). However, amino acid substitutions were found in greater numbers in oral fluid samples.

The most frequently detected polymorphisms in the polymerase were 122H/Y, M129L, V163I, and I253V, which were observed in 12, 13, 11, and 9 serum samples, respectively, and in their paired oral fluid samples, when available.

One or more escape mutations were detected in the S gene of five serum samples and four paired oral fluid samples. Most of these isolates had amino acid substitutions within the ‘a’ determinant region (aa 124-147) of the major hydrophilic region (MHR). The mutation Y100C was observed in two subjects. Furthermore, the mutations 109V, 126N, M133T, 144A, and 145R were found in one isolate each.

Discussion

Previous studies have suggested that oral fluid samples are a safe, less-invasive, and economical alternative to serum samples for molecular surveillance of infectious diseases [18]. This study demonstrated the usefulness of oral fluid for sequencing purposes, for genotyping, and for identification of clinically relevant mutations in HBV S/Pol.

In this study, a concordance of 100% in genotype/subgenotype classification between paired oral fluid and serum samples was observed. HBV genotypes A, D, E, and F were detected. Phylogenetic analysis revealed that most of the samples (61.1%) were classified as subgenotype A1, clustering with other Brazilian samples in the HBV/A1 Asia-American clade [21]. HBV subgenotypes A2, D3, and F2 were also detected in a proportion of 11.1% each and were phylogenetically related to samples from Belgium, Italy, and Venezuela, respectively. Subgenotypes A1, A2, D3, and F2 are highly prevalent in Brazil and appear to be linked to the different ethnic groups (Africans, Europeans, and Amerindians) that contributed to the current Brazilian demographic composition [8, 22]. Genotype E (5.6%) was detected in one patient who was born in Angola (Africa) but had recently moved to Brazil. This genotype had been reported previously in Brazil, being mostly associated with immigrants or refugees from East Africa [8, 22], a region that has a high prevalence of genotype E strains [23]. The successful detection, not only of Brazilian endemic genotypes but also of an imported genotype, demonstrates that the use of oral fluid for sequencing applications may be effective for molecular analysis of genetically heterogeneous populations.

It has been suggested that intra-host HBV subpopulations may evolve into quasispecies according to the selective pressure imposed by the infected cellular reservoirs [24]. In this study, the sequences for paired serum and oral fluid samples were over 99.8% identical. These results suggest that the HBV subpopulations present in lymph nodes and oral mucosa (oral fluid samples) strongly reflect those circulating in the blood. However, molecular investigations using liver biopsies would be useful to determine if serum and oral fluid accurately represent the major viral subpopulation in the liver. To our knowledge, this is the first study comparing HBV subpopulation characteristics present in serum and oral fluid samples. Since the diversity of HBV quasispecies may fluctuate depending on the cellular reservoir, managing viral subpopulations at extrahepatic sites may be the key to a successful antiviral therapy.

It has been demonstrated that both host immune background and antiviral selective pressure can drive HBV evolution by boosting the emergence of mutations in the Pol/S gene [25]. In this study, HBV sequence analysis identified amino acid mutations in all 18 HBV serum samples and 10 oral fluid samples in both the polymerase and S genes. Most of the polymorphisms occurred in the MHR of HBsAg, which encompasses the major B and T cell epitopes and is an important region for binding of neutralizing antibodies [12, 26].

HBsAg is a target for immune system cells and acts as a template for vaccine-mediated immunity. Therefore, mutations in the MHR may seriously impact vaccine efficacy and facilitate immune evasion [10, 12]. In this study, HBV quasispecies with the MHR amino acid mutations Y100C, L109V, M133T, D144A, and G145R in the S gene were detected in eight serum/oral fluid sample pairs (Table 3). These mutations are frequently associated with vaccine escape, failure in HBsAg detection, and a poor response to immunoglobulin therapy [11, 12, 27]. Despite these mutations, HBsAg detection in the samples from this study using a commercial assay was not impaired. Similar results were observed by Avellon and colleagues [11] in a study of Spanish chronic HBV carriers. Despite the detection of escape mutants in 12.5% of the subjects, detection of HBsAg was not suppressed in the three commercial assays evaluated by the authors. As suggested previously, the impact of a specific mutation on reactivity in a diagnostic test might be affected by other polymorphisms present in the sample [11]. In this context, the existence of other mutations that may have secondary effects on HBsAg detection should be investigated further.

The impact of HBsAg mutations in the epidemiological setting is currently unclear. Although no influence on diagnostic assays was observed in this study, the effects of these polymorphisms, as reported by others [11, 27], may represent a challenge in HBV vaccine efficacy and in the outcome of immunotherapy. Although four individuals were undergoing antiviral therapy, no primary antiviral resistance mutations were detected. In contrast, the rtV173L mutation, which is a self-compensatory mutation associated with enhanced viral replication in lamivudine-resistant patients [28, 29], was found in a treatment-naïve patient. Studies have demonstrated that the use of ??tenofovir disoproxil fumarate?? (TDF) was strongly associated with undetectable/low HBV DNA levels [30]. However, the two patients from this study who were previously treated with TDF had surprisingly high viral loads (Table 1), and no relationship with HBe/anti-HBe seroconversion was observed.

This study has some limitations, such as the small sample size, the lack of data regarding HBV subpopulations in the liver, and the unavailability of cloning or next-generation sequencing data for the HBV subpopulations present in the serum and oral fluid samples that could shed light on the overall diversity of HBV quasispecies present in different cellular reservoirs. In addition, sequence data from oral fluid could not be obtained from the samples with the lowest viral loads (generally HBeAg negative), preventing inferences about sequencing accuracy in these samples.

In conclusion, this study demonstrates the usefulness of oral fluid samples in tracking HBV mutations, genotyping, and phylogenetic analysis compared to the gold standard, serum samples. Since collection of oral fluid is a safe, convenient, and non-invasive method, its use may be an important tool in molecular epidemiology studies, especially in remote areas and hard-to-reach populations.