Introduction

Chronic hepatitis B virus (HBV) infection affects approximately 250 million people worldwide [1]. The implementation of a universal hepatitis B immunization program in Taiwan in 1984 has successfully decreased the sero-prevalence of HBV surface antigen (HBsAg) in vaccinated birth cohorts from 13.7% to less than 1% [2, 3]. Nevertheless, approximately 10% of infants born to highly infectious mothers experience breakthrough HBV infections in spite of hepatitis B immunoglobulin (HBIG) within 24 h after birth and complete HBV vaccine in infancy [3]. Mother-to-infant transmission of HBV accounted for almost 40% of all new HBV infections before the HBV immunization program [4], and have accounted for 75–90% of all infections since then [5, 6]. Hepatitis e antigen (HBeAg) is a marker for active HBV replication and high infectivity [7]. HBeAg positivity [8] or high maternal viral load [9], vaccine-escape mutants [10], and intrauterine infections [11] are the main causes of HBV vaccine failure. Complete eradication of HBV infections requires identification of vaccine failure factors in infants born to highly infectious mothers.

The sodium taurocholate co-transporting polypeptide (NTCP), encoded by the solute carrier family 10 member 1 gene (SLC10A1), is the major hepatocellular sodium-dependent uptake system for conjugated bile acids in human livers [12]. It binds and transfers bile acids from the blood to liver parenchymal cells, and acts as a hepatocellular entry receptor for HBV by interacting with the N-terminus of the HBV large-envelope protein [13]. Its expression at 14–23 weeks of gestation tends to be low (2–4% of adults) [14, 15]. Nevertheless, about 12% of human fetus primary hepatocytes can be infected with HBV [16]. It remains unclear whether NTCP expression in fetal/neonatal livers determines the risk for mother-to-infant HBV transmission.

The association between single nucleotide polymorphisms (SNP) in the NTCP gene and HBV susceptibility and disease progression has been studied extensively. An association between the TT genotype of the SNP rs7154439 (c.-1956 C > T) at 5′ upstream non-coding region of the NTCP gene and HBV clearance has been reported in southwestern Chinese Han individuals [17], but not in central Chinese Han [18], Tibetans, or Uyghurs [19]. The SNP rs4646285 AA genotype (c.225 G > A, p.T75T) in exon 1 of the NTCP gene is more prevalent in individuals who spontaneously recovered from HBV infections [20]. High linkage disequilibrium (LD) between rs7154439 and rs4646285 has been observed in Chinese Han individuals [17]. It has also been reported that the A allele of the SNP rs4646287 (c.356 + 702 G > A) in intron 1 of the NTCP gene is common in East Asians (11.31% and 12.35% in Chinese and Japanese populations, respectively) but absent in Africans and Europeans [21]. A protective effect of the rs4646287 GA/AA genotype against HBV infections has been suggested in eastern Chinese Han [21]. Notably, a protective role of Asian-specific SNP rs2296651 (c.800 C > T, p.S267F) against chronic HBV infections, acute-on-chronic liver failure, and HCC has been suggested [22,23,24]. However, the role of NTCP and its genetic variants in HBV vaccine failure has not been addressed yet. In this study, we selected subjects born to HBsAg and HBeAg double-positive mothers, and investigated their genotypes for SLC10A1. We investigated the association between NTCP gene polymorphisms and HBV vaccine failure and the long-term consequences of chronic HBV infection.

Methods

Subjects

White blood cell (WBC) DNA samples were obtained from two cohorts to study NTCP genetic variants: 170 carrier children with chronic HBV infection and 138 non-carrier controls. Children in both cohorts were born to HBsAg and HBeAg double-positive mothers. Table 1 lists the characteristics of the studied subjects and the maternal HBV status. All of the children received HBIG at birth and three to four doses of HBV vaccine before reaching the age of 7 months. Chronic HBV-infected patients were followed up longitudinally every 6 months, or every 1–3 months in case of elevated serum levels of alanine aminotransferase (ALT) and aspartate aminotransferase (AST). The study protocol was approved by the Institutional Review Board of National Taiwan University Hospital (NTUH IRB 201512129INB). All of the specimens were collected with parental and/or patient consent.

Table 1 Characteristics of the non-carrier control and HBV-infected carrier children of HBsAg and HBeAg double-positive mothers

Specimens

For gene expression analyses, NTCP genetic variants were detected in adult WBC DNA samples from non-HCC patients with chronic HBV infection (N = 100) or non-HBV/non-hepatitis C virus (HCV) infection (NBNC) (N = 100). A total of 82 RNA samples, paired to the genotyped WBC DNA samples, were extracted from the non-tumor liver tissues of non-HCC patients (those with hemangiomas, focal nodular hyperplasia, angiomyolipomas, cirrhosis, or other liver diseases). WBC DNA and total RNA samples were provided by Taiwan Liver Cancer Network, National Health Research Institutes Biobank.

Selected SNPs and genotyping by restriction fragment length polymorphism (RFLP) and sequencing

Four SNPs of the NTCP gene, rs7154439 (in the 5′ upstream region), rs4646285 (in exon 1), rs4646287 (in intron 1), and rs2296651 (in exon 4), were selected for this study because a high minor allele frequency (MAF) among Asians and an association with HBV persistence have been reported. Genomic DNA was prepared from human peripheral blood cells using the Puregene Blood Kit (QIAGEN, GmbH, Hilden, Germany) and quantified on a NanoDrop 1000 spectrophotometer (NanoDrop, Wilmington, DE, USA).

Four selected SNPs were genotyped by PCR–RFLP. WBC DNA was amplified and digested with restriction enzymes BsmAI (for rs7154439 C > T, 218 bp), EagI (for rs4646285 G > A, 116 bp), Cac8I (for rs4646287 G > A, 118 bp), and HphI (for rs2296651 C > T, 120 + 84 bp). Agarose gel electrophoresis was used to separate the wild type, heterozygous, and homozygous genetic variants (Supplementary Figure S1). The RFLP-identified variants were all confirmed by direct sequencing. Primers and restriction enzymes (New England Biolabs, Ipswich, MA, USA) are shown in Supplementary Table S1.

Quantitative RT-PCR

Total RNAs were prepared from non-tumor liver tissues of non-HCC patients. Complementary DNA was synthesized using a High-Capacity cDNA Reverse Transcription kit (Applied Biosystems, Waltham, MA, USA). Real-time PCR was performed using the SYBR Green PCR Master Mix and ABI Prism 7500 Fast System (Applied Biosystems). The primers used were human glyceraldehyde 3-phosphate dehydrogenase forward 5′-GAAGGTGAAGGTCGGAGT-3′ and reverse 5′-CATGGGTGGAATCATATTGGAA-3′; human NTCP forward 5′-TGACCACCTGCTCCACCTTC-3′ and reverse 5′-GAATGAGAACCAGGACCAGTGAT-3′; human pan-interferon α (IFNα) forward 5′-CACACAGGCTTCCAGGCATTC-3′ and reverse 5′-TCTTCAGCACAAAGGACTCATCTG-3′; human interferon β (IFNβ) forward 5′-TGCTCTCCTGTTGTGCTTCTCCAC-3′ and reverse 5′-ATAGATGGTCAATGCGGCGTCC-3′; and IFITM3 forward 5′-ATGAATCACACTGTCCAAACCTTCT-3′ and reverse 5′-CTATCCATGAGGCCTGGAAGATCAG-3′. The relative standard curve method was used for the calculation of fold changes in gene expression [25].

Statistical analyses

Pearson’s chi-square (χ2) test was used to analyze differences in the genotype frequencies and the p value was adjusted by Bonferroni correction. Pairwise linkage disequilibrium (LD) was estimated between four SNPs of SLC10A1 based on D prime (D′) and R square (r2) values using Haploview software version 4.2. Odds ratios (ORs) and 95% confidence intervals (CIs) were calculated on the basis of conditional logistic regression analysis. The Haldane–Anscombe correction was applied to groups with zero individuals to calculate ORs. The Student’s t test was used to analyze ALT and AST activities and NTCP transcription levels. Kaplan–Meier survival curves were used to analyze HBeAg seroconversion ages. A two-tailed p < 0.05 was used as the criterion for statistical significance. One-way analysis of variance (ANOVA) and the Pearson correlation test were used to analyze the quantitative RT-PCR results.

Results

Baseline characteristics

The control cohort included 138 children born to HBsAg and HBeAg double-positive mothers with high viral loads. The median maternal viral load was 8.01 log10 IU/mL (interquartile range: 7.14–8.38, N = 125). The vaccine failure cohort included 170 chronic HBV-infected patients, born to HBsAg and HBeAg double-positive mothers (viral loads were mostly not determined or unavailable). The carrier children were followed up longitudinally for a median of 13.02 years (interquartile range: 7.22–17.52) (Table 1). The mothers had no records of receiving antiviral treatments or of liver diseases, except chronic hepatitis, during the pregnancy.

NTCP genetic variants and HBV vaccine failure (Table 2)

Table 2 Comparison of the NTCP genetic variants in non-carrier control and HBV vaccine failure carrier children of HBsAg and HBeAg double-positive mothers

The four selected SNPs of the NTCP gene were characterized by PCR–RFLP, and the identified variants were confirmed by direct sequencing. There was a significantly higher frequency of the rs4646287 GA/AA genotype in the non-carrier control children compared to chronic HBV carrier children (28.3% [39/138] vs. 15.3% [26/170], p = 0.006). The MAF of rs4646287-A was 14.1% (39/276) in controls compared to 7.9% (27/340) in carriers (p = 0.014). Genetic variant rs7154439 C > T coexisted with rs4646285 G > A, and had a trend of higher frequency in controls (15.2% [42/276] vs. 10.3% [35/340], p = 0.086). Pairwise LD analysis showed that four SNPs were in weak LD (r2 < 0.1), except the pair of rs7154439 C > T and rs4646285 G > A (Fig. 1). Significantly strong LD of rs7154439 C > T and rs4646285 G > A was found in carriers (D′ = 0.968, r2 = 0.937) and controls (D′ = 1, r2 = 0.972).

Fig. 1
figure 1

Linkage disequilibrium map for four SNPs of the NTCP gene in 170 chronic HBV carriers and 138 control individuals. The LD plot shows r2 values between each pair of SNPs; white squares indicate r2 < 0.1 and dark squares indicate r2 > 0.9

The frequency of the triple mutant variant carrying rs7154439 C > T, rs4646285 G > A, and rs4646287 G > A genotype was significantly higher in the control group (6.5% [9/138] vs. 0% [0/170], p < 0.001). Conditional logistic regression analysis showed that higher frequencies and lower risk of vaccine failure (OR: 0.06, 95% CI: 0.003 – 1.09, p = 0.009) of rs7154439 C > T and rs4646285 G > A were only found in controls bearing rs4646287 G > A (Table 3). The differences of rs7154439 C > T and rs4646285 G > A between controls and carriers disappeared when they coexisted with wild-type rs4646287-GG, indicating minor effect of rs7154439 C > T and rs4646285 G > A on HBV vaccine failure. The results further support a unique association of rs4646287 G > A with HBV vaccine failure. The prevalence of rs2296651 C > T was similar between the two groups, which suggests a minor association between this Asian-specific SNP and HBV vaccine failure.

Table 3 Association between the selected polymorphisms in NTCP gene and HBV vaccine failure in non-carrier control and chronic HBV carrier children separated by rs4646287 genotype

NTCP genetic variants and clinical parameters in chronic HBV carriers

The impact of the four SNPs of the NTCP gene on clinical progression was characterized. Peak levels of ALT and AST in carrier children who had achieved HBeAg seroconversion (N = 71) were analyzed (Fig. 2). Compared to wild-type rs4646287-GG, there was a significant correlation between rs4646287 G > A and higher peak ALT (588.6 ± 557.1 [N = 11] vs. 247.1 ± 202.8 [N = 60], p < 0.001) and AST (378.4 ± 339.4 [N = 11] vs. 160.9 ± 145.8 [N = 60], p < 0.001) levels. Among the subgroups divided according to sex and age, higher ALT levels stayed significantly associated with rs4646287 G > A (Supplementary Table S2). The other three SNPs (rs7154439, rs4646285, and rs2296651) showed no significant association with peak ALT and AST levels. Our results suggest that chronic HBV-infected children with the rs4646287 G > A genotype of the NTCP gene may have stronger immune responses to HBV infections.

Fig. 2
figure 2

Associations between NTCP genetic variants and clinical parameters in carrier children of chronic HBV. Peak alanine aminotransferase (ALT) (a) and aspartate aminotransferase (AST) (b) levels in carrier children with HBeAg seroconversion (N = 71) were compared between different NTCP variants. c The spontaneous HBeAg seroconversion ages of carrier children without antiviral treatments were compared between NTCP genetic variants and the wild-type genotype. p values were determined using Student’s t-test for peak ALT and AST levels, and Kaplan Meier analyses (log-rank test) for HBeAg seroconversion ages

Kaplan–Meier survival analyses were performed for the spontaneous HBeAg seroconversion ages in chronic HBV carrier children who did not receive antiviral treatment. Carrier children with wild-type NTCP gene (no genetic variance at the 4 SNPs, N = 82) were compared to those with variants rs7154439 C > T/rs4646285 G > A (N = 31), rs2296651 C > T (N = 15), and rs4646287 G > A (N = 22). The Kaplan Meier survival curves for HBeAg seroconversion separated HBV carriers with wild-type NTCP gene from those with NTCP genetic variants (p = 0.029). Compared to the wild-type NTCP (14.6 ± 8.7 years), rs4646287 G > A was significantly associated with an earlier HBeAg seroconversion age (10.8 ± 8.4 years, p = 0.003) (Fig. 2c). The impact of rs7154439 C > T/rs4646285 G > A (13.07 ± 9.2 years, p = 0.423) and rs2296651 C > T (11.2 ± 7.8 years, p = 0.151) on spontaneous HBeAg seroconversion was not significant.

Correlation between the NTCP rs4646287 genotype and innate immunity

To further characterize the protective role of rs4646287 G > A, we investigated hepatic expression of NTCP and genes related to innate immunity. The four NTCP SNPs were genotyped by PCR–RFLP in WBC DNAs from 100 HBV-infected non-HCC patients and 100 NBNC non-HCC patients. The MAFs of rs4646287-A allele were 15% and 24% in HBV-infected and NBNC patients, respectively. A total of 82 non-tumor liver RNA samples from those genotyped non-HCC patients were further analyzed and classified into four groups: HBV infection with the rs4646287 G > A (N = 9) or -GG (N = 25) genotype; NBNC with the rs4646287 G > A (N = 19) or -GG (N = 29) genotype.

Quantitative RT-PCR was used to detect the transcription levels of NTCP, type I interferons, and IFITM3. ANOVA indicated differences in NTCP mRNA levels between the four groups (Supplementary Figure S2). NBNC liver tissues with rs4646287 G > A had higher NTCP levels compared to the -GG genotype (p = 0.065) and compared to HBV-infected liver tissues with either the rs4646287 G > A (p = 0.039) or -GG genotype (p = 0.017). A significant difference in IFNα mRNA levels was found between the groups (p < 0.0001). IFNα levels were significantly reduced in HBV-infected liver tissues. No significant differences in IFNβ or IFITM3 levels were found.

Pearson correlation analyses (Fig. 3) showed an inverse correlation between NTCP and IFITM3 levels only in HBV-infected livers (rs4646287 G > A, r = − 0.672, p = 0.047; rs4646287-GG, r = − 0.571, p = 0.003). A trend of positive correlation between NTCP and IFNα was detected only in HBV-infected rs4646287 G > A liver tissues (r = 0.638, p = 0.064). An inverse correlation between type I interferons and IFITM3 was detected in HBV-infected liver tissues with rs4646287-GG (IFNα: r = − 0.530, p = 0.006; IFNβ: r = − 0.664, p < 0.001) and NBNC liver tissues with the rs4646287 G > A (IFNα: r = − 0.562, p = 0.012; IFNβ: r = − 0.588, p = 0.008) or -GG genotype (IFNα: r = − 0.639, p < 0.001; IFNβ: r = − 0.615, p < 0.001), but not in HBV-infected liver tissues with the rs4646287 G > A genotype. The unique correlation patterns of NTCP, type I interferons, and IFITM3 expressions were only found in HBV-infected liver cells carrying the rs4646287 G > A genotype.

Fig. 3
figure 3

Correlations between the rs4646287 genotype and NTCP, IFITM3, IFNα, and IFNβ expression. Pearson correlation tests were performed to determine the relationships between IFITM3 and NTCP (a), IFNα and NTCP (b), IFITM3 and IFNα (c), and IFITM3 and IFNβ (d) transcripts in non-tumor liver tissues from non-HCC patients classified into four groups: HBV infections with the NTCP genotype rs4646287 G > A (N = 9) or wild-type -GG (N = 25); and non-HBV/non-HCV infection (NBNC) with the rs4646287 G > A (N = 19) or wild-type -GG (N = 29). Pearson correlation coefficients r and p values are shown in each scatter plot

Discussion

An association of NTCP genetic variants with HBV susceptibility and disease progression has been reported in different populations [18, 19, 26]. In this study, we demonstrated that rs4646287 (c.356 + 702) G > A variant located in intron 1 of the NTCP gene is associated with a reduced risk for HBV vaccine failure in children of HBeAg-positive mothers. Among children of HBsAg and HBeAg double-positive mothers, the frequency of the rs4646287 GA/AA genotype in the non-carrier children (28.3%) was significantly higher than in vaccine failure chronic HBV carrier children (15.3%). HBeAg is a marker of high infectivity and is associated with higher vaccine failure rates [5]. Our results suggest that the NTCP genetic variant rs4646287 G > A might have a protective effect against HBV vaccine failure under high maternal viral loads.

The SNP rs4646287 (c.356 + 702) G > A variant in intron 1 is protective for the host in response to HBV infections [21]. G > A may introduce a new DNA-binding site for the transcription co-activator p300. Protein p300 is an adenoviral E1A-binding protein and functions as a transcriptional co-factor and histone acetyltransferase. Lower NTCP mRNA levels have been reported in HBV-infected HCC tissues with rs4646287 GA/AA compared to the wild-type genotype, but no difference has been detected in paired non-tumor tissues [21]. Our study also found no significant differences in NTCP levels between HBV-infected non-tumor liver tissues with the wild-type genotype and those with rs4646287 G > A. The NBNC non-tumor liver tissues with rs4646287 G > A expressed the highest NTCP levels. Whether the genetic variant rs4646287 G > A affected NTCP transcription in liver tissues requires further investigation.

HBeAg seroconversion is indicative of reduced HBV replication and hepatitis activity in patients with chronic HBV [27]. Vaccine failure and maternal HBeAg are associated with a prolonged period of HBeAg seroconversion in such patients [28, 29]. In this study, SNP rs4646287 G > A of the NTCP gene was significantly associated with higher peak ALT levels in HBV carrier children who achieved HBeAg seroconversion. Besides, HBV carrier children with rs4646287 G > A achieved spontaneous HBeAg seroconversion at younger age. This suggests a stronger immune response in livers carrying the rs4646287 G > A genotype, which may have a protective effect on clinical progression in chronic HBV carrier children. Protective effects of the other three SNPs, namely, rs7154439 (in the 5′ upstream region) [17], rs4646285 (in exon 1) [20], and rs2296651 (in exon 4) [22, 23], against chronic HBV infections have been reported. In our study, these three SNPs showed little impact on HBV vaccine failure, peak ALT levels, and spontaneous HBeAg seroconversion.

The association between NTCP and innate immunity was examined by quantifying the expression of NTCP, type I interferons, and IFITM3 in non-tumor liver tissues of non-HCC patients. IFNα expression was significantly reduced in HBV-infected liver tissues compared to NBNC livers. HBV infections have a reported association with a lack of induction [30] and even suppression of innate immune responses [31, 32]. Our results indicate that chronic HBV infections may reduce the intrahepatic expression of IFNα but not IFNβ.

Pearson correlation analyses showed that NBNC liver tissues with the wild-type genotype or rs4646287 G > A displayed similar gene correlation patterns. However, different correlation patterns were detected in HBV-infected liver tissues. A trend of positive correlation between NTCP and IFNα was present in HBV-infected livers with rs4646287 G > A (r = 0.638, p = 0.064) but not those with the wild-type genotype. In addition, there was an inverse correlation between type I interferons and IFITM3 in HBV-infected livers with the wild-type genotype but not those with rs4646287 G > A. Whether these negative correlations between type I interferons and IFITM3 contribute to the protective effects of rs4646287 G > A requires further investigation.

In conclusion, the genetic variant rs4646287 (c. 356 + 702) G > A in intron 1 of the NTCP gene is associated with a lower risk of vaccine failure in children born to HBeAg-positive mothers and with younger spontaneous HBeAg seroconversion ages in chronic HBV carrier children (Fig. 4). HBV-infected liver cells bearing the NTCP rs4646287 G > A genotype had unique innate immune responses that might be protective against HBV infections.

Fig. 4
figure 4

Protective effects of the NTCP genetic variant rs4646287 G > A in children born to HBeAg-positive mothers. There was a higher frequency of rs4646287 G > A in non-carrier children compared to chronic HBV carriers. Lower spontaneous HBeAg seroconversion ages were associated with the rs4646287 G > A genotype in HBV carriers