Introduction

Hepatitis C virus (HCV) infection is one of the major causes of liver inflammation, and characterized by high risk of developing persistent virus survival and chronic liver disease [1]. It has been reported that more than 170 million people have been infected with this virus worldwide so far [2]. Recent data from WHO declared that around 75–85 % of newly infected people developed into chronic infection and 60–70 % of chronically infected people developed into chronic liver disease, including cirrhosis or hepatocellular carcinoma (HCC) (http://www.who.int/mediacentre/factsheets/fs164/en/index.html).

Although the mechanisms of chronic HCV infection are not clarified clearly yet, it has been accepted that the outcomes of the infection may be influenced by virus variants, host genetic factors and environment, such as clinical treatment, source of infection, co-infected with other virus and so on [3, 4]. Because of the individual variability and highly different response to antiviral therapy, it was reasonable to postulate a genetic basis for this virus evading host immune surveillance and leading to chronic infection. Several association studies have revealed that the gene polymorphisms of human leukocyte antigen (HLA), interleukin-28(IL-28) and IL-18, are associated with viral persistence and disease progression [58].

Considering the factors of virus, the effects of HCV core protein have been studied, including pro- and anti-apoptotic effects, regulation of cell growth, and immune-regulatory properties [911]. In addition, a novel HCV protein, called alternate reading frame protein (F protein) and translated from the core encoding region by ribosomal frame shifting, has been reported challenging various activities attributed to core protein [1214]. However, different from core protein, it seemed that the expression of F protein was related to the disease progression and individual variability. That was to say, the rate of anti-F antibodies positive was not 100 % and was increased along with the progression of hepatitis C virus infection [13, 14]. And we have gained an intriguing observation that decreased Th1 cytokine [IFN-gamma, (IFN-γ)] and increased Th2 cytokines (IL-2, IL-4) in anti-F protein antibody seropositive patients may not be a beneficial sign for the clearance of the virus [13, 14]. We suspected that F protein may be a novel product for the host to adapt to the immune environment and make a difference in Th1/Th2 cytokine balance.

T-bet, also known as TBX21, is a T cell specific T-box transcription factor and has been demonstrated to play essential roles in Th1/Th2 balance [15]. TBX21 is located in human chromosome 17q21.32 and can regulate helper T cell differentiated to Th1 cells through the promotion of Th1 cytokines and inhibition of Th2 cytokines [16]. Its single nucleotide polymorphisms (SNPs) have been shown to be associated with many diseases recently, such as rheumatoid arthritis (RA), asthma, and type 1 diabetes mellitus (DM-1) [1720]. Recent data have shown that the SNPs of rs17250932 and rs4794067 were linked to the production of IL-4 and IFN-γ, and viral persistence in hepatitis B infection was also associated with the polymorphism of rs 4794067 inTBX21 [21, 22].

The aim of our study was to further determine whether virus susceptibility, disease improvement, and F protein generation are associated with TBX21 gene polymorphisms in Chinese Han people. Three SNPs (rs17250932, rs2074190, rs4794067) in TBX21 gene were systematically selected and genotyped in 190 anti-F protein negative cases, 557 anti-F protein positive subjects and 354 healthy controls. By assessing the frequencies of the SNPs mentioned above, this was the first study to explicate the possible mechanisms of T-bet in the progression of chronic HCV infection.

Materials and methods

Study subjects

A total of 1,101 subjects were enrolled in our study, including 747 chronic HCV infection cases and 354 healthy controls. All subjects were recruited from Jurong and Danyang, Jiangsu Province, in southern China from June 2012 to December 2012 and had blood histories from the past 20–30 years. Patients co-infected with any other virus (such as HBV, HIV), with other types of liver diseases (such as autoimmune, alcoholic or metabolic liver diseases), with other high-risk infection behavior (which was a small population such as drug use or hemodialysis) or previous interferon and/or ribavirin therapy were excluded. All subjects were categorized into three groups for analysis. The healthy control group (group A) included 354 subjects who were tested anti-HCV seronegative. The patients were diagnosed with chronic HCV infection which was confirmed anti-HCV seropositive, HCV RNA seropositive, elevated alanine animotransferase (ALT) levels for more than two years. In addition, the HCV F protein was expressed and purified in our term and the anti-F protein antibodies were detected in all patients’ serum using indirect enzyme linked immunosorbent assay (ELISA) according to our previous report [13]. Subsequently, all patients were assigned into two groups: anti-F negative (group B) and anti-F positive (group C) according to the presence of anti-F antibodies.

All subjects provided informed consent to participate in this study and were interviewed by well-trained interviewers. The standardized questionnaire included information about demographic data and environmental exposure history. Subjects’ venous blood was collected for serological, virological, and immunological analyses, and the stage of liver fibrosis was measured by transient elastography (FibroScan®, Echosens, Paris, France) [23]. This study was approved by the ethics committee of Nanjing Medical University and ethics approval was obtained from the Human Investigation Committee of the Huadong Research Institute for Medicine and Biotechnics (Nanjing, China).

Virological tests

An approximately 5 mL venous blood sample was collected from each subject, and blood cells were isolated and stored at −80 °C for further extraction of genomic DNA. Viral RNA in patients’ sera was extracted using the Trizol reagent according to the manufacturer’s suggested protocol (Trizol LS Reagent, Life Technologies, Rockville, MD, USA) and the HCV genotypes were determined by reverse transcription RNA into cDNA and polymerase chain reaction (PCR) with type-specific primers in the 5′ non-coding region (5′NCR) [24]. Anti-HCV antibodies were tested by the third generation HCV enzyme immunoassays (KHB, Shanghai, China). Anti-F protein antibodies were tested as described above.

SNPs selection

We searched the SNPs from the NCBI dbSNP database (http://www.ncbi.nlm.nih.gov/SNP) and public HapMap SNP database (http://www.hapmap.org). SNPs meeting the following conditions were prior to be brought into our study: minor allele frequency (MAF) >5 % in Chinese Han population; reported to be associated with other diseases, especially liver or immune related disorders; no association with HCV infection in previous researches [1722]. Given the genetic variation scanning for the promoter and/or exons of human TBX21 gene and studies about disease susceptibility, three SNPs were represented in Table 1.

Table 1 SNPs that meet the selection inclusion

Genotyping assays

Genomic DNA was extracted from peripheral blood by sodium dodecyl sulfate lysis and protease K digestion, followed by phenol-chloroform extraction and ethanol precipitation. Genotyping of SNPs was performed using PCR-ligase detection reaction (TaqMan assay) on an ABI Prism 377 Sequence Detection System (Applied Biosystems, Foster City, CA, USA) and the technical support was from Genesky Biotechnologies Inc., Shanghai, China. Briefly, a 10-ul PCR reaction was prepared for each sample, containing 1 × GC-I buffer (Takara, Japan), 3.0 mM Mg2+, 0.3 mM dNTPs, 1 U HotStarTaq polymerase (Qiagen, Hilden, Germany), 1 μL of sample DNA and 1 μL of each primer. The PCR program was described as followed: 95 °C for 2 min; 11 cycles of 94 °C for 20 s, 65 °C for 40 s, 72 °C for 1.5 min; 24 cycles of 94 °C for 20 s, 59 °C for 30 s, 72 °C for 1.5 min; and a final extension at 72 °C for 2 min. PCR products were treated with 5U of Shrimp Alkaline Phosphatase and 2U of Exonuclease I (Qiagen) and inactivation for 5 min. For ligase detection reaction, two allele-specific probes and one fluorescently labeled probe were used and this reaction was carried out in 1 μL of 10 × binding buffer, 0.25 μL of thermostable Taq DNA ligase (Takara), 0.4 μL of 1 μM 5′ ligation primer, 0.4 μL of 2 μM 3′ ligation primer, 2 μL of multiplex PCR product, and 6 μL of double-distilled H2O for 38 cycles of 94 °C for 1 min, 58 °C for 4 min and maintained at 4 °C. The products were denatured at 95 °C for 5 min before loaded on a ABI3730XL sequencer. Data were analyzed using GeneMapper Software v.4.0 (Applied Biosystems). The information of probes and primers for the selected SNPs was listed in Table 2. All the genotyping was performed in a blinded fashion without knowledge of the patients’ clinical data. The accordance rate of each SNP was 100 % for the repeated experiments of 10 % random samples.

Table 2 Probes and primers for analysis of TBX21 polymorphisms

Statistical analysis

Statistical analysis was carried out using SPSS 18.0 (SPSS Inc, Chicago, IL, USA). The distributions of general demographic, clinical and virological features were evaluated using the Student’s t test, χ 2-test, Kruskal-Wallis test. Allele frequencies for each SNP were determined by gene counting, and Hardy-Weinberg equilibrium (HWE) was tested using a goodness-of-fit χ 2-test. The association between SNPs and chronic HCV infection risk and anti-F antibody generation was determined using logistic regression analysis by computing the odds ratios (ORs) and 95 % confidence intervals (CIs), and P-value less than 0.05 was considered statistically significant. A linear regression model was used to analyse the relevance between HCV RNA load and stage of liver fibrosis. TBX21 haplotypes were assigned by the PHASE program.

Results

Samples’ characteristics

The selected characteristics of 354 healthy controls, 190 anti-F seronegative patients and 557 anti-F seropositive patients were summarized in Table 3. Significant differences were observed in the distribution of ALT/AST levels between the healthy group and patients, and HCV genotype as well as stages of liver fibrosis also showed differences between the two patients’ groups (all P < 0.001). Also, subjects showed no differences in the distribution of age and sex among three groups in our study (all P > 0.05). HCV RNA loads also showed no differences in the distribution between anti-F seronegative subjects and anti-F seropositive subjects (P = 0.192). In this study, the rate of anti-F antibodies positive was 74.6 % (557/747) and was consistent with our previous research and others reports (data not shown) [13, 14, 25].

Table 3 Distributions of selected variables in healthy controls and HCV infection patients

Association analysis of TBX21 genetic polymorphisms with chronic HCV and anti-f antibody generation

The genotype distributions of rs17250932, rs2074190 and rs4794067 in TBX21 among three groups are presented in Table 4. The observed genotype frequencies among the control subjects were in agreement with the Hardy-Weinberg equilibrium (rs17250932: χ 2 = 2.56, P = 0.11; rs2074190: χ 2 = 0.10, P = 0.75; rs4794067: χ 2 = 0.38, P = 0.54). As shown in Table 4, there was a significantly increased frequency of rs4794067 C allele in chronic HCV carrier subjects (OR = 1.335, 95 % CI = 1.058–1.684, P = 0.015), compared to those in healthy controls. In addition, the frequency of rs4794067 C allele was higher exceptionally in anti-F positive subjects compared to those in anti-F negative subjects (OR = 1.547, 95 % CI = 1.140–2.101, P = 0.005). However, no significant associations between the other two SNPs and HCV infection susceptibility and/or persistence were observed in the overall analysis (all P > 0.05).

Table 4 TBX21 genotype distribution in healthy controls and chronic HCV patients

Stratified analysis of the SNPs in TBX21 and the risk of persistent HCV infection

To determine the subgroup differences in the association of genetic polymorphisms (rs17250932, rs2074190 and rs4794067) with HCV infection or anti-F antibody generation, we performed a stratified analysis by disease-related characteristics (Tables 5 and 6). As shown, the CC genotype frequency of rs4794067 in anti-F seropositive subjects was significantly more common among subgroups of HCV 1b genotype, mild fibrosis than anti-F seronegative subjects (P = 0.021, P = 0.010, respectively), and the rs2074190 G allele showed increased frequency in the HCV non-1b genotype subgroup between the two groups (P = 0.029). Furthermore, patients carrying the rs4794067 C allele might be related to an increased risk of anti-F antibody generation in the mild fibrosis subgroup and HCV 1b genotype subgroup (OR = 1.861, 95 % CI = 1.162–2.981; OR = 1.500, 95 % CI = 1.063–2.116, respectively). And linear regression modeling also suggested the HCV RNA loads were positively correlated with the stage of liver fibrosis (r = 0.821, t = 39.247, P < 0.001) (data not shown). But no evidence showed significant associations between the genotypes of rs17250932 and the risk of disease.

Table 5 Stratified analysis for HCV genotype and stage of liver fibrosis on association between TBX21 rs17250932, rs2074190 and rs4794067 polymorphisms between anti-F negative subjects and anti-F positive subjects
Table 6 Stratified analysis for age and sex on association between TBX21 rs17250932, rs2074190 and rs4794067 polymorphisms among controls and case

Table 6 shows that the carriage of the C allele in the rs4794067 gene locus appeared to be associated with an increasing risk of persistent HCV infection in a younger subgroup (≤56 years, OR = 1.423, 95%CI = 1.028–1.970, P = 0.034) and female subgroup (OR = 1.374, 95 % CI = 1.032–1.831, P = 0.030). In addition, the results also revealed that the rs4794067 C allele could increase the risk of anti-F antibody generation in an older subgroup (>56 years, OR = 1.858, 95%CI = 1.151–2.998, P = 0.011) and a male subgroup (OR = 2.139, 95%CI = 1.084–4.220, P = 0.028).

Haplotype analysis among the three groups

To further evaluate the combined effect of three TBX21 polymorphisms (order of SNPs comprising haplotype: rs17250932T/C, rs2074190A/G and rs4794067T/C) on the risk of HCV persistent infection and F protein generation, we performed the haplotype analysis after measurement of pairwise linkage disequilibrium (LD) which was determined using the CubeX software (http://www.oege.org /software /cubex/) to calculate D’ (standardized linkage disequilibrium coefficient) and r2 (correlation coefficient) [26]. TBX21 rs17250932, rs2074190 and rs4794067 polymorphisms were found weak LD to each other: rs17250932/rs2074190 (D’ = 0.83, r2 = 0.40), rs17250932/ rs4794067 (D’ = 0.90, r2 = 0.53), rs2074190/ rs4794067 (D’ = 0.91, r2 = 0.56). A total of eight haplotypes were derived from the observed genotypes and the haplotype frequencies are shown in Table 7. Compared with the most frequent TAT haplotype, TAC haplotype distribution was significantly different between the chronic HCV carrier group and the healthy group, as well as the anti-F seronegative group and anti-F seropositive group (all P < 0.001). There was a significantly increased frequency of TAC haplotype in the chronic HCV infection subjects, exceptionally in anti-F seropositive subjects (OR = 4.708, 95%CI = 2.264–9.789; OR = 4.728, 95%CI = 2.584–8.651, respectively).

Table 7 Distribution of the estimated haplotype frequencies for TBX21 polymorphisms in controls and case

Discussion

HCV is a single-stranded (ss) RNA virus and the genome (~9.6 kb) includes a long open reading frame (ORF) encoding a polyprotein of approximately 3,000 amino acids which is processed by viral and cellular proteases into structural (core, E1, and E2) and nonstructural (p7, NS2, NS3, NS4A, NS4B, NS5A, and NS5B) proteins [27]. Apart from these proteins, a novel HCV protein, called alternate F protein, has been previously reported [12]. As a ribosomal frameshift protein of HCV core, it expresses in all HCV genotypes with ribosomal frame shift occurring at codons 11–161 in HCV 1a genotype and 42–141 in HCV 1b genotype [12, 28, 29]. However, the expression of F protein showed high sequence diversity and individual variability with a very short half-life [13]. So we detected the anti-F antibody instead of the F protein itself in our study.

Persistent HCV infection is characterized by invalid adaptive immune response which included inefficient CD4+T cell early and weak CD8+cytotoxic T-lymphocyte (CTL) response subsequently [1, 2, 30]. Apart from CTL, the Th1-type cytokine IFN-γ may also exert an inhibition effect on HCV replication in chronically infected patients [30]. We have demonstrated that IFN-γ declined while IL-4 elevated in anti-F protein seropositive HCV patients and F protein stimulated groups [14]. In addition, HCV F could induce IL-6 secretion in HCC patients [13]. Our previous results also demonstrated that the rate of anti-F protein seropositivity was increased along with the progression of the disease [13]. All these data implied that the expression of the F protein during HCV infection plays an important role in the development of persistent infection. And F protein generation may be advantageous to virus survival through making a difference in Th1/Th2 cytokine balance, although the underlying mechanisms remain unclear.

The gene TBX21 encodes for transcription factor T-bet, which appears to act as a master switch for Th1 development responsible for the induction of Th1 cells and the repression of Th2 cells from naive T lymphocytes [15]. In addition, T-bet also shares an effect on Th1/Th2 cytokine dysregulation and it could increase IFN-γ production in Th1 cells, whereas represses the expression of Th2 cytokines such as IL-4, IL-5 and IL-13 [15, 3134]. This could play an important role in disease progression in chronically infected patients.

Although the relationship between the genetic variations in TBX21 and persistent HCV infection has not been clearly demonstrated yet, several SNPs in TBX21 have been known to affect T-bet expression levels and be associated with disease susceptibility [1720]. In our study, we showed that genetic variation in the TBX21 gene promoter, specifically the rs4794067 polymorphism, was significantly associated with sustainability to chronic HCV infection. A minor allele with C instead of T in this position was significantly more common among persistent HCV-infected subjects than healthy controls. This result was in accordance with genetic variation of TBX21 in hepatitis B virus carriers in Chinese Han people, as demonstrated by Chen et al. [22]. In addition, Sullivan et al. showed that lacking T-bet might increase susceptibility to Mycobacterium tuberculosis infection with increased IL-10 and decreased IFN-γ production in mice [35]. More recently, Yao et al. demonstrated that HCV could replicate in T cells and inhibit T cells proliferation through the Jak/T-bet/STAT pathway [36, 37]. A previous report also has demonstrated that TBX21 T-rs4794067C is a common functional polymorphism that suppresses TBX21 promoter activity in activated CD4+ T cells and that such suppression is mediated through binding affinity to the Yin Yang 1 (YY1) transcription factor. The TBX21 promoter carrying rs4794067C allele possesses significantly stronger affinity to the YY1 which affects T-bet expression and Th1 cytokine production [38]. In summary, the rs4794067 polymorphism may participate in T-bet expression and interact with infection factors, such as bacteria or virus. The results of cooperation seem to lead to inefficient T cells response with lower proliferation and imbalance in production of Th1/Th2 cytokines, which might be involved in the development of chronic HCV and help to provide a comfortable microenvironment for pathogen existence and new product generation.

In our present study, a minor allele with C instead of T in rs4794067 was significantly more common in anti-F protein seropositive subjects compared with anti-F protein seronegative subjects. The frequency of TAC haplotype at positions rs17250932, rs2074190 and rs4794067 was also significantly higher. Thus, carriage of the C allele in the rs4794067 gene locus may increase the risk of HCV F protein generation. Other studies also considered that the expression of the F protein would relatively decrease virus pathogenicity and play a protective role in the host. The hypothesis was building on lower HCV viral load in anti-F protein seropositive individuals [13]. However, HCV RNA load showed no differences in the distribution of anti-F seronegative subjects and anti-F seropositive subjects in our present study. The difference was possibly due to the variety of subjects or the small sample size in a previous experiment. Additionally, our previous results showed that F protein may participate in a protective T-cell-mediated immune response to protect liver cell injury from acute inflammation, with increased Th2 cytokine (IL-4) production and decreased Th1 cytokine (IFN-γ) production [13, 14]. This was according to Fyall et al., who reported that rs4794067C polymorphism was associated with decreased IFN-γ production [21]. The allele frequencies of rs17250932T/C and rs2074190A/G polymorphism were low in the population of our study and no significant associations were observed with the susceptibility of HCV infection or F protein generation.

The previous report declared that the differences in disease exposure should also be taken into account in disease-related characteristics [39]. So we assessed the genotype frequency of rs17250932, rs2074190 and rs4794067 in several subgroups. It has been publicly accepted that both pathogen and host characteristics can influence the development and progress of the disease. In our study, the frequency of rs4794067 C allele was significantly different between anti-F protein seropositive subjects and anti-F protein seronegative subjects in subgroups of HCV 1b genotype and patients with mild fibrosis, whereas the rs2074190 G allele showed increased frequency in the HCV non-1b genotype subgroup between the two groups. This pathogen-host interaction may provide a novel way to understand the complex mechanism of the formation of chronic infection with different viral genotypes. We also conducted an analysis stratified by age and sex. It showed that the carriage of C allele in the rs4794067 gene locus appeared to be associated with an increasing risk of persistent HCV infection in a younger subgroup and female subgroup. This was contrary to popular belief with young subjects showing a higher rate of spontaneous clearance in HCV infection [40]. The possible interpretations were that an appropriate immune response in younger infections was to protect liver cell injury from acute inflammation and a reduced immune response by the action of estradiol in females [41]. In addition, the results also revealed that rs4794067 C allele could increase the risk of anti-F protein antibody generation in an older subgroup. In a word, these results suggested a complex relationship between age, viral genotype, and genomic factors.

In summary, we studied the association of the SNPs of TBX21 gene rs17250932, rs2074190 and rs4794067 with susceptibility to HCV infection and the F protein generation in a high-risk Chinese population for the first time. To reduce the inescapable selection bias, we applied many strategies to control and minimize the potential confounding factors, such as matching the age and residential area, taking the stage of liver fibrosis and viral genotype into account, and testing anti-F protein antibody in patients with chronic HCV infection. However, the number of subjects in our study was relatively limited. So this conclusion needs to be verified by similar studies in other populations. With accumulating researches revealing that the genetic variations in TBX21 may be involved in the etiology of susceptibility and/or sustainability to HCV infection, our study conducted a novel pathogenic antigen (HCV F protein) to provide a new perspective to understand the mechanism of HCV evasion strategy for the first time. However, further studies are necessary to demonstrate how these TBX21 gene polymorphisms are implicated in the pathogenesis of chronic HCV infection.