Introduction

Hydroxymethylcytosine has become known as the “sixth base” following methylcytosine as a modified nucleotide found in the mammalian genome [1]. The role of this new epigenetic mark may be twofold. First, hydroxymethylcytosine acts as an intermediate in the removal of methyl groups from cytosine bases, returning the cytosine back to its unmodified form [1, 2]. Additionally, hydroxymethylcytosine may have a role in transcriptional regulation because proteins that bind to the DNA through methylcytosine, the so-called methyl-binding proteins, have less affinity for hydroxymethylcytosine. Because methyl-binding proteins can alter transcription of a gene by binding to the DNA in a gene’s promoter region, hydroxymethylcytosine may also change transcription through this alteration in binding affinity [3, 4]. Hydroxymethylation of DNA in a promoter, CpG island or gene body is associated with an increase in transcription of that gene [5, 6].

The percentage of cytosines that are modified to hydroxymethylcytosine is variable from tissue to tissue, with the most being in the central nervous system [2, 7]. Interestingly, there is a marked decrease in hydroxymethylation in various types of cancer [8]. In the liver, global decreases in hydroxymethylcytosine have been recorded in cases of hepatocellular carcinoma [911]. While this research is promising, the genomic locations of which these changes in hydroxymethylcytosine are occurring are currently unknown. Like DNA methylation, gene-specific changes in hydroxymethylation may occur in hepatocellular carcinoma [12].

Chronic alcohol consumption and aging are both risk factors for the development of hepatocellular carcinoma, which is associated with the development of hepatic steatosis and cirrhosis [13, 14]. Additionally, changes in DNA methylation and other epigenetic marks have been identified in alcoholic liver disease, which has been contested as an initial step in the development of hepatocellular carcinoma [15]. It is unknown, however, whether hydroxymethylcytosine may play a role in this stage of liver disease. Both aging and alcohol are known to alter DNA methylation and may also affect DNA hydroxymethylation patterns [16, 17]. We have previously found global fluxes in DNA hydroxymethylation in rodents exposed to a chronic alcohol diet [18]. More specifically, we found that global hepatic DNA hydroxymethylation is significantly decreased in young mice fed an alcohol-containing diet when compared to their young control counterparts, and that aging alone also decreases global DNA hydroxymethylation. This previous study, however, was unable to determine the locations of these changes within the genome, a limitation because alteration in genome-wide hydroxymethylation patterns may alter transcription of specific genes, having the downstream consequence of predisposing the cell to carcinogenesis or other metabolic changes.

Using a microarray and bioinformatic analysis, we assessed genome-wide and gene-specific changes in hepatic hydroxymethylcytosine associated with aging and chronic alcohol consumption in hepatic DNA of a mouse model.

Materials and methods

Animal study and diets

This study was reviewed and approved by the Animal Care and Use Committee of the USDA Human Nutrition Research Center on Aging at Tufts University. The manufacture of this array recommends a minimum of three subjects per study group. To our experience, three animals per group are too small of a sample size to compensate for epigenetic variability; therefore, we chose to use four animals per study group. Eight young (4 months old) and eight old (18 months old) male C57Bl/6 mice (Charles River Laboratories, Wilmington MA) were fed a l-amino acid defined Lieber–DeCarli control liquid diet for 3 weeks [19, 20] (Dyets, Bethleham, PA, 210011). Mice were then randomized to continue either the control diet or an isocaloric diet containing ethanol for 5 weeks [21]. Ethanol was included in the diet at a concentration of 3.1 % (v/v), comprising 18 % of total energy, and was introduced over an adaption period of 2 weeks. Maltodextrin was added to the control diet to equal the number of calories delivered from ethanol in the alcohol diet.

Individually housed mice were group pair-fed to decrease variability of energy and nutrient intake within and between dietary groups. The amount of food supplied to each diet group was matched to the mean daily food consumption of the group with the lowest food consumption. Therefore, all mice in each age group were offered the same volume of liquid daily and adjusted when necessary to ensure all mice consumed equal volumes. After 5 weeks of feeding, mice were killed and liver samples were harvested as previously described [22].

Preparation of samples

To determine which locations in the genome contain differentially hydroxymethylated regions in aging or alcohol consumption, a microarray method was conducted. Genomic DNA was extracted via the standard phenol/chloroform/isoamyl alcohol [25:24:1 (v/v/v)] (Invitrogen Life Technologies, Grand Island, NY) method with precipitation by 100 % ethanol and 3 mol/L sodium acetate, pH 5.2. DNA was re-dissolved in Tris–EDTA buffer, the quality was assessed on an Agarose gel, and both 260/280 and 260/230 ratios were measured on a Nanodrop 1000 (Thermo Scientific, Wilmington DE). All samples used in this study had a 260/280 and a 260/230 ratio equal to or greater than 1.8. DNA was then randomly fragmented by sonication into 200- to 1000-bp fragments, which was verified on an Agarose gel. DNA was divided into two aliquots per sample, and one aliquot underwent immunoprecipitation with an antibody (Diagenode, Denville NJ, AF-110-0016) against hydroxymethylated cytosine (hmeDIP), resulting in samples enriched with hydroxymethylcytosine [3]. The immunoprecipitation was verified by qRT-PCR using internal positive and negative controls provided. Non-hmeDIP (input) and hmeDIP DNAs were then amplified using a whole genome amplification kit as described by the manufacturer (Sigma-Aldrich, St. Louis, MO, WGA2).

Hydroxymethyl-enriched samples were fluorescently labeled (Cy5) and competitively hybridized with fluorescently labeled (Cy3) (Roche, Madison WI, 6370250001 & 5583683001) input samples to a high-throughput 3 × 720 k NimbleGen array containing CpG islands and all annotated gene promoters in the mouse genome (Roche, Madison WI, 5924537001). Signals were detected with the NimbleGen microarray scanner MS 200 (Roche, Madison WI).

Microarray quality control and finding differentially hydroxymethylated regions

A modified version of the comprehensive high-throughput array for relative methylation (CHARM) method in R Studio was used to analyze our hmeDIP DNA methylation arrays for all four age and diet contrasts [23]. The original CHARM code was written for samples that underwent methylation-dependent enzyme restriction, yielding samples that are depleted of methylcytosine instead of enriched in methylcytosines. We modified the CHARM R code in R Studio to invert the M ratio to hmeDIP/input and to eliminate the percent methylation calculation to apply CHARM to our hmeDIP-generated DNA methylation arrays. The CHARM package is a freely available open software program from Bioconductor (http://www.bioconductor.org/packages/release/bioc/html/charm.html).

Quality control of DNA hydroxymethylation array

Each slide used in this study contained three microarrays, which increases the possibility that samples will resemble each other if they were on the same slide resulting in slide bias. Principal component analysis (PCA) was used to evaluate the variability contributing to the dataset by each sample following CHARM normalization. Because the three samples on the same slides did not cluster together, we can be confident that there was not an issue of slide bias.

Annotation of DhMRs and bioinformatic analysis

Upon generation of DhMRs for each contrast of interest, regions were annotated to the mouse genome using the freely available genomic regions enrichment of annotations tool (GREAT) [24]. This analysis annotates chromosomal regions back to the UCSC mm9 mouse genome and associates them with the nearest gene [25]. For this study, regions that were 5.0 kb upstream or 1.0 kb downstream of a gene’s transcription start site were mapped to that gene, regardless of other nearby genes. Additionally, curated regulatory domains that have demonstrated experimental evidence of directly regulating a gene that otherwise would fall outside of the above-stated region are included in the mapping process.

Bioinformatic analysis of the gene lists generated in GREAT was conducted using Ingenuity Pathway Analysis, IPA (Ingenuity® Systems, http://www.ingenuity.com). IPA maps each gene to a biological relationship and using published literature classifies these genes into categories of biological function, molecular networks, and canonical pathways. For the network analysis, which finds enriched biological networks in a gene set, a Fisher’s exact test is conducted, which calculates a p value to determine the significance of the enriched network found in the gene list, resulting in a Network score:

$$\rm{Network\,score} = -log \, (Fisher's\,exact\,test\,result)$$

For this study, networks were generated using the published literature regarding hepatic mouse data only, and biological networks that contained fewer than five genes were filtered out.

From these networks, the expression of specific genes of interest was determined. Genes of interest were those with a low permutation p value and that were identified to be biologically interesting within the physiological context of aging and alcohol consumption. Furthermore, genes that are transcriptionally regulated by DNA methylation were considered.

Quantification of gene expression

An RNA extraction was conducted using TRIzol® reagent (Invitrogen, Grand Island, NY), and cDNA was synthesized through a standard reverse transcriptase kit as described by the manufacturer (Invitrogen Life Technologies, Grand Island, NY). Quantitative RT-PCR was conducted on the Roche LightCycler® 480 real-time PCR machine (Roche, Madison WI) using TaqMan Fast Advanced Master Mix (Invitrogen, Grand Island, NY) and TaqMan primer assays (Invitrogen Life Technologies, Grand Island, NY). Gene expression was calculated as ΔCt, following normalization to the housekeeping gene GAPDHCt = Ct GeneXCt GAPDH). All expression results are displayed as fold difference (FD = 2−ΔΔCt) relative to the control group for each comparison.

Differences in gene expression were determined using a t test, which was performed in SAS version 9.3 (SAS Institute Inc., Cary NC). The level of significance was p < 0.05 for gene expression analyses.

Results

Genome-wide DNA hydroxymethylation

Four contrasts or group comparisons were performed in this study: Young versus Young + Alcohol, Old versus Old + Alcohol, Young versus Old, and Young + Alcohol versus Old + Alcohol. All probes from the microarray that had a difference in log 2 ratios >0.5 and a t test p values <0.01 were used to generate a heat map. From the clustering of these heat maps, we conclude that samples within the same experimental group had similar hydroxymethylation patterns at specific sites throughout the genome (Fig. 1). Only heat maps from the Young versus Young + Alcohol, and the Young versus Old contrasts are shown. Further analysis identified these groups as having the most differences and the focus of subsequent data interpretations. The heat maps also confer that hydroxymethylation of DNA is occurring at specific sites of DNA, rather than randomly.

Fig. 1
figure 1

Heat maps of differentially hydroxymethylated probes in the contrast of the Young and the Young + Alcohol groups (left), and the Young and Old groups (right). Red demonstrates relative hypohydroxymethylation, and yellow demonstrates relative hyperhydroxymethylation

Following per-probe quality control techniques, probes were lumped into differentially hydroxymethylated regions, DhMRs. Genomically neighboring probes are lumped into a DhMR if they both had a p value of <0.01 when calculating the difference in hydroxymethylation of that probe between two groups. The number of DhMRs for each group comparison gives insight into the degree of changes in hydroxymethylation that occurred in the DNA. Overall, comparing hydroxymethylation in the Young group to the Young + Alcohol group gave the highest number of differentially hydroxymethylated regions. This is particularly interesting when compared to the Old versus Old + Alcohol contrast, which had far less DhMRs (431 vs 189), signifying that hydroxymethylation of specific regions in the DNA of young mice is more easily changed by chronic alcohol consumption than in old mice, or that old mice have more variable hydroxymethylation patterns. There are also a large number of regions that had altered hydroxymethylation when comparing the DNA from young mice to old (331 DhMRs). This suggests that aging alone, as compared to aging with alcohol consumption (196 DhMRs), seems to alter hydroxymethylation patterns to a larger degree. Most DhMRs found in this study were near a transcription start site of a gene (Fig. 2), leading us to speculate that hydroxymethylcytosine may affect gene transcription.

Fig. 2
figure 2

Generated from Stanford University’s GREAT analysis. The number above bars represents the number of regions located at that site relative to a gene’s transcription start site (TSS)

The contrasts of Young to Young + Alcohol, and Young to Old yielded the most DhMRs with greater differences in hydroxymethylcytosine compared to the DhMRs found in the Old versus Old + Alcohol and the Young + Alcohol versus Old + Alcohol contrasts. For these reasons, the Young versus Young + Alcohol and the Young versus Old contrasts are the focus of subsequent analysis presented here.

Gene-specific changes in DNA hydroxymethylation

After the determination of DhMRs, each region was mapped back to the mouse genome. Some regions were between multiple genes; therefore, we included all of these genes in our downstream network analysis. Gene-specific changes in hydroxymethylation for each contrast were analyzed using a network analysis in IPA (Ingenuity® Systems; http://www.ingenuity.com). From these networks, we can determine biologically relevant gene systems that may be affected by either alcohol consumption or aging.

Gene-specific changes in hydroxymethylcytosine associated with alcohol consumption in young mice

Conducting a network analysis in IPA in the Young and the Young + Alcohol groups resulted in two networks with the biological functions of: (1) lipid metabolism, molecular transport, small molecule biochemistry; and (2) cell death and survival, liver necrosis/cell death, cellular development (Table 1). Within these networks, alcohol seemed to increase DNA hydroxymethylation of these genes, with a few exceptions. This may be due to the increased oxidative environment caused by exposure to alcohol, or it may be a programmed biological response.

Table 1 Gene networks found in the list of DhMRs generated from the comparison of Young and Young + Alcohol groups

The gene nuclear receptor subfamily 3, group C, member 1 (glucocorticoid receptor) (Nr3c1; permutation p = 0.055) is a central node in network 1 (Fig. 3). The glucocorticoid receptor is involved in cellular differentiation and acts as a transcription factor that can bind to glucocorticoid response elements to activate transcription [26]. Both of these genes are well conserved among mammalian species and have been previously demonstrated as being epigenetically regulated [27, 28].

Fig. 3
figure 3

Network of genes involved in lipid metabolism that contained a DhMR in the Young versus Young + Alcohol contrast. Red demonstrates genes that were more hydroxymethylated in the Young group, and green demonstrates genes that were more hydroxymethylated in the Young + Alcohol group. White demonstrates a gene within the biological network that did not contain a DhMR. Solid line represents a direct relationship between genes; dashed line represents an indirect relationship between genes

The region of the glucocorticoid receptor gene, in which our DhMR falls, is at a CpG island roughly 2.5 kb upstream of the gene’s transcription start site and has increased hydroxymethylation in the Young + Alcohol group relative to the Young. This gene also had significantly increased mRNA expression in the Young + Alcohol group relative to the Young (p = 0.015; Fig. 4). The increased hydroxymethylation of the CpG islands in the Young + Alcohol group may be altering the binding of proteins in that region, thereby increasing transcription.

Fig. 4
figure 4

mRNA expression of the glucocorticoid receptor (Nr3c1) in Young (n = 4) and Young + Alcohol (n = 4) groups. Error bars represent SEM; * represents a significant difference from Young (p = 0.015)

Gene-specific changes in hydroxymethylcytosine associated with aging

Overall, three biological networks encompassing two biological functions were enriched in our dataset (two of these networks of genes are involved in lipid metabolism, but are separated for easier viewing) (Table 2). This suggests that aging may have an effect on DNA hydroxymethylation of genes involved in lipid metabolism in hepatic DNA even without the consumption of alcohol.

Table 2 Gene networks found in the list of DhMRs generated from the comparison of Young and Old groups

Leptin is an important peptide hormone involved in feeding patterns and energy expenditure. Decreases in either leptin, or the leptin receptor, are associated with obesity [2931]. A region of DNA ~1 kb upstream of the leptin receptor gene (Lepr) was found to have decreased hydroxymethylation in old mice relative to young (permutation p = 0.044; Fig. 5). Old mice also had a significant decrease in the expression of the leptin receptor relative to the young mice (p = 0.029; Fig. 6).

Fig. 5
figure 5

Network of genes involved in protein synthesis and hepatic system development that contained a DhMR in the Young versus Old contrast. Red demonstrates genes that were more hydroxymethylated in the Young group, and green demonstrates genes that were more hydroxymethylated in the Old group. White demonstrates a gene within the biological network that did not contain a DhMR. Solid line represents a direct relationship between genes, and dashed line represents an indirect relationship between genes

Fig. 6
figure 6

mRNA expression of the leptin receptor (Lepr) in Young (n = 4) and Old (n = 4) groups. Error bars represent SEM; * represents a significant difference from Young (p = 0.029)

Discussion

Through the analysis of genome-wide and gene-specific hydroxymethylcytosine patterns, we can better understand the degree and directionality of changes occurring in the hepatic hydroxymethylome that are associated with aging and chronic alcohol consumption. Chronic alcohol consumption and aging are both associated with increased risk of hepatocellular carcinoma. The genomic and gene-specific alterations in hydroxymethylcytosine patterns presented here may shed light on the early stages of carcinogenesis as well as metabolic shifts that occur with the aging the process and the dysregulation caused by alcohol.

By comparing the number of regions that are differentially hydroxymethylated between experimental groups, we conclude that alcohol has a larger effect on hydroxymethylation in young mice relative to old. The larger effect on young mice is similar to our previous DNA methylation study [21] and suggests that the response to alcohol in young mice is stronger than in old mice. This may create an environment in young mice that is either protective against or prone to the alcoholic liver injury, but the clinical significance needs to be clarified in future studies. Aging alone is associated with large genome-wide alterations in hydroxymethylcytosine. Furthermore, consumption of alcohol may mask the effects of aging on the hydroxymethylome. These changes largely occur within 5 kb up or downstream of the transcription start site of genes, which suggests that those hydroxymethylcytosine changes (DhMRs) are genomically near the transcription starting sites, which are heavily regulated regions yielding functionally relevant changes in gene expression.

To our knowledge, this is the first report of gene-specific changes in hydroxymethylation of hepatic DNA that are associated with aging and chronic alcohol consumption. Our finding that chronic alcohol consumption in young mice increases hydroxymethylation at a CpG island in the promoter of the glucocorticoid receptor gene, Nr3c1, and is associated with increased levels of glucocorticoid receptor mRNA is novel. Glucocorticoid receptor signaling is metabolically important in the liver and may play a role in hepatic lipogenesis and steatosis, both of which are increased during chronic alcohol consumption [32, 33]. Glucocorticoids are involved in array of physiological pathways, from glucose homeostasis to apoptosis to the stress response [26, 34]. The activity of the glucocorticoid receptor is tissue specific. For example, increased lymphocyte glucocorticoid receptor expression has been associated with anxiety and post-traumatic stress disorder [35], while decreased glucocorticoid receptor expression in the brain is associated with helplessness and depression [36, 37]. In other tissues, the methylation of the glucocorticoid receptor gene, Nr3c1, seems to be sensitive to external environmental exposures such as stress and the diet [38, 39] and can be passed onto future generations [40, 41]. Glucocorticoid receptor-mediated apoptosis is a mechanism by which some cancer treatments suppress growth of cells in blood cancers [42]. Importantly, the glucocorticoid signaling pathway has previously been implicated in the development of hepatic steatosis and hepatic lipogenesis [33, 43], both of which are associated with alcohol intake [32]. Therefore, the changes in hydroxymethylcytosine and mRNA expression we report here may be an important step in the liver’s response to chronic alcohol consumption. Moreover, because we only see changes in Nr3c1 hydroxymethylation and expression when alcohol is consumed by the young mice, it is possible that the older mice have an impaired cellular response to chronic alcohol consumption.

We report that aging is associated with a decrease in hydroxymethylcytosine at a region located ~1 kb upstream of the transcription start site of the leptin receptor gene, Lepr, and is also associated with lowered levels of Lepr mRNA. Previous studies have reported a decline in leptin hormone associated with aging, although this is interrupted by increased fat mass [44, 45]. An age-associated change in Lepr transcription has not been previously reported. In addition to its association with obesity, rodents with a deficiency in leptin or the leptin receptor have a tendency to develop hepatic steatosis [46, 47]. Conversely, leptin is a mediator of hepatic fibrosis, and patients with steatohepatitis have elevated serum leptin levels [48, 49]. These findings indicate that the leptin signaling pathway is important in hepatic lipid homeostasis and fibrosis and may be important in carcinogenesis. Through a series of studies, we found that alcohol and aging, cancer risk factors, can alter DNA hydroxymethylation. Our earlier studies found that alcohol and aging can alter global DNA hydroxymethylation in mice [18]. We have also found global hydroxymethylation changes in alcohol-associated hepatocellular carcinoma in humans [11]. This study aimed to reveal how altered genome-wide and region-specific patterns of DNA hydroxymethylation may be an initial step in the development of alcohol-associated liver cancer. Taken together, the results presented here provide novel insight into the epigenetic ramifications of aging and chronic alcohol consumption. These findings demonstrate that hydroxymethylcytosine is site specific and may regulate transcription of genes. These early investigations identify associations that could be further followed up with functional assays and longer-term hepatic cancer studies. Future research may validate these findings and determine whether changes in protein–DNA binding are occurring at the genomic locations of these DhMRs.