Introduction

The increasing demand for animal proteins over the years have compelled for an increased efficiency of global food production (Thundathil et al. 2016). Improving reproductive efficiency, especially bull fertility, is particularly critical, as one bull can breed thousands of females by artificial insemination (AI) (Thundathil et al. 2016). Recent adoption of bovine genotyping arrays has facilitated early-life, genotype-based selection of candidate bulls. Consequently, semen from these young bulls can be marketed as soon as they reach puberty. However, there is considerable variation among bulls regarding age at puberty (Brito et al. 2012); furthermore, semen quality and fertility improve markedly for ~ 3 months after puberty (Brito et al. 2007b). Therefore, early onset of puberty and production of freezable semen have substantial financial implications (Dance et al. 2016). Although genetic selection affects those parameters, the environment (e.g. nutrition) has a profound impact on performance of animals and their progeny (Dance et al. 2016, 2015). In addition, recent studies have documented profound benefits of strategic supplemental nutrition on reproductive performance of bulls (Brito et al. 2007a; Barth et al. 2008; Dance et al. 2015, 2016; Johnson et al. 2019). We reported that feeding beef and dairy bulls supplemental energy and protein during calfhood (2–30 weeks) increased serum luteinizing hormone concentrations before 20 weeks of age, hastened puberty (~ 1 month) and produced mature bulls with larger testes and greater sperm production (~ 20–25% for each) compared to bulls fed less than recommended amounts of energy and protein with no apparent effect on sperm function (Dance et al. 2016, 2015). Although nutrition modulates reproductive potential of bulls, its impact on post-pubertal sperm epigenome remains largely unknown (Wennmann et al. 2014).

The epigenome represents a group of molecular processes targeting the chromatin that influences gene expression without altering the DNA sequence (Goldberg et al. 2007). Among these molecular processes, modifications to DNA (DNA methylation; Holliday and Pugh 1975) or its associated proteins (histone modifications; Allfrey et al. 1964) act in combination to drive the transcriptomic profiles related to cell identity and function. Furthermore, alterations of these epigenetic marks could be influenced by environmental conditions (e.g. nutrition; Burton and Lillycrop 2019). In human, nutritional influences on epigenome were also evocated to explain inter-generational effects, opening a completely new field of research backing up to Barker’s hypothesis on developmental origin of health and diseases (DOHaD) (Barker 2004, 1992; Barker and Martyn 1992; Kaati et al. 2002). More and more studies also demonstrated that paternal environment could influence the non-genetic information transmission carried by sperm cells and modify the trajectory of offspring (Donkin and Barres 2018; Champroux et al. 2018). Indeed, the differentiation of male germ cells into mature spermatozoa is based on a long and complex process including a large panel of epigenetic modifications taking place during the foetal life, the prepubertal period and after puberty during each spermatogenesis cycle (Champroux et al. 2018). All these epigenetic remodelling steps are plastic to environmental influences. Specifically, it has been shown that pre-pubertal nutrition has effects on gonad development through the proliferation of Sertoli cells, spermiogenesis supporting cells (Gaysinskaya et al. 2018; Perrier et al. 2018). In bull, we observed differential regulation of several genes in testicular tissue of post-pubertal bulls exposed to either low-, normal- or high-nutrition diets during their pre-pubertal period (Johnson et al. 2019). One hypothesis would be that dietary modulations done on bull calves during their pre-pubertal period influence the epigenetic maturation of germ cells through alterations of Sertoli cell transcriptome and function.

Although the DNA methylation profile appears to be acquired at the stage of prespermatogonia at birth (Oakes et al. 2007), a transient DNA methylation reduction was observed at onset of meiosis suggesting potential plasticity at prepubertal period (Gaysinskaya et al. 2018). The main objective of our study is to generate preliminary data on effects of pre-pubertal dietary modulations on bull sperm DNA methylation using the reduced-representation bisulfite sequencing (RRBS) technique, which offers a cost-effective genome-wide view of DNA methylation changes at gene- and CpG-rich regions and at a base resolution level. The data generated will allow us to conduct focused studies on the effects of dietary modulations during specific stages of a bull’s reproductive life on the sperm epigenome and predict consequences for his fertility and for progeny development.

Materials and methods

Animals and treatments

Dietary treatments have been described in detail in Dance et al. (2015). Briefly, 1-week-old Holstein bull calves (n = 9), randomly allocated to 3 groups (n = 3/group), were fed either a high, medium or low diet from 2 to 32 weeks of age. From 2 to 8 weeks of age, calves were fed milk (8, 6 and 4L/day in high-, medium- and low-diet groups respectively), followed by a transition into a barley silage-based diet. All diets contained 1.6% vitamin-mineral premix (as fed). The high nutrition diet consisted of 49.7% rolled barley, 9.7% rolled corn, 7.6% canola meal and 7.6% soybean meal (20.0% crude protein (CP) and 67.9% total digestible nutrients (TDN)). The medium nutrition diet contained 4.8% rolled barley, 4.8% rolled corn, 3.8% canola meal and 3.8% soybean meal (overall, 17.0% CP and 66.0% TDN). The low-nutrition diet was barley silage (plus premix, but no concentrate) and had 12.2% CP and 62.9% TDN (note that for this and all other diets, CP and TDN are reported on a dry matter basis). The high-nutrition group was fed ad libitum and based on their intake, the same amount of feed (on an as-fed basis) was offered to the low- and medium-nutrition groups. Calves were on their respective diets until 32 weeks of age, after which they were all fed the medium-nutrition diet. Semen was collected from bulls by electroejaculation at frequent intervals from 51 to 73 weeks, but for the present study we utilised semen samples collected from two specific time points 55–59 (Y) and 69–71 (O) weeks of age. The semen samples were then chilled, diluted and cryopreserved as described previously (Dance et al. 2016).

Genomic DNA extraction

Frozen-thawed sperm cells from two 0.5-ml straws were used for extraction from all 18 samples (3 diets, 2 time points, n = 3 per condition) as described previously (Perrier et al. 2018). In brief, semen samples were washed with water followed by phosphate-buffered saline and incubated overnight with lysis buffer (10 mM Tris–HCl pH 7.5, 25 mM EDTA, 1% SDS, 75 mM NaCl, 50 mM dithiothreitol and 0.5 μg glycogen) containing 0.2 mg/ml proteinase K. Following RNAse digestion (25 μg/ml RNAse A for 1 h, 37 °C), genomic DNA was extracted twice using phenol:chloroform (1:1) and chloroform, then ethanol precipitated and washed. The dried pellet was re-suspended in TE buffer (10 mM Tris–HCl, pH 7.5, 2 mM EDTA). Both Nanodrop (2000/2000c; ThermoFisher Scientific) and Qubit (ThermoFisher Scientific) were used to analyse the purity and concentration of extracted DNA. The integrity of genomic DNA was confirmed by electrophoresis on agarose gel. Mitochondrial DNA elimination was not ensured as sperm tails were included in the DNA extraction procedure.

Preparation of RRBS libraries

Library preparation for RRBS was performed as described elsewhere [64_65] from 200 ng of genomic DNA digested with MspI (Fermentas), except that magnetic bead–based size selection was performed using SPRIselect magnetic beads (Beckman-Coulter). Briefly, after ligation to 55-bp methylated Illumina adapters for paired-end sequencing, H2O was added up to 50 µl, which was followed by the addition of 35-µl magnetic beads. The larger fragments bound to the beads were removed using a magnetic rack according to the manufacturer’s instructions, and 85 µl of supernatant containing the smaller fragments were transferred into a new tube. The addition of 25 µl of fresh beads next allowed to select fragments ranging from 150 to 400 bp. After washing with 85% ethanol, the DNA bound to the beads was eluted in 20 µl EB buffer (Qiagen). The DNA was then bisulfite converted twice using the EpiTect bisulfite kit (Qiagen) followed by PCR amplification of converted DNA for 14 cycles using Pfu Turbo Cx hotstart DNA polymerase (Agilent). All pipetting steps before the final amplification were carried out using a NGS STARlet liquid handling system with four channels (Hamilton). The final libraries were purified using AMPure XP beads (Beckman-Coulter). Concentration of libraries was estimated using Qubit and concentrations suitable for sequencing were confirmed in all the 18 samples. To evaluate the integrity of the final libraries, electrophoresis was performed on a pre-cast 4–20% polyacrylamide gel (Invitrogen) to reveal a smear of DNA from 150 to 400 bp (Supplementary File 1). Sequencing was performed on an Illumina HiSeq4000 sequencer to produce 75-bp paired-end reads (Integragen SA, France).

RRBS data analysis

RRBS sequences were analysed as described previously (Perrier et al. 2018). An integrated pipeline combining scripts developed in house in Python, R and Shell, together with external tools (https://github.com/FAANG/faang-methylation/tree/master/RRBS-toolkit/) was used for this purpose. In brief, RRBS libraries were subjected to FastQC quality control analysis. Additional quality check and trimming was performed with TrimGalore v0.4.0 to remove adapter sequences, unsatisfactory bases and reads, also reads shorter than 20 nucleotides (Krueger 2014). Good-quality reads were aligned using Bismark v0.14.3 with Bowtie 1 [67–68] to the bovine reference genome (UMD3.1) in which the sequence of Y chromosome has been incorporated (GenBank: CM001061.2).

Principal component analysis (PCA) and hierarchical clustering were computed using the FactoMineR R package on the matrix of CpGs covered by at least 10 reads per sample. For hierarchical clustering, the distance between samples was calculated using Pearson correlation coefficients.

MethylKit (Akalin et al. 2012) software was used to detect differentially methylated CpGs (DMCs). A minimum threshold of 10 reads was applied for CpG coverage. All CpGs fulfilling this coverage threshold in at least 4 samples within a treatment (2 groups related to age, n = 9/group; 3 groups related to diet, n = 6/group) were included in the analysis. CpGs with a q-value lower than 0.01 and methylation difference greater than 25% were considered differentially methylated. A region should have had minimum 3 DMCs with a maximum inter-DMC distance of 100 bp to be considered a differentially methylated region (DMR). Identified DMRs were further extended by the incorporation of CpGs with a q-value comprised between 0.05 and 0.01 (Perrier et al. 2018).

Annotation and enrichment analysis

All analysed CpGs, DMCs and DMRs were annotated relative to gene features, CpG density and repetitive elements using an in house pipeline (https://github.com/FAANG/faang-methylation/tree/master/RRBS-toolkit/Annotation) as described previously (Perrier et al. 2018). The reference files were downloaded at the following sites: ftp://ftp.ensembl.org/pub/release-94/gtf/bos_taurus/Bos_taurus.UMD3.1.94.gtf.gz, http://oct2018.archive.ensembl.org/biomart/martview/2c4d22063af3b241ef9322512fae0cdc (Ensembl Genes 94, Cow genes (UMD3.1)), http://hgdownload.cse.ucsc.edu/goldenPath/bosTau6/database/cpgIslandExt.txt.gz and http://hgdownload.cse.ucsc.edu/goldenPath/bosTau6/database/rmsk.txt.gz. The following criteria were applied for annotation: promoter-TSS, − 2000 to + 100 bp relative to the transcription start site (TSS); TTS, − 100 to + 100 bp relative to the transcription termination site (TTS); shore, up to 2000 bp from a CpG island (CGI); and shelve up to 2000 bp from a shore. A site/fragment was considered to belong to a CGI (respective shore and shelve) if an overlap of at least 75% was observed between the site/fragment and the CGI (respective shore and shelve). A site/fragment was considered being overlapped by a repetitive element whatever the extent of this overlapping.

CpGs from all analysis covered a total of 17,257 Ensembl gene IDs and this was used as background for enrichment analysis. The Ensembl gene IDs covered by DMCs and DMRs from low vs high, low vs medium and medium vs high comparisons were uploaded to Database for Annotation, Visualization and Integrated Discovery (DAVID, v6.8) to check enrichment for Gene Ontology (GO) terms (Huang et al. 2009) and KEGG pathway analysis (Kanehisa and Goto 2000). The functional annotation and clustering option were used for most analysis. All significant GO terms and pathways with an adjusted P value ≤ 0.01 (Benjamini-Hochberg) and gene count ≥ 2 have been reported.

Bisulfite pyrosequencing

The DMRs associated with IGF2, INS, WWC2 and WNT1 were subjected to bisulfite pyrosequencing (Tost and Gut 2007). Bisulfite conversion was performed either using the Epitect bisulfite kit (Qiagen) or as described previously (Dupont et al. 2004) using 1 µg of genomic DNA. In brief, genomic DNA was denatured followed by treatment with sodium bisulfite (5 M) and hydroquinone (130 mM) for 4 h at 55 °C followed by purification using a pressure activated aspirator system and the Wizard DNA cleanup system (Promega). After purification, DNA was eluted with water, following incubation with 0.3 N NaOH and addition of ammonium acetate (6 M), glycogen and 100% ethanol. After overnight incubation, the DNA pellet was ethanol precipitated and resuspension in water.

PCR cycle conditions used for amplification of bisulfite converted DNA were as follows: 15 min at 95 °C followed by 45 cycles of 30 s at 94 °C (denaturation), 30 s at 56 °C (annealing), 30 s at 72 °C (extension) and 10 min at 72 °C. The primers for all genes were designed using Pyromark assay design software (Qiagen) and sequences are provided in Table 1. The reverse primers were 5′ biotinylated for all genes evaluated in our study. Amplifications were performed using the Pyromark PCR kit (Qiagen) according to the manufacturer’s instructions with varying concentration of MgCl2 (Table 1). Electrophoresis on 1% agarose gels confirmed the appropriate size and homogenous amounts of the PCR products among samples (data not shown).

Table 1 Details of primers used for PCR amplification. Reverse primers are biotinylated

Pyrosequencing was used to validate quantitative DNA methylation data (Tost and Gut 2007). PCR products (15 μL) were added to a mixture containing 2 μL Streptavidin Sepharose HP™ (Amersham Biosciences), 38 μL binding buffer (Qiagen), 25 μL MilliQ H2O and mixed at 1400 rpm for 10 min at room temperature. PyroMark Q24 Vacuum Workstation (Qiagen) was used to prepare the single-stranded PCR products according to the manufacturer’s instructions. The sepharose beads with the single stranded templates attached were released into a Pyromark Q24 plate (Qiagen) containing 0.3 µM of the sequencing primer (Table 2) diluted in annealing buffer (Qiagen). Pyrosequencing reactions were performed in duplicate with the PyroMark Q24 instrument using PyroMark Gold Q24 reagents, according to the manufacturer’s instructions (Qiagen). Methylation analysis of CpG sites was performed using PyroMark Q24 Software (Qiagen), and the average methylation values obtained from duplicated assays were considered for statistical analysis.

Table 2 Details of primers used for pyrosequencing including their oligonucleotide sequence and CpG sites included

All statistical analysis was performed using R software (v 3.5.3; Team 2008). Diet groups were compared at each CpG using non-parametric tests suited to small samples (permutation tests for two independent samples, with Monte-Carlo sampling of 100,000 permutations and with correction for the stratification of the population according to the age). Significance was declared at P < 0.05.

Results and discussion

Results

RRBS library characterisation

An average of 28 million read pairs were obtained from all the 18 libraries included in our study (Supplementary Dataset 1). Average mapping rate was 88.9%, with a unique mapping rate of 36.4%. Taking account of the high representation of repeat sequences in the bovine genome, this unique mapping rate was expected and in accordance with the rate observed in a recent RRBS study in cattle (Perrier et al. 2018). CpGs were covered by 21.4 reads on average, with 55.1% of CpGs covered by at least 10 reads (CpG10) displaying an average methylation of 49.6%. Bisulfite conversion rate was on average 98.7% with a standard deviation of 0.2 suggestive of low technical variability. Further details on the individual libraries are provided in Supplementary Dataset 1.

Descriptive analysis

The matrix of methylation percentages for every CpG10 and every sample was used to perform correlation clustering and PCA. With correlation clustering, the samples from the same bull (young and older) were grouped together irrespective of dietary treatments (Fig. 1a). No specific clustering was observed in the PCA plot (Fig. 1b). The results suggest that inter-individual variation unrelated to either age or dietary treatment represents the major variation in our methylation data.

Fig. 1
figure 1

a Correlation clustering and b principal component analysis (PCA) run on the totality of CpG10 covered in all 18 samples. Low and young (n = 3, blue), low and older (n = 3, pink), medium and young (n = 3, green), medium and older (n = 3, black), high and young (n = 3, orange), high and older (n = 3, red)

Determination of age or diet effects by differential analysis

Two independent factors were considered for analysis of differential methylation: diet and age. The young vs older comparison for age (n = 9/group) revealed differential methylation in 39 CpG sites with no DMRs, supporting the findings from our descriptive analysis of lack of intra-individual age differences. Accordingly, paired samples collected from the same bull were grouped to run the dietary comparisons. Since in methylKit, reads obtained under the same condition are aggregated regardless of the intra-condition inter-individual variability; this approach allows increasing the coverage at each tested CpG. Pairwise comparisons were therefore conducted on sperm samples from bulls fed low vs high diets, low vs medium and medium vs high diets (n = 6 samples from 3 bulls per diet group). A total of 7931 CpGs were differentially methylated in the low- vs high-diet comparison (5678 hypermethylated and 2253 hypomethylated in the low diet as compared to the high diet), 6144 in low vs medium (4073 hypermethylated and 2071 hypomethylated in the low diet as compared to the medium diet) and 5244 in medium- vs high-diet (2762 hypermethylated and 2482 hypomethylated in the low diet as compared to the high diet) comparisons (Fig. 2a). In addition, 395 DMRs were identified in the low vs high (367 vs 28, hyper vs hypo relative to low diet), 270 in low vs medium (238 vs 32, hyper vs hypo relative to low diet) and 118 in medium- vs high-diet (70 vs 48, hyper vs hypo relative to low diet) groups (Fig. 2b). All together, these results suggest that pre-pubertal low diet led to hypermethylation of some regions of the genome compared to enriched diets. A list of DMCs and regions significantly different across treatments (low vs high, low vs medium and medium vs high) is provided in Supplementary Datasets 2, 3 and 4 (DMCs) and Supplementary Datasets 5, 6 and 7 (DMRs) respectively. The overlap between the three DMC lists was limited (Fig. 2c), since 217 DMCs were common to all the three dietary datasets without significant enrichment for any function or pathway.

Fig. 2
figure 2

Number of hypermethylated and hypomethylated a DMCs and b DMRs in low vs high, low vs medium and medium vs high datasets. Y axis—number of DMCs or DMRs; X axis—dataset. The black bars represent the number of DMCs hypermethylated in low diet (low vs high and low vs medium comparisons), or in medium diet (medium vs high comparison). c Venn diagram showing the DMCs identified in low (L) vs high (H), low (L) vs medium (M) and medium (M) vs high (H) comparisons

Genomic distribution of diet-DMCs

The genomic distribution of the DMCs relative to gene features, CpG islands and repetitive elements for the three comparisons between diets was compared to the genomic distribution of the 1,254,808 CpG10 analysed by RRBS (Background) (Fig. 3a–l). Interestingly, the genomic distribution of the DMCs was similar for the three comparisons and differed from that of the background. Concerning the gene features, the DMCs were more associated with intergenic regions and gene introns, in detriment of the promoter-TSS regions. The diet-DMCs were also more associated to CpG poor regions. Finally, diet-DMCs targeted more LINE and SINE regions and less simple repeat that was observed for the background. Together these data suggested that an important part of diet effect may concern more a general chromatin organisation than gene-specific regulatory regions.

Fig. 3
figure 3

Gene, CpG and repeat annotations for DMCs across a–c low vs high, d–f low vs medium, g–i medium vs high and j–l background

Gene specific enrichment

A focus was done on DMCs included in genes. It was interesting to observe that the 522 DMC-containing genes were shared by all pairwise comparisons, demonstrating that their methylation status was significantly different across all three diets. Among the 522 genes, MAPK (mitogen-activated protein kinase) signalling pathway was enriched (adjusted P value < 0.1, Benjamini–Hochberg correction, gene count-18, Fig. 4). This could point towards the vulnerability of the pathway to mild environmental influences, which in our case was pre-pubertal nutrition.

Fig. 4
figure 4

Venn diagram showing the DMC-containing genes identified using RRBS in low vs high, low vs medium and medium vs high comparisons. A total of 522 genes were common across all datasets and were significantly enriched for MAPK signalling

The results of enrichment analysis have been summarised in Table 3 (Adj P ≤ 0.01, Benjamini–Hochberg correction). Briefly, in the low- vs high-diet dataset, a single biological process (BP), “regulation of Rho protein signal transduction” involving genes SPATA13 (spermatogenesis-associated protein), ARHGEF2, ARHGEF7, ARHGEF10 and ARHGEF19 (Rho guanine nucleotide exchange factors), was enriched (total 22 genes). The cellular component (CC) plasma membrane and proteinaceous extracellular matrix were also enriched. Rho protein signal transduction (BP, 20 genes), GTPase activity (molecular function (MF)) and Rho guanyl-nucleotide exchange factor activity (MF) were the GO terms enriched in the low vs medium dataset. KEGG pathways were only significantly enriched in the medium- vs high-diet comparison, and this included “MAPK signalling” (40 genes), “focal adhesion” (33 genes) and “proteoglycans in cancer” (33 genes). Major genes involved in the MAPK pathway included AKT1, AKT2, EGFR and MAPK1.

Table 3 (a) KEGG pathways and (b) GO terms enriched in the list of DMC-containing genes identified using DAVID (Huang et al. 2009) (Adj P ≤ 0.01, Count ≥ 2). BP biological process, CC cellular component

Validation by bisulfite-pyrosequencing

Pyrosequencing was used to validate quantitative DNA methylation data (Tost and Gut 2007). Four DMRs located within (i) WWC2, (ii) INSR, (iii) IGF2 and (iv) WNT1 genes were selected for this. IGV browser views of these regions are provided in Fig. 5. According to our RRBS data, the DMR of WWC2 located within the exon was hypermethylated in the low-diet group relative to both the medium- and high-diet groups. The DMRs of WNT1 (located in an exon) and INSR (intron) were both hypermethylated in the low (vs high) diet and the DMR of IGF2 (promoter) being hypomethylated in low (vs high) diet. IGF2 is a major regulator of development and the disruption of its paternal allele reduced foetal growth in mice (Burns and Hassan 2001). Some of the growth-promoting effects of IGF2 is mediated by the INSR (Louvi et al. 1997). WNT signalling (WNT1) is critical for mammalian spermatogenesis and is also involved in sexual differentiation (Ronfani and Bianchi 2004). WWC2 is a regulator of hippo signalling involved in the male reproductive development and spermatogenesis in sheep (Zhang et al. 2019). Results obtained using pyrosequencing successfully validated the results of RRBS (Fig. 5).

Fig. 5
figure 5

Validation of RRBS data by bisulfite-pyrosequencing. IGV browser views of gene regions and methylation percentages of the CpGs assayed by pyrosequencing (a, b WNT1; c, d IGF2; e, f INSR; g, i WWC2) targeted for pyrosequencing. For a, c, e, g and i, the red, green and blue bar charts represent the methylation percentages at each CpG10 position for low-, medium- and high-diet samples, respectively. The DMCs are enclosed in the black box. For b, d, f and h, low-, medium- and high-diet groups are represented in red, green and blue, respectively (n = 3 bulls/group), while the two different time points are indicated in plain and dashed lines. X-axis: CpG sites; Y-axis: Methylation %. *Significant difference between groups (permutation test, P < 0.05)

Discussion

Through our RRBS study, we analysed the effects of differential pre-pubertal diet and advancing age on methylation of bovine sperm DNA. Even though age did not impact the sperm methylome, probably due to the short interval between semen collections (2–3 months), our results suggest that pre-pubertal nutrition can cause dynamic changes in the sperm epigenome.

Sexual development in the bull is a complicated process in which the testes grow comparatively slow during the first 6 months followed by a rapid phase during puberty (9–12 months, Rawlings et al. 2008). The proliferation and differentiation of germ cells occur during the rapid phase and are characterised by the increased production of testosterone. Even though a significant portion of the germ cell pattern is established prior to type A spermatogonia stage, de novo methylation and demethylation changes have been reported during spermatogenesis, specially during meiosis (Oakes et al. 2007). It is possible that in young bulls these mechanisms are not fully established and could be characterised by differences in DNA methylation. Thus, age was included as an independent variable in our study to determine if methylation changes induced by pre-pubertal diet could be modified over time in bulls. A study investigating effects of age on sperm DNA methylation in bulls detected no DMRs when evaluating sperm at 12 (late-pubertal) and 16 months (post-pubertal) of age supporting our findings (Lambert et al. 2018). Since testes growth is maximum during puberty, most DMRs were detected across 10 (early pubertal) and 16 months (post-pubertal) of age in the above study. Other studies have also reported age-related DMRs in bovine sperm (Takeda et al. 2017, 2019). Even though isolated CpG sites were differentially methylated in our study, no DMRs were detected, demonstrating that sperm DNA methylation is stable over an interval of 2–3 months within the post-pubertal period. Another study in bovine evaluated the effects of feeding a high vs medium nutrition in bulls until 24 weeks of age followed by medium nutrition until puberty at two different ages (15 vs 16 months, Perrier et al. 2020). No differential methylation was detected in sperm for age, but for diet. Based on their findings, they speculated that the changes in DNA methylation were associated with advanced sexual maturity of high-diet vs medium-diet bulls as no differences were detected when comparing medium group at 16 months vs high at 15 months as methylation changes would be counterbalanced with maturity (Perrier et al. 2020). Our lab previously reported advanced puberty, larger testes and increased sperm production in bulls fed a high vs low pre-pubertal nutrition (Dance et al. 2016; 2015). A similar speculation that the advanced puberty of high-diet bulls facilitated by their enhanced diet induced differential methylation in sperm is plausible but inconclusive.

It is also imperative to note that majority of the differences in methylation were irrespective of diet or age, but between individuals likely rely on genetic polymorphisms as described in humans (Lappalainen and Greally 2017). Considering the random selection of bulls in our study and the vital role of genetics in addition to environment (nutrition) in epigenetic regulation (Triantaphyllopoulos et al. 2016), this is conceivable and indirectly ascertains the supremacy of genetics in epigenetic regulation.

With regard to diet, we identified several DMCs and regions (DMR). Interestingly, for all the comparisons, the major part of DMCs were more methylated in the low diet, indicating that nutritional limitation is associated with an increase of DNA methylation at specific sites. Previous studies investigating human infertility have associated global sperm DNA hypermethylation to lowered sperm quality (Rahiminia et al. 2018; Kumar et al. 2013). Even though there is a lack of consensus in this regard (Olszewska et al. 2017), it is worthy to note that in a previous study done in our lab, the semen from bulls fed low pre-pubertal diet also exhibited low progressive motility compared to the high-diet bulls (Johnson et al. 2020). Thus, a negative association between sperm motility and global sperm hypermethylation is plausible based on our data.

The genomic distribution of DMCs was quite similar for the three comparisons: intragenic regions and gene promoters, CpG islands and simple and low complexity repeat regions were depleted in DMCs. These data suggest nutritional effects on global architecture of spermatozoa nuclei which would have consequences on the epigenetic reprogramming of paternal nucleus after fertilisation as described for other epigenetic marks (Ost et al. 2014).

Most prominent changes in the sperm epigenome associated with pre-pubertal diet could be related to spermatogenesis, sperm function and early embryo development. We found an enrichment of MAPK signalling in the 522 genes differentially methylated in all three dietary datasets. This is important considering the involvement of MAPK pathway in numerous male reproductive processes including spermatogenesis, sperm function (maturation and activation, capacitation, acrosome reaction) and Sertoli cell function (Li et al. 2009). MAPKs have been implicated in the progression of meiosis where incubation of pachytene spermatocytes with ERK1 (MAPK downstream) inhibitor prevented chromatin condensation and progression to metaphase stages in mouse (Sette et al. 1999). In primates and rats, MAPK signalling induced germ cell apoptosis in response to mild hyperthermia and lowered testosterone levels (Johnson et al. 2008). ERK1/2 was detected in the tail of human spermatozoa and regulated sperm capacitation and hyperactivation in sperm. It is possible that Rho GTPases, and putative ERK substrates, are involved in the abovementioned regulation of sperm motility by ERK1/2 (Almog et al. 2008). MAPK signalling activated in response to cytokines lowered the steady-state protein levels of occludin, ZO-1 and cadherins, disrupting blood testes barrier integrity and spermatogenesis in adult rat testes (Xia et al. 2006). Thus, dietary modulation in cattle could alter MAPK signalling that plays a key role in the regulation of gene expression, cellular growth and survival.

Some effects of MAPK signalling on sperm function could be activated via small GTPases as described earlier. Through analysis in DAVID, we identified the biological process, “rho protein signal transduction”, to be enriched in two datasets (low vs high, low vs medium) in our study. Rho protein signal transduction has been implicated in many cellular processes including transcription, cell cycle, cell proliferation, migration and most importantly actin reorganisation (Bustelo et al. 2007). Coming from the Ras superfamily, Rho GTPases are small (~ 20 kDa) signalling G proteins regulating actin-based cytoskeletal rearrangements. GTPases respond to growth factors, integrins, ion channels or G-protein-coupled receptors activated by environmental influences/external stimuli. Activation of Rho GTPase mediates its transition from the inactive GDP bound form to an active GTP bound form (Lu et al. 2009). A major regulatory protein of Rho GTPase, guanine nucleotide exchange factor, facilitates the exchange of GDP with GTP. Many downstream targets of Rho have been identified including PKN (protein kinase C–like protein) and PIP5-kinase (phosphatidylinositol 4-phosphate 5-kinase) with major regulatory effects on actin reorganisation (Mukai 2003; Tolias et al. 1995). Rho GTPases have been identified in the sperm of many mammalian species including bovine. A multi-species localisation study of Rho GTPases in the sperm revealed its presence in both the head and tail of bovine sperm and confirmed it being conserved through mammalian evolution (Ducummon and Berger 2006). Previously, in 1993, a research group performed ADP ribosylation of rho GTPases using an exoenzyme like toxin and reported lowered motility in bovine sperm (Hinsch et al. 1993). Evaluating bulls fed differential pre-pubertal diets, we previously identified greater sperm progressive motility and mitochondrial function in both the testes and sperm of high- compared to medium- and low-diet bulls (Johnson et al. 2020). It is possible that the underlying differences in DNA methylation (rho protein signal transduction) drove the differences in sperm motility observed in our previous study. Considering the association between Rho signalling and sperm motility, this is a possibility. A recent review investigated the role of Rho GTPases in spermatogenesis and supported its role in maintaining the junctional dynamics by actin organisation within the testes (Lui et al. 2003). In addition, the localisation of Rho proteins in the acrosome region of many mammals supports its role in acrosomal exostosis (Ducummon and Berger 2006). However, the association between the methylation status of genes involved in Rho signalling and its impact on sperm function is unknown.

When comparing DMCs in the medium- vs high-diet dataset, we identified focal adhesions and MAPK pathways to be significantly enriched. Focal adhesions are multi-protein complexes forming mechanical links between the extracellular matrix and the cytoskeleton. Focal adhesion kinase has been localised in mammalian sperm and is important for signalling mechanisms required for capacitation and acrosome reaction (Roa-Espitia et al. 2016). Focal adhesion kinases can also modify the actin cytoskeletal assembly through activation of Rho GTPases for enabling cell motility (Tomar and Schlaepfer 2009).

In addition to the above, we identified several differentially methylated genes in our study with a direct impact on sperm function and embryo development. Among the many hypermethylated DMRs in the low-diet bulls, CYP26B1 (cytochrome P450, family 26, subfamily B, polypeptide 1), a gene associated with spermatogenesis was also present. CYP26B1, an inhibitor of retinoic acid action, maintains the undifferentiated state of germ cells regulating their meiotic entry. In mice, mutations in CYP26B1 gene in both germ and Sertoli cells disrupted spermatogenesis and lowered fertility in male (Hogarth et al. 2015). Hypermethylation could imply lowered gene expression in low-diet bulls (vs high and medium) causing disruption in spermatogenesis. In our previous study where we evaluated testicular gene expression of pre-pubertal bulls, a cytochrome P450 gene (CYP51A1) was downregulated in the low-diet (vs high diet)-fed bulls (Johnson et al. 2019). Another hypermethylated DMR in the low diet was located within the DDX4 gene, a commonly used marker for germ cells. The gene is critical for germ cell development in many mammals including bovine (Bartholomew and Parks 2007) and was reported to have lowered expression in oligozoospermic human patients (Guo et al. 2007). Other DMRs with related functions included SPATA13 (exonic location, spermatogenesis-associated protein 13) and EQTN (Equatorin; acrosome reaction). Both genes were hypermethylated in low compared to high diet, and this could be speculated as lowered gene expression in low-diet bulls. To corroborate this, our RNA-seq study on pre-pubertal testicular tissue detected differential expression of genes supporting enhanced Sertoli cell maturation in the high-diet-fed bulls (Johnson et al. 2019). Since Sertoli cells support spermatogenesis in tremendous ways, it is reasonable to speculate a positive impact of pre-pubertal feeding in bulls.

DNA methylation of sperm have a critical role in the regulation of imprinted genes and is a crucial determinant of embryo/foetal development and post natal growth (Reik et al. 2001). Compared to the somatic genome where methylation patterns are stable and heritable, gametes encounter two developmental periods (germ cell and pre-implantation embryo) where DNA methylation is reprogrammed establishing cells with vast developmental potential (Reik et al 2001). Even though a major wave of demethylation is complete by 8-cell stage in bovine embryos, a small portion (~ 20%) of DNA methylation is retained including the imprinted genes, which could be critical in regulation of embryo growth and development (Jiang et al. 2018, Wu and Sirard 2020). Thus, the effects of environmental modulation on DNA methylation could be immediate in terms of sperm function and fertility or long term (progeny development). According to a recent study in sheep, a minor dietary modulation in prepubertal F0 rams altered SC and weight at puberty in F1 rams. Additionally, differential methylation was detected in genes associated with sexual development and body weight (Gross et al. 2020). In porcine, feeding methylating micronutrients vs a standard diet to F0 males lowered fat and increased shoulder muscle percentage in F2 generation which was postulated to be epigenetic inheritance via differential methylation in the liver of treatment males (Braunschweig et al. 2012).

As discussed previously, embryo development is characterised by the development of new methylation patterns during the different windows of epigenetic programming (Reik et al. 2001) which involves DNMT3a, the enzyme involved in de novo methylation and crucial for early embryonic development, parental imprinting and X chromosome inactivation (Kaneda et al. 2004; Okano et al. 1999). Interestingly within the DNMT3a gene, 2 CpG sites with an intronic location were found hypermethylated in low-diet bulls when compared to high-diet bulls. In human testicular germ cell tumours, intronic hypomethylation of DNMT3a was associated with upregulated expression (Ishii et al. 2007). Ablation of DNMT3a in mice resulted in death shortly after birth (Okano et al. 1999), and lowered expression of DNMT3a was correlated with increased rates of early embryonic death in humans (Gu et al. 2017). Thus, one hypothesis could be that low diet in prepubertal bulls leads to hypermethylation and lower gene expression of DNMT3a, resulting to lower reprogramming efficiency after fertilisation.

Several genes within the insulin/IGF family were also differentially methylated in our study. This included DMRs in INSR, IGF2 and DMCs in IGF-IR, IGF2R, IRS2 and INSIG1 genes. A significant finding was the hypomethylation (low vs high) of a DMR (196 bp) with 7 CpGs located in the promoter region of IGF2, a well-recognised imprinted gene in bovine (Dindot et al. 2005) as well in mice and human (Constancia et al. 2002). IGF2 gene is widely expressed during embryo development and encodes a critical regulator of foetal growth, nutrient delivery and placental size (Constancia et al. 2002; Fowden et al. 2009). A previous study in bulls reported hypomethylation of IGF2 in sperm collected from peripubertal bulls (Lambert et al. 2018) and discussed the importance of age of semen collection as a critical factor. It has been shown that some of the growth-promoting effects of IGF2 during embryogenesis is mediated via the INSR (Louvi et al. 1997) and IGF-IR (Pitetti et al. 2013), another gene differentially methylated in our study. For better coordination of regulation of parental imprinting, IGF2 shares an imprinting control region (ICR) with a non-coding gene H19 and is located upstream of the H19. In this study, diet does not affect the ICR methylation state, suggesting that IGF2 parental imprinting is correctly operating. We found a diet-induced alteration of methylation in IGF2 promoter. A previous study reported a correlation between hypermethylation of IGF2 promoter and repressed expression (Zheng et al. 2018). As previously indicated, an abnormal methylation of IGF2 promoter could be reprogrammed with lower efficiency after fertilisation, affecting IGF2 gene expression and embryo development.

Moreover, aberrant methylation in the imprinted genes was associated with spermatogenic failure in a human case control study comparing idiopathic and normal sperm (Tang et al. 2018). IGF2R located on chromosome 9 is also a verified imprinted (maternal) gene in bovine (Killian et al. 2001). A single DMC located within an intron was hypermethylated in the medium-diet bulls compared to the high diet. No clear hypothesis could be established concerning the impact of this DNA methylation alteration.

In bovine species, it has been proven that nutrition impacts fertility but the transgenerational effects of the same through sperm DNA methylation specially in the progeny have not been clear. When comparing embryos generated from sperm of peripubertal (10–12 months) vs post-pubertal bulls (16 months), differential methylation and expression were detected for genes associated with energy metabolism (Wu and Sirard 2020). In a different study, DMRs in sperm and its correlation with gestation length, days to first breeding after calving, somatic cell score, body type, milk production and other traits were reported (Fang et al. 2019) suggestive of transgenerational effects via epigenetic programming. Even though we detected DMRs correlated with sperm function and embryo development in our study, we lack gene expression data which could have been correlated with sperm DNA methylation. Therefore, a future direction would be the evaluation of gene expression in earlier spermatogenic cell types and/relevant somatic cells of the offspring derived from the sperm. To accomplish this, evaluation of gene expression of embryos derived from sperm of bulls fed differential pre-pubertal diet is important which could then be correlated with our DNA methylation data to expand our understanding about the implications of our dietary modulation on the next generation.

Another limitation of our study was the inability to characterise mitochondrial DNA methylation albeit their inclusion via sperm tails. Differential mitochondrial DNA methylation in response to diet has been reported previously in human trials (Corsi et al. 2020). In our previous studies, we reported enhanced mitochondrial function and motility of sperm from bulls fed high levels of nutrition (Johnson et al. 2020). Thus, it would also be interesting to evaluate its impact on sperm mitochondrial DNA methylation and its correlation with sperm function in future studies.

Conclusion

In brief, our results suggest that pre-pubertal diet can alter DNA methylation in the mature sperm of bulls with potential impacts on genes implicated in spermatogenesis, sperm function and early embryo development, appealing for further investigations about the programming of reproductive development in bulls during early calfhood. The question as to the impacts on embryogenesis or phenotype of the next generation is still vague in bovine and therefore will deserve further attention.