Introduction

The glycopeptide antibiotic bleomycin is used as a cancer chemotherapeutic agent to treat squamous cell carcinoma, Hodgkin’s lymphoma, and testicular cancer [13]. Its mechanism of action is thought to involve DNA damage and DNA cleavage [4]. Both double-strand and single-strand breaks are formed by bleomycin although double-strand breaks are thought to be most important for the anti-tumor activity of bleomycin [5].

The reaction of bleomycin with DNA requires a divalent cation, usually Fe2+. The reaction mechanism for the damage and cleavage of DNA involves the abstraction of a hydrogen atom from the C4′ of the deoxyribose sugar in DNA. The resulting free radical then reacts with O2 and ultimately leads to breakage of the DNA phosphodiester backbone with production of 3′-phosphogycolate and 5′-phosphate ends and a pyrimidine propenal [611].

Previous bleomycin DNA sequence specificity studies with purified plasmid DNA sequences have indicated that the dinucleotides 5′-GT and 5′-GC are preferentially cleaved [1216]. The DNA sequence specificity has also been determined in human cells at centromeric alphoid DNA sequences [17], telomeric DNA sequences [18], globin [1922], and retinoblastoma genes [23]; and the DNA sequence specificity was again found to be mainly at the dinucleotides 5′-GC and GT in human cells. For globin and retinoblastoma DNA sequences, the bleomycin was able to footprint transcription factors and positioned nucleosomes in human cells. Bleomycin is known to cleave in the linker region of nucleosomes [17, 24, 25] and hence bleomycin is a useful agent to probe chromatin structure in human cells.

Next-generation sequencing techniques have been employed to examine several areas of molecular biology [2628]. The Illumina Hi-Seq 2000 can generate more than 200 million short sequences in a single run. DNase I hypersensitive sites have been mapped at high resolution [29, 30] and shown to be associated with promoters, enhancers, insulators and locus control regions [31]. Next-generation sequencing has also been useful in demonstrating that these open chromatin regions are associated with post-translational histone modifications. The location of acetylated and non-acetylated histones, as well as methylated histones and positioned nucleosomes, has also been pinpointed with precision [3239].

In this study we examined the cleavage sites of bleomycin in the entire human genome in human cells. We utilized next-generation sequencing techniques to map the double-strand break sites of bleomycin onto the entire human DNA sequence. This enabled features such as transcription start sites (TSSs) and chromatin structure to be correlated with the sites of bleomycin cleavage. This study represents the first occasion that the DNA damage sites of an anti-tumor agent have been mapped at high resolution for the entire human genome.

Materials and methods

Bleomycin was obtained from Bristol Laboratories (USA). Endonuclease IV was purchased from New England Biolabs. Human HeLa cells were treated with 0.5 mM bleomycin for 30 min at 37 °C in phosphate buffered saline and the DNA was extracted from the cells [17]. In parallel, purified HeLa DNA was treated with 0.33 mM bleomycin [17]. The bleomycin-treated purified DNA and purified bleomycin-treated cellular DNA were treated with the enzyme endonuclease IV to remove 3′-phophoglycolate termini with its 3′-diesterease activity [40]. The 3′-phophoglycolate termini are not removed by the “polishing” 3′-5′-exonuclease reaction of DNA polymerase I and hence the endonuclease IV treatment was necessary in order that the bleomycin cleavage site were compatible with the Illumina preparation procedure of 3′-A addition and subsequent ligation of the linker oligonucleotides. The CplusE and #1CplusE cellular samples were treated with endonuclease IV (these two samples are duplicates from independent experiments); the CminE cellular sample was not treated with endonuclease IV; while the DplusE purified DNA sample was treated with endonuclease IV.

After linker addition, the bleomycin-treated DNA samples were electrophoresed and purified on an Agilent Technologies 2100 expert high sensitivity DNA assay system. DNA fragments between 200 and 1,000 bps were purified and subjected to DNA sequencing in an Illumina flow cell.

The resulting DNA sequence reads were mapped onto the hg19 repeat masked human genome sequence with Bowtie [41]. Using Samtools [42], the mapped reads were sorted and unmapped reads were removed. With HTSeq in the Python environment (http://www-huber.embl.de/users/anders/HTSeq/doc/tss.html#tss), the bleomycin cleavage sites were mapped onto HeLa cell transcription start sites (TSSs).

The 143,600 HeLa TSSs were obtained from the file “J80-J81_TSSGencv7.gff” obtained from “http://genome.ucsc.edu/cgi-bin/hgFileUi?db=hg19&g=wgEncodeRikenCage“ as the file “wgEncodeRikenCageHelas3CellPapTssGencV7.gtf.gz”.

The 143,600 HeLa TSSs were split into the 82,596 TSSs with zero expression and 61,004 TSSs with some expression level. These 61,004 TSSs were further categorized into five quintiles of differing expression levels, 0–20, 20–40, 40–60, 60–80, and 80–100 %, with the 0–20 % being the highest expressed TSSs and the 80–100 % TSSs being the lowest expressed in HeLa cells. The relative range of expression for the 0–20 % quintile was 1000 to 1.9, for the 20–40 % quintile 1.9 to 0.35, 40–60 %, 0.35 to 0.041, 60–80 %, 0.041 to 0.0062, and 80–100 %, 0.0062 to 0.0018.

Using HTSeq, bleomycin cleavage sites for these six categories of HeLa expression were mapped to within 3,000 bp of the TSS using a sliding window (fragment size) of 10 bp. Using a similar methodology, the bleomycin cleavage sites were also mapped onto the sense and antisense strands [34].

The RNA-seq data “J80-DA_20100603_Solexa_HWUSI-EAS566_lane5.txt” was downloaded from “http://hgdownload-test.cse.ucsc.edu/goldenPath/hg19/encodeDCC/wgEncodeRikenCage/“ as the file “wgEncodeRikenCageHelas3CellPapRawDataRep1.fastq.gz”.

Results

For all four of the samples, a read length of 51 nucleotides was produced. The CminE sample produced 2.53 × 108 sequences of which 96.7 % mapped to the hg19 human genome;

CplusE produced 2.42 × 108 sequences (96.8 % mapped); DplusE −2.17 × 108 sequences (94.7 % mapped); and #1CplusE −5.79 × 107 sequences (96.8 % mapped).

Preliminary results with 143,600 human TSSs indicated that enhanced cleavage was present for bleomycin at TSSs. In order to examine the effect of transcriptional activity on the enhanced cleavage, the 143,600 identified human TSSs were split into non-transcribed genes (82,596) and transcribed genes (61,004), which were further sub-divided into five quintiles.

In Fig. 1, the cellular bleomycin cleavage sites were mapped on the combined top 20 and 20–40 % quintiles of genes. In Fig. 1 the values are presented as a ratio of cellular bleomycin DNA cleavage compared with purified DNA at each nucleotide. Hence, any bias due to the varying DNA cleavage specificity of bleomycin was removed. Enhanced cleavage of up to twofold was observed for the top 40 % quintiles compared with the non-transcribed genes. Actively transcribed genes were preferentially cleaved by bleomycin within approximately 700 bp of the TSS.

Fig. 1
figure 1

Bleomycin damage at transcription start sites for actively transcribed genes and non-transcribed genes in human HeLa cells. The actively transcribed genes (blue) comprise the combined 0–20 and 20–40 % quintiles and are the 24,402 most highly expressed genes in HeLa cells. The non-transcribed genes (red) comprise the 82,596 non-transcribed genes in HeLa cells. The data is from the Cplus and Dplus samples and the y-axis is the ratio of the cellular/purified DNA values at each nucleotide surrounding the TSS (x-axis)

A further point to notice is that the peaks were approximately 200 bp apart, especially downstream of the TSS. This indicated that phased nucleosomes were present in these regions. There is also an indication that the peaks were multiple in nature that could be caused by overlapping genes that were slightly out of phase.

In Supplementary Figure 1 bleomycin cleavage sites at the TSSs were mapped for the five expressed quintiles as well as the non-expressed genes. The top 20 and 20–40 % quintiles had a similar profile, while the 40–60 % quintile was intermediate. The 60–80 and 80–100 % quintiles were close to the non-expressed gene profile. This was quantified in Fig. 2 where the enhancement above the ratio of 1 was summated between −700 and +700 bp around the TSS. It can be seen in Fig. 2 that as the level of transcription declines, the degree of enhancement is also reduced, especially from the 20–40 % quintile to the 80–100 % quintile. However, the level of enhancement is not precisely related to the level of transcription since the top 20 and 20–40 % quintiles had a similar profile but the level of transcription varied by over 1,000-fold within these quintiles (Fig. 2). In Fig. 2, the three different cellular samples all gave a very similar pattern for the five quintiles of gene expression and no expression. These similar patterns from independent experiments gives confidence that the result is reproducible.

Fig. 2
figure 2

The relationship between the level of gene expression and enhancement of bleomycin cleavage at the TSSs. The enhancement of the bleomycin cleavage ratio (cellular/purified DNA) between −700 and 700 bp was quantified for each of the five expression quintiles and for the non-transcribed genes. The transcription level for each of the quintiles is also indicated on the column graph (purple). The enhancement of the bleomycin cleavage ratio is presented for the Cplus (blue), Cmin (red), and #1Cplus samples (green). The y-axis is the normalized bleomycin enhancement ratio and has been normalized to an arbitrary value of 1,000 for the top 20 % quintile

The bleomycin cleavage sites were also mapped onto the sense and antisense strands at the TSSs (Fig. 3a). Peaks with spacing of approximately 200 bp were seen for both sense and antisense strands. There was also a slight offset between the two strands. Figure 3b shows data from a study utilizing micrococcal nuclease as the cleaving agent in CD4+ human nuclei [34]. It can be seen that the bleomycin data is very similar to the micrococcal nuclease data for the regions −1,000 to −200 and 0 to 1,000 bp on the sense strand and −1,000 to −100 and 100 to 1,000 bp on the antisense strand. However, between −200 to 0 bp (sense) and −100 to 100 bp (antisense), there were peaks at −120 bp (sense) and 0 bp (antisense) for the bleomycin data but there were no peaks in the same regions for the micrococcal nuclease data (Fig. 3b). Nucleosomes are not thought to be present in these regions at the TSSs [34].

Fig. 3
figure 3

The bleomycin cleavage ratio for the sense and antisense strands at the TSSs as well as Micrococcal Nuclease-Seq data. a The bleomycin cleavage ratio (cellular/purified DNA) for the combined 0–20 and 20–40 % quintiles for the sense (red) and antisense (green) strands. The data is from the Cplus and Dplus samples and the y-axis is the ratio of the cellular/purified DNA values at each nucleotide surrounding the TSS (x-axis). b Comparison data for Micrococcal Nuclease-Seq on the sense (red) and antisense (green) strands is shown and has been reprinted from Schones et al. [34]. Positioned nucleosomes are indicated

The bleomycin cleavage sites at the TSSs were mapped for the five expressed quintiles and the non-expressed genes for the sense strand (Supplementary Figure 2a) and the antisense strand (Supplementary Figure 2b). The top 20 and 20–40 % quintiles had a similar profile for each strand, while the 40–60 % quintile was again intermediate. The 60–80 and 80–100 % quintiles were close to the non-expressed gene profile.

The sense and antisense bleomycin data were quantified for the region from −700 to 700 bp around the TSSs for the sense and antisense strands (Fig. 4a). The profiles for the five quintiles and no expression data showed a decline in enhancement with the decline in the level of gene expression, especially from the 20–40 % quintile to the 80–100 % quintile. This Fig. 4a data was very similar on both strands, for the three different cellular samples, and also compared with the combined strand data in Fig. 2.

Fig. 4
figure 4

The relationship between the level of gene expression and enhancement of bleomycin cleavage on the sense and antisense strands at the TSSs. The enhancement of the bleomycin cleavage ratio is presented for each of the five expression quintiles and for the non-transcribed genes on the sense and antisense strands with the Cplus, Cmin and #1Cplus samples. The y-axis is the normalized bleomycin enhancement ratio and has been normalized to an arbitrary value of 1,000 for the top 20 % quintile. a The enhancement of the bleomycin cleavage ratio (cellular/purified DNA) between −700 and 700 bp relative to the TSS. b The enhancement of the bleomycin cleavage ratio (cellular/purified DNA) between –170 and −60 bp on the sense strand and between −90 and 30 bp on the antisense strand

The extra enhanced peaks at −120 bp (sense) and 0 bp (antisense) in the bleomycin data compared with the micrococcal nuclease data (see above) were quantified as shown in Fig. 4b. The profile of these peaks with respect to the level of gene expression was very similar to the −700 to 700 bp enhancement in Fig. 4a.

The bleomycin cleavage data for the sense and antisense strands is shown in Supplementary Figure 3 with the location of the positioned nucleosomes superimposed. It can be seem that the nucleosomes are slightly offset, with the sense strand nucleosomes approximately 30 bp to the right of the antisense strand nucleosomes. There is space for a potential nucleosome between approximately −140 and 40 bp.

The protocols and procedures for mapping the bleomycin data to the sense and antisense strands was verified by using RNA-seq data from HeLa cells. As can be seen in Supplementary Figure 4, the HeLa RNA-seq data mapped almost entirely to the sense strand as expected and not to the antisense strand.

Discussion

In this study, for the first time, the pattern of DNA cleavage for an anti-tumor drug was determined at high resolution for the entire human genome in human cells. Bleomycin is utilized as a cancer chemotherapeutic agent to treat Hodgkin’s lymphoma, squamous cell carcinoma, and testicular cancer [13]. Since the mechanism of action of bleomycin is thought to involve DNA damage and DNA cleavage [4], it is very important to determine the position of the bleomycin cleavage sites.

The preferred sites of bleomycin cleavage are at 5′-GT and 5′-GC dinucleotides [1216]. In order to allow for varying bleomycin cleavage efficiencies at different nucleotide sequences, a ratio of cellular to purified DNA bleomycin cleavage was calculated at each nucleotide. This enabled any bias due to the DNA sequence specificity of bleomycin cleavage to be removed.

The main outcome of this study was that bleomycin preferentially cleaved at the transcription start sites (TSSs) of actively transcribed genes compared with non-transcribed genes. Bleomycin is a relatively large molecule (molecular weight approximately 1,500 Daltons) and would have difficulty in accessing the closed chromatin structures of non-transcribed genes [19, 25].This indicated that genes with an open configuration gave bleomycin easy access to DNA compared with closed structures where DNA is protected by histones and other proteins in chromatin [17]. It has previously been shown that bleomycin preferentially cleaved actively expressed ovalbumin and globin genes but no preferential cleavage occurred when these genes were not being expressed [43].

The enhanced bleomycin cleavage at TSSs correlated with the level of transcriptional activity. There was a gradual reduction in the level of enhanced bleomycin cleavage from the 20–40 % quintile to the 80–100 % quintile; however, the enhanced bleomycin cleavage was similar for the 0–20 % and 20–40 % quintiles (see below).

This information could be used to enhance the cancer chemotherapeutic activity of bleomycin. Targeting of these actively transcribed genes in conjunction with bleomycin treatment could enhance the cancer chemotherapeutic efficacy of bleomycin by the down-regulation of these highly expressed genes [44, 45].

It should be noted that the concentrations used in this study are higher than those in a clinical setting. Several studies have shown that the DNA sequence specificity of bleomycin is similar over a range of concentrations in cellular and purified DNA [17, 46, 47]. However, it has also been demonstrated that at very low bleomycin concentrations with small hairpin oligonucleotide sequences, bleomycin has a different sequence specificity [4850].

The pattern of DNA cleavage had a peak-to-peak distance of approximately 200 bp. This indicated that positioned nucleosomes exist at actively transcribed genes as found by others in yeast [32, 33] and human cells [34]. This correlated with the known ability of bleomycin to cleave in the linker region of nucleosomes [17, 24, 25].

The detection of similar enhanced bleomycin cleavage patterns in the top 20 and 20–40 % quintiles suggests that a particular chromatin structure is being recognized by bleomycin. The degree of transcription varied by over a 1,000-fold within these two quintile groups. Hence, this chromatin structure, detected by bleomycin, does not predict the level of transcription but indicates a potential for high-level transcription. Further factors, e.g., the recruitment of transcription factors, would be expected to be required for high-level transcription to be achieved.

The peaks at 0 bp on the sense strand and −120 bp on the antisense strand were not found in the micrococcal nuclease experiments (MNase-seq) [34]. There are several explanations for these differences. First, in the micrococcal nuclease experiments the nucleosomal linker DNA is trimmed back to the nucleosome core and hence any free DNA that is not present in the nucleosome core is digested away. However, in the bleomycin experiments, the DNA is not trimmed back to the core and any free (linker) DNA will be included in the analysis. Another difference is that CD4+ cells were used for the MNase-seq experiments while it was HeLa cells for the bleomycin experiments.

Second, the bleomycin experiments could be detecting nucleosomes. The peaks at 0 bp on the sense strand and −120 bp on the antisense strand are approximately 200 bp from the neighboring peaks and that is characteristic of the presence of a nucleosome core. However, other experiments with anti-histone antibodies have indicated that a nucleosome is not present in this part of the gene [34, 51]. The bleomycin experiments may be revealing the transient presence of a nucleosome [52, 53].

Third, the bleomycin experiments may be detecting a particular structural element at TSSs. Since the bleomycin cleavage is enhanced at this region, it could have been caused by a structural distortion in the DNA. An unusual structural distortion in the DNA at the TSS could be present as part of the transcriptional initiation complex. Alternatively, a large protein (or complex) could be present on one side of the enhanced region. A strong candidate for a large protein complex would be the RNA polymerase complex, which has similar dimensions to that of a nucleosome and would fit into the DNA region between the peaks. There are several arguments against the presence of the RNA polymerase complex in these regions. RNA polymerase should only be bound to the antisense strand and in our experiments the peak was found on both strands. Also anti-RNA polymerase antibody pull-downs [34] indicate that the RNA polymerase is present at 0 bp at the TSS, i.e., not in the region between the peaks but at the peak.

Fourth, the extra enhanced peaks at −120 bp (sense) and 0 bp (antisense) had a similar profile with respect to the level of gene expression compared with the −700 to 700 bp enhancement (Fig. 4). This indicates that structure at the TSS that is being detected by bleomycin is present to a similar degree at the −120-bp and 0-bp sites as the overall enhancement structure, and is probably part of TSS structure that is poised for transcription.

Conclusions

In conclusion, we have shown for the first time the pattern of bleomycin cleavage for the whole genome in human cells. Bleomycin cleavage was preferentially observed at the TSSs of expressed genes. This study provided insight into the mechanism of action of this anti-tumor drug where actively transcribed genes were preferentially targeted. This information could be utilized to enhance the activity of bleomycin and other anti-tumor drugs.