Introduction

Breast cancer represents the most widespread tumor in Western women: incidence in Italy is more than 30,000 new cases per year. Nearly 80% of human mammary carcinomas express estrogen receptor (ER) at clinically significant levels [1]. Estrogen-bound ER induces the expression of peptide growth factors that are responsible for cancer cell proliferation.

In estrogen-dependent breast tumors, control of tumor growth can be achieved by treatment with estrogen antagonists that block the binding of the hormone to ER: tamoxifen has been the predominant treatment for the past two decades [1, 2]. In postmenopausal women, aromatase inhibitors (AIs) have shown to be more effective than tamoxifen as adjuvant therapy [36]. Aromatase is the enzyme that synthesizes estrogens by converting C19 androgens to aromatic C18 estrogenic steroids. Experimental results indicate that autocrine produced estrogen plays a more important role than circulating extradiol in breast tumor promotion: aromatase is expressed in breast tumor tissue (both cancer and surrounding adipose stromal cells) probably at a higher level than in normal breast tissue [7, 8]. In hormone dependent breast cancer, neoadjuvant therapy with third generation AI therapy (anastrozole, letrozole, exemestane) reduces proliferation, causes cell death and produces tumor regression [9, 10]. Although of proven clinical efficacy, such therapies are still associated with non-response in about 20% of cases [11]. Up to now, it is still impossible to accurately predict who will respond to such treatments.

Furthermore, little is known of (i) the effects of anti-aromatase treatment in vivo, (ii) how these effects vary between tumors and (iii) whether the molecular changes fully reflect and are responsible for the clinical response. Preoperative treatment followed by surgical resection provides an excellent opportunity to correlate baseline molecular markers measured in a diagnostic needle biopsy with subsequent response to treatment, and to delineate molecular changes induced by the treatment itself.

A pivotal study on anastrozole treatment in 12 patients identified 298 genes differentially expressed between partial response and progressive disease [12]. Signatures correlated to response to neoadjuvant treatment have been also identified for exemestane, although on a very small patient cohort, and letrozole [13, 14].

Our goals were to (i) examine changes in gene expression with anastrozole therapy and (ii) identify genetic signatures that might distinguish between ER+ post-menopausal breast cancers benefiting from treatment and those not. To elucidate mode of action, we compared the transcriptional profile before and after three months of anastrozole treatment in 13 patients. In order to derive a predictive signature, we analysed global gene expression profiles of 17 ER+ breast cancers before neoadjuvant therapy and associated them to response values. Genes predictive of response were then tested on a publicly available independent dataset [14] and on an independent cohort of fixed tissues.

Methods

Patients

All tumor samples and clinical data were collected with the Fondazione S. Maugeri ethical committee’s approval and patients’ informed consent. The 17 patients, enrolled into this study between July 2004 to November 2007, were post-menopausal and had breast cancer stage T2 or T3, greater than 2.5 centimeters, any lymph node status and no distant metastasis. All the tumors were HER2/neu- and ER+/PgR+, except 2 that were ER+/PgR− (Table 1). They received neoadjuvant therapy with Anastrozole (Arimidex™, Astra Zeneca) 1 mg po/die for three months. Clinical response was evaluated by serial tumor clinical examination and mammary ultrasound bidimensional measurements, performed by single operator (LR) before, during and after treatment. Patients with decrease in tumor volume greater than or equal to 30%, according to RECIST criteria [15], were classified as responders.

Table 1 Pathophysiological characteristics and percentage of response to therapy of analyzed patients

Microarray hybridization and data processing

Samples were removed by tru-cut biopsy or surgical resection and stored in RNAlater (Ambion Inc., Austin, TX) for processing. Tissues were homogenized and total RNA was isolated using TriReagent (Sigma-Aldrich, St. Louis, MO) and RNeasy Mini Kit (Qiagen, Dusseldorf, Germany). mRNA amplification and labeling, quality control and microarray hybridization were carried out as previously described [16]. Two oligonucleotide glass arrays (Human Whole Genome Oligo Microarray 1x44K or 4x44K, Agilent Technologies, Palo Alto, CA) were utilized for each sample, using the Human Universal Reference Total RNA (Clontech, Palo Alto, CA) as baseline and dye-swap duplication.

Slides were scanned with the Agilent G2505B scanner and the TIFF images loaded into the Feature Extraction software (Agilent Technologies). Raw data were processed using the statistical computing software “R” and packages from Bioconductor (www.bioconductor.org); “limma” was used for preprocessing and differential expression analysis. Raw intensities were first background corrected (“normexp” method, with an offset value of 50 added to red and green intensities so that log-ratios are shrunk towards zero at lower intensities). In order to set log-ratios average to zero within each array and to have similar log-ratio distributions across all arrays, “loess” and “Aquantile” normalization were performed [17].

Gene ontology analysis was accomplished with GOEAST (http://omicslab.genetics.ac.cn/GOEAST/) and pathway analysis with “MetaCore 5.0” (GeneGO, St. Joseph, MI).

Signature extraction and evaluation

Genes differentially expressed by the treatment were obtained as previously described [18]. The difference matrix was defined as the difference between post and pre treatment expression matrices. To derive a predictive signature, we first calculated the Pearson correlation coefficient between tru-cut expression and response data, used as continuous variable. Then the two sample modified t-test available in the “limma” package was performed, using response data as a binary variable. The best predictors from the two lists were selected using the rank product method.

For sample classification, leave-one-out cross validation procedure was applied. Briefly, for each list of predictors, the following procedure was carried out: one sample was leaved out and two centroids, one representing responders and the other non-responders, were derived by kmeans clustering and response status was assigned based on majority rule. Then the sample initially leaved out was assigned a class based on Euclidean distance from the two centroids. The procedure was repeated 17 times leaving out each time one sample from centroid calculation and the number of correct prediction reflects the performance of the signature.

To validate the procedure, radical leave-one-out cross validation was performed, where the whole process of signature derivation was repeated leaving each time one sample out of the analysis and the derived signature was used to classify the sample initially leaved out.

The standalone JAVA-based Signature Evaluation Tool (http://microarray.ym.edu.tw) was used to predict responsiveness on test samples in radical leave-one-out cross validation and on the independent dataset (GSE5462).

RT-qPCR

Total RNA was extracted from an independent cohort of 10 formalin-fixed, paraffin-embedded breast cancer pre-treatment biopsies using Recover All Nucleic Acid Isolation kit (Ambion Inc., Austin, TX), according to manufacturer instructions.

Reverse Transcriptase quantitative Polymerase Chain Reaction (RT-qPCR) was performed as previously described [19] to validate a subset of differently expressed transcripts (CXCL3, GSS, FBXL18 and DUSP1) identified by microarray analysis on fresh samples. Primers sequences are available in Supplementary Table 1. First strand cDNA synthesis was performed using 200 ng of total RNA. The relative abundance of transcripts was established using the mean expression of TBP and RPLP0 for normalization of input cDNA.

Results

A flow chart outlying the statistical methods used for analyzing expression data is shown in Fig. 1.

Fig. 1
figure 1

Flow-chart of data processing and analysis

Change in gene expression with therapy

The molecular profiles of all the 17 patients listed in Table 1 were analyzed before therapy, and for 13 of them (identified by a star after the patient’s code) we could also evaluate the change in gene expression with treatment. Three different approaches were used to find probes differentially expressed between pre and post-treatment samples [18]: frequency of change, which measures the consistency by which a probe is up or down-regulated; magnitude of change, which identifies probes having greatest median fold change across all patients; statistically significant change, which identifies probes most significantly changed with therapy, applying two sample paired Student t-test.

Combining the lists from the three methods resulted in 634 down-regulated and 646 up-regulated probes, 78 (47 up and 31 down-regulated) of which were reported differentially expressed by all the three methods (Table 2) Venn diagrams representing overlaps between the three lists are shown in Fig. 2. Hierarchical clustering on 1280 probes using the difference matrix was not able to separate responders from non-responders in a statistically significant manner (Supplementary Figure 1). Differentially expressed probes were compared with a list of 2897 estrogen-regulated genes [20], found modulated in response to estrogens and also present on Agilent 44K array. Hypergeometric test showed that our list is significantly enriched in estrogen-regulated genes (Fisher exact test P value = 2.2 × 10−16), as revealed by Metacore analysis, too (data not shown).

Table 2 78 probes changed with treatment and found by all three statistical methods
Fig. 2
figure 2

Overlap among differentially expressed probes. Venn diagram representing overlap among down-regulated probes (a) and up-regulated probes (b) from three methods. Using the frequency of change approach (fc), 93 down-regulated and 203 up-regulated probes in 10 out of 13 patients were found (Supplementary Tables 2 and 3). Using the magnitude of change (mc) method (change in at least 10 cases), we found 109 down-regulated and 343 up-regulated probes (Supplementary Tables 4 and 5). With the statistical significant change (sc) approach, we found 543 down-regulated and 339 up-regulated probes with two sample paired t-test adjusted P-value less than 0.01 (Supplementary Tables 6 and 7)

In order to understand the functional characteristics of genes changed with treatment, gene ontology analysis was performed separately for down and up-regulated probes. The down-regulated list is basically enriched in terms related to cell cycle (Supplementary Tables 8 and 9), as cell cycle phase, M-phase, mitotic cell cycle, cell cycle, mitosis cell division and regulation of mitotic cell cycle. Other overrepresented GO terms were response to DNA damage stimulus, DNA replication and DNA integrity checkpoint. GO analysis of up-regulated genes showed that this list is more heterogeneous and less specific in terms of functions. The up-regulated genes feature terms like MHC protein complex, antigen processing and presentation, phosphoinositide phosphorylation, immune system process and insulin like growth factor binding. The top GO processes, pathway maps and network processes found for the up and down-regulated probes with their P-values are listed in Supplementary Table 10 and graphically shown in Fig. 3.

Fig. 3
figure 3

Go analysis of genes differentially expressed in pre-post treatment samples. Each rectangular represents a single category and is ordered according to ascending P-value (as in Supplementary Table 10). Similar pathways or processes have the same color code

Signature predictive of response

Using the response criteria as described in methods, 9 patients were classified as responders and 8 as non-responders (Table 1). No patient showed disease progression during the treatment. In this limited series of 17 breast cancers it has not been possible to identify any statistically significant association between the response and the base-line expression of ER, PGR, Ki67, or lymph nodes involvement. We only observed that tumors with low PgR and N1 responded less than tumors with high PgR tumors and N0, respectively.

So we applied correlation analysis using pre-treatment expression data and the percentage of response as a continuous variable. The log ratios for each probe across 17 samples and the response data were scaled to have mean zero and standard deviation one. A filter was applied to remove all the probes with more than three missing values or standard deviation less than 0.2 across 17 pre-treatment samples. After the filtering step, 35,181 probes were selected for further analysis. 159 probes had correlation test P-value less than 0.01. Out of these, 107 were positively correlated and 52 negatively correlated with response. Probes with their correlation and associated P-value are listed in Supplementary Table 11. Hierarchical clustering with Euclidean distance over 159 probes classifies patients into 2 distinct groups (Fig. 4): the cluster on the left contains only non-responders, while the one on the right contains all the responders and two patients with response values slightly below the cut-off. Repeated runs of k-means clustering find similar grouping when patients are classified in two clusters. Leave-one-out cross validation (with 2-means clustering) correctly predicts responsiveness of 14 patients out of 17 (Fisher P-value = 0.015).

Fig. 4
figure 4

Hierarchical clustering with Euclidean as distance measure over the 159 probes most correlated with response classifies patients into 2 distinct groups. Blue rectangle = non-responders; red rectangle = responders

Then two Sample t-test was performed on pre-treatment samples to identify genes differentially expressed between responders and non-responders. Analysis selected 234 probes with P-value < 0.01 (Supplementary Table 12). Hierarchical clustering over these probes separated responders from non-responders with one exception (data not shown). The patients misclassified had response percentage 27 and was actually placed into the responsive cluster group. k-Means clustering produced the same results when samples are classified into two groups. Leave-one-out cross validation correctly predicts 16 patients (Fisher P-value = 0.0004). The wrongly predicted sample was the same previously misclassified case.

The predictive signatures identified by the two methods have 39 probes in common. In order to have best compromise for informative probes, the two signature lists were combined using the rank product method. Probes were ordered according to the product of their ranks. The top 77 probes were selected as final signature (54 unique genes, Tables 3 and 4), as they performed with 100% accuracy in separating responders from non-responders in leave-one-out cross validation. To validate the procedure, radical leave-one-out cross validation was performed and SET (Signature Evaluation Tool) was used to predict the responsiveness of the test sample. We were able to correctly predict responsiveness for 82% of the samples (Fisher exact P-value = 0.015).

Table 3 Genes which are up-regulated in non-responders compared to responders in pre-treatment samples and also are correlated with response data
Table 4 Genes which are down-regulated in non-responders compared to responders in pre-treatment samples and also are correlated with response data

The 77-probe signature predictive of response to anastrozole was then tested on a totally independent dataset (GSE5462) of 52 patients treated with neoadjuvant letrozole therapy [14], using SET. Five genes (in bold in Tables 3 and 4) correctly classify 26 out of 37 responders and 11 out of 15 non-responders (Fisher exact test P-value = 0.0056).

qRT-PCR results

Four of those genes (CXCL3, GSS, FBXL18 and DUSP1) were also tested by means of RT-qPCR on a cohort of 10 FFPE breast pre-treatment biopsies from the anastrozole study, that were not used for the gene expression signature derivation. Their expression changes were in accordance with microarray analysis results, although the differences between the two groups (responders and non-responders) were not statistically significant (Supplementary Figure 2).

Changes with treatment in responder and non-responder sub-groups

A separate analysis for responders and non-responders was carried out in order to investigate whether therapy affected the expression in a different manner. The separate analysis showed that a larger number of probes were affected by the treatment in non-responders compared to responders (303 in responders and 587 in non responders). No probe was found oppositely modulated in the two groups. Most of the probes found differentially expressed in one group were not affected by the treatment in the other (only 49 probes were common between two analyses).

GO analysis of the genes differentially expressed in responders, pointed out an enrichment in GO processes, pathway maps and network processes linked to down-regulation of cell cycle, stimulation of the immune response, and cell adhesion and cytoskeleton (Fig. 5, right panel, and Supplementary Table 10). On the contrary, non-responders showed an increase in mechanisms of induction of T-Cell energy by MHC antigen presentation without IL-2 production, and were characterized by a positive regulation of processes and pathways related to androgen receptor nuclear signaling, vesicle trafficking and synaptic transmission (Fig. 5, left panel, and Supplementary Table 10).

Fig. 5
figure 5

Go analysis of genes differentially expressed in non-responders, and in responders. Each rectangular represents a single category and is ordered according to ascending P-value (as in Supplementary Table 10). Similar pathways or processes have the same color code. The right panel refers to responders while the left one to non-responders

Discussion

Studies to investigate the effect of estrogen or endocrine agents on gene expression have usually utilized in vitro systems in which cells were cultured in presence of agents for only hours or days before expression analysis [21]. Early responsive genes have been identified whose expression may or may not remain modulated after a longer period of exposure. Considering the complexity of breast tumor, it is important to utilize different approaches to investigate the effect of estrogen on tumor proliferation [2225]. In this respect, neoadjuvant treatment provides a unique opportunity to study molecular/genetic changes induced by therapy in breast cancer patients. However, relative few studies have been published so far on pharmacogenomic predictors of response [1214, 18, 26, 27].

In the present study, we analyzed the molecular changes induced by anastrozole in post-menopausal breast cancer patients and looked for a signature predictive of response. For this purpose, whole genome expression profiles of biopsies collected before treatment were analyzed and associated with clinical–pathological response to treatment.

Gene ontology analysis of differentially expressed genes between pre and post-treatment samples showed that the list of down-regulated genes mainly contains features related to cell cycle progression, growth, DNA damage stimulus and DNA integrity checkpoint. Proliferation promoting genes are also differentially expressed between pre and post-treatment samples. Gene network analysis also showed that many genes transcriptionally activated by ESR1 (estrogen receptor 1) were among the down-regulated list. Similar results for down-regulated genes were also obtained by Miller et al. [18] and Mackay et al. [27].

By contrast, among up-regulated genes we found the overrepresentation of processes linked to immune system, histamine metabolism and inflammation, cell adhesion and cytoskeleton, whereas previous reports only pointed out organ development and extracellular matrix turnover. In support of our findings, Wang and coworkers [28] showed that, in a rat model, anastrozole induced increased levels of proinflammatory cytokines and suppressed the differentiation of naive T cells to Treg cells: T-regs inhibit the antitumor functions of tumor-specific T cells by direct cell-to-cell contact.

In general, the comparison between our results on the total cohort and Mackay’s gave an overlap of 47 down- and 84 up-regulated common genes, even if Mackay’s observations were after 14 days of treatment.

GO analysis of the genes differentially expressed in responders only, revealed enrichment of processes linked to down-regulation of cell cycle and to stimulation of the immune response. On the other hand, non-responders showed an increase in mechanisms of induction of T-cell energy by MHC antigen presentation without IL-2 production: these processes could reduce an immune response versus the tumor cell induced by anastrozole treatment [28]. Furthermore, non-responder tumors showed an up-regulation of network processes related to androgen receptor nuclear signaling: this suggests that tumor resistant cells could divert androgens to another cell proliferation signal to escape from aromatase cell cycle inhibition [29]. Sikora et al. [30] hypothesized that androgens and their metabolites, independently of aromatase activity, may contribute to breast cancer growth. Moreover, non-responders show a down-regulation of negative regulation of synaptic transmission, a mechanism that could be connected to cancer cell proliferation [31] and an up-regulation of Rab-regulated recycling pathways of vesicle: some elements of Rab protein family contribute to breast cancer progression [32, 33].

Our results are the first to demonstrate specific pathways involved into response to AI, with a specific distinction between tumors that respond and those that show intrinsic resistance.

To find a signature predictive of response, probes having maximum correlation with response variable or differentially expressed between responders and non-responders were selected in pre-treatment samples. The differentially expressed list performed better in separating responders from non-responders. To combine the predictive power of the two lists, probes were arranged according to the product of their ranks and the top 77 were selected as final signature (100% accuracy). In order to test such predictor on an independent cohort, we downloaded the dataset used by Miller and coworkers [14], who derived a signature predictive of response to neoadjuvant treatment with letrozole. Our signature and the 69 pre-treatment variables in Miller’s one have no common genes. However, after mapping the 77 unique Agilent probes from our analysis to 66 unique Affymetrix probes on Miller’s dataset, we were able to identify six probes predictive of response to letrozole (p = 0.0056), by using SET. The six probes correspond to five unique genes, which are CXCL3, GSS, PAX4, KIAA0913, FBXL18. Among these, CXCL3 is up-regulated in non-responders and was previously found associated with a shorter relapse-free survival of ERalpha-positive breast cancer patients treated with tamoxifen [34].

Only 17 patients satisfying the eligibility criteria discussed above completed the therapy and provided good quality RNA for microarray analysis. Due to the limited number of patients in our cohort, we used them as training-set to derive a signature predictive of response and then used a bigger and totally independent cohort of patients with comparable characteristics as test-set. This was the only larger neoadjuvant series available to best of our knowledge to use as validation set. Although our signature derives from a small patient cohort, it succeeds in predicting response to another AI, very similar to anastrozole, in such a larger independent series.

Four of those genes were also tested on an independent cohort of 10 FFPE biopsies. Dividing patients into two groups, responders and non-responders, we obtained the same modulation trend as in the previous microarray analysis.

To our knowledge, a limited number of papers have been published on this subject, and none has results comparable to ours. Kristensen et al. [12] used anastrozole too, but analyzed few and heterogeneous (ER+ and ER−) patients, defining only a group of differentially expressed genes between partial response and progressive disease. Harvell et al. [13] used exemestane, alone or in combination with tamoxifen, on 7 ER+ breast cancers and individuated 50 genes associated with response; however, treatment of the 7 patients was not homogeneous: all responders were treated with exemestane only, but among non responders, three were treated with the two drugs in combination and one received exemestane only. Miller and colleagues [14] identified a combination of pre-treatment (biopsy before therapy) and changed-by-treatment (biopsy after 14 days therapy) genes able to predict response on a cohort of 52 samples treated by neoadjuvant letrozole therapy. However, it has to be emphasized that none of the former studies tested their signatures on independent datasets.

If our results will be further validated on a bigger fixed cohort, AI neoadjuvant treatment could be focused on these “likely responder” patients, pointed out by gene expression evaluation of few genes on RNA extracted from fixed biopsy. On the other hand, patients showing a “non-responsive” signature will be operated immediately, reducing risks and costs of an ineffective treatment. Molecular evaluation on tru-cut biopsy only has the advantage of avoiding treatment on patients predicted to be non-responders, without waiting for 14 days and re-biopsing the tumor, as required to translate previous results to a clinical setting.