Introduction

Reproductive factors that increase the woman’s lifetime exposure to hormones, such as early menarche and late menopause, are established risk factors for breast cancer. Studies of breast cancer risk by receptor subtype have shown inconsistent results [1]. The importance of the androgen receptor (AR) as a prognostic factor and a possible treatment target in breast cancer is currently debated [2, 3]. Immunohistochemical (IHC) expression of AR is reported in 70–90% of all invasive breast cancers [4, 5]. Several studies have indicated that AR is an independent positive prognostic marker in breast cancer, as well as a predictive factor for response to endocrine treatment [4, 610]. AR has frequently been co-expressed with the estrogen receptor alpha (ER) and the progesterone receptor (PR) [8, 11, 12]. The AR acts as a transcription factor and has been shown to bind to ER-regulated genes to inhibit ER-dependent cell proliferation [6]. Recently, in vitro studies have suggested that activated AR up-regulates estrogen receptor beta gene expression, which inhibits breast cancer cell growth [13]. AR has also been suggested to interact with cell cycle checkpoint protein p21, which is involved in epidermal growth factor signaling [14]. Further, AR expression may be associated with response to tamoxifen (TAM) [15]. The significance of AR expression in ER negative (ER-) breast cancer and triple-negative breast cancer (TNBC) has been contradictory. Some studies show no prognostic value of AR in ER- and/or TNBC, whereas others report either improved or worsened outcomes for breast cancer patients with AR positive (AR+) in ER- and/or TNBC as reviewed by McNamara et al. [16]. However, a clinical phase II trial recently showed promising results for the anti-androgen bicalutamide in AR+/ER-/PR- metastatic breast cancer, high-lighting AR as a promising predictive marker in breast cancer treatment [17]. Another anti-androgen, enzalutamide, is currently being tested in an ongoing clinical trial (ClinicalTrials.gov identifier: NCT01597193). It has also recently been shown that the ER/AR ratio could be an important predictor of response to endocrine treatment [18].

Androgens exert stimulating effects directly on breast cancer cells through binding to the AR, as well as indirect effects through androgen aromatization into estrogens, which in turn bind to the ER [1921]. The levels of circulating androgens are most likely affected by hormonal factors, such as reproductive history and anthropometric measures [22, 23]. However, the potential impact of lifestyle factors including anthropometry, reproductive history, and exogenous hormone use on AR expression in breast cancer has not yet been addressed. We hypothesized that lifestyle factors and body constitution associated with endogenous androgen levels may affect the risk of developing AR-defined breast cancer as they do for ER-defined breast cancer risk [1, 24].

The aim of this study was to evaluate AR expression in relation to clinically established tumor markers and to analyze the association between lifestyle factors and AR-defined breast cancer risk using the Malmö Diet and Cancer Study (MDCS). The MDCS is a large prospective population-based study consisting of 17,035 women, among whom a total of 747 have been diagnosed with incident invasive breast cancer.

Materials and methods

Malmö Diet and Cancer Study

MDCS is a population-based prospective cohort study of women living in Malmö, Sweden. Women born between 1923 and 1950 were eligible for inclusion, and enrollment took place between 1991 and 1996. At baseline, participants underwent anthropometric measures, answered questionnaires (multiple-choice and open-ended questions) regarding demographics, lifestyle, medical history, and reproductive history including exogenous hormone use. Exclusion criteria included only language incapacities and mental disabilities that prevented the respondent from answering the extensive questionnaire. Further details concerning the study design have been described previously [25]. In order to retrieve information on incident cancer and vital status among participants, the MDCS has been linked annually to the Swedish Cancer Registry, the Regional Tumor Registry for Southern Sweden, and the Swedish Cause of Death Registry. Ethical permission for this study was obtained from the Ethical Committee at Lund University (Dnr 472/2007).

The study cohort and follow-up

The study population consisted of 17,035 women enrolled in the MDCS. In this study, women with a history of breast cancer at baseline were excluded (n  = 576), resulting in a total study cohort of 16,459 women. By the end of the follow-up period, on the 31 December 2007, a total of 826 women had been diagnosed with breast cancer. Review of tumor data revealed a total of 79 ductal carcinoma in situ (DCIS) tumors and 747 invasive breast cancers (Fig. 1).

Fig. 1
figure 1

Flow chart of the study population from the Malmö Diet and Cancer Study (MDCS)

Histopathological analyses

Tumor material was collected from formalin-fixed paraffin-embedded breast cancer blocks. All tumor samples obtained from women diagnosed from 1991 to 2004 were stained with hematoxylin and eosin (H&E), mounted on slides, and reviewed by one breast pathologist for confirmation of the histopathological diagnosis, i.e., the histological type according to WHO classification guidelines [26] and tumor grade. Tumor grading was performed according to Elston and Ellis [27] and included tubular formation, nuclear atypia, and mitotic index. For women diagnosed 2004–2007, the histological type and grade were collected from hospital records including pathology reports. Data on size and lymph node status were retrieved from pathological reports. In this study, tumor size was dichotomized into ≤20 or >20 mm groups and axillary lymph node involvement (ALNI) was recorded as negative or positive (≥1 metastatic nodes).

Tissue microarray and tumor markers

Representative areas of invasive cancer from the donating tumor block were selected for tissue microarray (TMA) construction as described in detail previously [28]. From each tumor, two cores of either 0.6 mm (1991–2004) or 1.0 mm (2005–2007) were mounted in a recipient block and TMA slides of 4 µm were cut and prepared for IHC analyses. The previously studied IHC markers included ER, PR, the proliferation marker Ki67, and human epidermal growth factor 2 (HER2) (1994–2004) [28]. Cut-off values in accordance with current clinical practice (ER, PR) and previous MDCS studies (Ki67) were used [28], and expression levels were defined as negative/low with ≤10% of positively stained nuclei and as positive/high with >10% of positively stained nuclei. HER2 was assessed either through IHC score, ranging from 0 to 3+ according to the HercepTest [29] (1991–2004), or by regional and national cancer registries and hospital records (2005–2007) including both IHC and in situ hybridization (ISH) analysis data. When the score of the IHC evaluation was applied, a score of 0 and 1+ was considered negative and a score of 3+ was considered positive. Tumors with an IHC score of 2+ were excluded as missing, if not confirmed as either amplified or normal in ISH analysis. An ISH analysis result was used when available. AR was assessed using the monoclonal antibody Ab-1 (clone AR441, dilution 1:200, Thermo Scientific, Fremont, CA, USA). Scoring of the AR was performed semi-quantitatively, using fractions of 0, 1–10, 11–50, 51–75, and 76–100% positive nuclei. In this study, a dichotomized variable with a cut-off at 10% was used. All arrays were evaluated independently twice (KE), and in case of discrepancy, a third examination was performed (SiB), followed by a final decision. In the case of heterogeneity of AR expression between the two duplicate cores, the decision was based on visual evaluation of the total tumor area of the two cores pooled together. A valid AR score was obtained for 516 of the 747 invasive breast cancer tumors. Among the 231 cases with missing AR status, there were 96 cases with no tissue available for histopathology. In 52 cases, there was lack of tissue cores in the array due to prior sectioning for marker analysis, and in 17 cases, tumor tissue cores displayed no invasive tumor foci. In the remaining 66 cases, the tissue samples in the cores were damaged during processing such that the tissue structure was destroyed/melted or dislocated on the TMA and could not be assessed for AR expression.

Participants’ baseline characteristics

Educational level was categorized into O-level college (9 years of school attendance), A-level college (another 3 years of education), and university level (education following A-levels). Type of occupation was recorded as manual worker, nonmanual worker, or as employer/self-employed. Alcohol use was divided into three categories: (1) nothing last year, (2) something last year but not last month, or (3) something last month. Smoking habits were categorized into never, current, and former smokers. Anthropometric measures such as weight (multiples of 0.1 kg) and height (to the nearest 0.005 m) were measured by a trained nurse, and body mass index (BMI) was calculated as kg/m2. BMI categories used were <25, ≥25–<30, or ≥30 kg/m2 [30]. Waist circumference was measured at the midpoint between the lower ribs and the iliac crest (to the nearest 0.01 m) and categorized as ≤80, >80–≤88, and >88 cm [31]. Hip circumference was measured at the level of greatest lateral extension (to the nearest 0.01 m). Waist and hip measurements were used to construct a waist–hip ratio (WHR) (m/m), and categories used were ≤0.80, >0.80–≤0.85, and >0.85 [31, 32]. Body fat percentage was calculated from measurements of bio-impedance (BIA 103, RLJ-Systems, Detroit, MI, USA).

Information on reproductive history included age at menarche, parity, age at first childbirth, breast-feeding (<1, 1–<12 or ≥12 months), age at menopause, and exposure to oral contraceptives (OC) (ever/never). Menopausal status at baseline was defined as pre-, peri-, or postmenopausal as described in detail previously [28]. Use of hormonal replacement therapy (HRT) was categorized as current use or nonuse and as type of HRT, coded according to the Anatomic Therapeutic Chemical classification (ATC) system [28]. The present study grouped use of HRT into estrogen only (eHRT) and combined estrogen plus progestin HRT (cHRT). Progestin-only HRT use was uncommon (n = 106, 0.65%) and thus excluded from analysis of HRT.

Statistical analysis

Statistical analyses were performed using the statistical software SPSS Statistics 19 (IBM, Chicago, IL, USA). Variables were categorized either as described earlier or as tertiles based upon the distribution in the entire study cohort, excluding prevalent cases. Age at baseline was not normally distributed due to participant recruitment; thus, age was presented as median and interquartile range and analyzed as a categorized variable of four equally sized groups. Height (normally distributed) and the natural logarithm of weight (skewed distribution) were used as continuous variables in the multivariable analyses. Hazard ratios (HR) stratified for AR status were calculated using Cox proportional hazards model and presented as age-adjusted HR with 95% confidence intervals (CI). Multivariable analyses were adjusted for age at baseline (categorized into quartiles), height (continuous), weight (natural logarithm), occupation type, age at first childbirth (categorized, including nulliparous women), ever OC use, and current HRT use and type, if not otherwise specified. Univariable analyses were carried out using log-rank test and presented as Kaplan–Meier curves for both AR- and ER-defined breast cancer. Each woman in the study cohort was followed from inclusion until the event of breast cancer, death, emigration, or end of the follow-up period by the 31 December 2007. In the subanalyses of AR+ breast cancer, time of follow-up was censored at the time of AR negative (AR-) breast cancer, cancer in situ, or breast cancer with missing AR status. Subanalyses for AR- breast cancer were performed correspondingly.

Tumor characteristics, such as tumor size, ALNI, histological grade, ER, PR, HER2, and Ki67 were categorized and analyzed in relation to AR status with X 2-analyses or Fishers’ exact test. Odds ratios (ORs) with 95% confidence intervals (CIs) and p values are presented. Logistic regression was used to compare AR status in relation to combined ER/PR status and histological grade. ORs and 95% CIs are presented. Three two-way interaction terms were created between the following three variables: age at first childbirth (≤20, >20–≤25, nulliparous, >25–≤30, >30) and ever OC use (yes/no) or cHRT (yes/no), and finally, ever OC use (yes/no) and cHRT (yes/no). p values of less than 0.05 were considered significant. All p values were two-tailed. Nominal p values without adjustment for multiple testing are presented.

Results

Tumor characteristics

A total of 467 tumors (90.5%) were AR+, and 49 tumors (9.5%) were AR- (Table 1). AR negativity in the tumors was significantly associated with higher tumor grade and higher proliferation rates. ER- and PR- expression showed highly significant associations with AR- expression, mainly driven by co-expression between AR and ER. No significant association with AR was observed for tumor size and ALNI. In analyses using a variable combining HER2 and hormone receptor status, AR negativity was associated with the triple-negative phenotype rather than the HER2+/ ER- /PR- phenotype. In order to address potential selection bias, tumors with missing AR data were analyzed in line with tumors with known AR. Risk factors for women with missing AR status are presented separately and show no major differences compared to women with known AR status (Table 1). Furthermore, sensitivity analysis was performed for tumor characteristics with regard to missing AR status. In the analysis that considered all tumors with missing AR status to be AR+, all results remained statistically significant (data not shown). When missing tumors were all assumed to be AR-, the results changed; this is in accordance with nearly 90% of invasive breast cancer tumors being AR+. The distribution of the baseline characteristics among all participants is described in Supplementary Table 1.

Table 1 Associations between AR status and clinicopathological parameters

Risk factors for AR-defined breast cancer

The higher the age at first childbirth, the higher the risk of AR- breast cancer. This trend became somewhat stronger in the multivariable analysis. In contrast, age at first childbirth did not affect the risk of AR+ breast cancer or the overall risk of invasive breast cancer (Table 2; Fig. 2). Breast-feeding did not affect the risk of developing AR-defined breast cancer. Ever users of OC had a significantly increased risk of AR- breast cancer compared with never users of OC (Table 2; Fig. 2). Ever use of OC was not associated with risk of invasive breast cancer or AR+ breast cancer. Results remained significant in multivariable analyses. The use of cHRT was significantly associated with an increased breast cancer risk, irrespective of AR status, compared with nonusers of HRT (Table 2; Fig. 2) and irrespective of OC use prior to cHRT. The results remained essentially the same in multivariable analyses. There were no significant interactions between OC use and cHRT use, OC use and age at first childbirth, or age at first childbirth and cHRT use (p ≥ 0.17). As for anthropometric measures, the mid-category of waist measurements, >80–≤ 88 cm, showed a significantly decreased risk of AR- breast cancer (Table 3). The result remained significant in multivariable analysis. In analyses stratified for menopausal status, the result only remained significant among postmenopausal women (age-adjusted HRpost_mp 0.24 (0.057–1.01) p  =  0.052). No associations between waist circumference and overall breast cancer risk or AR+ breast cancer risk were seen, neither in the entire cohort nor in analyses stratified according to menopausal status. No associations between hip measurements and risk of invasive or AR-defined breast cancer were seen, neither for the entire cohort nor when stratified according to menopausal status. WHR was not associated with risk of invasive breast cancer in general or AR- breast cancer in particular. When stratified according to menopausal status, no significant differences were seen for WHR measurements among invasive or AR- breast cancer. The mid-category of WHR, >0.80–≤0.85, showed an increased risk of AR+ breast cancer (Table 3). The result remained significant in multivariable analysis and was most prominent among the postmenopausal women in analysis stratified for menopausal status (age-adjusted HRpost_mp 1.31 (1.02–1.68) p  =  0.034). Height and weight were significant risk factors for invasive breast cancer in general; however, they did not differ according to AR status (Table 3). Education level, smoking, and alcohol use were not associated with either invasive or AR-defined breast cancer risk. Nonmanual workers had an increased risk of invasive breast cancer in general compared to manual workers (age-adjusted HR 1.26 (1.07–1.47) p  =  0.004). However, the association was not related to AR-defined breast cancer risk.

Table 2 Breast cancer risk by AR status; reproductive factors and exogenous hormone use
Fig. 2
figure 2

Kaplan–Meier estimate of risk of AR negative or AR positive defined breast cancer in relation to a + b age at first childbirth, c + d ever use of OCs, and e + f use of estrogen only or combined estrogen plus progestin HRT

Table 3 Breast cancer risk by AR status; anthropometry

Risk factors for ER-defined breast cancer

In order to assess whether the AR-defined breast cancer risk factors add independent information in addition to ER-defined risk factors, the analyses were repeated stratified according to ER status. Neither use of OC nor age at first childbirth was associated with ER-defined breast cancer risk (Supplementary Fig. 1).

Discussion

Here, we present data from the large MDCS showing that advanced age at first childbirth and ever use of OC increased the risk of AR- breast cancer, without influencing the risk of AR+ breast cancer. Use of cHRT was associated with increased risk of breast cancer irrespective of AR status.

The distribution of tumor characteristics in relation to AR status was consistent with similar studies, markedly for ER status, ALNI, size, and grade [4, 12, 33]. No association was seen between AR expression and HER2 status alone. When HER2 status was combined with ER and PR, AR negativity was associated with the triple-negative phenotype rather than the HER2+/ER-/PR- phenotype. Similar findings on AR and HER2 status have been described previously [12] although a review has reported heterogeneous results regarding AR and HER2 status [16]. Ni et al. [34] reported that targeting the AR in ER-/HER2+ breast tumors effectively inhibits tumor growth. Thus, androgen receptor blocking drugs (e.g., bicalutamide [17] or enzalutamide (ClinicalTrials.gov identifier: NCT01597193)) may be a treatment option for breast cancer patients with ERAR+ tumors [18].

In this study, risk assessment analyses for ER-defined breast cancer differed from AR-defined breast cancer risk. Consequently, we suggest that risk factors for AR-defined breast cancer differ from those of ER-defined breast cancer and that AR status can add information independent from ER. We therefore consider risk assessment in relation to AR to be of interest, especially as we show that these risk estimates add information to the risk defined by ER stratification.

The main limitations of this study are the limited number of women who developed AR- tumors and the number of tumors with missing AR data; both factors may impact the robustness of the results. However, in accordance with the results of the sensitivity analysis on missing AR status, missing tissue was not considered to exert a risk of selection bias in this study. Some additional methodological concerns should be addressed. The participation rate in the MDCS was 40%, and a previous study on the background population has shown a selection of higher socioeconomic status in participants [35]. The participating women in the MDCS were often postmenopausal; hence, the risk factors elucidated from this study may primarily cover postmenopausal breast cancer. The information on menopausal status was obtained at baseline and not at diagnosis, making adjustment for menopausal status in this study difficult. We did, however, stratify the findings of anthropometry according to menopausal status at baseline, since it is well documented that the association of breast cancer with BMI varies according to menopausal status. Furthermore, the vast majority of participants were of Swedish ethnicity, thereby possibly limiting the applicability of our results to women in general. Information on exposures was obtained from baseline questionnaires, and it is possible for these exposures to have changed over time since women were 44 years or older at inclusion. HRT use may be underestimated as participants may have initiated HRT use after inclusion in the study. However, age at first childbirth and ever use of OC are likely to be correctly reported since few women would give birth or initiate OC use after age 44. The nulliparous women might be a heterogeneous group with some women postponing pregnancy for social reasons, whereas others may experience fertility problems. Thus, the hormonal influence may differ between such subgroups, which may have an impact on the interpretation of the results. However, this study was not able to address this issue in depth as information on infertility was not available. Since the aim of this study was to compare AR expression in relation to lifestyle factors including anthropometry, reproductive history, and exogenous hormone use, we consider it possible to make internal comparisons between the subjects in order to obtain relative risks. Moreover, all multivariable analyses were adjusted for potential confounders and showed similar results as compared to age-adjusted analyses.

The breast epithelium is considered most sensitive to hormonal stimuli during the period between menarche and first childbirth. Further, the breast tissue matures during breast-feeding, and it may take multiple pregnancies before the breast tissue is fully matured [36]. In line with this, advanced age at first childbirth has been shown both to increase the risk of breast cancer in general [37] and to affect histological types and ER/PR-defined breast cancer differently [1, 38]. To our knowledge, analyses stratified according to AR status have not been performed.

Ma et al. [24] hypothesized that the protective effect of parity could be explained by reduced circulating hormone levels, increased levels of sex hormone binding globulin SHBG, or by increasing the breast epithelium differentiation to a less susceptible state concerning the effects of estrogen and progestin. Previous results from the MDCS have suggested that advanced age at first childbirth and parity correlate with more aggressive breast cancer subtypes [39], which is in agreement with the current findings of an association between advanced age at first childbirth and risk of AR- breast cancer. A recently published case study showed increasing frequency of TNBC among women with an older age at first child birth. However, AR expression was not investigated in the study [40].

In this study, a higher risk of AR- breast cancer among ever users of OC compared to never users was found, whereas no effect was seen on AR+ breast cancer. The Collaborative Group on Hormonal Factors in Breast Cancer reported a small increase in breast cancer risk, especially for women younger than 20 years using combined OC and up to 10 years afterward [41]. Results of analyses on OC use by receptor subtype, although inconsistent, have shown an increased risk of ER- breast cancers over ER+ breast cancers [1], which would be in line with our results of increased AR- tumors among ever users of OC.

With regard to HRT use, the Collaborative Group reported an obvious risk increase, lasting up to 5 years after cessation of therapy [42]. Previously, we have shown an increased risk of breast cancer among cHRT users in the MDCS [28]. The same pattern was seen in this study, the risk increment for women using HRT being irrespective of AR status. In relation to breast cancer subtypes, HRT use has been associated with ER+/PR- breast cancers and the progestin component of cHRT has been suggested to be of considerable importance [43, 44]. The importance of a stable testosterone/estrogen ratio in order to avoid breast cancer development has been stressed [45]. Use of either OC or HRT is capable of distressing this ratio by reducing free androgens, thus leaving the estrogenic stimulation on the breast unopposed [45]. It has been postulated that the synthetic progestins disrupt androgen signaling that normally exerts protective effects on the breast [46].

Furthermore, OC users have been shown to have lower testosterone levels compared to nonusers of OCs [23]. Androgens have been suggested to protect breast tissue from excessive hormone-induced proliferation, contrasted by findings indicating that androgens might increase breast cancer risk [45]. Preclinical studies implied that androgens promote apoptosis in human breast cancer cell lines [47]. Other preclinical data showed that AR blocks ER-stimulated growth in breast cancer cells [6] and that overexpression of AR decreased ER transcriptional activity [48]. Whether different AR genotypes are reflected in AR expression in breast tumors is unknown, but we have recently reported on AR genotypes predicting response to TAM treatment [49]. The treatment predictive value of tumor-specific AR expression has been addressed in endocrine-treated cohorts, demonstrating high AR expression to be a predictive factor for response to endocrine treatment [10]. Low or intermediate AR expression might be used as an indication to supplement endocrine treatment with chemotherapy or not [10].

Summary and future perspectives

In summary, age at first childbirth and use of OC increase the risk of AR- breast cancer without influencing the risk of AR+ breast cancer. This information may contribute to the comprehension of AR’s role in breast carcinogenesis and potentially play a role in breast cancer prevention by improving risk stratification. Further, considering the future potential for AR as a treatment target in breast cancer, more studies elucidating AR function are needed.