Introduction

Breast cancer, the second leading cause of cancer death in women in the United States [1], has received hundreds of billions of U.S. dollars in research and drug development funding over the last 20 years and has produced thousands of peer-review publications [2]. The development of a single breast cancer drug costs approximately $610 million and requires on average 37 months of animal trials to screen for safety and efficacy [3]. Mouse models of metastatic breast cancer are the mainstay for screening for efficacy because of safety and cost, but too often drugs that pass animal screening fail in humans [46]. The number of projects terminated in phase III trials has increased from 36 in 2006–2007 to 53 in 2008–2010 and 45 in 2009–2011 [7]. Despite the central role of mouse models in breast cancer research and the great costs and inefficiencies of drug development, there is no standard mouse breast cancer metastasis model established as the optimal system for drug development [46, 8]. Moreover, in the era of gene targeted therapy where investigators increasingly focus on tumor genome profiles, little is known about the differences in the genetic signatures of the tumors produced by different models [9].

There has been a remarkable proliferation of publications using murine metastatic breast cancer models over the last 20 years, rising to more than 500 per year in the last 3 years. Simultaneously, there has been a divergence in the types of models used, from 3 in 1990 to 7 in 2008–2012, without a consensus model arising. In xenograft models, drugs are screened for efficacy against human tumors in immunodeficient mice. However, xenograft models obviate the anti-tumor immune response, confound tumor-host interactions important for cancer, obliterate the native mammary gland and associated vessel architecture and mammary microenvironment, and often produce false positive results [46]. Transgenic mouse models are useful to study tumorigenesis, but it usually takes months for the mice to develop tumors that are variable in timing and size, and can require more than a year to test one drug. Further, expensive diagnostic imaging modalities are required to monitor metastasis in vivo in these models, and unstable tumors can often lead to false positive results for novel therapeutics [5]. Syngeneic mouse models, such as murine mammary adenocarcinoma 4T1 cells tagged with firefly luciferase implanted in immune competent mice, can be used to screen for drug efficacy in vivo using bioluminescent technology. Such syngeneic models take into account the anti-tumor immune response, cancer-stromal interactions, and maintain and utilize the mammary microenvironment, and have been shown to produce more efficient cancer progression and metastasis than xenografts [4, 10, 11].

The choice of implantation site for xenografts, orthotopically directly by injection into the subcutaneous space (SQ) of the mammary gland or through a small incision (orthotopic implantation under direct vision, ODV) remains controversial, and critical analysis of the optimal approach for research and drug development is lacking [4, 5, 1216]. Although the orthotopic implantation model was first described over 20 years ago [17], and the advantages of this model over other models were also described 15 years ago [18], ectopically implanted SQ model is widely used to date. In fact, it was also demonstrated utilizing high resolution fluorescent protein imaging as well as whole body fluorescent imaging in real time how this model can be utilized as an ideal in vivo system to study metastatic breast cancer [1922]. However, the metastatic progression of each model, the feasibility of determining drug efficacy in mice with endpoints that are clinically important to humans, and differences in tumor genome profile have not been examined in enough detail to persuade the drug developers and critics to adopt it as the consensus model [46]. It has been argued that compared to SQ, ODV requires advanced surgical skill, and the tumors that are produced are difficult to monitor [4, 5, 1216]. On the other hand, there is some evidence that ODV produces more efficient cancer progression and metastasis than SQ [46, 10, 23, 24].

The purpose of this study was to compare commonly used methods of implantation of breast cancer cells in syngeneic mice to determine which produces the most stable results and best mimics the progression of human disease. We also examined gene expression in tumors produced to evaluate the impact of implantation site on genetic targets important for cancer biology and therapy.

Materials and methods

Materials and animals used

Virginia Commonwealth University Institutional Animal Care and Use Committee approval was obtained for all experiments and all protocols were followed. Female Balb/c, nude, and C57Bl/6 mice, 12 weeks of age, weighing approximately 20 g were obtained from Jackson Laboratories. 4T1-luc2 murine mammary adenocarcinoma cell line genetically manipulated to overexpress the firefly luciferase gene was obtained from Caliper (Perkin Elmer). The cells were cultured in RPMI media, suspended at a concentration of 1 × 106 cells/100 µl, and 10 µl was then injected into the mice, unless specified otherwise.

Bioluminescent quantification of tumor burden

D-Luciferin (0.2 mL of 15 mg/mL stock; Perkin Elmer) was injected intraperitoneally (ip) into 4T1-luc2 tumor bearing mice and Xenogen IVIS 200 and Living Image software (Perkin Elmer) were used to quantify photons emitted by the cells as described [25, 26]. Metastasis was examined ex vivo 10 min after ip injection of luciferin.

Surgical techniques of cell implantation

Subcutaneous implantation (SQ) Under anesthesia, the skin was tented up, and 10 µl of the 4T1-luc2 cell suspension was implanted into the subcutaneous space. Orthotopic Percutaneous mammary gland implantation (OP) Under anesthesia, 10 µl of the 4T1-luc2 suspension was implanted under the nipple into the mammary gland. Orthotopic mammary gland surgical implantation under Direct Vision (ODV) Under anesthesia, a 5-mm incision was made medial to the nipple, and a sterile cotton swab was used to expose the mammary gland. A 27G needle was used to implant 4T1-luc2 cells directly into the mammary gland under direct vision using x10 microscope magnification.

Gene expression profiling of 4T1 ODV and SQ tumors

Ten days after ODV or SQ (8 Balb/c mice per group, one implantation site each), the tumors were harvested and snap-frozen at −80 °C. Frozen tissue Sections (10 µm) were used for histopathological scoring after staining with hematoxylin and eosin and adjacent sections used for RNA isolation. All samples contained more than 70 % tumor. Total RNA was extracted, and the quality evaluated using a previously established method of sample processing [27]. Total RNA was extracted from multiple 10 µm frozen tissue sections using the MagMAX™-96 for Microarrays Total RNA Isolation Kit (Invitrogen Life Technologies, Carlsbad, CA), in an automated fashion using the MagMAX™ Express magnetic particle processor. RNA purity was judged by spectrophotometry at 260, 270, and 280 nm. RNA integrity was assessed by running 1 µl of every sample in RNA 6000 Nano LabChips on the 2100 Bioanalyzer (Agilent Technologies, Foster City, CA). The Affymetrix protocol (Affymetrix, Santa Clara, CA) has previously been described [27]. Briefly, starting with 500 ng of total RNA, cDNA synthesis and cRNA labeling were performed using the GeneChip® 3′ IVT Express Kit (Affymetrix). Ten µg of fragmented cRNA were hybridized to the GeneChip® Mouse Genome 430A 2.0 array for 16 h at 60 rpm in a 45 °C hybridization oven. This array provides a comprehensive coverage of the transcribed murine genome by including over 22,600 probe sets that analyze the expression level of over 14,000 well-characterized mouse transcripts. The arrays were washed and stained with streptavidin phycoerythrin (SAPE; Molecular Probes, Eugene, OR) in the Affymetrix fluidics workstation. Every chip was scanned at a high resolution, with pixelations ranging from 2.5 µm down to 0.51 µm by the Affymetrix GeneChip® Scanner 3000 according to the GeneChip Expression Analysis Technical Manual procedures (Affymetrix). After scanning, the raw intensities for every probe were stored in electronic files (in.DAT and.CEL formats) by the GeneChip Operating Software (GCODV v1.4, Affymetrix). The overall quality of each array was assessed by monitoring the 3′:5′ ratios for a housekeeping gene (GAPDH) and the percentage of “Present” genes (%P), with arrays exhibiting GAPDH 3′:5′ <3.0 and  %p > 40 % considered good quality.

Quantitative real-time PCR

Real-time, quantitative reverse transcriptase polymerase chain reaction (QPCR) was used to assess gene expression levels of selected genes using the ABI Prism 7900 sequence detection system (Applied Biosystems) with probes and primer sets obtained from Applied Biosystems as previously described [28].

Statistical analysis

Student’s t test and Kaplan–Meier survival statistical analysis were utilized for photon emission and tumor surface area, and survival data, respectively. Standard statistical methods were utilized for microarray data analysis, including the Robust Multi-array Analysis (RMA) method [29], hierarchical cluster analyses, and false discovery rates [30].

Results

ODV generates larger and less variable tumors than OP and SQ

One of the challenges in the use of animal breast cancer models is the variability in size of the tumors between each animal after implantation. Higher variability requires a larger number of animals to reach statistical power resulting in soaring cost and efforts. Therefore, we compared three different implantation methods, ODV, OP, and SQ, commonly used in breast cancer research. Using luciferase-tagged 4T1-luc2 cells, we found that the ODV implantation method more consistently produced larger tumors than the OP or SQ methods, as determined by quantitative in vivo luminescence imaging and by qualitative imaging of the tumors (Fig. 1).

Fig. 1
figure 1

ODV produced greater cancer growth with less variability than OP or SQ. Four days after implantation of 1 × 105 cells by the respective methods in 4 separate locations in 5 mice in each group, bioluminescence was quantified, the black line represents the mean for each group of N = 20 (*p < 0.02, **p < 0.001)

Pathologic analysis demonstrated that cells implanted by OP, the most commonly used method of orthotopic implantation, were not uniformly distributed into the mammary gland, and cancer cells were observed all along the needle tract, from the skin to the mammary gland (Fig. 2a). This in part may explain why OP resulted in greater variability (Fig. 1). The OP method also can miss the targeted mammary fat pad entirely, resulting in intradermal, subcutaneous or intramuscular implantation. In contrast, ODV allows for accurate implantation into the mammary fat pad, as the procedure is performed under direct visualization of the target. Tumor size was determined by caliper measurements, where ODV produced larger tumors compared to SQ at day 28 after implantation (Fig. 2b). Pathologic analysis revealed a scar-like fibrotic capsule formed around the tumor after SQ, but not after ODV implantation (Fig. 2b). Mice implanted by the ODV method had a significantly shorter survival than SQ (Fig. 2c).

Fig. 2
figure 2

Pathological analyses demonstrate the possible cause of variabilities in OP or SQ. a H&E pathology demonstrated implantation along the needle tract (black arrow demonstrates skin implantation) and in the mammary gland (white arrow) after OP method. b ODV produced larger tumors than SQ (**p < 0.001; N = 8). H&E staining showed capsule formation around SQ but not ODV tumors. c Kaplan–Meier analysis demonstrated ODV produced shorter survival than Sq (p < 0.001)

Implantation of breast cancer cells into the chest 2nd mammary pad by ODV is a better mimic of human breast cancer progression

The vast majority of studies using orthotopic implantation as a metastatic breast cancer model implant cells into the 4th mammary pad, which is located at the abdomen. The rationale for this approach is that the 4th mammary pad is the largest and relatively easy to access. Therefore, it was of interest to examine metastasis to different organs after implantation of 4T1-luc2 cells into the abdominal 4th mammary pad compared to into the chest 2nd mammary pad. Twenty eight days after ODV implantation of 4T1-luc2 cells into the 2nd chest pad developed distant metastases, mostly to the lungs and to a lesser extent to the kidneys, liver and spleen detected by in vivo and ex vivo bioluminescence technology, (Fig. 3). This is consistent with our previous observation that ODV into the chest pad developed axially lymph node metastasis first and then lung metastasis, which is the same sequence as human breast cancer progression [25, 26]. In contrast, 28 days after ODV implantation into the 4th abdominal pad generated tumors that invaded through the abdominal wall, leading to peritoneal carcinomatosis, dissemination of cancer cells in the abdominal cavity, and much less lung metastasis was observed (Fig. 3). The majority of the tumor burden after ODV into abdominal pad was in the form of carcinomatosis, which is not seen in human breast cancer.

Fig. 3
figure 3

ODV implantation into the chest 2nd mammary pad better mimic human breast cancer progression than into the abdomen 4th mammary pad. 4T1-luc2 cells were implanted into either abdomen 4th mammary pad (Abdomen; closed bar) or chest 2nd mammary pad (Chest; open bar), and ex vivo IVIS imaging was obtained 28 days after implantation. Upper panel demonstrate the percentage of total photon counts of the metastatic lesions that had tumor in the respective organs, and their representative images are demonstrated below. Vast majority of the metastases from abdomen ODV was peritoneal carcinomatosis, whereas that of chest ODV was distributed among the distant organs. N = 5

ODV is not difficult to perform or learn

A review of the literature revealed that there was no uniformity in the orthotopic injection volume. The most commonly used injectate volumes, 10 and 100 µl, were compared (Fig. 4a–e). ODV implantation of isosulfan blue dye demonstrated greater spillage with 100 µl than with 10 µl (Fig. 4a, b). H&E staining revealed that ODV implantation of 4T1-luc2 cells in a volume of 100 µl (Fig. 4c) produced tissue disruption by dissection of the tissue plane between the fascia and subcutaneous fat and scattering of cancer cells in that layer, whereas in contrast, cells remained in the mammary pad after implantation of the same number of cells in a volume of 10 µl (Fig. 4d). Seven days after ODV implantation of cells in a 10 µl volume, larger tumors were produced than when the same number of cells were implanted in 100 µl determined by bioluminescence (Fig. 4e).

Fig. 4
figure 4

Less fluid is advantageous for ODV, and it is not difficult to learn. 100 µL ODV injection of isosulfan blue spilled out of the mammary gland (a), while 10 µL ODV stayed in the mammary gland (b). Implantation of the same number of cells, 100 versus 10 µL, cause greater tissue disruption, (c) versus (d), and less cancer growth (e). ODV required approximately 3 min longer to perform than other methods (f), and it only required 5 attempts for a researcher with no animal experience (closed diamond) to be proficient in this technique (g)

The major criticisms of ODV in favor of OP is that ODV is technically more demanding and requires more time to master. When the average time to perform these methods was directly compared, ODV required only 3 min more than SQ or OP (Fig. 4f). ODV learning curves were plotted for 3 investigators with no animal experience, 3 with some animal experience, and 3 professionally trained surgeons. As shown in Fig. 4g, even those with no animal experience were able to master the ODV method and were as swift as trained surgeons after 5 trials. In sum, ODV implantation of 10 µl minimized spillage out of the mammary gland and tissue disruption, optimized cancer growth, and was not burdensome to learn or perform.

Implantation of cancer cells in different sites produces tumors that differentially express genes important for breast cancer research and drug development

Because the tumor microenvironment plays a critical role in cancer biology [4, 5, 1216], we sought to examine the effect of implantation of cancer cells into different microenvironments on gene expression profiles of the tumors that were produced. OP was not included in this analysis due to the limitations of this implantation method demonstrated above (Fig. 1). Initially, we compared the gene expression profiles of 4T1-luc2 cells in vitro (Cells) to ODV and SQ tumors using genome-wide microarrays. Unsupervised cluster analysis demonstrated that there was a significant difference in the clustering of the gene expression profiles between these groups (Fig. 5a). There were 10,350 genes (45.7 % of the entire transcriptome covered by the array) that were differentially expressed between the cells in vitro compared to the ODV and SQ tumors (FDR ≤ 1 %; p < 0.0001). Interestingly, genome-wide microarray analysis comparing SQ and ODV tumors showed that approximately 700 probe sets (3.1 % of the entire transcriptome covered by the array; lower white circle of Fig. 5a) were differentially expressed (FDR ≤ 15 %; p < 0.01). On the other hand, there were no significant differences in the gene expression profiles of tumors in different animals within each group. Supervised cluster analysis of ODV and SQ tumors demonstrated that 206 genes were differentially expressed by a 1.5-fold difference with p < 0.01 (Fig. 5b). A literature search identified 75 % of these 206 genes as either known targets of cancer therapy, cancer biomarkers, or tumor antigens (Fig. 5c). When the gene interaction networks of differentially expressed genes were analyzed for a significance of p ≤ 1 × 10−30, 8 networks were found with functions of known significance in cancer research and drug development (Table 1). Further analysis illustrated how these genes interact with pathways of interest to cancer research and targets of drug development (Fig. 6). These results were further validated by qPCR of 10 specific target genes of interest with significant differences in expression (p < 0.0001) in the development of targeted therapy including wnt5a (Fig. 7a), pdgfa (Fig. 7b), twist2 (Fig. 7c), s100a16 (Fig. 7d), cxcl14 (Fig. 7e), epha3 (Fig. 7f), hsp110 (Fig. 7g), cd59a (Fig. 7h), hdac1 (Fig. 7i), and notch4 (Fig. 7j). Although HDAC1 and NOTCH4 were differentially expressed, HDAC2 and NOTCH1 were not, as shown in Fig. 7i, j, respectively. Taken together, analysis of the differential expression of genetic pathways, gene interaction networks, and qPCR validation of specific targets suggest that the differences between the microenvironments in which SQ and ODV produce tumors differentially regulate gene profiles. Many of those are cancer biomarkers, know targets of cancer therapy, or genes that have been implicated tumor progression suggesting that the choice of the human breast cancer model can influence the development of new therapeutics in the era of targeted therapy.

Fig. 5
figure 5

ODV versus SQ: site of implantation produced tumors with different gene expression profiles. a Unsupervised cluster analysis using centered correlation and average linkage demonstrated that 10,350 genes were commonly different between Cells versus ODV and Cells versus SQ, which implicate the difference in gene expression profile between Cells and tumor in vivo is larger than ODV versus SQ. In total, 700 genes were differentially expressed between ODV and SQ tumors. b Supervised cluster analysis demonstrated the differential expression of 206 significant probe sets (3.1 % of entire transcriptome) by greater than 1.5-fold between ODV and Sq tumors, N = 8. c PubMed search of these 206 significant probe sets demonstrated that 75 % are known targets of therapy, cancer biomarkers, or cancer antigens, with 25 % of unknown clinical significance

Table 1 SQ and ODV Differentially Expressed Significant Gene Pathways Differentially expressed genes were organized into the known gene pathways reported in the literature, with a out off of significance score of 30 or higher
Fig. 6
figure 6

Significant Gene Interaction Networks. The gene interaction networks with a significance p ≤ 1 × 10−30

Fig. 7
figure 7

ODV and SQ Differentially Expressed Genes Important for Cancer Research and Drug Development. With a significance of p < 0.0001, by genome-wide microarray and qPCR validation, there was a difference in the expression of Wnt5a (a), Cxcl14 (b), Pdgfa (c), S100a16 (d), Twist2 (e), Epha3 (f), Cd59a (g), Hsp110 (h), Hdac1 (i), and Notch4(j), but not Hdac2 (i) or Notch1 (j)

Discussion

Numerous mouse models have been used to study human breast cancer development and metastasis, ranging from spontaneous and carcinogen-induced models to genetically engineered and transplantation mouse models. Although it has been advocated that there should be a wider adoption of orthotopic implantation for cancer research instead of SQ [4, 10, 14, 15, 3138], there has been a trend toward OP rather than ODV over the last 20 years. This trend has occurred despite the description of the orthotopic method over 20 years ago and of its advantages over other methods 15 years ago [17, 18].

Our results suggest that SQ and OP do not reliably form tumor into their target locations. These results support the reported variability of cancer cell growth and high false positive results in SQ [46, 10, 23, 24]. We also found that ODV produced greater tumor growth without a scar-like capsule and shorter survival than SQ. ODV implantation into the chest 2nd mammary pad produced a distribution of metastatic progression that modeled human breast cancer better than implantation into the abdominal 4th mammary pad, the most commonly used site of implantation. Implantation into the chest mammary pad produces metastasis to lung and other distant organs, while implantation into the 4th abdominal mammary pad mainly produced peritoneal disseminations secondary to direct invasion, which is almost never seen in human breast cancer patients. Despite the common misconception, we found that ODV implantation into the chest 2nd mammary pad is not difficult to perform or learn. We also found that a 10 µl volume of implantation was optimal and minimized disruption of the breast tissue. Considering that SQ and OP are the most commonly used methods, the inaccuracies and variable progression demonstrated in this study is an important consideration in favor of adopting ODV for preclinical testing of anti-cancer drug efficacy. These findings further support the multiple reports of Dr. Hoffman et al. that orthotopic models provide an optimal system for breast cancer research and drug development [17, 18].

Transcriptional profiling has been a common way to identify genes and signaling pathways important for carcinogenesis. However, rarely has there been a critical evaluation of how different methods and sites of implantation affect tumor genome-wide gene expression profiles. We have found that there was a significant difference in expression of hundreds of genes between the ODV and SQ tumors. Because the tumor microenvironment is critical to cancer biology, it is not surprising that implanting the same cancer cells into different microenvironments can alter tumor characteristics and gene expression. Strikingly, three quarters of these differentially expressed genes in ODV are of known importance in cancer cell biology and anti-cancer drug development. These differences might explain why SQ implantation models have failed to recapitulate the complexity of human breast cancer progression and failures of translation of positive preclinical trials to the clinic. Evaluation of gene interaction networks indicated that 8 of the most significant divergent networks upregulated in ODV tumors compared to SQ tumors have functions of importance for tumor development and are currently the focus of cancer research and drug development. Examples include Twist2, which is an epithelial-mesenchymal transition inducing E-cadherin repressor [39], and Epha3, which is a key player in tumor growth and survival [40]. These two genes are subjects of intense cancer biology research with the hope for future applications in therapy. Other examples include Wnt5a, an important player in breast epithelial cell migration and a marker of disease-free survival in breast cancer [41]; pdgf-alpha receptor which is correlated with breast cancer invasiveness and aggressiveness [42]; S100a16 (Fig. 7d) involved with cell motility, growth, differentiation, cell cycle progression, transcription, secretion, and a marker for circulating tumor cells, which is used as cancer diagnostic and prognostic markers [43, 44]; Cxcl14 promotes cell motility, which is associated with breast cancer development and progression, and its expression level is even associated with clinical time to breast cancer metastasis [45]. HSP110 (Fig. 7h) has been correlated with clinical stage in colorectal cancer and breast cancer, and is currently a target of cancer vaccines and molecular targeted chemotherapy [46, 47]. Antibodies to CD59a are currently under investigation to improve the efficacy of the chemotherapuetic agent rituximab [48]. HDAC1 is also currently under investigation as a target with a new class of inhibitors undergoing clinical trials in targeted cancer treatment. Interestingly, HDAC2 inhibitors showed promise in animals and failed in humans, and similarly HDAC2 demonstrated no significant differential expression between SQ and ODV, but HDAC1 which is thought to be a more appropriate target was differentially expressed [49]. NOTCH inhibitors provided a similar example: NOTCH 1 inhibitors demonstrated promise in xenograft mouse models, failed in humans, and recent findings suggest that NOTCH4 inhibitors instead offer greater promise [50]. These findings underscore the importance of characterizing the genomic profiles of the tumors produced by mouse models when screening for the efficacy of novel agents in the era of targeted therapy.