Introduction

Breast cancer is the most common malignant tumor among women and is a highly heterogeneous disease [1]. According to the hormone status and human epidermal growth factor receptor 2 (Her2) status, it can be divided into four subtypes: luminal A, luminal B, Her2-positive and triple negative breast cancer (TNBC) [2]. Due to improvements in treatment and early detection, the mortality of breast cancer declines year by year. However, as a result of resistance to chemotherapy, some patients especially those who can not benefit from other treatments still have low survival rates [3]. Therefore, it is very necessary to identify effective biomarkers and novel drug targets to improve the prognosis of breast cancer patients.

MicroRNA(miRNA) is a class of small noncoding RNAs whose length is 19–22 nucleotides. MiRNAs can regulate gene expression at the post-transcriptional level by binding to the complementary sequence in the untranslated regions (UTRs) of their target mRNAs [4]. Besides, miRNAs also can interact with long non-coding RNAs (lncRNAs), preventing miRNAs from binding to their target genes, thereby interfering with gene expression [5]. In addition to cancer, aberrant expression of miRNAs is also associated with many other human diseases, like Parkinson’s disease and Coronary Artery disease [6, 7]. In breast cancer, studies have shown that miRNAs are involved in proliferation, apoptosis, migration, invasion and drug-resistance [8,9,10,11].

In this study, we detected differentially expressed miRNAs between chemo-sensitive breast cancer tissues and chemo-resistant breast cancer tissues through the integration of two microarray profiling datasets from Gene Expression Omnibus (GEO) database. Then, we used miRNA-Genes interaction prediction database to obtain their target genes and compared them with chemo-resistant related genes from another dataset to generate a link between chemo-resistant miRNAs and genes. Finally, we applied bioinformatics analysis to identify the miRNAs and genes that play an important role in breast cancer chemotherapy resistance, which might become new biomarkers or drug targets.

Materials and methods

MiRNA microarray

The microarray datasets GSE71142, GSE73736 and GSE6434 were downloaded from the GEO database. GSE71142 and GSE73736 contained miRNA microarrays from both chemo-sensitive tissues and chemo-resistant tissues of breast cancer. GSE6434 contained the gene expression data from chemotherapy sensitive and resistant patients.

Screening for DE-miRNAs

The raw data were normalized and analyzed by R package “limma” from the Bioconductor project. The miRNAs with |log2 fold changes (FC)|>1 and P value<0.05 were considered as DE-miRNAs. The overlapping DE-miRNAs from the two datasets were finally selected.

Prediction of target genes of DE-miRNAs and identification of DE-genes

The target genes of DE-miRNAs were predicted by miRWalk3 database, which predicted target genes by using its own TarPmiR algorithm or other miRNA-target prediction datasets like TargetScan, miRDB and miRTarBase. Only common target genes predicted by at least two methods were chosen as potential target genes. The common genes from both the potential target genes and differentially expressed genes in GSE6434 were considered as DE-genes.

Gene ontology annotation (GO) and KEGG pathway analysis

The online website, DAVID (http://david-d.ncifcrf.gov/), was used to perform gene ontology annotation and KEGG pathway enrichment analysis for DE-genes. All the data were visualized by R package “ggplot2”.

PPI network analysis, hub gene identification and miRNA-gene network construction

The STRING database is an online tool to analyze protein-protein interaction. We submitted the DE-genes to the STRING database and only chose the interactions with a confidence score > 0.7. The PPI network was constructed by software Cytoscape. The degree of connectivity in networks was calculated to obtain the significant nodes and hub genes (25). The miRNA-gene network was visualized by software Cytoscape.

Survival analysis of hub genes

Kaplan-Meier plotter website (https://kmplot.com/analysis/) is able to assess the effect of fifty thousand genes on survival in several cancer types including breast, ovarian, lung and gastric cancer. In order to analyze the relapse free survival (RFS), the patient samples were split into two groups according to the median level of each gene’s expression. The website can automatically compare the two patient groups by a Kaplan-Meier survival plot and calculate the hazard ratio (HR) with 95% confidence intervals and log rank P values.

Results

Identification of DE-miRNAs and their target genes

According to the previously defined threshold, 22 DE-miRNAs were finally screened out, including 10 upregulated and 12 downregulated miRNAs (Fig. 1) in chemo-resistant tissues, compared with chemo-sensitive tissues. MiRWalk3 database was employed to predict the target genes of the DE-miRNAs and finally generated 3754 potential target genes, including 1824 genes related to upregulated miRNAs and 1930 genes related to downregulated miRNAs. Then, we took the intersection of two miRNA sourced target genes and chose the common part of genes between miRNA sourced genes and GSE6434 sourced genes. Totally, 1932 genes were acquired, including 978 genes for upregulated miRNAs and 954 genes for downregulated miRNAs.

Fig. 1
figure 1

Heatmap for DE-miRNAs. a GSE71142, b GSE73736

Functional and pathway enrichment analysis

To further investigate the biological functions, KEGG pathway analysis and GO analysis including molecular function (MF), cellular component (CC) and biological process (BP) were performed through DAVID tools. The top 10 most significant Go terms and KEGG pathway terms for the target genes of upregulated DE-miRNAs were shown in Fig. 2. On the BP level, the genes were mainly enriched in positive regulation of transcription from RNA polymerase II promoter, regulation of phosphatidylinositol 3-kinase signaling and signal transduction. On the CC level, the genes were mainly enriched in cytosol, nucleoplasm and cytoplasm. On the MF level, the genes were mainly enriched in protein binding, RNA polymerase II core promoter proximal region sequence specific binding and kinase activity. The enriched KEGG pathways included ErbB signaling pathway, pathways in cancer and insulin signaling pathway.

Fig. 2
figure 2

GO and KEGG pathway analysis of the overlapping target genes from the upregulated DE-miRNAs. a Biological process, b Cellular component, c Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis and d Molecular Function

We also conducted GO functional and KEGG pathway analysis for the target genes of the downregulated DE-miRNAs. The top 10 most significant Go terms and KEGG pathway terms were shown in Fig. 3. On the BP level, the genes were mainly enriched in positive regulation of transcription from RNA polymerase II promoter, negative regulation of transcription from RNA polymerase II promoter and intracellular signal transduction. On the CC level, the genes were mainly enriched in cytosol, cytoplasm and nucleoplasm. On the MF level, the genes were mainly enriched in protein binding, transcription factor binding and sequence-specific DNA binding. The enriched KEGG pathways included proteoglycans in cancer, neurotrophin signaling pathway and GnRH signaling pathway.

Fig. 3
figure 3

GO and KEGG pathway analysis of the overlapping target genes from the downregulated DE-miRNAs. a Biological process, b Cellular component, c Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis and d Molecular Function

PPI network analysis, hub gene identification and miRNA-gene network construction

As we all know, proteins usually perform their functions by interacting with other proteins. Thus, we used STRING database to analyze the PPI network of the DE-genes of the both upregulated and downregulated miRNAs. Software Cytoscape was used to identify the top 25 hub genes according to degree level (Fig. 4). For the upregulated miRNAs, the top 10 hub genes were MAPK1, CTNNB1, PIK3CA, PIK3R1, MAPK3, KRAS, APP, HSP90AA1, PTEN and MAPK8. Among these genes, MAPK1 showed the highest node degree(degree=64). For the downregulated miRNAs, the top 10 hub genes were PRKACA, CDC42, MAPK1, MAPK8, KRAS, POLR2E, SMAD3, STAT3, PTPN11 and PTEN. Among these genes, PRKACA showed the highest node degree(degree=54). As shown in Fig. 5, miRNA-hub genes network was constructed by software Cytoscape. Hub genes of upregulated miRNAs could be regulated by miR-200c-3p, miR-214-3p, miR-33b-3p, miR-3927-3p, miR-4277, miR-4422 and miR-4771. Among them, miR-214-3p was predicted to target the most hub genes (n=12), indicating that this miRNA may play an important role in chemotherapy resistance in breast cancer. The hub genes of downregulated miRNAs could be regulated by miR-30c-2-3p, miR-586, miR-587, miR-592, miR-1178-3p, miR-2681-3p, miR-3116, miR-3162-5p, miR-4282 and miR-4472. Among them, miR-587 was predicted to target the most hub genes (n=9). Besides, miR-4472 and miR-4422 could respectively regulate 6 and 7 hub genes suggesting these four miRNAs may have great effect on chemotherapy resistance in breast cancer.

Fig. 4
figure 4

The PPI network of top 25 hub genes. a Hub genes targeted by upregulated miRNAs, b Hub genes targeted by downregulated miRNAs

Fig. 5
figure 5

miRNA-hub genes interaction networks. a The network of hub genes and upregulated miRNAs b The network of hub genes and downregulated miRNAs

Prognostic value of hub genes

The prognostic value of 16 unrepeated hub genes was obtained from Kaplan–Meier plotter website. As shown in Fig. 6, 13 out of 16 genes had statistical significance including SMAD3, PTPN11, KRAS. High expression of PIK3R1, MAPK3, KRAS, MAPK8, PRKACA, CDC42, SMAD3, STAT3, PTPN11 and lower expression of PIK3CA, HSP90AA1, PTEN, POLR2E were associated with a better RFS of breast cancer patients (p<0.05).

Fig. 6
figure 6

Kaplan–Meier survival curves of 16 hub genes. a MAPK1, b CTNNB1, c PIK3CA, d PIK3R1, e MAPK3, f KRAS, g APP, h HSP90AA1, i PTEN, j MAPK8, k PRKACA, l CDC42, m POLR2E, n SMAD3, o STAT3, p PTPN11

Discussion

Although significant progress has been achieved in targeted therapy, immunotherapy and endocrine therapy, breast cancer chemotherapy resistance is still the main reason for limiting patients’ survival rates [12]. Yusuf Baran et.al pointed out that there are many reasons for chemotherapy resistance and the main reason that limits the success of chemotherapy is the development of multidrug resistance [13]. Wang et.al showed that changing the expression of miRNAs can effectively re-sensitise cancer cells to cisplatin regimen in certain types of cancers including breast, gastric, ovarian and prostate [14]. Here, we explored the possible biomarkers and drug targets of breast cancer chemotherapy resistance from the perspective of miRNA. In this study, we used the method of bioinformatics to download data from public databases and screened out the disordered miRNAs associated with breast cancer chemotherapy resistance and their corresponding target genes. We found that mir-3927-3p had the largest difference multiple in the upregulated miRNAs, but probably because mir-3927-3p was found late, it has not been studied yet in the aspect of tumor proliferation or drug resistance. Mir-619 had the largest difference multiple in the downregulated miRNAs. Study has shown that the expression of mir-619 decreased in drug resistant head and neck tumors [15]. As a sponge to lncRNA and mRNA, mir-619 can not only promote tumor angiogenesis but also the proliferation and invasion of colorectal cancer [16, 17]. In our study, the genes interacting with miR-214-3p were the most, so we can speculate that miR-214-3p may play a big role in chemotherapy resistance of breast cancer. Study showed that miR-214-3p can enhance cells’ proliferation by directly binding to the 3’UTR of FOXP2 and down-regulating its expression in breast cancer [18]. MiR-214-3p also can regulate the viability, invasion, migration and Epithelial-Mesenchymal Transition (EMT) of TNBC cells by targeting to ST6GAL1 [19]. Besides, overexpression of miR-214-3p in NCCIT cells may lead to cisplatin resistance [20]. Therefore, what roles these miRNAs play in chemotherapy resistance is worth studying in depth.

Subsequently, we performed functional enrichment analysis and pathway analysis of target genes of dysregulated miRNAs. Consistent with the predictions, we found that these genes were mainly concentrated in the cell signal transduction, and the signaling pathways were concentrated in the cell proliferation and invasion-related pathways, such as Ras signaling pathway, mTOR signaling pathway and ErbB signaling pathway. In the study of hub genes, we also found the similar situation. Most of the identified hub genes were key genes in the signal transduction pathways, such as MAPK1, CTNNB1 (encoding β-catenin), KRAS, PTEN, SMAD3, BRCA1, etc. They mainly focus on Ras/MAPK pathway, PI3K / AKT/mTOR pathway, Wnt / β-catenin/mTOR pathway and SMAD / TGF- β pathway. Ras/MAPK pathway is a classical pathway in tumor. About 15% TNBC patients were Ras/MAPK altered at the genomic level and Ras/MAPK activity can suppress expression of MHC-I and MHC-II, which suggests cancer cells can circumvent antigen presentation pathways by activating the MAPK pathway [21]. It is believed that Ras/MAPK pathway is involved in chemotherapy resistance in breast cancer. Steelman et.al pointed out that PI3K/AKT/mTOR and Ras/ MAPK signaling pathways can regulate HER2 expression, while HER2 overexpression can also activate both the PI3K/Akt/mTOR and Ras/ MAPK pathways [22]. New evidence showed that activation of PI3K/AKT/mTOR pathway can partly cause the drug resistance in HER2-positive breast cancer with brain metastasis [23]. In addition, TGF-β/SMAD3 pathway also plays an important role in the induction and maintenance of resistance to anti-HER2 drugs and inhibiting SMAD3 can attenuate resistance to anti-HER2 drugs in HER2-positive breast cancer cells [24]. All these studies indicate that our research is credible and the interactions between miRNAs and genes may have a great effect on the chemotherapy resistance in breast cancer.

In summary, our study identified several miRNAs and hub genes involved in chemotherapy resistance in breast cancer. According to the functional analysis and survival analysis, these genes and miRNAs may serve as potential biomarkers or drug targets contributing to improve the survival rates of breast cancer patients with chemotherapy resistance. In the next plan, we will continue to explore the underlying mechanisms of breast cancer chemotherapy resistance in more depth.