1 Introduction

Head and neck squamous cell carcinoma (HNSCC) is the sixth most common cancer in the world [13] and annually approximately 900,000 new cases are reported worldwide, including 300,000 deaths [4]. HNSCC is diverse in nature, both biologically and clinically. The primary risk factors for the development of HNSCC include tobacco use, alcohol consumption, human papilloma virus (HPV) infection (for oropharyngeal cancer) and Epstein-Barr virus (EBV) infection (for nasopharyngeal cancer). In the initial stages of the disease a patient may not show any clinical symptoms and, consequently, a significant number of patients present with metastatic disease at the time of diagnosis (regional nodal involvement in 43 % and distant metastasis in 10 %), leading to 5-years survival rates of less than 40 % [5]. The early diagnosis of HNSCC holds promise for an improved prognosis, but this option is currently impeded by the lack of clinically validated biomarkers to reliably detect the disease at an early stage and the paucity of targeted molecular therapeutics [6].

Recently, the use of saliva to detect different pathologies, including cancer, has been shown to hold great promise [610]. Indeed, saliva sampling is rapidly expanding and may provide a non-invasive and cheap alternative to traditional tissue and blood sampling [11]. The ease to conduct saliva-based tests also turns it into an attractive option for large scale population-based screening and repeated sampling to monitor and manage disease progression [12]. Li et al. [13] reported that the salivary transcriptome contains more than 3,000 RNA species, mostly mRNAs, and Park et al. [14] and Patel et al. [15] reported the possibility of using salivary microRNAs (miRNAs) to detect oral cancers. Additionally, Wong et al. [16] emphasized the potential use of these miRNAs as biomarkers for cancer screening programs.

miRNAs are small noncoding RNAs (approximately 22 nucleotides long) that are known to play a crucial role in the post-transcriptional regulation of gene expression by either inhibiting target mRNA translation and/or by degrading mRNA targets [17, 18]. To date, the miRNA registry contains 1,872 human miRNA precursors, giving rise to 2,578 mature miRNAs (MiRBase, Release 20), which are thought to regulate at least 30 % of all human genes. Within the human genome miRNAs constitute 3-5 % of the predicted genes, and they can be located in intergenic, intronic or exonic regions of either protein-coding or non-protein-coding genes [19]. miRNAs are frequently located in cancer-associated genomic regions [20] and have been shown to govern important cellular process such as proliferation, differentiation, apoptosis and cell cycle regulation. As such, miRNAs may play a crucial role in cancer initiation, progression and/or metastasis [21].

From an initial study, using miScript™ miRNA PCR Arrays from Qiagen, we selected a subset of five differentially expressed miRNAs (based on > 2-fold change) to determine their clinical utility. Our aims were (i) to develop a robust method to isolate miRNAs from small volumes of saliva and (ii) to develop a novel salivary biomarker panel for the early detection of HNSCC.

2 Materials and methods

2.1 Participants and samples

Unstimulated saliva samples were collected from HNSCC patients (n = 61, including the 5 samples used in the miRNA PCR arrays) at various stages (II-IV) of the disease, who presented to the Head and Neck Clinic at the Princess Alexandra Hospital, Woolloongabba, Australia. Similarly, saliva samples were collected from healthy individuals (n = 61, including the 5 samples used in the miRNA PCR arrays) with no previous history of any malignancy, and who were non-smokers with a good oral hygiene. All samples were immediately stored in dry ice and transferred to the laboratory for processing. The participants’ demographics are listed in Table 1 and their clinical data are listed in supplementary Table 1. All participants provided written informed consent before sample collection. This study was approved by the University of Queensland Medical Ethical Institutional Board and by the Princess Alexandra Hospital Ethics Review Board.

Table 1 Participant demographics

2.2 Sample characteristics

We collected whole mouth resting saliva in sterile 50 mL Falcon tubes (Sarstedt, Australia) as reported before [7, 22]. Cell-free salivary supernatants allow the detection of miRNAs that are secreted into the saliva, whereas whole mouth salivas allow the detection of both secretorty and non-secretory miRNAs. Park et al. [14] concluded from their work that the miRNA levels in salivary cell-free supernatants are lower than those in whole mouth salivas and, in addition, that more heterogeneous miRNA populations can be found in whole saliva. Moreover, Patel et al. [15] reported that they were able to obtain high resolution microRNA signatures from whole saliva samples.

2.3 miRNA extraction and cDNA synthesis

QIAzol lysis reagent (Qiagen, Valencia, CA, USA) was used as an extraction solvent to isolate total RNA and a NucleoSpin miRNA kit (Macherey-Nagel, Düren, Germany) was used to enrich for miRNAs. When using a NucleoSpin miRNA kit, there are 3 options: (i) to isolate total RNA (miRNAs and large RNAs) in the same fraction, (ii) to isolate miRNAs and large RNAs in two separate fractions or (iii) to only isolate miRNAs. We have used option (iii). In brief, 800 μL QIAzol lysis reagent was added to 200 μL whole saliva. This mixture was briefly vortexed and incubated for 5 min at room temperature (RT). Next, 200 μL chloroform was added, and the mixture was vortexed vigorously and incubated at RT for another 5 min. The samples were then centrifuged at 10,000 × g for 10 min at 4 °C. The upper aqueous layer (800 μL) was transferred into a new microcentrifuge tube, another 200 μL chloroform was added and the mixture was vortexed again. After this, the samples were again centrifuged at 10,000 × g at 4 °C. The upper aqueous layer (800 μL), containing total RNA, was carefully removed and transferred to a new microcentrifuge tube and 200 μL 100 % ethanol was added. This mixture was used for miRNA enrichment using a NucleoSpin miRNA kit. To this end, the solution containing total RNA was passed through a Nucleospin column (blue ring) by centrifugation at 11,000 × g for 30 s at RT. The flow through was collected and 800 μL MX buffer was added. Next, the sample was loaded onto a Nucleospin column (green ring) and centrifuged at 11,000 × g for 30 s at RT. The columns were then washed with 600 μL MW1 buffer and centrifuged at 11,000 × g for 30 s. The wash was repeated with 700 μL MW1 buffer followed by 250 μL MW2 buffer and centrifugation at 11,000 × g for 30 s. The columns were centrifuged at 11,000 × g for another 2 min to remove all the ethanol. Then, the miRNA was eluted in 30 μL RNase free water. The quantity of the isolated miRNA was determined using a Nanodrop ND-1,000 spectrophotometer (Thermo scientific Wilmington DE). In supplementary Table 2 the concentrations of the miRNAs are listed. OD 260:280 ratios ≥ 1.8 were accepted as pure.

Using the isolated miRNAs as template (input 250 ng), cDNA synthesis was carried out via a miScript II RT Kit with Hispec buffer (Qiagen, Valencia, CA, USA) according to the manufacturers’ instructions. In brief, miRNAs were polyadenylated with polymerase A and transcribed into cDNA using oligo-dT primers (in parallel in the same tube). The oligo-dT had a 3′degenerate anchor and a universal tag sequence at the 5′end allowing amplification of mature miRNA through a real-time PCR step. The polyadenylation and universal tag primer ensure that genomic DNA is not amplified. Therefore, DNAase treatment was omitted.

2.4 miRNA PCR array assay

miScript™ miRNA PCR arrays (Qiagen, catalogue number MIHS-3102Z) were used according to the manufactur’s instructions. Briefly, 100 μl saliva samples from five HNSCC patients and from five healthy controls were pooled separately to reduce biological variation. From these two saliva sample pools, miRNA was extracted and converted to cDNA as described above. Next, the cDNA was diluted 1:10 in RNAse/DNAse free water and used as a template for miRNA PCR array amplification. Before that, the PCR array plate was sealed and centrifuged at 1,000 × g for 1 min at RT to remove air bubbles that may interfere with the PCR amplification process. Then, PCR reactions were performed using a thermo cycler (Bio-Rad, Hercules, CA, USA). The reaction mixtures were incubated at 95 °C for 10 min to activate the HotStart DNA Taq polymerase, followed by 40 cycles of 95 °C for 15 s and 60 °C for 60 s, monitored by melt curve analysis. Ct values >35 were excluded from the analyses.

miScript™ miRNA PCR Arrays allow the profiling of 84 miRNAs that were selected based their presumed correlation with the diagnosis, staging, progression and/or prognosis of various cancers. A set of controls present on this array enables data analysis using the ΔΔCt method, the assessment of reverse transcription performance and the assessment of PCR performance. The endogenous normaliser SNORD96A [23, 24] showed a relatively stable expression across control and HNSCC patient samples. The miScript miRNA PCR Array Data Analysis Web Portal (http://pcrdataanalysis.sabiosciences.com/mirna/arrayanalysis.php) was used to analyse the microarray data (Qiagen, Germany).

2.5 Quantitative real-time PCR (RT-qPCR)

To confirm the microarray-based expression data, saliva collected from 21 HNSCC patients and 21 healthy controls were first analysed using RT-qPCR. Next, we expanded the cohorts of HNSCC patients and controls with 35 each to validate the clinical utility of the identified miRNAs. To this end, the cDNAs were diluted 1:10 in RNAse/DNAse free water and used as a template for PCR amplification. The PCRs were run in duplicate using a miScript Syber green PCR master mix (Qiagen, Valencia, CA, USA) and carried out in a thermo cycler (Bio-Rad, Hercules, CA, USA) The reaction mixtures were incubated at 95 °C for 15 min to activate the hot start Taq DNA polymerase, followed by 40 cycles of 94 °C for 15 s, 55 °C for 30s and 70 °C for 30s, monitored by melt curve analysis. According to the Minimum Information for Publication of Quantitative Real-Time PCR Experiments guidelines [25], the threshold value (Ct) was determined for each well and the values were averaged for each gene (see supplementary Table 3). Samples with Ct values > 35 were excluded from the analysis.

2.6 Independent validation using TCGA data

Additionally, The Cancer Genome Atlas (TCGA) miRNAseq data were used to validate our findings. In our study, we used saliva samples for miRNA profiling. There are two ways by which tumour-specific molecules can enter the saliva (i) via direct release or (ii) via the secretion of exsosomes/microvesicles [2629]. From the TCGA data portal (HNSCC cancer) we downloaded Illumina HiSeq miRNAseq data from 334 tumours and 39 normal tissues, and performed PCA expression analysis (see below) for both tumour and normal tissues. We excluded 8 samples (4 from tumours and 4 from normal tissues) that were found to be outliers.

2.7 Statistical analysis

All statistical analyses were performed using GraphPad Prism 5 software version 5.03 (GraphPad Software Inc., USA). Data analyses were performed using the miScript miRNA PCR Array Data Analysis Excel web portal (http://pcrdataanalysis.sabiosciences.com/mirna/arrayanalysis.php), including quantification by the ΔΔCT method [30]. Data normalization was performed using an endogenous control (SNORD96A) to correct for sample variation. miRNA expression levels were compared using the Mann Whitney U- test for non-parametric analysis. Receiver-operating characteristics (ROC) curves were generated for five selected miRNAs to determine their clinical utility as diagnostic biomarkers. For TCGA data analysis, differential expression was evaluated on the reads per million counts (on a log2 scale) using the Wilcoxon rank sum test. The Bonferroni method was used for multiple hypothesis correction.

3 Results

3.1 High miRNA yields from small saliva samples

We have established a robust and reproducible method to isolate high yield miRNAs from both fresh and archival saliva samples based on QIAzol (Qiagen) extraction followed by solid phase enrichment of miRNAs on silica columns. This method allowed us to isolate miRNAs from as little as 200 μL of whole mouth saliva. This method also turned out to be robust for isolating high yield miRNAs from archival saliva samples stored at −80 °C for up to 2 years. Supplementary Table 2 depicts the concentrations and the OD 260/280 measurements for the individual samples. The amounts of miRNAs isolated from 200 μL of saliva ranged from 11.50 ng to 644 ng.

3.2 miRNA expression in saliva from HNSCC patients and healthy controls

After miScript™ miRNA PCR Array based expression analysis using pooled samples from five HNSCC patients and five healthy controls, a subset of five differentially expressed miRNAs (based on > 2-fold change) was selected for confirmation and validation in independent (the first 5 pooled samples are included in these cohorts: see material and methods and supplementary Tables 1, 2, and 3) cohorts using RT-qPCR. These miRNAs were: miR-9, miR-127, miR-134, miR-191 and miR-222. For the confirmatory study we used saliva from 21 HNSCC patients and 21 healthy controls and for the validation study we used saliva from an independent cohort of 35 HNSCC patients and 35 healthy controls. When comparing the microarray and RT-qPCR results side-by-side after normalizing both data sets with the same control (SNORD96A) we found that, with the exception of miR-191, in the confirmatory cohort (n = 21) all other miRNA microarray data were in agreement with the RT-qPCR data (Fig. 1a). The average SNORD96A Ct values for the saliva collected from the healthy controls (Ct = 22.9) and the HNSCC patients (Ct = 22.5) were similar (Mann–Whitney U Test), thus confirming its applicability as a normaliser (see supplementary Table 3). In Fig. 1b the RT-qPCR based fold-changes for the five selected miRNAs are listed. miR-127 (P = 0.07), miR-222 (P = 0.11), miR-191 (P < 0.01) and miR-9 (P = 0.68) were found to be up-regulated in the saliva from HNSCC patients as compared to the healthy controls (fold changes 2.3, 1.92, 8.2 and 3.7, respectively). In contrast, miR-134 was found to be down-regulated in the microarray data set but not in the RT-qPCR data set. In the second independent validation study (n = 35), except for miR-134, the other four miRNAs were consistently up-regulated in saliva from the HNSCC patients as compared to the saliva from the healthy controls (Fig. 2a).

Fig. 1
figure 1

a Relative expression levels of five miRNAs determined by microarray and RT-qPCR technologies, respectively. Microarray and RT-qPCR data sets were normalized to SNORD96A. b Expression levels of miR-222, miR-134, miR-127, miR-9 and miR-191 in saliva collected from HNSCC patients (n = 21) and healthy controls (n = 21) determined by RT-qPCR (confirmatory study)

Fig. 2
figure 2

a Expression levels of miR-222, miR-134, miR-127, miR-9 and miR-191 in saliva collected from HNSCC patients (n = 35) and healthy controls (n = 35) determined by RT-qPCR. Differences were determined using Wilcoxon Statistical test and statistical significance was obtained for miR-191, miR-9 and miR-134 (P < 0.001). b Illumina HiSeq miRNAseq data were downloaded from The Cancer Genome Atlas (TCGA) portal (HNSCC) of 334 tumours and 39 normal tissues. The Y-axis represents reads per million and the X-axis represents five selected miRNAs investigated

3.3 Independent validation using TCGA data

TCGA HNSCC miRNAseq data revealed that four of the five miRNAs: (miR-9, miR-127, miR-191 and miR-222) were differentially expressed (P < 0.01) after multiple hypothesis correction. miR-9, miR-191 and miRNA-222 showed a significant over-expression in tumours derived from HNSCC patients compared to normal tissues collected from the same patients. In contrast to salivary miRNA expression data for miRNA 127 and miRNA 134, both of these miRNAs were higher in normal tissues as compared to tumour samples from the same patients (Fig. 2b).

3.4 Discriminatory power of miR-9, miR-134 and miR-191

Receiver Operator Curves (ROC) were generated to evaluate the discriminatory power of miR-9, miR-134 and miR-191 for their ability to differentiate between control and disease groups. All three miRNAs were found to provide a good discriminative ability with AUC values of 0.85 (P < 0.0001), 0.74 (P < 0.001) and 0.98 (P < 0.0001), respectively. When combining all three miRNAs, the AUC was 0.74 (P < 0.0001). In contrast, miR-127 and miR-222 yielded AUCs values of 0.73 (P < 0.01) and 0.58 (P = 0.24), respectively. Figure 3 shows ROC analyses for these three miRNAs using salivary samples from all 56 HNSCC patients and healthy controls. Figure 4 depicts the signature cluster profiles of five selected salivary miRNAs.

Fig. 3
figure 3

Receiver operator characteristics (ROC) curve analysis of the independent validation study using saliva-derived miR-9 (a), miR-134 (b) and miR-191 (c) for discriminating HNSCC patients from normal subjects (P < 0.001)

Fig. 4
figure 4

miScriptTM microarray signature cluster profiles of five selected miRNAs in saliva from healthy controls (n = 56) and HNSCC patients (n = 56)

We also evaluated whether the miRNA expression profiles can discriminate HNSCC patients based on anatomical sites (Supplementary Table 4). To this end, again ROCs were generated. We found that the fold changes of miR-222 (P < 0.05), miR-127 (P < 0.05) and miR-191 (P < 0.05) could differentiate HNSCC patients belonging to Group 1 and Group 4 (Group 1 = oral cavity and Group 4 = pharynx). The expression of miR-9 could (P < 0.05) differentiate HNSCC patients in Group 2 and Group 3 (Group 2 = oropharynx and Group 3 = larynx) with an AUC of 0.77. Similarly, we found that the expression of miR-191 could (P < 0.05) differentiate HNSCC patients from Group 2 and Group 4 with an AUC of 0.94, and that the expression of miR-127 could (P < 0.01) differentiate HNSCC patients from Group 3 and Group 4 with an AUC of 1.00.

4 Discussion

Currently, the early diagnosis of head and neck squamous cell carcinoma (HNSCC) is severely hampered by the lack of suitable biomarkers. We have developed a protocol to isolate high yields of miRNAs from as little as 200 μL of fresh saliva, as well as from archival samples stored at −80 °C. By using these samples, we have identified and validated a novel salivary biomarker panel, consisting of miR-9, miR-191 and miR-134, to reliably detect HNSCC. Through an independent validation using the TCGA data base, the clinical utility of this panel was further confirmed.

Previously, Zhu et al. [31] reported that miR-9 over-expression may be associated with colorectal carcinoma metastasis by promoting cell motility. It was also noted that, when miR-9 was over-expressed in non-metastasising breast carcinoma cells, it was able to promote lung micro-metastases in a mouse model. Conversely, it was found that down-regulation of miR-9 in highly malignant cells may impair metastatic changes [32]. Furthermore, Hui et al. [33] reported that miR-9 and miR-9* were strongly associated with HPV/p16-status in oropharyngeal carcinomas. This observation is compatible with a report of Kozaki et al. [34] showing a significant up-regulation of miR-9 and miR-9* in oral squamous cell carcinoma (OSCC) cell lines. Khew-Goodall et al. [35] identified miR-9 as a negative regulator of E-cadherin by suppressing the ability to promote carcinoma cell motility and invasiveness.

We observed a down-regulation of miR-134 in the saliva samples from HNSCC patients. In contrast, Lin et al. [36] found that this miRNA was up-regulated in oral cancerous tissues compared to non-cancerous tissues, and that this up-regulation was associated with vascular invasion and tumour size. More recent studies have shown that miR-134 is involved in tumour suppression in prostate and lung cancers, and that its down-regulation is related to drug resistance [37].

Work carried out by Elyakim et al. [38], using a mouse model for hepatocellular carcinoma (HCC), has shown that dioxin (a liver carcinogen) can up-regulate miR-191 expression. Subsequent in vitro inhibition of miR-191 revealed a reduction in cellular proliferation and an increase in apoptosis. Similarly, a significant in vivo reduction in tumour load was observed. In addition, miR-191 was found to exhibit a tissue-specific expression pattern, and to be over-expressed in colon, lung, pancreas, prostate and stomach cancers [39]. Under hypoxic conditions, HNSCC cells showed an up-regulation of miR-191 [40]. Barker et al. [41] sought to establish a method to predict the origin of an unknown primary HNSCC tumour by utilising the expression of miR-191. In their study, miR-191 expression levels were found to be consistent between primary and local metastatic lesions within an individual patient and between different patients within a specific site [41]. They noted an association between primary and regional metastatic nodal sites in base-of-tongue and tonsillar carcinoma patients.

From our preliminary work we conclude that miR-9, miR-191 and miR-134 may serve as novel non-invasive biomarkers in HNSCC. Notably, the expression of salivary miR-9, miR-191 and miR-134 was independently validated using a HNSCC cohort consisting of 334 tumour samples and 39 normal adjacent tissues, curated in the TCGA database.