Introduction

Oral cancer is the sixth most common cancer in the world and it is one of the major public health problems. Consumption of various tobacco products are associated with higher risk of developing oral cancer compared to persons without any adverse habit. Optimal selection of therapy plays a crucial role in the successful treatment of oral cancer, along with the importance of early diagnosis. The World Health Organization (WHO) has reported a mortality rate of 45% for oral cancer patients across all stages within five years of diagnosis. Notably, individuals with oral squamous cell carcinoma (OSCC) who receive timely treatment exhibit more favourable outcomes, with 80% 5-year survival rates [1]. Enhancing early cancer diagnosis and treatment relies on two key factors: identifying specific biomarkers for cancer detection and developing dependable, user-friendly techniques [2]. Fourier transform infrared (FTIR) spectroscopy emerges as a promising method in this field [3,4,5]. The vibrational spectroscopic techniques mentioned are characterized by their simplicity, reproducibility, non-destructiveness to tissue, and minimal sample preparation requirements, involving small amounts of material ranging from micrograms to nanograms. Furthermore, these techniques offer valuable molecular-level insights, enabling the study of functional groups, bonding types, and molecular conformations. Spectral bands observed in vibrational spectra are unique to each molecule, providing direct information about its biochemical composition. This method has been previously utilized for detecting and monitoring the treatment of breast cancer [6, 7]. Differences have been shown in the absorption spectra of blood serum from lung cancer patients and healthy individuals [8, 9]. There have been reports in the literature on the diagnosis of cancer of the stomach [10], intestines [11, 12], liver [13], prostate [14, 15], thyroid [16], testicles [17], bladder [18], and other organs [19,20,21,22].

Attention is now being focused on saliva as an informative biomaterial for the non-invasive diagnosis and monitoring of the human organism [24, 25]. Saliva is a complex and dynamic biological fluid composed of DNA, RNA, proteins and metabolites. Alterations in the levels of these substances can serve as indicators of systemic diseases or the presence of early-stage oral cancer [9]. The salivary proteome is highly dynamic [10], contains over 3652 proteins and 12,562 peptides, and shares nearly 51% of the proteins and 79% of the plasma peptides [11]. Hence, saliva could be an effective source for screening or monitoring of the malignancy. However, the use of the IR spectroscopy of saliva for oral cancer diagnosis has been limited. In the present work, FTIR spectroscopy of saliva is used to discriminate between OSCC and Healthy control (HC) subjects using support vector machine (SVM).

Material and Method

Study Population

This study was conducted in the Department of Oral Medicine and Radiology, Dental College, RIMS Ranchi. Approval from Internal Research Review Committee and Institutional Ethics Committee were taken before the study protocol was initiated. In this study, a total of 60 patients divided into 2 groups were recruited. Group A included 30 histo-pathologically confirmed OSCC patients and Group B included 30 healthy controls. All patients gave written informed consent to participate in this study.

A detailed demographic data, history of habit of tobacco and alcohol intake and its duration, frequency, quantity were obtained. History of previous malignancies and previous treatments were also obtained along with thorough examination of the head and neck and the soft and hard tissues within the oral cavity for the detection of an OSCC. Adjacent lymphnodes along with cervical lymph nodes for the location, size, mobility, texture, and tenderness were also done by following standard protocol [23].

Saliva Sample Collections

A standardized procedure was followed to collect unstimulated whole saliva samples. The patients were instructed to rinse their mouths with water and refrain from consuming food, drinks, cigarettes, smokeless tobacco, or oral hygiene products for a minimum of one hour before sample collection. The unstimulated whole saliva samples were collected in the morning, specifically between 9 and 10 a.m., while the patient sat comfortably and expectorated saliva into a plastic container as it accumulated in their mouth. Roughly 10 ml of saliva was collected from each patient.

FTIR Spectroscopy Method

FTIR spectroscopy was done for all the saliva samples collected from both Group A and B at Birla Institute of Technology, Ranchi. An IR Prestige-21 (Shimadzu Corp, Japan) spectrometer was used to record IR spectra in the 40–4000 cm-1 range.

The samples were thawed and brought to room temperature. One drop of liquid saliva sample was put on sample table by using micropipette and then spectrum was recorded. The spectrum of air was used as a background before each sample analysis. Background and sample spectra were taken at room temperature (around 21–23 °C), at a spectral resolution of 4 cm-1 and for each measurement 32 scans were performed. All IR spectra reflecting the dependency of infrared absorbance (abs. units) on the wavenumber were obtained in no less than 3 consequent tests with 3–5 repetitions each. The original FTIR spectra were normalized and the baseline was corrected using OPUS software.

Spectral Data Processing

All the raw data were transferred into excel sheet and graph was plotted. The spectrum is recorded on a graph with wavenumber (cm−1) recorded on X axis and transmittance on Y axis. Absorbance is related to Transmittance as follows A = log1/T where A is absorbance and t is transmittance. Absorbance is linearly proportional to concentration [Beer –Lambert Law] and used for quantitative analysis. Graphs obtained by each group were compiled.

Feature Selection and Data Reduction

FTIR data was also processed with the Kruskal Wallis test along with the principal component analysis (PCA) technique for features selection and data reduction. The Kruskal Wallis test is a non-parametric technique used to test whether the samples originate from the same distribution. Kruskal-Wallis test was used to select significant features based on the median value of two samples and to obtain the p-value of each wave number-variable. If the p value is less than the significance level (P < 0.05) for the features it means features with discriminative information are present and those features were selected. PCA is used for statistical analyses of different samples. With PCA the dimensionality of FTIR microscopic imaging datasets is reduced while as much information as possible is retained. The Kruskal Wallis values were selected as features to be used in the SVM classifier.

Support Vector Machine (SVM)

SVM classifier was applied in the classification of disease state from normal subjects using FTIR data. The classifier draws the hyperplane in separating the data points. The total of 60 samples (30 samples of OSCC and 30 samples of healthy control), were divided into training set (80% of data) and test data set (20% of data). The SVM model was implemented using the Matlab version 2020b using Gaussian kernel. A two-class model was trained to classify the normal and cancer data set. The FTIR data (1000 to 2000 cm−1) were analysed using Kruskal Wallis technique along with PCA as features selected and feature reduction technique, respectively. The PCA used atleast 95% variance in selection of the data. The derived values of true positive (TP), true negative (TN), false positive (FP), and false-negative (FN) have also been used to determine sensitivity, specificity, and accuracy.

$$Sensitivity= \frac{TP}{TP+FN}$$
$$Specificity= \frac{TN}{TN+FP}$$
$$Accuracy= \frac{TP+TN}{TP+TN+FP+FN}$$

Results

Demographic Detail

The OSCC patients in this study ranged from 35 years to 82 years with a mean of 54.13 years (SD ± 13.17). The male: female ratio in OSCC group was 3.2:1. The mean age of healthy control group subject was 24.9years (SD±5.6), ranging from 18 to 42 years. In control group, there were 22 males and 8 female (Table 1). Out of 30 OSCC cases, 18 were well differentiated OSCC, 9 were moderately differentiated OSCC and 3 were Poorly differentiated OSCC.

Table 1 Showing age wise demographic distribution of the study population

3.2 FTIR Interpretation

Result of FTIR

The peaks were identified at wave no 1180 cm−1, 1230 cm−1, 1340 cm−1, 1360 cm−1, 1420 cm−1, 1460 cm−1, 1500 cm−1, 1540 cm−1, 1560 cm−1, and 1637 cm−1. It represents the functional group given in Table 2. 1637 cm−1 Amide I band of protein and H–O–H deformation of water is the highest peak identified (Fig. 1, Table 2).

Table 2 Assignments of main FTIR peaks, component identification and biomolecules
Fig. 1
figure 1

Showing Graph obtained by compiling OSCC and healthy control group

Principal Component Analysis

Loading Plot (PCA) demonstrated that the sample 8, 26 and 33 are negatively correlated while others are forming the small angle thus positively correlated (Fig. 2a) and steep scree plot revealed that principal component 1 captured all the information (Fig. 2b). Comparison of principal components to the wave numbers with respect to the Normal and cancerous sample is shown in Fig. 2c

Fig. 2
figure 2

a Loading Plot (PCA). b Scree Plot. c Comparison of principal components to the wave numbers with respect to the Normal and cancerous sample

Support Vector Machine Result

The observed results demonstrated the accuracy of 91.66 % in the classification of Cancer tissues from the normal subjects with sensitivity of 83.33% while specificity and precision of 100.0 % (Table 3). The receiver operating characteristic curve presented the area under the curve of 91% (Fig 3).

Table 3 Support vector machine result
Fig. 3
figure 3

The receiver operating characteristics of the binary classification of cancer and normal subjects

Discussion

The current study used FTIR spectroscopy combined with machine learning techniques to build discrimination models between the two groups OSCC and Healthy control. FTIR interpretation were performed for 60 saliva samples (30 from oral cancer group and 30 from normal individuals).The peaks obtained were at wave no 1180 cm−1, 1230 cm−1, 1340 cm−1, 1360 cm−1, 1420 cm−1, 1460 cm−1, 1500 cm−1, 1540 cm−1, 1560 cm−1, and 1637 cm−1. Although peaks identified are similar in both the groups, the absorbance of oral cancer group at 1637 cm−1 wavenumber was observed to be more than the normal individual group. At 1637cm−1 Amide I band of protein and H–O–H deformation of water is observed. Amides I and II bands are commonly used to investigate the secondary structure of proteins [3]. Example of Secondary Structure of Protein are Prolin and Glycin [4]. Absorbance is linearly proportional to concentration [Beer–Lambert Law] and used for quantitative analysis. Therefore, it can be interpreted that quantity of anticipated proteins at 1637 cm−1 is more in oral cancer group than that of normal individual group.

Wu et al. used fiber-optic ATR spectroscopy on freshly-cut human oral tissues and found that the intensity variation at 1745 cm−1 band among normal and malignant tissue. The relative intensities of the 1745 cm bands were strong in the spectra of normal tissues but weak or absent in malignant tissues and the amide I band at 1646 cm−1 were good markers for distinguishing normal oral tissue from malignant ones. Where as in current study showed more distinguishing peak at 1637 cm−1 between OSCC and HC group in salivary sample [28].

Shaikh et al. studied on saliva based non-invasive screening of Oral Sub mucous Fibrosis which is a potentially malignant disorder using ATR-FTIR spectroscopy. The researchers noted that the transmission of protein predominantly occurs within the 1300 to 1800 cm−1 range. Spectral bands in this region are indicative of the amide II region, likely originating from transmembrane proteins. Moreover, a significant contrast in intensity was observed in the wavenumber range of 900–1200 cm−1. Within this range, the biomolecules primarily exhibit carbohydrate vibrations associated with glucose, fructose, and glycogen. In current study between 900 and 1200 cm−1, only one peak 1180 cm−1 is observed [29].

Ayelet Zlotogorski-Hurvitzetal analyzed the Fourier-transform infrared (FTIR) spectra of exosomes found in saliva from patients with oral squamous cell carcinoma (OSCC) and healthy individuals (HI), and evaluates its potential as a diagnostic tool using computational models. The results revealed consistent differences between the IR spectra of OSCC and HI at specific frequencies: 1072 cm−1 (indicating nucleic acids), 2924 cm−1 and 2854 cm−1 (representing membranous lipids), and 1543 cm−1 (associated with transmembrane proteins). By employing a PCA–LDA discrimination model, the samples were accurately classified with 100% sensitivity, 89% specificity, and 95% overall accuracy. Furthermore, the SVM model exhibited a training accuracy of 100% and a cross-validation accuracy of 89%. Notably, the current study found the IR spectrum at 1540 cm−1 to be particularly close to the identified characteristics [30].

Banerjee et al. In their study, utilized FTIR spectroscopy to differentiate between oral leukoplakia and histological tissues of oral squamous cell carcinoma (OSCC) using linear and quadratic support vector machine (SVM) with tenfold cross-validation. Through the Feature Forward Selection method, they identified six spectral features (at 1782, 1713, 1665, 1545, 1409, and 1161 cm−1) that enabled classification between leukoplakia and OSCC. The classification achieved a sensitivity of 81.3%, specificity of 95.7%, and an overall accuracy of 89.7%. However, in the current study, spectral features at 1540, 1420, and 1180 cm−1 were found to be different from the identified peaks but closer in proximity [31].

Rai et al. [32] used FTIR spectroscopy together with chemometric techniques to differentiate the serum metabolic signatures of OSMF patients from healthy controls.19 major peaks (p b 0.001) were observed at 1313, 1544, 1650, 1035, 1169, 1025, 1045, 1151, 1122, 1078, 1725, 1243, 1453, 1398 1020, 1100, 1039 and 1117 cm−1. By employing multivariate statistical techniques such as PCA, HCA, and PLS-DA, the study successfully achieved a remarkable distinction between spectra of oral submucous fibrosis (OSF) and normal samples. This differentiation was accomplished using nineteen significant wavenumbers (p ≤ 0.001) ranging from 1725 to 1020 cm−1, which correspond to alterations in lipids, proteins, carbohydrates, and nucleic acids. These findings strongly suggest that the combination of FTIR spectroscopy and chemometric analysis holds significant potential in enabling rapid and accurate preoperative screening and diagnosis of OSF [31]. Nearer to these findings in current study 10 peaks between 1180 and 1637 cm−1 in saliva sample of OSCC and HC patients. Summarizing all the data obtained from different sources we found some of the FTIR peaks were common in tissue, serum as well as saliva of OSCC or Oral potentially malignant disorders, which can represent the distinguishing biomolecules among both the groups. Hence we can infer that saliva which is the safest biological sample may be used as diagnostic sample in place of serum and tissue for FTIR spectroscopy in diagnosis of OSCC.

Support Vector Machine (SVM) is a classification and regression prediction tool that uses machine learning theory to maximize predictive accuracy while automatically avoiding over-fit to the data. In current study SVM classifier was applied in the classification of disease state from normal subjects using FTIR data. PCA was used assessing the dimensionality of FTIR microscopic imaging datasets. The observed results demonstrated the accuracy of 91.66% in the classification of Cancer tissues from the normal subjects with sensitivity of 83.33% while specificity and precision of 100.0%.

Banerjee et al. achieved a classification between leukoplakia and OSCC with 81.3% sensitivity, 95.7% specificity, and 89.7% overall accuracy [31]. The SVM showed a training accuracy of 100% and a cross-validation accuracy of 89% between OSCC and Healthy Control (Ayelet Zlotogorski-Hurvitzetal 2019) [30]. Although the current study contained a rather low number of subjects, the discrimination function model was able to differentiate OSCC from HI salivary sample with accuracy of 91.66%, sensitivity of 83.33% while specificity and precision of 100.0%.

Conclusion

From the research that has been carried out it possible to conclude that salivary samples can be successfully used for ATR-FTIR spectroscopy for distinguishing between healthy individuals and oral cancer patients. We highlighted spectroscopic biomarkers associated with oral cancer development, in particular bands attributed to different proteins and polysaccharide. It has been demonstrated that there are significant changes in secondary structure of proteins upon oral cancer development. These data also highlight the potential of ATR-FTIR platforms coupled with machine learning as a sustainable, reagent-free, non-invasive, and highly sensitive tool for screening and monitoring oral cancer patients. For a comprehensive evaluation, it is essential to have a sufficient sample size in order to ensure accuracy and reliability.