Introduction

Idiopathic pulmonary fibrosis (IPF) is a chronic, progressive, and heterogeneous lung disease of unknown cause [14]. The main pathologic features of IPF is paraseptal fibrosis and honeycombing in the subpleural area, and mild inflammation with patchy lymphoplasmacytic interstitial infiltrate [1]. Recent studies about the pathogenesis of IPF have indicated that the fibrotic response is driven by abnormally activated alveolar epithelial cells [1, 2]. During chronic inflammation in the interstitium, epithelial cells, fibroblasts, lymphocytes, or macrophages interact with each other and release metabolic products that diffuse into the alveoli from the interstitium. These cells also produce many mediators that induce the formation of fibroblast foci, attraction of circulating fibrocytes, and stimulation of the epithelial to mesenchymal transition. Therefore, pathological process in IPF consists of complex interactions between epithelial and mesenchymal cells at the interstitium in the lung.

More than a thousand different volatile organic compounds (VOCs) have been detected at low concentrations in the normal human breath, with wide inter-individual variations [5]. Exhaled breath contains some endogenous molecules produced during metabolic processes in epithelial cells, mesenchymal cells, or blood-borne substances [5]. VOCs in exhaled breath have been reported as biomarkers for discriminating several respiratory diseases form healthy subjects [620]. However, few studies have examined the VOCs in IPF patients. As far as we known, there was one report that exhaled ethane was elevated in patients with interstitial lung disease and was correlated with clinical outcome [21].

Recently, the multi-capillary column (MCC), which is a variation of a capillary gas chromatography, has been used in combination with ion mobility spectrometer (IMS) [6, 10, 11, 19, 2225]. MCC/IMS provides quick separation and highly selective analyses of VOCs from exhaled breath samples. The general operation of ion mobility spectrometry was reviewed recently by Cumeras et al. [26, 27].

The aim of this study was to find the characteristic VOCs in the exhaled breath of IPF patients for discrimination from healthy subjects using MCC/IMS, and to examine the relationship between the VOCs and the clinical parameters, including pulmonary function tests (PFT), blood gas analysis (BGA) and serum biomarkers of IPF.

Methods

Study Design and Study Subjects

The study had a cross-sectional case-control design. We recruited patients with interstitial pneumonia from Sapporo Medical University Hospital between December 2012 and May 2013, and radiologic findings including high-resolution computed tomography (HRCT) and clinical information from medical record were evaluated retrospectively. In total, 81 patients were identified: 25 had non-specific interstitial pneumonia (NSIP), 3 had desquamative interstitial pneumonia and 11 had collagen vascular disease-associated interstitial pneumonia (CVD-IP), two had other interstitial pneumonia; thus, 40 IPF patients were recruited into our study.

The diagnosis of IPF patients was made based on the ATS/ERS/JRS/ALAT statement [4]. In all patients, a medical history was obtained and physical examination, HRCT, PFT, BGA and serum biomarker measurements were performed. The diagnosis was performed by a multidisciplinary clinical-radiological-pathological review of the patient data. Patients with inflammatory diseases or malignancies were excluded. The control subjects were healthy volunteers recruited from our medical staff.

All patients and controls underwent VOC measurement in the exhaled breath. Informed written consent was obtained from all patients and the controls. This study was approved by the Institutional Review Board of Sapporo Medical University.

MCC/IMS

We used an ion mobility analyser for breath analysis. This device was composed of a breath-sampling unit (SpiroScout, Ganshorn Medizin Electronic, Niederlauer, Germany) and MCC/IMS (BioScout, B&S Analytik, Dortmund, Germany).

The exhaled breath from the subjects was collected through a mouthpiece connected to the SpiroScout, feeding the exhaled breath to the BioScout. At the end of the collection of exhaled breath, 10 mL of gas in the sample loop was given to the MCC (type Ov-5, Multichrom Ltd, Novosibirsk, Russia). In the MCC, the exhaled breath gas was sent through 1000 parallel capillaries, each with an inner diameter of 40 μm and a film thickness of 200 nm. After a 95 MBq 63Ni β-radiation source was applied to ionize the carrier gas, sample gas was fed into the IMS by the carrier gas (compressed air with grade 1, Taiyo Nippon Sanso Co, Japan).

The carrier gas was ionized by the 63Ni β-radiation source and ion–molecule reactions formed ionized molecules of the analytes through different types of collisions, including charge transfer reactions (chemical ionization, e.g. proton or electron transfer). The drift gas flowed from the Faraday-plate to the ionization region to prevent neutral substances from entering the drift region. Ionized analytes were detected on the Faraday plate at the end of the drift region of the IMS by directly measuring the charge. The drift times of the ions formed from the analytes were recorded.

Using an MCC as a pre-separation unit, IMS chromatograms were represented as a three-dimensional matrix as described in previous studies [19]. The intensity is related to the concentration of the analyte as well as other parameters, including drift time and retention time. Therefore, the detected metabolites are represented as a list of peaks described by the drift time (ms) and the retention time (s) according to their local maxima in the data matrix. Retention time and drift time were plotted on the x- and y-axes, respectively. The intensity of charge (V) measured at the Faraday plate was plotted on the z-axis as a measure of the analyte concentration within the breath sample. Each analyte was calibrated.

Measurement of VOCs

We measured VOCs in the exhaled breath of IPF patients and controls using MCC/IMS. The IPF patients and controls had fasted for at least 6 h before examination and stopped consuming alcohol on the previous day. Current smokers had not smoked for at least 24 h before examination, while IPF patients continued oral medication. Just before examination, all subjects promptly washed their mouths by gargling with pure water.

IPF patients and controls breathed through a disposable mouthpiece (Ganshorn Medizin Electronic) connected to a tube on the MCC/IMS. Collection of exhaled breath gas took for 30 s during respiration at rest. Tidal breathing of IPF patients and controls were approximately 10–15 times per minute. The MCC/IMS system was used to analyse all VOCs in the exhaled breath within 12 min.

After the breath samples were analysed by MCC/IMS, VOC peaks in the IPF patients and controls were determined using the software Visual Now 3.4 (B&S Analytik). The VOCs were identified by peak comparisons with reference data using the software Visual Now Version 110,801 (B&S Analytik).

PFT

A Chestac 9800 (Chest Co, Tokyo, Japan) was used for the PFT. We examined the following parameters: vital capacity (VC), predicted percentage of vital capacity (%VC), forced vital capacity (FVC), predicted percentage of FVC (%FVC), forced expiratory volume in one second to FVC ratio (FEV1/FVC), residual volume divided by total lung capacity (RV/TLC), and predicted percentage of RV to TLC (%RV/TLC). We measured diffusion capacity (DLco), predicted percentage of diffusion capacity (%DLco), DL divided by the alveolar volume (DLco/VA) and predicted percentage of DLco/VA (%DLco/VA) according to single-breath carbon monoxide uptake.

BGA and Serum Biomarkers

We measured arterial oxygen tension (PaO2), arterial carbon dioxide tension (PaCO2), alveolar-arterial difference of oxygen (AaDO2) by BGA, surfactant protein A (SP-A), surfactant protein D (SP-D), and Krebs von den Lungen-6 (KL-6) by sera in IPF patients.

Statistical Analysis

Differences of VOC peaks between the IPF patients and controls were assessed using the Mann–Whitney U test and the results were confirmed by the permutation test (n = 100). The multiple testing was corrected using the Benjamini-Hchberg method for false discovery rate (FDR). Significance was set at FDR-adjusted p-value less than 0.05.

Cut-off value of each VOC was determined by area under the receiver operating characteristic (ROC) curve. Diagnostic accuracy of significant VOC was expressed in sensitivity, specificity, positive predictive value, negative predictive value, accuracy and odds ratio. The bootstrap method was used to obtain 95% confidence intervals (CI) of area under the ROC curve. Correlations between clinical parameters and signal intensities of the significant VOC peaks were compared using the Spearman’s signed rank test. The effect of smoking on the results was examined in a logistic regression model or a linear regression model. In the logistic regression model and linear regression model, we used standardized values of VOCs which were transformed using the formula of z = (x − μ)/σ because the raw data of VOCs were extremely small.

The differences were considered statistically significant when the p-value was less than 0.05. Analyses were completed using IBM SPSS statistics version 22 (SPSS Inc., Chicago, IL, USA). In addition, AUC with 95% CI by bootstrap method and p-value by permutation tests were calculated using R 3.0.2 with ‘pROC’ package.

Results

VOC Peaks in IPF Patients

Forty IPF patients and 55 control subjects underwent measurement of VOC in the exhaled breath (Table 1). A total of 85 VOC peaks were detected and numbered in both IPF patients and the controls (Supplementary Fig. 1). In a comparison of the signal intensities of VOC peaks in both groups, significant differences were found in 5 VOCs: peaks 2, 5, 10, 18, and 67. According to the database of known VOCs measured by MCC/IMS, peaks 2, 5, 10, and 18 were p-cymene (IUPAC name: 1-methyl-4-(1-methylethyl)benzene), acetoin (3-hydroxy-2-butanone), isoprene (2-methyl-1,3-butadiene), and ethylbenzene, respectively (Table 2). However, the peak 67 compound was not identified in the database.

Table 1 Study population
Table 2 Five significant VOC peaks and the equivalent compounds

As shown in Fig. 1, the signal intensity of p-cymene was significantly lower in IPF patients than in controls (p < 0.001; Mann–Whitney U test). In contrast, the signal intensities of acetoin, isoprene, ethylbenzene, and the peak 67 compound were significantly higher in IPF patients than in controls (p < 0.001 for all compounds).

Fig. 1
figure 1

Comparison between patients with IPF and control subjects in significant 5 VOC peaks including p-cymene, acetoin, isoprene, ethylbenzene and an unknown VOC. Box plots show the signal intensity of VOC peaks in the ion mobility spectrometry (MCC/IMS). The box represents the 25th and 75th percentiles, the whiskers represent the extreme value, the lines in the box represent the median and the circle represents the outlier

To identify the optimal cut-off value of each significant VOCs for IPF patients, ROC analysis were performed using threshold value. As a result, each significant VOC cut-off value was provided (Supplementary Fig. 2). Using the value, we classified each significant VOC into four groups; true positive, true negative, false positive and false negative groups. P-cymene showed a specificity of 0.5 and a high specificity of 0.909 with an area under the curve of 0.803. Isoprene showed a high sensitivity of 0.825 and a specificity of 0.836 with an area under the curve of 0.810.

As our study subjects include current and former smokers, we evaluated influence of smoking factor using the multivariate analysis. A logistic regression model demonstrated that these results were not influenced by smoking factor and 5 VOCs had a statistically significant impact on discrimination of IPF from controls (Supplementary Table 1).

Correlation Between VOC Peaks and Clinical Parameters

We compared the signal intensities of 5 VOCs with clinical parameters in IPF patients. For p-cymene, acetoin, and ethylbenzene, we found significant correlations between the VOC peaks and several clinical parameters including PFT, BGA, and KL-6 (Table 3). In p-cymene, we found negative correlations with VC (r = −0.393, p = 0.013), %VC (r = −0.569, p < 0.001), FVC (r = −0.440, p = 0.004), %FVC (r = −0.539, p < 0.001), DLco (r = −0.394, p = 0.018), and %DLco (r = −0.413, p = 0.008) and a positive correlation with KL-6 (r = 0.432, p = 0.005) (Fig. 2). In acetoin, there was a negative correlation with PaO2 (r = −0.359, p = 0.027) and a positive correlation with AaDO2 (r = 0.369, p = 0.023). In ethylbenzene, there was a negative correlation with %DLCO/VA (r = −0.347, p = 0.038). However, there was no correlation between clinical parameters and both isoprene and the peak 67 compound.

Table 3 Correlations of signal intensities of p-cymene, acetoin and ethylbenzene with clinical parameters of IPF
Fig. 2
figure 2

Correlations of the signal intensities of p-cymene with pulmonary function tests and KL-6 in IPF patients. There are negative correlations with VC, %VC, %FVC, DLco, DLco/VA and %DLco/VA, and a positive correlation with serum levels of KL-6. The unbroken line and the dotted line indicate the regression line and 95% confidence intervals, respectively

We examined the influence of smoking factor on the results by multivariate analysis. A linear regression model showed that the results of p-cymene were not influenced by smoking factor (Supplementary Table 2). However, both acetoin and ethylbenzene had a significant association with smoking factor as a confounder.

Discussion

In this study, we found 5 characteristic VOCs in IPF patients compared with controls by using MCC/IMS. Of the 5 VOCs, 4 were identified as p-cymene, acetoin, isoprene and ethylbenzene. However, peak 67 could not be identified. VOC peaks of p-cymene was significantly decreased, whereas VOC peaks of acetoin, isoprene, ethylbenzene and the peak 67 were significantly increased in IPF patients. Therefore, we thought that p-cymene was a negative biomarker and the others were positive biomarkers of IPF. These VOCs may be useful for discrimination of IPF patients from healthy subjects.

Although the source of the VOCs of IPF is unclear, these compounds are thought to be produced from interstitial inflammation and fibrosis during or after metabolism of epithelial and mesenchymal cells. Isoprene is an endogenous VOC that is a byproduct of cholesterol biosynthesis. Foster et al. [28] demonstrated isoprene in exhaled breath showed a significant increase following a temporally decrease after exposing to ozone which causes oxidative stress. They speculated that reactive oxygen species (ROS) as free radicals damaged cell membranes and epithelial membrane repair mechanisms would increase cholesterol biosynthesis in order to replenish the cholesterol fraction within cellular membrane. Therefore, epithelial membrane repair during persistent inflammation in IPF patients might lead to an increase of cholesterol biosynthesis focally in the lung and isoprene as intermediate would be increased in the breath. Acetoin is a flavoring chemical, which is widely used for food, cigarette, cosmetics, detergent, and so on. Jaiali et al. [29] reported acetoin was a significantly higher VOC in exhaled breath of workers exposed to silica and silicosis patients compared to the healthy subjects. Silicosis is considered to be an oxidative stress disease that can lead to the development of lung fibrosis and lung cancer. Increased oxidative DNA damage has been detected in IPF patients as well as workers with silicosis [30]. Acetoin in exhaled breath might be produced as a result of cellular damage due to ROS. In addition, ethylbenzene has carcinogenic activity and is considered to be a VOC breath biomarker of lung cancer. Experimental exposure to ethylbenzene resulted in an increase of lung tumor in mice [31, 32]. Similar to ethylbenzene, both isoprene [33] and acetoin [33, 34] are interestingly reported as a VOC breath biomarker of lung cancer [34]. These VOCs might be related with complication of lung cancer in IPF. On the other hand, p-cymene, a monoterpene and a major constituent of essential oils of various species of plants, has an antioxidant potential. In mouse model of lipopolysaccharide-induced acute lung injury, intraperitoneal preconditioning with p-cymene resulted in a significant reduction of proinflammatory cytokines, water gain, inflammatory cell infiltration, and lung tissue myeloperoxidase activity [35]. p-Cymene might play a protective effect against progression of IPF and a decrease of p-cymene might promote interstitial inflammation and fibrosis.

Of the 5 VOCs, p-cymene, acetoin, and ethylbenzene were correlated with several clinical parameters of interstitial pneumonia. The signal intensities of p-cymene revealed a negative correlation with both the decrease in spirometric testing including VC, %VC, FVC, and %FVC and decline in diffusion capacity including DLco, %DLco, DLco/VA, and %DLco/VA. These findings indicate the signal intensities of p-cymene might represent the deterioration of pulmonary function. Moreover, the signal intensities of p-cymene revealed a positive correlation with serum levels of KL-6, suggesting a relationship with the extent of alveolitis followed by fibrosis in the lung. P-cymene may be clinically useful for evaluating alveolitis in IPF patients. However, the signal intensities of p-cymene were paradoxically significantly higher in control subjects than in IPF patients. For the reason, we speculated that the production amount or metabolism of p-cymene in IPF patients differed from that in controls. On the other hand, acetoin showed a positive correlation with PaO2 and a negative correlation with AaDO2. Ethylbenzol showed a negative correlation with %DLco. Acetoin and ethylbenzene were thought to be related to oxygenation in the lung. In these results, we examined the influence of smoking factor on p-cymene, acetoin, and ethylbenzene using a linear regression model. The multivariate analysis showed a significant correlation between p-cymene and clinical parameters.

There were several limitations in our study. First, although isoprene is endogenously produced during cholesterol synthesis, there is no exact knowledge about biochemical pathway for the production of endogenous acetoin, ethylbenzene and p-cymene in human at present. Second, serial measurement of VOCs was not performed. The VOC measurement was only conducted once. Variability in VOCs during the course of IPF remains unknown. VOC may vary with the progression of IPF. Third, pathologic confirmation was not performed in our study. IPF lesions are distributed unevenly in the lung, and the degree varies depending on the site [24]. The relationship between pathologic changes and VOCs may be important for speculating the sources of the VOCs. Fourth, age distribution of the controls is different from that of IPF group. We were not able to compare IPF group with age-matched controls. We need further investigation between IPF and age-matched control group in the future. Fifth, MCC/IMS did not identify all VOCs in the exhaled breath. We could not identify the peak 67 compound because there was no matching material in the database. A complete database is expected to be established in the future.

Since the measurement of VOCs using MCC/IMS is noninvasive and simple [22, 23, 25], patients with impaired pulmonary function can be easily measured. The advantages of MCC/IMS are its ability to detect very low concentrations of compounds without pre-concentration and the short time required for analysis. In contrast to other methods, MCC/IMS enables the detection of a large number of VOCs in exhaled breath. Analysis of VOCs in exhaled breath using MCC/IMS may be a clinically useful screening for IPF patients.

Conclusions

We found characteristic 5 VOCs in the exhaled breath of IPF patients. Among them, the VOC peak of p-cymene was related to the clinical parameters. These VOCs may be useful biomarkers of IPF.