Introduction

Snoring is reported to occur either as a symptom of obstructive sleep apnoea or as sound phenomenon occurring independent of obstructive apnoeas. Depending on the site and mechanism of obstructions and vibrations, snoring sounds have specific acoustic characteristics. Whilst palatinal snoring is reported to have a main frequency below 500 Hz, the one found for tongue base snoring is above 500 Hz. Snoring episodes terminating obstructive apnoeas can exceed frequency maxima up to 4000 Hz [18]. Nevertheless, the data vary extremely depending on the method of recording and the parameters analysed. Most data deal with nocturnal snoring sounds, leading to the methodical problem that the analysed snoring sounds are not classified visually. The classification of a certain type of obstruction or vibration is performed by indirect classification on base of poly(somno)graphic information.

A potential visualisation can be obtained by flexible nasal endoscopy during simulated snoring under wake conditions [9]. Nevertheless, simulated snoring sounds have been shown to differ from real nocturnal snoring [10].

Drug-induced sleep endoscopy (DISE) is commonly used for evaluation of the upper airway in patients with sleep-disordered breathing [1116]. During the procedure, the site of vibration and obstruction can be detected visually. Besides that, the different types of obstruction- and vibration-induced snoring sounds can be recorded.

The aim of the study was to evaluate the specific characteristics of different types of snoring sounds, which are induced by DISE in order to create algorithms for acoustic analyses of snoring sounds in the future.

Methods

General procedure

The study was performed in male patients (n = 41) with suspected sleep-disordered breathing after written consent of the patients at a tertiary referral centre. The study was approved by the local ethic committee (2011–55). A polygraphic or polysomnographic recording to verify the sleep-disordered breathing by apnoea-hypopnea-index (AHI) was performed prior to DISE. Age and the body mass index (BMI) were documented.

Drug-induced sleep endoscopy

Drug-induced sleep endoscopy (DISE) was performed in an operation room, and patients were monitored with oxygen saturation, non-invasive blood pressure and ECG by an anaesthesiologist. DISE was performed by bolus-wise application of 10–20 mg Disoprivan 1 % (Propofol-Lipuro 10 mg/ml; B. Braun Melsungen AG, Bonn, Germany). Depth of sedation was monitored by detection of the BiSpectral Index ® (BIS). A BIS value of 40 was aimed at to simulate slow wave sleep (S3 sleep) [1719]. During DISE, a continuous video documentation of the upper airway was achieved by flexible nasal endoscopy. Parallel to the visual recording, an external condenser microphone with a flat frequency response curve from 20 to 20,000 Hz (RODE NT3, Silverwater, Australia) was placed in front of the patients’ mouth in a distance of 30 cm. The microphone was linked to the video recording system (AIDA, Storz, Tuttlingen, Germany).

Visual evaluation

The video sequences were analysed with respect to the pattern of collapse and vibration. One hundred and eight snoring episodes were included in the study. Between one and three episodes were chosen from each patient. The depth of sedation measured by BIS at which the snoring samples were chosen ranged between 70 and 40. The video samples of each snoring episode were classified on the basis of the visual appearance at a mute mode by the first author. Obstruction and vibration patterns were classified as follows:

  1. 1.

    Snoring terminating an obstructive apnoea (the first snoring episode after an apnoeic event independent from the pattern of vibration n = 22)

  2. 2.

    Tonsillar snoring (oropharynx; lateral, n = 21)

  3. 3.

    Velar snoring (velum; anterior–posterior, n = 50)

  4. 4.

    Obstructive velar snoring (velum; anterior–posterior and lateral or circular, n = 15)

One episode of snoring of each type is shown in Fig. 1a–d

Fig. 1
figure 1

a Series of consecutive images of one snoring episode terminating an obstructive apnoea. A complete lateral collapse is visible at the first image. The deep oropharynx expands within the following three images (arrows indicating the direction of movement). The fifth image shows an open airway with a view to the epiglottis. b Series of consecutive images of one snoring episode with a lateral collapse of the tonsils. Hyperplastic tonsils (arrows indicating the direction of movement) are visible in the first two pictures leaving a narrow airway with view to the edge of the epiglottis. During the snoring episode, a complete lateral obstruction occurs leading to vibrations at the level of the tonsils. c Series of consecutive images of one snoring episode with an anterior–posterior vibration of the soft palate. The uvula (asterisk) vibrates at an anterior–posterior direction. No obstruction is detectable. d Series of consecutive images of one snoring episode with an anterior–posterior vibration and a circular collapse at the velum level. Additional to the anterior–posterior vibration, a circular collapse (bent arrows) can be detected at the velum level leading to an obstructive component

The distributions of the depth of sedation and different snoring types were as follows:

BIS 70: n = 20 (tonsillar snoring n = 3, velar snoring n = 17)

BIS 60: n = 39 (post-apnoeic snoring n = 1, tonsillar snoring n = 10, velar snoring n = 22, obstructive velar snoring n = 6)

BIS 50: n = 37 (post-apnoeic snoring n = 11, tonsillar snoring n = 6, velar snoring n = 11, obstructive velar snoring n = 9)

BIS 40: n = 12 (post-apnoeic snoring n = 10, tonsillar snoring n = 2)

Acoustic evaluation

After visual classification, the sound tracks of each snoring sample were transformed into a WAV format (16,000 Hz, 16-Bit, mono signal; Adobe Audition 1.5, Adobe Systems, München, Germany). The sound pressure level was by an external sound level metre (sound level metre 322 Datalog, Voltcraft, Conrad Electronic, Hirschau, Germany). The samples were analysed by psychoacoustic software (dbSONIC, Metravib, Limonest, France) as previously published [20] with regard to the following parameters: sound pressure level ([dB], A-weighted), loudness (sone), sharpness (acum), roughness (cAsper), fluctuation strength (cVacil) and centre frequency (Hz) (mean values for each parameter). The psychoacoustic parameters loudness, sharpness, roughness and fluctuation strength were described by Fastl and Zwicker [21].

Statistic evaluation

The psychoacoustic parameters were analyses by mixed model analyses providing the regression coefficient beta, standard errors, 95 % confidence intervals (95 % CI), p values and estimate mean values. Calculations were controlled for random effects for patients and depth of sedation. Analyses were performed with SAS 9.3 (SAS Institute, Cary, NC, USA).

Results

Patients’ baseline data

The mean age of the patients was 53.4 years (standard deviation (SD) 7.2). The mean BMI was 28.7 (SD 6.3). The mean AHI was 12.4 (SD 8.1; min 1.2; max 30.9).

Level

The mean sound pressure level ([dB], A-weighted) was highest for obstructive velar snoring (mean 79.45; 95 % CI 78.27; 80.64). Post-apnoeic (estimated difference −6.63; 95 % CI −10.81; −2.46), velar (estimated difference −10.29; 95 % CI −13.99; −6.59) and tonsillar snoring (estimated difference −7.40; 95 % CI −11.44; −3.36) did not differ from each other and showed significant lower levels than obstructive velar snoring (Fig. 2a, Tables 1 and 2).

Fig. 2
figure 2

af Analyses of the parameters sound pressure level (a), loudness (b), sharpness (c), roughness (d), fluctuation strength (e) and centre frequency (f). Data are given as raw mean values (δ), median and percentile 25/75 % (box), whisker 5/95, 1 and 99 % (x), and minimum and maximum value (−). Obstructive velar snoring showed a higher sound pressure level than the other snoring patterns (a). Obstructive snoring patterns (post-apnoeic, tonsillar, velar obstructive) reveal a higher loudness than non-obstructive velar snoring (b). The sharpness is elevated as soon as lateral tonsillar obstructions occur (c). An enhanced roughness is present in velar snoring patterns regardless of an obstructive component (d). Post-apnoeic snoring episodes reveal a high level of fluctuation strength than non-apnoeic sounds (e). Velar non-obstructive snoring presented the lowest centre frequency of the different snoring patterns (f)

Table 1 Descriptive analyses of psychoacoustic parameters for different types of snoring. Data are given as mean values, standard deviation (SD) and 95 % confidence intervals
Table 2 Data of the mixed model with controlled random effects for patients and depth of sedation

Loudness

Velar snoring with vibrations with an anterior–posterior pattern revealed a lower loudness (sone) (mean 18.99; 95 % CI 17.88; 20.09, estimated difference to obstructive velar snoring −11.05; 95 % CI −16.30; −5.81) than all other types of snoring. As soon as obstructive patterns are classified (post-apnoeic (estimated difference to obstructive velar snoring − 0.30; 95 % CI −6.12; 5.52), tonsillar (estimated difference to obstructive velar snoring −0.37; 95 % CI −6.20; 5.46), velar obstructive), the loudness raises (Fig. 2b, Tables 1 and 2).

Sharpness

The acoustic characteristics of obstructions and vibrations at tonsillar level revealed a higher sharpness (acum) (mean 1.65; 95 % CI 1.61; 1.69; estimated difference to obstructive velar snoring 0.24; 95 % CI 0.12; 0.36) compared to the other types of snoring sounds (Fig. 2c, Tables 1 and 2).

Roughness

Velar snoring (mean 216.71; 95 % CI 184.77; 248.65; estimated difference to obstructive velar snoring 28.05; 95 % CI −48.45; 104.56) as well as obstructive velar snoring (mean 184.41; 95 % CI 147.61; 221.22) revealed a higher roughness (cAsper) compared to tonsillar and post-apnoeic snoring (Fig. 2d, Tables 1 and 2).

Fluctuation strength

Post-apnoeic snoring showed higher fluctuation strength (cVacil) (mean 71.51; 95 % CI 56.92; 86.11; estimated difference to obstructive velar snoring 38.76; 95 % CI 18.78; 58.74) than the other snoring types (Fig. 2e, Tables 1 and 2).

Centre frequency

Velar snoring revealed a lower centre frequency (Hz) (mean 2673.40; 95 % CI 1957.42; 3389.39; estimated difference to obstructive velar snoring 3592.98; 95 % CI −6461.34; −724.63) than other types of snoring episodes. No differences with respect to the centre frequency were detected between post-apnoeic, tonsillar and obstructive velar snoring (Fig. 2f, Tables 1 and 2).

Control for random effects for patients and depth of sedation

Vibrations of the velum (velar snoring) without obstructive patterns were more frequent at less deeper levels of sedation (BIS 70 and 60). The deeper the sedation was performed (BIS 50 and 40), the more obstructive patterns were detected (post-apnoeic snoring). A potential influence of the individual patient (n = 41) and the depth of sedation (BIS 70–40) were controlled for random effects. The controlled estimate means are given in Table 2. Velar obstructive snoring was taken as reference to which the other snoring patterns were compared. The difference between the estimate means are given as parameter estimate as an estimated difference to the respective parameter of obstructive velar snoring with standard error, p value and 95 % CI (Table 2).

Discussion

Although snoring is a main symptom in obstructive sleep apnoea, the acoustic characteristics of snoring sounds remain unanalysed by routine poly(somno)graphy. In recent years, various publications differentiate specific snoring types and anatomic structures that may create specific snoring sounds. Tongue base snoring is reported to have predominant frequency patterns above 500 Hz and palatinal snoring below 500 Hz [3, 4, 1, 2, 58]. Most data are based on the visualisation of vibrating structures during sleep endoscopy. Thereby, a possible differentiation between palatal and non-palatal snoring is frequently reported. Contrary to that, other sites of obstruction can hardly be detected by acoustic analyses [4, 1, 7, 2225].

As recent publications report multi-level obstruction patterns during sleep and even a change of obstructions levels depending on the depth of sedation during sleep endoscopy [26, 27, 13, 16], other structures apart from the velum and tongue base need to get into focus of acoustic investigations. To the opinion of the authors, focusing on only one pattern of vibration (e.g. just palatinal vs. non-palatinal) seems to be too idealised. Therefore, the presented study evaluates several subtypes of vibration and collapse patterns (velar, velar obstructive, tonsillar, post-apnoeic) in order to achieve a more practical view on the different types of snoring.

Moreover, mostly, a single method of acoustic analysis (e.g. centre or peak frequency) is performed depending on the available equipment of software. This results in the inability to distinguish other sites of obstruction or vibration apart from palatal or non-palatal. The acoustic information of a snoring noise cannot be evaluated by simple frequency analyses alone.

A more complex approach of analysis seems to be necessary. The present study provides a standardised procedure of analysis by psychoacoustic parameters as previously described [20, 28, 29, 21]. A review of Pevernagie et al. lists several methods of acoustic evaluation of snoring sounds [8]. These approaches are mostly a single method approach resulting in a limited informative value. In our opinion, multiple parameters provide a wider range for classifying specific subgroups of snoring. Therefore, the use of several psychoacoustic parameters is intended in the presented study. An “acoustic fingerprint” of different subtypes of snoring can be obtained.

The present data enable a differentiation between obstructive and non-obstructive snoring sounds, which might be of clinical relevance. Non-obstructive snoring as it can be observed at the velum caused by an anterior–posterior vibration reveals less loudness and a lower level than snoring with obstructive components. In addition, the velum can be determined as site of vibration or obstruction by an enhanced roughness compared to tonsillar or post-apnoeic snoring episodes.

Contrary to that, an increased sharpness gives evidence for snoring sounds, which are caused by a lateral collapse of the oropharynx at the tonsil level.

Despite the fact that the patterns of vibration in post-apnoeic snoring are not as homogeneous as in the other three types, it can be demonstrated that the fluctuation strength of snoring sounds terminating an apnoeic event is higher than in any other snoring type.

By taking all measured psychoacoustic parameters into account, different types of snoring and patterns of vibration or obstruction can be differentiated. As demonstrated in Table 3, different patterns of distribution of psychoacoustic parameters can be detected. The characterisation of a specific pattern of vibration or obstruction can only be achieved by merging different psychoacoustic parameters. To our knowledge, the present study for the first time provides an acoustic classification in a large number of visually classified patterns of obstruction.

Table 3 Simple algorithmic classification of snoring sounds with different types of vibrations or obstructions

Snoring sounds and patterns of obstruction induced by DISE may vary from natural nocturnal snoring [30, 1]. This fact has to be taken into account when interpreting data of DISE. Up to now, no better method of getting obstructions and vibrations visualised than DISE is available, and nocturnal sleep endoscopy (during natural sleep) remains a goal, which seems hardly achievable in the near future on a routine basis.

DISE remains a valuable procedure for the detection of the site of obstruction. Often, a potential surgical therapy is based on the results of DISE [31, 32]. It needs to be discussed, whether the obtained data of obstruction site and acoustic characteristics of snoring resemble the nocturnal situation. Induced sleep does not resemble natural sleep in terms of brain activity, which is altered by drug application. It has been demonstrated, however, that respiratory parameters are not altered by drug-induced sleep [33]. The principle of DISE is based on this assumption. If the visualised patterns of obstructions and vibrations do not only exist in induced sleep but also in natural sleep, it can be assumed that the corresponding acoustic characteristics exist in both states as well. Apart from that, the present study does not aim to compare snoring sounds during natural and induced sleep. The main goal was to introduce an alternative way of acoustic analysis apart from simple frequency analyses in order to distinguish between different subgroups of acoustic characteristics.

Further research is needed to establish cut-off values for the different types of snoring. A simple algorithm to characterise different snoring samples taking all psychoacoustic parameters into account is suggested in Table 3. On the basis of these findings, a potential classification of vibration and obstruction patterns might lead to a more focused surgical approach in the treatment of snoring and obstructive sleep apnoea. Further research needs to evaluate the therapeutic outcome of specific surgical techniques in specific patterns of obstruction (e.g. surgery of the soft palate in a palatal collapse anterior–posterior vs. lateral or circular).

Conclusion

Different patterns of obstruction and vibration during DISE create specific psychoacoustic parameters. Psychoacoustic parameters enable a differentiation between visually classified patterns of vibration and obstruction in snoring episodes obtained during DISE. The “acoustic fingerprint” obtained might be a step towards an evaluation of nocturnal snoring sounds which can be performed during poly(somno)graphy on a routine basis in the future. The aim should be an algorithm for acoustic analysis which enables the differentiation between different sites of obstruction and vibration as well as apnoeic and non-apnoeic snoring which might contribute to a more individualised treatment of snoring.