Introduction

The acoustical analysis of snoring sounds has been in focus of research during the last two decades [1, 6, 812, 17, 24, 2628]. According to several authors, it seems possible to distinguish between different types of snoring like palatal or tongue base snoring. The vast majority of studies perform a frequency analysis of the investigated snoring sounds by fast Fourier transformation (FFT). By means of FFT, the mean and peak frequencies as well as power spectral bands can be obtained. Despite numerous efforts, a sufficient determination of snoring sounds has not been developed yet. Snoring sounds, which obviously sound different to the listener, might reveal a similar power spectrum during frequency analysis. This specific flaw remains still a major problem for the acoustic evaluation of snoring.

Apart from the regular FFT analysis, the determination of psychoacoustic parameters is in use, especially for the investigation of environmental sounds [1416, 2023, 29, 33]. By means of psychoacoustic analysis, the perceptual aspects of noise like annoyance can be determined.

The aim of the present study was to evaluate the application of psychoacoustic parameters for the analysis of snoring sounds.

Materials and methods

Recording of snoring sounds

Three snoring sounds which were subject to investigation were recorded during routine polysomnography in male patients with suspected sleep disordered breathing. The aim of the study was not to investigate the psychoacoustic properties of a large population sample, but to establish a novel technique of investigation by means of psychoacoustic analysis. Therefore, only three snoring samples, which were clearly acoustically different, were elected. Snoring samples were chosen on base of the acoustical appearance and polysomnographic results. Sample 1 (primary snoring, PS) was from a 45-year-old patient with primary snoring without apnoeic events (AHI: 1.2; Sao2 mean: 98.4%; oesophageal pressure (cm H2O): at specific snoring episode, −5.1; mean, −2.3). Sample 2 (upper airway resistance syndrome snoring, UARS) was taken from a 47-year-old patient with the clinical diagnosis of an upper airway resistance syndrome (AHI: 3.7; Sao2 mean: 98.2%; oesophageal pressure (cm H2O): at specific snoring episode, −20.4; mean, −10.3).The authors are aware that the UARS is no longer included in the new ICSD. Nevertheless, the snoring sample was chosen as there is a difference between PS and snoring during an obstructive sleep apnea syndrome in terms of acoustical appearance and pathogenesis. Sample 3 (obstructive sleep apnea snoring, OSAS) was chosen from a 46-year-old patient with an obstructive sleep apnea syndrome (AHI: 43.5; Sao2 mean: 86.4%; oesophageal pressure (cm H2O): at specific snoring episode, −33.7; mean, −20.8). All three snoring samples were original snoring sounds of real patient recordings. Sound recordings were performed by an air coupled dual-diaphragm condenser microphone (C3 Behringer, Willich, Germany) which was placed 30 cm in front of the patients’ mouths. A digital mixer was used for recording the snoring sounds (Phonic Helix Board FW 18 MKII Compact Mixer, Phonic Cooperation, Tampa, FL, USA). Data acquisition was performed by Adobe Audition II (Adobe Systems, München, Germany).

Creation of snoring samples

All the three snoring samples were amplified according to their peak sound pressure levels [Lpeak (dB) SPL (sound pressure level)]. The peaks were amplified to the levels 60, 65, 70, 75, 80 and 85 dB without filtering the specific frequencies. Amplification was performed by the software programme Adobe Audition II.

Frequency analysis

Frequency analysis was performed by Adobe Audition II. The peak intensities of the specific snoring sample were detected (Fig. 1a, b, c) at all different levels of Lpeak (60–85 dB).

Fig. 1
figure 1

Properties of snoring sound samples. a Snoring sample 1 (PS) represents a snoring episode with predominately deep frequency spectra with a peak intensity at 100 Hz. b Snoring sample 2 (UARS) represents a snoring episode with a high frequency spectrum with a peak intensity at 3,500 Hz. c Snoring sample 3 (OSAS) represents a snoring episode with spectra at 500 and 3,500 Hz. Left row recording track, axis time (sec), intensity (dB); right row FFT, axis frequency (Hz), sound pressure (dB)

Psychoacoustic analysis

Psychoacoustic analysis was performed by VIPER V 3.14 (Visual Perception of Audio Signals, CORTEX INSTRUMENTS GmbH). The parameters “loudness”, “roughness” and “sharpness” were calculated of the specific snoring sample under increasing SPLs.

Loudness

The parameter “loudness” describes the hearing perception of a specific sound pressure as the magnitude of an auditory sensation. The unit of loudness N is sone. The loudness level L N of 40 phon (narrowband noise centred at 1 kHz) with a sound pressure level of 40 dB equals 1 sone. A 10-phon gain in the loudness level L N doubles the loudness N. The loudness level and the loudness were analysed in accordance with DIN 45631 and ISO 532 B respectively [4]: the time interval was fixed as 2 ms. The high temporal resolution corresponds to the temporal resolution of the human ear. For each time interval, the sound levels in 20 approximated critical bands were computed using a third octave filter bank. The critical band levels were weighted according to the transfer from the free sound field through the outer- and middle ear. By a power law relationship, the weighted critical band levels were transformed into specific loudness values N′. Loudness N was calculated by integrating the specific loudness N′ [5].

Roughness

Fast modulation (between 20 and 300 Hz) in sounds is perceived as roughness, e.g. rough voice, noise of a flying beetle, electric motor or piston engine. The maximum of the roughness for a 1-kHz tone is reached with modulating frequencies by 70 Hz. The unit of roughness is asper (1 asper = 100 cA), referenced to a 1 kHz tone at 60 dB sound pressure level that is frequency-modulated by a 70-Hz sine wave with a modulation factor of 1 (100%). The instances of roughness were calculated in accordance with a model of Aures [2].

Sharpness

The parameter “sharpness” describes the perception of timbre (tone colour), depending on the spectral envelope of the sound. The sharper a noise sounds, the more the percentage of high frequencies in the sound grows. Perceived sharpness is proportional to spectral centre of gravity. Sharpness is measured in acum referenced to narrowband noise centred at 1 kHz with a sound pressure level of 60 dB.

Statistical analysis

Statistical analyses were performed by SPSS 18 (Chicago, Ill, USA). The Mann–Whitney U test was performed where appropriate. A probability level of 5% was considered as significant. Approximately, 30,000 samples per noise were statistically analysed. In needs to be mentioned that even minimal differences between the specific noise samples or the specific SPLs turn significant with such a high amount of values. Therefore, the obtained statistical data need to be interpreted carefully.

Results

Frequency analysis (FFT) under rising Lpeak

There was no change in the peak intensity of the FFT under rising Lpeak (65–85 dB) in all the three snoring samples.

Psychoacoustic analysis under rising Lpeak

Loudness

Under rising Lpeak (dB), the snoring sample “OSAS” revealed a lower loudness than the snoring samples “PS” and “UARS”. PS revealed a higher loudness, especially under high SPL levels than UARS (Z and p values given in Tables 1, 2). The increase in loudness for PS and UARS was more intense than for OSAS (Fig. 2).

Table 1 Statistical analysis of the psychoacoustic parameters
Table 2 Statistical analysis of the psychoacoustic parameters
Fig. 2
figure 2

Alteration of the psychoacoustic parameter “loudness” under increasing sound pressure levels. All snoring samples show an increase in loudness (y axis) under increasing SPLs (x axis). The loudness of sample OSAS was lower than samples PS and UARS and increased to a lower extent (median values of the specific sample shown; black PS, red UARS, green OSAS)

Sharpness

The sharpness for PS and UARS was higher than for OSAS at all SPLs. The sharpness increased at high SPLs for PS and OSAS, whereas the sharpness of UARS tended to decrease (Z and p values given in Tables 1, 2) (Fig. 3).

Fig. 3
figure 3

Alteration of the psychoacoustic parameter “sharpness” under increasing sound pressure levels. Snoring samples PS and UARS revealed a high level of sharpness (y axis) for all SPLs (x axis), whereas the sharpness of sample OSAS increased under rising SPLs. The sharpness of sample OSAS was lower than sample PS and UARS (median values of the specific sample shown; black PS, red UARS, green OSAS)

Roughness

The roughness of all the three snoring samples increased under increasing SPLs. Sample PS revealed a greater roughness than samples UARS and OSAS (Z and p values given in Tables 1, 2) (Fig. 4).

Fig. 4
figure 4

Alteration of the psychoacoustic parameter “roughness” under increasing sound pressure levels. The roughness (y axis) of all three samples increased under increasing SPLs (x axis). Sample PS revealed a greater roughness than sample UARS and OSAS (median values of the specific sample shown; black PS, red UARS, green OSAS)

Psychoacoustic fingerprint

Merging the psychoacoustic parameters “loudness”, “sharpness” and “roughness”, a three dimensional diagram can be created. The curves of the snoring samples differ from each other and create a psychoacoustic fingerprint of the specific noise (Fig. 5).

Fig. 5
figure 5

Three dimensional plot of the psychoacoustic parameters “loudness”, “sharpness” and “roughness”. A psychoacoustic “fingerprint” of the three different snoring sounds is demonstrated. The SPL is given as outlined number of the respective snoring sound. x axis: sharpness, y axis: loudness, z axis: roughness (black PS, red UARS, green OSAS)

Discussion

The present results demonstrate that the perception of snoring sounds depends on various psychoacoustic parameters. Although the sound pressure levels are equal for the three investigated snoring samples, the psychoacoustic parameters “loudness”, “sharpness” and “roughness” alter under increasing sound pressure levels. These individual changes in psychoacoustic parameters enable the creation of “acoustic fingerprints” of snoring sounds to identify specific types of snoring and to evaluate the degree of annoyance.

The frequency analysis by fast Fourier transformation (FFT) has been described as a method to investigate snoring sounds in the recent two decades [1, 6, 8, 10, 31, 32]. According to several authors, it seems to be possible to distinguish between palatal and tongue based snoring as the two major sources of origin of snoring. In general, the differences base mostly on the dominant frequency spectra of the snoring sounds. Palatal snoring is reported to consist of low frequency spectra (80–300 Hz), whereas tongue based snoring reveals predominately higher frequency spectra (1,000–4,000 Hz) [1, 3, 8, 10, 34]. Despite these efforts, a precise acoustical discrimination between different snoring sounds is still not possible in somnological standard diagnostic procedure. Moreover, the degree of annoyance of specific snoring sounds is not possible to determine the means of FFT.

The use of psychoacoustic parameters is predominately present in environmental acoustic investigations like studying traffic noise, aircraft noise and occupational environments to describe the degree of annoyance of a specific noise [13, 25, 29, 30]. In medical use, psychoacoustic analysis of sounds has been only reported in few cases although the use of psychoacoustic parameters seems to be an adequate tool to classify a specific noise. Nielsen et al. [19] investigated mechanical heart valve noises according to their annoying potential to the listener and found the loudness and sharpness of the valve noises, which are crucial for the degree of annoyance. Haderlein et al. [7] used psychoacoustic parameters for the evaluation of speech quality of tracheoesophageal substitute voices after laryngectomy.

Despite encouraging results in other medical fields in describing the quality a specific noise, the analysis of snoring sounds by psychoacoustic parameters is still a rarely investigated field. A study by Ng and Koh [18] investigated the degree of annoyance of 13 snoring samples estimated by 25 individuals. Psychoacoustic parameters were also used to classify different snoring sounds in combination with the individual estimation of the listening person. The parameters loudness, roughness and annoyance seem to be the best parameters to distinguish between apnoeic and non-apnoeic snoring in the cited study.

It needs to be marked that polysomnography will remain the gold standard for the detection of sleep related breathing disorders. But the use of psychoacoustic analyses might contribute to topo-diagnose the snoring noise and to evaluate the degree of annoyance of a specific noise.

According to the present data, the assumption that the loudness represents a crucial parameter for the annoyance of snoring noises can be confirmed. The loudness and roughness of the low-frequent rhythmic and high-frequent rhythmic snoring samples increase to a bigger extent under increasing SPL level compared to the high-frequent non-rhythmic sample.

By merging the psychoacoustic parameters “loudness”, “roughness” and “sharpness”, it is possible to create an “acoustic fingerprint” of a specific noise. As demonstrated in Fig. 5, the curves for each snoring noise differ in the plotted diagram. Based on those results, psychoacoustic analyses of a larger sample of snoring patients need to be performed to verify the present preliminary data in a heterogeneous group of snoring individuals.

Conclusion

The analysis of snoring sounds by psychoacoustic parameters provides a promising approach to differentiate different types of snoring sounds. Compared to conventional methods like FFT, a more precise description is obtained by calculating “loudness”, “roughness” and “sharpness”, which cumulates in the creation of “psychoacoustic fingerprints” of the snoring sounds. Psychoacoustic analyses could be implemented in routine polysomnography for a sufficient investigation of snoring sounds and could provide an adequate tool to evaluate the annoyance by snoring noises.