1 Introduction

Hemodialysis is an essential treatment for removing waste products and excess fluid from the blood of end-stage renal disease (ESRD) patients [37]. The permanent vascular access connects an artery and a vein to provide sufficient blood flow to access dialysis. In performing hemodialysis, permanent vascular access is generally obtained using arteriovenous (AV) fistulas, AV grafts or venous catheters [16]. However, following prolonged usage, irreversible thrombosis caused by stenosis hinders the use of vascular access [12]. Accordingly, reliable surveillance methods for detecting vascular access stenosis at an early stage of development are essential in facilitating remedial treatment such as percutaneous transluminal angioplasty (PTA) [2]. PTA is an invasive procedure to dilate the lumen of the stenotic site in the vascular access [34].

The literature contains various proposals for detecting hemodynamically significant vascular access stenosis [3, 13, 27, 29, 40]. For example, the noninvasive imaging techniques, namely magnetic resonance (MR) angiography, are currently used for detecting stenosis in the cardiovascular system [3]. However, while such methods provide an accurate indication of the degree of vascular access stenosis, they require the use of expensive apparatus and involve certain surgical risks in using catheterization and contrast [13]. Accordingly, various authors have proposed noninvasive methods for detecting the lumen narrowing in the vascular access region by means of ultrasound imaging techniques [27, 29]. Currently, ultrasonic detection method has become a standard procedure to evaluate the vascular stenosis in the hospital [40]. However, the methods proposed in [3, 13, 27, 29, 40] are highly skill dependent and are impractical for initial screening in hemodialysis centers.

Phonoangiography is a noninvasive and inexpensive procedure for evaluating the sound pattern of bruit produced by the local blood flow through the vascular system [24, 28]. When blood flows through a constricted blood vessel, the resulting flow jet induces a turbulent flow and a pressure fluctuation in the blood vessel wall [31]. The resulting acoustic noise emitted from the vessel is defined as a murmur or bruit. Knox et al. [21] used a spectral bruit analysis (SBA) method to evaluate the severity of the carotid artery stenosis. Lees [23] founded that the peak systolic portion of the bruit processed by fast Fourier transform (FFT) is directly related to the residual lumen diameter of the stenotic internal carotid artery. Mansy et al. [25] evaluated the vascular patency of hemodialysis access by measuring the spectral change of the SBA. However, the disadvantage of SBA is their narrow sharp peaks induced from FFT calculation, which is hard to implement the spectral peak analysis in programming. Chen et al. [8, 9] proposed a method based on the Burg autoregressive (AR) model combined with fractional-order chaos system and fuzzy petri nets (FPNs) to evaluate the relationship between the power spectral density (PSD) and the vascular access degree of stenosis (DOS). Notably, the methods proposed in [8, 9, 21, 23, 25] are all based on a spectral analysis of the phonoangiography signal. In practice, phonoangiography is recorded with the mixture of ambient environment noise and friction noise generated by the movement of sensor head. The unwanted noises sometimes have similar spectral information as bruit caused by the turbulent blood flow in the narrow blood vessel. Only using spectral information is hard to directly distinguish the dynamic characteristics of bruit directly from noisy phonoangiography. However, the clinicians can recognize the stenosis sound pattern in noisy environments. The auditory perception mechanism is based on the basilar membrane mechanical response induced by sound and more sensitive to the transient response than steady-state signal. Thus, intuitively, the auditory perception models are considered when performing bruit analysis approach. In our proposal, the auditory spectrogram and auditory spectro-temporal features are used to overcome the limits in traditional time–frequency representation (TFR) methods.

Auscultation is a noninvasive procedure to initially evaluate the vascular access stenotic signs in hemodialysis patients. However, the outcome of auscultation is both subjective and skill dependent. Consequently, great interest exists in visualizing the time–frequency pattern of the auscultation signal so as to obtain a more objective and reliable prediction of the stenosis degree. Time–frequency representation (TFR) methods are widely used to analyze biomedical signals such as phonocardiograms [6]. Among the various TFR methods, the short-time Fourier transform (STFT) method uses a simple FFT operation to analyze the time signal. However, the fixed resolution of STFT is a compromise for the analysis of non-stationary signals [7]. In the wavelet transform (WT) method, the WT overcomes the fixed resolution limitation of STFT to get enormous spectral-temporal information and is suitable for analyzing the non-stationary biomedical signal [14]. However, the performance of WT is critically dependent on the choice of basis kernel function, e.g., Morlet, Daubechies [10].

In a previous study, Sung et al. [32] presented a computer-assisted auscultation approach for detecting abnormal heart sounds due to congenital heart disease. The present study proposes a similar system integrated with auscultation technique to evaluate vascular access stenosis. Figure 1 presents a functional block diagram of the proposed method. As shown, the phonoangiography signal is obtained by the ESRD patient. The signal is then transformed into an auditory spectrogram in the time–frequency domain using a process mimicking that of the human ear. Two spectro-temporal features of the spectrogram, namely the auditory spectrum flux (ASF) and the auditory spectral centroid (ASC), are then extracted and processed using a multivariate Gaussian distribution (MGD) model. Finally, the presence of stenosis is detected by examining the distributions of the two features in the MGD model.

Fig. 1
figure 1

Functional block diagram of proposed auditory auscultation system for hemodialysis vascular stenosis detection

2 Materials and methods

2.1 Subjects and experimental procedure

Sixteen hemodialysis patients (four male and 12 female; mean age 60.5 ± 10.0 years) were invited to participate in the study. All of the participants had straight type AV grafts over the upper arm region. The institutional review board (IRB) of National Cheng Kung University Hospital approved both the data collection process for the 16 patients and the follow-up tracking program, under the IRB protocol No. ER-99-186. For each patient, phonoangiography signals were recorded continuously over a minimum period of 20 s to ensure getting ten pulse cycles segmentation for further analysis. The recording process will be repeated until the extracted phonoangiography signals are stable without surrounding noise interference. Moreover, the bruits were acquired at both the arterial anastomosis site (A-site) and the venous anastomosis site (V-site) before and after undergoing PTA treatment. The bruits were recorded using an electronic stethoscope (3 M LITTMANN® 4100 series) with a sampling rate of 4,000 Hz and a resolution of 16 bits. The phonoangiography was then converted into an auditory spectrogram, and the spectro-temporal features extracted and processed, using a self-written program implemented in Matlab®.

2.2 Signal denoising

The sound signals acquired by an acoustic transducer generally contain unexpected interference as a result of environmental noise and motion artifacts. Therefore, the denoising operation to improve robust signal acquisition is required. Lee and Zhang [22] proposed a method for reducing the effects of motion artifacts in photoplethysmographic recordings using a wavelet threshold approach. However, the optimal value of the threshold was set in accordance with a trial-and-error approach and was not suitable for all of the considered subjects. As described in Sect. 1, the blood flow through a stenotic region has the form of a pulsatile flow. Thus, it is reasonable to assume that the phonoangiography signal is effectively a modulation signal. Accordingly, in the present study, the phonoangiography signals are denoised by removing the high-frequency noise interference components using the demodulation technique, as shown in Fig. 2.

Fig. 2
figure 2

Procedure of signal denoising method based on demodulation technique

In developing the demodulation scheme, it is assumed that a discrete input signal x[n] can be expressed as the sum of the clean signal s[n] and the additive noise e[n], i.e.,

$$x[n] = s[n] + e[n].$$
(1)

where the noise e[n] may result from a combination of ambient noisy sound, instrument noise, frictional noise of sensor head and physiological acoustic noise from body.

The input signal x[n] can then be normalized in the preprocessing stage as follows:

$$x_{N} [n] = x[n]/\hbox{max} \left| {x[n]} \right|,$$
(2)

where x N [n] denoted the normalized bruits signal.

In mathematics, an analytic signal can be represented as the sum of a real part and an imaginary part [18]. It can also be specified as exponential form or joint both compositions of amplitude modulation (AM) and frequency modulation (FM), as follows:

$$\begin{aligned} x_{a} [n] &= x_{N} [n] + j\text{H}\left\{ {x_{N} [n]} \right\} \\ &= {\text{AM}}[n] \cdot \exp \left\{ {j\phi_{m} [n]} \right\}, \\ \end{aligned}$$
(3)

where H{·} is the Hilbert transform; AM[n] = ∥x a [n]∥ denotes an instantaneous amplitude or AM envelope; ϕ m [n] is instantaneous phase of x a [n]; note that the derivative of ϕ m [n] is instantaneous frequency (IF).

Hilbert transform is one of the demodulation techniques to estimate the envelope of AM and the IF of the signal. However, the Hilbert envelope is sensitive to high-frequency modulation interference [11]. Thus, in the proposed system, the envelope of the phonoangiography signal is further estimated by means of autoregressive modeling technique, namely frequency domain linear prediction (FDLP) [35].

The N-dimensional real vector which represents a finite-duration discrete-time AM envelope AM[n] is processed by forward discrete cosine transform (DCT). The DCT property of AM[n] can be calculated by approximating the discrete Fourier transform (DFT) of the AM envelope. Then, the DCT of AM can be extracted as

$${\text{AM}}_{\text{DCT}} [k] = a[k]\left| {{\text{AM}}_{\text{DFT}} [k]} \right|\cos \left( {\theta [k] - \frac{\pi k}{2N}} \right),\quad k = 0,1, \ldots ,N - 1,$$
(4)

where the |AMDFT[k]| and θ[k] are the magnitude and phase of the zero-padded DFT sequence, respectively, and N is the signal samples of the discrete enhanced phonoangiography signal.

Moreover, the linear predictions of the DCT components calculated by the autocorrelation method are taken as the output of the FDLP process [5]. The AM envelope calculated using the FDLP technique is then passed through a zero-phase low pass filter in order to remove the high-frequency modulation interference and obtain the enhanced AM envelope (as shown in the red solid lines in Fig. 3a, b). Finally, the enhanced AM envelope is multiplied by the original FM signal to reconstruct the enhanced phonoangiography signal, shown in Fig. 2.

Fig. 3
figure 3

Angiogram of the stenosis site in hemodialysis patient with arteriovenous access

2.3 Auditory time frequency representation

In the proposed auditory auscultation method, the bruits are transformed into the time–frequency plane using a cochlear wavelet transform (CWT) module, which mimics the auditory response of the human ear. The CWT module modified the gammachirp function presented by Irino [20] as complex-valued cochlear wavelet function to better characterize the level-dependent mechanical motion of the basilar membrane as follows:

$$\psi_{a,b} \left[ t \right] = \left[ t \right]^{\alpha - 1} \exp \left\{ { - j2\pi \beta \cdot {\text{ERB}}\left[ {f_{\text{center}} } \right] \cdot \left[ t \right]} \right\} \cdot \exp \left\{ {j2\pi f_{\text{center}} \left[ t \right] + j\gamma \ln \left[ t \right] + \varphi } \right\},$$
(5)

where α and β are parameters describing the envelope of the gamma distribution, α = 4, β = 1.019 are used in this paper; f center is the critical center frequency; γ is the chirp term; φ is the initial phase; t is t = [b − n]/a i , n is the analyzed sample index, a i is the scale factor of each frequency bin i; b is the sample location variable, and the term ERB[f center] is the equivalent rectangular bandwidth which varies with the center frequency [19].

In implementing the CWT into mathematical software, the output of the CWT module is modeled as the convolution of the enhanced phonoangiography signal x E [n] with the K analyzed wavelet functions, i.e.,

$${\mathbf{AS}}[k,n] = \sum\limits_{k = 1}^{K} {\frac{1}{{\sqrt {a_{i} } }}\psi_{a,b} [k,n] \times x_{E} [n]} ,$$
(6)

where AS[k, n] is the complex-valued output of the discrete CWT, k is the index of the analyzed wavelet function which can be regarded as frequency bin; K is the total number of frequency bin, this paper adapts K = 64;

As described above, the magnitude of the CWT output, i.e., AS|[k, n]| gives the auditory spectrogram shown in Fig. 1 and represents the characteristics of the bruits in the time–frequency domain.

2.4 Auditory spectro-temporal features

In the system proposed in this study, the detection of vascular access stenosis is performed by means of two spectral-temporal features extracted from the auditory spectrogram, namely the ASF and the ASC.

2.4.1 Auditory spectrum flux

The ASF feature provides an indication of the rate at which the magnitude of the auditory spectrum changes. The ASF feature is typically used to classify the types of audio signals and is defined as the average variation value of the auditory spectrum magnitude between two adjacent time frames [23]. In terms of stenosis detection, the peaks of time-varied ASF curve become sharp means that the severity of DOS increases. The spectrum flux of the auditory spectrogram is calculated by taking the ordinary Euclidean norm of the auditory spectrum magnitude difference with time, i.e.,

$$\text{SF}\left[ n \right] = \frac{1}{K}\sqrt {\sum\limits_{k = 1}^{K} {\left[ {\left| {{\mathbf{AS}}\left[ {k,n} \right]} \right| - \left| {{\mathbf{AS}}\left[ {k,n - 1} \right]} \right|} \right]}^{2} } .$$
(7)

2.4.2 Auditory spectral centroid

In a practical auscultation procedure, pitch variation is taken as an important indicator of the severity of stenosis. The auditory pitch can be defined as the attribute of the auditory sensation in terms of the frequency components of the sound. Moreover, the spectral centroid provides a convenient approach for parameterizing the center of gravity of the magnitude auditory spectrum, which corresponds to the brightness of the sound [36]. Accordingly, in the present study, the ASC feature of the auditory spectrogram is used to model the auditory pitch of the phonoangiography signal. Mathematically, the ASC feature can be modeled as

$$f_{{\text{centroid}}} \left[ n \right] = \frac{{\sum\nolimits_{k = 1}^{K} {\left( {\left| {{\mathbf{AS}}\left[ {k,n} \right]} \right| \cdot f_{{\text{center}}} \left[ k \right]} \right)} }}{{\sum\nolimits_{k = 1}^{K} {\left| {{\mathbf{AS}}\left[ {k,n} \right]} \right|} }}.$$
(8)

The spectral distribution of the bruits varies in accordance with the degree of narrowing of the stenosis region. Therefore, the severity of vascular access stenosis can be evaluated by monitoring the variation of the ASC feature. More specifically, a more intense and higher-pitched sound (i.e., a higher time-variation ASC curve) suggests the possible presence of stenosis, while little or no sound suggests a completely occluded condition.

2.4.3 Multivariate Gaussian distribution

In the present study, the ASF and ASC features described above are redrawn as two-dimensional data distribution in ASC–ASF plane (i.e., x-, y-axis represents ASC and ASF, respectively) to describe the severity of the vascular access stenosis. As shown in Fig. 1, the distribution density of the two-dimensional data is described using a MGD model. The MGD function can be expressed as [33]

$$g\left( {{\mathbf{x}}\left| {\mu_{i} ,\varSigma_{i} } \right.} \right) = \frac{1}{{(2\pi )^{D/2} \left| {\varSigma_{i} } \right|^{1/2} }} \cdot \exp \left\{ { - \frac{1}{2}\left( {{\mathbf{x}} - \mu_{i} } \right)^{\text{T}} \varSigma_{i}^{ - 1} \left( {{\mathbf{x}} - \mu_{i} } \right)} \right\},$$
(9)

where x represents the D-dimensional valued data, D = 2 for both ASC and ASF, i is the mixture weights of the Gaussian density components, μ i is the mean value of MGD and Σ i is the MGD covariance matrix.

The parameters of the MGD model can be estimated using the iterative expectation–maximization (EM) algorithm [39]. Furthermore, the mean values and covariance matrix coefficients of the MGD model can then be used to obtain a qualitative assessment of the presence of vascular access stenosis.

3 Results

In this study, 64 bruits distributed over a total of 640 cardiac cycle signals were analyzed in order to evaluate the relationship between the proposed auditory spectro-features and the severity of vascular access stenosis. As described in Sect. 2.1, the data were obtained from 16 hemodialysis patients with upper arm straight type AV grafts under different DOS conditions. Furthermore, the bruits were obtained at both the A-site and the V-site of each patient in order to evaluate the most suitable placement of the acoustic transducer for vascular access stenosis detection. In accordance with [26], the severity of the obstruction in the VA access was quantified using the DOS metric. The DOS is measured using angiography and defined as the area ratio of the luminal narrowing segment compared to that of the normal vascular segment, i.e.,

$${\text{DOS}} = \left( {1 - \frac{{d_{r}^{2} }}{{D_{u}^{2} }}} \right) \times 100\,\% ,$$
(10)

where d r is the residual lumen diameter at the location of maximum stenosis and D u is the unobstructed lumen diameter, as shown in Fig. 3.

For the 16 participants, the DOS ranged from 8 to 98 % and had a mean value of 64 %. Generally speaking, a DOS of more than 50 % is considered to indicate vascular access stenosis. Moreover, based on clinical experience, a DOS of more than 70 % is taken to indicate a significant vascular access stenosis condition. Utilizing these definitions of the 32 sites considered in the present study (i.e., 16 A-sites and 16 V-sites), 18 sites indicated a significant access stenosis condition, while 14 sites indicated a non-significant access stenosis condition.

Figure 4 shows the phonoangiography waveforms, STFT spectrograms and auditory spectrograms obtained for two cases with DOS values of 90 and 27 %, respectively. A qualitative difference in the phonoangiography waveforms recorded at A-sites of the two patients is clearly observed in Fig. 4a, b, respectively. Specifically, the systolic peaks in the waveform corresponding to a significant DOS (Fig. 4a) are far sharper than those in the waveform corresponding to a normal DOS (Fig. 4b). Figure 4c, d shows the energy distributions of the two waveforms in the time–frequency plane, as calculated by STFT. Meanwhile, Fig. 4e, f shows the corresponding auditory spectrograms of the two waveforms. Note that in Fig. 4c–f, the x-, y- and z-axes represent time, frequency and magnitude, respectively. Comparing the four figures, it is noted that the auditory spectrograms provide a clearer visual representation of the energy distribution in the phonoangiography waveform than the STFT spectrograms. A detailed inspection of Fig. 4e shows that in the case of severe stenosis, the spectral energy is distributed over a broad range of 50–800 Hz and exist high-frequency components, which is induced by the turbulent flow through stenotic region at vascular access. However, for the case of normal stenosis, the energy is distributed over a more limited range of 50–450 Hz. Furthermore, the spectrogram has a flatter envelope profile for each cardiac cycle.

Fig. 4
figure 4

Left side a waveform, c STFT spectrogram and e auditory spectrogram of phonoangiography at the condition, DOS of 90 %, respectively. The right side is b waveform, d STFT spectrogram and f auditory spectrogram of phonoangiography at the condition, DOS of 27 %

Although the auditory spectrograms provide a useful visualization of the phonoangiography waveform in the time–frequency domain, they are not amenable to quantitative analysis. Thus, as described in Sect. 2.4, the auditory spectrograms of the bruits were quantified by means of their ASF and ASC features. Figure 5a, b shows the ASF values of the bruits corresponding to DOS values of 90 and 27 %, respectively. It is seen that both profiles have a similar regular periodic tendency due to the pulsatile nature of the local blood flow. However, the peaks in the flux envelope corresponding to DOS = 90 % are sharper than those in the envelope corresponding to DOS = 27 %. Figure 5c, d shows the ASC curves of the two bruits. It is seen that the ASC curve corresponding to DOS = 90 % exhibits a greater variation range than that corresponding to DOS = 27 %. Moreover, the peaks in the ASC curve (corresponding to high-pitch points) coincide with the cardiac cycle peaks in the case of severe stenosis (i.e., DOS = 90 %). Figure 5e, f shows the 2D features of the waveforms corresponding to DOS = 90 % and DOS = 27 %, respectively. The MGD function mentioned in Sect. 2.4 was used to describe the 2D features of both (as shown in the red solid lines in Fig. 5a, b). It is seen that in the case of severe stenosis, the elliptical circle of MGD is more widely distributed in the domain, whereas in the case of normal stenosis, the 2D features points are more densely packed.

Fig. 5
figure 5

a Auditory spectrum flux, c auditory spectral centroid, e both 2D features at DOS of 90 % and b auditory spectrum flux, d auditory spectral centroid, f both 2D features at DOS of 27 %, respectively

In this study, the stenosis detection performance of the auditory spectro-temporal features was evaluated by means of the receiver operating characteristic (ROC) curve [15]. The ROC curve is widely used to visualize the achievable fraction between sensitivity and specificity for different decision thresholds of the MGD parameters, shown as Fig. 6. The results demonstrated that the MGD covariance matrix of ASC, Σ ASC is a good indicator for stenosis detection. The overall accuracy of each stenosis detector feature set was evaluated by measuring the area under the respective ROC curve (AUC), as listed in Table 1. Table 1 shows the AUCs of all feature sets at A-site is larger than 0.80. However, only the MGD covariance matrix of ASC achieves 0.81 at V-site. The confusion matrix which composes four outcomes, namely true positive (TP), true negative (TN), false positive (FP) and false negative (FN), was used to evaluate the accuracy. The TP group comprised the patients with significant vascular access stenosis; the TN group comprised the patients without significant stenosis. The FP group comprised the normal patients wrongly identified as having significant vascular access stenosis, while the FN group comprised the patients with significant vascular access stenosis but wrongly identified as having normal vascular access stenosis. Table 1 shows the accuracy, sensitivity and specificity of the various features considered in the present study. It is seen that the accuracy of all feature sets at A-site are higher than V-site. Among the features, the MGD covariance matrix of the ASC at A-site and V-site yield better accuracy of 83.87 and 70.97 %, respectively.

Fig. 6
figure 6

ROC curve of auditory spectro-temporal features extracted from bruits at A-site

Table 1 Accuracy of the stenosis detector for different MGD feature sets

4 Discussion

Previous studies have shown that the acoustic characteristics of vascular sound can be used to evaluate occluded blood vessels [17]. Moreover, it has been shown that SBA was used to investigate obstructed arteries [1]. In general, bruits are produced as the blood flows turbulently through the constricted portion of the blood vessel. Since the turbulent flow results in spectral resonance, the detection of VA stenosis can be performed using spectral analysis methods [30]. Spectral analysis methods typically involve the cycle segmentation and performing an FFT to calculate the PSD of the analyzed signal. In previous study [9], the authors proposed a parametric method based on the Burg AR algorithm to overcome spectral leakage and smooth the PSD curve. The results showed that the difference in the Burg AR frequency spectra with and without occluded VA, respectively, provided an indication of VA stenosis. However, the accuracy of the Burg AR spectral analysis method is also dependent on the AR order and the choice of analyzed signal segment.

Various envelope-based extraction methods have been proposed for assisting in the segmentation of signals, including the Hilbert envelope and the Shannon energy envelope [11]. However, their performance is seriously degraded in noisy environments.

In clinical diagnoses, physicians auscultate the acoustic emission sounds from the body based on their intensity, pitch and temporal periodicity. An experienced physician can often recognize a stenotic blood flow condition based solely on the acoustic characteristics of the bruits. However, the diagnosis is intuitively skill dependent and subjective. Thus, more objective methods are required to support robust vascular access stenosis detection. The bruit pattern is too complicated to describe using only one-dimensional features. Thus, in the present study, the proposed auditory spectrogram is used to visualize the energy magnitude of the bruit pattern in the time and frequency domains simultaneously. The bruit induced by the blood flow through the vascular access has a pulsatile characteristic, and thus the change in the flow rate dominates the variation of the bruit pattern over time. As a consequence, the variation rate of the spectral energy in the auditory spectrogram, defined as the ASF, can be used to characterize the flow pattern of the bruits. For example, given a severe stenosis condition, the envelope of the ASF contains sharp peaks, which coincide with the systolic peaks in the cardiac cycles, as shown in Fig. 5a. By contrast, given a mild vascular access stenosis condition, the envelope of the spectrum flux is flatter and is characterized by a series of ripples rather than peaks, as shown in Fig. 5b. Thus, even though the spectrum flux envelope does not provide a precise indication of the flow velocity (as in Doppler ultrasonography, for example), the shape tendency of the flux profile provides a good qualitative indication of the presence (or otherwise) of stenosis.

The variation of the bruit pitch is also an essential characteristic in determining the severity of VA access stenosis. In the present study, the bruit pitch is quantified using the ASC feature, as shown in Fig. 5c, d. In general, the results obtained in this study have shown that a severe stenosis condition results in a higher pitch variation than that produced under moderate (DOS = 27 %) stenosis conditions. However, neither the ASC feature nor the auditory spectral flux feature provides sufficient information to permit the automatic recognition of vascular access stenosis. Thus, in the present study, the two features have been combined into a single ASC–ASF domain. The distribution of the data points in the ASC–ASF domain can describe by MGD model or other data-clustering model (i.e., support vector machine or k-nearest neighbor algorithm) to classify the stenosis and non-stenosis condition. Based on this classification model, the proposed system can be integrated with automatic stenosis detection system to realize the clinical application.

In this study, the bruits were recorded at both the A-site and the V-site of each patient in order to evaluate which of the two positions provided more reliable feature information for stenosis detection. Table 1 shows that the auditory spectro-temporal features extracted from the A-site yield a better performance than those extracted from the V-site. However, previous studies [4, 19] have shown that stenosis most commonly occurs nearly the V-site. It was shown in a previous numerical study that the blood flow upstream of a stenotic region has the form of steady laminar flow, whereas that downstream of the stenotic region has the form of turbulent flow [38]. As a result, the bruit sound patterns downstream of the stenotic region are unstable. However, the characteristic of the blood flow through constricted blood vessel is complicated. The suitable recording position can be further studied by the combination use of the Doppler ultrasonography.

5 Conclusions

Current AV graft stenosis surveillance methods based on static dialysis venous pressure, flow monitoring and duplex ultrasound involve the use of sophisticated and expensive equipment, and are therefore impractical for use in the home. Accordingly, the present study has proposed an auditory auscultation method for detecting vascular access stenosis by means of a simple phonoangiography procedure. The performance of traditional stenosis detection methods based on a spectral analysis of the phonoangiography waveform is limited by the selection of the analyzed window and the frequency resolution. Moreover, the outcome of the detection process is significantly dependent on the skill and experience of the practitioner. Therefore, in the present study, a CWT module which mimics the hearing response of the human ear has been used to represent the phonoangiography signal in the form of an auditory spectrogram in the frequency-time domain. Furthermore, two spectro-temporal features, namely ASF and ASC, have been extracted from the auditory spectrogram and used to characterize the relationship between the bruits and the DOS by means of a MGD model. Although the parameters of the MGD model are unable to provide an exact indication of the DOS, the results have shown that they have a sensitivity of 88.89 % and an accuracy of 83.87 % in predicting the occurrence of significant vascular access stenosis.

In summary, the results obtained in this study suggest that the proposed system provides a suitable basis for the development of a noninvasive stenosis surveillance method capable of use in the home by the hemodialysis patient or an untrained assistant. In future studies, additional auditory spectro-temporal features will be extracted and combined with an automatic recognition system in order to provide a more precise indication of the actual DOS condition.