Introduction

Embolism is intravascular migration of an insoluble body such as gas bubble, a fat globule, a blood clot, an atheromatous plaque, or a piece of thrombus. Embolus formation inside the body could be attributed due to different physiological, physical, and intervention mechanisms [1]. It can travel to any part of the body, accounting for many serious (and sometimes life-threatening) disorders thus the importance of an automatic classification system.

Transcranial Doppler (TCD) [1] is a non-invasive ultrasound approach employed to assess blood flow velocity in the major basal intracranial arteries on a real time basis. In order to evaluate the ability of TCD to detect and classify intracranial emboli, several experimental studies have been carried since the early 1960s. One of the approaches relies on detecting the appearance of abrupt changes (known as high intensity transient signals) in the TCD waveform with higher amplitudes than the other recorded signal. These abrupt changes are exploited as indicators of the presence of emboli, in several clinical areas: carotid and cardiac surgery [2, 3], cerebral angiography and following prosthetic heart valve insertion [46]. Many reports emphasized the ability of TCD, combined with signal processing methods for example discrete wavelet transform (DWT), to detect microemboli in the brain circulation [710]. Aydin et al. [7]. used a signal-processing algorithm based on DWT to characterize embolic signals (ES), Doppler speckle, or artifacts. In their work, they showed that the detection parameters derived from DWT coefficients are likely to improve the sensitivity of an automated system [7]. Unfortunately, this technology presents today some limitations to determine the embolus composition [11, 12]. Especially, before performing interventional procedures for neuroprotection where the nature of the emboli is a vital diagnostic criteria. This is the case during cardiac surgery and carotid endarterectomy where solid and gaseous embolus are strongly intricate. TCD can detect a high intensity transient signals indicating a presence of emboli but it is very difficult to differentiate gaseous from solid microemboli. Solid emboli involve different therapeutic approaches than gaseous emboli [13] and they are potentially far more damaging.

Indeed, when solid emboli are identified, an antiagregant or anticoagulant treatment will be employed. However, gaseous emboli are considered as a risk indicator and their detection involves to investigate for the cause of emboli.

Recently, a new approach based on the analysis of backscatter radio frequency signals using the nonlinear characteristics of gaseous bubbles and artificial neural network to classify emboli was investigated [13, 14]. A number of researchers have reported that DWT performs better than fast Fourier transform (FFT) for the analysis and the detection of ES [7, 15]. The existence of fast algorithms to implement DWT, makes also the investigation of the feasibility of ES detection systems based on the DWT worthwhile.

We suggest in this experimental study to exploit Radio-Frequency ultrasound signals backscattered by the emboli since they contain additional information on the embolus in comparison to the classically used Doppler signals [16].

First, we employ DWT algorithm based on the Daubechies (db6) to decompose RF signals into different frequency bands and identify which features lead to a better recognition performance. Several features are evaluated from the detail coefficients. It should be noted that the features used in this study are the same used in the work by Aydin et al. [7]. These features are used as inputs to the classification models without using feature selection method. Second, and due to curse of dimensionality, we employ a wrapper feature selection approach based on differential evolution (DE) algorithm. This algorithm, referred to as differential evolution feature selection (DEFS).

Feature selection techniques usually fall into three main categories; filter, wrapper, or embedded methods. Filter approaches perform feature selection using the intrinsic properties of the data independently of the learning algorithm. Wrapper approaches conduct feature selection using a learning algorithm as part of the evaluation function to estimate the relevance of a given set of features, in embedded approaches the search for an optimal set of features is built into the learning algorithm construction [17].

DEFS utilizes the differential evolution float number optimizer in a combinatorial optimization problem like feature selection. The DEFS highly reduces the computational cost while at the same time proves to present a powerful performance [18, 19].

In order to evaluate the performance of the proposed system, we implemented SVM classifier. For nonlinear classification problem with limited number of samples SVMs appear to be advantageous compared to neural networks approaches [2022]. Furthermore, in [23] a detailed comparison of SVM, radial basis neural network, and K-nearest neighbor is performed. The results indicated that SVM approach demonstrated superior performance compared to the other two classifiers, it was concluded that SVM has less sensitivity to the curse of dimensionality [23]. We use hold-out-set cross-validation strategy to fix the parameters of the classifier and to select the most relevant features. While there are other validation techniques, including, but not limited to, bootstrap and leave-one-out, we focus on the more commonly used cross-validation approach in this paper. We randomly divide the dataset, into two subsets training set and test set (hold-out-set). After that, we apply cross-validation technique only on the training set to randomly generate new training and validation sets. The test set is used only for the assessment of the model selected by the cross-validation algorithm. While the validation set is used to tune the parameters of the classifier. The experimental evaluation is performed using hold-out-set to avoid overfitting and assure statistical validity of the results [24].

The main purpose of this experimental study is to evaluate the performances of the proposed technique in the detection and classification of microemboli using Radio-Frequency ultrasound signals backscattered by the emboli instead of Doppler signals.

Discrete wavelet transform of backscatter RF signals

Discrete wavelet transform

The basic idea of the wavelet transform is to represent any arbitrary signal x(t) as a superposition of a set of wavelets or basis functions. These basis functions are obtained from a single prototype wavelet called the mother wavelet by dilation (scaling) and translation (shifts). The wavelet transform of a continuous signal x(t) is defined as [7, 25]:

$$c\left( a,b \right)=\int\limits_{R}{x(t)}\frac{1}{\sqrt{a}}\psi \frac{(t-b)}{a}dt,$$
(1)

where, the indexes c(a,b) are called wavelet coefficients of signal x(t), a is dilation and b is translation, Ψ(t) is the mother wavelet. DWT of a signal is defined with respect to a mother wavelet and maps continuous finite energy signals to a two-dimensional grid of coefficients [7, 25]. The scale a in the discrete wavelet transform case becomes \(a=a_{0}^{m}\), and the translation b becomes \(b=n{{b}_{0}}a_{0}^{m}\) [7, 25]. DWT of a discrete signal with length N is defined as:

$$c(m,n)=\frac{1}{\sqrt{a_{0}^{m}}}\text{ }\sum\limits_{k=0}^{N-1}{\text{ }s\left( k \right)\text{ }\psi \left( \frac{k-n{{b}_{0}}a_{0}^{m}}{a_{0}^{m}} \right)}.$$
(2)

DWT of a discrete signal yields a set of coefficients including all the detailed coefficients and the last approximation coefficients [7, 25].

DWT description of backscatter RF signals

The backscatter RF signal are transformed into different time–frequency scales through the wavelet analysis, DWT uses two functions as high-pass filters and low pass filters. The high-frequency filter generates a detailed version of the backscatter RF signal (D), while the low-frequency filter produces its approximate version (A).

In our study, we decomposed the backscatter RF signal x(t) into five levels, so we can write as:

$$x\left( t \right)=A5(t)+D5(t)+D4(t)+D3(t)+D2(t)+D1(t).$$
(3)

Wavelet functions used for this study are standard DWT functions available in Matlab Wavelet toolbox [26], namely Daubechies (db6). An example of the decomposition of gaseous embolus and solid embolus signals is shown in Fig. 1. The detail and approximation coefficients are not directly used as classifier inputs. In this study, several features are used. All features are individually applied to the detail coefficients of each decomposition level.

Fig. 1
figure 1

Examples of DWT using Daubechies (db6) as mother wavelet of backscatter RF signal at low mechanical index (MI = 0.2)

Detection algorithm

The general block diagram of the detection system is shown in Fig. 2. The input backscatter RF signals are first detected and collected (signal acquisition). At the second stage, DWT coefficients of the signals are collected. Input signals are decomposed into an optimum number of frequency bands using DWT. Therefore, it is vital to select an appropriate wavelet for the signal being analyzed. Suitability of the wavelet filters and orders are determined experimentally. It should be noted that we shown in previous work that the best mother wavelet on the same types of backscatter RF signals is Daubechies (db6) [27]. In the third step, after applying DTW on the backscatter RF signals, several features are evaluated from the detail coefficients. It should be noted that the features used in this study are the same in the paper by Aydin et al. [7]. Table 2 shows the ten (10) features for each decomposition level and their formulas. In the last step, all these features are first used as inputs to the classification model without feature selection method. Second, due to the curse of dimensionality, we employ a feature selection algorithm based on DE algorithm [18, 19]. The motivation for this approach is that the more powerful among the existing machine learning algorithms tend to get confused when supplied with a large number of features [28].

Table 1 shows the confusion matrix in which the correct and incorrect predictions made by the classifier compared with the true labels in the test data are depicted.

Fig. 2
figure 2

Block diagram of the DWT-based detection system

Table 1 Confusion matrix for binary classification model

Before classification and feature selection tasks, and since the generalization performance of an algorithm should be estimated using unseen samples, we randomly divided the dataset into two subsets (training set and test set). After that, we applied cross-validation technique only on the training set to tune the classifier parameters and to rank the features. Thus the algorithms have only access to the training set, and the test set is kept unseen both to the ranking step and to the classifying step. The experimental evaluation is performed using hold-out-set cross-validation to avoid overfitting and assure statistical validity of the results [24].

The reported results are obtained with different types of measures, such as classification accuracy, sensitivity, and specificity.

Accuracy: (Number of correct predicted labels)/(Number of all labels)

$$Accuracy=\frac{TP+TN}{TP+TN+FP+FN}$$
(4)

Sensitivity: proportion of actual positives which are predicted positive it can be expressed as:

$$Sensitivity=\frac{TP}{TP+FN}$$
(5)

and then Specificity which is the proportion of actual negative which are predicted negative:

$$Specificity=\frac{TN}{TN+FP}$$
(6)

Feature extraction

The instantaneous power (IP) is calculated for the DWT coefficients of each level. A threshold value for each level is determined. Figure 3 illustrates the associated IP and threshold values, which are used in the detection algorithm.

Fig. 3
figure 3

Instantaneous power and corresponding threshold values for each level (gaseous and solid embolus signals)

The threshold is calculated from the data using a statistical method, which depends on the data length and the standard deviation [29], and it is given by:

$${{A}_{\text{th}}}={{\sigma }_{\text{n}}}\sqrt{{{\log }_{2}}N}$$
(7)

where: σn is the standard deviation of the signal power at the nth level and \(N\) is the length of the observation.

The following parameters relating the threshold are determined:

The ratio of the embolic signal to the background signal (EBR) is the most widely used feature in microemboli detection [7]. EBR shows how strong an embolic signal is relative to the background. P2TR is one of the definitions of the EBR, P2TR can be calculated using the measures given in Fig. 4.

$$\text{P2TR}=10\log \frac{{{\text{A}}_{\text{pk}}}}{{{\text{A}}_{\text{th}}}}\text{ (dB)}$$
(8)

Another feature, the total power to the threshold ratio (TP2TR) which is the quantity of power a signal has relative to the background energy, and it is given by:

$$\text{TP2TR}=10\text{ }\log \frac{{{\text{A}}_{\text{tot}}}}{{{\text{A}}_{\text{th}}}}=10\text{ }\log \frac{\sum\nolimits_{k={{t}_{on}}}^{{{t}_{{{o}_{ff}}}}}{{{\text{A}}_{f}}\left( k \right)}}{{{\text{A}}_{\text{th}}}}\text{ (dB)}$$
(9)

where A tot is the total power of the signal A(k). It is calculated by integrating the IP of signal between t on and t off , as illustrated in Fig. 4.

Fig. 4
figure 4

Instantaneous power of the detail coefficient and parameters used to calculate detection features

Two other features, which use threshold indirectly, are rise rate (RR) and fall rate (FR):

$$\text{RR}=\frac{10\text{ log}\frac{{{\text{A}}_{pk}}}{{{\text{A}}_{\text{th}}}}}{{{\text{t}}_{\text{pk}}}-{{t}_{on}}}=\frac{\text{P2TR}}{{{\text{t}}_{\text{pk}}}-{{t}_{on}}}\text{ (dB/ms)}$$
(10)
$$\text{FR}=\frac{10\text{ log}\frac{{{\text{A}}_{pk}}}{{{\text{A}}_{\text{th}}}}}{{{\text{t}}_{\text{off}}}-{{t}_{pk}}}=\frac{\text{P2TR}}{{{\text{t}}_{\text{off}}}-{{t}_{pk}}}\text{ (dB/ms)}$$
(11)

where: t s is the average time of the signal and f s is the average frequency of the signal.

t s and f s are calculated, respectively, as

$${{t}_{s}}=\frac{1}{{{E}_{s}}}\int\limits_{-\infty }^{+\infty }{t\text{ }{{\left| \text{ }s\left( t \right)\text{ } \right|}^{2}}dt}$$
(12)
$${{f}_{s}}=\frac{1}{{{E}_{s}}}\int\limits_{-\infty }^{+\infty }{f\text{ }{{\left| \text{ }S\left( f \right)\text{ } \right|}^{2}}df}$$
(13)

where S(f) is the Fourier transform of the signal s(t).

Time spreading Ts 2 and frequency spreading Bs 2 are defined as:

$$T_{s}^{2}=\frac{1}{{{E}_{s}}}{{\int\limits_{-\infty }^{+\infty }{{{\left( t-{{t}_{s}} \right)}^{2}} {{\left|s(t)\right|}}^{2}}}}dt$$
(14)
$$B_{s}^{2}=\frac{1}{{{E}_{s}}}{{\int\limits_{-\infty }^{+\infty }{{{\left( f-{{f}_{s}} \right)}^{2}}{{\left|S(f)\right|}}^{2}}}}df$$
(15)

where:

$${{E}_{s}}=\underset{-\infty }{\overset{+\infty }{\mathop \int }}\,{{\left| s(t) \right|}^{2}}dt$$
(16)

The instantaneous envelope and instantaneous frequency of a signal [30] are used to describe a signal simultaneously in time and in frequency. These two parameters are defined, respectively, as:

$$a\left( t \right)=\left| {{s}_{a}}(t) \right|$$
(17)
$$f\left( t \right)=\frac{1}{2\pi }\frac{d(arg({{s}_{a}}(t)))}{dt}$$
(18)

where s a (t) is the Hilbert transform of s(t).

$${{s}_{a}}\left( t \right)=s\left( t \right)+j\hat{s}\left( t \right).\hat{s}(t)$$
(19)

The variances of instantaneous envelope and instantaneous frequency (VIE and VIF) are used as other types of features.

Processing steps for the classification of microemboli can be summarized as follows:

  • Apply DWT to both solid and gaseous signals in order to collect DWT coefficients;

  • calculate Instantaneous power for each level;

  • from each level, derive a threshold value to be employed in detection;

  • evaluate previously described parameters for each DWT level;

  • apply: (i) classification, (ii) dimensionality reduction.

Table 2 summarizes the feature extractor methods that are used in this study.

Table 2 The features used in this study for each decomposition level and their formulas

As shown in Fig. 3, and unlike all the detail coefficients, the approximation coefficients does not have a similar shape (peak or Gaussian) thus we can’t extract the parameters t on and t off , therefore we discarded the approximation coefficients from the feature extraction phase.

Classification

For binary classification problems with limited number of samples it is crucial to validate the classification model with cross validation technique. Before building the classification model, the samples are subdivided into three subsets training set, validation set, and test set. The test set is used only for the assessment of the model selected by the cross-validation technique, while the validation set is used to tune the classifiers parameter. Therefore the algorithm has only access to the training and validation sets, the test set is kept unseen in the selection process of the best model.

Our experimental data consists of 102 samples (51 solid embolus and 51 gaseous embolus). In order to evaluate the predictive ability of a model, we randomly divide the dataset, into three subsets training set, validation set, and test set. The test set, approximately one-third (1/3) of the data (14 solid embolus and 14 gaseous embolus), is used only for the assessment of the model selected by the cross-validation technique, while the rest of the data will belong to the learning set (used for building the models) which will be divided into a training set (approximately two-third) and a validation set (approximately one-third). The validation set is used to tune the classifiers parameter.

In order to evaluate the performance of the proposed system, we employ support vector machines classifier.

Support vector machines classifier

Support Vector Machines, proposed by Vapnik’s group in 1995 [31], are supervised learning techniques based on a statistical learning theory. SVM has gained increasing interest in areas that range from regression estimation to pattern recognition, due to its great learning performance. Indeed, SVMs don’t have local extrema problems that are present for traditional neural networks, which involve large numbers of training patterns [32].

SVM was originally designed for linear binary classification problems. For nonlinear classification, a transformation maps the data from the input space into a higher dimensional feature space that allows linear separation. We then seek the optimization separation plane in feature space using a kernel function. In our case, radial basis kernel is used to differentiate between solid embolus and gaseous. More details about recent developments of SVM can be found in [3133].

The kernel parameter γ and the penalty parameter C are the two major parameters of the RBF applied in SVM-RBF which have to be set appropriately in order to improve SVM learning. γ is the width of RBF kernel. C is the regularization parameter of error which allows one to trade off training error versus model complexity. In order to empirically select the best classification model, the value of each parameter is varied in a given predefined range according to a grid search [34]. We carried out experiments by trying exponentially growing sequences of C and γ on the training set [34]. The combination that results in a model with highest validation accuracy is picked as the best choice of the classification problem and then used to accurately predict the testing set.

Feature selection based on differential evolution algorithm

The feature vector cited in “Discrete wavelet transform of backscatter RF signals” section is too large to be handled properly by a classifier during training. To overcome this curse of dimensionality, we apply a supervised feature selection approach based on DE algorithm which belongs to the family of evolutionary algorithms. Later DE has been successfully applied to a large number of problems for example feature selection [18, 19]. To solve an optimization problem, DE starts by iteratively modifying a randomly generated initial population of candidate solutions using a floating-point encoding instead of binary numbers [18, 19, 35]. This process is then enhanced using selection, mutation, and crossover operations. DE combines different randomly chosen populations (\({{X}_{r0}}, {{X}_{r1}},\text{ and }{{X}_{r2}}\)) to create a mutant element (\({{V}_{i,g}}\)) from the current generation \(g\) [19]:

$${{V}_{i,g}}={{X}_{\text{r}0,g}}+\text{F * }\left( {{X}_{r1,g}}-{{X}_{r2,g}} \right)$$
(20)

where \(\text{F}\in \left( 0,\text{ }1 \right)\) is a scale factor that controls the rate at which the population evolves. The index \(g\) indicates the generation to which a vector belongs.

In addition, DE also uses discrete recombination (crossover), in order to construct trial vectors out of parameter values that have been copied from two different populations:

$${\text{U}}_{{j,i,g}} = \left\{ {\begin{array}{*{20}c} {{\text{V}}_{{j,i,g}} } & {{\text{if rand }}\left( {0{\text{ }},{\text{ }}1} \right) \le C_{r} {\text{ or}}} \\ {{\text{X}}_{{j,i,g}} } & {{\text{otherwise,}}} \\ \end{array} } \right.$$
(21)

where \({{\text{U}}_{j,i,g}}\)is the j’th trial element along i’th dimension from the current population g and \({{\text{C}}_{r}}\in \left[ 0,\text{ }1 \right]\) is the crossover probability which controls the fraction of parameter values that are copied from the mutant.

If the newly generated element results in a better fitness (classification accuracy) than the predetermined population member, then the resulting element replaces the vector with which it was compared.

The steps used in DEFS algorithm are [18, 19]:

  • Generate new population elements from the original population.

  • Create a new mutant vector for each position in the population matrix.

  • A trail vector is obtained by crossing the mutant vector with the original vector.

  • The corresponding position in the new population will contain either:

    • the trial vector or its corrected version,

    • or the original target vector depending on which one of them achieved a better fitness.

  • The process is repeated until each of the population elements have competed against a randomly generated trial element.

Once the last trial element has been evaluated, the survivors of the population pairwise competitions become parents for the next generation in the evolutionary cycle. For further details on the above DEFS method, please refer to [18, 19].

Datasets

In order to verify the usefulness of RF ultrasound signal processing for microemboli classification, an in vitro set-up is developed. It consists of a Doppler flow phantom containing 3 tubes of 0.2, 0.4 and 0.8 mm in diameter. The tube of 0.8 mm is chosen since its size approximates the size of a human vessel (Fig. 5). In order to imitate the ultrasonic behavior of gaseous emboli, contrast agents consisting of microbubbles are used in our experimental setup. Indeed, several studies have revealed that the acoustic behavior of gaseous emboli and microbubbles are similar, mainly the nonlinear behavior [14, 36]. Hence, a continuous flow carries the Sonovue microbubbles [37, 38] (contrast microbubbles) through the insonified vessel is used to mimic the ultrasonic behavior (scattering) of gaseous emboli. We used Sonovue microbubbles since it is the only commercially available contrast agent in Europe. The concentration of microbubbles and the flow are controlled by the operator. Since in clinical situations the scattering amplitude of emboli approaches that of blood, we fixed the concentration of Sonovue microbubbles such as its scattering amplitude at the fundamental frequency is comparable to the scattering of the surrounding tissue at the same frequency.

Fig. 5
figure 5

Experimental set-up

However, solid emboli present acoustic properties comparable to those of biological tissue, in their scattered signal no harmonic components can be seen, therefore the scattering of solid emboli is purely linear. Thus the tissue mimicking material surrounding the vessel is chosen to mimic the behavior of solid emboli. Moreover, the applied acoustic pressures are not sufficiently high to induce nonlinear propagation effects and does not generate any harmonic components during ultrasound propagation.

The ultrasound waves are generated by a VF13-5 probe connected to a Siemens Antares scanner (Anthares, Siemens, MV, CA). The acquisitions are carried out at 1.82 MHz transmit frequency in Tissue Harmonic Imaging (THI) mode at 14 fps frame rate, and 20 MHz as the sampling rate of the signals. The acoustic focus is set at 2 cm which is the depth at which the flowing contrast microbubbles are situated. Ultrasound waves are transmitted at two different intensities corresponding to mechanical indices (MI) of 0.2 (low MI) and 0.6 (high MI). The User Research Interface is used to grab the unfiltered RF signals to a personal computer for further analysis. Two concentrations of contrast agent/Isoton were used: 0.025 and 0.05 µl/ml [13].

The regions of interests where RF signals corresponding to gaseous embolus and solid embolus are shown in Fig. 6.

Fig. 6
figure 6

Grey scale images acquired: a MI = 0.2, b MI = 0.6 for two microbubbles concentrations

Figure 7 illustrates two types of RF signals extracted from the obtained grayscale images. Panel a shows RF signal backscattered by gaseous and solid embolus at low MI (0.2). It is noteworthy that the acoustic pressure is not enough to create nonlinear microbubbles oscillations. Panel b illustrates the RF signal of each type of embolus at higher MI (0.6) in this case the propagation of an ultrasound wave becomes nonlinear, thus a nonlinear behavior of both gaseous and solid particles is observed [13].

Fig. 7
figure 7

Examples of RF signals. a MI = 0.2, b MI = 0.6

Results

Classification performance is evaluated in terms of two measures, which are: (i) the overall accuracy, which is the percentage of correctly classified embolus among all the embolus independently of the classes they belong to; (ii) the accuracy of each class that is the percentage of correctly classified embolus among the embolus of the considered class.

Table 3 summarizes the percentage of correct classification rates of microemboli using SVM analysis as a function of all input features (50 features) and mechanical indexes for the two microbubble concentrations (0.025 and 0.05 µl/ml).

Table 3 Classification rates of gaseous and solid emboli using all features

The average classification rate doesn’t exceed 85.71% for all datasets using all features as input vector. Therefore the feature vector is too large to be handled properly by the classifier. To overcome this limitation, we grouped the input parameters into small feature vectors regarding the nature of each feature then we applied the classification algorithm on each set of features separately, the results are shown in Table 4.

Table 4 Classification rates of gaseous and solid emboli using each set of features separately

The features P2TR n , RR n , FR n , and VIF n for the two concentrations do not provide significant classification rates neither at low MI (0.2) nor at high MI (0.6). When the features TP2TR n , \({{t}_{s}}_{n}\),\({{f}_{s}}_{n}\), \(T{{_{s}^{2}}_{n}}\), \(B{{_{s}^{2}}_{n}}\), and VIE n are introduced as input parameters into the SVM model, the correct average rate of classification of microemboli reached 89.28%, thus a significant improvement in the classification rates is observed.

In order to reduce the dimensionality we perform feature selection using differential evolution algorithm. DEFS algorithm is a wrapper approach; it performs feature selection using SVM classifier. We select the set of features that maximizes accuracy of the classification model (refer to Fig. 8).

Fig. 8
figure 8

Classification rates vs number of selected features with concentration of microbubbles (0.025 µl/ml) at high MI (0.6)

Figure 8 draws the evolution of the average classification rate vs number of selected features at low microbubble concentration (0.025 µl/ml) with a higher MI (0.6), the best result (Accuracy = 96.42%) is obtained using four features. The training data set is used to select the most relevant features and to fix the parameters of the classifier. The final results presented, are based on the system’s performance using one unseen test data set.

The output of the designed approach using DEFS algorithm and SVM classifier based on hold-out-set cross validation are illustrated in Table 5.

Table 5 Generalisation performances using feature selection based DEFS algorithm

For the concentration of microbubbles (0.025 µl/ml) at low MI (0.2) our proposed method achieved 96.42% classification rate using a total of 4 features. For the other three acquisitions the performance measures are quite similar, the proposed method reached a classification rate of 92.85%.

More particular parameters can also be derived, such as the sensitivity and the specificity of the classification system (refer to Eqs. 5, 6) which are used to discriminate relevant information that provide more insight into the characteristics of the model in order to make meaningful decisions. Sensitivity relates to a test’s ability to correctly identify those patients with pathology as positive. Specificity of a test refers to its ability to correctly identify individuals without the pathology as negative. Furthermore, we investigated other statistical measure which better estimates the accuracy of a given trial test by analyzing sensitivity and specificity simultaneously, this approach is the area under curve (AUC) associated to the receiver operating characteristic (ROC) curve. AUC allows to quantify the ROC curve performance using a single value. It is well known that the higher the AUC value, the more efficient the classifier.

Figure 9 shows the ROC curves of the four datasets using several detection thresholds in which TP rate is plotted on the Y axis and FP rate is plotted on the X axis. The best results (that maximizes sensitivity and specificity) are achieved using a threshold of 0.1163. At this cut-offs, the sensitivity is 100% and specificity is 92.85%. The ROC curves in Fig. 9 indicate that the proposed model performs better at identifying gaseous emboli than at identifying solid emboli for the concentrations C = 0.025 µl/ml and C = 0.05 µl/ml at high MI (0.6), the ROC curves produce an AUC of 97.44 and 96.93% respectively. On the contrary, for the concentration C = 0.025 µl/ml at low MI (0.2), the classifier recognizes better solid emboli than gaseous emboli. The classification performances for gaseous and solid emboli are quite similar for the concentration C = 0.05 µl/ml at low MI (0.2).

Fig. 9
figure 9

ROC curve at different detection thresholds: a MI = 0.2, b MI = 0.6 for two microbubbles concentrations

Table 5 illustrates the analysis of ROC curves using DEFS algorithm. The best value of accuracy based on the optimal threshold is obtained for concentration C = 0.025 µl/ml at high MI (0.6), the ROC point at (0.07, 1) produces its highest accuracy (96.42%), with an AUC of 97.44%. For all the acquisitions a significant improvement in terms of accuracy, sensitivity, and specificity is observed when using feature selection based on DEFS algorithm.

Table 6 illustrates the confusion matrix of the proposed system. The numbers of correct and incorrect predictions made by the proposed model compared to the target values in the test data are shown in Table 6. For example, at microbubble concentration 0.025 µl/ml and high MI (0.6) the proposed classification model succeeded in classifying 14 gaseous embolus out of 14 (Sensitivity = 100%) and 13 solid embolus out of 14 (Specificity = 92.85%). Thus 1 solid embolus is not recognized i.e. classified as gaseous embolus, the gaseous embolus are all recognized.

Table 6 Confusion matrix of the proposed model

In order to validate the proposed approach; we compared the obtained results in this study with those obtained in a recently published study [13] on the same backscatter RF signals (refer to Fig. 10). The average percentage of correct classification of microemboli using DEFS algorithm with SVM classifier for two microbubbles concentrations (0.025 µl/ml and 0.05 µl/ml) at high and low mechanical index (0.2 and 0.6) are given in Table 7.

Fig. 10
figure 10

Generalisation performances using DEFS algorithm and comparison with the results obtained in Ref. [13]: a MI = 0.2, b MI = 0.6 for two microbubbles concentrations

Table 7 Generalisation performances using DEFS algorithm and comparison with the results obtained in Ref. [13]

Discussion

In this experimental study we exploit RF signals in the detection and the classification of microemboli into gaseous or solid embolus. Several features are evaluated from the detail coefficients using DWT technique. It should be noted that the features used in this study are the same used in the work by Aydin et al. [7]. These features are used as inputs to the classification models without feature selection method. The average classification rate doesn’t exceed 89.28% for all datasets, this can be explained by the fact that even the more powerful among the existing machine learning algorithms tend to get confused when supplied with a large number of features.

Building quantitative models (classifiers) using a large number of features most often requires using a smaller set a features than the initial one. Indeed, a too large number of features feed to a model (classifier) results in a too large number of parameters, leading to overfitting and poor generalization abilities [27]. It is noteworthy, in our case, the original data set contains d features (d = 50), an extensive search of all possible combinations would involve the design of 2d − 1 different models. This value grows exponentially, making an exhaustive search impractical even for moderate values of d. In order to reduce the dimensionality and select a relevant set of features, we implemented a wrapper feature selection approach based on differential evolution algorithm. For all the acquisitions a significant improvement in the classification rates is observed when using feature selection based on DEFS algorithm. The average classification rate goes down when the number of selected features gets larger which validates that learning might be deteriorated by irrelevant features (refer to Fig. 8).

In order validate the proposed approach; we compared the obtained results in this study with those obtained in a recently published study [13] on the same backscatter RF signals (refer to Table 7). In Ref. [13] the authors employed a neural network (MLP and RBF) analysis using the fundamental and the second harmonic components information contained in the RF signal backscattered by an embolus. The experimental results show clearly that our proposed method achieves better average classification rates compared to the method cited in [13] using also the same backscatter RF signals.

The superiority of SVM over the Neural Network approach can be explained by the fact that SVM has a high capacity for generalization using limited numbers of training data points; furthermore, SVMs don’t have local extrema problems that are present for traditional neural networks, which involve large numbers of training patterns. The performance of SVMs relies on the choice of kernel type and kernel parameters, but this dependence is less influential.

It is noteworthy that the results depicted by Tables 3, 4, 5, 6 and 7 are reproducible. The best results appear in bold. The algorithm has only access to the training and validation sets, the test set is kept unseen in the selection process of the best model. The test set is used only for the assessment of the model selected by the cross-validation technique.

According to the obtained results, one may need to combine other types of approaches (e.g., filter and embedded) in order to improve the discriminative power of the proposed method. The processing time is an essential consideration if we have to implement this approach in clinical decisions. It should be noted that the training/feature selection phase lasts for a few minutes while the processing time for the test phase (classification) is less than 1 s. The implementation of such algorithm in real time applications makes also the investigation of the feasibility of emboli signals detection systems using RF signals worthwhile.

Conclusion

Emboli detection and classification remain promising research due to the correlation between embolism consequences and embolus nature (gaseous or solid emboli). The detection and classification of circulating microemboli is nowadays considered as a main challenge in the field of emboli detection.

The results presented in this experimental study demonstrate the usefulness of RF ultrasound signal processing in detection and classification of microemboli. A first proof of the concept of emboli classification based on the combination of a time–frequency based feature extraction technique (discrete wavelet transform), feature selection (DEFS), and backscatter RF signals is demonstrated. The results indicated that feature selection not only has the ability to find the most relevant set of inputs that result in higher average classification rate but also has the ability to reduce the size of feature vector. We demonstrated in this experimental study that combining DEFS algorithm and discrete wavelet transform provides better average classification rates (96.42%) in comparison to a previous study using also the same backscatter RF signal [13] that was evaluated extensively in a comparable manner.

The technique as suggested in this study proves to be effective in improving classification. The implementation of such algorithm in real time applications makes also the investigation of the feasibility of emboli signals detection systems using RF signal worthwhile. However, further validations in in-vivo situations are required to demonstrate the additional benefit.