Abstract
In this paper, we are interested in the problem of Blind Source Separation using a Second-order Statistics (SOS) method in order to separate autocorrelated and mutually independent sources mixed according to a bilinear (BL) model. In this context, we propose a new approach called Bilinear Second-order Blind Source Separation, which is an extension of linear SOS methods, devoted to separate sources present in BL mixtures. These sources, called extended sources, include the actual sources and their products. We first study the statistical properties of the different extended sources, in order to verify the assumption of identifiability when the actual sources are zero-mean and when they are not. Then, we present the different steps performed in order to estimate these actual centred sources and to extract the actual mixing parameters. The obtained results using artificial mixtures of synthetic and real sources confirm the effectiveness of the new proposed approach.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
1 Introduction
Blind Source Separation (BSS) consists in decomposing several observed signals into a set of source signals and their mixing parameters, with almost no prior knowledge about them (Comon and Jutten 2010; Deville 2016). Most of the researches dealing with BSS methods suppose that the mixing model is linear, where the observed signals result from linear combinations of the source signals. Nevertheless, for some applications, the linear mixing model is not valid, and must be replaced by a nonlinear one. This nonlinear model provides a better description of the mixing process and the interactions between sources. Due to the complexity of nonlinear models, the nonlinear BSS methods are more complex and remain less studied (Taleb 2002; Deville and Hosseini 2009; Hosseini and Deville 2013). This complexity may be reduced by constraining the structure of the mixing models. Indeed, the addition of simplifying assumptions has allowed the development of exploitable nonlinear models. Among these models, the linear-quadratic (LQ) model has drawn significant attention [see e.g. the survey in Deville and Duarte (2015)]. Research focused on the LQ model has demonstrated the relevance of its use in several applications such as remote sensing (Meganem et al. 2014a, b; Eches and Guillaume 2014; Jarboui et al. 2014, 2016), analysis of gas sensor array data (Bedoya 2006; Ando et al. 2015), and scanned document processing (Merrikh-Bayat et al. 2011; Duarte et al. 2011; Almeida and Almeida 2012; Liu and Wang 2013). The particularity of the LQ model as compared with the linear one is the presence of the second-order terms. Thereby, considering K observations resulting from an LQ mixture of L sources, the relationship between the observed and the source signals can be characterized by the following equation
where \(x_i(n)\) is the ith observed signal at time n, \(s_j(n)\) is the jth unknown source signal, \( a_{j}(i) \) is the linear coefficient associated with the jth source signal and the ith observed signal, \( a_{j,k}(i) \) is the quadratic mixing coefficient associated with the ith observed signal and resulting from the interaction between the jth and the kth sources. All the actual sources \(s_j\) and the pseudo-sources \(s_j \times s_k\) are called extended sources, as in Deville and Duarte (2015); Meganem et al. (2014b). There is a particular case of the LQ model called the bilinear (BL) model where the squared term coefficients \( a_{j,j}(i) \) are null. The model (1) then becomes
Various methods applicable to the LQ model have been proposed (Deville and Duarte 2015). While some of them are only devoted to Blind Mixture Identification (BMI), in order to only identify the mixing parameters (Krob and Benidir 1993; Abed-Meraim et al. 1996), the others are dedicated to BSS, which aims also at estimating the source signals. They include Sparse Component Analysis (SCA) methods (Jarboui et al. 2014; Deville and Hosseini 2007), which are only applicable to sparse sources, Non-negative Matrix Factorization (NMF) methods (Meganem et al. 2014b; Eches and Guillaume 2014; Jarboui et al. 2016), which may be used only when sources and mixing coefficients are non-negative, and Independent Component Analysis (ICA) methods (Deville and Hosseini 2009; Hosseini and Deville 2013; Castella 2008), which are based on the assumption that the source signals are statistically independent. Nevertheless, the use of LQ ICA-based methods, either BMI or BSS, is usually constrained by other properties of the sources and/or the mixture. For example, some of them can be used only when the sources are complex-valued and circular (Krob and Benidir 1993; Abed-Meraim et al. 1996) or binary (Castella 2008). Others are suited only to determined mixtures (Deville and Hosseini 2009; Hosseini and Deville 2013; Almeida and Almeida 2012), which generally means overlooking a useful part of the available observations. Moreover, most of the existent LQ ICA-based methods are time-consuming.
In this paper, we study the bilinear model in an over-determined configuration. In addition, we suppose that the sources are real-valued, stochastic, auto-correlated, mutually independent and jointly strict-sense stationary signals. Then, we propose a new and fast BSS method, called Bilinear Second-Order Blind Source Separation (B-SO-BSS), based on Second-order Statistics (SOS) and the joint diagonalization of correlation matrices of the whitened centred observed signals. Such SOS methods have already been proposed in the framework of linear BSS (Tong et al. 1990; Belouchrani et al. 1997). We first study the correlation between different extended sources in Sect. 2, then we present our proposed method developed based on the results of this study in Sect. 3, and in Sect. 4, we eventually provide some simulation results using artificial mixtures of synthetic and real-world sources.
2 Mutual correlation of the extended sources
In this section, by supposing that the actual sources \(s_j\) are real-valued, stochastic, auto-correlated , mutually independent and jointly strict-sense stationary, we investigate whether all the extended sources are mutually uncorrelated in two different cases: when the actual sources are zero-mean and when they are not.
2.1 Case of zero-mean actual sources
We here detail the study of the correlation between the different extended sources when the actual sources are zero-mean. The pseudo-sources are then zero-mean, as will now be shown. Indeed, these pseudo-sources are defined as \( s_j(n)s_k(n) \), with \( j \ne k \). Their factors \( s_j(n) \) and \( s_k(n) \) are independent and thus uncorrelated, which yields
where \( E\{ . \} \) stands for expectation.
All extended sources are thus zero-mean in the considered case. Therefore, their cross-covariance functions, which should be used to measure their correlation, are here equal to their cross-correlation functions. The latter functions are derived hereafter.
2.1.1 Correlation function of \(s_i(n)\) and \(s_j(n)\)
Two different actual sources \(s_i(n)\) and \(s_j(n)\) being independent and zero-mean, their cross-correlation function is equal to zero:
2.1.2 Correlation function of \(s_i(n)\) and \(s_j(n)\times s_k(n)\)
\( \mathrm {If} \ (i \ne j) \ \mathrm {and} \ (i \ne k)\), since \(s_i(n)\), \(s_j(n)\) and \(s_k(n)\) are independent, we can write
\( \mathrm {If} \ (i = j) \ \mathrm {or} \ (i = k)\), the reasoning is similar for the two cases, e.g. considering \((i = j)\), we get
2.1.3 Correlation function of \(s_i(n)\times s_j(n)\) and \(s_k(n)\times s_l(n)\)
\(\mathrm {If} \ (i \ne k),\ (i \ne l),\ (j \ne k) \ \mathrm {and} \ (j \ne l)\) , the independence of \(s_i(n)\), \(s_j(n)\), \(s_k(n)\) and \(s_l(n)\) yields
\( \mathrm {If} \ (i = (k \ \mathrm {or} \ l)) \ \mathrm {xor} \ (j = (k \ \mathrm {or} \ l))\), the reasoning is similar for all cases, e.g. considering \((i=k)\) and therefore \((j \ne (k \ \mathrm {and} \ l))\), the sources \(s_i\), \(s_j\) and \(s_l\) are independent so that
Thus, in the case of zero-mean actual sources, all the extended sources are mutually uncorrelated.
2.2 Case of non-zero-mean actual sources
In this case, (3) shows that the pseudo-sources are non-zero-mean, so that the correlation of the extended sources must be measured using their cross-covariance functions. Assuming that the actual sources are auto-correlated such that the covariance function \(C_{s_i,s_i}(\tau ) \ne 0\) at a lag \(\tau \), we here present two types of extended sources which are mutually correlated.
2.2.1 Covariance function of \(s_i(n)\) and \(s_j(n)\times s_k(n)\) when \((i = j) \ \mathrm {or} \ (i = k)\)
The reasoning is similar for the two cases, e.g. considering \((i=j)\), calculating the covariance function yields
Therefore, \(s_i(n)\) and \(s_i(n)\times s_k(n)\) are correlated.
2.2.2 Covariance function of \(s_i(n)\times s_j(n)\) and \(s_k(n)\times s_l(n)\) when \((i = (k \ \mathrm {or} \ l)) \ \mathrm {xor} \ (j = (k \ \mathrm {or} \ l))\)
The reasoning is similar for all cases, e.g. considering \((i=k)\) and therefore \((j \ne (k \ and \ l))\), the covariance function reads
Therefore, \(s_i(n)\times s_j(n)\) and \(s_i(n)\times s_l(n)\) are correlated.
3 Proposed BSS method
In this section, we propose a new BSS method, called Bilinear Second-Order Blind Source Separation (B-SO-BSS), first for zero-mean and then for non zero-mean actual sources.
3.1 Case of zero-mean actual sources
The bilinear mixing model (2) can be written in the following matrix form
where \(\mathbf{x}(n)=[x_1(n),\ldots ,x_K(n)]^T\) is the vector of K observed signals at time n, \(\mathbf{s}(n)=[s_1(n),\ldots ,s_L(n),s_1(n)s_2(n),\ldots ,s_{L-1}(n)s_L(n)]^T\) is the vector of all the extended sources at time n, and the mixing matrix \(\mathbf{A}\), which contains both linear and quadratic mixing parameters, reads
Then, the bilinear mixture can be considered as a linear mixture of the \(L(L+1)/2\) extended sources. In the following, we assume that \(K \ge L(L+1)/2\) so that this reformulated linear mixture is not under-determined. As shown in Sect. 2.1, all the extended sources are mutually uncorrelated in the case considered here. If they are also auto-correlated with different auto-correlation functions, the source separation may be achieved by jointly diagonalizing the correlation matrices of the whitened centred observations at different lags as will be detailed in Sect. 3.4.
3.2 Case of non-zero-mean actual sources
As shown in Sect. 2.2, in this case some extended sources are mutually correlated. Here, we show how the original bilinear mixing model may be used to derive a new mixing model with new mutually uncorrelated extended sources. The mean \( E\{ s_j(n) \} \) of the actual source \( s_j(n) \) does not depend on the considered time n, since the actual sources are assumed to be strict-sense stationary. The expectation of \({s}_j\) will be denoted by \(\bar{s}_j\) hereafter. The centred version of \( s_j(n) \) is thus \(\widetilde{s}_j(n)=s_j(n)-\bar{s}_j\). The bilinear model (2) can then be written as
The fourth term on the right hand side of (13), denoted as F in the following, can be rewritten as
where the last term above is obtained just by inverting the roles of symbols j and k.
Then, we introduce the coefficients \(a_{j,k}(i)\) with \(j>k\), defined with respect to the actual coefficients of (2), as \(a_{j,k}(i)=a_{k,j}(i)\).
This yields
The above sum contains all possible combinations of \(j\in \left[ 1,L\right] \), \(k\in \left[ 1,L\right] \) such that \(j\ne k\). It can then be rewritten as
Replacing (16) in (13) leads to
which yields
where \( \widetilde{a}_{j}(i) \) are the linear coefficients of the new model which are defined as
and \(C_i\) is a constant defined as
Since the actual centred sources \(\widetilde{s}_j(n)\) and \(\widetilde{s}_k(n)\) are zero-mean and independent, from (18) the mean of the observed value \(x_i(n)\) is equal to \(\bar{x}_i = C_i\). Thus, its centred version can be written as follows:
As can be seen, the centred observations form a new bilinear mixture of the actual centred sources, although the mixing parameters of the linear part in this new model are not the same as those in the original mixture. According to the results provided in Sect. 2.1, the new extended sources \(\widetilde{s}_j(n)\) and \( \widetilde{s}_j(n) \widetilde{s}_k(n) \) are all mutually uncorrelated. We can then rewrite this new bilinear model in the matrix form (11) just by replacing \(\mathbf{s}\) and \(\mathbf{x}\) by \(\widetilde{\mathbf{s}}\) and \(\widetilde{\mathbf{x}}\), and the parameters \(a_j(i)\) by \(\widetilde{a}_j (i)\) in the expression (12) of the matrix \(\mathbf{A}\) to obtain the matrix \(\mathbf{\widetilde{A}}\).
The approach that we developed at this stage therefore yields a modified set of observations, namely the centred observations \( \widetilde{x}_i(n) \), which form a determined (or overdetermined) linear mixture of a modified set of mutually uncorrelated source signals, namely the extended centred sources related to the actual centred sources. Moreover, we hereafter consider the case when these modified source signals are auto-correlated with different auto-correlation functions. With respect to these modified observations and source signals, the configuration that we thus derived meets the same main assumptions as those which have previously been used in the literature, for plain linear mixtures, to derive second-order BSS methods, such as the Algorithm for Multiple Unknown Signals Extraction (AMUSE) (Tong et al. 1990) or its improved version, that is the Second-Order Blind Identification (SOBI) method (Belouchrani et al. 1997). This then allows us to derive extended versions of the above standard methods, which were initially intended for linear mixtures, in order to process our configuration based on bilinear mixtures. In particular, we hereafter propose an extension of SOBI.
3.3 Identifiability condition
A necessary step in the proposed method is to check the mixture identifiability condition. Similarly to the SOBI method, the proposed method uses several correlation matrices of the whitened centred observations for a fixed set of different non-zero lags \(\tau _i\in \{\tau _1,\ldots ,\tau _m\}\). For a lag \(\tau _i\), the correlation matrix of the \( L(L+1)/2 \) whitened centred observations \(\mathbf{z}(n)=\mathbf{W}\widetilde{\mathbf{x}}(n)\) (where \(\mathbf{W}\) is a whitening matrix) is given by:
where \(\mathbf{U}\) denotes an orthogonal matrix, \(^T\) stands for transposition, and \(R_{\widetilde{\mathbf{s}}}(\tau _i)\) denotes the correlation matrix of the extended centred sources associated with the lag \(\tau _i\), which is a diagonal matrix since the extended centred sources are mutually uncorrelated.
Let us consider the following theorem:
Theorem
Let \(\tau _i=\{{\tau }_1, \ldots ,{\tau }_m\}\) be m non-zero lags, \(\mathbf{V}\) be an orthogonal matrix, such that:
Then, \(\mathbf{U}\) and \(\mathbf{V}\) are essentially equal, i.e. they are equal up to a multiplication by a matrix \(\mathbf{P}\), such that \(\mathbf{U}=\mathbf{VP}\), where \(\mathbf{P}\) has one nonzero entry in each row and column, whose value is equal to \(\pm 1\).
This theorem provides a uniqueness condition for the matrix \(\mathbf{U}\) and consequently the mixing matrix \(\widetilde{\mathbf{A}}\). Note that the mixing matrix cannot be identified when the extended centred sources have identical normalized spectra. But, if they have different normalized spectra, it is possible to find a set of lags \(\tau _i\) satisfying the theorem condition. More details are provided in Belouchrani et al. (1997).
It should in particular be noted that if one of the actual centred sources \(\widetilde{s}_i(n)\) is temporally uncorrelated, then all the pseudo-sources related to it, i.e. \(\widetilde{s}_i(n)\widetilde{s}_j(n)\) with \(i \ne j\) are temporally uncorrelated too, so that all these extended centred sources have identical (constant) normalized spectra. Thus, a necessary condition for identifiability is that all the actual centred sources must be autocorrelated.Footnote 1
3.4 Proposed algorithm
Our proposed algorithm (B-SO-BSS), which provides estimates of centred actual sources up to a permutation and scale factors, is summarized in Algorithm 1.
In the particular case in which the actual sources are zero-mean, the same algorithm may be used just by choosing \(\widetilde{\mathbf{s}}=\mathbf{s}\), \(\widetilde{\mathbf{A}}=\mathbf{A}\), and \(\widetilde{\mathbf{x}}=\mathbf{x}\).
The last step of the proposed algorithm, which consists in identifying the estimated actual centred sources among all the estimated extended centred sources, is detailed below.
3.5 Identifying the estimated actual centred sources
The first steps of the proposed method yield a set of signals \(\hat{\mathbf {s}}(n)\) composed of estimates of the \(L(L+1)/2\) unordered extended centred sources, up to a permutation and scale factors. Thus, if e.g. \(\hat{s}_j(n)\) and \(\hat{s}_k(n)\) correspond to two centred actual sources and \(\hat{s}_i(n)\) corresponds to their product, then \(\hat{s}_i(n)\) must ideally be proportional to \(\hat{s}_j(n)\times \hat{s}_k(n)\). As a result, the absolute value of the correlation coefficient between \(\hat{s}_i(n)\) and \(\hat{s}_j(n)\times \hat{s}_k(n)\) must be close to one. Thus, by computing this correlation coefficient for all the possible triplets \(\{i,j,k\}\,, i\ne j\ne k\), we can identify the estimated actual centred sources among all the estimated extended centred sources.Footnote 2
3.6 Estimation of actual mixing coefficients
Many BSS applications only aim at estimating the actual source waveforms, which are provided by our Algorithm 1. In some applications (like in hyperspectral image unmixing), however, it is also needed to estimate the mixing parameters. In the case of non-zero-mean actual sources, our algorithm provides an estimate of matrix \(\widetilde{\mathbf{A}}\), and not \(\mathbf{A}\) (up to a permutation and a diagonal matrix). As mentioned in Sect. 3.2, matrix \(\widetilde{\mathbf{A}}\) consists of:
-
columns containing quadratic coefficients \(a_{j,k}(i)\), like in matrix \(\mathbf{A}\),
-
columns containing modified linear coefficients \(\widetilde{a}_{j}(i)\), which are different from the actual coefficients \({a}_{j}(i)\) included in matrix \(\mathbf{A}\), and are defined by (19).
In the following, we propose a method to recover an estimate of the actual matrix \(\mathbf{A}\) (up to classical indeterminacies) from the estimate of matrix \(\widetilde{\mathbf{A}}\) provided by Algorithm 1.
From (19), we have
where \(d_j\) and \(d_{j,k}\) are unknown arbitrary scale factors, up to which the extended centred sources and the columns of matrix \(\widetilde{\mathbf{A}}\) have been estimated by Algorithm 1. The above result can be written in the following matrix form
where \(\mathbf{c}=[C_1,\ldots ,C_K]^T\), \(\mathbf{e}=[d_1\bar{s}_1, \ldots , d_L \bar{s}_L, (-d_{1,2}\bar{s}_1\bar{s}_2), \ldots , (-d_{L-1,L}\bar{s}_{L-1}\bar{s}_L) ]^T\), and \(\widetilde{\mathbf{A}}_1\) is the result of dividing the columns of matrix \(\widetilde{\mathbf{A}}\) by unknown scale factors \(d_j\) and \(d_{j,k}\). In other words, \(\widetilde{\mathbf{A}}_1\) is the matrix \(\hat{\mathbf{A}}\) provided by Algorithm 1 up to estimation errors.Footnote 3
Note that \(\mathbf c \) can easily be obtained by estimating the means of observations. As a result, \(\mathbf{e}\) can be obtained using
where \(\dagger \) stands for pseudo-inverse.
Furthermore, (25) can be rewritten as
As mentioned above, \(\frac{\widetilde{a}_{j}(i)}{d_j}\) and \(\frac{a_{j,k}(i)}{d_{j,k}}\) (\(\forall i=1,\ldots , K\)) correspond to the columns of \(\widetilde{\mathbf{A}}_1\), estimated by Algorithm 1, and \((d_j\bar{s}_j)\) and \((-d_{j,k}\bar{s}_j \bar{s}_k)\) are the entries of \(\mathbf{e}\), estimated using (28). Consequently, \({a}_{j}(i) \; \forall i=1,\ldots , K\) can be estimated up to unknown factors \(\bar{s}_j\) according to (29). In other words, this approach allows one to estimate the columns of the actual matrix \(\mathbf{A}\) containing the linear coefficients up to scale factors. Note that the columns of this matrix containing the quadratic coefficients are directly provided by Algorithm 1 (up to scale factors too).
4 Simulation results
In this section, we present and discuss the results obtained by the proposed B-SO-BSS method presented in Algorithm 1 to unmix the bilinear mixtures, with adding the step described in Sect. 3.6 to estimate the actual mixing coefficients. Herein, we just present the results obtained when the sources are non-zero-mean since we found nearly the same performance for both zero-mean and non-zero-mean cases. In our simulations, the processed data are non-negative, and hence it is possible to compare the obtained results to those obtained by the NMF-Grd-LQ algorithm presented in Meganem et al. (2014b) which is an NMF-based method adapted to LQ mixtures, exploiting the non-negativity of data involved in mixtures. Note that the physical constraints of the NMF-Grd-LQ algorithm, originally adapted to remote sensing applications, i.e the sum of the linear coefficients equal to 1 and the quadratic mixing coefficients lower than 0.5, have been omitted in our simulations and the NMF-Grd-LQ method has been modified accordingly.
4.1 Performance criteria
In order to evaluate the performance of the methods, we calculate the Signal-to-Interference Ratio (SIR) and the Normalized Mean Square Error (NMSE) related to each actual centred source according to the following equations
where N represents the number of available samples for each signal and the notation ‘ \(\hat{}\) ’ refers to the estimated values after removing the permutation and scale factor indeterminacies. In the same way, we calculate \(SIR_a\) and \(NMSE_a\) related to all the mixing parameters, \(a_j\) and \(a_{j,k}\), estimated according to Sect. 3.6.
4.2 Tests
We performed the following four experiments:
Experiment 1
We considered artificial mixtures of synthetic sources. The mixing parameters \(a_j(i)\) and \(a_{j,k}(i)\) were generated randomly with values uniformly distributed between 0 and 1. The generation of two sources was realized as follows: at first, we generated two independent and identically distributed (i.i.d.) signals \(e_1(n)\) and \(e_2(n)\) uniformly distributed over [0, 1], then we filtered them by two first-order auto-regressive filters in order to obtain two auto-correlated source signals according to the model \(s_i(n)=e_i(n)+ \rho _i s_i(n-1)\). The chosen parameters were \(\rho _1=0.7\) and \(\rho _2=0.5\). The tests were repeated using different source sample numbers N: 10,000, 1000, and 100. Finally, three observed signals were generated using the BL model (2).
Experiment 2
We generated artificial mixtures of two real-world sources. These sources, shown in Fig. 1 and described in Duarte et al. (2014), correspond to the activities (which can be seen as effective ionic concentrations) of ions Na\(^+\) and K\(^+\) measured for 41 samples. As in Experiment 1, the mixing parameters \(a_j(i)\) and \(a_{j,k}(i)\) were generated with random values uniformly distributed between 0 and 1. Thereafter, we generated three observed signals using the BL model (2), even if the mixture model of the concentrations of the chemical species is usually approximated by a linear-quadratic model.
Experiment 3
The third experiment aims at evaluating the robustness of our method to noise. The mixtures were, first, generated in the same way as in Experiment 1, with \(N=10{,}000\). A zero-mean Gaussian i.i.d noise was then added to the observed signals in order to obtain a noisy setting. The SNR (Signal to Noise Ratio) values are varied from 60 dB down to 30 dB.
Experiment 4
With the same strategy concerning noise effect adopted in Experiment 3, we added a zero-mean Gaussian i.i.d. noise to the observed signals generated in Experiment 2. As in Experiment 3, the SNR values are varied from 60 dB down to 30 dB.
4.3 Results
In order to separate the mixed sources and to estimate the mixing parameters, we applied the steps described in Algorithm 1 and Sect. 3.6 in two different configurations: using only one lag (\(\tau _i=1\)), and using 4 lags (\(\tau _i=\{1,2,3,4\}\)).
In the first two experiments, we performed 100 Monte Carlo simulations for our method and NMF-Grd-LQ. At each simulation, we, randomly, modified the source signals and the mixing parameters in the case of Experiment 1, and only the mixing parameters in the case of Experiment 2. The mean of SIR and NMSE of the sources and all the mixing parameters, and the CPU time,Footnote 4 averaged over all 100 simulations realized in Experiment 1 and Experiment 2, are shown in Table 1.
As can be seen, our proposed method leads to the best results. In terms of run-time, the proposed method is much faster than NMF-Grd-LQ.
Moreover, to evaluate the quality of the source estimation, comparisons are then provided in Figs. 2 and 3 between the actual and estimated sources obtained by different methods. Indeed, the actual sources used in Fig. 2 correspond to an example of Experiment 1 where \(N = 1000\) and the random mixing matrix \(\mathbf{A}_\mathbf{Exp1}\) is given by
while Fig. 3 shows an example of Experiment 2 where the random mixing matrix \(\mathbf{A}_\mathbf{Exp2}\) is as follows
We notice in Fig. 2, which only shows 50 samples corresponding to \(n \in [200,249]\) for the sake of clarity, that both synthetic sources are much better estimated by our method than by the NMF-Grd-LQ method. Figure 3 shows that both chemical sources are well estimated by our method, while only one source is well estimated by the NMF-Grd-LQ method.
In the following, our goal is to evaluate the performance of the proposed method with only one, then four lags, when the observed signals are corrupted by noise. Moreover, comparisons with the NMF-Grd-LQ method are carried out.
In the last two experiments, we performed 100 Monte Carlo simulations and, at each simulation, we modified the source signals and the mixing parameters in case of Experiment 3, and only the mixing parameters in case of Experiment 4.
The mean of the SIR and NMSE of the sources and of all the mixing parameters obtained for these two experiments are presented in Tables 2 and 3. For clarity, \(SIR_s\) representation was performed versus the SNR values as shown in Fig. 4 (Experiment 3) and Fig. 5 (Experiment 4).
Considering Experiment 3, it is noted that the \(SIR_s\) obtained by the B-SO-BSS method are acceptable down to SNR equal to 40 dB. By comparing these results with those obtained by NMF-Grd-LQ, we notice that for high SNR values, B-SO-BSS gives the best results, however for low ones, NMF-Grd-LQ seems more efficient. Indeed, it can be seen that the \(SIR_s\) obtained by NMF-Grd-LQ remain acceptable down to \(SNR=30\) dB.
In the case of Experiment 4, the results obtained by the B-SO-BSS method are acceptable down to SNR equal to 50 dB. On the contrary, for all considered SNR values, NMF-Grd-LQ fails to yield accurate enough estimates.
In that regard, as expected, the presence of noise in the observed signals decreases the separation performance of our method. Nevertheless, performance remains acceptable for relatively high SNR values.
5 Conclusion
In this paper, we proposed a new and fast BSS method, called Bilinear Second-Order Blind Source Separation (B-SO-BSS), which is an extension of linear SOS methods, to separate sources mixed according to the bilinear model. First, we studied the statistical properties of the different extended sources when the actual sources are zero-mean and when they are not. Then, we presented the different steps performed in order to separate the actual centred sources and to estimate the actual mixing parameters. Finally, we presented the experimental results, obtained by our proposed method. As a first step, we evaluated the method separation performance when applied to noiseless artificial mixtures of synthetic or chemical sources. We, therefore, clearly noticed the effectiveness of our method as compared to the NMF-Grd-LQ method. As a second step, we evaluated the method robustness to noise, and as expected, we noticed that the presence of noise in the generated mixtures decreases the effectiveness of our method. However, the performance remained acceptable for relatively high SNR values.
Notes
Note that in the linear SOBI, at most one source may be temporally uncorrelated.
In the special case of \(L=2\) actual centred sources, it is also possible to identify the estimated actual centred sources among 3 estimated extended centred sources using a criterion measuring statistical independence, like mutual information. Actually, we know that \(\widetilde{s}_1\) and \(\widetilde{s}_2\) are mutually independent while \(\widetilde{s}_1\times \widetilde{s}_2\) is not independent from \(\widetilde{s}_1\) and \(\widetilde{s}_2\).
In fact, the columns of the matrix estimated by Algorithm 1 are also permuted. However, using the method explained in Sect. 3.5, we can identify the columns containing linear coefficients and the columns containing quadratic ones. It is then possible to arrange the columns of the estimated matrix so that the first L ones correspond to the linear part (their order is not really important) and the other ones are matched correctly to these first L columns.
Computation has been performed with Matlab, on a computer with an intel core i7 processor, a frequency of 2.7 GHz, and a RAM size of 16 GB.
References
Abed-Meraim, K., Belouchiani, A., & Hua, Y. (1996). Blind identification of a linear-quadratic mixture of independent components based on joint diagonalization procedure. In Proceedings of the international conference on acoustics, speech, and signal processing (ICASSP-96), Atlanta, GA, USA, pp. 2718–2721.
Almeida, M. S. C., & Almeida, L. B. (2012). Nonlinear separation of show-through image mixtures using a physical model trained with ICA. Signal Processing, 92(4), 872–884.
Ando, R. A., Duarte, L. T., Jutten, C., & Attux, R. (2015). A blind source separation method for chemical sensor arrays based on a second order mixing model. In 23rd European signal processing conference (EUSIPCO-2015) (pp. 933–937). France: Nice.
Bedoya, G. (2006). Non-linear blind signal separation for chemical solid-state sensor arrays. Ph.D. thesis, Universitat Politecnica de Catalunya
Belouchrani, A., Abed-Meraim, K., Cardoso, J. F., & Moulines, E. (1997). A blind source separation technique using second-order statistics. IEEE Transactions on Signal Processing, 45(2), 434–444.
Castella, M. (2008). Inversion of polynomial systems and separation of nonlinear mixtures of finite-alphabet sources. IEEE Transactions on Signal Processing, 56(8), 3905–3917.
Comon, P., & Jutten, C. (2010). Handbook of Blind Source Separation. Independent Component Analysis and Applications. Oxford: Academic Press.
Deville, Y. (2016). Blind Source Separation and Blind Mixture Identification Methods. New York: Wiley. doi:10.1002/047134608X.W8300.
Deville, Y., & Duarte, L. T. (2015). An overview of blind source separation methods for linear-quadratic and post-nonlinear mixtures. In Proceedings of the 12th international conference on latent variable analysis and signal separation (LVA/ICA 2015), Springer International Publishing Switzerland, LNCS 9237 (pp. 155–167). Liberec: Czech Republic.
Deville, Y., & Hosseini, S. (2007). Blind identification and separation methods for linear-quadratic mixtures and/or linearly independent non-stationary signals. In Proceedings of the international symposium on signal processing and its applications (ISSPA-2007) (pp. 1–4). United Arab Emirates: Sharjah.
Deville, Y., & Hosseini, S. (2009). Recurrent networks for separating extractable target nonlinear mixtures. Part I: Non-blind configurations. Signal Processing, 89(4), 378–393.
Duarte, L. T., Tomazeli, L., Jutten, C., & Moussaoui, S. (2011). Bayesian source separation of linear and linear-quadratic mixtures using truncated priors. Journal of Signal Processing Systems, 65(3), 311–323.
Duarte, L. T., Romano, J. M. T., Jutten, C., Chumbimuni-Torres, K. Y., & Kubota, L. T. (2014). Application of blind source separation methods to ion-selective electrode arrays in flow-injection analysis. IEEE Sensors Journal, 14(7), 2228–2229.
Eches, O., & Guillaume, M. (2014). A bilinear-bilinear nonnegative matrix factorization method for hyperspectral unmixing. IEEE Geoscience and Remote Sensing Letters, 11(4), 778–782.
Hosseini, S., & Deville, Y. (2013). Recurrent networks for separating extractable target nonlinear mixtures. Part II: Blind configurations. Signal Processing, 93, 671–683.
Jarboui, L., Hosseini, S., Deville, Y., Guidara, R., & Ben-Hamida, A. (2014) A new unsupervised method for hyperspectral image unmixing using a linear-quadratic model. In Proceedings of the 1st international conference on advanced technologies for signal and image processing (ATSIP-2014) (pp. 423–428).
Jarboui, L., Hosseini, S., Guidara, R., Deville, Y., & Ben-Hamida, A. (2016) A MAP-based NMF approach to hyperspectral image unmixing using a linear-quadratic mixture model. In Proceedings of the international conference on acoustics, speech and signal processing (ICASSP-2016) (pp. 3356–3360).
Krob, M., & Benidir, M. (1993) Blind identification of a linear-quadratic model using higher-order statistics. In Proceedings of the international conference on acoustics, speech, and signal processing (ICASSP-93) (Vol. 4, pp. 440–443).
Liu, Q., & Wang, W. (2013). Show-through removal for scanned images using non-linear NMF with adaptive smoothing. In Proceedings of ChinaSIP, Beijing (pp. 650–654).
Meganem, I., Déliot, P., Briottet, X., Deville, Y., & Hosseini, S. (2014a). Linear-quadratic mixing model for reflectances in urban environments. IEEE Transactions on Geoscience and Remote Sensing, 52(1), 544–558.
Meganem, I., Deville, Y., Hosseini, S., Déliot, P., & Briottet, X. (2014b). Linear-quadratic blind source separation using NMF to unmix urban hyperspectral images. IEEE Transactions on Signal Processing, 62(7), 1822–1833.
Merrikh-Bayat, F., Babaie-Zadeh, M., & Jutten, C. (2011). Linear-quadratic blind source separating structure for removing show-through in scanned documents. International Journal on Document Analysis and Recognition (IJDAR), 14(4), 319–333.
Taleb, A. (2002). A generic framework for blind source separation in structured nonlinear models. IEEE Transactions on Signal Processing, 50, 1819–1830.
Tong, L., Soon, V. C., Huang, Y. F., & Liu, R. (1990) Amuse: A new blind identification algorithm. In Proceedings of the international symposium on circuits and systems (ISCAS-1990) (Vol. 3, pp. 1784–1787).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Jarboui, L., Deville, Y., Hosseini, S. et al. A second-order blind source separation method for bilinear mixtures. Multidim Syst Sign Process 29, 1153–1172 (2018). https://doi.org/10.1007/s11045-017-0493-9
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11045-017-0493-9