1 Introduction

Frequency-modulated signals also known as chirp signals are frequently observed in nature. Bats and dolphins use chirp signals for locating food and navigating in the skies or oceans [23]. Similarly, in order to communicate with each other, whales use chirp-like signals to sing long and complex songs. In radar, sonar and remote sensing, chirp signals are used to navigate and detect the desired targets [14]. In communication systems, information can be transmitted using chirps, where transmitter encodes the information by phase and frequency modulations. In astrophysics, gravitational waves that emerge due to the disturbances in the curvature of space–time also posses chirp-like characteristics [5]. Chirp signals are also used for jamming in electronic warfare because of their wideband characteristics.

Detection of weak chirp signals in a very noisy environment is important for many military and commercial applications [5]. Commonly used methods for signal detection include energy detector, matched filter and detectors exploiting cyclostationarity [6, 30, 31]. The performance of these methods depends on the use of available prior knowledge about signals. For example, if the signal is known to the receiver, matched filter is an optimal detector [3]. Energy detector on the other hand does not require knowledge about the signal, but requires the noise power to be known [30, 31]. Energy detector performs very poorly in the case of noise uncertainty as, in practice, obtaining accurate estimates of noise powers is not possible [2, 21, 25, 25, 26]. Hence, there is a need of completely blind detection schemes that require very little or no prior information about signal and noise [3, 6, 30, 31].

In many practical cases received signals are correlated in space or time due to the presence of spatially or temporally correlated channels oversampling at the receiver, or because of the inherent characteristics of a transmitted signal [28, 32]. In order to achieve better detection performances, extra features such as spatial or temporal correlation could be exploited in addition to signal energy [2, 3]. These correlations can be considered as side information that can also be used to bring robustness to the detection process against noise power uncertainty [2, 3, 31]. However, these methods are used for stationary signals and are expected to under-perform for non-stationary signals.

Time–frequency distributions (TFDs) are powerful tools for the analysis of non-stationary FM signals. TFDs concentrate energy of such non-stationary signals along their instantaneous frequencies while spreading the noise in entire tf plane. One commonly used tf signal detection approach is to employ matched filtering or correlation [20]. A given signal is transformed in the tf domain and then correlated with a set of predetermined templates. These methods have been successfully applied for the detection of seizures in newborns [20]. Experimental results have shown that tf matched filtering can outperform conventional time-domain matched filter in case of non-Gaussian noise [20]. However, effective implementation of these methods requires prior information regarding the nature of underlying signal to prepare templates [9]. For example, in case of EEG signals piecewise linear frequency modulation characteristics of the seizure signals are exploited for preparing templates [9]. Similarly, signal energy concentrated along the instantaneous frequency (IF) of a frequency-modulated signal can be used as a test statistic for the detection of non-stationary signals [22]. This method does not require any prior information regarding the IF structure of the given signal.

The aforementioned tf-based detection methods either assume known signals, or only exploit signal energy along IF curves for signal detection and fail to exploit spatial or temporal correlation in slowly changing amplitudes of signal components. In this study, we develop a new completely blind scheme for signal detection that:

  1. 1.

    is completely blind and does not require prior information regarding the structure of the desired signal.

  2. 2.

    assumes no knowledge about the noise and performs well even when noise power is unknown. Therefore, the proposed scheme is robust against the noise power uncertainty.

  3. 3.

    is equally applicable to both mono-sensor and multi-sensor recordings.

In order to achieve the above goals, the proposed detection scheme uses TFD to estimate IF estimation, which is then employed to de-chirp the given signal. The de-chirping results in a temporally correlated stationary version of a non-stationary signal. This inherent correlation is then exploited using covariance matrix-based approach for signal detection in both mono-sensor and multi-sensor recordings.

Experimental results indicate the superiority of the proposed method as compared to conventional time-domain approaches while considering a challenging scenario when noise power is uncertain.

2 Time–Frequency Mono-sensor Detector

2.1 Signal and System Model

In this section we consider a receiver with a single sensor that receives N samples of the desired signals in the presence of additive white Gaussian noise (AWGN) with uncertain noise power [25]. These N observed samples are stacked in a \(N\times 1\) vector \(\mathbf {x}\), and the detection model for the received signal can be described as:

$$\begin{aligned} \mathcal {H}_{\text {1}}&: \mathbf {x}=\mathbf {s}+\mathbf {w}\nonumber \\ \mathcal {H}_{0}&: \mathbf {x}=\mathbf {w}, \end{aligned}$$
(1)

where \(\mathbf {s}=\left[ s\left[ 1\right] ,s\left[ 2\right] ,\ldots ,s\left[ N\right] \right] \) with \(s(n)=a(n)\exp \left( j\varphi \left( n\right) \right) \) being the nth sample of noise-free desired signal, in which \(\varphi \left( n\right) \) is the instantaneous phase and a(n) is the slowly varying instantaneous amplitude whose spectrum does not overlap with the spectrum of \(e^{j{\varphi }(n)}\). With this received signal model, the goal is to design robust detection scheme against noise power uncertainty.

2.2 Proposed Detection Scheme

Existing covariance-based approaches [1, 3, 4, 30] are blind and assume no a priori knowledge. They are robust against the noise power uncertainty as they instantly estimate noise powers. However, these methods exploit temporal correlation assuming that the given signal is stationary. The aim of this work is to extend the aforementioned detector for frequency-modulated signals. The aforementioned model of FM signals indicate that instantaneous amplitude of such signals is correlated in time, while non-stationarity is caused by \(e^{j{\varphi }(n)}\). In the following section, a tf approach will be developed to stationarize the received signal by first estimating the instantaneous frequency and then using it to stationarize the received signal. For the case of non-stationary process, the proposed scheme that exploits both tf techniques and temporal correlation is described in the following subsections and illustrated in Fig. 1.

Fig. 1
figure 1

Steps for stationarization of the received signals at mono-sensor receiver

2.2.1 Computation of Time–Frequency Distribution

A TFD of a received signal \(\left\{ x\left( n\right) \right\} _{n=1}^{N}\) is computed as:

$$\begin{aligned} \rho \left[ n,k\right] =\sum _{m} G\left[ n,m\right] \underset{n}{*}x\left[ n+m\right] x^{*}\left[ n-m\right] e^{-\frac{j2{\pi }mk}{N}}, \end{aligned}$$
(2)

where n is time index, k is frequency bin, N is total number of time-domain samples, and \(G\left[ n,m\right] \) is a time-lag kernel. In this study, we have selected modified B distribution (MBD) though we can employ other sophisticated methods such as locally adaptive directional TFDs to improve robustness against noise [7, 10, 15, 19]. The selected time-lag kernel is defined as:

$$\begin{aligned} G[n,m]=\frac{\cosh ^{-2\beta }[n]}{\sum _i{\cosh ^{-2\beta }[i]}}, \end{aligned}$$
(3)

where \(\beta =0.2\) in this study.

2.2.2 Instantaneous Frequency and Phase Estimation

After having clear TFD \(\rho \left[ n,k\right] \) of the received signals, we estimate the instantaneous frequency of the signal by detecting the location of the peak frequency at each time instant as:

$$\begin{aligned} \hat{f}\left[ n\right] =\underset{k}{\text {arg }\text {max}}\rho \left[ n,k\right] . \end{aligned}$$
(4)

Note that we can employ sophisticated methods such as random sample consensus (RANSAC) and Viterbi-based algorithms to obtain accurate estimate of the instantaneous frequency [12, 24]. For continuous time signals, the instantaneous phase, i.e., \(\varphi (t)\), is related to instantaneous frequency, i.e., f(t), by the integral operation [8].

$$\begin{aligned} \varphi \left( t\right) =\int _{0}^{t}{f(\tau )\hbox {d}\tau }. \end{aligned}$$
(5)

In case of discrete signals, the instantaneous phase of the signal is estimated using the following operation:

$$\begin{aligned} \hat{\varphi }\left[ n\right] =\hat{\varphi }\left[ n-1\right] +\frac{2\pi }{N}\hat{f}\left[ n\right] , \end{aligned}$$
(6)

where N is the total number of frequency bins in a TFD. Note that the above-mentioned expression can estimate phase up to a constant as initial phase is not known and assumed to be zero, i.e., \(\hat{\varphi }\left[ n\right] =0\).

2.2.3 Time–Frequency Filtering (De-chirping)

Once we have the estimate of instantaneous phase, the next step is to de-chirp the given signal using the estimated phase. The de-chirping operation converts a non-stationary signal into a stationary signal by removing the frequency modulation. The de-chirping operation has found widespread applications in non-stationary signal processing, such as component extraction, signal analysis, parameter estimation, instantaneous frequency estimation [11, 13, 18, 27]. In case of a multi-component signal, the strongest component is converted into DC or stationary signal. The process in the case of hypothesis \(\mathcal {H}_{1}\) can be represented as:

$$\begin{aligned} \begin{aligned} y\left[ n\right]&=x\left[ n\right] \exp \left( -j\hat{\varphi }\left[ n\right] \right) \\&=s\left[ n\right] \exp \left( -j\hat{\varphi }\left[ n\right] \right) +w\left[ n\right] \exp \left( -j\hat{\varphi }\left[ n\right] \right) \\&\approx \alpha \exp \left( -j\left( \hat{\varphi }\left[ n\right] -\varphi \left[ n\right] \right) \right) +\tilde{w}\left[ n\right] \\&\quad \quad \alpha \exp \left( -j\phi \right) +\tilde{w}\left[ n\right] \\&\approx \tilde{s}\left[ n\right] +\tilde{w}\left[ n\right] , \end{aligned} \end{aligned}$$
(7)

where \(\phi \) is a constant phase. The above equation is based on the assumption that estimated instantaneous phase is approximately equal to original instantaneous phase up to a constant, i.e., \(\hat{\varphi }[n]\approx \varphi [n]+\phi \). The above equation illustrates that the de-chirped signal, i.e., \(\tilde{s}\left[ n\right] =\alpha \exp \left( -j\phi \right) \), is a constant as it has no time-varying part. Note that this derivation is based on assumption that the observed signal is a frequency-modulated signal, i.e., \(s[n]=\alpha \exp \left( -j{\varphi }\left[ n\right] \right) \), with no variations in instantaneous amplitude. Such signals are frequently encountered in radars. The tf signature of such de-chirped signals is a straight line parallel to the time axis in the tf plane [29], e.g., Fig. 2 illustrates how tf representation of a non-stationary signal changes when it is stationarized by de-chirping.

Fig. 2
figure 2

Transformation of a non-stationary frequency-modulated signal into a stationary signal by de-chirping. a Time–frequency representation of non-stationary quadratic chirp. b Time–frequency representation of a stationarized signal

The uncorrelated white Gaussian noise remains uncorrelated with the same power even after de-chirping, i.e.,

$$\begin{aligned} E[\tilde{w}^{*}[n]\tilde{w}[n+l]]=e^{j2{\pi }(\theta (n+l)-\theta (n))}E[w^{*}[n]w[n+l]]={\sigma }^{2}\delta (l). \end{aligned}$$
(8)

Now the \(N\times 1\) vector \(\mathbf {y}=\left[ y\left[ 1\right] ,y\left[ 2\right] ,\ldots ,y\left[ N\right] \right] ^\mathrm{T}\) contains stationary version of non-stationary signals in vector \(\mathbf {x}\). The detection problem becomes

$$\begin{aligned} \begin{aligned} \mathcal {H}_{\text {1}}:&\mathbf {y}=\tilde{\mathbf {s}}+\tilde{\mathbf {w}}\\ \mathcal {H}_{0}:&\mathbf {y}=\tilde{\mathbf {w}}, \end{aligned} \end{aligned}$$
(9)

where \({\tilde{\mathbf {s}}}=\left[ \tilde{s}\left[ 1\right] ,\tilde{s}\left[ 2\right] ,\ldots ,\tilde{s}\left[ N\right] \right] \) and \({\tilde{\mathbf {w}}}=\left[ \tilde{w}\left[ 1\right] ,\tilde{w}\left[ 2\right] ,\ldots ,\tilde{w}\left[ N\right] \right] \). The covariance matrices of \(\mathbf {y}\) and \(\tilde{\mathbf {s}}\) are given as:

$$\begin{aligned} \varvec{\Sigma }_{\mathbf {y}}&=\text {E}[\mathbf {y}(n)\mathbf {y}^\mathrm{T}(n)]\\ \varvec{\Sigma }_{\mathbf {s}}&=\text {E}[\tilde{\mathbf {s}}(n)\tilde{\mathbf {s}}^\mathrm{T}(n)]\\ \varvec{\Sigma }_{\mathbf {w}}&=\text {E}[\tilde{\mathbf {w}}(n)\tilde{\mathbf {w}}^\mathrm{T}(n)]. \end{aligned}$$

These two covariance matrices \(\varvec{\Sigma }_{\mathbf {y}}\) and \(\varvec{\Sigma }_{\mathbf {s}}\) can be related as:

$$\begin{aligned} \varvec{\Sigma }_{\mathbf {y}}=\varvec{\Sigma }_{\mathbf {s}} +\varvec{\Sigma }_{\mathbf {w}}, \end{aligned}$$
(10)

where \(\varvec{\Sigma }_{\mathbf {w}}={\sigma }^{2}\mathbf {I}\) as noise remains uncorrelated even after de-chirping. Off-diagonal elements of signal covariance matrix \(\varvec{\Sigma }_{\mathbf {s}}\) capture temporal correlation between signals that can be exploited as side information to detect the presence of the desired signal in the observed signals. Exploiting the temporal correlation using covariance matrix as a priori information can effectively circumvent the problem against the noise power uncertainty faced by the traditional energy-based detection mechanisms.

2.2.4 Generalized Likelihood Ratio Test

The obtained signals, \(\mathbf {y}=\left[ y\left[ 1\right] ,y\left[ 2\right] ,\ldots ,y\left[ N\right] \right] ^\mathrm{T}\), posses temporal correlation. In order to exploit the temporal correlation jointly with the energy of the signals, we need to solve the following generalized likelihood ratio test (GLRT) equation [4]:

(11)

where \(f\left( \mathbf {y};\varvec{\varSigma }_{1,\mathbf {y}}\right) \) and \(f\left( \mathbf {y};\varvec{\varSigma }_{0,\mathbf {y}}\right) \) are the probability distributions (likelihood functions) with \(\varvec{\varSigma }_{1,\mathbf {y}}\) and \(\varvec{\varSigma }_{0,\mathbf {y}}\) exhibiting temporal correlation present in \(\mathbf {y}\) under hypothesis \(\mathcal {H}_{1}\) and \(\mathcal {H}_{0}\), respectively. In (11), \(\underset{\varvec{\varSigma }_{1,\mathbf {y}}}{\text {max}}f\left( \mathbf {y};\varvec{\varSigma }_{1,\mathbf {y}}\right) \) and \(\underset{\varvec{\varSigma }_{0,\mathbf {y}}}{\text {max}}f\left( \mathbf {y};\varvec{\varSigma }_{0,\mathbf {y}}\right) \) are maximum likelihood estimation-based solutions under hypothesis \(\mathcal {H}_{1}\) and \(\mathcal {H}_{0}\), respectively. Solving (11), we get

(12)

where \(\text {det}\) is short for determinate, \(\hat{\varvec{\varSigma }}_{1,\mathbf {y}}=\text {E}\left[ \mathbf {y}\mathbf {y}^{H}\right] \), and \(\hat{\varvec{\varSigma }}_{0,\mathbf {y}}\approx \text {diag}\hat{\varvec{\varSigma }}_{1,\mathbf {y}}\) [3]. The traditional GLRT scheme would be similar to (12) but replacing \(\hat{\varvec{\varSigma }}_{1,\mathbf {y}}\) with \(\hat{\varvec{\varSigma }}_{1,\mathbf {x}}=\text {E}\left[ \mathbf {x}\mathbf {x}^{H}\right] \) in (11). It is to be noted that \(\mathbf {x}\) is the preprocessed version of \(\mathbf {y}\). We further remark that GLRT is asymptotically an optimal detection method [2].

2.2.5 Covariance Absolute Value Detector

The covariance absolute value (CAV) detector is a ratio between the sum of absolute value of elements of the sample covariance matrix \(\hat{\varvec{\varSigma }}_{1,\mathbf {y}}\) and the sum of diagonal elements of it. Let \(T_{1}\) be defined as sum of all elements of \(\hat{\varvec{\varSigma }}_{1,\mathbf {y}}\)

$$\begin{aligned} T_{1}(\mathbf {y})=\sum _{n=1}^{N}{\sum _{m=1}^{N}{\hbox {abs}(\lambda _{nm})}}, \end{aligned}$$
(13)

where \(\lambda _{nm}\) represents the elements of covariance matrix. Let \(T_{2}\) be defined as sum of all elements of \(\hat{\varvec{\varSigma }}_{0,\mathbf {y}}\).

$$\begin{aligned} T_{2}(\mathbf {y})=\sum _{n=1}^{N}{\text {abs}(\lambda _{nn})}. \end{aligned}$$
(14)

If signal is not present, i.e., \(\varvec{\Sigma }_{\mathbf {s}}=0\), then all off-diagonal elements of \(\varvec{\varSigma }_{\mathbf {y}}\) become equal to zero. In such scenario, sum of all elements of \(\varvec{\varSigma }_{\mathbf {y}}\) would be equal to sum of its diagonal elements, i.e., \(T_{1}=T_{2}\) [31]. In case if signal is present then all off-diagonal elements of \(\varvec{\varSigma }_{\mathbf {y}}\) would be nonzero, which will make \(T_{1}>T_{2}\) [31]. This implies we can use the following test static for signal detection.

(15)

The traditional CAV detector scheme is similar to the proposed methods expressed in Eq. (15). For original time-domain detector, the covariance matrix is estimated from original time-domain signal, whereas the proposed method uses de-chirped signal, i.e., \(\hat{\varvec{\varSigma }}_{1,\mathbf {y}}\) has replaced \(\hat{\varvec{\varSigma }}_{1,\mathbf {x}}=\text {E}\left[ \mathbf {x}\mathbf {x}^{H}\right] \).

2.3 Performance Comparison

Let us consider a non-stationary signal defined as:

$$\begin{aligned} s=e^{2{\pi }j(an+bn^{3})}, \end{aligned}$$
(16)

where \(a=0.5\) and \(b=1.2207e-05\). The given signal is corrupted with white Gaussian noise with uncertain noise power. The uncertain noise power is generated as:

$$\begin{aligned} \sigma _{un}^{2}\sim \mathcal {U}\left( \frac{\sigma ^{2}}{\alpha _{nu}}, \alpha _{nu}\sigma ^{2}\right) , \end{aligned}$$
(17)

where \(\alpha _{nu}\ge 1\) and \(\alpha _{nu}=1\) means no noise uncertainty [25]. In this study, we have assumed \(\alpha _{nu}=2\). It is to be noted that \(\sigma ^{2}\) is mean noise power of \(\sigma _{un}^{2}\). Such an uncertainty in the noise power knowledge deteriorates the performance of detection schemes. In the presence of such an uncertainty, the performance of the proposed tf signal detection techniques is compared with original time-domain covariance methods [31], ridge energy detector [22], tf correlator [20] and energy detector using receiver operating characteristic (ROC) curve analysis, whereas the energy detector is formulated as:

(18)

The ROC curve obtained using GLRT and CAV detectors are shown in Figs. 3 and 4, respectively. It can be observed that the proposed schemes perform better than the traditional schemes. Areas under each ROC curve (AUCs) for these two approaches are given in Table 1. Note that the AUC is always between 0.5 (worst) and 1.0 (best). It is because the worst ROC curve lies along the diagonal and it has an area of 0.5. The best ROC curve has an area of 1. All of the above experimental results indicate that the proposed signal detection method outperforms the conventional schemes.

Fig. 3
figure 3

Performance comparison between tf generalized likelihood ratio test versus time-domain generalized likelihood test [3, 4], tf ridge detector [22], tf correlator [20] and energy detector Eq. (18). Noise power uncertainty parameter \(\alpha _{nu}=2\)

Fig. 4
figure 4

Performance comparison between tf covariance absolute value detector versus time-domain covariance absolute value detector [31], tf ridge detector [22], tf correlator [20] and energy detector Eq. (18). Noise power uncertainty parameter \(\alpha _{nu}=2\)

In order to further analyze the proposed schemes in comparison with the traditional schemes we plot the probability of detection at a false alarm rate of 1% versus SNR. These plots are shown in Fig. 5 for GLRT and in Fig. 6 for CAV detectors. Once again we can clearly see that the proposed methods achieve better performance as compared to traditional methods. This improved performance is due to conversion of a non-stationary signal into stationary one by the use of tf processing thus allowing us to exploit temporal correlation in addition to signal energy. These results are plotted for the noise power uncertainty value \(\alpha _{nu}=2\).

Table 1 Area under ROC curve of tf detectors versus conventional detection methods
Fig. 5
figure 5

Performance comparison between tf GLRT versus traditional GLRT [3, 4]. Noise power uncertainty parameter \(\alpha _{nu}=2\)

Fig. 6
figure 6

Performance comparison between tf CAV detector versus traditional CAV detector [31]. Noise power uncertainty parameter \(\alpha _{nu}=2\)

3 Extension to Multi-sensor Detection Scheme

In this section we consider the multi-sensor detector at the receiver side to exploit the spatial correlation in addition to temporal correlation [3]. Earlier signal detection methods given in [3] assume stationary signals and are expected to under-perform for the detection of non-stationary signals. In this study, we extend these conventional multi-sensor signal detection approaches to non-stationary signals by using tf processing. The proposed multi-sensor schemes are similar to mono-sensor signal detection schemes in a way that the key step of signal stationarization is common in both methods. However, in order to exploit spatial diversity for multi-sensor recordings, the following additional steps are performed.

  • Spatial averaging of TFDs computed at each sensor is performed to overall SNR, thus improving the accuracy of IF estimate.

  • Spatiotemporal GLRT and spatiotemporal CAV detectors are employed instead of temporal GLRT and temporal CAV to exploit both spatial diversity and temporal diversity.

3.1 Multi-sensor Signal Model

Lets us assume that M sensors are available to detect the non-stationary signal received from the source. When the signal is present then at the mth sensor, the received signal vector can be represented as: \(\mathbf {x}_{m}=\mathbf {s}_{m}+\mathbf {w}_{m}\), whereas in the absence of source signal, we have \(\mathbf {x}_{m}=\mathbf {w}_{m}\). Hence, the signal model at the mth sensor of the receiver can be written as:

$$\begin{aligned} \begin{aligned} \mathcal {H}_{1}:&\mathbf {x}_{m}=\mathbf {s}_{m}+\mathbf {w}_{m}\\ \mathcal {H}_{0}:&\mathbf {x}_{m}=\mathbf {w}_{m}, \end{aligned} \end{aligned}$$
(19)

where \(\mathbf {x}_{m}=\left[ x_{m}\left[ 1\right] ,x_{m}\left[ 1\right] ,\ldots ,x_{m}\left[ N\right] ,\right] \) contains N samples of the observed signals at the mth sensor, \(\mathbf {s}_{m}=\left[ s_{m}\left[ 1\right] ,s_{m}\left[ 1\right] ,\ldots ,s_{m}\left[ N\right] ,\right] \) contains noiseless signal, and \(\mathbf {w}_{m}=\left[ w_{m}\left[ 1\right] ,w_{m}\left[ 1\right] ,\ldots ,w_{m}\left[ N\right] ,\right] \) contains N samples of Gaussian noise.

Now the goal is to exploit both spatial and temporal correlations in the detection process. Therefore, we need to perform the tf processing to extract the embedded spatiotemporal correlation.

3.2 Time–Frequency Processing

The process of stationarization for multi-sensor detection scheme is illustrated in Fig. 7. Just like in the mono-sensor case, quadratic TFD is computed for the signal acquired at the mth sensor as:

$$\begin{aligned} \rho _{m}\left[ n,k\right] =\sum _l G\left[ n,l\right] \underset{n}{*}x_{m}\left[ n+l\right] x_{m}^{*}\left[ n-l\right] e^{-\frac{2{\pi }jlk}{N}}, \end{aligned}$$
(20)

where \(m= 1,2,\ldots ,M\). These TFDs are then spatially averaged as

$$\begin{aligned} \rho _{\text {avg}}\left[ n,k\right] =\frac{1}{M}\sum _{m=1}^{M}\rho _{m}\left[ n,k\right] . \end{aligned}$$
(21)

The \(\rho _{\text {avg}}\left[ n,k\right] \) is now a single TFD . Using \(\rho _{\text {avg}}\left[ n,k\right] \), the IF \(f\left( n\right) \) and phase \(\hat{\varphi }\left( n\right) \) of the source signal can be estimated following the process given in Sect. 2.2. Once we have the estimate of instantaneous phase, the next step is to de-chirp the given signal using the estimated phase \(\hat{\varphi }\left( n\right) \). The de-chirping process at the mth sensor in the case of hypothesis \(\mathcal {H}_{1}\) can be represented as:

$$\begin{aligned} \begin{aligned} y_{m}\left[ n\right]&=x_{m}\left[ n\right] \exp \left( -j\hat{\varphi }\left( n\right) \right) \\&=\tilde{s}_{m}\left[ n\right] +\tilde{w}_{m}\left[ n\right] . \end{aligned} \end{aligned}$$
(22)

The N samples of the stationary version of signals received at M sensors are stacked in \(M\times N\) matrix \(\mathbf {Y}\) as:

$$\begin{aligned} \mathbf {Y}=\left[ \begin{array}{c} \mathbf {y}_{1}\\ \mathbf {y}_{2}\\ \vdots \\ \mathbf {y}_{M} \end{array}\right] , \end{aligned}$$
(23)

where \(\mathbf {y}_{m}=\left[ y_{m}\left[ 1\right] ,y_{m}\left[ 1\right] ,\ldots ,y_{m}\left[ N\right] ,\right] \) is the stationary version of \(\mathbf {x}_{m}\). By using vec operator we find \(\mathbf {z}=\text {vec}\mathbf {\left( Y\right) }\) as:

$$\begin{aligned} \mathbf {z}=\left[ \left[ \begin{array}{cccc} y_{1}\left[ 1\right]&y_{2}\left[ 1\right]&\cdots&y_{M}\left[ 1\right] \end{array}\right] ,\ldots ,\left[ \begin{array}{cccc} y_{1}\left[ N\right]&y_{2}\left[ N\right]&\ldots&y_{M}\left[ N\right] \end{array}\right] \right] ^\mathrm{T}. \end{aligned}$$
(24)

The \(NM\times 1\) vector \(\mathbf {z}\) is used to design detection scheme using spatiotemporal correlation in addition to the energy of the received signals.

Fig. 7
figure 7

Illustration of the process of stationarization of the received signals at multi-sensor receiver

3.3 Spatiotemporal GLRT

In this section we exploit the spatiotemporal correlation for the detection problem introduced in Eq. (19) by adopting the GLRT approach given in (11). In this case the GLRT scheme can be formulated as:

$$\begin{aligned} \begin{aligned} \varLambda _{\text {ST}}\left( \mathbf {z}\right)&=\frac{\underset{\varvec{{\Sigma }}_{{z,1}}}{\text {max}}f_{\mathbf {z}}\left( \mathbf {z};\varvec{\Sigma }_{{z,1}}\right) }{\underset{\varvec{\Sigma }_{{z,0}}}{\text {max}}f_{\mathbf {z}}\left( \mathbf {z};\varvec{\Sigma }_{{z,0}}\right) } \end{aligned} \gtrless _{\mathcal {H}_{0}}^{\mathcal {H}_{1}}\gamma , \end{aligned}$$
(25)

where \(f_{\mathbf {z}}\left( \mathbf {z},\varvec{\Sigma }_{z,0}\right) \) and \(f_{\mathbf {z}}\left( \mathbf {z};\varvec{\Sigma }_{z,1}\right) \) are the likelihood functions under hypothesis \(\mathcal {H}_{0}\) and \(\mathcal {H}_{1}\), respectively. Solving (25), we can get the final expression of the GLRT as:

$$\begin{aligned} \begin{aligned} \varLambda _{\text {ST}}\left( {\mathbf {z}}\right)&=\left| \frac{{\hat{\varvec{\Sigma }}}_{z,1}}{{\hat{\varvec{\Sigma }}}_{z,0}}\right| \gtrless _{{\mathcal {H}}_{1}}^{{\mathcal {H}}_{0}}\gamma , \end{aligned} \end{aligned}$$
(26)

where \({\hat{\varvec{\Sigma }}}_{z,1}=\text {E}\left[ \mathbf {z}\mathbf {z}^{H}\right] \) and \(\hat{\varvec{\Sigma }}_{z,0}=\text {diag}\left[ {\hat{\varvec{\Sigma }}}_{z,1}\right] \). Note that the detection scheme in (26) assumes no structure for the covariance matrix, except that the covariance matrix is symmetric.

3.4 Spatiotemporal CAV Detector

In order to use the detection scheme (25), a critical requirement is that the sample covariance matrices \({\hat{\varvec{\Sigma }}}_{z,1}\) must be non-singular and positive definite; otherwise, (26) degenerates [16] [17]. Taking into account this fact the CAV detector is useful to avoid such problems. As we have seen above that the CAV detector is a ratio between the sum of elements of the sample covariance matrix and the sum of diagonal elements of that matrix as:

$$\begin{aligned} \varLambda _{\text {CAV}}\left( {\mathbf {z}}\right) =\frac{\sum _{i}^{NM}\sum _{j}^{NM}\text {abs}\left( \lambda _{ij}\right) }{\sum _{i}^{NM}\text {abs}\left( \lambda _{ii}\right) }\lessgtr _{{\mathcal {H}}_{0}}^{{\mathcal {H}}_{1}}\gamma , \end{aligned}$$
(27)

here \(\text {abs}\left( .\right) \) is for absolute value, whereas \(\lambda _{ij}\) is the ijth element of sample covariance matrix \({\hat{\varvec{\Sigma }}}_{{z,1}}\). Test statistics (27) is not optimal like GLRT, but has robustness against the high-dimensionality effects in small sample support.

Fig. 8
figure 8

Performance comparison with ROC between the proposed tf spatiotemporal GLRT detector versus traditional spatiotemporal GLRT detector [3]. Noise power uncertainty parameter \(\alpha _{nu}=2\)

Fig. 9
figure 9

Performance comparison with ROC curve between the proposed tf spatiotemporal CAV detector versus traditional spatiotemporal CAV detector [3]. Noise power uncertainty parameter \(\alpha _{nu}=2\)

3.5 Experimental Results

Let us compare the performance of the traditional time-domain detectors exploiting both spatiotemporal correlations [3] with the proposed tf-based detectors. In order to numerically analyze these schemes, we assume that the following signal is received by \(m=3\) sensors.

$$\begin{aligned} s=e^{2{\pi }j(an+bn^{3})}, \end{aligned}$$
(28)

where \(a=0.5\) and \(b=1.2207e-05\). The given signal is corrupted with white Gaussian noise with uncertain noise power, where noise power uncertainty is modeled using Eq. (17). ROC curves illustrating the performance comparison between traditional multi-sensor spatiotemporal detectors based on GLRT and CAV [3] and their tf counterparts are shown in Figs. 8 and 9. Table 2 shows the area under the ROC curves of the two detection schemes. Experimental results indicate that the tf detection scheme performs better than the corresponding traditional time-domain detector.

Table 2 Area under ROC curve of the proposed tf detectors (tf CAV detector; tf GLRT detector versus conventional CAV detector [3] and GLRT detector [3])
Fig. 10
figure 10

Performance comparison between the proposed tf-based spatiotemporal GLRT detector versus traditional spatiotemporal GLRT detector [3] using probability of detection at a false alarm rate of 1% plotted for different SNRs. Noise power uncertainty parameter \(\alpha _{nu}=2\)

For further analysis of the above-mentioned schemes we plot probability of detection at a false alarm rate of 1% versus SNR. To compare the performance of the proposed tf schemes with the traditional schemes, we perform ROC analysis. The ROC curves for GLRT- and CAV-based detectors are shown in Figs. 10 and 11, respectively. These plots confirm the superiority of the proposed schemes over the traditional schemes because of their ability to exploit both spatial and temporal correlations, whereas the time-domain approach fails to exploit temporal correlation due to non-stationary characteristics of frequency-modulated signals.

Furthermore, we can also observe that in multi-sensor and mono-sensor cases, CAV detectors (tf and time domain) perform similar to GLRT detectors (tf and time domain) as both exploit correlations in addition to energy of the received signals though, asymptotically, GLRT is proven to outperform the CAV detector [3].

Fig. 11
figure 11

Performance comparison between the proposed tf spatiotemporal CAV detector versus traditional spatiotemporal CAV detector [3] using probability of detection at a false alarm rate of 1% plotted for different SNRs. Noise power uncertainty parameter \(\alpha _{nu}=2\)

4 Conclusion

A tf approach to signal detection has been developed for both mono-sensor and multi-sensor recordings. The proposed approach uses instantaneous frequency estimation and de-chirping procedure to remove frequency modulation, thus resulting in temporally correlated stationary signal. Experimental results demonstrate the superiority of the proposed method over existing techniques such as time-domain covariance-based signal detection method and energy detector. The proposed method has been developed aiming at the scenarios when there is an uncertainty in the noise power. The improvement in performance comes at the expense of increased computational cost of the proposed algorithm that requires the additional computations for obtaining a QTFD. The computational cost of computing a QTFD is \(O(KN^2\hbox {log}N)\), where N is number of samples in a given signal and K is number of sensors employed. The proposed method has been developed for mono-component signals. However, it can be applied to multi-component signals as well. For multi-component signals, the method would require the IF estimation of the strongest component followed by de-chirping. The IF of the strongest component can be estimated by robust IF estimation algorithm given in [24]. This topic will be further explored in a future study.