1 Introduction

Alongside booming growth in wireless communication applications, the efficient utilization of spectrum has become a critical consideration for researchers and developers. The cognitive radio (CR) [1] was first proposed for effective spectrum resource usage. This technology allows unlicensed users to access wireless channels when primary users are in an inactive state. Spectrum sensing is an essential functionality to avoid access interference [2, 3]. In a CR system, the wideband frequency spectrum is generally sparse, as not all of the subbands are occupied at the same time; active subbands can be detected by compressive sensing [4], known as sub-Nyquist sampling. The method, based on the sub-Nyquist rate, reduces computational burden and memory requirements. The Nyquist rates of wideband signals in a CR system may exceed the specifications of high speed, high-rate analog-to-digital converters (ADCs) [5], which are too power hungry for direct wideband spectrum sensing. However, sub-Nyquist sampling can solve this problem.

Numerous researches refer to acquiring multi-band signal from a rate below the Nyquist sampling rate. Compressive sensing theory is firstly put forward to sense wideband spectrum in [6]. This method reconstructs the wideband spectrum in the first step, then wavelet-based edge detection detects spectral opportunities across the wideband spectrum. A two-step CS scheme was introduced in another study [7] for minimizing sampling rate, where actual sparsity is estimated in the first time slot,then compressed measurements are adjusted in the second slot. Another study explored an adaptive compressive spectrum sensing algorithm for wideband CR [8], in which compressed measurements are adaptively adjusted without any sparsity estimation efforts to improve the throughput. Yet another study [9] developed a multi-rate asynchronous sub-Nyquist sampling (MASS) system to perform wideband spectrum sensing, robust against lack-of-time synchronization and with excellent performance in fading/shadowing scenarios. An efficient collaborative spectrum sensing method with low sample rate was presented by other research [10], employing adopting matching pursuit to detect spectrum holes in a cognitive network. All these studies, however, focused on frequency spectrum estimation or reconstructing original signals. Sparsity order estimation for cognitive radio was investigated in one previous study [11], to provide a general methodology to quantify the minimum number of samples required to estimate sparsity order for compressive spectrum sensing. These works based on sub-Nyquist sampling require an assumption of sparsity in the frequency domain.

In a typical CR system, it is not necessary to recover the original signal but to detect active subbands for dynamic spectrum access. Wideband power spectrum sensing from sub-Nyquist samples no longer requires sparsity assumption. Related works are presented in [1216]. In [12], wideband power spectrum sensing was considered in the form of inequalities, and only needed a few bits from the nodes in a sensor network. In another work [13], a single-stage frequency-domain sensing technique based on sub-Nyquist sampling was proposed to blindly detect the locations of active subbands. The approach in [14, 15] reconstructed power spectrum of wide-sense stationary signals by exploiting the cross-correlation between the outputs of different channels. A method for finite resolution approximation of power spectrum estimation was proposed in another study [16]. The schemes in [13, 14] based on multi-coset sampling is sensitive to timing jittering. It is difficult to maintain accurate time intervals between channels. The method proposed in [9] introduced an additional estimation error with the high sampling rate of ADCs. In other studies [611], methods which used normal compressive sensing reconstruction algorithm with \(\ell _{1}/\ell _{2}\) norm constraint were proposed, but these usually cause high complexity. In this study, we focus on least square (LS) reconstruction without any sparse constraint. An analog-to-information converter (AIC) [17] is employed with multi-channel parallel architecture, providing a sampler for sparse multi-band signals according to the general concept of random filtering [18]. The sampling framework is similar to the modulated wideband converter (MWC) prototype described in a previous study [19], which is implemented in a wideband receiver in practice.

The rest of this paper is organized as follows. In the second section, an analog signal sub-Nyquist sampling model is described, and the frequency spectrum relationship between samples and the original signal is established. In the third section, we describe the power spectrum relationship between samples and the original signal, which exploits the cross-power spectrum between the outputs of different channels. The process changes to solve overdetermined equations with a rank condition for reliable estimation. An adaptive resolution frequency averaging (ARFA) scheme is proposed with low complexity in the fourth section. A multiband signal is used for simulation and normalized mean square error (NMSE) is applied to measure the quality of estimation performance. The final section provides a summary and conclusion.

2 Sampling Model

Based on Fig. 1, \(M\) branches are employed along with a wide-sense stationary complex-valued analog input \(x\left( t\right) \). Only considering branch \(i\), \(x\left( t\right) \) is modulated with a pseudo random signal \(p_{i} \left( t\right) \). The signal \(p_{i}\left( t\right) \) is a periodic extension of distinct, finite-duration random square waves that share a common period NT, yielded from the piecewise constant function \(c_{i} \left( t\right) \). \(T\) is the sampling time interval on the Nyquist rate. The length of samples of each branch is \(L\). The average sampling rate is equal to the Nyquist rate multiplied by \(M/N\).

Fig. 1
figure 1

Multi-channel parallel AIC sub-Nyquist sampling framework

The relation can be expressed as follows:

$$\begin{aligned} \left\{ \begin{aligned}&p_{i}(t)= c_{i} (t_{0} ) \quad t=t_{0} +l{\textit{NT}},\; 0\le t_{0} <{\textit{NT}},l=0,1,\ldots ,L \\&c_{i}(t_{0} )=c_{i}[-n] \quad nT\le t_{0} <(n+1)T,n=0,1,\ldots ,N-1 \\&c_{i} [-n]\in \{-1,1\} \end{aligned}\right. \end{aligned}$$
(1)

The output of the i-th branch with N-fold down-sampling can be represented as

$$\begin{aligned} \begin{aligned} y_{i} [k]&=\frac{1}{{\textit{NT}}} \int _{k{\textit{NT}}}^{(k+1){\textit{NT}}}p_{i} (t)x(t)dt \\&=\frac{1}{T} \sum _{n=0}^{N-1}c_{i} [-n] \int _{(kN+n)T}^{(kN+n+1)T}x(t)dt \\&=\sum _{n=1-N}^{0}c_{i} [n] x[kN-n] \\&=s_{i}[kN] \end{aligned} \end{aligned}$$
(2)

There is a key operation of \(c_{i}(t_{0})=c_{i}[-n]\) in (1). The output of each branch can be regarded as the N-fold down-sampling of \(s_{i} [n]=c_{i} [n]*x[n]\) from (2), where \(*\) represents the linear convolution operator.

The output is equal to the N-fold down-sampling from \(s'_{i} [n]\). \(s'_{i} [n]=c_{i}'[n] \circledast x[n]\), where \(\circledast \) is the circular convolution operator. \({\mathbf{c }}_{i}'=[{\mathbf{c }}_{i}, {\mathbf{0 }}_{1\times (L-1)N}]\) (where \({\mathbf{0 }}\) is zero matrix, \({\mathbf{c }}_{i} =[c_{i} [0],c_{i} [1],\ldots ,c_{i} [N-1]\,]\)). The length of \(c'_{i}[n]\) is identical to \(x[n]\). Then following is then obtained

$$\begin{aligned} \begin{aligned} y_{i}[l]&=s'_{i}[(l+1)N-1] \\&=\frac{1}{\textit{LN}} \sum _{k=0}^{{\textit{LN}}-1} S'_{i}[k]e^{j\frac{2\pi }{{\textit{LN}}}k[(l+1)N-1]} \quad l=0,1,\ldots ,L-1 \end{aligned} \end{aligned}$$
(3)

where \(S'_{i}[k]\) is the LN-points discrete Fourier transform (DFT) of \(s'_{i}[n]\). In the frequency domain, with the circular convolution theorem:

$$\begin{aligned} S'_{i} [k]=C_{i} '[k]X[k] \end{aligned}$$
(4)

where \(C'_{i} [k]\) is LN-points DFT of \(c'_{i} [n]\), and \(X[k]\) is the LN-points DFT of \(x[n]\). We then can have the L-points DFT of \(y_{i} [l]\):

$$\begin{aligned} \begin{aligned} Y_{i}[k]&=\frac{1}{{\textit{LN}}}\sum _{l=0}^{L-1}y_{i}[l]e^{-j\frac{2\pi }{L} lk}\\&=\frac{1}{{\textit{LN}}} \sum _{l=0}^{L-1}\left[ \sum _{m=0}^{{\textit{LN}}-1}X[m]C'_{i}[m]e^{j\frac{2\pi }{{\textit{LN}}} m[(l+1)N-1]} \right] e^{-j\frac{2\pi }{L} lk}\\&=\frac{1}{N}\sum _{m}X[m]C'_{i}[m]e^{j\frac{2\pi }{{\textit{LN}}}m(N-1)}\frac{1}{L}\sum _{l=0}^{L-1}e^{j\frac{2\pi }{L}(m-k)l} \end{aligned} \end{aligned}$$
(5)

It is obvious that

$$\begin{aligned} \begin{aligned} \frac{1}{L} \sum _{l=0}^{L-1}e^{j\frac{2\pi }{L} (m-k)l} {\rm{=}}\left\{ \begin{array}{l} {1\quad m-k=0,L,2L,\ldots ,(N-1)L} \\ {0\quad others} \end{array}\right. \end{aligned} \end{aligned}$$
(6)

From (56), we can have

$$\begin{aligned} \begin{aligned} Y_{i} [k]&=\frac{1}{N} \sum _{n=0}^{N-1}X[nL+k]C'_{i} [nL+k]e^{j\frac{2\pi }{{\textit{LN}}} (nL+k)(N-1)} \\&=\frac{1}{N} \sum _{n=0}^{N-1}X[nL+k]C''_{i} [nL+k] \end{aligned} \end{aligned}$$
(7)

where \(C''_{i} [nL+k]=C'_{i} [nL+k]e^{j\frac{2\pi }{{\textit{LN}}} (nL+k)(N-1)}\). From (7), the multi-branch output can be written in matrix form as follows:

$$\begin{aligned} {\mathbf{y }}[k]=\frac{1}{N} {\mathbf{C }}''[k]{\mathbf{x }}[k]\quad k=0,1,\ldots ,L-1 \end{aligned}$$
(8)

where the relation is

$$\begin{aligned} \begin{aligned} {\mathbf{y }}[k]&=[Y_{0} [k],Y_{1}[k],\ldots ,Y_{M-1}[k]]^{T} \\ {\mathbf{x }}[k]&=[X[k],X[L+k],\ldots ,X[L(N-1)+k]^{T} \\ {\mathbf{C }}''[k]&=\left[ \begin{array}{cccc} C''_{0} [k]&{} C''_{0} [L+k]&{} \cdots &{} C''_{0} [L(N-1)+k]\\ C''_{1} [k]&{} C''_{1} [L+k]&{} \cdots &{} C''_{1} [L(N-1)+k]\\ \vdots &{} \vdots &{} \vdots &{} \vdots \\ C''_{M-1} [k]&{} C''_{M-1} [L+k]&{} \cdots &{} C''_{M-1} [L(N-1)+k]\\ \end{array} \right] \end{aligned} \end{aligned}$$
(9)

From (8), we finally obtain

$$\left[\begin{array}{c} {\mathbf{y }}[0]\\ {\mathbf{y }}[1] \\ \vdots \\ {\mathbf{y }}[L-1] \end{array}\right] =\frac{1}{N} \left[\begin{array}{ccc} {\mathbf{C }}''[0]&&{\mathbf{0 }} \\&{\mathbf{C }}''[1]&\\&\ddots&\\ {\mathbf{0 }}&&{\mathbf{C }}''[L-1] \end{array}\right] \left[\begin{array}{c} {\mathbf{x }}[0]\\ {\mathbf{x }}[1] \\ \vdots \\ {\mathbf{x }}[L-1] \end{array}\right] $$
(10)

The frequency spectrum of \(x[n]\) is expressed as:

$$\begin{aligned} {\mathbf{x }}=vec\{ [{\mathbf{x }}[0]^{T},{\mathbf{x }}[1]^{T} ,\ldots ,{\mathbf{x }}[L-1]^{T} ] \} \end{aligned}$$
(11)

where vec{.} stacks all columns of a matrix into a vector. We cannot directly obtain \({\mathbf{x }}[k]\) from (8) because \({\mathbf{C }}''[k]\) is not a full column rank matrix (where \(M<N\)).

3 Wideband Power Spectrum Estimation

In a CR system, spectrum sensing detects the active subbands rather than recovering the original signal. In this section, we estimate the power spectrum of wide-sense stationary analog input to detect the active bands. The signal exhibiting cyclostationary has an auto-correlation function \({R_x}(t,\tau )={\rm{E}}\{x(t){x^{*} }(t + \tau )\), which is periodic in the time domain with a period \(T_0\)

$$\begin{aligned} {R_x}(t + {T_0},\tau ) = {R_x}(t,\tau ) \end{aligned}$$
(12)

The power spectrum of x[n] is given as

$$\begin{aligned} P_{x} [k]=X[k]X^{*} [k] \end{aligned}$$
(13)

where \(^{*}\) denotes conjugation. The cross-power spectrum of \(y_{i}\)[k] and \(y_{j}\)[k] can be written as

$$\begin{aligned} P_{y_{i,j}}[k]=Y_{i}[k]Y_{j}^{*}[k] \end{aligned}$$
(14)

The cross-power spectrum of \(c'_{i} [n]\) and \(c'_{j} [n]\) can be computed offline as

$$\begin{aligned} P_{c_{i,j}}[k]=C'_{i}[k]{C'_{j}}^{*}[k] \end{aligned}$$
(15)

It is obvious that \(C'_{i} [k]{C'_{j}}^{*}[k]=C''_{i} [k]{C''_{j}}^{*} [k]\), then a relation similar to (8) is drawn

$$\begin{aligned} {\mathbf{p }}_{y} [k]=\frac{1}{N^{2}}{\mathbf{P }}_{C}[k]{\mathbf{p }}_{x}[k]\; \quad k=1,2,\ldots ,L \end{aligned}$$
(16)

where the relation is

$$\begin{aligned} \begin{aligned} {\mathbf{p }}_{y}[k]&=[P_{y_{0,0}}[k],P_{y_{0,1} }[k]\ldots ,P_{y_{i,j} } [k],\ldots ]^{T} \\ {\mathbf{p }}_{x}[k]&=[P_{x} [k],P_{x} [L+k],\ldots ,P_{x} [L(N-1)+k]]^{T} \\ {\mathbf{P }}_{C} [k]&=\left[ \begin{array}{cccc} P_{c_{0,0} }[k]&{} P_{c_{0,0}}[L+k]&{} \cdots &{}P_{c_{0,0} }[L(N-1)+k]] \\ P_{c_{0,1} }[k]&{} P_{c_{0,1}}[L+k]&{} \cdots &{}P_{c_{0,1} }[L(N-1)+k]] \\ \vdots &{} \vdots &{} \vdots &{}\vdots \\ P_{c_{0,0} }[k]&{} P_{c_{0,0}}[L+k]&{} \cdots &{}P_{c_{i,j} }[L(N-1)+k]] \\ \end{array} \right] \end{aligned} \end{aligned}$$
(17)

\({\mathbf{P }}_{C}[k]\) is a \(M^{2} \times N\) deterministic matrix. From (16), we finally obtain

$$ \left[\begin{array}{c} {\mathbf{p }}_{y}[0]\\ {\mathbf{p }}_{y}[1] \\ \vdots \\ {\mathbf{p }}_{y}[L-1] \end{array}\right] =\frac{1}{N^2} \left[\begin{array}{ccc} {\mathbf{P }}_{C}[0]&&{\mathbf{0 }} \\&{\mathbf{P }}_{C}[1]&\\&\ddots&\\ {\mathbf{0 }}&&{\mathbf{P }}_{C}[L-1] \end{array}\right] \left[\begin{array}{c} {\mathbf{p }}_{x}[0]\\ {\mathbf{p }}_{x}[1] \\ \vdots \\ {\mathbf{p }}_{x}[L-1] \end{array}\right]$$
(18)

The power spectrum of x[n] is written as

$$\begin{aligned} {\mathbf{p }}_{x} =vec\{ [\ldots ,{\mathbf{p }}_{x} [k],\ldots ]\} \; \; \; \; k=0,1,\ldots ,L-1 \end{aligned}$$
(19)

While the compressive sampling matrix \({\mathbf{C }}\) is given as

$$\begin{aligned} {\mathbf{C }}_{M\times N} =\left[ {\mathbf{c }}_{0} ^{T} ,{\mathbf{c }}_{1} ^{T},\ldots {\mathbf{c }}_{M-1} ^{T}\right] ^{T} \end{aligned}$$
(20)

The rank of \({\mathbf{C }}\) is \(r_{C}=rank({\mathbf{C }})\) (where \(rank ({\mathbf{X }})\) denotes the rank of matrix X). When \(M\ll N\), the value of \(r_{C}\) is usually equal to M.

Proposition

If \(\textit{M}^2 \ge \textit{N} \), \({\mathbf{P }}_{C} [k]\) is a full column rank matrix.

Proof

From (15), the element in \({\mathbf{P }}_{C}[k]\) is given

$$\begin{aligned} \begin{aligned} P_{c_{i,j} } [nL+k]&=C'_{i} [nL+k]{C'_{j}} ^{*} [nL+k] \\&=\left( \sum _{m=0}^{N-1}c_{i} [m]e^{-j\frac{2\pi }{{\textit{LN}}} m(nL+k)} \right) \times \left( \sum _{m=0}^{N-1}c_{j} [m]e^{j\frac{2\pi }{{\textit{LN}}} m(nL+k)} \right) \\&=c_{i} {\mathbf{f }}_{nL+k} ^{T} {\mathbf{f }}_{nL+k} ^{*} c_{j} ^{T} \\&=c_{i} {\mathbf{F }}_{nL+k} c_{j} ^{T} \end{aligned} \end{aligned}$$
(21)

where \({\mathbf{f }}_{nL+k} =[1,W^{(nL+k)},\ldots , W^{(N-1)(nL+k)} ]\) is the Fourier transformation vector and \(W=e^{-j\frac{2\pi }{{\textit{LN}}}}\) is an exponential factor. \({\mathbf{F }}_{nL+k} ={\mathbf{f }}_{nL+k}^{T} {\mathbf{f }}_{nL+k} ^{*}\). From (17) and (20), we can have

$$\begin{aligned} {\mathbf{p }}_{c_{i,j} }[k]={\mathbf{C }}_{A_{i} } {\mathbf{F }}_{D} {\mathbf{c }}_{B_{j} } \end{aligned}$$
(22)

where \({\mathbf{C }}_{A_{i}}={\mathbf{U }}_{N}\otimes {\mathbf{c }}_{i}\) (\(\otimes \) denotes Kronecker product, \({\mathbf{U }}_{N}\) is a unit matrix with size N), \({\mathbf{c }}_{B_{j} }=[{\mathbf{c }}_{j},{\mathbf{c }}_{j} ,\ldots ,{\mathbf{c }}_{j} ]^{T} \) and \({\mathbf{F }}_{D}={\rm{diag}}({\mathbf{F }}_{k},{\mathbf{F }}_{L+k} ,\ldots ,{\mathbf{F }}_{(N-1)L+k} )\). \({\rm{diag}}({\mathbf{X }}_{1} ,{\mathbf{X }}_{2},\ldots ,{\mathbf{X }}_{g} )\) denotes a block diagonal matrix with principal diagonal blocks \({\mathbf{X }}_{1} ,{\mathbf{X }}_{2},\ldots ,{\mathbf{X }}_{g}\). \({\mathbf{c }}_{B_{j} }\) is a vector containing \(N\) groups of \({\mathbf{c }}_{j}\). From (17) and (22), it is given

$$\begin{aligned} {\mathbf{P }}_{C}[k]={\mathbf{C }}_{B}{\mathbf{F }}{\mathbf{C }}_{A} \end{aligned}$$
(23)

where \({\mathbf{C }}_{A} =[{\mathbf{C }}_{A_{0} },{\mathbf{C }}_{A_{1} },\ldots {\mathbf{C }}_{{A}_{M-1}}]^{T}\), \({\mathbf{C }}_{B} =U_{N} \otimes {\mathbf{C }}_{B_{0} }\), \({\mathbf{C }}_{B_{0} } =[{\mathbf{c }}_{B_{0} },{\mathbf{c }}_{B_{1} } \ldots {\mathbf{c }}_{B_{M-1} } ]^{T}\) and \({\mathbf{F }}={\mathbf{U }}_{M} \otimes {\mathbf{F }}_{D} ^{T}\) (\({\mathbf{U }}_{M}\) is a unit matrix with size M).

\({\mathbf{F }}\) is a block diagonal matrix with the \(M\times N\) nonzero blocks. \({\mathbf{C }}_{A}\) is a matrix with M nonzero block diagonal matrix arranged in a column. While assuming \(r_{C} {\rm{=}}M\), each block \({\mathbf{C }}_{A_{i} } \) in \({\mathbf{C }}_{A}\) is independent (which means that each block matrix \({\mathbf{C }}_{A_{i}}\) cannot have a linear expression with other blocks). \({\mathbf{C }}_{B}\) is a block diagonal matrix with \(M\) blocks, each block is equal to \({\mathbf{C }}_{B_{0} }\), and \(rank({\mathbf{C }}_{B_{0}})=rank({\mathbf{C }})=M\). Thus, the rank of \({\mathbf{P }}_{C}[k]\) is given

$$\begin{aligned} rank({\mathbf{P }}_{C} [k])=\min \{ M^{2},N\} \end{aligned}$$
(24)

where min {.} denotes the smaller element of the two numbers. From (24), \({\mathbf{P }}_{C} [k]\) has full column rank when

$$\begin{aligned} M ^{2} \ge N \end{aligned}$$
(25)

Because \({\mathbf{C }}_{M\times N}\) is a random matrix, it is easy to design according to the requirements of (25). If \({\mathbf{P }}_{C}[k]\) is a full column rank matrix, (15) is solvable using LS. Denoting its pseudo inverse as \({\mathbf{P }}^{\dag }_{C} [k]\), \(\widehat{{\mathbf{p }}}_{x} [k]\) is attained as the estimation of \({\mathbf{p }}_{x} [k]\) without any additional constraints.

$$\begin{aligned} \frac{1}{N^{2}} \widehat{{\mathbf{p }}}_{x}[k]={\mathbf{P }}^{\dag } _{C} [k]{\mathbf{p }}_{y} [k] \end{aligned}$$
(26)

\(\square \)

4 Adaptive Resolution Frequency Averaging (ARFA) Method

The modulated signal in a communication system is wide-sense stationary. This underlying stationary assumption allows us to use the frequency averaging method. This section introduces three methods based on frequency averaging in effort to decrease the envelope fluctuation.

Definition

The length of \(\widehat{{\mathbf{p }}}_{x}[k]\) is LN. The length of \(\widehat{{\mathbf{p }}}_{x}[k']\) computed by frequency averaging is \(L'N\) and \(Z=L/L'\), where \(L'\) is the adaptive resolution factor. The spectrum averaging of the original signal is expressed as

$$\begin{aligned} X[k']=\frac{1}{Z} \sum _{k=k'Z}^{(k'+1)Z-1}X[k] \quad k'=0,1,\ldots ,L'N-1\; \end{aligned}$$
(27)

The reference power spectrum is written as

$$\begin{aligned} {\mathbf{P }}_{x} [k']={\mathbf{X }}[k']{\mathbf{X }}^{*} [k'] \end{aligned}$$
(28)

The resolution of \({\mathbf{p }}_{x} [k']\) is \(1/L'N\).

The first of the three methods is the direct frequency averaging method (DFAM). After obtaining the estimation results of power spectrum with (26), DFAM computes the average value of the segments divided from the estimation. The number of segments is \(L'N\). The results are provided by

$$\begin{aligned} \widehat{{\mathbf{p }}}_{x}[k']=\frac{1}{Z} \sum _{k=k'Z}^{(k'+1)Z-1}\widehat{{\mathbf{p }}}_{x}[k] \quad k'=0,1,\ldots ,L'N-1\; \end{aligned}$$
(29)

The second method is the periodogram averaging method (PAM). The periodogram method [20] is a basic power spectrum estimation method that decreases envelope fluctuation. PAM first divides the signal into \(L'N\) segments, then the power spectrum of each segment is estimated by (26) in turn. The output is the average value of all segments, expressed as follows:

$$\begin{aligned} \widehat{{\mathbf{p }}}_{x}[k']=\frac{1}{Z} \sum _{j=0}^{Z-1}\widehat{{\mathbf{p }}}_{x_j}[k'] \quad k'=0,1,\ldots ,L'N-1\; \end{aligned}$$
(30)

where \(j\) is segment index. In practice, it is difficult to reconstruct \({\mathbf{p }}_{x} [k]\; (k=1,2,\ldots ,L)\) when the value of L is large. An adaptive resolution frequency averaging (ARFA) method is proposed. ARFA first divides the frequency spectrum \({\mathbf{Y }}[k]\) into \(L'N\) segments, then the average value of the frequency spectrum of all segments is computed. Equation (26) provides the estimation results. The corresponding operation is given

$$\begin{aligned} \begin{aligned} {\mathbf{c }}_{i} '[n]&=[\; {\mathbf{c }}_{i} [n]\;, \; {\mathbf{0 }}_{1\times (L'-1)N}] \\ Y_{i} [k']&=\frac{1}{Z} \sum _{k=k'Z}^{(k'+1)Z-1}Y_{i} [k] \\ P_{y_{i,j} } [k']&=Y_{i} [k']Y_{j}^{*} [k']\quad k'=0,1,\ldots ,L'-1\; \end{aligned} \end{aligned}$$
(31)

(7) is rewritten in averaging approximate form as

$$\begin{aligned} \begin{aligned} Y_{i} [k']&=\frac{1}{N} \sum _{n=0}^{N-1}\left( \frac{1}{Z} \sum _{k=nL+k'Z}^{nL+(k'+1)Z-1}X[k]\right) \left( \frac{1}{Z} \sum _{k=k'Z}^{(k'+1)Z-1}C''_{i} [k] \right) \\&=\frac{1}{N} \sum _{n=0}^{N-1}X[nL'+k']C'_{i} [nL'+k']) \end{aligned} \end{aligned}$$
(32)

where \(C'_{i}\) is the \(L'N\)-points DFT of \(c_{i} '\) in (32). Equation (26) is reexpressed as

$$\begin{aligned} \frac{1}{N^{2}} \widehat{{\mathbf{p }}}_{x}[k']={\mathbf{P }}^{\dag } _{C} [k']{\mathbf{p }}_{Y} [k'] \quad k'=0,1,\ldots ,L'-1\; \end{aligned}$$
(33)

There are only \(L'\) group equations to be solved by (33) for ARFA, while DFAM and PAM methods need to solve \(L\) group equations with (26). The frequency resolution of \(\widehat{{\mathbf{p }}}_{x}[k']\) is \(\frac{1}{L'N}\).

5 Simulation Results

This section provides some simulations that to demonstrate the performance of power spectrum estimation with the ARFA method. The multi-band orthogonal frequency division multiplexing (OFDM) signal is selected as the analog input, that investigated as a candidate transmission technology for CR [21, 22]. The spectrum shape of each subband is rectangular.

Fig. 2
figure 2

\(=\) 1000, N \(=\) 256, Compression rate \(=\) 0.25. a Original power spectrum estimation. b Reference power spectrum, \(L'=4\)

Fig. 3
figure 3

Power spectrum estimation with different frequency resolution, L \(=\) 1000, N \(=\) 256, a \(L'=1\), b \(L'=4\)

Figure 2a is the estimation result computed by (26). The reference power spectrum in Figure 2b was computed by (2728). Figure 3 illustrates the estimation performance versus frequency resolution. High frequency resolution estimation introduces envelope fluctuation. Figure 3a demonstrates a special case of \(L=1\), which is only needed to solve one equation.

Fig. 4
figure 4

Power spectrum estimation from compressive samples. a, c M \(=\) 16, N \(=\) 32. b, d M \(=\) 32, N \(=\) 64. a, b PSBS method. c, d ARFA method

The number of total subcarriers is \(N_{c }\) (\(N_{c} =2048\) in the following test), which indicates signal bandwidth. The number of active subbands is \(N_{b}\). Figure 4 shows a test result with noise-free input. The number of active bands is \(N_{b}=4\), L \(=\) 1000, and each subband employs 40 subcarriers. In Fig. 4a, c, M \(=\) 16, N \(=\) 32. In Fig. 4b, d, M = 32, N \(=\) 64. The adaptive resolution factor is \(L' =16\) in Fig. 4c, d.

The PSBS method proposed in [15] introduces estimation error, as shown in Fig. 4a, b. The estimation error increases when the value of N is smaller (Fig. 4a, b). However, this modification barely affects estimation performance with the ARFA method in Fig. 4c, d.

NMSE is used to measure the quality between power spectrum estimation results and the reference power spectrum. It is defined as

$$\begin{aligned} {\rm{NMSE}}:={\mathbb {E}}\left[ \frac{\left\| \hat{P}_{x} [i]-P_{r} [i]\right\| ^{2} }{\left\| P_{r} [i]\right\| ^{2}} \right] \end{aligned}$$
(34)

where \(P_{r}\) is the reference power spectrum, expressed as

$$P_{r} = \left\{ {\begin{array}{ll} {P_{x} + \delta ^{2} } & {\textit{In\;the\;active\;band}} \\ {\delta ^{2} } & {\textit{Out\;of\;the\;active\;band}} \end{array} } \right.$$
(35)

where \(P_{x}\) is the theoretical power spectrum of the input and \(\delta ^{2}\) is the additive white Gaussian noise power level.

Fig. 5
figure 5

Estimation performance with different SNR in active band

Fig. 6
figure 6

The NMSE in the active band with different sensing time

The estimation performance of the active band is analyzed in the following experiments. In Fig. 5, compression rate is M/N, where N = 128. The SNR is only calculated in active bands. The number of active bands \(N_{b}\) is 4, and each band employs 40 subcarriers. In Fig. 6, the NMSE in active bands with different sensing times (denoted with L) is illustrated, and SNR = 10 dB.

Fig. 7
figure 7

Power spectrum estimation for less sparse signal. Compression rate is 0.25

Fig. 8
figure 8

The NMSE in the active band and the out band with different \(N_{b}\)

In Fig. 7, N = 128 and compression rate is 0.25. The original signal occupies six subbands. The sparse level is 0.6. The ARFA method is also suitable for less sparse signals. The compression rate can not be less than 0.6 in the spectrum sensing methods based on signal recovery, and such methods are always sensitive to sparsity level. The results in Figs. 5 and 6 also illustrate that the proposed method is not sensitive to sparsity level, because the NMSE does not converge quickly as compression rate increases.

In Fig. 8, N = 128, and M = 32. The SNR in active bands is 10 dB. The number of active bands \(N_{b}\) are 3 and 6 respectively, and each subband employs 32 subcarriers. The effect of sparsity level on estimation performance in the active band is considerable, but the higher sparsity level leads increases of NMSE out of the active band. The error is mainly resulted from the power leakage from the active subbands. Lower value of \(L'\) indicates lower frequency resolution and lower operation complexity; however, when \(L'\) is too small, estimation performance degradation results, mainly from the envelope fluctuation.

6 Conclusion

In this study, sub-Nyquist sampling was employed for spectrum sensing in a cognitive radio system, focusing primarily on power spectrum estimation of the wide-sense stationary analog signal. To account for practical implementation, an adaptive resolution frequency averaging (ARFA) scheme was proposed with accepted complexity. The proposed method consists of a set of LS problems with a rank condition for reliable estimation. Compared to other spectrum sensing methods based on sub-Nyquist sampling, this scheme is suitable for both sparse and nonsparse signals. The scheme is easily implemented to detect active subbands with a threshold in CR systems.