1 Introduction

Cognitive radio (CR) has been proposed as a promising solution to improve the current severely underutilized radio spectrum. In this technique, CRs (or secondary users: SUs) opportunistically access the temporarily unused licensed spectrums by finding them through spectrum sensing (SS), and without degrading quality of service of the primary users (PUs) [1].

In wireless channels, signals often suffer from shadowing or fading, which may lead to the problem of not being able to correctly detect an active PU. To overcome this problem and increase the probability of correctly detecting a PU, a CSS is proposed in [1], which has three successive stages: local sensing, reporting, and data fusion. In local sensing stage, spectrum is sensed by several local SUs. Then, in reporting stage, SS results are sent to a fusion center (FC), and in the last stage, the FC combines them by a specific rule to make a better overall decision.

To perform the above three stages, four great challenges must be considered. First, the reporting channels suffer from noise and fading causing CSS performance is severely degraded [13]. In [4, 5], undesired bounds on the sensing performance caused by imperfect reporting channels are discussed. These bounds limit performance of CSS irrespective of local sensing performances achieved by SUs. In [3], these performance limitations are defined in the form of bit error probability walls.

Second, data fusion rule at the FC has a significant impact on CSS. Depending on the type of local SS results that are reported (soft or hard), there are different rules to combine them and taking the final decision [13]. But two important points, which should be considered in the design of a fusion rule, are the imperfect reporting channels, and the amount of information available about a CRN (Such as instant SNR).

Third, the overhead time for reporting SS results is directly related to the number of cooperative SUs. When the local sensing time is fixed, increasing the number of cooperative SUs increases reporting time, and reduces the time-slot available for data transmission by each SU. However, in [6, 7], it was shown that the CSS performance is improved by increasing the number of cooperative SUs. So, there is an important tradeoff between the throughput degradation associated with reduced time-slot for data transmission and the throughput gain associated with a successful transmission [5].

Fourth, energy efficiency should be considered for CSS schemes, since a cooperative method and its reporting time significantly affect the lifetime of CRN [8, 9]. The above four challenges have not been considered simultaneously, since they are dependent on each other.

To improve sensing performance caused by errors due to reporting channels, either a cluster-based CSS is used in [10, 11], or a spatial diversity-based CSS is applied [4, 5]. However, inner wireless channels (i.e. intra-cluster channels which are between a SU and each of cluster members) in [10, 11] are considered as error-free. Furthermore, using spatial diversity-based CSS of [4, 5] imposes a lot of overhead and complexity to the CRN, in which the CRN has to switch between two different reporting modes (cooperative and non-cooperative), repeatedly [12].

In the design of fusion rules in [7, 13], reporting channels are considered as error-free. In [2, 14], channels are considered as noisy. But fusion rule in [2] uses all reported decisions, which increases overhead, and in [14], it requires much network information.

In [8, 9], energy efficient CSS schemes are proposed, in which all SUs are not participated in CSS. However, in both schemes, the reporting channels are considered as error-free.

In [15], with the objective of maximizing the achievable throughput of the CRN, a cluster-based CSS is proposed to obtain a proper assignment policy. This policy determines the number of SUs required in each cluster, to cooperate for sensing a PU’s channel. However, in [15], reporting channels are considered to be error-free, which is impractical. Moreover, the main parameter: overhead time for reporting is ignored in the formulation of the achievable throughput.

In this paper, we consider the above four challenges simultaneously for a practical CRN, in which all channels (such as reporting channels and inner channels) suffer from noise and fading. To improve the spectrum efficiency of this CRN, we propose a new cluster-based CSS strategy, with an initial phase and four periodic phases. In initial phase, SUs participating in the CSS are divided into smaller clusters. In Phase 1, each SU senses PU’s spectrum independently. In Phase 2, each cluster member improves its local decisions by using its own and other cluster members’ decisions. In Phase 3, each cluster separately selects more reliable decisions from its improved decisions, and sends them to the FC by using a cooperative communication. In Phase 4, the FC makes final decision.

The novelties and contributions of this paper are as follows:

  • In contrast to previous works (which ignore energy-efficiency of their CSS schemes in the non-error-free environment such as non-cluster-based CSS schemes in [2, 5, 8, 9, 13] and cluster-based CSS schemes in [10, 11, 15]), we propose a clustering algorithm for the CSS strategy to improve its performance, as well as to guarantee maximum the CRN lifetime. This is done by using a method that evenly distributes the total energy consumption of the CRN among clusters.

  • Local sensing decisions are improved by using the proposed incremental weighing (IW) fusion rule. This method increases the probability of detecting the PU without increasing overhead traffic. Furthermore, the IW rule adapts itself to inner channels SNR, and so, unlike many conventional methods (such as [35]), does not allow inner channels errors causing any undesired bounds on the sensing performance.

  • We propose a reliability method, based on IW fusion rule, to selecting and sending more reliable decisions to the FC. By using this method, the overhead traffic is reduced in reporting channels while sensing performance at FC is maintained.

  • To overcome fading effects in reporting channels, we employ cooperative communications by using orthogonal distributed space–time block code (ODSTBC) [12]. We chose this method for the following reasons:

    • This method is more bandwidth efficient compared to the other repetition-based methods, such as [1620], in which data is sent to the FC in different time slots. Also, the method in [16] requires some parity bits for verification of received data at relay users, which add undesirable overhead time. Furthermore, these repetition-based methods are non-energy-efficient since relay users consume more energy compared with those of other users in the network. Thus, relay users will have shorter lifetimes, which is equivalent to a shorter useful lifetime for the network.

    • ODSTBCs can be directly obtained from conventional orthogonal STBCs. So, they have low decoding complexity at the FC [12, 21].

    • As we proved in [12], even when inner channels suffer from noise and fading, it is possible to achieve a better BER with this method compared to those of the non-cooperative methods.

  • We derive the detection probability and the false alarm probability for the proposed IW rule at each cluster, and the final fusion rule at the FC. By using them, a closed-form expression for the CRN throughput is derived. Finally, we find the optimal settings of the proposed CSS strategy by maximizing the achievable throughput of the CRN under the constraint of the SUs’ interference on the PU.

The remainder of this paper is organized as follows. In Sect. 2, the proposed network model is described. Section 3 presents optimal energy efficient clustering. In Sect. 4, performance, and in Sect. 5, throughput of the proposed CSS strategy is analyzed. In Sect. 6, simulation results are presented to evaluate the proposed CSS strategy. In Sect. 7, conclusions are presented.

2 System Model

In the proposed CSS system, there are \(S\) SUs, each equipped with a single antenna, and one FC (Fig. 1). In this CRN, SUs are interested in detecting the presence of PU(s) in a single underutilized frequency band (i.e. the sensing channel). Toward this goal, we propose a cluster-based CSS strategy. This CSS strategy has an initial phase and four periodic phases, which are described in the following sub-sections.

Fig. 1
figure 1

A CR network in fading environments

2.1 Initial Phase (Clustering)

As shown in Fig. 1, the clustering algorithm divides N selected SUs, \(1 \le N \le S\), into M clusters. Number of members belonging to cluster \(m\), \(m = 1,2, \ldots ,M\), denoted by n m . As will been seen in Sect. 3, the optimal energy efficient algorithm determines \(\vec{\varvec{N}}_{M} = \left\{ { n_{1} , n_{2} , \ldots , n_{M} } \right\}\) to maximize the CRN lifetime. Moreover, in Sect. 4, we show this flexibility to select N cooperative SUs and M clusters, \(1 \le M \le N\), helps the CRN to enhance sensing performance.

This initial phase is performed once, but as shown in Fig. 2, the following four phases are repeated in ach time frame.

Fig. 2
figure 2

Frame structure of the periodic phases in the proposed CSS strategy

2.2 Phase 1 (Local Spectrum Sensing)

In local sensing time-slot (\(T_{s}\)), in Fig. 2, each SU performs a local spectrum sensing independently and simultaneously, and then makes a binary local decision.

2.3 Phase 2 (Cluster-Based Decisions Improving)

This phase is composed of M time-slots, where time-slot m (\(m = 1,2, \ldots ,M\)), has time duration \(T_{r - in}^{slot} \left( m \right)\), consisting of n m mini time-slots (Fig. 2). The process performed by cluster \(m\) is as follows:

figure c

This proposed IW fusion rule will be explained later in Sect. 4.2 using (20) and (21). At the end of Phase 2, all SUs in a cluster are aware of each other’s updated decisions.

2.4 Phase 3 (Cluster-Based Decisions Reporting)

This phase is composed of M time-slots, where time-slot m (\(m = 1,2, \ldots ,M\)) has time duration \(T_{r - out}^{slot} \left( m \right)\) (Fig. 2). The process performed by cluster \(m, m = 1,2, \ldots ,M\) is as follows:

  1. 1.

    Based on a proposed reliability method, described in Sect. 4.3, each SU selects \(l_{m}\) of \(n_{m}\) decisions, which can include its own decision and decisions received in Phase 2.

  2. 2.

    Cluster members act like a virtual antenna array, and each SU makes a bit sequence of \(l_{m}\) selected decisions.

  3. 3.

    Each SU simultaneously encodes and transmits its bit sequence to the FC, according to a distinct column of a matrix called OSTBC matrix [12]. This matrix is designed using the method presented in [22].

2.5 Phase 4 (Decisions Combining)

In this phase, which takes \(T_{FC}\) seconds, the FC combines the improved decisions received from all clusters to make a final decision. If the FC decides the PU is absent in the sensing channel, SUs proceeds to transmit data in the rest of the frame (\(T_{Data}\)). Otherwise, they must stay quiet and wait until the next time frame to do the CSS again.

As explained in this section, to combat fading in reporting channels, we employ a spatial diversity by using a cooperative communication (ODSTBC). The cooperative communication has two steps: data exchange and cooperative transmission [12, 21], which are equivalent to Phases 2 and 3 in the proposed CSS. But it should be noted that both reporting channels (i.e. the channels between a cluster member and the FC), and inner channels affect the achievable diversity gain and sensing performance. Therefore, in the time of Phase 2, we improve local sensing decisions by using the proposed IW fusion rule. This IW fusion rule depends on the SNR of inner channels. As will be explained in Sect. 4.2, this fusion rule will greatly reduce the destructive effects of errors caused by inner channels.

3 Optimal Energy Efficient Clustering for Proposed CSS

In this section, by analyzing energy consumption in the proposed CSS, we show why we selected the clustering approach for the CSS. Then, we formulate our optimization problem to maximize useful lifetime of the CRN, and evenly distribute the energy consumption of the proposed CSS among clusters. By solving this optimization, we find the optimum clustering algorithm for the proposed CSS.

If there is no clustering, N SUs perform the CSS, and C of S SUs can participate in the reporting to the FC by using cooperative communications. By increasing C, cooperative diversity order increases, and bit error rate (BER) of reporting channels decrease. As a result, the CSS performance can be improved. However, by increasing C, total time duration \(T_{r - out} ,\) of transmissions between SUs and the FC increases non-linearly, and a longer \(T_{r - out}\) left a shorter \(T_{Data}\) (Fig. 2), which result in a lower achievable throughput of CRN. So, selecting the value of C, is an important issue since it makes a tradeoff between diversity and throughput. As a result, \(C = N\) is not always an optimum solution to maximize CRN performance. On the other side, there is another problem, when \(C \ne N\). In this case, C SUs, which report to the FC, consume more energy compare with other SUs, which don’t report. Thus, when C SUs run out of energy, the network loses its full coverage and optimum performance, which means the CRN will have a shorter useful lifetime.

Considering the above two challenges, we use clustering approach in the proposed CSS, which has been explained in Sect. 1. In addition to degrees of freedom N, the proposed clustering approach provides two degrees of freedom:

  1. 1.

    The number of clusters (M), and

  2. 2.

    Clustering algorithm.

We use M to determine the diversity order within each cluster (i.e. act like C). As will be described in Sect. 5, optimal value for this parameter depends on the maximization of achievable throughput.

The clustering algorithm plays an important role in determining a relationship between N cooperative SUs, and M clusters, with n m SUs in cluster \(m\) (\(m = 1,2, \ldots ,M\)). In the rest of this section, we use a clustering algorithm to evenly distribute the energy consumption among clusters, which increases the lifetime of the CRN. Toward this goal, n m should be determined in such a way to minimize the mean square difference of energy consumption (MSDE) between any two arbitrary clusters. So, the energy optimization problem can be modeled as follows

$$\begin{array}{*{20}l} {\mathop {\hbox{min} }\limits_{{\vec{\varvec{N}}_{M} \, = \,\left\{ { n_{1} , n_{2} , \ldots , n_{M} } \right\} }} MSDE\left( {\vec{\varvec{N}}_{M} } \right) = \frac{2}{{M\left( {M - 1} \right)}}\sum\limits_{i = 1}^{M - 1} {\sum\limits_{m = i + 1}^{M} {\left[ {E_{i} \left( {n_{i} } \right) - E_{m} \left( {n_{m} } \right)} \right]^{2} } } } \hfill \\ {s.t. \sum\limits_{m = 1}^{M} {n_{m} = N} ,} \hfill \\ \end{array}$$
(1)

where \(E_{m} \left( {n_{m} } \right)\) is the average energy consumption of cluster m with n m SUs, which consists of three main components as follows

$$E_{m} \left( {n_{m} } \right) = \frac{1}{{n_{m} }}\left( {n_{m} P_{s} T_{s} + E_{in,m} \left( {n_{m} } \right) + E_{r,m} \left( {n_{m} } \right)} \right).$$
(2)

In (2), \(P_{s}\) is the power consumption of each SU in local sensing time-slot (Phase 1), \(E_{in,m} \left( . \right)\) is the energy consumption of inner communications among SUs of cluster m in Phase 2, and \(E_{r,m} \left( . \right)\) is the energy consumption of reporting decisions by cluster m to the FC in Phase 3. The last two energy functions are related to the number of SU in the cluster, \(n_{m}\), which are given by

$$E_{in,m} \left( {n_{m} } \right) = P_{in,m} \left( {SNR_{in}^{m} } \right) \times T_{r - in}^{slot} \left( m \right) = \frac{{P_{in,m} \left( {SNR_{in}^{m} } \right)n_{m} }}{{R_{b} }} ,$$
(3)
$$E_{r,m} \left( {n_{m} } \right) = P_{r,m} \left( {SNR_{r}^{m} } \right) \times T_{r - out}^{slot} \left( m \right) = \frac{{P_{r,m} \left( {SNR_{r}^{m} } \right)l_{m} }}{{R_{b}^{eff} \left( {n_{m} } \right)}} ,$$
(4)

where \(P_{in,m} \left( {SNR_{in}^{m} } \right)\) and \(P_{r,m} \left( {SNR_{r}^{m} } \right)\) are the total average power consumptions in Phases 2 and 3, whose values, respectively, depend on the average SNR of inner and reporting channels of cluster m (i.e. \(SNR_{in}^{m}\) and \(SNR_{r}^{m}\)), \(l_{m}\) is total number of bits transmitted from cluster m to the FC, which will be defined in (33), \(R_{b}\) is SUs bit rate, and \(R_{b}^{eff} \left( {n_{m} } \right)\) is the effective bit rate of the ODSTBC transmission.

Suppose that \(T_{coh}\) is the coherence time of the fading reporting channels, and \(\lfloor x \rfloor\) denotes the greatest integer less than or equal to x. So, each group of \(F = \lfloor R_{b} T_{coh}\rfloor\) bits sent to the FC, consists of \(\vartheta \times n_{m}\) training bits, which are used to obtain channel state information [5]. As a result, \(R_{b}^{eff} \left( {n_{m} } \right)\) in (4) can be represented by

$$R_{b}^{eff} \left( {n_{m} } \right) = \frac{{F - \vartheta n_{m} }}{F} \times R_{STBC} \left( {n_{m} } \right) \times R_{b} ,$$
(5)

where \(R_{STBC} \left( . \right)\) is code rate, which can be expressed as follows for the complex modulations [5, 22]

$$R_{STBC} \left( {n_{m} } \right) = \left\{ {\begin{array}{*{20}l} {\frac{{n_{m} + 2}}{{2n_{m} }},} \hfill & {n_{m} \in even,} \hfill \\ {\frac{{n_{m} + 3}}{{2\left( {n_{m} + 1} \right)}},} \hfill & {n_{m} \in odd.} \hfill \\ \end{array} } \right.$$
(6)

Now, by using these definitions, we present Theorem 1, which gives the optimal clustering algorithm.

Theorem 1

The solution of the optimization problem given by (1), determine the optimal number of SUs in each cluster, \(\hat{n}_{m}\) , which can have a lower and upper bounds as follows

$$N - \sum _{{\begin{array}{*{20}c} {q = 1} \\ {q \ne m} \\ \end{array} }}^{M} \left \lceil {E_{q}^{{ - 1}} \left( {E_{m} \left( {\hat{n}_{m} + 1} \right)} \right)} \right\rceil \le \hat{n}_{m} \le N - \sum _{{\begin{array}{*{20}c} {q = 1} \\ {q \ne m} \\ \end{array} }}^{M} \lfloor E_{q}^{{ - 1}} \left( {E_{m} \left( {\hat{n}_{m} - 1} \right)} \right)\rfloor,\quad 1 \le m \le M,$$
(7)

where \(\left\lceil x \right\rceil\) denotes the smallest integer bigger than or equal to x, and \(E_{q}^{ - 1} \left( . \right)\) is the inverse of the total energy consumption in (2).

Proof

Proof of (7) is presented in “Appendix 1”.

Suppose SUs sparsely dispersed in the CRN, and its dimensions are small as compared to the distances between each SU and the FC. So, we can assume the average SNR per bit in reporting channels of all clusters are approximately the same, and denoted by \(SNR_{r}\) [2, 14]. Furthermore, a clustering algorithm assigns each node to a cluster whose cluster center is closer to the node. Hence, it is a reasonable assumption that the average SNR per bit in each inner channel of a cluster is approximately the same, which denoted by \(SNR_{in}\). By using these assumptions, we present Proposition 1, which gives a simple version of (7).

Proposition 1

If SNR of inner channels, as well as SNR of reporting channels are respectively almost the same, then the optimal number of SUs in each cluster, \(\hat{n}_{m}\) , in (7) can be simplified as follows

$$\hat{n}_{m} = \left\{ {\begin{array}{*{20}l} {n + 1,} \hfill & {1 \le m \le r,} \hfill \\ {n,} \hfill & {r + 1 \le m \le M,} \hfill \\ \end{array} } \right.$$
(8)

where

$$n = \lfloor\frac{N}{M}\rfloor,\quad r = N - nM$$
(9)

Proof

By these assumptions, we can infer that \(P_{in,m}\) and \(P_{r,m}\) are approximately the same for all clusters (\(P_{in} \left( {SNR_{in} } \right) = P_{in,m} \left( {SNR_{in}^{m} } \right), P_{r} \left( {SNR_{r} } \right) = P_{r,m} \left( {SNR_{r}^{m} } \right)\) for \(\forall m\)). Therefore, by using (3) and (4), we have

$$E_{m} \left( {\hat{n}_{m} } \right) = E_{q} \left( {\hat{n}_{m} } \right) , \quad 1 \le m \le M, 1 \le q \le M.$$
(10)
$$E_{q}^{ - 1} \left( {E_{m} \left( {\hat{n}_{m} } \right)} \right) = E_{q}^{ - 1} \left( {E_{q} \left( {\hat{n}_{m} } \right)} \right) = \hat{n}_{m} .$$
(11)

By using (11), we can rewrite (7) as

$$\frac{{N - \left( {M - 1} \right)}}{M} \le \hat{n}_{m} \le \frac{{N + \left( {M - 1} \right)}}{M}, \quad 1 \le m \le M.$$
(12)

Since \(\hat{n}_{m}\) can only be a positive integer, we have

$$\lceil\frac{N}{M} - \frac{{\left( {M - 1} \right)}}{M} \rceil\le \hat{n}_{m} \le \lfloor\frac{N}{M} + \frac{{\left( {M - 1} \right)}}{M}\rfloor,\quad 1 \le m \le M.$$
(13)

Substituting (9) into (13), we have

$$\lceil n + \frac{{r - \left( {M - 1} \right)}}{M}\rceil \le \hat{n}_{m} \le \lfloor n + \frac{{r + \left( {M - 1} \right)}}{M}\rfloor,\quad 1 \le m \le M.$$
(14)

From (9), it’s clear that \(0 \le r \le M - 1\). As a result, \(\hat{n}_{m}\) can only be one of two values, \(n\) or \(n + 1\). Now, we need to determine how many clusters containing \(n\) and \(n + 1\) SUs. If \(M - J\) and \(J\), respectively, denoting the number of clusters that have \(n\) and \(n + 1\) SUs, the total number of SUs will be calculated as follows:

$$\sum\limits_{m = 1}^{M} {\hat{n}_{m} } = \left( {M - J} \right)n + J\left( {n + 1} \right) = Mn + J = N.$$
(15)

Looking at (9), we find that \(J = r\). Therefore, \(r\) and \(M - r\) clusters have \(n + 1\) and \(n\) SUs, respectively, and we get the same description as (8) that we wanted. According to (8) and (9), when N is an integer multiples of M, the numbers of SUs in each cluster are the same; otherwise the maximum difference between the two clusters will be equal to 1. Therefore, as expected, the optimal clustering algorithm uses a uniform distribution for the number of SUs, to achieve an evenly distribution of energy consumption. Thereby, achieving the maximum effective life-time is guaranteed.

4 Performance Analysis of the Proposed CSS

In this section, the phases described in Sect. 2 for the proposed CSS strategy, are analytically studied in the following sub-sections.

4.1 Local Spectrum Sensing

In this phase, each SU evaluates a decision statistic based on the received signal from the sensing channel with bandwidth \(W_{s}\), over the sensing time-slot \(T_{s}\). Then, SU k (\(1 \le k \le n_{m}\)) in cluster m (\(1 \le m \le M\)) compares this decision statistic with a common detection threshold (\(\psi_{E}\)) to make a one-bit hard local decision \(d_{k}^{m}\). It’s notable that \(d_{k}^{m} = 1\) corresponds to the hypothesis \(H_{1}\) (a PU signal is present) and \(d_{k}^{m} = 0\) corresponds to the hypothesis \(H_{0}\) (a PU signal is absent).

We assume that SUs experience a same average SNR on the sensing channel, which denoted by \(SNR_{s}\) [2, 5, 6, 14]. Therefore, we have

$$\Pr \left\{ {d_{k}^{m} = 1|H_{0} } \right\} = P_{f}^{Local} ,\quad \Pr \left\{ {d_{k}^{m} = 0|H_{0} } \right\} = 1 - P_{f}^{Local} ,$$
(16)
$$\Pr \left\{ {d_{k}^{m} = 1|H_{1} } \right\} = P_{d}^{Local} , \quad { \Pr }\left\{ {d_{k}^{m} = 0|H_{1} } \right\} = 1 - P_{d}^{Local} ,$$
(17)

where \(P_{d}^{Local}\) and \(P_{f}^{Local}\) are the local probabilities of detection, and false alarm. These probabilities for energy detection method on Rayleigh fading channel can be expressed as [6]

$$P_{f}^{Local} = \frac{{\Gamma \left( {u,\psi_{E} /2} \right)}}{{\Gamma \left( u \right)}},$$
(18)
$$P_{d}^{Local} = \frac{{{{\varGamma }}\left( {u - 1,\frac{{\psi_{E} }}{2}} \right)}}{{{{\varGamma }}\left( {u - 1} \right)}} + e^{{\frac{{ - \psi_{E} }}{{2\left( {1 + uSNR_{s} } \right)}}}} \left( {1 + \frac{1}{{uSNR_{s} }}} \right)^{u - 1} \left[ {1 - \frac{{{{\varGamma }}\left( {u - 1,\frac{{u\psi_{E} SNR_{s} }}{{2\left( {1 + uSNR_{s} } \right)}}} \right)}}{{{{\varGamma }}\left( {u - 1} \right)}}} \right],$$
(19)

where \({{\varGamma }}\left( . \right)\) is gamma function, \({{\varGamma }}\left( {.,.} \right)\) is a complementary incomplete gamma function, and \(u = \lfloor W_{s} T_{s}\rfloor\) is the time-bandwidth product.

4.2 Cluster-Based Decisions Improving

In this phase, we are looking to simultaneously achieve three main goals for each cluster:

  1. 1.

    All SUs in a cluster are aware of each other’s decisions, which is a prerequisite step in a cooperative communication [5, 12, 21].

  2. 2.

    SUs improve and modify their local decisions based on other SUs’ decisions.

  3. 3.

    These decisions improvement is done in a way that the BER of inner channels has minimal impact on it.

Therefore, as described in Sect. 2, we proposed a process in which SUs share their decisions simultaneously in order to improve their decisions. In this sub-section, this process is analyzed mathematically for cluster m as follows.

As explained in Sect. 2, this process starts from SU 1 and ends by SU \(n_{m}\) in \(n_{m}\) similar steps (i.e. time-slots). In step k, SU k after receiving \(k - 1\) decisions of the \(k - 1\) previous SUs, computes soft valued decision statistics \({{\varLambda }}_{IW} \left( {m,k} \right)\) as

$${{\varLambda }}_{IW} \left( {m,k} \right) = w_{m} \sum\limits_{i = 1}^{k - 1} {\hat{x}_{i,k}^{m} + d_{k}^{m} } ,$$
(20)

where \(\hat{x}_{i,k}\) is the decoded binary decision by SU k based on the binary decision \(x_{i}^{m}\) transmitted by SU i, and \(w_{m}\) is the weight assigned to \(k - 1\) decoded binary decisions.Then, SU k makes the final decision as

$$x_{k}^{m} = \left\{ {\begin{array}{*{20}c} {1,\quad } & {{{\varLambda }}_{IW} \left( {m,k} \right) \ge \psi_{IW} \left( {m,k} \right),} \\ {0,\quad } & {{{\varLambda }}_{IW} \left( {m,k} \right) < \psi_{IW} \left( {m,k} \right),} \\ \end{array} } \right.$$
(21)

where \(\psi_{IW} \left( {m,k} \right)\) is the fusion threshold of SU k. We call the proposed fusion rule in (20) and (21) as the incremental weighing (IW) fusion rule, as by increasing SU members from 1 to \(n_{m} ,\) number of elements of its decision statistic increases linearly. We set \(\psi_{IW} \left( {m,1} \right) = 1\) for \(\forall m\). So, from (20) and (21), we have \(x_{1}^{m} = d_{1}^{m}\) for \(\forall m\).

Due to inner channels errors, the decisions received by SU k may differ from those transmitted from the k-1 SUs, which can have a devastating impact on sensing performance. Therefore, the weight assigned to \(k - 1\) decoded binary decisions, \(w_{m}\), in (20) is calculated using SNR of inner channels, \(SNR_{in}^{m}\), as follows

$$w_{m} = \frac{{SNR_{in}^{m} }}{{SNR_{in}^{m} + \alpha }}.$$
(22)

In above, \(\alpha\) is a positive constant, which controls the influence of \(SNR_{in}^{m}\) on \(w_{m}\). As seen in (22), higher weights are assigned to those decoded decisions received with higher SNRs since such decisions are more reliable. Furthermore, as expected, \(w_{m}\) tends to one as the \(SNR_{in}^{m}\) tends to infinity and all error-free decisions have a same weight. Therefore, using the proposed IW fusion rule, we can achieve all three goals at this phase.

Theorem 2

The probabilities of detection (\(P_{d}^{IW} \left( {m,k} \right)\)) and false-alarm (\(P_{f}^{IW} \left( {m,k} \right)\)) of the proposed IW fusion rule, are given as

$$P_{f}^{IW} \left( {m,k} \right) = \Pr \left\{ {x_{k}^{m} = 1|H_{0} } \right\} = \mathop \sum \limits_{a = 0}^{{2^{k - 1} - 1}} \left( {1 - F_{k} \left( {\left. {\vec{\varvec{A}}_{k}^{a} } \right|H_{0} } \right)} \right){ \Pr }\left\{ {\left. {\vec{\varvec{X}}_{k}^{m} = \vec{\varvec{A}}_{k}^{a} } \right|H_{0} } \right\},$$
(23)
$$P_{d}^{IW} \left( {m,k} \right) = \Pr \left\{ {x_{k}^{m} = 1|H_{1} } \right\} = \mathop \sum \limits_{a = 0}^{{2^{k - 1} - 1}} \left( {1 - F_{k} \left( {\left. {\vec{\varvec{A}}_{k}^{a} } \right|H_{1} } \right)} \right){ \Pr }\left\{ {\left. {\vec{\varvec{X}}_{k}^{m} = \vec{\varvec{A}}_{k}^{a} } \right|H_{1} } \right\},$$
(24)

where \(\vec{\varvec{X}}_{k}^{m} = \left\{ {\hat{x}_{1,k}^{m} ,\hat{x}_{2,k}^{m} , \ldots ,\hat{x}_{k - 1,k}^{m} } \right\}\) is the vector of decoded decisions received from \(k - 1\) SUs; \({\varvec{\Pi}}_{\text{k}}\) is the set of all possible values of a \(k - 1\) bit binary vector, like \(\vec{\varvec{X}}_{k}^{m}\) ; the cardinality of \({\varvec{\Pi}}_{\text{k}}\) is \(2^{k - 1}\) ; and \(\vec{\varvec{A}}_{k}^{a}\), \(\vec{\varvec{B}}_{k}^{b}\) and \(\vec{\varvec{C}}_{k}^{c}\) are the ath, bth, and cth entries in \({\varvec{\Pi}}_{\text{k}} ,\) which is corresponding to the \(k - 1\) bit vector equivalent to the decimal values a, b, and c, respectively.

In (23) and (24), \({ \Pr }\left\{ {\left. {\vec{\varvec{X}}_{k}^{m} = \vec{\varvec{A}}_{k}^{a} } \right|H_{y} } \right\},\) \(y \in \left\{ {0,1} \right\}\) is the probability mass function (pmf) of \(\vec{\varvec{X}}_{k}^{m}\) , which is given by

$${ \Pr }\left\{ {\left. {\vec{\varvec{X}}_{k}^{m} = \vec{\varvec{A}}_{k}^{a} } \right|H_{y} } \right\} = \mathop \sum \limits_{b = 0}^{{2^{k - 1} - 1}} \left( {\mathop \prod \limits_{j = 1}^{k - 1} p_{j} \left( {\left. {\vec{\varvec{B}}_{j}^{b} } \right|H_{y} } \right)^{{B_{j}^{b} }} \left( {1 - p_{j} \left( {\left. {\vec{\varvec{B}}_{j}^{b} } \right|H_{y} } \right)} \right)^{{1 - B_{j}^{b} }} \times \left( {P_{e1} } \right)^{{\left| {A_{j}^{a} - B_{j}^{b} } \right|}} \left( {1 - P_{e1} } \right)^{{1 - \left| {A_{j}^{a} - B_{j}^{b} } \right|}} } \right).$$
(25)

In above, \(A_{j}^{a}\) is the bit corresponding to bit j in vector \(\vec{\varvec{A}}_{k}^{a}\), and auxiliary functions \(p_{j} \left( {\left. {\vec{\varvec{B}}_{j}^{b} } \right|H_{y} } \right)\) is defined as follows

$$p_{j} \left( {\left. {\vec{\varvec{B}}_{j}^{b} } \right|H_{y} } \right) = \left\{ {\begin{array}{*{20}l} {\mathop \sum \limits_{c = 0}^{{2^{j - 1} - 1}} \left\{ {\left( {1 - F_{j} \left( {\left. {\vec{\varvec{C}}_{j}^{c} } \right|H_{y} } \right)} \right) \times \left( {P_{e2} } \right)^{{\mathop \sum \limits_{i = 1}^{j - 1} \left| {B_{i}^{b} - C_{i}^{c} } \right|}} \left( {1 - P_{e2} } \right)^{{j - 1 - \mathop \sum \limits_{i = 1}^{j - 1} \left| {B_{i}^{b} - C_{i}^{c} } \right|}} } \right\},\quad } \hfill & {j \ne 1,} \hfill \\ {1 - F_{j} \left( {\left. {\vec{\varvec{B}}_{j}^{b} } \right|H_{y} } \right),\quad } \hfill & {j = 1.} \hfill \\ \end{array} } \right.$$
(26)

In (25), \(P_{e1} = P_{e2} = P_{e}^{in} \left( {SNR_{in}^{m} } \right)\) which \(P_{e}^{in} \left( {SNR_{in}^{m} } \right)\) denotes the BER of inner channel, and \(F_{j} \left( {\left. {\vec{\varvec{A}}_{j}^{a} } \right|H_{y} } \right)\) is cumulative distribution function (CDF) of random variable \(d_{j}^{m}\) which can be computed as follows

$$\begin{aligned} F_{j} \left( {\left. {\vec{\varvec{A}}_{j}^{a} } \right|H_{y} } \right) & = \Pr \left\{ {\left. {d_{j}^{m} < \psi_{IW} \left( {m,j} \right) - w_{m} \mathop \sum \limits_{i = 1}^{j - 1} A_{i}^{a} } \right|H_{y} } \right\}, \\ & = \left\{ {\begin{array}{*{20}l} {1,} \hfill & {1 < \psi_{IW} \left( {m,j} \right) - w_{m} \mathop \sum \limits_{i = 1}^{j - 1} A_{i}^{a} ,} \hfill \\ {\Pr \left\{ {d_{j}^{m} = 0|H_{y} } \right\},} \hfill & {0 < \psi_{IW} \left( {m,j} \right) - w_{m} \mathop \sum \limits_{i = 1}^{j - 1} A_{i}^{a} \le 1} \hfill \\ {0,} \hfill & {\psi_{IW} \left( {m,j} \right) - w_{m} \mathop \sum \limits_{i = 1}^{j - 1} A_{i}^{a} \le 0.} \hfill \\ \end{array} } \right., \\ \end{aligned}$$
(27)

Proof

Proof of (23) and (24) are presented in “Appendix 2”.

Moreover, this IW fusion rule doesn’t make any limit on sensing, although there are errors in inner channels. This means if we choose appropriate values for \(\alpha\) and \(\psi_{IW} \left( {m,k} \right)\), then there is no lower bound for \(P_{f}^{IW} \left( {.,.} \right)\) or upper bound for \(P_{d}^{IW} \left( {.,.} \right)\) due to inner channel errors (unlike other fusion rules such as K-out-of-N [35]). The following propositions prove the above discussion.

Proposition 2

If for a specific \(SNR_{in}^{m}\) , we have

$$\psi_{IW} \left( {m,k} \right) > \left( {k - 1} \right)\left( {\frac{{SNR_{in}^{m} }}{{SNR_{in}^{m} + \alpha }}} \right),$$
(28)

and if \(P_{f}^{Local} \mathop \to \limits^{{}} 0\) , then \(P_{f}^{IW} \left( {m,k} \right)\mathop \to \limits^{{}} 0\).

Proof

When \(P_{f}^{Local} \mathop \to \limits^{{}} 0\), (27) can be simplified to

$$1 - F_{k} \left( {\left. {\vec{\varvec{A}}_{k}^{a} } \right|H_{0} } \right) = \left\{ {\begin{array}{*{20}c} {0,} & {\psi_{IW} \left( {m,k} \right) - w_{m} \mathop \sum \limits_{i = 1}^{k - 1} A_{i}^{a} > 0,} \\ {1,} & {\psi_{IW} \left( {m,k} \right) - w_{m} \mathop \sum \limits_{i = 1}^{k - 1} A_{i}^{a} \le 0.} \\ \end{array} } \right.$$
(29)

Furthermore, for \(\forall \vec{\varvec{A}}_{k}^{a}\) we know

$$\psi_{IW} \left( {m,k} \right) - w_{m} \left( {k - 1} \right) \le \psi_{IW} \left( {m,k} \right) - w_{m} \mathop \sum \limits_{i = 1}^{k - 1} A_{i}^{a} \le \psi_{k}^{m} .$$
(30)

So, if \(\psi_{IW} \left( {m,k} \right) > w_{m} \left( {k - 1} \right),\) then \(\psi_{IW} \left( {m,k} \right) - w_{m} \mathop \sum \nolimits_{i = 1}^{k - 1} A_{i}^{a} > 0\) for \(\forall \vec{\varvec{A}}_{k}^{a}\), and \(1 - F_{k} \left( {\left. {\vec{\varvec{A}}_{k}^{a} } \right|H_{0} } \right) = 0\) for \(\forall \vec{\varvec{A}}_{k}^{a}\). Therefore, by looking at (23), \(P_{f}^{IW} \left( {m,k} \right)\mathop \to \limits^{{}} 0\). Defining \(w_{m}\) as given by (22), then \(\psi_{IW} \left( {m,k} \right) > w_{m} \left( {k - 1} \right)\) is given by (28).

Proposition 3

If we have

$$\psi_{IW} \left( {m,k} \right) \le 1,$$
(31)

and if \(P_{d}^{Local} \mathop \to \limits^{{}} 1\) , then \(P_{d}^{IW} \left( {m,k} \right)\mathop \to \limits^{{}} 1\).

Proof

When \(P_{d}^{Local} \mathop \to \limits^{{}} 1\), (27) can be simplified to

$$1 - F_{k} \left( {\left. {\vec{\varvec{A}}_{k}^{a} } \right|H_{0} } \right) = \left\{ {\begin{array}{*{20}c} {0,} & {\psi_{IW} \left( {m,k} \right) - w_{m} \mathop \sum \limits_{i = 1}^{k - 1} A_{i}^{a} > 1,} \\ {1,} & {\psi_{IW} \left( {m,k} \right) - w_{m} \mathop \sum \limits_{i = 1}^{k - 1} A_{i}^{a} \le 1.} \\ \end{array} } \right.$$
(32)

So, looking at (24) and (30), we find that if \(\psi_{IW} \left( {m,k} \right) \le 1,\) then \(\psi_{IW} \left( {m,k} \right) - w_{m} \mathop \sum \nolimits_{i = 1}^{k - 1} A_{i}^{a} \le 1\) for \(\forall \vec{\varvec{A}}_{k}^{a}\), and \(P_{d}^{IW} \left( {m,k} \right)\mathop \to \limits^{{}} 1\).

4.3 Cluster-Based Decisions Reporting

In this phase, first, SUs in each cluster select last \(l_{m}\) of \(n_{m}\) improved decisions, where

$$l_{m} = \left\lfloor {n_{m} - \left( {w_{m} } \right)^{1/3} \left( {n_{m} - 1} \right)} \right\rfloor .$$
(33)

Then, these SUs employ cooperative transmission using ODSTBC, and transmit selected decisions, \(x_{k}^{m} \left( {n_{m} - l_{m} + 1 \le k \le n_{m} } \right)\) to the FC.

In (33), by increasing \(SNR_{in}^{m}\) (i.e. when \(w_{m} \to 1\)), SUs send less number of decisions to the FC, since these decisions have sufficient reliable knowledge about other decisions.

As shown in [12], ODSTBC BER depends on the number of cooperative SUs, \(n_{m} ,\) SNR of inner channels, and SNR of reporting channels. A closed-form expression for the upper bound of ODSTBC BER is derived in [12], which we use it as \(P_{e}^{r} \left( {n_{m} ,SNR_{in}^{m} ,SNR_{r}^{m} } \right)\) in the rest of this paper.

4.4 Decisions Combining

In the final phase, the FC, after receiving selected improved decisions from \(M\) independent clusters, decides which hypothesis is more likely to be true, which is denoted by \(z_{FC}\). This is done by using a fusion rule to construct a decision statistic, \({{\varLambda }}_{FC}\), and then it is compared with a threshold.

The optimal fusion rule at the FC for the proposed CSS strategy is the sum of log-likelihood ratios (LLRs) of the received decisions from each cluster [3, 13], i.e.,

$${{\varLambda }}_{FC}^{OPT} = \mathop \sum \limits_{m = 1}^{M} log\left[ {\frac{{{ \Pr }\left\{ {\left. {\vec{\varvec{X}}_{FC}^{m} } \right|H_{1} } \right\}}}{{{ \Pr }\left\{ {\left. {\vec{\varvec{X}}_{FC}^{m} } \right|H_{0} } \right\}}}} \right].$$
(34)

In above, \(\vec{\varvec{X}}_{FC}^{m} = \left\{ {\tilde{x}_{{n_{m} - l_{m} + 1}}^{m} ,\tilde{x}_{{n_{m} - l_{m} + 2}}^{m} , \ldots ,\tilde{x}_{{n_{m} }}^{m} } \right\}\) is the vector of decoded decisions \(\tilde{x}_{k}^{m}\) by the FC, which is based on the selected binary decision \(x_{k}^{m}\) transmitted by cluster m. Looking at (58) and (59), we see that the pmf of \(\vec{\varvec{X}}_{FC}^{m}\), \({ \Pr }\left\{ {\left. {\vec{\varvec{X}}_{FC}^{m} } \right|H_{y} } \right\}\) for \(y \in \left\{ {0,1} \right\}\), is the same as the pmf of \(\vec{\varvec{X}}_{k}^{m}\) in (25), with the exception that \(k = l_{m} + 1\) and \(P_{e1} = P_{e}^{r} \left( {n_{m} ,SNR_{in}^{m} ,SNR_{r}^{m} } \right).\) Therefore, \({ \Pr }\left\{ {\left. {\vec{\varvec{X}}_{FC}^{m} } \right|H_{y} } \right\}\) in (34) is given by

$${ \Pr }\left\{ {\left. {\vec{\varvec{X}}_{FC}^{m} = \vec{\varvec{A}}_{k}^{a} } \right|H_{y} } \right\} = \mathop \sum \limits_{b = 0}^{{2^{{l_{m} }} - 1}} \left( {\mathop \prod \limits_{j = 1}^{{l_{m} }} p_{j} \left( {\left. {\vec{\varvec{B}}_{{l_{m} + 1}}^{b} } \right|H_{y} } \right)^{{B_{j}^{b} }} \left( {1 - p_{j} \left( {\left. {\vec{\varvec{B}}_{{l_{m} + 1}}^{b} } \right|H_{y} } \right)} \right)^{{1 - B_{j}^{b} }} \times \left( {P_{e1} } \right)^{{\left| {A_{j}^{a} - B_{j}^{b} } \right|}} \left( {1 - P_{e1} } \right)^{{1 - \left| {A_{j}^{a} - B_{j}^{b} } \right|}} } \right),$$
(35)

where \(p_{j} \left( {\left. {\vec{\varvec{B}}_{{l_{m} + 1}}^{b} } \right|H_{y} } \right)\) is defined in (26).

As seen in (35) and (26), the optimal fusion rule requires average SNR of inner and reporting channels, i.e. \(SNR_{in}^{m}\), \(SNR_{r}^{m}\), and the local sensing performance indices, i.e., the \(P_{d}^{Local}\) and \(P_{f}^{Local}\), for all clusters, which may be unavailable. Furthermore, it has a non-linear complex form, since according to Eq. (20), the improved decisions of SUs in each cluster are correlated. Therefore, we use a general fusion rule for the FC, which is easy to be implemented, and it reduces the complexity of the throughput optimization problem in the next section. This fusion rule is OR-rule, which is computed as

$${{\varLambda }}_{FC} = \mathop \sum \limits_{m = 1}^{M} \mathop \sum \limits_{{k = n_{m} - l_{m} + 1}}^{{n_{m} }} \tilde{x}_{k}^{m} ,$$
(36)
$$z_{FC} = \left\{ {\begin{array}{*{20}c} {1,} & {{{\varLambda }}_{FC} \ge 1,} \\ {0,} & {{{\varLambda }}_{FC} = 0.} \\ \end{array} } \right.$$
(37)

Considering independent clusters, we can evaluate the false alarm probability, \(P_{f}^{FC}\), and the detection probability, \(P_{d}^{FC}\), at the FC as

$$P_{f}^{FC} = \Pr \left\{ {z_{FC} = 1|H_{0} } \right\} = 1 - \mathop \prod \limits_{m = 1}^{M} { \Pr }\left\{ {\left. {\vec{\varvec{X}}_{FC}^{m} = {\vec{\mathbb{O}}}_{{l_{m} + 1}} } \right|H_{0} } \right\},$$
(38)
$$P_{d}^{FC} = \Pr \left\{ {z_{FC} = 1|H_{1} } \right\} = 1 - \mathop \prod \limits_{m = 1}^{M} { \Pr }\left\{ {\left. {\vec{\varvec{X}}_{FC}^{m} = {\vec{\mathbb{O}}}_{{l_{m} + 1}} } \right|H_{1} } \right\},$$
(39)

where \({\vec{\mathbb{O}}}_{{l_{m} + 1}}^{{}}\) is a vector of \(l_{m}\) zeros, and \({ \Pr }\left\{ {\left. {\vec{\varvec{X}}_{FC}^{m} = {\vec{\mathbb{O}}}_{{l_{m} + 1}}^{{}} } \right|H_{0} } \right\}\) is defined in (35).

5 Throughputs of the Proposed CSS Strategy

As mentioned in previous sections, the values of \(N,\) \(M,\) \(\alpha ,\) \({\text{and}}\) \(\psi_{IW} \left( {m,k} \right)\) affect the performance of the proposed CSS. So, we should complete the proposed CSS strategy by providing a way to calculate the optimal values of its parameters. Toward this goal, several criterions can be considered. Since a CR is originally designed to improve the spectrum efficiency, maximizing the throughput of a CRN is one of more practical interest [5, 8, 9, 11, 15]. Therefore, our object is to find the optimal settings of the proposed CSS by maximizing the throughput of the CRN under the constraints of protecting PUs from SUs.

Let C 1 and C 0 denote the throughput of the CRN with and without existence of a PU, respectively. Hence, as seen in Fig. 2, the average achievable throughput by CRN is [5, 13]

$$R = \left( {1 - P_{f}^{FC} } \right)\left( {1 - \frac{{T_{s} + T_{FC} + \mathop \sum \nolimits_{m = 1}^{M} \left\{ {T_{r - in}^{slot} \left( m \right) + T_{r - out}^{slot} \left( m \right)} \right\}}}{{T_{F} }}} \right)C_{0} P\left( {H_{0} } \right) + \left( {1 - P_{d}^{FC} } \right)\left( {1 - \frac{{T_{s} + T_{FC} + \mathop \sum \nolimits_{m = 1}^{M} \left\{ {T_{r - in}^{slot} \left( m \right) + T_{r - out}^{slot} \left( m \right)} \right\}}}{{T_{F} }}} \right)C_{1} \left( {1 - P\left( {H_{0} } \right)} \right),$$
(40)

where \(P\left( {H_{0} } \right)\) is the probability that PU’s signal is absent. Since due to the interference \(C_{0} \gg C_{1}\), the first term in the right hand side of (40) dominates the achievable throughput [5, 13]. Therefore, the normalized achievable throughput can be approximated by

$$\bar{R}\left( {N,M,\alpha ,\psi_{IW} \left( {m,k} \right),\psi_{E} } \right) = \left( {1 - P_{f}^{FC} } \right)\left( {1 - \frac{{T_{s} + T_{FC} + \mathop \sum \nolimits_{m = 1}^{M} \left\{ {T_{r - in}^{slot} \left( m \right) + T_{r - out}^{slot} \left( m \right)} \right\}}}{{T_{F} }}} \right).$$
(41)

Furthermore, to simplify the following optimization problem, we consider \(\psi_{IW} = \psi_{IW} \left( {m,k} \right)\) for \(\forall m,k\). So, the optimization problem can be modeled as

$$\begin{array}{*{20}l} {\mathop { \hbox{max} }\limits_{{N,M,\alpha ,\psi_{IW} ,\psi_{E} }} } \hfill & {\bar{R}\left( {N,M,\alpha ,\psi_{IW} ,\psi_{E} } \right),} \hfill \\ {s.t. } \hfill & {\begin{array}{*{20}l} {P_{d}^{FC} \left( {N,M,\alpha ,\psi_{IW} ,\psi_{E} } \right) \ge P_{d,T} ,} \hfill \\ {1 \le N \le S,} \hfill \\ {1 \le M \le N,} \hfill \\ \end{array} } \hfill \\ \end{array}$$
(42)

where \(P_{d,T}\) denotes the target for detection probability so that the PU is sufficiently protected.

The optimization problem in (42) is non-convex, and in order to solve it, we use an exhaustive search. Since \(N\) and \(M\) are integers, and, as we show in the following, \(\alpha\) and \(\psi_{IW}\) have discrete search space, the search space is limited.

Using \(I_{j} = \mathop \sum \nolimits_{i = 1}^{j - 1} A_{i}^{a}\), we can simplify (27) as

$$F_{j} \left( {\left. {\vec{\varvec{A}}_{j}^{a} } \right|H_{y} } \right) = \left\{ {\begin{array}{*{20}l} {1,} \hfill & {I_{j} < L_{2},} \hfill \\ {\Pr \left\{ {d_{j}^{m} = 0|H_{y} } \right\},} \hfill & {L_{2} \le I_{j} < L_{1},} \hfill \\ {0,} \hfill & {L_{1} \le I_{j} ,} \hfill \\ \end{array} } \right.$$
(43)

where \(L_{1} = \psi_{IW} /w_{m}\) and \(L_{2} = \left( {\psi_{IW} - 1} \right)/w_{m}\). Since \(I_{j}\) is an integer, one, the smallest integer, is a reasonable step-size for \(L_{1}\) and \(L_{2}\) to change \(F_{j} \left( {\left. {\vec{\varvec{A}}_{j}^{a} } \right|H_{y} } \right)\) (i.e. \(P_{f}^{FC}\) and \(P_{d}^{FC}\)). Therefore, the search space of \(\alpha\) and \(\psi_{IW}\), which is equivalent to \(L_{1}\) and \(L_{2}\), are discrete.

6 Simulation Results

In this section, we evaluate the proposed CSS strategy using simulations and assuming reporting and inner channels are Rayleigh flat fading. Other related CRN parameters are defined in Table 1.

Table 1 CRN parameters

In the first step, we evaluate our proposed clustering algorithm to verify the evenly distribution of energy consumption. In simulations, \(N = 14\) SUs in the CRN and \(SNR_{in} = SNR_{i} = 10dB\), and we divide them into three clusters. Accordingly, Fig. 3 shows the MSDE for different values of \(n_{1}\) and \(n_{2}\), (\(n_{3} = N - n_{1} - n_{2}\)), that are equivalent to different clustering algorithms. As seen in Fig. 3, the least MSDE, which is equivalent to the best evenly distribution of energy consumption, occurs at 3 points, given in Table 2. On the other hand, using (8), the optimal number of SUs in each cluster can be expressed as follows

$$\hat{n}_{m} = \left\{ {\begin{array}{*{20}l} {5,} \hfill & {1 \le m \le 2,} \hfill \\ {4,} \hfill & {m = 3.} \hfill \\ \end{array} } \right.$$
(44)
Fig. 3
figure 3

MSDE versus different values of \(n_{1}\) and \(n_{2}\) \(\left( {N = 14, M = 3,\;SNR_{in} = SNR_{r} = 10 \,{\text{dB}}} \right)\)

Table 2 Optimal numbers of SUs in each cluster based on simulation results \(\left( {N = 14, M = 3, \;SNR_{in} = SNR_{r} = 10\, {\text{dB}}} \right)\)

Since the ordering of clusters is not important here, these results are equivalent. Thus, the clustering algorithm given in Theorem 1, introduce an energy efficient algorithm.

In the second step, we compare minimum values of \(P_{f}^{IW} \left( {m,k} \right)\) subject to \(P_{d}^{IW} \left( {m,k} \right) \ge 0.9\) achieved by the proposed IW fusion rule with various hard combining fusion rules and no-improvement case (\(w_{m} = 0\)). Without loss of generality, we choose \(\psi_{IW} \left( {m,k} \right) = 0.98\) for \(\forall k\) in the IW fusion rule, which needs to know only the value of \(SNR_{in}^{m}\), and OR (\(w_{m} = 1,\psi_{IW} \left( {m,k} \right) = 1\)), AND (\(w_{m} = 1,\psi_{IW} \left( {m,k} \right) = k\)), and MAJORITY (\(w_{m} = 1,\psi_{IW} \left( {m,k} \right) = \lceil k/2\rceil\)) fusion rules as references. Figure 4 shows this comparison for SU 6 in cluster m for different \(SNR_{in}^{m}\) values. As seen, by selecting an appropriate value for \(\alpha\), the proposed IW fusion rule shows a better performance at low to high SNRs. At very low SNR, the proposed IW fusion rule is similar to no-improvement case (\(w_{m} \to 0\)), where the MAJORITY fusion rule shows good sensing performance. At very high SNR, the proposed IW fusion rule is similar to the OR fusion rule (\(w_{m} \to 1\)).

Fig. 4
figure 4

Minimum \(P_{f}^{IW} \left( {m,6} \right)\) subject to \(P_{d}^{IW} \left( {m,6} \right) \ge 0.9\) achieved by various \(SNR_{in}^{m}\) over Rayleigh faded inner channel (SU 6)

Figure 5 shows the above comparison as sensing improvement at SU k in cluster m for \(SNR_{in}^{m} = 6\,{\text{dB}}\). As seen, the proposed CSS strategy outperforms all fusion rules for \(k > 3\). For \(k \le 3\), OR-rule shows better performance, which can be seen as a especial case of the proposed IW fusion rule, when \(\alpha \to 0\) and \(\psi_{IW} \left( {m,k} \right) = 1.\)

Fig. 5
figure 5

Minimum \(P_{f}^{IW} \left( {m,k} \right)\) subject to \(P_{d}^{IW} \left( {m,k} \right) \ge 0.9\) achieved by SU \(k\) in the cluster m over Rayleigh faded inner channel (\(SNR_{in}^{m} = 6\,{\text{dB}}\))

In the final step, we compare our CSS with three baseline schemes. The first scheme, called S1 has \(w_{m} = 0\), in which the CRN doesn’t use IW fusion rule and its reliability method for the selection of decisions. In the second scheme, S2, the cooperative SUs report to the FC using a direct transmission with no diversity gain (\(N = M\)), which is the same as usual CSS with OR-rule. The third scheme, S3, has fixed \(M = 1\), in which SUs don’t participate in the proposed clustering. The optimal settings for the proposed strategy and these three schemes are obtained by solving the optimization problem given by Eq. (42). As mentioned in Fig. 5 of [15], for one PU channel (one sensing channel), the proposed CSS doesn’t use clustering approach, and its sensing performance is the same as those of usual CSS with OR-rule at the FC. On the other hand, as described in Sect. 2, we assume a CRN with one sensing channel. Therefore, with one sensing channel and imperfect reporting channels, the CSS scheme in [15] is the same as S2 scheme.

Figures 6 and 7 show the maximum normalized achievable throughput of our proposed strategy and these schemes in scenarios with average SNR of inner and reporting channels for \(P_{d,T} = 0.9\). When errors in inner channels are worse than reporting channels (so that approximately when \(SNR_{r} > SNR_{in}\)), cooperative communication methods are inefficient, and the optimum method of reporting is direct transmission (non-cooperative) to the FC. As seen in Figs. 6 and 7, and as expected, in this condition, the proposed strategy sets the number of clusters equal to the number of cooperative SUs, and its performance is the same as S2. However, in practical environments, the errors in inner channels are often better than reporting channels (i.e. \(SNR_{r} < SNR_{in}\)). In this condition, the performance of proposed CSS strategy is much better than S2, which is used by default (Figs. 6, 7). Furthermore, in this condition (\(SNR_{r} < SNR_{in}\)), the importance of using the proposed IW rule and its reliability method in the CSS can be seen more clearly. As seen in Figs. 6 and 7, CRN with S1 scheme achieves less throughput compared to the proposed CSS strategy, which this difference increases by increasing \(SNR_{in}\) since, in S1 scheme, all local decisions are sent to the FC without any improvement. However, in proposed strategy, by increasing \(SNR_{in}\), without losing sensing efficiency, the less number of decisions in a shorter overhead time are sent to the FC.

Fig. 6
figure 6

Maximum normalized achievable throughput versus \(SNR_{in}\) for different CSS strategies and \(P_{d,T} = 0.9,\) a \(SNR_{r} = 2\,{\text{dB}},\) b \(SNR_{r} = 7\,{\text{dB}}\)

Fig. 7
figure 7

Maximum normalized achievable throughput versus \(SNR_{r}\) for different CSS strategies and \(P_{d,T} = 0.9,\) a \(SNR_{in} = 9\,{\text{dB}},\) b \(SNR_{in} = 14\,{\text{dB}}\)

In addition, as seen in Figs. 6 and 7, the performance of S3 is very poor in low values for \(SNR_{in}\) since, as we proved in [12], when \(SNR_{in}\) decreases in one cluster, the ODSTBC BER increases. To overcome this problem, number of SUs in the corresponding cluster must be reduced. However, when \(M = 1\), this reduction reduces the number of cooperative SUs, \(N\), causing severe degradation in S3 performance. Therefore, we use clustering in the proposed strategy to benefit diversity-throughput tradeoff. Furthermore, this clustering is energy-efficient that guarantees the maximum life-time of the CRN. As a result, we see that the proposed CSS strategy outperforms the three mentioned schemes at all SNRs. For example in Fig. 7b, the average of this throughput improvement is more than 17% compared to scheme S2.

Tables 3, 4, 5 and 6 show optimal values for key design parameters, which are the number of cooperative SUs, \(N_{opt}\), number of clusters, \(M_{opt} ,\) and IW fusion parameters, \(\alpha_{opt} , \psi_{IW,opt}\), for different SNRs of inner and reporting channels, and \(P_{d,T} = 0.9\). It is notable that when \(N_{opt} = M_{opt}\), the proposed CSS strategy uses non-cooperative transmission. So, there are no optimal values for \(\alpha_{opt}\) and \(\psi_{IW,opt}\) in Tables 5 and 6.

Table 3 Optimal number of cooperative SUs, \(N_{opt}\), for the proposed CSS strategy (\(P_{d,T} = 0.9\))
Table 4 Optimal number of clusters, \(M_{opt} ,\) for the proposed CSS strategy (\(P_{d,T} = 0.9\))
Table 5 Optimal IW fusion rule parameter, \(\alpha_{opt} ,\) for the proposed CSS strategy (\(P_{d,T} = 0.9\))
Table 6 Optimal threshold of IW fusion rule, \(\psi_{IW,opt}\) for the proposed CSS strategy (\(P_{d,T} = 0.9\))

Figure 8 shows the maximum normalized achievable throughput of our proposed strategy and the mentioned three schemes versus target probability of detection, \(P_{d,T}\). As seen, regardless of \(P_{d,T}\), by using the proposed CSS strategy, we can achieve highest throughput for the CRN; so that the average of the throughput improvement is more than 11% compared to scheme S2 (the proposed CSS scheme in [15]).

Fig. 8
figure 8

Maximum normalized achievable throughput versus \(P_{d,T}\) for different CSS strategies (\(SNR_{r} = 7\,{\text{dB}}, \;SNR_{in} = 14\,{\text{dB}}\))

7 Conclusions

In this paper, we consider the adverse effects of the reporting stage in a CSS, such as error and overhead time, which has not been fully studied in most of the literatures. Based on this, we proposed a new cluster-based CSS for a CRN to maximize CRN’s performance (CRN’s throughput) in a practical environment, as well as, to guarantee maximum CRN lifetime. Throughput maximization is done by using four techniques: an optimal clustering algorithm, a new IW fusion rule, a reliable method to select decisions, and a cooperative transmission based on ODSTBC. Simulation results confirm we need all four techniques to be performed simultaneously to maintain the performance of the CSS strategy for different SNR values.

Furthermore, these four techniques have four key parameters (the number of cooperative SUs, number of clusters, and two IW fusion rule parameters), which affect CRN’s throughput. Therefore, by deriving a closed-form expression for the CRN’s throughput, we obtain the optimal values of these parameters. As a result, we proposed a CSS strategy for different SNR values, which simulation results confirm its effectiveness. For example, in one usual case with imperfect channels, the average improvement of the throughput is more than 11% in the CRN, regardless of required protection for the PU.