1 Introduction

Radio spectrum is one of the scarce and valuable communication resources. In fact, many users seek to make efficient use of spectrum by using DSA techniques as introduced by cognitive radio networks (CRNs) [1]. In this regard, SUs may follow two different approaches to take advantage of these valuable resources. Given the first approach, SUs recognize the PU’s spectrum holes and use these opportunity for their transmissions. When PUs want to reuse their spectrum, SUs’ have the choice to either hop to other frequencies (spectrum overlay) or reduce their power as much as noise level, by using the power control and spread spectrum techniques (spectrum underlay) [2, 3]. The second approach is based on an awareness spectrum leasing method from the PUs to the SUs. PUs in a beneficial action lease a portion of their spectrum to SUs. SUs compete with each other to access these resources for their own transmissions. The cost of access to these resources is given by SUs who have won the competition. They communicate primary signals via cooperative transmission or pay remuneration.

If PUs suffer bad performance due to the channel fading, or they have a heavy traffic, then suitable SUs are selected as the cooperative relays to improve the performance of primary transmission. However, for most primary services, when the required traffic demand is satisfied, primary systems have no interest to increase their transmission rate any more, instead, they want to achieve certain benefit in other format, for example revenue, which is more interesting to them [4, 5]. Increasing the PU’s rate via cooperation, more opportunities are provided to SUs. Thus SUs can exploit these opportunities to their own transmissions and promote their QoS. Therefore, by exploiting cooperation between primary and secondary systems, both systems can increase their own interest and a win–win situation can be achieved [6, 7].

In the same vein in [8], authors considered the system, where a primary transmitter (PT) communicates with the intended receiver (PR). In the same spectrum band, a secondary (unlicensed) network composed of multiple transmitters receivers pairs{STi, SRi}, is seeking to exploit possible transmission opportunities. By comparing the cooperative and Non-cooperative transmission rates, PT decides whether to use the entire slot for direct transmission to PR or to employ cooperation. If PT chooses cooperative transmission, a portion of time frame belonging to PU, is leased to the suitable secondary relays which exploit decode and forward (DF) scheme and the remaining frame is divided into two subslots. The first and second subslots are dedicated for PT to ST and ST to PR transmissions, respectively. Since both SUs and PU are rational and selfish, which are interested into maximizing their own utilities, an optimization problem which is analyzed by the Stackelberg Game is proposed.

In [9] a mechanism which is focused amplify and forward (AF) cooperation protocol has been proposed. Therefore it is necessary to define power control policy on SUs, indicating how much power they are willing to spend for relaying the PUs signals. Thus the SU’s are forced to have variable power.

To overcome the problems that we encountered in TDMA, a two dimensional time–frequency domain leasing for an OFDM system has been proposed. To achieve high rates, PT allocates portion of these resources in a fraction of time and frequency with the subset of SUs who have the highest bid. SUs allocate some subcarriers for relaying the primary data and use the rest of the subcarriers for their own transmissions [10, 11].

This spectrum leasing scenario will be more complicated in a cellular network where multiple PUs live in coexistence of multiple SUs [12, 13]. Due to emerge of heterogeneity in both spatial and frequency domains in a realistic scenario, different spectrum provided by different owners have different costs [14, 15]. The prior works done on the spectrum leasing scenario have assumed that spectrums are identical while frequency diversity may cause non-identical conflicts among spectrum buyers since frequencies have different communication ranges (FH) and Spectrum availability varies in different geo-locations (SH). Hence, existing spectrum leasing schemes cannot provide truthfulness or efficiency in the realistic scenario. This paper addresses this issue by developing multiple primary users’ spectrum leasing scenario in presence of SH and FH. Since several PTs may simultaneously choose same secondary user, it is necessary to consider multi-antenna equipment on cooperative STs [16]. The main contributions of this work can be summarized as follows: (1) Multi primary users’ spectrum leasing in presence of space and frequency heterogeneities is designed and implemented based on the analytical result. Therefore it can provide truthfulness and efficiency in the realistic scenario. (2) To implement this system, an OFDMA system based on Frequency Hopping algorithm has been applied to the CCRN for the first time. (3) Proposing a developed technique in CR network by introducing a new parameter (SUs transmission rate satisfaction degree) as well as Reward and Punishment policy in order to modeling and overcome SH and FH respectively. (4) This model is formulated as a Stackelberg game and a unique NEP is achieved in analytical format. (5) Numerical analysis reveal that under our framework, both primary and secondary systems achieve more reliable and truthful performance compared to previous works.

The rest of this paper is outlined as follows. Section 2 present the detailed system model, including the network structure, TF strategy adopted in this paper, is described. New utility functions by adding new parameter according to SH and FH are defined for primary and secondary networks In Sect. 3. Backward induction is adopted to analyze the formulated Stackelberg game and the NEP is given, whose property is demonstrated with numerical results in Sect. 4. Finally Sect. 5 concludes the paper.

2 System Model

In this section, the model of CCRN and the Frequency Hopping algorithm have been described. Consider a communication cell with normalized radius as shown in Fig. 1. This system composed of K p number of \( \left\{ {PT_{i} } \right\}_{i = 1}^{{k_{p} }} \) plan to communicate with their Primary Access Point (PAP) and K s number of \( \left\{ {STj,SRj} \right\}_{j = 1}^{{k_{s} }} \) are seeking spectrum holes In order to exploit for their own transmission. All nodes except the PAP are mobile in our scenario. It should also be considered a predefined traffic requirement in terms of transmission rate R0i for each PT i contrary to secondary network. Each ST accesses the channel in best-effort manner.

Fig. 1
figure 1

Proposed system model for multi primary spectrum leasing scenario; a all PTs transmit directly; b To perform spectrum leasing PT1 and PT2 divide their own OFDM frame into two main parts and transmit to their selected subset (S1 = {ST1,ST2}, S2 = {ST2,ST3}) in 1st part; c Each STj uses a number of subcarriers for PTi retransmission toward the PAP; d remained subcarriers are leased for STj intra-link communication to it’s respective receiver

Each PT depending on it’s utility, decides to transmits directly (Fig. 1a) or cooperate with subsets of STs (Fig. 1b–d). In the latter case PT select suitable STs as cooperative relays, and in return, give them the chance to access the channel which is belongs to primary system. To overcome FH phenomenon, a simple Frequency Hopping technique [17] during the time slots which is performed by PAP has been used in an OFDMA system sketched in Fig. 2. Otherwise it may increases the secondary attention to the specific PT due to FH. This will be described in more detail later. The channels between the nodes are modeled as independent proper complex Gaussian random variables, with frequency coefficients assumed to be constant in time within a block of OFDM symbols (i.e., Rayleigh block-fading channels).

Fig. 2
figure 2

OFDMA technique with simple Frequency Hopping

The whole parameters used in this paper are gathered in Table 1. Therefore the non-cooperative transmission rate for each \( \left\{ {PT_{i} } \right\}_{i = 1}^{{k_{p} }} \) in an OFDMA-based system is calculated as follow:

$$ \left[ {R_{i,p} } \right]_{{k_{p} \times 1}} = \log_{2} \left( {1 + \frac{{\left| {h_{i,p} } \right|^{2} p_{{p_{i} }} }}{{N_{0} }}} \right) $$
(1)

Proposed System model for multi primary spectrum leasing scenario which has exploited DF relaying scheme has the following steps (Fig. 3): (1) Each PTi divides it’s own TF plane into two main parts and transmit to their selected subset Si in the first subsection. The first portion \( \alpha_{i} T_{s} \), \( \left( {0 < \{ \alpha_{i} \}_{i = 1}^{{k_{p} }} < 1} \right) \) is devoted for PTi transmission and the second \( \left( {1 - \alpha_{i} } \right)T_{s} \) dedicated for secondary intra link transmission (without loss of generality we assume \( T_{s} = 1 \)) (2) PTi transmits it’s data to the STj, \( j \in \left\{ {s_{i} } \right\} \) in the first subsection \( \alpha_{i} \beta_{i} T_{s} \), \( \left( {0 < \{ \beta_{i} \}_{i = 1}^{{k_{p} }} < 1} \right) \). (3) Each STj, \( j \in \left\{ {s_{i} } \right\} \) uses a number of subcarriers for PTi retransmission toward the PAP in the second subsection \( \alpha_{i} \left( {1 - \beta_{i} } \right)T_{s} \). (4) In the latter subsection, the selected STs access the channel in frequency-division multiplexing access (FDMA) mode to communicate with their intended receivers. Each STj, \( j \in \left\{ {s_{i} } \right\} \) takes advantage of PTi resources (\( \alpha_{ij} \)) proportional to the contribution it makes in the cooperative process (\( \theta_{ij} \)), which is related to it’s payment to PTi (\( c_{ij} {\text{which}} \, 0 \le c_{ij} \le \overline{c} \)).

Fig. 3
figure 3

Reward and punishment policy to overcome FH phenomenon

$$ \alpha_{ij} = \left( {1 - \alpha_{i} } \right)\theta_{ij} = \left( {1 - \alpha_{i} } \right)\frac{{c_{ij} }}{{\mathop \sum \nolimits_{{j \in \left\{ {s_{i} } \right\}}} c_{ij} }} $$
(2)

Therefore the PTi’s cooperative transmission rate to STj \( ({\text{R}}_{{{\text{i}},{\text{j}}}} ) \) (subsection A. Fig. 3) which is dominated by the worst channel \( {\text{h}}_{{{\text{i}},{\text{j}}}} \) in the subset Si is:

$$ R_{i,j} \left( {s_{i} } \right) = \log \left( {1 + \frac{{min_{{j \in \left\{ {s_{i} } \right\}}} \left| {h_{i,j} } \right|^{2} p_{{p_{i} }} }}{{N_{0} }}} \right) $$
(3)

For the subsection B, Assuming the PAP exploits maximum ratio combining before decoding the signal. Hence the effective SNR is equal to the sum of all the SNRs of each STj. Therefore, the achievable rate of the cooperative link is given by:

$$ R_{{J_{i} ,p}} = \log \left( {1 + \mathop \sum \limits_{{j \in \left\{ {s_{i} } \right\}}} \frac{{\left| {h_{i,j} } \right|^{2} p_{{s_{j} }} }}{{N_{0} }}} \right) ,\quad \forall j \in \left\{ {s_{i} } \right\} $$
(4)

The overall achievable rate of the DF cooperative transmission equals to the minimum rate of the two above stages:

$$ R_{{coop_{i} }} = min\left\{ {\beta_{i} R_{i,j} \left( {s_{i} } \right), \left( {1 - \beta_{i} } \right)R_{{J_{i} ,p}} \left( {s_{i} } \right)} \right\} $$
(5)

Each PTi allocates \( \alpha_{ij} \) fraction of it’s resources to the STj transmission, thus the achievable rate for each STj, \( j \in \left\{ {s_{i} } \right\} \) is calculated as a function of their contributions to cooperative process with PTi (\( \alpha_{ij} \)). This parameter will be considered later in (9):

$$ \left[ {R_{{jj_{i} }} } \right]_{{k_{p} \times 1}} = \log_{2} \left( {1 + \frac{{\left| {h_{j,j} } \right|^{2} p_{{s_{j} }} }}{{N_{0} }}} \right) ,\quad \forall j \in \left\{ {s_{i} } \right\} $$
(6)

It should be noted that the spectrums are non-identical and the use of PTi resources is affected by FH phenomenon. As is clear in (7), with the assumption of distance -invariant between nodes, FH states that the frequencies have different communication ranges [17] and different costs.

$$ L\left( {dB} \right) = 10\log f_{i}^{2} + \gamma \log d_{i,j} + p_{f} \left( n \right) - 28 $$
(7)

This issue affected the secondary users’ interests and causes to emerge two problems: (1) As mentioned before it may increases the secondary attention to the PTs who have lower frequencies and We solve it by using Frequency Hopping technique. (2) If several SUs were selected by PTi, Since SUs interest to lower frequencies it may cause to non-optimal use of TF plane belongs to the PTi. We also solve this problem by using Reward and Punishment policy as shown in Fig. 3. Therefore if STj cooperate with PTi at higher frequencies in subsections A and B, lower frequencies will be leased to it with lower prices in subsection C (Reward). Also if STj cooperate at lower frequencies, higher frequencies will be leased to it with higher prices (Punishment). Hence all frequencies will be identical.

Since PU is licensed user parameters announces α, β to selected subset STj, \( j \in \left\{ {s_{i} } \right\} \), so as to maximize its own utility in terms of both traffic rate and revenue. The STj selects it’s strategy (\( c_{ij} \)) to exploiting PTi’s spectrum based on the amount of the leased resources from the PTi (\( \theta_{ij} \left( {1 - \alpha_{i} } \right) \)). Each STj by defining a utility function which expresses it’s benefit in the cooperation process, tries to maximize it’s utility function. Hence it is important for each STj to select the best strategy without making too much payment and suggest it to the PTi in a competitive procedure. Each PTi chooses the best relay for cooperation after listening to different suggestion coming from the STs. In the next section, we formulate these spectrum leasing interactions.

3 Utility Functions and Nash Equilibrium Point

Based on aforementioned description, it can be said we have two-stage leader–follower game which can be analyzed under Stackelberg game framework. Thus PTi (game leader), optimizes its strategy \( \left\{ {\alpha_{i} ,\beta_{i} ,\left\{ {s_{i} } \right\}} \right\} \) based on the knowledge of the effects of its decision on the behavior of the followers (STs). Therefore we first define the primary and secondary utility functions then achieve unique NEP by solving the game.

3.1 Primary Utility Function

The PTi utility function consists of two components: (1) The PTi—utility with respect to it’s transmission rate satisfactory depending on chosen decision to cooperate or not. It has been shown that sigmoid function is a proper function to express user’s satisfaction with respect to demand traffic [6]. This decision will be taken after comparing between utility of direct link and cooperative transmission which is denoted by Di for each PTi. (2) The overall revenues obtained from all STj, \( j \in \left\{ {s_{i} } \right\} \):

$$ U_{{p_{i} }} = \frac{{\omega _{{p_{i} }} }}{{1 + e^{{ - \tau \left( {\alpha _{i} \left( {D_{i} R_{{p,i}} + \left( {1 - D_{i} } \right)R_{{coop_{i} }} } \right) - R_{{0_{i} }} } \right)}} }} + \left( {1 - D_{i} } \right)\sum\limits_{{j \in \left\{ {s_{i} } \right\}}} {c_{{ij}} } ,\quad \forall i \in \left\{ {1,2, \ldots ,k_{p} } \right\} $$
(8)

where all parameters are gathered in Table 1.

Table 1 Description of the symbols

3.2 Secondary Utility Function

Since the STj may be existed in several subsets which are chosen by different PTi, thus The STj utility function due to cooperate with PTs which had chosen STj in several subsets (\( {\text{S}}_{\text{i}} \)) can be defined as the sum of utility with respect to transmission rates they are able to achieve \( {\text{R}}_{{{\text{jj}}_{\text{i}} }} \left( {{\text{s}}_{\text{i}} } \right) \) minus its payment to the primary network. Since SUs have no traffic requirement on their transmission, their utility functions are linear with \( {\text{R}}_{{{\text{jj}}_{\text{i}} }} \left( {{\text{s}}_{\text{i}} } \right) \), which are proportional to the payment they are willing to pay.

$$ U_{{s_{j} }} = \mathop \sum \limits_{i} \left( {\omega_{ij} \alpha_{ij} R_{{jj_{i} }} \left( {s_{i} } \right) - c_{ij} } \right) $$
(9)

where \( \omega_{ij} \) is the STj transmission rate satisfaction degree contributes to the overall utility. In [69] it is a predefined coefficient and has the same value for all STs. While frequency diversity may cause non-identical conflicts among spectrum buyers since frequencies have different communication ranges (FH) and spectrum availability varies in different geo-locations (SH). We already removed FH by exploiting reward and punishment policy. Therefore if PTi cooperate with STj who is far from PTi, primary utility will be reduced due to \( R_{{coop_{i} }} \) decreasing. To compensate this utility decreasing the PTi will prefer to lease it’s own resources with more expensive prices (\( c_{ij} \)) to the farther STs. We solve this spectrum heterogeneity due to variable distances by using \( \omega_{ij} \) which is the level of STj transmission rate satisfactory contribute to the overall utility. Based on aforementioned description, it can be said \( \omega_{ij} \) is inversely proportional to \( d_{ij} \):

$$ \omega_{ij} = \frac{{d_{ij}^{p} }}{{\mathop \sum \nolimits_{i,j} d_{ij}^{p} }} $$
(10)

As is clear p should be negative and is a constant predefined parameter which will be obtained by plotting normalized utility difference (NUD) versus this parameter variation. Hence we define the NUD for each PTi as below:

$$ NUD = \frac{{U_{p} \left( p \right) - U_{p} \left( {p = 0} \right)}}{{U_{p} \left( {p = 0} \right)}} \times 100 $$
(11)

As can be observed in Fig. 4, when p is equal to −0.15 we have about 3 % improvement in the primary utility functions (PT1 and PT2). When p = 0, the space heterogeneity parameter \( \omega_{ij} \) is the same for all STs. In other words, with this value of p the model is more robust to the space heterogeneity. It is noteworthy that for the rest of our experiment, p is assumed to be equal to −0.15.

Fig. 4
figure 4

Normalized utility difference (NUD) versus different value of P

3.3 Nash Equilibrium Point

As mentioned before STj, \( j \in \left\{ {s_{i} } \right\} \) compete with each other in a Non-cooperative Payment selection Game (NPG), \( G = \left[ {\left\{ {s_{i} } \right\}, \left\{ {c_{ij} } \right\}, \left\{ {U_{{s_{j} }} \left( . \right)} \right\}} \right] \) based on the selected strategy by leader \( \left\{ {\alpha_{i} ,\beta_{i} ,\left\{ {s_{i} } \right\}} \right\} \). Each STj chooses its strategy within the strategy space \( {\mathbf{C}} = \left[ {C_{ij} } \right]_{{j \in s_{i} ,i \in \left\{ {1,2, \ldots ,K_{p} } \right\}}} \) which is given by solving gradient Eq. (11):

$$ \nabla_{{\underline{c} }} U_{{s_{j} }} = \left[ {\frac{{\partial U_{{s_{j} }} }}{{\partial c_{1j} }} \frac{{\partial U_{{s_{j} }} }}{{\partial c_{2j} }} \ldots \frac{{\partial U_{{s_{j} }} }}{{\partial c_{{k_{p} j}} }}} \right]^{\text{t}} = 0 ,\quad \forall j \in \left\{ {1,2, \ldots , k_{s} } \right\} $$
(12)
$$ \varvec{C} = \left[ {\begin{array}{*{20}c} {c_{11} } & \cdots & {c_{{1k_{s} }} } \\ \vdots & \ddots & \vdots \\ {c_{{k_{p} 1}} } & \cdots & {c_{{k_{p} k_{s} }} } \\ \end{array} } \right]_{{k_{p} \times k_{s} }} $$
(13)

It should be noted that solving \( \frac{{\partial U_{{s_{j} }} }}{{\partial c_{1j} }} = 0 \) yields the first column of the C contains \( \left[ {c_{11} ,c_{12} , \ldots ,c_{{1k_{p} }} } \right]^{t} \). If STj doesn’t cooperate with any PTi it leads to the corresponding matrix element will be equal to zero. It has been shown that necessary and sufficient conditions for this class of NPG game to demonstrate NE existence and uniqueness are satisfied [6]. Now solving (12) yields a unique NE for NPG game as:

$$ c_{{mn}}^{*} = \frac{{\left( {1 - \alpha _{m} } \right)\left( {k_{{s_{m} }} - 1} \right)\left[ {\left( {\sum\nolimits_{{j \in \left\{ {s_{m} } \right\}}} {\frac{1}{{\omega _{{mj}} R_{j} }}} } \right) - \frac{{k_{{s_{m} }} - 1}}{{\omega _{{mn}} R_{n} }}} \right]}}{{\left( {\sum\nolimits_{{j \in \left\{ {s_{m} } \right\}}} {\frac{1}{{\omega _{{mj}} R_{j} }}} } \right)^{2} }},\quad \forall m \in \left\{ {1,2, \ldots ,k_{p} } \right\},n \in \left\{ {1,2, \ldots ,k_{s} } \right\} $$
(14)

Since we knew \( c_{ij} \) is bounded (i.e. \( 0 \le c_{ij} \le \overline{c} \)), we should define new constraints which is adapted to this achieved optimal point (\( c_{mn}^{*} \)). These constraints will be used by the primary user to select optimal cooperative relay set.

$$\left( {\sum\limits_{{j \in \left\{ {s_{m} } \right\}}} {\frac{1}{{\omega _{{mj}} R_{j} }}} } \right) - \frac{{k_{{s_{m} }} - 1}}{{\omega _{{mn}} R_{n} }} > 0 $$
(15)
$$ \frac{{\left( {1 - \alpha _{m} } \right)\left( {k_{{s_{m} }} - 1} \right)\left[ {\left( {\sum\nolimits_{{j \in \left\{ {s_{m} } \right\}}} {\frac{1}{{\omega _{{mj}} R_{j} }}} } \right) - \frac{{k_{{s_{m} }} - 1}}{{\omega _{{mn}} R_{n} }}} \right]}}{{\left( {\sum\nolimits_{{j \in \left\{ {s_{m} } \right\}}} {\frac{1}{{\omega _{{mj}} R_{j} }}} } \right)^{2} }} < \bar{c} $$
(16)

Now the sign of Di must be determined as follow:

$$ U_{{p_{iD} }} = \frac{{\omega_{p} }}{{1 + e^{{ - \tau \left( {\alpha_{i} R_{i,p} - R_{{0_{i} }} } \right)}} }} $$
(17)
$$ U_{{p_{{i_{coop} }} }} = \frac{{\omega_{p} }}{{1 + e^{{ - \tau \left( {\alpha_{i} R_{{coop_{i} }} - R_{{0_{i} }} } \right)}} }} + \mathop \sum \limits_{{j \in s_{i} }} c_{ij} $$
(18)
$$ D_{i} = \left\{ {\begin{array}{*{20}c} {0,} & {U_{{p_{iD} }} < U_{{p_{{i_{coop} }} }} } \\ {1,} & {U_{{p_{iD} }} > U_{{_{{p_{{i_{coop} }} }} }} } \\ \end{array} } \right. $$
(19)

Now PTi (game leader) can optimize its strategy \( \left\{ {\alpha_{i} ,\beta_{i} ,\left\{ {s_{i} } \right\}} \right\} \) based on the analytical result of NPG game with substituting (14) into (18) and calculate the first order derivative of \( U_{pi_{coop}} \) in respect with \( \alpha_{i} \). It is important to know that this decision will affect the behavior of the followers (STs).

$$ \frac{{dU_{{p_{{i_{coop} }} }} }}{{d\alpha_{i} }} = 0 \mathop \Rightarrow \limits^{yields} \alpha_{i}^{ * } = \frac{{\left( { - \frac{\ln \left( X \right)}{\tau } + R_{{0_{i} }} } \right)}}{{R_{{coop_{i} }} }} $$
(20)

where X is:

$$ X = \frac{{ \pm \sqrt {\left( {B - A} \right) - 4B^{2} } - \left( {2B - A} \right)}}{2B} $$
(21)

In which:

$$ A = \tau \omega _{p} \beta _{m} R_{{m,j}} \left( {s_{m} } \right),\quad B = \frac{{\left( {k_{{s_{m} }} - 1} \right)\left[ {\left( {\sum\nolimits_{{j \in \left\{ {s_{m} } \right\}}} {\frac{1}{{\omega _{{mj}} R_{j} }}} } \right) - \frac{{k_{{s_{m} }} - 1}}{{\omega _{{mn}} R_{n} }}} \right]}}{{\sum\nolimits_{{j \in \left\{ {s_{m} } \right\}}} {\frac{1}{{\omega _{{mj}} R_{j} }}} }} $$
(22)

We also should be aware that transmitted symbols allocated for PTi to STj should be equal to the number of subcarriers in the STj to PAP link in our proposed cooperative model [11]. Hence we have:

$$ \beta_{i} R_{i,j} \left( {s_{i} } \right) = \left( {1 - \beta_{i} } \right)R_{{J_{i} ,p}} \left( {s_{i} } \right) $$
(23)
$$ \beta_{i}^{ * } = \frac{{R_{{J_{i} ,p}} \left( {s_{i} } \right)}}{{R_{i,j} \left( {s_{i} } \right) + R_{{J_{i} ,p}} \left( {s_{i} } \right)}} $$
(24)

4 Numerical Results

In this section, simulation results are presented to demonstrate the impact of different spectrum leasing characteristics on the optimal multi primary cooperation scheme. A cognitive network includes kp number of PTi and ks number of STj–SRj pairs is considered in which the distance between the PTi and PAP is assumed to be normalized to 1 and secondary nodes are all placed at approximately the same normalized distance d (0 < d < 1) from the PT and 1 − d from the PAP. All parameters used in the simulation are set as follows in Table 2. Both primary user and secondary users transmit at a fixed power level without power control. To demonstrate SH and PH phenomenon’s separately, we consider two scenario in which the number of PTs are selected different so that kp (SH) and kp (FH) are set to 2 and 3 respectively. Therefore we present our simulation results in two subsections as follow:

Table 2 Description of the symbols

4.1 Multi Primary Spectrum Leasing Scenario with Considering Space Heterogeneity

In this subsection we only consider SH with respect to predefined parameter p which is calculated in subsection 3.2. Figure 5 shows the optimal parameters α and β, versus the distance between \( \{ PT_{i} \}_{i = 1}^{2} \) and various numbers of STj, \( 1 < j < 8 \) which are in subset \( \{ S_{i} \}_{i = 1}^{{k_{p} }} \). As is expected, with the increase of dij, secondary utilities are reduced (Fig. 6). Hence it leads to decrease the number of STj which interest to cooperate with PTi and the amount of leased resources from PTi (\( 1 - \alpha_{i}^{ * } \)). Increasing dij also causes to decreasing the broadcast transmission rate from PTi to STj (\( R_{i,j} \left( {s_{i} } \right) \)), while the cooperative transmission rate from STj to PAP (\( R_{{J_{i} ,p}} \left( {s_{i} } \right) \)) is increased. To receive certain amount of data and forward the same amount to intended receiver (PAP), more time is needed for the first broadcast stage and less is needed for the second cooperation stage. Therefore, \( \beta^{ * } \) increases when the normalized distance d becomes larger, which also agrees with the analysis result given in [6].

Fig. 5
figure 5

Optimal parameters \( \alpha^{ * } \) and \( \beta^{ * } \) versus normalized distance between STj and PTi

Fig. 6
figure 6

Secondary utilities versus normalized distance between STj and PTi

As is expected if PTi cooperate with STj which is far from PTi, primary utility will be reduced (Fig. 9) due to \( {\text{R}}_{{{\text{coop}}_{\text{i}} }} \) decreasing. In this case the PTi will prefer to lease it’s own resources with more expensive prices (\( {\text{c}}_{\text{ij}} \)) for the farther STs and not interested to increase it’s rate. We solve this spectrum heterogeneity due to variable distances by using \( \omega_{ij} \). Hence if distance becomes larger, \( \omega_{ij} \) should be decreased to prevent distant STs not to cooperate with PTi. Figures 7 and 8 show the total secondary users utility which are located in subset S1 and a specifc secondary utility (i.e. secondary utility 1) under different schemes versus the normalized distance dij respectively. To compare between different scheme we set p = −0.15 and p = 0 to introduce a system in precense of SH with and without considering \( \omega_{ij} \) as a STj transmission rate satisfaction degree contributes to the overall utility, respectively. As can be observed a system with considering \( \omega_{ij} \) is more robust against space variations.

Fig. 7
figure 7

Secondary utility 1 which cooperate with \( \{ PT_{i} \}_{i = 1}^{2} \) versus normalized distance between ST1 and PTi

Fig. 8
figure 8

Total secondary utility (S1) which cooperate with PT1 utility versus normalized distance between STj and PTi

Figure 9 illustrates primary utility under different schemes versus distance between STj and PTi. U P denotes the utility function of the optimal scheme, in which primary user leases some portion of it’s resources for secondary user and leverages them to transmit cooperatively. U 0 and U p with direct link denotes the primary user’s utility function when α = 0 and α = 1 respectively. U 0 implies that all the primary user’s resources is given to the secondary users to receive payment without sending any of its own data while U p with direct link refers to the primary utility when no cooperation is leveraged and all the channel is used for the licensed service. As we can see, the benefit is brought by the appropriate trade-off between two strategies (α = 0 and α = 1) and selecting the optimal amount of leased resources (\( 1 < \alpha_{i}^{ * } < 8 \)). Increasing \( \alpha_{i}^{ * } \) causes to decreasing the leased resources from PTi to STj and optimal scheme utility function U p is decreased dramatically (Fig. 10). This also cause to secondary utility decreasing which are selected by PTi (Fig. 11). Figure 12 also shows secondary utility 1 versus \( \left\{ {\alpha_{ij} } \right\}_{i = 1\,and\,j = 1}.\)

Fig. 9
figure 9

Different scheme of primary utility versus normalized distance between STj and PTi

Fig. 10
figure 10

Different scheme of primary utility versus normalized optimal parameter α

Fig. 11
figure 11

Total secondary utility (S1) which cooperate with PT1 utility versus normalized optimal parameter \( \alpha_{1}^{ * } \)

Fig. 12
figure 12

Secondary utility 1 which cooperate with PT1 versus optimal parameter \( \alpha_{11}^{ * } \)

4.2 Multi Primary Spectrum Leasing Scenario with Considering Space Heterogeneity

In this subsection we only consider FH which makes to emerge two problems. (1) It may affect the secondary users’ interests to the specific PTs who have lower frequencies and we solved it by using Frequency Hopping technique. Figure 13a shows \( \{ PT_{i} \}_{i = 1}^{3} \) which use OFDM subcarriers for their transmissions in 0 < t < 1. At this time PT3 which has lower frequency attract more attentions of STs and achieve more utility via cooperation with subset \( {\text{S}}_{3} = \left\{ {ST1,ST3,ST5,ST8} \right\} \). At the next time (1 < t < 2) PT2 has lower frequency and achieve more utility compare to other PTs (Fig. 13b).

Fig. 13
figure 13

Frequency Hopping technique to remove FH among PTs spectrums

(2) If several SUs were selected by PTi, Since SUs interest to lower frequencies it may cause to non-optimal use of TF plane belongs to the PTi. We also solve this problem by using Reward and Punishment policy which is described in Sect. 2. As we can observed in Figs. 14 and 15 this algorithm make system more robust against FH phenomenon.

Fig. 14
figure 14

Total secondary utility (S1) which cooperate with PT1 utility versus normalized optimal parameter \( \alpha_{1}^{ * } \) with considering FH phenomenon

Fig. 15
figure 15

Secondary utility 1 which cooperate with PT1 versus optimal parameter \( \alpha_{11}^{ * } W \) with considering FH phenomenon

5 Conclusions

The present paper was intended to propose a model to deal with the problem of space and frequency heterogeneity. To this end, multi primary users were utilized to state the space and frequency heterogeneity and subsequently, a new parameter \( \omega_{ij} \) as well as the Frequency Hopping and Reward and Punishment policy were introduced in order to consider the Space and Frequency Heterogeneity respectively. Finally to achieve optimal strategies (unique Nash Equilibrium Point (NEP)) which are selected by leaders (primary users) and followers (secondary users), the Stackelberg Game is applied. Numerical analysis reveal that under our framework, both primary and secondary systems achieve more reliable and truthful performance.