1 Introduction

Cognitive radio (CR) and multi-input multi-output (MIMO) communications have been proposed as promising solutions to improve spectrum utilization and efficiency in wireless networks. Dynamic and opportunistic spectrum access allows secondary users (SUs) to communicate on temporarily idle or underutilized frequencies. Moreover, MIMO systems enhance spectral efficiency by having a multi-antenna node simultaneously transmit multiple data streams. In newly emerging systems and standards (e.g., WiMAX, 4G Advanced-LTE, IEEE 802.16e) MIMO communications has been as a core feature. TV white bands have also been approved by the FCC for opportunistic, secondary use [1]. As a result, the current trend followed by the research society incorporates recent innovations of the two technologies into a single system.

The non-participation of primary users (PUs), as assumed in the traditional dynamic spectrum access schemes, is inefficient in terms of utilizing the spectrum. As a remedy, we can allow PUs to proactively manage the amount of secondary activity in their licensed band [2]. In [2, 3], a framework for dynamic spectrum leasing was proposed, where the PUs are rewarded for allowing SUs to operate in their licensed spectrum. Thus, the PUs have an incentive to allow SUs to access the spectrum whenever possible to the maximum extent. The spectrum leasing problem in a decentralized cognitive radio model was studied in [4], whereby a PU leases its bandwidth for a fraction of time to a network of independent SUs in exchange for cooperation.

The issue of resource allocation in MIMO CR networks was explored in [5,6,7,8]. The authors in [5] presented a low complexity semi-distributed algorithm for resource allocation in MIMO-OFDM based CR networks, using game theory approach, the strong duality in convex optimization and the primal decomposition method. In [6], the authors extended the pricing concept to MIMO-OFDM based CR networks and presented two iterative algorithms for resource allocation in such systems. In order to obtain an optimal subcarrier pairing, relay assignment and power allocation in MIMO-OFDM based CCRNs, the dual decomposition technique was recruited in [7] to maximize the sum-rate subject to the interference temperature limit of the PUs. Moreover, because of high computational complexity of the optimal approach, a suboptimal algorithm was further proposed in [7] and [8].

The optimal resource allocation in MIMO cognitive radio networks with heterogeneous secondary users, centralized and distributed users, was investigated in [9]. The authors in [10] modeled the problem of joint relay selection and power allocation in MIMO-OFDM based CCRNs as a two-level cooperative game problem with two objectives. The first objective is to assign each weak SU to one of the relays (rich SUs) through solving a problem achieved by a non-transferable utility coalition graph game, and the second one is to jointly allocating available channels to the SUs such that no subchannel is allocated to more than one SU and simultaneously optimize the transmit covariance matrices of nodes based on the Nash bargaining solution, which is the second level of the game.

The resource allocation problem in a spectrum leasing scenario in cooperative cognitive radio networks (CCRN) was addressed from various aspects, such as channel allocation for secondary system, relay selection, and power allocation [11, 14]. In [11], the joint problems of subchannel allocation, relay selection, and relay strategy in a multi-channel CCRN were considered and a unified framework based on Nash Bargaining Solutions was further proposed. When there exist one PU and multiple SUs in a single-channel system, the resource allocation problem for spectrum leasing was formulated as a Stackelberg game in [12, 13]. One of contributions of this work is to consider the existence of multiple channel and multiple PUs in the system which complicates the analysis and we further utilize the coalitional game theory to study the behavior of the various users in the system. The authors solved the problem of resource allocation in a spectrum leasing scenario in CCRN in [14], where the CR users allocate the whole their transmission power in a portion of transmission frame to relay the primary signals. In return, the PU pairs lease their unused portion of transmission frame to the SUs.

In this work, we firstly study a coalitional game-based approach for resource allocation in a multi-channel cooperative cognitive radio network with multiple PUs and SUs. We form the grand coalition through grouping all PUs and SUs in a set, where each PU can lease its spectrum to all SUs in a time-division manner. Meanwhile, the SUs help the data transmission of PUs by relaying, in return. Through validating that the solution concept of the coalitional game (the core) is nonempty, we prove that the grand coalition is stable in the proposed scenario. Therefore, we conclude that from the coalitional game- based point of view, the optimal case for the spectrum leasing scenario in MIMO cooperative cognitive radio networks (MIMO-CCRN) is to put all the PUs and the SUs in a coalition.

Afterwards, we study a MIMO-CCRN system with practical assumptions like the minimum rate demand of the PUs. We aim at maximizing simultaneously the data rate of the PUs and the SUs. Meanwhile, the interactions among the SUs to access the leased channels is modeled using the bargaining games to allocate the spectrum in a fair and optimal manner. In a brief, we propose a two-level cooperative game based approach to tackle the aforementioned problems. To provide more clarification, in the proposed framework, the interactions between the PU and the SUs are modeled as the first level bargain game while the second level bargain game is used to formulate the SUs decision making process on spectrum sharing. We analyze the optimal actions of the PU and the SU and derive the theoretic results for the one-PU one-SU scenario. We further extend the achievements for the one-PU multi-SU scenario using a revised numerical searching algorithm and then, we prove its convergence.

2 System Model and Basic Assumptions

2.1 System Setup

We consider a single-cell network where the \({N_{PU}}\) primary users (PUs) communicate with the primary base station (PBS), while \({N_{SU}}\) pairs of SUs are seeking vacant channels for transmitting data to their respective secondary receivers (SRs). It is assumed that the PUs transmit to the PBS taking advantage of the SUs cooperation and there is no direct link between the PUs and the PBS. All users are assumed to be equipped with multiple antennas and without losing the generality, it is further assumed that all the users are equipped with N antennas. We denote the set of PUs and SUs by \({\mathcal {N}_{PU}} = \left\{ {1, \ldots ,{N_{PU}}} \right\} \) and \({\mathcal {N}_{SU}} = \left\{ {1, \ldots ,{N_{SU}}} \right\} \), respectively. Without losing the generality and for ease of exposition, we assume that the i-th PU communicates with PBS using the i-th subchannel only, and the set of all PUs’ subchannels in the primary network is denoted by \({\mathcal {N}_F} = \left\{ {1, \ldots ,{N_{PU}}} \right\} \). The secondary transmitters can only communicate their respective receivers using the subchannels leased from a group of PUs, and as a result, a coalition is formed between the PUs and the SUs.

As described earlier, we firstly aim at studying a coalitional game based method to resource allocation in a multi-channel cooperative cognitive radio network with multiple PUs and SUs. A coalition \(\mathcal {S} = {\mathcal {S}_{PU}} \cup {\mathcal {S}_{SU}}\) is a subset of \({\mathcal {N}_{PU}} \cup {\mathcal {N}_{SU}}\) in which multiple PUs and SUs cooperate, where \({\mathcal {S}_{PU}} \subseteq {\mathcal {N}_{PU}}\) and \({\mathcal {S}_{SU}} \subseteq {\mathcal {N}_{SU}}\). The coalition \({\mathcal {N}_{PU}} \cup {\mathcal {N}_{SU}}\) is called the grand coalition. A simple example of the system model with two PUs and two SUs is shown in Fig. 1. According to Fig. 1, PU 1 and SU 1 may form a coalition \({\mathcal {S}_1}\) and PU 2 and SU 2 form another coalition \({\mathcal {S}_2}\), or all of them form a grand coalition. The benefits of the grand coalition for PUs and SUs are summarized in the following:

  • It will be possible for the PUs to select different relay nodes and thereby, PUs gain more revenues.

  • Forming the grand coalition enables the SUs to obtain more opportunities to access subchannels leased by different PUs.

Our objective at the first part of this work is to validate the above two statements using the the coalitional game theory [15].

Fig. 1
figure 1

The system model for the spectrum leasing in MIMO-CCRN from a coalitional game based point of view and the transmission slots: a secondary transmitters (STs) receive primary signals In the first time slot; b In the second time slot, STs cooperate with PUs or transmit signals to their respective secondary receivers (SRs)

subsectionCoalition-based Transmission Conventions

Before proceeding to the game theoretic formulation and analysis, we describe the cooperation mechanism and assumptions in the following:

  1. 1.

    It is possible for the SUs to use different subchannels at any given time.

  2. 2.

    In the formed coalition, each PU has only one available subchannel to lease to the SUs. When an SU relays the PU is traffic, it uses the subchannel i leased by PU i.

  3. 3.

    Cooperative transmissions for PUs and SUs are realized in a time-division fashion. As shown in Fig. 1, the first time slot is for PUs transmission and the second time slot for SUs transmission. The second time slot is further divided into two parts, i.e., SUs relaying PUs’ data and transmitting their own data. In the second time slot, the fraction of time that SU j relays PU i’s traffic (using subchannel i) and transmits its own traffic using subchannel l are denoted by \({\beta _{j,i}}\) and \({\alpha _{j,l}}\), respectively, where \(0 \le {\beta _{j,i}} \le 1\) and \(0 \le {\alpha _{j,l}} \le 1\) for all \(i \in {\mathcal {S}_{PU}}\), \(j \in {\mathcal {S}_{SU}}\), and \(l \in {\mathcal {S}_F}\).

  4. 4.

    The cooperation strategy in the proposed scenario is Amplify-and-forward (AF). Meanwhile, we assume flat Rayleigh fading for all channels which is invariant within each time-slot. The PBS uses maximal ratio combining to combine signals from the direct link and cooperative links. The PBS is also aware of the channel state information (CSI).

2.2 Utility Function

When PU m transmits to the PBS taking advantage of the cooperation of SU k, the received signal at the PBS can be written as

$$\begin{aligned} {{\mathbf{y}}_m} = {{\mathbf{G}}_k}{{\mathbf{A}}_{m,k}}{{\mathbf{H}}_{m,k}}{{\mathbf{x}}_m} + {{\mathbf{G}}_k}{{\mathbf{A}}_{m,k}}{{\mathbf{z}}_k} + {{\mathbf{w}}_m} \end{aligned}$$
(1)

where \({{\mathbf{x}}_m} \in {\mathbb {C}^{N \times 1}}\) denotes the transmit signal of PU m; \({{\mathbf{H}}_{m,k}}\) and \({{\mathbf{G}}_k}\) are channel matrices from PU m to SU k and from SU k to PBS, respectively; \({{\mathbf{A}}_{m,k}}\) denotes the amplification matrix at SU k for the signals of PU m; The noise contribution at SU k and the PBS are denoted by \({{\mathbf{z}}_k}\sim \mathcal {CN}\left( {{\mathbf{0}},{\sigma ^2}{{\mathbf{I}}_N}} \right) \) and \({{\mathbf{w}}_m}\sim \mathcal {CN}\left( {{\mathbf{0}},{\sigma ^2}{{\mathbf{I}}_N}} \right) \), respectively, where we have assumed the same noise power at all SUs and PBS. Due to the availability of the full CSI at all nodes, we can use the singular value decomposition of the channel matrices to determine transmit and receive beamforming matrices at all nodes. Since these matrices are unitary they do not change the statistics of the channel and therefore preserve the mutual information and error performance of this link. The singular value decomposition of the channel matrices is given by

$$\begin{aligned} \begin{aligned} {{\mathbf{H}}_{m,k}}&= {{\mathbf{U}}_{m,k}}{\varLambda _{m,k}}{\mathbf{V}}_{m,k}^H \\ {{\mathbf{G}}_k}&= {{\mathbf{S}}_k}{\varGamma _k}{\mathbf{D}}_k^H \\ \end{aligned} \end{aligned}$$
(2)

Using the appropriate beamforming, the resulting transmit signal at PU m is thus given by \({{\mathbf{x}}_m} = {{\mathbf{V}}_{m,k}}{{\tilde{\mathbf{x}}}_m}\) and the amplification matrix has the structure \({{\mathbf{A}}_{m,k}} = {{\mathbf{D}}_k}{\varPi _{m,k}}{\mathbf{U}}_{m,k}^H\). The PBS multiplies the received signal in (1) by \({\mathbf{S}}_k^H\) such that it can be expressed as

$$\begin{aligned} {{\tilde{\mathbf{y}}}_m} = {\mathbf{S}}_k^H{{\mathbf{y}}_m} = {\varGamma _k}{\varPi _{m,k}}{\varLambda _{m,k}}{{\tilde{\mathbf{x}}}_m} + {\varGamma _k}{\varPi _{m,k}}{{\tilde{\mathbf{z}}}_k} + {{\tilde{\mathbf{w}}}_m} \end{aligned}$$
(3)

where \({{\tilde{\mathbf{z}}}_k} = {\mathbf{U}}_{m,k}^H{{\mathbf{z}}_k}\) and \({{\tilde{\mathbf{w}}}_m} = {\mathbf{S}}_k^H{{\mathbf{w}}_m}\) denote the equivalent noise contributions.

Up till now, we have no restriction on the structure of the gain matrix \({\varPi _{m,k}}\). The optimal structure of \({\varPi _{m,k}}\) is found through the maximization of the achievable data rate of PU m which is given by

$$\begin{aligned} R_{m,k}^P = {\beta _{k,m}}\log \det \left( {{{\mathbf{I}}_N} + \frac{1}{{{\sigma ^2}}}{{\left( {{{\mathbf{I}}_M} + {\varGamma _k}{\varPi _{m,k}}\varPi _{m,k}^H\varGamma _k^H} \right) }^{ - 1}}{\varGamma _k}{\varPi _{m,k}}{\varLambda _{m,k}}{\varPhi _m}\varLambda _{m,k}^H\varPi _{m,k}^H\varGamma _k^H} \right) \end{aligned}$$
(4)

where we have assumed that transmit signals Gaussian with \({\varPhi _m} = E\left\{ {{{{\tilde{\mathbf{x}}}}_m}{\tilde{\mathbf{x}}}_m^H} \right\} \). Owing to Hadamard inequality, a requirement for maximizing the determinant in (4) is that the rows (or columns) of the matrix within the determinant have to be orthogonal [16]. Using this condition and the observation that all matrices in (4) are diagonal, we pick the gain matrix \({\varPi _{m,k}}\) also as a diagonal matrix. This leads to a diagonal overall matrix in the determinant which maximizes (4). Therefore, we define \({\varPi _{m,k}}\) as

$$\begin{aligned} {\varPi _{m,k}} = {\text{diag}}\left\{ {{\pi _{m,k,1}}, \ldots ,{\pi _{m,k,N}}} \right\} \end{aligned}$$
(5)

with

$$\begin{aligned} {\pi _{m,k,n}} = \sqrt{\frac{{P_{k,n}^s}}{{P_{m,n}^p{\lambda _{m,k,n}} + {\sigma ^2}}}} \end{aligned}$$
(6)

where \(P_{m,n}^p\) and \(P_{k,n}^s\) denote the transmit power of PU m and cooperating SU k on the n-th spatial subchannel, respectively. \({\lambda _{m,k,n}}\) represents the n-th diagonal element of \({\varLambda _{m,k}}\). Therefore, the achievable data rate of PU m, taking advantage of the cooperation of SU k, can be written as

$$\begin{aligned} R_{m,k}^P = \sum \limits _{n = 1}^N {{\beta _{k,m}}\log \left( {1 + \frac{{P_{m,n}^p{a_{m,k,n}}P_{k,n}^s{b_{k,n}}}}{{1 + P_{m,n}^p{a_{m,k,n}} + P_{k,n}^s{b_{k,n}}}}} \right) } \end{aligned}$$
(7)

where \({a_{m,k,n}} = \frac{{{\lambda _{m,k,n}}}}{{{\sigma ^2}}}\) and \({b_{k,n}} = \frac{{{\gamma _{k,n}}}}{{{\sigma ^2}}}\). \({\gamma _{k,n}}\) denotes the n-th diagonal element of \({\varGamma _k}\). Hence, the achievable data rate of PU m can be expressed as

$$\begin{aligned} R_m^P = \sum \limits _{k \in {\mathcal {S}_{SU}}} {R_{m,k}^P} \end{aligned}$$
(8)

In case of leasing the subchannel of PU m to SUs, the cost of PU m’s spectrum leasing can be modeled as the total fraction of time that its subchannel is used by SUs, i.e.,

$$\begin{aligned} {\mu _m} = \sum \limits _{k \in {\mathcal {S}_{SU}}} {{\alpha _{k,m}}} \end{aligned}$$
(9)

Therefore, the utility of PU m can be written as

$$\begin{aligned} U_m^P = F\left( {R_m^P} \right) + c_m^P - G\left( {{\mu _m}} \right) \end{aligned}$$
(10)

where \(F\left( . \right) \) is a concave function which maps the achievable data rate of the PU to a utility gain, \(G\left( . \right) \) is a convex function which maps the cost of each PU to a utility loss, and \(c_m^P = \eta \sum \limits _{k \in {S_{SU}}} {{\alpha _{k,m}}} \) denotes the amount of payment of SUs to PU m with \(\eta \) representing the price of frequency use per unit time. The achievable data rate of SU k can be expressed as

$$\begin{aligned} R_k^S = \sum \limits _{j \in {\mathcal {S}_F}} {R_{k,j}^S} \end{aligned}$$
(11)

where \(R_{k,j}^S = {\alpha _{k,j}}{R_{k,j}}\) represents the maximum data rate of the k-th SU link using the j-th channel and using the appropriate transmit and receive beamforming in each SU link, we have,

$$\begin{aligned} {R_{k,j}} = \sum \limits _{n = 1}^N {\log \left( {1 + \frac{{P_{k,n}^s{\theta _{k,j,n}}}}{{{\sigma ^2}}}} \right) } \end{aligned}$$
(12)

where \({\theta _{k,j,n}}\) denotes the n-th eigen-value of the channel matrix in the k-th SU link over the j-th channel. Then, the utility of the k-th SU link can be expressed as

$$\begin{aligned} U_k^S = L\left( {R_k^S} \right) - c_k^S \end{aligned}$$
(13)

where \(L\left( . \right) \) is a concave function to project the rate to the revenues and \(c_k^S = \eta \sum \limits _{k \in {S_{SU}}} {{\alpha _{k,j}}} \) denotes the payment of the k-th SU to the PUs.

In the next section, we recruit the coalitional cooperative games to formulate and analyze the performance of the proposed system model.

3 Coalitional Cooperative Game Based Formulation

3.1 Concepts and Game-Theoretic Problem Formulation

A classification of the coalitional games can be based on the transferability of the utility of the game [15]. Therefore, a coalitional game can be either with transferable utility (TU) or with non-transferable utility (NTU). The coalitional game in which the coalition value can be apportioned arbitrarily between the coalition members is called a TU game. Similarly, when the apportioning of the coalition value depends on the joint actions of the members in the coalition, the resulting game is referred to as an NTU game. As will be proved, since we define the coalition value as the sum of utilities generated by PUs and SUs in the coalition, a TU game applies to our considered network. Moreover, as will be shown, the considered network in this paper fulfills the properties of a canonical coalitional game, e.g., grand coalition. Thus, we formulate the cognitive radio network as a canonical coalitional game with TU. In what follows, we present some coalitional game-related terminologies firstly, and then we prove that this coalitional game satisfies the desired properties that ensure a stable grand coalition and a fair payoff allocation.

A coalitional game with TU comprises of a set of players who form cooperative groups and the coalition value which correlates each nonempty subset of players with a number [15]. First of all, we provide the definition of the coalition value for the considered game.

Definition 1

For the proposed canonical coalitional game with TU, the coalition value \(\upsilon \left( \mathcal {S} \right) \) of a coalition \(\mathcal {S}\) is the maximum sum utility of PUs and SUs in \(\mathcal {S}\)and depends only on the actions of the PUs and SUs in \(\mathcal {S}\) (not those in \({\mathcal {N}_{PU}} \cup {\mathcal {N}_{SU}}\backslash \mathcal {S}\)).

Recruiting Definition 1 and using the utility definitions in (10) and (13), the coalition value \(\upsilon \left( \mathcal {S} \right) \) is the solution to the following convex optimization problem:

$$\begin{aligned} \upsilon \left( \mathcal {S} \right) \triangleq&\max \sum \limits _{m \in {\mathcal {S}_{PU}}} {U_m^P} + \sum \limits _{k \in {\mathcal {S}_{SU}}} {U_k^S} \nonumber \\ {\text{s.t. }}{\text{}}&{\text{(C1):}}\sum \limits _{j \in {\mathcal {S}_F}} {{\alpha _{k,j}}} + \sum \limits _{m \in {S_{PU}}} {{\alpha _{k,m}}} = 1 \nonumber \\&{\text{(C2):}}\sum \limits _{k \in {\mathcal {S}_{SU}}} {\left( {{\alpha _{k,j}} + {\beta _{k,m}}} \right) } = 1 \nonumber \\&{\text{(C3): 0}} \le {\alpha _{k,j}} \le {\text{1, 0}} \le {\beta _{k,m}} \le {\text{1}} \end{aligned}$$
(14)

where \(U_m^P\) and \(U_k^S\) the utility functions for PU m and SU k are given in (10) and (13), respectively. We require the set of time fractions \(\mathbf{\alpha } = \left\{ {{\alpha _{k,j}}\left| {k \in {\mathcal {S}_{SU}},j \in {\mathcal {S}_F}} \right. } \right\} \) and \(\mathbf{\beta } = \left\{ {{\beta _{k,m}}\left| {k \in {\mathcal {S}_{SU}},m \in {\mathcal {S}_{PU}}} \right. } \right\} \) to solve the problem in (14). Note that \(\mathbf{\alpha } \) is in \(U_m^P\) and \(U_k^S\); while \(\mathbf{\beta } \) exists in \(U_m^P\). The first constraint in (14), \(\left( {{\text{C1}}} \right) \), makes sure that each SU works in either relay or access mode but not simultaneously. The second constraint in (14), \(\left( {{\text{C2}}} \right) \), guarantees that the total fraction of time that an SU in coalition \(\mathcal {S}\) accesses the spectrum will not exceed one. Moreover, \(\left( {{\text{C3}}} \right) \) is to guarantee a feasible joint action of coalition \(\mathcal {S}\) [17]. If the solution to problem in (14) is not feasible, then \(\upsilon \left( \mathcal {S} \right) \triangleq - \infty \).

3.2 The Core of the Game

As stated earlier, for a coalitional game with TU, the coalition value can be divided arbitrarily among the participants in the coalition. A feasible payoff allocation policy for a canonical coalitional game guarantees that if a subset of players separates from the grand coalition, at least one PU or SU in the separated subset has a utility worse than that in the grand coalition. To obtain a feasible payoff allocation policy, we recruit the core solution concept [15] as defined in the following:

Definition 2

The core of the proposed coalitional game is the set of feasible payoff allocation vectors \({\mathbf{y}} = \left[ {y_1^P, \ldots ,y_{{N_{PU}}}^P,y_1^S, \ldots ,y_{{N_{SU}}}^S} \right] \) (where \(y_m^P\) and \(y_k^S\) represent the payoff value of PU m and SU k, respectively) for the grand coalition and is called an imputation if \(\sum \limits _{m \in {\mathcal {S}_{PU}}} {y_m^P} + \sum \limits _{k \in {\mathcal {S}_{SU}}} {y_m^S} = \upsilon \left( {{\mathcal {N}_{SU}} \cup {\mathcal {N}_{PU}}} \right) \) and \({y_i} \ge \upsilon \left( {\left\{ i \right\} } \right) \), \(\forall i \in \left( {{\mathcal {N}_{SU}} \cup {\mathcal {N}_{PU}}} \right) \). The core \(\mathcal {C}\) is the set of imputations for which \(\sum \limits _{m \in {\mathcal {S}_{PU}}} {y_m^P} + \sum \limits _{k \in {\mathcal {S}_{SU}}} {y_m^S} \ge \upsilon \left( \mathcal {S} \right) \) for all \(\mathcal {S} \subseteq {\mathcal {N}_{PU}} \cup {\mathcal {N}_{SU}}\), i.e.,

$$\begin{aligned} \mathcal {C}&= \left\{ {{\mathbf{y}}:\sum \limits _{m \in {\mathcal {S}_{PU}}} {y_m^P} + \sum \limits _{k \in {\mathcal {S}_{SU}}} {y_m^S} = \upsilon \left( {{\mathcal {S}_{SU}} \cup {\mathcal {S}_{PU}}} \right)\, {\text{ and}}} \right. \nonumber \\&\left. {{\text{ }}\sum \limits _{m \in {\mathcal {S}_{PU}}} {y_m^P} + \sum \limits _{k \in {\mathcal {S}_{SU}}} {y_m^S} \geqslant \upsilon \left( S \right) ,\forall S \subseteq {\mathcal {N}_{PU}} \cup {\mathcal {N}_{SU}}} \right\} \end{aligned}$$
(15)

According to Definition 2, if the core exists, it offers a set of stable payoff allocations because no coalition \(\mathcal {S} \subseteq {\mathcal {N}_{PU}} \cup {\mathcal {N}_{SU}}\) has the incentives to refuse the proposed payoff allocation and leave the grand coalition to form a separate coalition instead.

In the following, we prove that the proposed canonical coalitional game with TU has a nonempty core. At first, we find the dual problem of (14) using the duality theorem [18]. Using the transformation of the primal problem in (14) to the dual problem, the relation of the solution set between the dual problem and the primal problem can be obtained. In other words, the optimal solution of the dual problem is the upper bound of the optimal solution of the primal problem [19]. Meanwhile, the solution set of the dual problem can be shown to be a subset of the core, as will be proved in Theorem 1. The Lagrangian functions of PUs and SUs can be expressed as

$$\begin{aligned}&{f_{m,j}}\left( {\delta ,\rho ,\tau } \right) = \mathop {\max }\limits _{R_m^P \ge 0,{\mu _m} \ge 0,c_m^P \ge 0,\forall m \in {\mathcal {S}_{PU}}} \left( {F\left( {R_m^P} \right) + c_m^P - G\left( {{\mu _m}} \right) + {\delta _m}R_m^P + {\rho _m}c_m^P + {\tau _m}{\mu _m}} \right) \end{aligned}$$
(16)
$$\begin{aligned}&g\left( {\psi ,\varphi } \right) = \mathop {\max }\limits _{c_k^S \ge 0,R_k^S \ge 0,k \in {\mathcal {S}_{SU}}} \left( {L\left( {R_k^S} \right) - c_k^S + {\psi _k}R_k^S + {\varphi _k}c_k^S} \right) \end{aligned}$$
(17)

where \({\delta _m}\), \({\rho _m}\), \({\tau _m}\), \({\psi _k}\) and \({\varphi _k}\) are the Lagrange multipliers. Therefore, the dual problem of the problem in (14) for \(\mathcal {S} = {\mathcal {S}_{PU}} \cup {\mathcal {S}_{SU}}\) is formulated as

$$\begin{aligned} {\mathbf{D}}\left( S \right) :\mathop {\min }\limits _{\mathbf{\alpha } ,\mathbf{\beta } }&\sum \limits _{m \in {\mathcal {S}_{PU}}} {\sum \limits _{k \in {\mathcal {S}_F}} {\left( {{f_{m,j}}\left( {\delta ,\rho ,\tau } \right) + {\omega _{m,j}}} \right) } } + \sum \limits _{k \in {\mathcal {S}_{SU}}} {\left( {g\left( {\psi ,\varphi } \right) + {\chi _k}} \right) } \nonumber \\ {\text{s.t.}}{\text{}}{\text{ }}&{\rho _j} + {\psi _k}R_{k,j}^S + \eta {\tau _j} + \eta {\varphi _k} + \sum \limits _{m \in {\mathcal {S}_{PU}}} {{\omega _{m,j}}} \ge 0,\forall k \in {\mathcal {S}_{SU}},j \in {\mathcal {S}_F} \nonumber \\ {\text{ }}&{\delta _m}R_{k,m}^P + {\chi _k} + \sum \limits _{j \in {\mathcal {S}_F}} {{\omega _{m,j}}} \ge 0,\forall k \in {\mathcal {S}_{SU}},m \in {\mathcal {S}_{PU}} \nonumber \\ {\text{ }}&{\varphi _k},{\chi _k},{\psi _k} \ge 0,\forall k \in {\mathcal {S}_{SU}} \nonumber \\ {\text{ }}&{\rho _j},{\tau _j} \ge 0,\forall j \in {\mathcal {S}_F} \nonumber \\ {\text{ }}&{\omega _{m,j}} \ge 0,\forall m \in {\mathcal {S}_{PU}},j \in {\mathcal {S}_F} \nonumber \\ {\text{ }}&{\delta _m} \ge 0,\forall m \in {\mathcal {S}_{PU}} \end{aligned}$$
(18)

where \({\omega _{m,j}}\) and \({\chi _k}\) are also the Lagrange multipliers, derived from the transformation of the problem in (14) into its dual problem in (18). Let \(\mathcal {D}\) denote the set of optimal solutions of the dual problem in (18) for \({\mathcal {S}_{PU}} = {\mathcal {N}_{PU}}\) and \({\mathcal {S}_{SU}} = {\mathcal {N}_{SU}}\), and let

$$\begin{aligned} \begin{gathered} \mathcal {O} = \left\{ {{{\mathbf{y}}^*}:y_m^{{P^*}} = {f_{m,j}}\left( {{\delta ^*},{\rho ^*},{\tau ^*}} \right) + \omega _{m,j}^*,m \in {\mathcal {N}_{PU}},j \in {\mathcal {S}_F}} \right. \\ y_k^{{S^*}} = g\left( {{\psi ^*},{\varphi ^*}} \right) + \chi _k^*,k \in {\mathcal {N}_{SU}} \\ \left. {{\text{for }}\left( {{\delta ^*},{\rho ^*},{\tau ^*},{\omega ^*},{\psi ^*},{\varphi ^*},{\chi ^*}} \right) \in \mathcal {D}} \right\} \\ \end{gathered} \end{aligned}$$
(19)

correspond to the dual payoff generated by the solution of the dual problem. The dual payoff is the set of the optimal payoff allocation of PUs and SUs. In the following theorem, we prove that the core of the proposed coalitional game with TU is nonempty.

Theorem 1

The core of the proposed canonical coalitional game with TU is nonempty and \(\mathcal {O} \subseteq \mathcal {C}\).

Proof

The set \(\mathcal {O}\) is nonempty, because the set \(\mathcal {D}\) is nonempty. We consider an arbitrary \({{\mathbf{y}}^*} \in \mathcal {O}\) which corresponds to some \(\left( {{\delta ^*},{\rho ^*},{\tau ^*},{\omega ^*},{\psi ^*},{\varphi ^*},{\chi ^*}} \right) \in \mathcal {D}\). Therefore, \(\sum \nolimits _{m \in {\mathcal {S}_{PU}}} {y_m^{{P^*}}} + \sum \nolimits _{k \in {\mathcal {S}_{SU}}} {y_m^{{S^*}}} \) is the optimal value of the objective function of \({\mathbf{D}}\left( \mathcal {S} \right) \). The objective function of (14) is concave, because \(F\left( . \right) \) and \(L\left( . \right) \) are concave, and \(G\left( . \right) \) is convex. Moreover, the constraints \(\left( {{\text{C1}}} \right) \) \(\left( {{\text{C3}}} \right) \) are linear. \({\mathbf{D}}\left( \mathcal {S} \right) \) is the dual of the problem in (14) for each \(\mathcal {S} \subseteq {\mathcal {N}_{PU}} \cup {\mathcal {N}_{SU}}\). Therefore, based on the strong duality [18], the duality gap is zero. Hence, \(\sum \nolimits _{m \in {\mathcal {S}_{PU}}} {y_m^{{P^*}}} + \sum \nolimits _{k \in {\mathcal {S}_{SU}}} {y_m^{{S^*}}} = \upsilon \left( {{\mathcal {N}_{PU}} \cup {\mathcal {N}_{SU}}} \right) \). We only need to show that \(\sum \nolimits _{m \in {\mathcal {S}_{PU}}} {y_m^{{P^*}}} + \sum \nolimits _{k \in {\mathcal {S}_{SU}}} {y_m^{{S^*}}} \ge \upsilon \left( \mathcal {S} \right) \) for all \(\mathcal {S} \subseteq {\mathcal {N}_{PU}} \cup {\mathcal {N}_{SU}}\). Assume that (14) is feasible. Then, strong duality states that \(\upsilon \left( \mathcal {S} \right) \) equals the optimal value of the objective function of \({\mathbf{D}}\left( \mathcal {S} \right) \). The sub-vectors \(\left( {\delta ,\rho ,\tau ,\omega ,\psi ,\varphi ,\chi } \right) \) consisting of the components of \(\left( {{\delta ^*},{\rho ^*},{\tau ^*},{\omega ^*},{\psi ^*},{\varphi ^*},{\chi ^*}} \right) \) in \(\mathcal {S}\) satisfy the constraints of the dual problem in (18). Thus, \(\sum \nolimits _{m \in {\mathcal {S}_{PU}}} {y_m^{{P^*}}} + \sum \nolimits _{k \in {\mathcal {S}_{SU}}} {y_m^{{S^*}}} = \upsilon \left( {{\mathcal {N}_{PU}} \cup {\mathcal {N}_{SU}}} \right) \) is the value of the objective function of the problem in (18) for the above feasible solution. It follows that the optimal value of the objective function of the problem in (18) is a lower bound for \(\sum \nolimits _{m \in {\mathcal {S}_{PU}}} {y_m^{{P^*}}} + \sum \nolimits _{k \in {\mathcal {S}_{SU}}} {y_m^{{S^*}}} \). Therefore, \({{\mathbf{y}}^*} \in \mathcal {C}\). \(\square \)

Based on Theorem 1, the optimal payoff vector \({{\mathbf{y}}^*}\) lies in the core and can be achieved by solving the dual problem \({\mathbf{D}}\left( {{\mathcal {N}_{PU}} \cup {\mathcal {N}_{SU}}} \right) \). In other words, the solution of the dual problem \({\mathbf{D}}\left( {{\mathcal {N}_{PU}} \cup {\mathcal {N}_{SU}}} \right) \) provides the optimal payoff allocation of the PUs, \(y_m^{{P^*}}\) , and the optimal payoff allocation of the SUs, \(y_k^{{S^*}}\), in the core.

4 A Two-Level Cooperative Game-Based Approach for Spectrum Leasing in MIMO-CCRN

In this section, we study the interactions among the SUs and PUs from a bargaining game-based point of view. We consider a single-PU multi-SU scenario and formulate the cooperative interactions between the PU and SUs as the first-level bargain game, while the interactions among the SUs is modeled as the second-level bargain game. In what follows, we derive the optimal actions of the PU and SU in a single-PU single-SU scenario, firstly. Then, a revised numerical searching algorithm is proposed for the single-PU multi-SU case and the convergence of the proposed scheme is proved.

As depicted in Fig. 2, the PU communicates with the PBS using the licensed band, while \({N_{SU}}\) SUs are hunting for vacant channels to transmit data to their respective secondary receivers (SRs). We assume that the PU has a minimum rate demand denoted by \(R_{\min }^P\). Moreover, it is further assumed that the communications between the PU and the PBS does not meet the minimum rate requirement of the PU and thereby, the PU will recruit the existing SUs, equipped with cognitive radio capabilities, for cooperative transmission in order to meet the minimum rate demand. The SUs are also interested in cooperating with the PU through relaying the signals transmitted from the PU to the PBS. The SUs are granted a portion of the transmission frame to transmit their signals from secondary transmitters (STs) to their intended receivers (SRs), in return, as shown in Fig. 2. After the SUs assist the PU, they bargain to share the granted bandwidth using the Time Division Multiple Access (TDMA) strategy. Similar to the previous section, we focus our discussions on Amplify-and-Forward (AF) strategy.

According to Fig. 2, the PU allocates \(\left( {1 - \alpha } \right) \) fraction of its transmission frame to SUs. It is presumed that the PU is able to achieve its required QoS by using only \(\alpha \) fraction of its transmission frame (for \(\alpha \in \left[ {0,1} \right] \)). This is the origin of the so-called spectrum holes that leads to the spectrum under-utilization. Therefore, at the beginning of each transmission frame, the PU decides that it can free-up up to an \(\left( {1 - \alpha } \right) \) fraction of its transmission frame. As shown in Fig. 2, the cooperative transmission for the PU consists of two phases. In the first phase, the PU transmits its signals to the SU. The bandwidth that the PU retains for its own use is divided into two sub-slots with equal length. In the first sub-slot, the PU transmits to the SUs, and then the SUs relay the primary data to the PBS in the second sub-slot. In the following, we study the proposed spectrum leasing scenario.

Fig. 2
figure 2

The modified system model for the spectrum leasing scenario in MIMO-CCRN with a single PU: a The PU transmits to the secondary transmitters (STs); b STs relay the primary signals to the PBS; c STs transmit to their respective secondary receivers (STs)

4.1 Two-level cooperative game-based Problem Formulation

Recall from (8) that the utility function of the PU (the achievable data rate of PU), as a result of the cooperation of SU k, can be written as

$$\begin{aligned} {U^P} = \sum \limits _{k \in {\mathcal {N}_{SU}}} {\sum \limits _{n = 1}^N {\frac{\alpha }{2}\log \left( {1 + \frac{{P_n^p{a_{k,n}}{d_{k,n}}P_{k,n}^s{b_{k,n}}}}{{1 + P_n^p{a_{k,n}} + {d_{k,n}}P_{k,n}^s{b_{k,n}}}}} \right) } } \end{aligned}$$
(20)

where \({a_{k,n}} = \frac{{{\lambda _{k,n}}}}{{{\sigma ^2}}}\) and \({b_{k,n}} = \frac{{{\gamma _{k,n}}}}{{{\sigma ^2}}}\); \({\lambda _{k,n}}\) and \({\gamma _{k,n}}\) denotes the n-th eigen-value of the channel matrices from PU to SU k and from SU k to PBS, respectively; \(P_n^p\) and \(P_{k,n}^s\) denote the transmit power of the PU and cooperating SU k on the n-th spatial subchannel, respectively; We further assume that the SU k uses only part of its power in each spatial mode n, i.e., \({d_{k,n}}P_{k,n}^s\) (\(0 \le {d_k} \le 1\)), to relay primary signal. Using (12), the utility function of the k-th SU can be expressed as

$$\begin{aligned} U_k^S = \sum \limits _{n = 1}^N {\left( {{e_k}\log \left( {1 + \frac{{P_{k,n}^s{\theta _{k,n}}}}{{{\sigma ^2}}}} \right) - \frac{\alpha }{2}{c_s}{d_{k,n}}P_{k,n}^S} \right) } \end{aligned}$$
(21)

where \({e_k}\) is the fraction of time in the third sub-slot which is allocated to the k-th SU to transmit to its intended receiver; \({c_s}\) is the price of per unit power for the SUs.

Therefore, the PU should determine the fraction of time which is supposed to be granted to the SUs, \(\alpha \), and each SU k in the secondary system should find the fraction of its cooperative transmission power on the n-th spatial subchannel to relay primary signal, \({d_{k,n}}\). Moreover, each SU k, is supposed to determine the fraction of the granted spectrum it can use for its own transmission, \({e_k}\). To stimulate the cooperation between the PU and the SUs, as well as the cooperation among the SUs, we propose to cast such a decision making process into a two-level bargain framework.

Before formulating the problem as a cooperative game, we introduce the Nash Bargaining Solution (NBS) briefly. A special type of cooperative games are bargaining games, where players negotiate/bargain their actions/strategies to reach an agreement with guaranteed minimum payoffs. The agreement is associated with a utility vector \({\mathbf{u}} = \left[ {{u_1}, \ldots ,{u_N}} \right] \), where \({u_i}\) is the utility of player i and there are N players in the game. Let \({d_i}\) and \({D_i}\) denote the action and action space for player i, respectively (\({d_i} \in {D_i}\)). The utility \({u_i}\) is a function of the action vector \({\mathbf{d}} = \left[ {{d_1}, \ldots ,{d_N}} \right] \). The utility space U is the set of all possible payoff allocations \({\mathbf{u}}\) that result from all possible action vectors \({\mathbf{d}}\). It is also possible that no agreement is reached after bargaining, a situation referred to as a disagreement point. A disagreement point is associated with a utility vector \({{\mathbf{u}}_0}\), which consists of minimum payoffs that players insist on having.

Given the variety of outcomes, Nash [20] suggested to, instead of study all possible outcomes, specify characteristics or axioms of one or several outcomes that we expect and find how to drive the bargaining process to that agreement point. Nash proposed following axioms that describe a Nash Bargaining Solution (NBS) [20], denoted by \(S\left( {{\mathbf{u}},{{\mathbf{u}}_0}} \right) \):

  • A NBS is Pareto optimal, i.e. there is no other solution that two or more players can simultaneously be better off.

  • At the NBS, all players are guaranteed their minimum payoffs that they insist on at the beginning of the bargaining process.

  • A NBS is symmetric, meaning that all players have the same priority. Specifically, if all users have identical action space \({D_i}\), then they have the same utility/payoff at the NBS.

  • If the action spaces of players shrink, leading to a shrunken utility space \(\tilde{U}\), the old NBS \(S\left( {{\mathbf{u}},{{\mathbf{u}}_0}} \right) \) remains optimal provided that \(S\left( {{\mathbf{u}},{{\mathbf{u}}_0}} \right) \) remains feasible in the new utility space \(\tilde{U}\). This property ensures that eliminating solutions that would not have been selected does not affect the NBS.

Given the above properties, the problem is whether a unique NBS exists, and how to find such a unique NBS. Nash proved the following theorem that answers these three key questions [20]:

Theorem 2

If the utility space U is upper-bounded, closed and convex, then there exists a unique NBS which is the solution of the following problem:

$$\begin{aligned} {\mathbf{u}} = \arg \mathop {\max }\limits _{\left\{ {{\mathbf{u}} \in U} \right\} } \prod \limits _{i = 1}^N {\left( {{u_i} - {u_{0,i}}} \right) } \end{aligned}$$
(22)

In the first level of the resource allocation game and with the target of maximizing the social welfare of the secondary system, we model all SUs in the secondary system all together which bargain with the PU, i.e., \({U^S} = \sum \limits _{k \in {\mathcal {N}_{SU}}} {U_k^S} \). Therefore, the game between the PU and the SUs, also known as the first level bargain game, can be formulated as \({\mathcal {G}_1}\left( {{U^P},{U^S};u_p^0,u_s^0} \right) \), where \(u_p^0\) and \(u_s^0\) are the disagreement point of the PU and the SUs, respectively. The disagreement point of PU should be set as \(R_{\min }^P\). Besides, the goal of the SUs is to get the opportunity to access the spectrum. Then, the SUs wish to cooperate if their gain from rate increasing by spectrum leasing is not less than the cost from power consumption. Therefore, the disagreement point of the secondary system has to be set to \(u_s^0 = 0\). At last, the first level Nash bargaining based game can be formulated as

$$\begin{aligned} {\mathcal {G}_1}:\mathop {\max }\limits _{\alpha ,{d_{k,n}}}&\left( {{U^P} - u_0^P} \right) \left( {{U^S} - u_0^S} \right) \nonumber \\ {\text{s.t.}}&{U^P} \ge u_0^P,{U^S} \ge u_0^S \end{aligned}$$
(23)

The benefits of the cooperation with the PU should be allocated among the SUs in a fair fashion. Thus, another Nash bargaining game, known as the second level game, should be played among the SUs, which is defined as \({G_2}\left( {U_k^S;u_{s,k}^0|k \in {\mathcal {N}_{SU}}} \right) \), with , \(u_{s,k}^0 = 0\) for all \(k \in {\mathcal {N}_{SU}}\), being the disagreement point of the SUs. Similarly, the solution of \({\mathcal {G}_2}\) can be achieved through solving the following problem

$$\begin{aligned} {\mathcal {G}_2}:\mathop {\max }\limits _{{e_k}}&\prod \limits _{k \in {\mathcal {N}_{SU}}} {\left( {U_k^S - u_k^S} \right) } \nonumber \\ {\text{ s.t.}}&\sum \limits _{k \in {\mathcal {N}_{SU}}} {{e_k}} = 1 - \alpha \end{aligned}$$
(24)

4.2 Solution of the Two-Level Game

In this section, we explore the interactions between the PU and the SUs and among the SUs. First of all, the case of single-PU single-SU is considered and the optimal actions of the PU and the SU are derived. Afterwards, the multi-user scenario is considered and a numerical searching algorithm is proposed and its convergence is proved.

4.2.1 Single-PU Single-SU Case

In the following, we prove that the NBS for the single-PU single-SU bargaining game is unique.

Theorem 3

There exists a unique NBS for the single-PU single-SU bargaining game, and the solution is given by

$$\begin{aligned} \left\{ {\begin{array}{ll} {{\alpha ^*} = \frac{{AC + B\left( {C + D} \right) }}{{2A\left( {C + D} \right) }},{{\mathbf{d}}^*} = {\text{arg min}} \sqrt{\frac{{C + D}}{A}} }&{}{{\text{if }}{\alpha _L} \le {\alpha _M} \le {\alpha _H}{\text{ and }}X\left( {{{\mathbf{d}}^*}} \right) < \sqrt{\frac{C}{B}} } \\ {{\alpha ^*} = 1,{{\mathbf{d}}^*} = {\mathbf{0}}}&{}{{\text{otherwise}}} \end{array}} \right. \end{aligned}$$
(25)

where:

$$\begin{aligned} A= & {} \frac{1}{2}\sum \limits _{n = 1}^N {\log \left( {1 + \frac{{P_n^p{a_n}{d_n}P_n^s{b_n}}}{{1 + P_n^p{a_n} + {d_n}P_n^s{b_n}}}} \right) } \end{aligned}$$
(26)
$$\begin{aligned} B = \, & {} u_0^P \end{aligned}$$
(27)
$$\begin{aligned} C= & {} \sum \limits _{n = 1}^N {\log \left( {1 + \frac{{P_n^s{\theta _n}}}{{{\sigma ^2}}}} \right) } \end{aligned}$$
(28)
$$\begin{aligned} D= & {} \frac{1}{2}\sum \limits _{n = 1}^N {{c_s}{d_n}P_n^S} \end{aligned}$$
(29)

Proof

For the single-PU single-SU scenario, the utility functions (20) and (21) can be written as

$$\begin{aligned} {U^P}= & {} \sum \limits _{n = 1}^N {\frac{\alpha }{2}\log \left( {1 + \frac{{P_n^p{a_n}{d_n}P_n^s{b_n}}}{{1 + P_n^p{a_n} + {d_n}P_n^s{b_n}}}} \right) } \end{aligned}$$
(30)
$$\begin{aligned} U_k^S= & {} \sum \limits _{n = 1}^N {\left( {\left( {1 - \alpha } \right) \log \left( {1 + \frac{{P_n^s{\theta _n}}}{{{\sigma ^2}}}} \right) - \frac{\alpha }{2}{c_s}{d_n}P_n^S} \right) } \end{aligned}$$
(31)

Therefore, the following problem should be solved:

$$\begin{aligned} {\mathcal {G}_1}:\mathop {\max }\limits _{\alpha ,{d_{k,n}}}&\left( {\sum \limits _{n = 1}^N {\frac{\alpha }{2}\log \left( {1 + \frac{{P_n^p{a_n}{d_n}P_n^s{b_n}}}{{1 + P_n^p{a_n} + {d_n}P_n^s{b_n}}}} \right) } - u_0^P} \right) \left( {\sum \limits _{n = 1}^N {\left( {\left( {1 - \alpha } \right) \log \left( {1 + \frac{{P_n^s{\theta _n}}}{{{\sigma ^2}}}} \right) - \frac{\alpha }{2}{c_s}{d_n}P_n^S} \right) } } \right) \nonumber \\ {\text{s.t.}}&\sum \limits _{n = 1}^N {\frac{\alpha }{2}\log \left( {1 + \frac{{P_n^p{a_n}{d_n}P_n^s{b_n}}}{{1 + P_n^p{a_n} + {d_n}P_n^s{b_n}}}} \right) } \ge u_0^P \nonumber \\ {\text{ }}&\sum \limits _{n = 1}^N {\left( {\left( {1 - \alpha } \right) \log \left( {1 + \frac{{P_n^s{\theta _n}}}{{{\sigma ^2}}}} \right) - \frac{\alpha }{2}{c_s}{d_n}P_n^S} \right) } \ge 0 \end{aligned}$$
(32)

\(\square \)

At first, we consider the fraction of SU power for cooperating with the PU in the n-th spatial mode, \({d_n}\) for \(n = 1, \ldots ,N\), as a constant and attempt to find \(\alpha \). Then, we can rewrite (32) using (26)–(29), according to the following:

$$\begin{aligned} {\mathcal {G}_1}:\mathop {\max }\limits _{\alpha ,{d_{k,n}}}&\left( {\alpha A - B} \right) \left( {\left( {1 - \alpha } \right) C - \frac{\alpha }{2}D} \right) \nonumber \\ {\text{s.t.}}\,{\text{}}{\text{ }}&\alpha A \ge B \nonumber \\ {\text{ }}&\left( {1 - \alpha } \right) C \ge \frac{\alpha }{2}D \end{aligned}$$
(33)

By setting the first derivative of (33) to zero, we can solve (33) and summarize the results according to the following:

$$ {\alpha ^*} = {\alpha ^*}\left( {\mathbf{d}} \right) = \left\{ {\begin{array}{ll} {1,}&{}{{\text{if }}{\alpha _H}< {\alpha _L}} \\ {{\alpha _L},}&{}{{\text{if }}{\alpha _M}< {\alpha _L}} \\ {{\alpha _M},}&{{\text{if }}{\alpha _L} \le {\alpha _M} \le {\alpha _H}} \\ {{\alpha _H},}&{}{{\text{if }}{\alpha _H} < {\alpha _M}} \end{array}} \right. $$
(34)

where \({\mathbf{d}} = \left[ {{d_1}, \ldots ,{d_N}} \right] \), \({\alpha _L} = \frac{B}{A}\), \({\alpha _M} = \frac{{AC + B\left( {C + D} \right) }}{{2A\left( {C + D} \right) }}\), and \({\alpha _H} = \frac{C}{{C + D}}\). Afterwards, by replacing (34) into (33) we can find the optimal value of \({\mathbf{d}} = \left[ {{d_1}, \ldots ,{d_N}} \right] \), \({{\mathbf{d}}^*}\), under different conditions. If \({\alpha _H} < {\alpha _L}\), \({\alpha _M} < {\alpha _L}\) and \({\alpha _H} < {\alpha _M}\), then \({\alpha ^*} = 1\) and \({{\mathbf{d}}^*} = {\mathbf{0}}\), meaning that the PU will not lease its spectrum to the SU. As a result, no cooperation will take place between the PU and the SU. However, if \({\alpha _L} \le {\alpha _M} \le {\alpha _H}\), by replacing \({\alpha ^*} = {\alpha _M} = \frac{{AC + B\left( {C + D} \right) }}{{2A\left( {C + D} \right) }}\) into (33), and assuming that \({X^2} = \frac{{C + D}}{A}\), we have

$$\begin{aligned} \mathop {\max }\limits _{\mathbf{d}}&{\text{ }}\frac{1}{4}{\left( {\frac{C}{X} - BX} \right) ^2} \nonumber \\ {\text{s.t.}}{\text{}}&{\text{ }} 0< X < \sqrt{\frac{C}{B}} \end{aligned}$$
(35)

For \(X \in \left( {0,\left. {\sqrt{\frac{C}{B}} } \right] } \right. \), \(f\left( X \right) = \frac{1}{4}{\left( {\frac{C}{X} - BX} \right) ^2}\) is a decreasing function of X. In this case, (35) is equivalent to the following:

$$\begin{aligned} \mathop {\max }\limits _{\mathbf{d}}&{\text{ X}}\left( {\mathbf{d}} \right) \nonumber \\ {\text{s.t.}}{\text{}}{\text{ }}&0< X\left( {\mathbf{d}} \right) < \sqrt{\frac{C}{B}} \end{aligned}$$
(36)

Ultimately, the optimum value of \({\mathbf{d}}\), \({{\mathbf{d}}^*}\), can be achieved by

$$ {{\mathbf{d}}^*} = \left\{ {\begin{array}{ll} {\arg \min \sqrt{\frac{{C + D}}{A}} }&{}{{\text{if }}X\left( {{{\mathbf{d}}^*}} \right) < \sqrt{\frac{C}{B}} } \\ 0&{}{{\text{otherwise}}} \end{array}} \right. $$
(37)

Finally, the NBS for the single-PU single-SU scenario can be summarized as

$$\begin{aligned} \left\{ {\begin{array}{ll} {{\alpha ^*} = \frac{{AC + B\left( {C + D} \right) }}{{2A\left( {C + D} \right) }},{{\mathbf{d}}^*} = {\text{arg min}} \sqrt{\frac{{C + D}}{A}} }&{{\text{if }}{\alpha _L} \le {\alpha _M} \le {\alpha _H}{\text{ and }}X\left( {{{\mathbf{d}}^*}} \right) < \sqrt{\frac{C}{B}} } \\ {{\alpha ^*} = 1,{{\mathbf{d}}^*} = {\mathbf{0}}}&{{\text{otherwise}}} \end{array}} \right. \end{aligned}$$
(38)

4.2.2 Single-PU Multi-SU Case

After finding the optimal solution of the single-PU single-SU case, we consider the single-PU multi-SU case. In order to derive the optimal solution for such a scenario, we first calculate the solution of \({\mathcal {G}_2}\), i.e., \({e_k}\), by assuming that \(\alpha \) and \({{\mathbf{d}}_k}\), for all \(k \in {N_{SU}}\), are known.

Theorem 4

If \(\alpha \) and \({{\mathbf{d}}_k}\), for all \(k \in {N_{SU}}\), are given, the optimal solution of \({G_2}\) is

$$\begin{aligned} e_k^* = \frac{1}{{{N_{SU}}}}\left[ {\left( {1 - \alpha } \right) - \frac{\alpha }{2}\sum \limits _{i \in {\mathcal {N}_{SU}}} {\frac{{{c_s}{{\mathbf{d}}_i}{\mathbf{P}}_i^S}}{{R_i^S}}} } \right] + \frac{\alpha }{2}\frac{{{c_s}{{\mathbf{d}}_k}{\mathbf{P}}_k^S}}{{R_k^S}} \end{aligned}$$
(39)

where \(R_i^s = \sum \limits _{n = 1}^N {\log \left( {1 + \frac{{P_{i,n}^s{\theta _{i,n}}}}{{{\sigma ^2}}}} \right) } \), \({{\mathbf{d}}_i} = \left[ {{d_{i,1}}, \ldots ,{d_{i,N}}} \right] \) and \({\mathbf{P}}_i^S = {\left[ {P_{i,1}^S, \ldots ,P_{i,N}^S} \right] ^T}\).

Proof

The Lagrangian function of the problem in (24) can be written as

$$\begin{aligned} L = \prod \limits _{k \in {\mathcal {N}_{SU}}} {\left( {{e_k}R_k^S - \frac{\alpha }{2}{c_s}{{\mathbf{d}}_i}{\mathbf{P}}_i^S} \right) } - \lambda \left( {\sum \limits _{k \in {\mathcal {N}_{SU}}} {{e_k}} - 1 + \alpha } \right) \end{aligned}$$
(40)

where \(\lambda \) is the Lagrange multiplier. We set to zero the first order derivative of (40) with respect to \({e_j}\), for all \(j \in {N_{SU}}\), we have

$$\begin{aligned} \frac{{\partial L}}{{\partial {e_j}}} = \prod \limits _{k \in {\mathcal {N}_{SU}},k \ne j} {\left( {{e_k}R_k^S - \frac{\alpha }{2}{c_s}{{\mathbf{d}}_i}{\mathbf{P}}_i^S} \right) } R_j^S - \lambda = 0 \end{aligned}$$
(41)

By some simple manipulation, (41) can be rewritten as

$$\begin{aligned} \prod \limits _{k \in {\mathcal {N}_{SU}}} {\left( {{e_k}R_k^S - \frac{\alpha }{2}{c_s}{{\mathbf{d}}_i}{\mathbf{P}}_i^S} \right) } = \frac{\lambda }{{R_j^S}}\left( {{e_j}R_j^S - \frac{\alpha }{2}{c_s}{{\mathbf{d}}_j}{\mathbf{P}}_j^S} \right) \end{aligned}$$
(42)

for all \(j \in {\mathcal {N}_{SU}}\). Then, it can be concluded that

$$\begin{aligned} {e_k} - \frac{\alpha }{{2R_k^S}}{c_s}{{\mathbf{d}}_i}{\mathbf{P}}_i^S = {e_j} - \frac{\alpha }{{2R_j^S}}{c_s}{{\mathbf{d}}_j}{\mathbf{P}}_j^S \end{aligned}$$
(43)

for all \(k,j \in {\mathcal {N}_{SU}}\). Finally, the optimal value of \({e_k}\), \(e_k^*\), can be expressed as

$$\begin{aligned} e_k^* = \frac{1}{{{N_{SU}}}}\left[ {\left( {1 - \alpha } \right) - \frac{\alpha }{2}\sum \limits _{i \in {\mathcal {N}_{SU}}} {\frac{{{c_s}{{\mathbf{d}}_i}{\mathbf{P}}_i^S}}{{R_i^S}}} } \right] + \frac{\alpha }{2}\frac{{{c_s}{{\mathbf{d}}_k}{\mathbf{P}}_k^S}}{{R_k^S}} \end{aligned}$$
(44)

\(\square \)

Now, the optimal solution of \({\mathcal {G}_1}\), i.e. \({\alpha ^*}\) and \({\mathbf{d}}_k^*\), should be found. Using (44), the total utility of the SUs can be expressed as

$$\begin{aligned} {U^S} = \sum \limits _{k \in {\mathcal {N}_{SU}}} {U_k^S} = \left( {\frac{{1 - \alpha }}{{{N_{SU}}}} - \frac{\alpha }{{2{N_{SU}}}}\sum \limits _{i \in {\mathcal {N}_{SU}}} {\frac{{{c_s}{{\mathbf{d}}_i}{\mathbf{P}}_i^S}}{{R_i^S}}} } \right) \sum \limits _{k \in {\mathcal {N}_{SU}}} {R_k^S} \end{aligned}$$
(45)

By replacing (45) into (23), the following complex non-linear optimization problem is obtained:

$$\begin{aligned} \mathop {\max }\limits _{\alpha ,{{\mathbf{d}}_k}}&\left( {\sum \limits _{n = 1}^N {\frac{\alpha }{2}\log \left( {1 + \frac{{P_n^p{a_n}{d_n}P_n^s{b_n}}}{{1 + P_n^p{a_n} + {d_n}P_n^s{b_n}}}} \right) } - u_0^P} \right) \left( {\left( {\frac{{1 - \alpha }}{{{N_{SU}}}} - \frac{\alpha }{{2{N_{SU}}}}\sum \limits _{i \in {\mathcal {N}_{SU}}} {\frac{{{c_s}{{\mathbf{d}}_i}{\mathbf{P}}_i^S}}{{R_i^S}}} } \right) \sum \limits _{k \in {N_{SU}}} {R_k^S} } \right) \nonumber \\ {\text{ s.t.}}{\text{}}{\text{ }}&\sum \limits _{n = 1}^N {\frac{\alpha }{2}\log \left( {1 + \frac{{P_n^p{a_n}{d_n}P_n^s{b_n}}}{{1 + P_n^p{a_n} + {d_n}P_n^s{b_n}}}} \right) } \ge u_0^P \nonumber \\ {\text{ }}&\left( {\frac{{1 - \alpha }}{{{N_{SU}}}} - \frac{\alpha }{{2{N_{SU}}}}\sum \limits _{i \in {\mathcal {N}_{SU}}} {\frac{{{c_s}{{\mathbf{d}}_i}{\mathbf{P}}_i^S}}{{R_i^S}}} } \right) \sum \limits _{k \in {\mathcal {N}_{SU}}} {R_k^S} \ge 0 \end{aligned}$$
(46)

It is possible to obtain a closed-form solution for the problem in (46). Therefore, we propose a numerical searching algorithm which recruits the sequential unconstrained minimization technique (SUMT).

Let \({\mathbf{z}} = {\left[ {\alpha ,{{\mathbf{d}}_1}, \ldots ,{{\mathbf{d}}_{{N_{SU}}}}} \right] ^T}\). Then (46) can be reformulated in the form of the following:

$$\begin{aligned} \min {\text{ }}&v\left( {\mathbf{z}} \right) \nonumber \\ {\text{s.t.}}{\text{}}{\text{ }}&{w_k}\left( {\mathbf{z}} \right) \ge 0,{\text{ }}k = 1,2, \ldots ,3{N_{SU}} + 4 \end{aligned}$$
(47)

Let \({\mathcal {Z}_1} = \left\{ {{\mathbf{z}}\left| {{w_k}\left( {\mathbf{z}} \right) > 0,k = 1, \ldots ,3{N_{SU}} + 4} \right. } \right\} \). Then, it would be possible to solve (47) through sequentially solving a series of unconstrained optimization problems in the form of

$$\begin{aligned} \mathop {\min }\limits _{{\mathbf{z}} \in {\mathcal {Z}_1}} {\text{ }}\phi \left( {{\mathbf{z}},{t_k}} \right) \end{aligned}$$
(48)

where \(\phi \left( {{\mathbf{z}},{t_k}} \right) = v\left( {\mathbf{z}} \right) + \varphi \left( {{\mathbf{z}},{t_k}} \right) \), and \(\varphi \left( {{\mathbf{z}},{t_k}} \right) = - {t_i}\sum \nolimits _{k = 1}^{3{N_{SU}} + 4} {\ln \left( {{w_k}\left( {\mathbf{z}} \right) } \right) } \) is the barrier item, k is a positive integer and \(\left\{ {{t_k}} \right\} \) is a positive descending series with \(\mathop {\lim }\limits _{k \rightarrow \infty } {t_k} = 0\). The steps to find the solution of (47), i.e., \({\mathbf{z}}\), is presented in Table 1 (Algorithm 1).

Table 1 Algorithm 1 (Interior Point Method to solve (48))
Table 2 Algorithm 2 (Searching for A Strictly Interior Initial Point for Algorithm 1)

In order to run Algorithm 1, we need a strictly interior point to start the iteration. In Algorithm 2, we propose a scheme to obtain a strictly interior point. Therefore, in order to solve (47), the first step is to call Algorithm 2. Then, using the initial point found in Algorithm 2, Algorithm 1 is recruited to solve the reformulated optimization problem in (48). It is noteworthy that in each round of iteration in Algorithm 1 and Algorithm 2, a modified Broyden-Fletcher-Goldfarb-Shanno (MBFGS) method [21] is used to solve the associated unconstrained optimization problems. The point that Algorithm 1 is converged to is announced as the optimal solution of (47). In Theorem 5, the convergence of the proposed algorithm is proved formally (Table 2).

Theorem 5

Algorithm 1 is convergent to at least a local optima of the proposed two-level game.

Proof

Based on Algorithm 1, we know that all the iterations of MBFGS are limited within the open ball with center \({\mathbf{x}} = {\left[ {0.5, \ldots ,0.5} \right] ^T}\) and radius \(R = 0.5\), i.e.,

$$\begin{aligned} \mathcal {B}\left( {\mathbf{y}} \right) = \left\{ {{\mathbf{y}} \in {\mathbb {R}^{{N_{SU}}N + 1}}\left| {\left\| {{\mathbf{y}} - {\mathbf{x}}} \right\| } \right. < R} \right\} (48) \end{aligned}$$
(49)

We refer to the functions in the iterative step of Algorithm 1 and step 3 of Algorithm 2 as the target functions. It is easy to verify that the target functions are continuously differentiable within the open ball. Since both the starting point and the domain of the target function are within this ball, the target function is continuously differentiable within the domain. According to [22], the target function is Lipschitz continuous. Then, according to [21], the MBFGS is globally convergent. According to [23], as long as the method for solving the unconstrained optimization problem, i.e., MBFGS, is globally convergent, Algorithm 1 is guaranteed to converge at least to a local optimum of the original bargaining game. \(\square \)

Similarly, we can also prove that Algorithm 2 will converge to a feasible solution if the feasible region is not empty.

5 Performance Evaluation

In this section, we evaluate the performance of the proposed algorithms are using simulations. We do a static system level simulation as a single cell. The simulation assumptions are as follows:

  • The area covers 2 km \( \times \) 2 km and the users are uniformly distributed in the area.

  • All users are assumed to be equipped with the same number of antennas, denoted by N.

  • The minimum rate requirement of the PU is set to 2 bps / Hz.

  • The noise power, \({\sigma ^2}\), is set to \({10^{ - 6}} W/Hz\).

The performance evaluation tools are the leasing parameters (such as \(\alpha \) and \({d_{k,n}}\)’s, for all \(k \in {\mathcal {S}_{SU}}\) and all spatial modes), the outage probability of the primary link and the total rate of all SUs. The performance evaluation is realized through comparing the evaluation tools when the proposed two-level game based approach (TL) is recruited, when the primary transmitter directly communicates primary receiver (direct) and when the non-cooperative approach (NCA) proposed in [24] is recruited. In NCA, a portion of the primary spectrum is leased in return for the cooperation of SUs, and the leased spectrum is non-cooperatively shared among the SUs.

Fig. 3
figure 3

The leasing parameters versus number of SUs

The superiority of the proposed scheme over NCA approach is shown in Fig. 3, where the leasing parameters, i.e., the portion of primary spectrum retained for the PU communications (\(\alpha \)) and the average portion of power assigned by the SUs to relay primary signals, \(D = \frac{1}{{{N_{SU}}N}}\sum \nolimits _{k \in {\mathcal {N}_{SU}}} {\sum \limits _{n = 1}^N {{d_{k,n}}} } \). To provide more clarifications, the leasing parameters, \(\alpha \) and D, indicate the eagerness of the users for cooperating. More specifically, it can be deduced from the smaller \(\alpha \) and the larger D that the PU and SUs are highly motivated for taking part in the proposed cooperative two-level game. According to Fig. 3, when \({N_{SU}} = 1\), the leasing parameters of the NCA are \(\alpha = 1\) and \(D = 0\), meaning that the PU and the SU are not cooperating. However, the proposed two-level game based approach results in \(\alpha = 0.95\) and \(D = 0.378\), which shows that the PU and the SU cooperate successfully. The superiority of the proposed approach (TL) over the spectrum leasing approach in [24] (NCA) can be attributed to more power that the SUs own for bargaining with the PU and further, sharing the profits of cooperation in a cooperative manner among themselves. When \({N_{SU}} > 1\), although the cooperation between the PU and the SUs takes place as a result of NCA, but the enthusiasm of the PU for cooperating with SUs does not change significantly, because of the selfish nature of the game in [24], while the SUs eagerness for cooperating diminishes as the number of the SUs grows. Opposed to NCA, the proposed two-level game approach leads in augmenting the motivations of the PU and the SUs to cooperate with each other, as the number of SUs increases. Another significant observation is that as a result of the proposed scheme, the cooperative spectrum leasing is guaranteed to happen between PU and the SUs.

Fig. 4
figure 4

The outage probability of the primary user

The outage probability of the primary user is another important tool for evaluating the performance of the proposed scheme. As shown in Fig. 4, the worst outage probability is achieved when the PU decides not to cooperate with the SUs. The cooperation of the SUs with the PU in the leasing scenario presented in [24] (NCA) reduces the outage probability. However, in comparison with NCA, our proposed spectrum leasing scheme causes a more significant reduction in the outage probability of the PU. The proposed two-level game based approach (TL) motivates the SUs to allocate a higher portion of their transmission power for relaying the primary signal and consequently, the outage probability performance of the proposed scheme is better than others.

Fig. 5
figure 5

The total rate of SUs

Fig. 6
figure 6

Fairness index for different approaches

The total rate of SUs in terms of the number of SUs for different number of the antennas and different schemes is presented in Fig. 5. Similar to previous discussions, it can be inferred from Fig. 5 that for the single-SU case, NCA does not motivates the SU enough to participate in PU transmissions and get access to the leased resources, in return. However, the proposed two-level game based approach (TL) persuades the SU to relay the primary signals and benefit from the leased resources to increase its data rate. As the number of existing SUs in the system increases, the total rate of the system implemented based on NCA does not escalate significantly, because of the competition among the SUs for sharing the leased spectrum. Meanwhile, the proposed two-level game based scheme which takes advantage of the cooperative games, provides a much better sharing of the leased spectrum among the SUs and thereby higher sum rate for the SUs. As another point, Fig. 5 suggests that deploying more antennas in all users, results in higher data rates expectedly.

In order to determine whether SUs are receiving a fair share of system resource, Jain’s fairness measure as a widely used fairness measure is used. Fig. 6 depicts the Jains fairness indices of the different algorithms. Evidently and expectedly, the proposed two-level cooperative game based scheme achieves significantly better fairness than NCA. As the number of SUs increases, fairness indices under NCA decreases. However, the proposed approach maintain quite stable fairness for different network sizes.

6 Conclusions

In this work, we studied the resource allocation problem in a spectrum leasing scenario in MIMO-CCRN with multi channels using a coalitional game based approach. After proving that all PUs and SUs should be grouped in a set and as a result of such grouping, the grand coalition is formed, we analyzed the stability of the proposed coalitional game based approach using the core concept. Afterwards, we explored a practical scenario where the data rate of the primary system, falls below a minimum data rate requirement, and alternatively, the PU resorts to the leasing of its unused spectrum to SUs. However, it is guaranteed that the SUs perform as cooperatively as to meet the rate requirement of the primary system. Moreover, the benefits of cooperating with the primary system may not be exploited by the secondary system efficiently, if the SUs compete over the resources. We propose a two-level game based on the bargaining games to tackle the aforementioned problems. The simulation results confirm the theoretical achievements.