1 Introduction

Multiple base stations (BSs) cooperation, which can be called multi-cell cooperation, network MIMO, coordinated multi-point (CoMP) in wireless communication system, has been extensively studied [14]. By sharing the channel state information (CSI) and user data information over the backhaul links, several BSs in the same cluster jointly pre-code and transmit signals, the inter-cell interference can effectively be reduced or even be eliminated. In theory, the more the collaborative BSs are, the larger the system capacity gain is [5]. However, in fact, it is difficult for too many BSs to coordinate, and there are some requirements that need to be solved, such as the limited capacity constraints of backhaul links between the BSs [6, 7], the perfect CSI feedback overhead from the users to the BS [8], latency control for network synchronization [9], and so on. The most effective solution is to partition the BSs in the system into some separate groups, i.e., clusters. The common used methods include static clustering and dynamic clustering.

The works on the dynamic clustering of BSs mainly focus on the system capacity, while the complexity of the well-known dynamic clustering algorithm is high. Most energy efficiency (EE) solutions are based on static clustering [6, 10] or single cell [11, 12]. As far as we know, EE for dynamic clustering in downlink is rarely studied. The static clustering method is adopted in [13], and the optimal cluster sizes for EE in four different scenarios are discussed. The effects of BS sleep mode and multiple BSs cooperation on the cluster size are presented, in addition, the minimum user data rate requirement, the BS transmit power and the consumption of the signal processing can also have influences on the cluster size. The relationship between the effective capacity and the total energy consumption in the actual system is studied in [14]. In addition to the transmit power, the total energy consumption also includes the power due to signal processing and the backhaul overhead. For a given cluster size, in order to maximize the sum capacity of downlink and uplink, the exhaustive search method is used to achieve the best clustering results for each channel realization. A dynamic clustering algorithm with fixed cluster size in downlink is proposed in [15], and the cluster set changes over time in order to maximize system capacity. This algorithm has a higher capacity than that of the static clustering. However, this algorithm is a brute force search algorithm, which needs to calculate the system capacity of all the possible cluster sets. The computational complexity of this algorithm and the system overhead grow exponentially with the number of BSs. Another dynamic clustering algorithm is presented in [16], and the whole system is divided into several fixed large clusters, then each large cluster is divided into several sub-clusters, and the sub-clusters constitute the candidate patterns. The candidate pattern which has the maximal system capacity is considered as the optimal clustering set. Although this algorithm reduces the computational complexity, the system gain is limited. Therefore, how to find a low complex clustering algorithm for multiple BSs cooperation is a challenging issue.

In this paper, system EE with dynamic clustering is investigated, and a novel dynamic clustering algorithm for multiple BSs cooperation is proposed. With equal power allocation and per antenna equal power constraint, the formulas of system spectral efficiency (SE) and EE for the case of ideal transmit [17] and the case of actual transmit are derived, respectively. The lower bound and the upper bound for SE and EE are addressed. By calculating the mutual interference matrix according to channel norm, the BS which has the biggest element in the interference matrix is selected as the leader BS. Then the rest BSs are chosen by the leader BS according to the cluster size to form a new cluster. Compared with the known dynamic clustering algorithm, the proposed algorithm has the very low complexity.

The rest of this paper is organized as follows. In Sect. 2, the system model is introduced. In Sect. 3, SE and EE for the lower bound and the upper bound are derived, where the case of ideal transmit and the case of actual transmit are considered, respectively. In Sect. 4, a novel dynamic clustering algorithm based on channel norm is proposed, and the computational complexity is analyzed. Simulation results are presented in Sect. 5. Finally, conclusions are drawn in Sect. 6.

Notations: Bold upper and lower case letters are used to denote matrices and vectors, respectively. \({( \cdot )^T}\) and \({( \cdot )^{\mathcal {H}}}\) stand for transpose and Hermitian transpose, respectively. \({\mathbb {C}}^N\) stands for the N dimensional complex vector space. \({{{\mathbb {E}}}}[ \cdot ]\) denotes the expectation operator, and the trace is expressed by \({\text {tr}}( \cdot )\). \(\left\lceil {x} \right\rceil\) denotes the nearest integers greater than or equal to x.

2 System Model

A wireless cellular system in downlink with M BSs and K mobile stations (MSs) is used for analysis, and the frequency reuse factor is 1, which is shown in Fig. 1, and a single antenna is equipped for each BS and MS, respectively. The channel state information (CSI) is shared in the main center processing unit, then the BSs in the system are partitioned into several clusters, and the cluster size can be the same, also can not be the same. The number of the clusters for each cluster set is G, and the g-th cluster is \(M_g\), so \(\bigcup \nolimits _{g = 1}^G {\left| {M_g } \right| } = M\), where \(2 \leqslant M_g \leqslant M\). For simplicity, it is assumed that the cluster size in one cluster set is the same. If the cluster size is \(\varOmega\), and the number of cluster is \(\left\lceil {M/\varOmega } \right\rceil\). One thing to note here is that each BS can only be in one cluster, and the MS in the cell can only be served by the corresponding cluster. Let \(K_g\) denote the MSs set served by the g-th cluster, and \(\bigcup \nolimits _{g = 1}^G {\left| {K_g } \right| } = K\). The channel for the i-th MS in the j-th cluster is

$$\begin{aligned} {{\mathbf{h}}_{K_{j(i)} } = \left[ {\mathbf{h}}_{K_{j(i)} }^{M_1 } ,{\mathbf{h}}_{K_{j(i)} }^{M_2 } , \ldots ,{\mathbf{h}}_{K_{j(i)} }^{M_G } \right] } , \end{aligned}$$
(1)

where \({\mathbf{h}}_{K_{j(i)} }^{M_g }\in {\mathbb {C}}^{1 \times \left| {M_g } \right| },(i=1, \ldots ,\left| {K_j } \right| , \, j=1, \ldots , \, G, g=1, \ldots ,G\)) is the channel vector from the g-th cluster to the i-th MS in the j-th cluster. Therefore, the whole downlink channel matrix of the system is

$$\begin{aligned} {\mathbf {H}} = \left[ {\mathbf{h}}_{K_{1(1)} }^T , \ldots ,{\mathbf{h}}_{K_{1(\left| {K_1 } \right| )} }^T , \ldots ,{\mathbf{h}}_{K_{G(1)} }^T , \ldots ,{\mathbf{h}}_{K_{G(\left| {K_G } \right| )} }^T \right] ^T . \end{aligned}$$
(2)

When the MS data and CSI are known by the BSs in the same cluster, BSs can transmit the data by jointly processing. The common used precoding techniques include dirty paper coding (DPC), zero-forcing (ZF) precoding, and minimum mean squared error (MMSE) precoding, and ZF precoding is adopted in this paper. The ZF precoding matrix for the j-th cluster is \({\mathbf {V}_{K_j }} = [{\mathbf{v}}_{K_{j(1)} } ,{\mathbf{v}}_{K_{j(2)} } , \ldots ,{\mathbf{v}}_{K_{j(| {K_j } |)} } ]\), where \({\mathbf{v}}_{K_{j(i)} } \in {\mathbb {C}}^{\left| {M_j } \right| \times 1}\) is the normalized precoding vector. The transmit data symbol vector is \({{\mathbf{x}}}_{K_j } = \left[ x_{K_{j(1)} } ,x_{K_{j(2)} } , \ldots ,x_{K_{j(| {K_j }|)} } \right] ^T\), where \({{\mathbb {E}}}\left[ | {x_{K_{j(i)} } } |^2 \right]\)=1, \(i = 1, \ldots ,| {K_j } |\), and the allocated power vector is \({{\mathbf{s}}}_{K_j } = \left[ s_{K_{j(1)} } ,s_{K_{j(2)} } , \ldots ,s_{K_{j(| {K_j } | )} } \right] ^T\). Therefore, the received signal for the i-th MS in the j-th cluster is

$$\begin{aligned} y_{K_{j(i)} }& = {\mathbf{h}}_{K_{j(i)} }^{M_j } {\mathbf{v}}_{K_{j(i)} } \sqrt{s_{K_{j(i)} } } x_{K_{j(i)} } \\&\quad +\sum \limits _{l = 1,l \ne i}^{\left| {K_j } \right| } {{\mathbf{h}}_{K_{j(i)} }^{M_j } } {\mathbf{v}}_{K_{j(l)} } \sqrt{s_{K_{j(l)} } } x_{K_{j(l)} } \\&\quad + \sum \limits _{q = 1,q \ne j}^G {\sum \limits _{m = 1}^{\left| {K_q } \right| } {{\mathbf{h}}_{K_{j(i)} }^{M_q } } {\mathbf{v}}_{K_{q(m)} } \sqrt{s_{K_{q(m)} } } x_{K_{q(m)} } } + n_i , \end{aligned}$$
(3)

where \(\sqrt{U_{K_{j(i)} } } = {\mathbf{h}}_{K_{j(i)} }^{M_j } {\mathbf{v}}_{K_{j(i)} } \sqrt{s_{K_{j(i)} } } x_{K_{j(i)} }\) is the useful received signal, \(\sqrt{I_{K_{j(i)} }^{tra} } =\sum \nolimits _{l = 1,l \ne i}^{\left| {K_j } \right| } {{\mathbf{h}}_{K_{j(i)} }^{M_j } } {\mathbf{v}}_{K_{j(l)} } \sqrt{s_{K_{j(l)} } } x_{K_{j(l)} }\) is the interference signal from the other BSs of the intra-cluster, \(\sqrt{I_{K_{j(i)} }^{ter} } = \sum \nolimits _{q = 1,q \ne j}^G {\sum \nolimits _{m = 1}^{\left| {K_q } \right| } {{\mathbf{h}}_{K_{j(i)} }^{M_q } } {\mathbf{v}}_{K_{q(m)} } \sqrt{s_{K_{q(m)} } } x_{K_{q(m)} } }\) is the interference signal from the BSs of the other clusters, and \(n_i\) is the additive white Gaussian noise (AWGN) with zero mean value and variance \({{\mathbb {E}}}\left( n_i n_i^H \right) = \sigma ^2\). Unlike the traditional total power constraint, the case of per antenna power constraint is adopted in this section. This is because in practical wireless system, each antenna has its own power amplifier (PA), and the output power of each antenna is controlled by its PA type [18, 19]. Therefore, the power constraint for the m-th antenna in the j-th cluster is

$$\begin{aligned} \sum \limits _{n = 1}^{\left| {K_j } \right| } {\left| {{\mathbf{V}}_{K_j (m,n)} } \right| ^2 } s_{K_{j(n)} } \leqslant P_m , \quad m = 1,2, \ldots ,\left| {M_j } \right| . \end{aligned}$$
(4)

It is assumed that, in the j-th cluster, each symbol is allocated to the same power and each antenna has the same power constraint, i.e., \(s_{K_{j(i)} }=S , i=1, \ldots ,\left| {K_j } \right|\) and \(P_m=P, \quad m = 1,2, \ldots ,| {M_j }|\). Thus, the relation between the symbol power and the antenna power constraint is as follows

$$\begin{aligned} S = P/\mathop {\max }\limits _{m = 1,2, \ldots ,\left| {M_j } \right| } \left[ {{\mathbf{V}}_{K_j } ({\mathbf{V}}_{K_j } )^{\mathcal {H}} } \right] _{(m,m)} . \end{aligned}$$
(5)

3 Derivation of SE and EE

For a given cluster realization, system SE and EE are derived, respectively. When a cluster realization forms, for the MSs in the j-th cluster, the sum of the achieved throughput is expressed as \(R(M_j)\). There are a variety of metrics which are used to measure system EE [20, 21]. In this paper, the system throughput per unit energy consumption, i.e., the bits per Joule, will be used as the metric of system EE. The system sum-throughput is

$$\begin{aligned} R = \sum \limits _{j = 1}^G {R(M_j )} . \end{aligned}$$
(6)

The system SE \(\psi\) is

$$\begin{aligned} \psi = \sum \limits _{j = 1}^G {R(M_j )/M} . \end{aligned}$$
(7)

So the system EE \(\eta\) is

$$\begin{aligned} \eta = \frac{R}{{P_{total} }} = \frac{{\sum \nolimits _{j = 1}^G {R(M_j )} }}{{P_{total} }} , \end{aligned}$$
(8)

where \(P_{total}\) is the total power consumption of the system.

3.1 System SE and EE for Ideal Transmit

Here, system SE and EE for the case of ideal transmit are considered, respectively. The ideal transmit means that only the radio frequency (RF) output power is considered, and the circuit power consumption of BS, the power amplifiers dissipation and the energy consumption of the backhaul links will not be included. When the cluster is formed, with equal power allocation and per antenna equal power constraint, system SE in each time slot is

$${{\psi _{l} = \sum\limits_{{j = 1}}^{G} {\sum\limits_{{i = 1}}^{{\left| {K_{j} } \right|}} {\log _{2} } } \left( {1 + \frac{{U_{{K_{{j(i)}} }} }}{{I_{{K_{{j(i)}} }}^{{tra}} + I_{{K_{{j(i)}} }}^{{ter}} + \sigma ^{2} }}} \right)} \mathord{\left/ {\vphantom {{\psi _{l} = \sum\limits_{{j = 1}}^{G} {\sum\limits_{{i = 1}}^{{\left| {K_{j} } \right|}} {\log _{2} } } \left( {1 + \frac{{U_{{K_{{j(i)}} }} }}{{I_{{K_{{j(i)}} }}^{{tra}} + I_{{K_{{j(i)}} }}^{{ter}} + \sigma ^{2} }}} \right)} M}} \right. \kern-\nulldelimiterspace} M}$$
(9)

and (9) is a lower bound of SE for a given clustering method because of the inter-cluster interference. On the other hand, when there is no inter-cluster interference, the upper bound of SE can be given as follows

$${{\psi _{u} = \sum\limits_{{j = 1}}^{G} {\sum\limits_{{i = 1}}^{{\left| {K_{j} } \right|}} {\log _{2} } } \left( {1 + \frac{{U_{{K_{{j(i)}} }} }}{{I_{{K_{{j(i)}} }}^{{tra}} + \sigma ^{2} }}} \right)} \mathord{\left/ {\vphantom {{\psi _{u} = \sum\limits_{{j = 1}}^{G} {\sum\limits_{{i = 1}}^{{\left| {K_{j} } \right|}} {\log _{2} } } \left( {1 + \frac{{U_{{K_{{j(i)}} }} }}{{I_{{K_{{j(i)}} }}^{{tra}} + \sigma ^{2} }}} \right)} M}} \right. \kern-\nulldelimiterspace} M}$$
(10)

Let \(\theta _{(M_j )} = 1/\mathop {\max }\limits _{m = 1,2, \ldots ,| {M_j } | } [ {{\mathbf{V}}_{K_j } ({\mathbf{V}}_{K_j } )^{\mathcal {H}} } ] _{(m,m)}\), according to the definition of EE, system EE for the ideal transmit is

$$\begin{aligned} \eta _l = \frac{{\sum \nolimits _{j = 1}^G {\sum \nolimits _{i = 1}^{\left| {K_j } \right| } {\log _2 } } \left( 1 + \frac{{U_{K_{j(i)} } }}{{I_{K_{j(i)} }^{tra} + I_{K_{j(i)} }^{ter} + \sigma ^2 }}\right) }}{{P\left( \sum \nolimits _{j = 1}^G {{\text {tr}}} ({\mathbf{V}}_{K_j } ({\mathbf{V}}_{K_j } )^{\mathcal {H}} )\theta _{(M_j )} \right) }} , \end{aligned}$$
(11)

Due to the inter-cluster interference, (11) is a lower bound of EE for the ideal transmit. Similarly, if there is no inter-cluster interference, the upper bound of EE can be given

$$\begin{aligned} \eta _u = \frac{{\sum \nolimits _{j = 1}^G {\sum \nolimits _{i = 1}^{\left| {K_j } \right| } {\log _2 } } \left( 1 + \frac{{U_{K_{j(i)} } }}{{I_{K_{j(i)} }^{tra} + \sigma ^2 }}\right) }}{{P\left( \sum \nolimits _{j = 1}^G {{\text {tr}}} \left( {\mathbf{V}}_{K_j } ({\mathbf{V}}_{K_j } )^{\mathcal {H}} \right) \theta _{(M_j )} \right) }} . \end{aligned}$$
(12)

3.2 System SE and EE for Actual Transmit

The ideal transmit only considers the RF power, which only contains a small part of the overall power consumption of the whole system, and therefore may result in one-sides conclusion, when evaluating system EE [22]. The power consumption models which includes different components for macro and micro BSs are presented in [23], and the backhaul link power is discussed. The actual total energy consumption for a given cluster size can be classified into three categories: the total power amplifiers energy dissipation, the circuit consumption due to signal processing and the backhaul link energy consumption [10]. Therefore, the actual power model of the j-th cluster is

$$\begin{aligned} {P_{act(j)}} = \sum \limits _{i = 1}^{\left| {{M_j}} \right| } {{p_i}} /\xi + {P_c}\left| {{M_j}} \right| + {P_{b}}{N_{b}}_{({M_j})} , \end{aligned}$$
(13)

where \(p_i\) represents the transmit output power of the i-th power amplifier, and \(\xi \in [0,1]\) is the power amplifier efficiency. \(P_c\) is the circuit power consumption value of a BS, which is caused by base band digital signal processing, the active transceiver filter, digital to analog converter (DAC), analog to digital converter (ADC), the frequency synthesizer, and so on. \(P_{b}\) is the additional power requirement per backhaul link. \({N_{b}}_{({M_j})}\)is the number of backhaul links of the j-th cluster, which is related to the network structure.

The system SE for the case of actual transmit is the same as that for the ideal transmit. In this case, the power P is replaced by the output power of the power amplifier. The system EE for the actual power model is

$$\begin{aligned} \tilde{\eta }_l = \frac{{\sum \nolimits _{j = 1}^G {\sum \nolimits _{i = 1}^{\left| {K_j } \right| } {\log _2 } } \left( 1 + \frac{{U_{K_{j(i)} } }}{{I_{K_{j(i)} }^{tra} + I_{K_{j(i)} }^{ter} + \sigma ^2 }}\right) }}{{\sum \nolimits _{j = 1}^G {{\text {tr}}} ({\mathbf{V}}_{K_j } ({\mathbf{V}}_{K_j } )^{\mathcal {H}} )\theta _{(M_j )} P/\xi + {P_c}\left| {{M_j}} \right| + {P_{b}}{N_{b}}_{({M_j})} }} , \end{aligned}$$
(14)

and (14) is a lower bound of EE for the actual transmit because of the inter-cluster interference. When there is no inter-cluster interference, the upper bound of EE is

$$\begin{aligned} \tilde{\eta }_u = \frac{{\sum \nolimits _{j = 1}^G {\sum \nolimits _{i = 1}^{\left| {K_j } \right| } {\log _2 } } \left( 1 + \frac{{U_{K_{j(i)} } }}{{I_{K_{j(i)} }^{tra} + \sigma ^2 }}\right) }}{{\sum \nolimits _{j = 1}^G {{\text {tr}}} ({\mathbf{V}}_{K_j } ({\mathbf{V}}_{K_j } )^{\mathcal {H}} )\theta _{(M_j )} P/\xi + {P_c}\left| {{M_j}} \right| + {P_{b}}{N_{b}}_{({M_j})} }} . \end{aligned}$$
(15)

When the cluster size and the CSI are given, with equal power allocation and per antenna equal power constraint for a cluster set, EE for the actual transmit is the function of the output power of the power amplifier. When the range of the output power of the PA is given, it is an one dimensional optimization problem to solve EE. The problem can be solved by search methods and/or approximation methods, and the detailed process can be found in [24].

4 Proposed Dynamic Clustering Algorithm

By BSs clustering, system performance can be improved, which mainly because the inter-cell interference in the cluster can be alleviated or eliminated. There are two main kinds of clustering methods, i.e., static clustering method and dynamic clustering method.

4.1 Static Clustering

For static clustering, the cluster realization is formed according to the geographic locations of the BSs. The selected BSs in the cluster are fixed, which does not change over time. The advantage of this method is that the clustering course is simple, and the main center processing unit do not need the overhead of backhaul. However, there is severe inter-cluster interference when the scheduled MSs is on the edge of the two adjacent clusters.

4.2 Dynamic Clustering

In order to solve the defect of static clustering, BSs dynamic clustering has raised concerns [15, 16, 2527]. Although the optimal system performance can be achieved by exhaustive search, the complexity is too high. In [28], a downlink decentralized dynamic clustering algorithm is proposed. Each BS selects the BSs from the neighboring cells, then these BSs compose several clusters. These clusters form the clusters list according to the corresponding cluster capacity in descending order, and the cluster with the maximum capacity is proposed as the preferred cluster. In the process of iteration, if the BSs in the preferred cluster have been received by other clusters, the cluster with the second maximum sum capacity is selected as the preferred cluster, and so on.

4.3 The Proposed Dynamic Clustering Algorithm

In this subsection, a novel dynamic clustering algorithm is proposed. The higher the interference between two BSs is, the more likely they are in the same cluster. Motivated by this, a new algorithm based on the channel norm is constructed, and it turns out that the number of the candidate cluster sets for this algorithm decreases dramatically.

For the multi-cell system with M BSs and K MSs, it is assumed that the cluster size is \(\varOmega\). Let \({\mathcal {D}}\) denote the candidate BSs list, which contains all the BSs which have not been selected to form a new cluster. Let \(\widetilde{{\mathcal {D}}}\) denote the selected BSs list, which includes all the BSs which have been selected in the cluster forming. According to the whole channel matrix \({\mathbf{H}}\), the mutual interference matrix \({\mathbf{F}}\) is defined as follows

$$\begin{aligned} f_{ij}= \left\{ \begin{array}{ll} \left| {h_{ij} } \right| ^2 + \left| {h_{ji} } \right| ^2 &{} \quad {\text {for}}\; i \ne j \\ 0 &{} \quad {\text {for}}\;i = j \end{array} \right. \end{aligned}$$
(16)

where \(f_{ij}\) is the element in the i-th line and the j-th column in \({\mathbf{F}}\), and \(h_{ij}\) is the channel coefficient from the i-th BS to the j-th MS. It can be seen that \({\mathbf{F}}\) is a symmetric matrix, so we only need to calculate the elements in the upper triangular or the lower triangular of \({\mathbf{F}}\).

The main step of the proposed algorithm is to find the lead BS for each cluster. For the BS i in \({\mathcal {D}}\), if i satisfies \(\mathop {\max }\limits _{i \in {\mathcal {D}}} \sum \nolimits _{j,j \ne i} {{f_{ij}}}\), then BS i is selected as the lead BS. The rest of the candidate BSs are selected by the lead BS. If the BS \(i'\) satisfies \(\mathop {\max }\limits _{i' \in {\mathcal {D}} } {f_{ii'}}\), then the BS \(i'\) is selected to join the current cluster. When a new cluster is formed, the selected BSs are deleted from \({\mathcal {D}}\) and are added to \(\widetilde{{\mathcal {D}}}\). Another cluster is formed according to the same steps until all BSs are partitioned into separate clusters. The algorithm is summarized in Table 1.

Table 1 The Proposed Dynamic Clustering Algorithm

As was mentioned earlier, the complexity of exhaustive search increases rapidly with the number of BSs. For convenience, we only analyze the number of possible cluster sets. Assume \(\varGamma = M/\varOmega\) is an integer. Then, there are \(M!/((\varOmega !)^\varGamma \varGamma !)\) possible cluster sets for exhaustive search [15]. In sharp contrast, our proposed algorithm incurs a very low computational load. Once the lead BS of a cluster is chosen, the cluster can be formed. For example, when \(M = 18 , \varOmega = 3 , \varGamma = 6\), the number of the cluster sets for exhaustive search is 90,590,400. The average number of clusters in each BS for the decentralized dynamic clustering algorithm is 1.9, so the average number of cluster sets for the decentralized algorithm in [28] is 36. In fact, the number of cluster sets for this algorithm would be greater than 36, because each BS needs to search the candidate clusters from the neighboring cells. On the other hand, our proposed algorithm only needs one cluster set, which can be decided once by calculating the channel norm, regardless of the number of the BSs in system.

5 Simulation Results

A two rings cellular network in downlink is considered, as shown in Fig. 1. For the convenience of clustering, a BS in the second ring is omitted. The radius of the hexagon cell is 500 meters, and each BS equipped with one omnidirectional antenna locates at the center of its own cell. Single antenna MSs are randomly distributed in each cell, and only one MS in each cell is served in each time of simulation. The channel coefficient from the m-th BS to the k-th MS is

$$\begin{aligned} h_{m,k} = \sqrt{G_a } h_{m,k}^l h_{m,k}^s , \end{aligned}$$
(17)

where \(G_a =7.94\) is the power gain which includes the transmit antenna in BS and the receive antenna in MS. \(h_{m,k}^l = \sqrt{\lambda _{m,k} /d_{m,k}^\alpha }\) represents the combination of path loss and shadow fading, and \(\lambda _{m,k}\) is a log-normal distributed shadowing variable with zero mean and standard deviation \(\sigma _{sh}\), here \(\sigma _{sh} = 8\) dB. \(d_{m,k}\) is the distance from the m-th BS to the k-th MS, and \(\alpha = 3.76\) represents the path loss exponent. \(h_{m,k}^s\) represents the Rayleigh small scale fading [25, 29].

Fig. 1
figure 1

Network model

In order to assess EE of the wireless system comprehensively, besides RF power, the other energy consumption such as the circuit power consumption of BS and the backhaul link power should be considered. It is assumed that the efficiency of the power amplifier is 0.35, each BS circuit power consumption is 10 W, the backhaul links are connected with the fiber, and the average power consumption for each backhaul link is 10 W.

Fig. 2
figure 2

System SE and EE versus cluster size for static cluster and the proposed dynamic clustering algorithm, where the inter-cluster interference is considered

Figure 2 illustrates system SE and EE as a function of the cluster size for static cluster and the proposed dynamic clustering algorithm, respectively, and the inter-cluster interference is considered. The ideal transmit RF power is 20 W. When the cluster size is 1, there is no cooperation between BSs, hence this point on the curve represents the lowest performance. When the cluster size increases, SE and EE all increase. This is because with the increase of the cluster size, the inter-cell interference (ICI) can be better reduced. SE and EE of the proposed dynamic clustering algorithm are better than that of static cluster when the cluster size is 2 and 3. However, since the cluster of proposed algorithm is determined by the leader BS, which may separate some BSs, and SE of the cluster size 4 degrades, but system EE is still better than that of static cluster. Because of the feedback and backhaul consumption, the cluster size is limited in real system, so the cluster size 2 or 3 is suitable.

Fig. 3
figure 3

System SE and EE of the proposed algorithm versus the ideal transmit power, where the inter-cluster interference is considered

System SE and EE of the proposed algorithm for different cluster sizes versus the ideal transmit power are shown in Fig. 3, and the inter-cluster interference is considered. For system SE, in low transmit power regime, zero-forcing precoding results in system capacity degradation. When the ideal transmit power is larger than 10 dBW, the proposed algorithm is better than no-cooperation, and the larger the cluster size, the higher the system SE. When the ideal transmit power is larger than 20 dBW, system SE does not increase linearly, this is because the inter-cluster interference is serious. System EE decreases with the transmit power.

Fig. 4
figure 4

System SE and EE of the proposed algorithm versus the actual transmit power, where the inter-cluster interference is considered

Then system SE and EE of the proposed algorithm versus the actual transmit power for some cluster sizes are shown in Fig. 4, in which the inter-cluster interference is considered. It can be seen that when the output power of power amplifier is smaller than 36 dBW, system EE of the proposed dynamic clustering algorithm is worse than that of no-cooperation. One reason is that in low transmit power region, zero-forcing precoding results in system capacity degradation, which also affects the system EE; the other reason is that, the circuit and the backhaul links also consume the power. When the output power of power amplifier is larger than 36 dBW, system EE of the proposed algorithm is better than that of no-cooperation, and the larger the cluster size, the better the system EE.

Fig. 5
figure 5

System SE and EE versus the actual transmit power for static cluster, the proposed algorithm and the decentralized algorithm when the cluster size is 3

In Fig. 5, system SE and EE versus the actual transmit power for static cluster, the proposed algorithm and the decentralized algorithm [28] are shown, and the cluster size is 3. When the inter-cluster interference is considered, system SE and EE evaluations are the same as Fig. 4. When there is no inter-cluster interference, system SE increases with the transmit power, and system EE slightly increases with the transmit power at first, then decreases with the transmit power, so there is a maximum EE value.

Fig. 6
figure 6

The tradeoff between SE and EE for static cluster, the proposed algorithm and the decentralized algorithm when the cluster size is 3

The tradeoff between SE and EE for static cluster, the proposed algorithm and the decentralized algorithm can be seen in Fig. 6, and the cluster size is 3. When the inter-cluster interference is considered, system SE and EE increase gradually at first, then SE reaches the maximum values that smaller than 4 bit/s/Hz/cell, and the maximum EE value is smaller than 0.2 bit/Joule. With the power increasing, SE remains a constant and EE decreases. When there is no inter-cluster interference, SE increases from 2 bit/s/Hz/cell to more than 16 bit/s/Hz/cell in the transmit power region, EE increases from 0.1 to 0.48 bit/Joule, then decreases gradually. It can be seen that the inter-cluster interference has an important influence on SE and EE, so the methods to eliminate interference such as fractional frequency reuse (FFR), orthogonal frequency division multiplexing (OFDM) are beneficial to improve SE and EE. However, there is a saturation point for SE–EE curve. EE will not increased, regardless of how much transmit power is used. It will provide the scientific guidance for the design of the optimal energy consumption networks in future.

6 Conclusions

In this paper, EE based on a novel dynamic clustering algorithm for multiple BSs cooperation has been investigated. The formulas for system SE and EE were derived, respectively, and the case of ideal transmit and actual transmit were considered. A novel dynamic clustering algorithm based on channel norm was proposed, which has a better performance than that of static clustering and a lower complexity than that of a known decentralized algorithm. Simulation results showed that EE of the proposed algorithm is better than that of the static cluster, and slightly degrades than that of the decentralized algorithm. In future work, Interference cancellation such as fractional frequency reuse, power management should be considered jointly with dynamic clustering so as to further improve the system EE.