1. Introduction

Owing to their high spectral efficiency, multiple-input multiple-output (MIMO) wireless antenna systems have been recognized as a key technology for future wireless communication systems such as long-term evolution (LTE), LTE-advanced (LTE-A), WiMax, etc. Multiuser MIMO (MU-MIMO) has become one of the main features in LTE-A systems because of several key advantages over single-user MIMO (SU-MIMO) [1, 2]. There are several kinds of classic transmission methods for downlink MU-MIMO: Dirty Paper Coding (DPC) [3], Block Diagonalization (BD) [4, 5], and zero-forcing (ZF) (or channel inversion) [6]. Though DPC is optimal and can achieve sum-rate capacity, it is difficult to implement in practical systems because of its high complexity. BD and ZF are suboptimal methods with tolerable performance degradation and their lower complexities make them easier to implement. Furthermore, compared to BD, ZF is an even simpler algorithm which essentially separates multiple data streams from the same user equipment (UE) at transmitter side. So, in this article, ZF transmission method is chosen for the performance analysis of downlink MU-MIMO systems; however, similar analytical methods may be applied to other transmission methods as well.

As is well known, the availability of accurate channel state information (CSI) is very important for downlink MU-MIMO schemes. However, in practice, CSI is always imperfect because of the existence of CSI delay, quantization error, and channel estimation error. This would cause not only self-interference among different data streams of the same user, but also interference among users, severely degrading the performance especially in case of high mobile users or long delay. Hence, it is important to characterize the performance of MU-MIMO system in the presence of imperfect CSI.

Most recent study [712] about the impact of imperfect CSI on MU-MIMO focused on the frequency division duplex systems. In [7], the authors investigated the impact of feedback delay and estimation error on the sum-rate of MU-MIMO systems. In [8], the authors studied upper and lower bounds on the achievable sum-rate of a correlated/uncorrelated MU-MIMO channel with channel estimation error and feedback delay. The achievable ergodic rates were derived for multi-user MIMO systems with CSI delay and quantization error in [9, 10]. In [11], the impact of imperfect CSI on sum-rate scaling law was investigated for downlink MU-MIMO systems. In [12], the authors quantified the impact of channel estimation errors, quantization errors, and outdated quantized CSI on the rate loss of MU-MIMO system.

To the authors' knowledge, the impact of imperfect CSI on MU-MIMO in time division duplex (TDD) systems is almost rarely investigated. In this article, we study the impact of imperfect CSI caused by both CSI delay and channel estimation error on bit error rate (BER) for TDD downlink MU-MIMO ZF systems. In order to clearly indicate the impact of imperfect CSI on MU-MIMO, we only analyze un-coded MU-MIMO systems although channel coding techniques are indispensable in practical systems. In TDD system, the base station (BS) estimates CSI at transmitter side based on the uplink pilots periodically sent by the mobile users. Then, BS uses it through channel reciprocity to generate precoding matrix for the downlink data transmission. Because of the constraints of the TDD frame structure and the uplink pilot overhead, there inevitably exists both CSI delay and channel estimation error between uplink estimated channel (used to generate precoding matrix during downlink transmission) and downlink transmission channel, which degrades the system performance. In this article, using the correlation between the actual channel and the estimated one [13], as well as the channel's time-correlation [14], we obtain an expression for post-processing signal-to-interference plus noise ratio (SINR) of each data stream of TDD downlink MU-MIMO systems. Based on the post-processing SINR, we then obtain the expression for average BER of uncoded TDD MU-MIMO ZF systems with M-quadrature amplitude modulation (QAM)-modulated signals. Numerical simulations verify our analysis.

Notation: E(·), (·)H, (·)T, (·)*, and ||·||F denote expectation, Hermitian, transpose, complex conjugation, and Frobenius norm, respectively. I M is the M × M identity matrix. (·) denotes the right pseudo inversion and (A)≜AH(AAH)-1. C N ( μ , Σ ) denotes the complex Gaussian distribution with mean vector μ and variance matrix Σ.

2. System model

Consider a TDD downlink MU-MIMO system with ZF precoding, where a BS equipped with M antennas transmits signals to K mobile users, each equipped with n k (k = 1, 2, ..., K) antennas, under the assumption that M N = k = 1 K n k to guarantee the existence of a non-zero precoding matrix. This assumption can be satisfied with the help of user scheduling techniques which select a subset (active users) of the available users to communicate at each time slot such that the total number of receive antennas for active users at any time instant satisfies the above required assumption [15, 16]. Because orthogonal frequency division multiplexing divides a wideband MIMO channel into a series of parallel narrowband MIMO channels, we can assume that the channels are frequency flat. Furthermore, we assume that the channels are spatially uncorrelated, time-varying, and Rayleigh fading, and channel's power spectrum follows Jakes model [17]. The channel matrix from the BS to the k th user is denoted by H k = h k , i j n k × M , where h k , i j ~ C N ( 0 , 1 ) is the complex channel gain between the j th transmit antenna of BS and the i th receive antenna of user k. Let b k denote the n k × 1 transmit signal vector to user k. This signal vector is first multiplied by an M × n k precoding matrix T k and then transmitted through M transmit antennas. The received signal vector y k (n k × 1) of user k at the m th symbol interval is

y k [ m ] = H k [ m ] l = 1 K T l [ m ] b l [ m ] + n k [ m ] ,
(1)

where n k ~ C N ( 0 , N 0 , k I n k ) is an additive white Gaussian noise (AWGN) vector, b l satisfies E [ b l b l H ] = E s I n l , Es is the symbol energy. The system equation can be expressed in the matrix form as follows

y [ m ] = H [ m ] T [ m ] b [ m ] + n [ m ] ,
(2)

where

y [ m ] y 1 T [ m ] y 2 T [ m ] y K T [ m ] T , H H 1 T [ m ] H 2 T [ m ] H K T [ m ] T , T [ m ] T 1 [ m T 2 [ m T K [ m ] , b [ m ] b 1 T [ m ] b 2 T [ m ] b K T [ m ] T , n [ m ] n 1 T [ m ] n 2 T [ m ] n K T [ m ] T .

There are seven kinds of TDD frame configurations as defined in 3GPP specifications [18, 19]. Without loss of generality, we choose the TDD frame configuration 2 for analysis. Figure 1 describes the structure of TDD frame configuration 2. One radio frame includes 10 subframes and 1 subframe (duration is 1 ms) includes 14 symbols per subcarrier. The uplink pilots for downlink beamforming transmission can be sent via the last one or several symbols in the special subframe and/or the uplink subframe.

Figure 1
figure 1

TDD frame configuration 2.

Because only frequency flat fading channel is considered, Figure 1 can be equivalently simplified into Figure 2 for our analysis. Here, all pilot symbols are drawn in one equivalent block together with those data symbols in the sequent downlink subframes, which clearly indicates the CSI delay (denoted by Md) between uplink channel estimation and downlink data transmission. In practical TDD systems, the number of pilot symbols (denoted by np) in one equivalent block is very small compared with that of data symbols (denoted by nd), so we can consider that all pilot symbols in one equivalent block experience the stationary channel.

Figure 2
figure 2

One equivalent block in TDD system.

In TDD MU-MIMO systems, the procedures at the physical layer for the downlink data transmission are as follows:

Step 1: BS obtains the delay estimated version H ^ [ m - M d ] of CSI based on the received uplink pilots at the (m - Md)th symbol interval. Here, H ^ [ m - M d ] H ^ 1 T [ m - M d ] H ^ 2 T [ m - M d ] , . . . , H ^ K T [ m - M d ] , Md denotes the delay in symbol between the uplink channel estimation and downlink data transmission, and the value of Md ranges from 1 to nd for the different downlink data symbol as in Figure 2.

Step 2: BS generates the normalized ZF precoding matrix as follows and sends out the downlink data streams.

T [ m ] = H ^ [ m - M d ] H ^ [ m - M d ] F ,
(3)

Step 3: each user estimates the downlink channel through the downlink pilots and then detects the received signal.

3. BER analysis

In this section, we first derive the post-processing SINR under the given H ^ [ m - M d ] , and then derive the average BER based on post-processing SINR.

Substituting (3) into (2), the received signal vector (2) of system can be expressed as

y [ m ] = H [ m ] H ^ [ m - M d ] H ^ [ m - M d ] F b [ m ] + n [ m ] .
(4)

Similar to [13], we can deduce that H[m-Md] and H ^ [ m - M d ] are jointly complex Gaussian distributed

H 1 [ m - M d ] H 2 [ m - M d ] . . . H K [ m - M d ] = ρ e,1 H ^ 1 [ m - M d ] ρ e,2 H ^ 2 [ m - M d ] . . . ρ e, K H ^ K [ m - M d ] + 1 - ρ e,1 2 ζ 1 [ m - M d ] 1 - ρ e,2 2 ζ 2 [ m - M d ] . . . 1 - ρ e, K 2 ζ K [ m - M d ] ,
(5)

where the elements of N × M random matrix H ^ [ m - M d ] are independent and identically distributed (i.i.d) zero-mean complex Gaussian random variables with unit variance, the elements of the n k × M random matrix ζ k [m-Md] are also i.i.d zero-mean complex Gaussian random variables with unit variance, ρe, kis the complex correlation coefficient between the actual channel gain and its estimation for user k and is defined as

ρ e, k E h k , i j [ m - M d ] ĥ k , i j * [ m - M d ] , k = 1 , 2 , , K , i = 1 , 2 , , n k , j  = 1,2 , , M ,
(6)

where 0 ≤ |ρe, k| ≤ 1. Because the SNR of pilots of each user can be different, ρe, kof each user can be different.

Assuming channel follows a Gauss-Markov autoregressive (AR) process, similar to [14] we can also deduce that H[m] and H[m-Md] follow jointly complex Gaussian distribution

H 1 [ m ] H 2 [ m ] . . . H K [ m ] = ρ d,1 H 1 [ m - M d ] ρ d,2 H 2 [ m - M d ] . . . ρ d, K H K [ m - M d ] + 1 - ρ d,1 2 ε 1 [ m ] 1 - ρ d,2 2 ε 2 [ m ] . . . 1 - ρ d, K 2 ε K [ m ] ,
(7)

where the elements of the n k × M random matrix ε k [m] (k = 1, 2, ..., K) are i.i.d zero-mean complex Gaussian random variables with unit variance, ρd, kis the complex correlation coefficient between current channel gain and the delayed one for user k and is defined as

ρ d, k E h k , i j [ m ] h k , i j * [ m - M d ] k = 1 , 2 , , K , i = 1 , 2 , , n k , j = 1 , 2 , , M ,
(8)

where 0 ≤ |ρd, k| ≤ 1. Because each user can have a different mobile velocity, ρd, kof each user can be different.

Defining ρkρd, kρd, kk = 1, 2, ..., K, and substituting (5), (7) into (4), the received signal vector (4) of system can be further expressed as

y [ m ] = b eq [ m ] + n eq [ m ] ,
(9)

where b eq [ m ] b eq,1 T [ m b eq,2 T [ m b eq, K T [ m ] T is referred to as the effective post-processing signal, given by

b eq [ m ] = 1 H ^ [ m - M d ] F ρ 1 [ m ] b 1 [ m ] ρ 2 [ m ] b 2 [ m ] . . . ρ K [ m ] b K [ m ] ,
(10)

while n eq [ m ] n eq,1 T [ m n eq,2 T [ m . . . n eq, K T [ m ] T is referred to as the effective post-processing noise, given by

n eq [ m ] = ρ d,1 1 - | ρ e,1 | 2 ζ 1 m - M d + 1 - | ρ d,1 | 2 ε 1 m ρ d,2 1 - | ρ e,2 | 2 ζ 2 m - M d + 1 - | ρ d,2 | 2 ε 2 m . . . ρ d, K 1 - | ρ e, K | 2 ζ K m - M d + 1 - | ρ d, K | 2 ε K m H ^ [ m - M d ] H ^ [ m - M d ] F b [ m ] + n [ m ] ,
(11)

the covariance matrix of neq, k[m] can be computed as

E n eq, k [ m ] n eq, k H [ m ] = E s 1 - ρ k 2 I n k + N 0 , k I n k .
(12)

Because ZF precoding has already separated all data streams at transmitter side, from the receiver's perspective MU-MIMO system has reduced into a lot of parallel "equivalent SISO systems" one of which bears one data stream. Although a more complicated receiver could be used in each "equivalent SISO system" to demodulate the data stream to obtain better performance, the main purpose of this article is to investigate the impact of CSI delay and channel estimation error on MU-MIMO systems, so we use the simple receiver "ZF equalizer" in this article to demodulate each data stream.

Therefore, based on (10) and (12), the post-processing SINR per symbol on the i th stream of user k, denoted by γ k, i [m], can be obtained as follows

γ k , i [ m ] = γ s, k | ρ k | 2 γ s, k 1 - | ρ k | 2 + 1 H ^ [ m - M d ] F 2 , k = 1 , 2 , , K , i = 1 , 2 , , n k ,
(13)

where γs, k= Es/N0, kis the pre-processing SNR of downlink data symbol. Note that all data streams to the same user have the same SINR because we do not consider the power allocation strategy for all data streams and each data stream has the equal power.

Based on (13), we below derive the expression for the average BER of TDD MU-MIMO ZF systems with M-QAM modulated signals.

If SNR for uplink pilot symbols is γp, kand minimum mean-square error (MMSE) is chosen for channel estimation for user k, we can deduce

ρ e, k = γ p, k 1 + γ p, k , k = 1 , 2 , , K .
(14)

For a time-varying Rayleigh fading channel, its power spectrum follows the Jakes model [20], then

ρ d, k = J 0 2 π M d T s F d, k , k = 1 , 2 , , K ,
(15)

where J0 is a zeroth-order Bessel function of the first kind, Fd, k is the maximal Doppler frequency shift of user k, Ts is the symbol duration, and MdTs is the time delay between uplink channel estimation and downlink data transmission. So,

ρ k = J 0 2 π M d T s F d, k γ p, k 1 + γ p, k , k = 1 , 2 , , K .
(16)

If the uncoded M-QAM is used for transmitting signals and the constellation size is M = 2q, BER in AWGN is [21]

p b 0 . 2 exp - 1 . 6 γ 2 q - 1 ,
(17)

where γ is post-processing SNR.

Substituting (13) into (17), then the BER at the m th symbol interval, denoted by pb, k, i[m], is as follows for the i th stream of user k.

p b, k , i [ m ] 0 . 2 exp - 1 . 6 γ k , i [ m ] 2 q - 1 , k = 1 , 2 , K , i = 1 , 2 , n k .
(18)

It is observed from (13) and (18) that pb, k, i[m] is dependent on random matrix H ^ [ m - M d ] , so we need to calculate the expectation of pb, k, i[m] with respect to H ^ [ m - M d ] as follows.

Define c k γ s, k ρ k 2 γ s, k 1 - ρ k 2 + 1 , x H ^ [ m - M d ] F 2 , and c k 1.6 c k ' 2 q 1 , so p b , k , i [ m ] 0.2 exp { 1.6 c k ' ( 2 q 1 ) x } = 0.2 exp { c k x } . Let the singular-value decomposition of H ^ as follows:

H ^  =  U Σ V H
(19)

where U and V are unitary matrixes and Σ is a diagonal matrix of singular values σ i . So, we have

H ^ = H ^ H H ^ H ^ H  -  1 = U Σ V H H U Σ V H U Σ V H H - 1 = V Σ H U H U Σ V H V Σ H U H - 1 = V Σ H U H U Σ Σ H U H - 1 = V Σ H U H = Σ Σ H U U H - 1 = V Σ H U H Σ Σ H - 1 = V Σ H Σ Σ H - 1 U H = V Σ - 1 U H

which means that the singular value λ i of H ^ is equal to 1 σ i . Then according to matrix knowledge [22], there exists H ^ F 2  =  i = 1 N λ i 2 . Therefore, we can obtain

x = i = 1 N 1 σ i 2 = i = 1 N 1 σ i
(20)

where σ i = σ i 2 . So, pb, k, inow depends on the square σ i of each singular value σ i of H ^ [ m - M d ] .

As mentioned previously, the entries of H ^ [ m - M d ] are i.i.d zero-mean complex Gaussian random variables with unit variance, so the joint probability density function (PDF) of the square σ i of all singular values σ i of H ^ [ m - M d ] , denoted by f(σ1, σ2, ..., σ N ), can be written as follows according to Theorem 2.17 of [23]

f ( σ 1 , σ 2 , . . . , σ N ) = e - i = 1 N σ i i = 1 N σ i M - N N - i ! M - i ! i < j N σ i - σ j 2 .
(21)

Hence, the average BER, denoted by p ̃ b, k , i , can be obtained as follows

p ̃ b, k , i c k , N , M 0 + 0 σ 1 0 σ N - 1 0 . 2 exp - c k / i = 1 N 1 σ i f ( σ 1 , σ 2 , , σ N ) d σ N d σ 2 d σ 1 .
(22)

While it is difficult to obtain a closed-form expression for (22), the integral is fairly straightforward to evaluate numerically, at least when min(M, N) is small (in practical communication systems, the number of antennas of BS is at most eight at present), so it is valuable for the design of the adaptive modulation and coding scheme in practical communication systems. Moreover, since the BER expression includes the parameters related to channel conditions (e.g., Doppler frequency shift, uplink pilot SNR, CSI delay length, etc.) and the parameters related to system configurations (e.g., modulation mode, symbol duration, number of BS antennas, number of UE antenna, etc.), it provides the hints for people to optimize the MU-MIMO performance in TDD systems from different perspectives.

The BER function p ̃ b, k , i is only determined by three parameters including c k , N, and M. Hence, we can summarize the impact of imperfect CSI as follows.

1. Increase BER

As Md T s Fd, kincreases or γp, kdecreases, |ρ k | decreases, so c k decreases and p ̃ b, k , i in turn increases. In other words, system performance degrades when the Doppler shift is high, or when the SNR of pilot symbols is low.

2. Error floor

If CSI is perfect and γs, k→ ∞, then c k → ∞ and p ̃ b, k , i 0 . However, if CSI is imperfect and γs, k→ ∞, then c k c k Upper - bound 1 . 6 ρ k 2 2 q - 1 1 - ρ k 2 and p ̃ b, k , i 0 + 0 σ 1 0 σ N - 1 0 . 2 exp - - c k Upper - bound i = 1 N 1 σ i f ( σ 1 , σ 2 , , σ N ) d σ N d σ 2 d σ 1 , which means c k approaches an upper-bound and the BER thus exhibits an error floor when γs, kis high, further increases in γs, kgain nothing. This error floor worsens as |ρ k | decreases, i.e., as the channel estimation error or CSI delay of user k increases.

4. Simulation results

Consider an LTE TDD downlink MU-MIMO system where a BS with eight antennas transmits data to two users each equipped with two antennas. The channels are assumed to be time-varying, spatially uncorrelated, frequency flat, and Rayleigh fading. Jakes model is used to simulate the time-varying channels. The carrier frequency is 2.3 GHz. Symbol interval is 1/14 ms. TDD frame configuration 2 is used to transmit downlink data block and uplink pilots for downlink beamforming transmission are sent in the last symbol of the uplink subframe. As shown in Figure 1, the ratio of uplink subframes to downlink subframes is 1:3 in one equivalent block where the first 1 ms is for uplink and other 3 ms is for downlink, so the range of CSI delay Md is from 1 to 42 symbol intervals for the different downlink data symbol. For simplification, we make the following assumptions: (1) The noise covariance N0, kof each user is the same, which implies γs, kof each user is the same, (2) uplink pilot SNR γp, kof each user is the same, and is equal to the pre-processing SNR γs, kof downlink data symbol when the channel estimation error of CSI is considered, (3) no channel coding is considered. Owing to the assumptions above, we can ignore user index k for all related variables hereafter, e.g., replacing γs, kwith γs. MMSE channel estimation and ideal channel estimation are used by BS and each user, respectively. Simulation parameters, some of which are cited from [18, 19], are summarized in Table 1.

Table 1 Simulation parameters

According to the system configuration in Table 1, the average BER p ̃ b, k , i in (22) becomes into

p ̃ b, k , i c k , 4 , 8 0 + 0 σ 1 0 σ 3 0 . 2 exp - c k / i = 1 4 1 σ i f ( σ 1 , σ 2 , , σ 4 ) d σ 4 d σ 2 d σ 1 .
(23)

where

f ( σ 1 , σ 2 , , σ 4 ) = e - i = 1 4 σ i i = 1 N σ i 4 4 - i ! 8 - i ! i < j 4 σ i - σ j 2 .
(24)

The above integral can be evaluated with numerical calculation software, e.g., Matlab (2009a), Mathematica, etc.

Figures 3, 4, and 5 show the variation of system BER (averaged over the two users) with γs for 4QAM, 16QAM, and 64QAM, respectively. In each figure, the four typical cases are considered according to the four kinds of different relationships between CSI and downlink transmission channel:

Figure 3
figure 3

BER performance of MU-MIMO downlink system (4-QAM).

Figure 4
figure 4

BER performance of MU-MIMO downlink system (16-QAM).

Figure 5
figure 5

BER performance of MU-MIMO downlink system (64-QAM).

1. Without CSI delay and without estimation error: 0 km/h and γp = ∞

There are no CSI delay and no estimation error between CSI and downlink transmission channel. It is the perfect CSI case which is as the comparison baseline for other three cases.

2. Without CSI delay and with estimation error: 0 km/h and γp = γs

There is no CSI delay but exists estimation error between CSI and downlink transmission channel.

3. With CSI delay and without estimation error: 10 km/h and γp = ∞

There exists CSI delay but is no estimation error between CSI and downlink transmission channel.

4. With CSI delay and with estimation error: 10 km/h and γp = γs

There are both CSI delay and estimation error between CSI and downlink transmission channel.

One can see that the simulation curves match the analytical ones very well, demonstrating the correctness of our average BER expression. It is also observed that, the BER increases as the channel estimation error and/or CSI delay (or mobile velocity) increase(s), and an error floor is evident at high SNR for the cases with CSI delay, which agrees with our summary about imperfect CSI impact. Furthermore, one can find that channel estimation error causes array gain loss by comparing the curves of the same mobile velocity but with different pilot SNR while CSI delay causes diversity gain loss by comparing the curves of the same pilot SNR but with different mobile velocities. Moreover, CSI delay causes more performance degradation at high SNR than channel estimation error as the latter diminishes when the SNR is high.

Figure 6 illustrates the variation of system BER with delay Md in the case of 4QAM, 16QAM, and 64QAM. Here, γs and γp are fixed as 30 dB in order to ignore the impact of channel estimation error as much as possible, the range of Md comes from linear area 0 ≤ x ≤ 2 of J0(x), Fd is 5 Hz. One can see that BER shows the trend of increasing as Md increases and finally reaching error floor. The reason is that CSI becomes more and more imperfect as Md increases, so BER becomes more and more big; when Md is beyond a certain long delay value, CSI has already become saturated imperfect, so BER arrives at error floor.

Figure 6
figure 6

Relationship between BER and delay M d .

Figure 7 illustrates the variation of system BER with γp (reflecting channel estimation error) in the case of 4QAM, 16QAM, and 64QAM. Here, γs are fixed as 10 dB, and Md is fixed as 10 symbol intervals in order to ignore the impact of CSI delay as much as possible, Fd is 5 Hz. One can see that BER decreases as γp increases and finally arrives at error floor. The reason is that CSI become more and more perfect as γp increases; so BER becomes more and more small; when γp becomes high, channel estimation error diminishes and CSI delay dominates BER, so BER arrives at an error floor, which again agrees with our summary about imperfect CSI impact.

Figure 7
figure 7

Relationship between BER and γ p .

Figure 8 depicts the variation of system BER with the speed of user in the case of 4QAM, 16QAM, and 64QAM. Here, γs and γp are fixed as 30 dB in order to ignore the impact of channel estimation error as much as possible, Md is fixed as 10 symbol intervals, the speeds of both users are assumed as the same for simplification and change from 0 to 100 km/h. One can see that BER increases as the speed of user increases and finally reaches error floor. The reason is that Doppler frequency shift becomes more and more big as the speed of user increases, which causes that CSI becomes more and more imperfect, so BER becomes more and more big. When the speed of user is beyond a certain value, CSI has already become saturated imperfect, so BER arrives at error floor. Moreover, by comparing Figures 6 and 8, we can find that the impact of the speed of user on BER is similar to the impact of CSI delay Md on BER and the big speed value is equivalent to the big CSI delay value, which can be explained by (15). It should be pointed out that in practical communications systems, the CSI delay Md is generally fixed because of the selected frame structure in advance while the speed of user often changes.

Figure 8
figure 8

Relationship between BER and user speed.

Figure 9 depicts the variation of system BER with the number of user in the case of 4QAM, 16QAM, and 64QAM. Here, γs and γp are fixed as 30 dB, Md is fixed as 10 symbol intervals. For simplification, we make the following assumptions: (1) each user is equipped with one antenna, so the number of user K is equal to the total number of antennas of all users N; (2) the speed of each user is the same and corresponding Doppler frequency shift is 5 Hz; (3) the number of user changes from 1 to 8 because of the condition M  =  8 K = N = k = 1 K n k . One can see that BER increases as the number of user increases. The reason is that inter-user interferences inevitably exist because of the existences of estimation error and CSI delay between CSI and downlink transmission channel, and increase as the number of user increases, so the BER becomes more and more big.

Figure 9
figure 9

Relationship between BER and the number of user.

5. Conclusion

In this article, we have investigated the BER of TDD downlink MU-MIMO ZF systems in the presence of imperfect CSI. By exploiting the correlation between the actual channel and the estimated one as well as channel time-correlation, we have developed the novel BER expression for TDD downlink MU-MIMO systems with M-QAM-modulated signals. Furthermore, we find that CSI delay and channel estimation error degrade system performance and even cause error floor, among which channel estimation error causes array gain loss while CSI delay causes diversity gain loss. At high SNR, CSI delay causes more performance degradation than channel estimation error. Especially, our research is valuable for the design of the adaptive modulation and coding scheme as well as the optimization of MU-MIMO systems. Numerical simulations have verified our theoretical analysis.