1 Introduction

Multiple-input-multiple-output (MIMO) schemes are mandatory nowadays and for the next generation of wireless technologies because of the demand for higher capacity services and reliability in wireless communication system. Massive MIMO and multiuser MIMO techniques have widely attracted to achieve reliability and higher data rates performance in many standards of wireless transmission systems in recent years. Mode of gain in MIMO system is divided into two major categories: spatial diversity gain [1,2,3] and spatial multiplexing gain [4,5,6,7,8,9,10]. Spatial diversity enables link performance in wireless channels while spatial multiplexing scales up capacity by transporting number of streams in parallel through multiple antennas.

Linear detector algorithms are widely designed to utilize for their low complexity receiver as compared to maximum likelihood (ML) detection [6]. Some of the emerging wireless systems i.e., 3GPP Long Term Evolution (LTE), IEEE 802.11n and IEEE 802.16e prefer linear receivers. The minimum mean square error (MMSE) and/or zero forcing (ZF) linear detectors are too complex to be exploited in systems with large scale antennas. It is very feasible for MIMO with small scale antennas. However, the complexities overhead of ZF and MMSE detection depend on the size of data streams to be detected and order of modulations to be used. The scales up computational complexities of the ZF and/or MMSE detector in some of the aforementioned situations have aimed the design of several other schemes for the MIMO system [11]. The linear receiver [12], the successive interference cancellation (SIC) method, vertical- bell laboratories layered space time (V-BLAST) approach [13, 14] and decision feedback detection [15,16,17] are techniques that can present attractive trade-offs between complexity and performance.

Modern cellular communication standards, orthogonal frequency division multiplexing (OFDM) employed with MIMO techniques to achieve higher spectral efficiency. Moreover, one of the key issues of high peak to average power ratio (PAPR) property is in OFDM. Thus, single carrier frequency division multiplexing access (SC-FDMA) system exhibits significantly low PAPR than OFDM. The system adapts the low PAPR benefits in the mobile handsets of power amplifier, which increases the power back-off of mobile device equipped with small batteries. SC-FDMA system has been selected as a physical layer technology of LTE and LTE-advanced (LTE-A) standards [18,19,20]. The LTE-A is widely recognized as one of the strong candidates for power efficient technology in the Internet of things (IoT). There are two well known subcarrier assigning techniques in SC-FDMA, namely: localized FDMA (LFDMA) and interleaved (IFDMA) [18].

Spatial multiplexing SC-FDMA (SM SC-FDMA) scenario achieves excellent capacity gain by adding linearly with multiple number of transmit and receive antennas [4, 5, 7,8,9,10]. There are variety of receivers employed to adapt in MIMO SC-FDMA detection such as linear frequency domain ZF, MMSE, and ML receivers [4,5,6,7,8,9]. Even though, ML detection offers an optimal bit error rate (BER) performance the complexity increases in the system. The linearly combining of all the transmitted signals at each received antenna is raised in receiver computational overhead. Therefore, an efficient computational complexity reduction receiving technique is needed for SM SC-FDMA transceiver systems. In several practical applications MIMO SC-FDMA system, receiver complexity is a significant concern, and a receiver detection system with reasonable complexity is widely preferred. The linear receivers such as ZF and MMSE receiver have addressed this article.

MIMO SC-FDMA system with detection of user data by the MMSE and/or ZF filter is typically simple and efficient [2, 4, 7,8,9,10, 21, 22]. These approaches involve unaffordable inversion of matrix computation depends on equal number of subcarriers. Also it suffers scales up of complexity of receiver in these systems as large as a few thousands of subcarrier employed in realistic systems. The authors in [4, 10, 23] suggested the reduction of complexity MMSE detector where the interference matrix is approximated by a banded matrix with size of bandwidth (B), which looks for the elements of the matrix outside B are considered to have negligible impact. It is noticed that this approximation of the interference matrix has been applied to obtain the performance with low-computational complexity in OFDMA and SC-FDMA systems. This approximation results in critical performance degradation issues. Consequently, the system may not be suitable for practical use. Regularized complexity detection for MIMO SC-FDMA uplink scheme is investigated [4]. An iterative interference cancellation scheme based on the Neumann power series expansion is presented in [24, 25]. It is noted that this system requires additional modifications by converting the matrix inversion into a series of matrix multiplications and additions, which is able to alleviate the complexity issue. However, the Neumann power series based scheme suffers because of number of users increases. An iterative sequential MMSE detection approach is proposed to reduce the computational complexity for estimating data symbols in large scale MIMO systems [26]. However, its adjacent interference cancellation involves higher delay leading to suffer performance deterioration.

In addition, a low-complexity ZF and/or MMSE detection for OFDMA and/or SC-FDMA uplink system is proposed in [27,28,29] by employing a conjugate gradient (CG) method. It computes the ZF and MMSE detector output without being involved in an inverse computation. This approach has a much lower complexity compared to existing techniques and is simple and efficient, while maintaining the optimal result. It is noticed that CG technique is a fast estimation of ZF and MMSE solution and achieves better performance than the direct MMSE solution [16]. To the best of our knowledge, this work is the first to apply the CG algorithm based linear detection especially for SM SC-FDMA uplink system. This study proposes low-complexity ZF and MMSE detection for SM SC-FDMA uplink system. The views of the research are furnished bellow:

  1. 1.

    The new CG algorithm based ZF and MMSE detection for SM SC-FDMA is proposed whereas scheme [29] time domain windowing based on OFDMA system, consequently experienced some bandwidth loss.

  2. 2.

    The computational complexity of proposed scheme is quite different from the one proposed in [4], which performed only the regularized ZF based MIMO SC-FDMA system.

  3. 3.

    By implementing CG algorithm with ZF and MMSE criteria that estimates solution, without requiring matrix inversion operations, which will enable us to realize near optimal performance with low complexity detection. It has a benchmark of lower complexity linear detection for SM SC-FDMA system.

  4. 4.

    It can be utilized for a broad range of carrier frequency offsets (CFOs) and also applicable to both LFDMA and IFDMA system.

The remaining chapter is organized as follows. Section 2 describes transceiver configuration model for uplink SM SC-FDMA. Section 3 presents non-iterative linear detection. Section 4 presents implementation of proposed iterative linear detection. The low complexity analysis and comparisons are described in Sect. 5. Simulation results of the SM SC-FDMA detectors are reported in Sect. 6. Finally, Sect. 7 draws the conclusion. Upper and lower case letters represent frequency domain and time domain components, respectively. Superscripts, −1, T and * designate an inverse, transpose and conjugation operations, respectively. \(I_{p}\) represents identity matrix of P dimension. \(C\) indicates the set of complex number. The operators || || and diag{.} represent the Euclidean vector norm and diagonal matrix, respectively. The notation “\((.)_{ab}\)” and “\((.)^{a \times b}\)” represent dimension of \(a \times b\) matrix.

2 Uplink Spatial Multiplexing SC-FDMA Transmission Model

The uplink configuration of SM SC-FDMA transceiver is shown in Fig. 1. It consists of the base station (BS) with \(N_{r}\) receive antennas and user equipment (UE) with \(N_{t}\) transmit antennas. One of the MIMO schemes is spatial multiplexing that maximizes the data rate of wireless transmission scenarios. Each user’s data streams are mapped to a complex constellation symbols (e.g., QPSK or 16-QAM). SM SC-FDMA scheme converts complex mapped symbols into separate signal based on the size of transmitting antennas. The resulting complex sequence \(s\) is converted into frequency domain through an M-point DFT. The frequency domain symbol at the mth subcarrier for the kth (k = 1,…, K) user can be expressed as follows [30]:

Fig. 1
figure 1

The transreceiver stucture of SM SC-FDMA systems

$$X_{{}}^{k} (m) = \sum\limits_{n = 0}^{M - 1} {s_{{}}^{k} (n)\,} e^{{\frac{ - j2\pi nm}{M}}} \quad 0 \le m \le M - 1$$
(1)

The frequency domain symbols are then mapped to the subcarrier mapping methods. After performing an N-point inverse DFT (IDFT) to the subcarrier mapped sequence. The time-domain IDFT sequence is expressed as follows:

$$x_{q}^{k} (n) = \sum\limits_{m = 0}^{N - 1} {\tilde{X}_{q}^{k} (m)\,} e^{{\frac{j2\pi nm}{N}}} \quad 0 \le n \le N - 1,\quad q = 0,1, \ldots ,N_{t}$$
(2)

where \(\tilde{X}_{p}^{k} (m)\) is the mth sample at qth the antenna of the kth user after the assignment operation. Now,\(N_{t} = N_{r} = 2\) transmit and receive antennas are considered. Before transmission in each block, a cyclic prefix (CP), which is the last part of the block data of \(x_{{}}^{k} (n)\), is inserted in front of that block to eliminate the inter-block interference.

The assumption is that the receiver has precise synchronization, the received SM SC-FDMA symbol at the \(N_{r}\) receiving antennas, after discarding of CP, can be described as follows [10]:

$$y = \sum\limits_{k = 1}^{K} {\,\tilde{\Phi }^{k} \bar{h}^{k} } x^{k} + z$$
(3)

Then, (3) is expressed in matrix structure [4] as:

$$y = \left[ {\begin{array}{*{20}c} {y_{1} } & {y_{2} } \\ \end{array} } \right]^{T} ,\quad x = \left[ {\begin{array}{*{20}c} {x_{1} } & {x_{2} } \\ \end{array} } \right]^{T} ,\quad z = \left[ {\begin{array}{*{20}c} {z_{1} } & {z_{2} } \\ \end{array} } \right]^{T} \,{\text{and}}\;\bar{h} = \left[ {\begin{array}{*{20}c} {\bar{h}_{1\,1} } & {\bar{h}_{1\,2} } \\ {\bar{h}_{2\,1} } & {\bar{h}_{2\,2} } \\ \end{array} } \right]$$
(4)

where \(y_{p} \in C^{N \times 1}\), \(x_{q} \in C^{N \times 1}\) and \(z_{p} \in C^{N \times 1}\) denote the received symbol column vector of the pth receive antenna, the transmitted symbol column vector of the qth transmit antenna and the additive white Gaussian noise (AWGN) column vector of the pth receive antenna, respectively. \(\tilde{\Phi }_{k}\) denotes the diagonal CFO \(N \times N\) matrix modeled as \(\tilde{\Phi }_{k} = diag\left\{ {1,e^{{\frac{{j2\pi \varepsilon_{k} }}{N}}} ,\,\,.\,\,.\,\,.\,.\,,e^{{\frac{{j2\pi (N - 1)\varepsilon_{k} }}{N}}} } \right\}\). \(\varepsilon_{k}\) = \(\frac{{\Delta f_{k} }}{{f_{sub} }}\) is the normalized CFO of the kth user, \(\Delta f_{k}\) represents the CFO between the kth user and BS receiver and \(f_{sub}\) denotes subcarrier spacing and \(\bar{h}_{p\,\,q}^{k} \in C^{N \times N}\) denotes channel circulant matrix. It shows the MIMO channels between qth transmit and pth receive antenna of the kth user which are expressed as follows [7, 10, 23]:

$$\bar{h}_{p\,q}^{k} = \left[ {\begin{array}{*{20}c} {h_{p\,q}^{k} (0)} & 0 & \cdots & {} & {} & {h_{p\,q}^{k} (L - 1)} & {h_{p\,q}^{k} (1)} \\ {h_{p\,q}^{k} (1)} & {h_{p\,q}^{k} (0)} & \ddots & {} & {} & 0 & {h_{p\,q}^{k} (2)} \\ \vdots & {h_{p\,q}^{k} (1)} & \ddots & {} & {} & \ddots & \vdots \\ {h_{p\,q}^{k} (L - 1)} & \vdots & \ddots & {} & {} & \ddots & \ddots \\ 0 & {h_{p\,q}^{k} (L - 1)} & \cdots & {} & {} & \ddots & 0 \\ \vdots & \vdots & \vdots & {} & {} & \vdots & \ddots \\ 0 & 0 & \cdots & {} & {} & {h_{p\,q}^{k} (1)} & {h_{p\,q}^{k} (0)} \\ \end{array} } \right]$$
(5)

It is a frequency domain diagonal channel matrix, obtained through DFT computation of the circulant element of \(h_{p\,q}\), whose diagonal elements are channel frequency response; i.e., \(\bar{H}_{p\,q}^{k} (m) = diag\{ \sum\nolimits_{n = 0}^{L - 1} {h_{p\,q}^{k} (n)\,\,e^{{\frac{ - j2\pi nm}{N}}} } \}\)\(0 \le m \le N - 1\), where \(h_{p\,q} \in C^{L}\) denotes the column vector of channel impulse response from pth receive antenna to qth transmit antenna. The circulant matrix is effectively diagonalized by the way of the fast Fourier transform (FFT) matrix operator \(F_{N}\) with its inverse \(F_{N}^{H}\). It is formulated as follows [7]:

$$\bar{H}_{p\,q}^{k} (m) = F_{N} \tilde{\Lambda }_{p\,q}^{k} (m)F_{N}^{H}$$
(6)

where \(\tilde{\Lambda }_{p\,q}^{k}\) is an \(N \times N\) diagonal matrix whose elements consist of FFT of the circulant sequence of \(H_{p\,q}^{k}\).

The received frequency domain signal after DFT is

$$Y = \sum\limits_{k = 1}^{K} {\Phi^{k} \Lambda^{k} X^{k} \, + Z}$$
(7)

Then, (7) can be written in matrix form as [10]

$$Y = \left[ {\begin{array}{*{20}c} {Y_{1} } & {Y_{2} } \\ \end{array} } \right]^{T} ,\quad X = \left[ {\begin{array}{*{20}c} {X_{1} } & {X_{2} } \\ \end{array} } \right]^{T} ,\quad Z = \left[ {\begin{array}{*{20}c} {Z_{1} } & {Z_{2} } \\ \end{array} } \right]^{T} \,{\text{and}}\,\Lambda^{k} = \left[ {\begin{array}{*{20}c} {\tilde{\Lambda }_{11}^{k} } & {\tilde{\Lambda }_{12}^{k} } \\ {\tilde{\Lambda }_{21}^{k} } & {\tilde{\Lambda }_{22}^{k} } \\ \end{array} } \right]$$
(8)

where \(Y_{p} ,X_{q}\) and \(Z_{P}\) are the Fourier transforms of \(y_{p} ,x_{q}\) and \(z_{p}\),respectively. \(\Phi_{k}\) denotes the frequency component of circulant CFO matrix. An individual detection is carried out by each user from the receiver side.

In order to detect user data, the frequency domain detection and demapping are carried out as follows [8]:

$$\bar{r}^{u} = [A^{u} ]^{T} \prod^{u} Y$$
(9)

where \([A^{u} ]^{T}\) denotes M \(\times\) N demapping matrix of uth user. \(\Pi^{u}\) denotes the \(N \times N\) diagonal MMSE equalization frequency domain matrix of uth user i.e., \(\Pi^{u} = \,(H^{u} )^{H} [\,H^{u} (H^{u} )^{H} + \frac{{\sigma_{n}^{2} }}{{\sigma_{s}^{2} }}I_{N} ]^{ - 1}\) where \(\frac{{\sigma_{n}^{2} }}{{\sigma_{s}^{2} }}\) is variance of noise to signal ratio.

The received signal of uth (k = 1, … ,K) user’s after separation is modeled as follows [23]:

$$\tilde{r}^{u} = [A^{u} ]^{T} \Pi^{u} (\Phi_{u} H^{u} A^{u} F_{M} x^{u} + \sum\limits_{k = 1,k \ne u}^{K} {\Phi_{k} H^{k} A^{k} F_{M} } x^{k} + Z^{k} )$$
(10)

Further, the M-point IDFT, the uth user detection output can be expressed as:

$$\begin{aligned} \hat{x}^{u} & = F_{M}^{H} \tilde{r}^{u} \\ & = \underbrace {{F_{M}^{H} [A^{u} ]^{T} \Pi^{u} [\Phi_{u} ]_{1,1} H^{u} A^{u} F_{M} x^{u} }}_{I} + \underbrace {{F_{M}^{H} [A^{u} ]^{T} \Pi^{u} (\Phi_{u} - [\Phi_{u} ]_{1,1} \,I_{N} )H^{u} A^{u} F_{M} x^{u} }}_{II} \\ & \quad + \underbrace {{F_{M}^{H} [A^{u} ]^{T} \Pi^{u} \sum\limits_{k = 1,k \ne u}^{K} {\Phi_{k} H^{k} A^{k} F_{M} x^{k} } }}_{III}\, + \underbrace {{F_{M}^{H} [A^{u} ]^{T} \Pi^{u} Z}}_{IV} \\ \end{aligned}$$
(11)

From the Eq. (11), it is denoted that the Ist term is transmitted with uth user’s transmitted information symbol, the IInd term, inter-symbol-interference (ISI), the IIIrd term, multiple access interference (MAI) and the IVth term, AWGN, respectively. Further, from the receiver extract, the original transmitted information can be obtained through demodulation and detection operations.

3 Conventional Multiuser Detections

3.1 Non-iterative Linear Detection Systems

Firstly, ZF and MMSE criterions associate with non-iterative detection methods are investigated. After performing subcarrier demapping with multiuser interference cancellation, the frequency domain linear detection is formulated to equalize the received signal. The estimate of transmitted signal of nth subcarrier by ZF/MMSE equalization can be shown as [6, 7, 9, 12, 31]:

$$\hat{X}_{ZF} (n) = H^{H} (n)[H(n)H^{H} (n)]^{ - 1} Y$$
(12)

and perform frequency domain linear MMSE equalization,described as [8, 23, 32]:

$$\hat{X}_{MMSE} (n) = H^{H} (n)\left[H(n)H^{H} (n) + \frac{{\sigma_{n}^{2} }}{{\sigma_{s}^{2} }}I_{N} \right]^{ - 1} Y$$
(13)

To reduce interference between subcarriers linear equalization can be employed. However, the interference cancellations based on ZF and MMSE approach perform an N × N matrix inversion when N becomes high in many applications. Thus, high computational difficulty occurs. To elevate the complexity crisis, an iterative procedure for CG approach to obtain the result of the linear equalizer coefficients without any matrix inversion is involved. The frequency domain equalized signal,\(\hat{X}\), is transformed to time component equivalent via M-point IDFT. Finally, as per the decoding procedure the transmitted signal is retrieved.

3.2 MIMO SC-FDMA System with Combined Regularized Linear Detection

The regularized [4, 10] linear ZF receiver overcomes the direct matrix inverse, utilized in the existing linear detection. It carries out through in two step procedure, the first step is to detect received signal with low complexity by implementing the banded structure of the interference matrix and a regularization sequence is appended in the equalization process to reduce the noise amplification in the step second.

  1. (i)

    In this step MIMO signals for first user data is estranged, inter-antenna-interference is minimized by applying the matrix \(w\) and expressed as follows [4, 10]:

$$\hat{x} = w_{1}^{k} \tilde{r}$$
(14)

where \(\tilde{r}\) is obtained by expression (10).\(w_{1}\) can be described as [4, 10]:

$$w_{1}^{k} = \left[ {\begin{array}{*{20}c} {I_{N} } & { - \Delta_{2}^{k} } \\ { - \Delta_{1}^{k} } & {I_{N} } \\ \end{array} } \right]$$

where \(\Delta_{1}^{k} = \Phi_{21}^{k} \left( {\Phi_{11}^{k} } \right)^{ - 1}\) and \(\Delta_{2}^{k} = \Phi_{12}^{k} \left( {\Phi_{22}^{k} } \right)^{ - 1}\)

  1. (ii)

    In this step ISI and MAI are minimized by implementing matrix \(w_{2}^{k}\) as [4, 10] by

$$\begin{aligned} \hat{x} & = w_{1}^{k} w_{2}^{k} \tilde{r}_{d}^{k} = w_{2}^{k} (w_{1}^{k} \tilde{r}_{d}^{k} \Phi^{k} )X^{k} + w_{1}^{k} w_{2}^{k} Z \\ & = w_{2}^{k} \left[ {\begin{array}{*{20}c} {\Theta_{1}^{k} } & {0_{N \times N} } \\ {0_{N \times N} } & {\Theta_{2}^{k} } \\ \end{array} } \right]X^{k} + w_{2}^{k} w_{1}^{k} Z \\ \end{aligned}$$
(15)

where \(w_{1}\) and \(w_{2}\) are the inter-antenna interference and inter symbol interference, respectively. \(\Theta_{1}^{k} = \Phi_{11}^{k} - \Delta_{2}^{k} \Phi_{21}^{k}\) and \(\Theta_{2}^{k} = \Phi_{22}^{k} - \Delta_{1}^{k} \Phi_{12}^{k}\). It is noted that \(w_{2}^{k}\) can be constructed as follows [4, 10]:

$$w_{2}^{k} = \left[ {\begin{array}{*{20}c} {(\Theta_{1}^{k} )^{ - 1} } & {0_{N \times N} } \\ {0_{N \times N} } & {(\Theta_{2}^{k} )^{ - 1} } \\ \end{array} } \right]$$
(16)

To reduce complexity problems in ZF scheme, the authors proposed as α is a regularization constraint, which adapts to elevate the noise amplification and the MAI problems. Note down that α = 0 in the ZF equalizer in (16). It is significant to notice that the matrices \(\Phi_{11}^{k}\),\(\Phi_{12}^{k}\),\(\Phi_{21}^{k}\) and \(\Phi_{22}^{k}\) can be approximated by banded matrices. The MMSE detection output for (15) can be derived as [4, 10]:

$$w^{k} = \left[ {\begin{array}{*{20}c} {(H_{1}^{k\,H} H_{1}^{k} + \alpha I_{N} )^{ - 1} H_{1}^{k\,H} } & {0_{N \times N} } \\ {0{}_{N \times N}} & {(H_{2}^{k\,H} H_{2}^{k} + \alpha I_{N} )^{ - 1} H_{2}^{k\,H} } \\ \end{array} } \right]$$
(17)

The optimal rate of \(\alpha\) is related to signals -to- noise ratio (SNR).

3.3 A Sub-optimal ML Estimation for MIMO SC-FDMA

Following [6], the ML detection generates effective channel response between the transmitted and received signals. Before the Euclidean distance metric of each symbol is evaluated, the decision line distance metric is applied to receive signal by the selective ML post detection processing scheme. Consequently, MMSE detection process of each received symbol is introduced to ML criterion. Then, it performs the removal of multiple transmits streams interference of the received signal. The overall received signal can be described [6] as:

$$y = H_{eff} x + z.$$
(18)

This scheme can be performed as MMSE

$$\hat{x} = [H_{eff}^{H} H_{eff} + SNR\,I_{N} ]^{ - 1} H_{eff}^{H} y$$
(19)

where \(H_{eff}\) is the effective ML channel estimation.

To improve MMSE system performance, a closer of unreliable symbols are carried out of the \(MN_{q}\) estimated MMSE symbols in \(\hat{x}\). The error probability of the ith symbol can be expressed as in Euclidean distance metric

$$\gamma_{i} = \begin{array}{*{20}c} {\hbox{min} } \\ {x_{i}^{(j)} \in \{ \hat{x}_{i} \}^{C} } \\ \end{array} \left\| {y - Hx_{i}^{(j)} } \right\|^{2}$$
(20)

where \(\{ x_{i} \}^{C}\) is a subset of symbol constellation.

The reliabilities of symbols \(\hat{x}_{v}\),\(v = 1,2, \ldots ,MN_{t}\) can be estimated as:

$$\gamma_{{\hat{x},v}} = \begin{array}{*{20}c} {\hbox{min} } \\ {\{ x \in \{ x_{i} \}^{C} \} - \{ \hat{x}_{v} \} } \\ \end{array} \left\| {y - \sum\limits_{l \ne v} {h_{eff,l} \hat{x}_{v} - h_{eff,v} x} } \right\|^{2}$$
(21)

where \(h_{eff,l}\) is the \(l{\rm th}\) column of \(H_{eff}\). A new set of unacceptable symbols is then identified to further enhance performance of the system when the ML estimated symbols are combined into the computation of reliabilities. This technique is restricted, mainly due to imperfect interference cancellation. ML detection is achieved better performance but high computational complexity when the modulation order grows or number of antennas increases.

3.4 An Iterative Sequential Detection

An iterative linear sequential detection scheme is presented to reduce complexity for large scale MIMO systems [26]. The detected solution for ZF algorithm based MIMO detection can be expressed [26] as

$$\hat{X}_{ZF} = Q[(H^{H} H)^{ - 1} H^{H} Y] = Q[a^{ - 1} b]$$
(22)

where Q[.] is the quantizer, \(b = H^{H} Y\) and \(a^{ - 1} = H^{H} H\) is the Gram matrix. The estimated solution for MMSE algorithm based MIMO detection can be described [26] as

$$\hat{X}_{MMSE} = Q[(H^{H} H + SNR\,I)^{ - 1} H^{H} Y] = Q\left[ {\begin{array}{*{20}c} {a^{ - 1} } & b \\ \end{array} } \right]$$
(23)

where Q[.] is the quantizer, \(b = H^{H} Y\) and \(a^{ - 1} = G + SNR\,I\) where \(G = H^{H} H\) is the Gram matrix. The corresponding symbol of each user is estimated and updated sequential fashion while nulling the interference from all the other users at every iteration process. For the removal of interference from all other users, the estimated signal of ith user can be expressed [26] as

$$\hat{y} = y - \sum\limits_{k = 1,k \ne i}^{K} {h_{k} } \mu_{t} (k)$$
(24)

To compute the symbol related to the ith user, tth iterative process the sequence \(\hat{y}\) as

$$\mu_{t + 1} (i) = \frac{{h_{i}^{T} }}{{\left\| {h_{i} } \right\|^{2} }}\hat{y} = \frac{1}{{\left\| {h_{i} } \right\|^{2} }}(h_{i}^{T} y - h_{i}^{T} \sum\limits_{k = 1,k \ne i}^{K} {h_{k} \mu_{t} (k)} )$$
(25)
$$\mu_{t + 1} (i) = \frac{1}{{\left\| {h_{i} } \right\|^{2} }}(h_{i}^{T} y - \sum\limits_{k = 1}^{K} {h_{i}^{T} h_{k} \mu_{t} (k)} + \frac{{h_{i}^{T} h_{i} }}{{\left\| {h_{i} } \right\|^{2} }}\mu_{t} (i)$$
(26)

where \(h_{i}\) is the ith column of the matrix H, equation (b) can be modified as

$$\mu_{t + 1} (i) = \mu_{t} (i) + \frac{1}{G(i,i)}(b(i) - \sum\limits_{k = 1}^{K} {G(i,k)} \,\mu_{t} (k))$$
(27)

where G(i, k) is the (i, k)th element of matrix G, b(i) is the ith element of the vector b. Linear iterative sequential detection approach achieved better performance. However, its adjacent interference cancellation consumes higher delay leading to affect performance degradation.

4 Implementation of Proposed Iterative Linear Detection

An efficient implementation of low complexity algorithm based on linear detection method is presented for SM SC-FDMA system. The linear system equations with positive definite and Hermitian coefficient matrices are suitable to implement CG algorithm [33].

This idea was first suggested in [28, 29], where authors low complexity ZF and MMSE solution obtain through the unique arrangement of the interference matrix while the similar structure (with few slight modifications) realized great interest to this article as well [34]. To estimate various users transmitted symbols associate CG method with ZF and MMSE principle, CG formulate to obtain a solution of the term (12) is suggested [28, 29] and modified as:

$$Q\hat{X}_{ZF} = d$$
(28)

where \(Q = H^{H} H\) and \(d = H^{H} Y\). MMSE criterion implementing with CG algorithm is to evaluate a solution of expression (13), is formulated as:

$$Q\hat{X}_{MMSE} = d$$
(29)

where \(Q = H^{H} H + \frac{{\sigma_{n}^{2} }}{{\sigma_{s}^{2} }}I_{N}\) and \(d = H^{H} Y\). An algorithm I describes the Eqs. (28) and (29) in terms of CG that evaluates the linear detection solution.\(\delta\) denotes the tolerance of the output, subscript l denotes the iteration number. Residual vectors are considered as \(\gamma_{(l)}\) and \(\rho_{(l)}\), which lead search direction in the lth direction. The preconditioning matrix is formulated as W, which is assumed as identity matrix, this is introduced to speed up the convergence of the CG operation. The CG scheme may require number of iterations (Nit) to obtain an exact solution [34]. The spreading of the eigenvalue underlying matrix decides required number of iterations.

figure e

5 Complexity Analysis

The new scheme carries out numerous matrix computations for instance, matrix inverse, addition, division and multiplications. It adapts CG algorithm to enable convergence fast to estimate MMSE and ZF solution iteratively. To alleviate the cause of inter carrier interference (ICI), a direct practice (expressions 12 and 13) based on the ZF and MMSE detection, which involves \(O(N^{3} )\) computational to invert a size of N ICI matrix for each user. The ML detection compares the received signal vector of size \(N \times 1\) with all the \(2^{vN}\) users, requiring in a complexity of \(O(N \times 2^{vN} )\), where ν is the size of constellation, which is high as compared to existing approaches. But, it is not suitable for practical scenario. In practical SC-FDMA schemes specifically 3GPP LTE, which requires number of subcarrier from 128 to 2048 [19]. A computationally attractive design of CG method is utilized to drastically reduce the receiver overheads and to sustain significant performance. A CG algorithm is applicable to calculate the effective solution in this article as a benchmark. Table 1 Summarizes complexity comparison of the proposed CG method and other existing scenarios. In this scenario, K, N,\(N{}_{r}\), B and Nit assert number of user, number of subcarriers, number of transmit antenna, bandwidth of banded matrix and number of iterations, respectively.

Table 1 Computational complexity comparison for different detection systems

The main objective of CG algorithm is to carry out the solution for an inverse matrix computation. An amount of evaluation overheads for computing Q−1 and \(\hat{X} = Q^{ - 1} d\) require \(O(NK^{3} )\) and O (\(N^{2}\)) operations. Hence, direct ZF (expression 12) resulting complexity is \(O(NK^{2} )\) and direct MMSE (expression 13) is evaluating the order of \(O(N(N + K^{2} ))\). The complex computations of the frequency and domain conversions are estimated by well-known FFT/IFFT algorithms. An N-point FFT/IFFT involves (N/2) \(\frac{N}{2}\log_{2} N\) complex multiplication. The complexity of the new system is analyzed and contrasted with direct ZF and MMSE, regularized linear detection [4] and sub-optimal ML receiver [6] are reported in Fig. 2. The complexity of CG algorithm under linear detections scheme is much superior to the conventional counterparts. Also the proposed ZF computation complexity result is much lower than that of proposed MMSE complexity solution.

Fig. 2
figure 2

Complexity evaluations between proposed and different detection schemes

6 Simulation Results

The proposed linear detection’s system performance is evaluated through computer simulations and performance is compared with some recent existing approaches. This study verifies both localized and interleaved subcarriers allocations scenario for uplink SM SC-FDMA transceiver system. During simulations, performance of the proposed CG algorithm is examined through Monte Carlo process.

The Extended vehicular A channel is tested by the fading channel model with uniform power delay profile having 9 taps. The simulation parameters are followed as in the 3GPP LTE technical specifications release 10 [35, 36]. In the uplink SM SC-FDMA scenario, \(2 \times 2\) MIMO antenna model is employed between the UE and BS. Modulation scheme of 16-QAM and QPSK are used to verify effectiveness of the proposed approach for simulations. The system is considered to have N = 1024 subcarriers, which assume bandwidth of 20 MHz.The number of users in this scenario is K = 8 and number of subcarriers per user is M = 128. CP is 1/8 of symbol duration. The CFO of users is presented at random in the range [− 0.4, 0.4]. Fading coefficients for each path are employed as independent and identically distributed Gaussian coefficients. Consequently, a rate of 1/3 convolutional code is used to encode information bits with a constraint length of 7, and the 32 iteration equalization results are revealed.

Figure 3 demonstrates the BER performance of the proposed iterative ZF and MMSE detector and different detectors for localized subcarrier mapping SM SC-FDMA system with QPSK modulation technique. It reveals that the proposed ZF detection outperforms existing ZF receivers about 1 dB and the proposed MMSE detection outperforms conventional MMSE approaches about 1.5 dB at 10−3 BER. Figure 4 presents the BER performance of the proposed ZF and MMSE detection and some recent receivers for localized subcarrier allocation SM SC-FDMA scheme with the 16-QAM modulation technique. Particularly, while BER is at 10−3, the proposed ZF detector superior performance to existing ZF receivers about 2 dB with 16-QAM and the proposed MMSE detector’s better performance than conventional MMSE approaches about 1.5 dB with 16-QAM. From these results, the systems realize both MMSE and ZF receiver with only 32 iteration (Nit = 32) converges the detection result.

Fig. 3
figure 3

BER performance for uplink SM SC-FDMA scheme with proposed and conventional detection scheme

Fig. 4
figure 4

BER performance for uplink SM SC-FDMA scheme with proposed and conventional detection scheme

Figure 5 shows the BER performance of the proposed multiuser ZF and MMSE receiver for interleaved subcarrier mapping SM SC-FDMA scenario. In QPSK modulation, the proposed ZF detection outperforms the existing ZF receivers about 2 dB and the proposed MMSE detection outperforms the conventional MMSE approaches about 2 dB at the BER of 10−3. Finally, the performance of the proposed ZF and MMSE detector for interleaved subcarrier mapping with the 16-QAM modulation technique is shown in Fig. 6. This clearly shows that the proposed ZF detection outperforms the existing ZF detections about 1 dB and the proposed MMSE detection outperforms the conventional MMSE approaches about 2 dB at the BER of 10−3. From these results, the systems realize both MMSE and ZF detection under iterative CG algorithm. It converges the detection result at Nit = 32. It is noticed that the proposed system superior to conventional detection even at low complexity. It is revealed that the proposed iterative CG based linear detection method is well suited to the SM SC-FDMA system.

Fig. 5
figure 5

BER performance for uplink SM SC-FDMA scheme with proposed and conventional detection scheme

Fig. 6
figure 6

BER performance for uplink SM SC-FDMA scheme with proposed and conventional detection scheme

7 Conclusion

In this article, low-complexity signal detection scheme for the uplink SM SC-FDMA transceiver is investigated. The new CG algorithm is an attractive design of both ZF and MMSE detection scenarios when employed uplink communication at the base station, the linear filter coefficients are carried out, it notably reduced the computational complexity of the detection system. Since there is no complicated complex matrix inversion of calculation and solving equations in proposed method, while sustaining low complexity. Both ZF and MMSE equalizations are thoroughly verified various scenarios and a number of solutions are generated. The proposed scenario utilized different subcarrier allocation to users which achieve better performance than existing approaches. Simulation results for uplink SM SC-FDMA shows the effectiveness of the proposed detection approach. Finally, the complexity of the new approach is far less than direct MMSE and ZF equalization. New scheme provides much superior performance at a reduced complexity than the conventional approaches. It is also clear that this scheme performance is shown to be closer to MFB only serve few iterations of the detection.