1 Introduction

Multicarrier index keying—orthogonal frequency division multiplexing (MCIK-OFDM) or the so-called OFDM with index modulation (OFDM-IM) is an emerging multicarrier scheme [1,2,3], which can offer higher energy efficiency and reliability over conventional OFDM. In MCIK-OFDM, a subset of subcarriers are active to carry data bits through both the conventional M-ary symbols and the indices of active subcarriers. Hence, MCIK-OFDM provides a promising trade-off between spectral efficiency (SE) and reliability compared to OFDM just by varying the number of active sub-carriers.

Recently, various MCIK or IM concepts have been proposed for OFDM, which can be found in the overview [4]. Particularly, the IM concept was first applied to OFDM-based multicarrier modulation in [1], and its enhanced version was proposed in [2], while its generalized version which independently applies the IM to different subcarrier groups was developed in [3]. For the performance analysis, in [5], a tight bound on the bit error rate (BER) of OFDM-IM using the maximum likelihood (ML) detection was derived. The MCIK concept was applied to multiple input multiple output (MIMO) systems in [6]. In [7], the generalized MCIK scheme with a variable number of active subcarriers was proposed. In [8], coordinate interleaving OFDM-IM was proposed to improve the diversity order. Also inspired by the MCIK concept, code index modulation (CIM) as well as its generalized version were studied in [9, 10]. Aiming to enhance the error performance of MCIK-OFDM, several transmit diversity schemes are reported in [11,12,13,14], in which the repetition code for either the index or M-ary symbol was used in [11,12,13], while the spreading code was used in [14]. Meanwhile, there are a number of studies in [15,16,17] that focus on improving the SE of MCIK-OFDM, where the IM-based transmitters are designed to increase the number of either index or M-ary bits. Recently, deep neural networks (DNNs) have been applied to the MCIK signal detection in [18, 19], which can provide a near-optimal performance at low runtime complexity. Additionally, the use of a DNN structure called autoencoder for jointly optimizing both the transmitter and receiver of multicarrier systems was reported in [20,21,22,23], where the resulting learning-based systems can even achieve better error performance than IM-based multicarrier systems. Finally, the IM technique was applied to visible light communications for improving the BER performance in [24].

Table 1 Contribution Comparison of MCIK-OFDM Performance Analysis

Most of the aforementioned papers consider the ML or log-likelihood ratio (LLR) detector for MCIK-OFDM, which still has a significantly higher complexity than the classical OFDM. In [28], a low-complexity greedy detector (GD) was developed, which utilizes the energy detection method to estimate the active indices before decoding the M-ary symbols conveyed on these active sub-carriers. The outage probabilities and the pair-wise error probability of the GD under generalized fading were analyzed in [29] and [30], respectively. The symbol error probability (SEP) and BER of the GD in the presence of channel state information (CSI) imperfection were investigated in [25, 26], which reveal that the GD detector is less sensitive to imperfect CSI than its ML counterpart. In order to further improve the diversity gain of GD, MCIK-OFDM with hybrid GD and diversity receptions, namely selection combining (SC) and maximal ratio combining (MRC), was proposed in [27] to examine the SEP, however only for the perfect CSI case. Moreover, [27] fails to provide an analytical comparison between the MRC/SC-based GD and ML detectors, and its theoretical results are not tight, even at high signal-to-noise ratios (SNRs). Hence, this work is unable to provide a theoretical guideline of selecting a suitable detection method, particularly under different CSI uncertainties. Meanwhile, the GD shown in [25, 26] is more effective in practical systems with imperfect CSI. Therefore, it is worth investigating the performance of MCIK-OFDM with such low-complexity MRC/SC-based GD receivers under practical CSI uncertainty, and compare with its ML counterpart. In addition, the performance analysis of MCIK-OFDM using both MRC/SC and ML detection has been overlooked in the literature.

To address the aforementioned issues, in this paper, we first analyze and compare the BERs of MCIK-OFDM with the SC-based multiple-antenna receivers called MCIK-OFDM-SC, employing both ML and GD detectors, over uncertain CSI. In particular, the main contributions of this work compared with the existing works are listed in Table 1, and are summarized as follows:

  • We propose a generalized framework for deriving the BERs of both the GD and the ML receivers for MCIK-OFDM, where the BER is represented as a linear combination of the SEP and index error probability (IEP) of the classical M-ary data symbols.

  • Utilizing this proposed framework, tight, closed-form expressions for the BERs of MCIK-OFDM-SC employing both the GD and ML detectors are derived in presence of various CSI conditions, namely perfect CSI, and fixed or variable CSI uncertainties.

  • Based on the derived expressions, asymptotic results are demonstrated to further investigate effects of different CSI uncertainties on the BERs of the two detectors. More importantly, we asymptotically develop conditions under which using the GD instead of the ML is desired for MCIK-OFDM-SC under each CSI condition, particularly when the number of antennas at the receiver increases.

  • Simulation results are provided to validate the derived expressions, as well as theoretical guidelines for selecting detection type for each CSI condition. Unlike [27], our theoretical results are tight in a wide range of SNRs.

The rest of our paper is as follows. Section 2 describes MCIK-OFDM-SC and its signal detection under uncertain CSI. In Sect. 3 analyzes the BERs of both ML and GD, followed by asymptotic analysis in Sect. 4. Simulation results are performed in Sect. 5. Section 6 concludes our paper.

Notation: Lower-case bold and Upper-case bold letters and are used for vectors and matrices, respectively. \(C\left( ,\right)\) and \((.)^{T}\) denotes the binomial coefficient and transpose operation, respectively. The floor function is represented by \(\left\lfloor .\right\rfloor\). \(\mathcal{C}\mathcal{N}\left( 0,\sigma ^{2}\right)\) stands for the complex Gaussian distribution with zero mean and variance \(\sigma ^{2}\). \({\mathbb {E}}\left\{ .\right\}\) and \({\mathcal {M}}\left( .\right)\) present the expectation operator and the moment generating function (MGF), respectively.

2 System model

2.1 MCIK-OFDM-SC

Consider a uplink single-input multi-output (SIMO) MCIK-OFDM scheme with \(N_{c}=NG\) sub-carriers that are divided into G clusters with N sub-carriers per cluster. The transmitter employs a single antennas while the receiver uses L antennas. At the receiver, the SC technique is employed to combine signals received from L branches. Then, the output of the SC is used to estimate transmitted data bits using either the ML or the GD [27]. The resulting scheme is called as MCIK-OFDM-SC. Since each cluster independently operates the MCIK-OFDM technique, for simplicity and without loss of generality, hereinafter we address the problem of only one cluster, whose block diagram is illustrated in Fig. 1. Here, the role of OFDM framework is to make sub-carriers orthogonal to each other so that we can independently apply the MCIK concept to each cluster, reducing the transceiver complexity.

Fig. 1
figure 1

The block diagram of MCIK-OFDM-SC

In every MCIK-OFDM transmission per cluster, only K out of N sub-carriers are activated to carry information bits with K complex M-ary symbols, while additional data bits are delivered by the indices of active sub-carriers. More specifically, p incoming bits are partitioned into two streams (\(p=p_{1}+p_{2}\)) at the transmitter. Utilizing combinatorial method or look-up table (LUT) [3], the first \(p_{1}\) bits are mapped to a pattern of K active sub-carriers. Denote by \(\theta =\left\{ \alpha _{1},\ldots ,\alpha _{K}\right\}\) the set of K active sub-carrier indices, where \(\alpha _{k}\in \left\{ 1,\ldots ,N\right\}\) for \(k=1,\ldots ,K\). Note that \(\theta\) can be referred to as an index symbol, which is identified by \(p_{1}\) index bits. The remaining \(p_{2}\) bits are mapped to K M-ary symbols. For given NK and M, the number of index bits and symbol bits are given by \(p_{1}=\left\lfloor \log _{2}C\left( N,K\right) \right\rfloor\) and \(p_{2}=K\log _{2}M\), respectively. Denote by \({\mathcal {S}}\) the M-ary constellation. Using \(\theta\) and K non-zero symbols (determined by p incoming bits), the transmitted signal for each cluster is given as \({\textbf{x}}=\left[ x\left( 1\right) ,\ldots ,x\left( N\right) \right] ^{T},\) where \(x\left( \alpha \right) =0\) for \(\alpha \notin \theta\) and \(x\left( \alpha \right) \in {\mathcal {S}}\) for \(\alpha \in \theta\). Here, note that the K non-zero data symbols conveyed on active sub-carriers are denoted as vector \({\textbf{s}}\) in Fig. 1. The frequency domain signal \({\textbf{x}}\) is then processed by the inverse fast Fourier transform (IFFT) before being transmitted to the receiver.

The received signal at the l-th antenna in the frequency domain, i.e., the signal obtained after the FFT, is given by

$$\begin{aligned} {\textbf{y}}_{l}={\textbf{H}}_{l}{\textbf{x}}+{\textbf{n}}_{l}, \end{aligned}$$
(1)

where \({\textbf{H}}_{l}=\text {diag}\left\{ h_{l}\left( 1\right) ,\ldots ,h_{l}\left( N\right) \right\}\) is the channel matrix between the transmitter and the l-th receiver antenna, while \({\textbf{n}}_{l}=\left[ n_{l}\left( 1\right) ,\ldots ,n_{l}\left( N\right) \right] ^{T}\) is the noise vector with \(n_{l}\left( \alpha \right) \sim \mathcal{C}\mathcal{N}\left( 0,N_{0}\right)\), for \(\alpha =1,\ldots ,N\) and \(l=1,\ldots ,L\). Particularly, \(h_{l}\left( \alpha \right)\) represents the Rayleigh fading channel, which is identical and independent to each other, where \(h_{l}\left( \alpha \right) \sim \mathcal{C}\mathcal{N}\left( 0,1\right)\). Here, we assume that the cyclic prefix inserted to each OFDM symbol in the time domain is large enough to completely combat the inter-symbol interference [3]. As such, the average SNR per active sub-carrier is given by \({\bar{\gamma }}=\varphi E_{s}/N_{0},\) where \(E_{s}\) denotes the average power per non-zero M-ary symbol and \(\varphi =N/K\) is the power allocation ratio.

2.2 Post-combining detection under CSI uncertainty

We consider a practical MCIK-OFDM-SC system where the receiver imperfectly knows the CSI. Particularly, denote by \({\hat{h}}_{l}\left( \alpha \right)\) the estimate of the true channel \(h_{l}\left( \alpha \right)\), and we have

$$\begin{aligned} {\hat{h}}_{l}\left( \alpha \right) =h_{l}\left( \alpha \right) -e_{l}\left( \alpha \right) , \end{aligned}$$
(2)

where \(e_{l}\left( \alpha \right)\) represents the channel estimation error as being independent of \({\hat{h}}_{l}\left( \alpha \right)\) and \(e_{l}\left( \alpha \right) \sim \mathcal{C}\mathcal{N}\left( 0,\epsilon ^{2}\right)\), and \({\hat{h}}_{l}\left( \alpha \right) \sim \mathcal{C}\mathcal{N}\left( 0,1-\epsilon ^{2}\right) ,\) where \(\epsilon ^{2}\in \left[ 0,1\right)\) denotes the error variance.

For sub-carrier \(\alpha\), the \(l^{*}\)-th branch is selected as the output of the SC such that \(l^{*}=\arg \max _{l}\left| {\hat{h}}_{l}\left( \alpha \right) \right| ^{2}\). Hence, the output signal of the SC can be given by

$$\begin{aligned} {\textbf{y}}={\textbf{H}}{\textbf{x}}+{\textbf{n}}, \end{aligned}$$
(3)

where \({\textbf{H}}=\text {diag}\left\{ h\left( 1\right) ,\ldots ,h\left( N\right) \right\}\) denotes the channel matrix of the SC and the corresponding noise vector is \({\textbf{n}}=\left[ n\left( 1\right) ,\ldots ,n\left( N\right) \right] ^{T}\), where \(h\left( \alpha \right) =h_{l^{*}}\left( \alpha \right)\) and \(n\left( \alpha \right) =n_{l^{*}}\left( \alpha \right)\) for \(\alpha =1,\ldots ,N\). Notice in (3) that \({\textbf{y}}=\left[ y\left( 1\right) ,\ldots ,y\left( N\right) \right] ^{T},\) with \(y\left( \alpha \right) =h\left( \alpha \right) x\left( \alpha \right) +n\left( \alpha \right)\).

Let \({\hat{h}}\left( \alpha \right)\) be the estimate of \(h\left( \alpha \right)\), i.e., \({\hat{h}}\left( \alpha \right) ={\hat{h}}_{l^{*}}\left( \alpha \right)\). Based on \({\textbf{y}}\) and \({\hat{h}}\left( \alpha \right) ,\) either the ML or the GD can be employed for the signal detection as follows.

2.2.1 Post-combining ML

Under imperfect CSI, the estimated signal \(\hat{{\textbf{x}}}\) is calculated by the ML criterion as

$$\begin{aligned} \hat{{\textbf{x}}}=\arg \min _{{\textbf{x}}}\left\| {\textbf{y}}-{\hat{\textbf{H}}}\textbf{x}\right\| ^{2}, \end{aligned}$$

where \(\hat{{\textbf{H}}}=\text {diag}\left\{ {\hat{h}}\left( 1\right) ,\ldots ,{\hat{h}}\left( N\right) \right\}\) denotes the estimate of the channel matrix after the SC. Utilizing \(\hat{{\textbf{x}}}\), the index symbol \({\hat{\theta }}\) and K non-zero symbols \(x\left( \alpha \right)\) with \(\alpha \in {\hat{\theta }}\) are recovered.

2.2.2 Post-combining GD

Post-combining GD makes best antenna selections per sub-carrier before GD processing. For given \({\textbf{H}}\), the GD detects signals through two following steps. Firstly, the active indices are estimated by K sub-carriers that have the largest SC-output energies, i.e., \(\left| y\left( \alpha \right) \right| ^{2}.\) Secondly, the non-zero M-ary symbols are detected by applying the ML decision to activated sub-carrier \(\alpha\) as

$$\begin{aligned} x\left( \alpha \right) =\arg \min _{x\left( \alpha \right) \in {\mathcal {S}}}\left| y\left( \alpha \right) -{\hat{h}}\left( \alpha \right) \times \left( \alpha \right) \right| ^{2}. \end{aligned}$$
(4)

Note that the GD detector has not only lower complexity, but also less sensitivity to CSI imperfection, than the ML detector [26]. However, when the number of antennas is limited to one, the ML still perform much better than GD under certain CSI conditions, especially when M is small (e.g., \(M=2,4\)) [25].

As a result, we are prompted to examine the BER performance of both the GD and the ML in MCIK-OFDM-SC, in order to understand if the post-combining GD receiver benefits from diversity gain. For this, we intend to derive the closed-form expressions for the BERs of the two detectors, taking CSI uncertainty into consideration in the next section.

3 BER analysis with CSI uncertainty

We note that the ML performs the same performance as the log-likelihood ratio (LLR) detector [12] which also has two separate steps as the GD. Thus, we now introduce a generalized framework to derive the BERs of both the ML and the GD. Particularly, we consider bit error event consisting of two parts: the index bit error (\(p_{1}\) bits) and the symbol bit error (\(p_{2}\) bits). Let \(P_{1}\) be the index BER (IBER) and \(P_{2}\) be the symbol BER (SBER). Then, the BER of either the ML or the GD is given by

$$\begin{aligned} P_{b} = \frac{p_{1}P_{1}+p_{2}P_{2}}{p_{1}+p_{2}}. \end{aligned}$$
(5)

The IBER and the SBER are obtained by [26]

$$\begin{aligned} P_{1}\approx \eta {\overline{P}}_{I}/2, \end{aligned}$$
(6)
$$\begin{aligned} P_{2}\le \frac{{\overline{P}}_{I}}{2K}+\frac{{\overline{P}}_{M}}{\log _{2}M}, \end{aligned}$$
(7)

where \({\overline{P}}_{I}\) denotes the average index error probability (IEP), \(\eta =1\) for \(N>2\) and \(\eta =2\) for \(N=2\), and \({\overline{P}}_{M}\) is the average SEP of the M-ary symbol detection as long as the activated indices are correctly detected. Plugging (6) and (7) into (5), the generalized BER expression for both the ML and the GD is given by

$$\begin{aligned} P_{b}\approx \frac{\left( \eta p_{1}+m\right) {\overline{P}}_{I}/2+K{\overline{P}}_{M}}{p}, \end{aligned}$$
(8)

where \(m=\log _{2}M\) and \(p=p_{1}+p_{2}\).

Remark 1

As seen from (8), when K increases to N, the BER of either the ML or the GD approaches that of classical OFDM, which is \({\overline{P}}_{M}/m\). As a result, the performance gap between these two detectors gets smaller when K gets larger.

Remark 2

\({\overline{P}}_{M}\) in (8) is the same for both the ML and the GD, while \({\overline{P}}_{I}\) depends on the detection type employed. Thus, to find out the BER expressions for the GD and the ML in MCIK-OFDM-SC, we need to derive \({\overline{P}}_{I}\) for them, considering CSI uncertainty. Meanwhile, \({\overline{P}}_{M}\) is provided in the following lemma when the M-ary PSK modulation is employed.

Lemma 1

Under CSI uncertainty with the error variance \(\epsilon ^{2}\), the average SEP of the conventional M-ary PSK symbol detection in MCIK-OFDM-SC is approximated by

$$\begin{aligned} {\overline{P}}_{M}\approx \frac{\xi }{12}\left\{ \frac{L!}{\prod _{l=1}^{L}\left[ l+\frac{\left( 1-\epsilon ^{2}\right) \rho {\bar{\gamma }}}{1+\epsilon ^{2} {\bar{\gamma }}}\right] }+\frac{3L!}{\prod _{l=1}^{L}\left[ l+\frac{4\left( 1-\epsilon ^{2}\right) \rho {\bar{\gamma }}}{3\left( 1+\epsilon ^{2}{\bar{\gamma }}\right) }\right] }\right\} , \end{aligned}$$
(9)

where \(\rho =\sin ^{2}\left( \pi /M\right)\), \(\xi =1\) for \(M=2\) and \(\xi =2\) for \(M>2\).

Proof

See Appendix A. \(\square\)

3.1 BER for ML with SC reception and CSI uncertainty

We first consider the IEP of the ML in MCIK-OFDM with the SC and imperfect CSI. Denote by \(P_{I_{1}}\) the instantaneous IEP of the ML, which is approximated by [25]

$$\begin{aligned} P_{I_{1}}\approx \frac{K}{N}\sum _{\alpha =1}^{N}\sum _{{\tilde{\alpha }}\ne \alpha =1}^{N-K} \left[ \frac{1}{12}e^{-\frac{{\bar{\gamma }}\left( {\hat{\nu }}_{\alpha }+{\hat{\nu }}_{{\tilde{\alpha }}}\right) }{4+2{\bar{\gamma }}\epsilon ^{2}}}+\frac{1}{4}e^{-\frac{2{\bar{\gamma }}\left( {\hat{\nu }}_{\alpha } +{\hat{\nu }}_{{\tilde{\alpha }}}\right) }{6+3{\bar{\gamma }}\epsilon ^{2}}}\right] , \end{aligned}$$
(10)

where \({\hat{\nu }}_{\alpha }=\left| {\hat{h}}\left( \alpha \right) \right| ^{2}\), \({\hat{\nu }}_{{\tilde{\alpha }}}=\left| {\hat{h}}\left( {\tilde{\alpha }}\right) \right| ^{2}\).

Denote \({\hat{\nu }}_{\varSigma }={\hat{\nu }}_{\alpha }+{\hat{\nu }}_{{\tilde{\alpha }}}\). The moment generating function (MGF) of \({\hat{\nu }}_{\varSigma }\) can be attained by \({\mathcal {M}}_{{\hat{\nu }}_{\varSigma }}\left( s\right) ={\mathcal {M}}_{{\hat{\nu }}}^{2}\left( s\right) ,\) where \({\mathcal {M}}_{{\hat{\nu }}}\left( s\right)\) is the MGF of \({\hat{\nu }}_{\alpha }\) which is given in (33). Here, applying the MGF approach to (10), we obtain the average IEP of the ML with the SC and uncertain CSI as follows

$$\begin{aligned} {\overline{P}}_{I_{1}}\approx \frac{\Psi _{1}}{12}\left\{ \frac{\left( L!\right) ^{2}}{\prod _{l=1}^{L}\left[ l+\frac{\left( 1-\epsilon ^{2}\right) {\bar{\gamma }}}{4+2{\bar{\gamma }}\epsilon ^{2}}\right] ^{2}}+\frac{3\left( L!\right) ^{2}}{\prod _{l=1}^{L} \left[ l+\frac{2\left( 1-\epsilon ^{2}\right) {\bar{\gamma }}}{6+3{\bar{\gamma }}\epsilon ^{2}}\right] ^{2}}\right\} , \end{aligned}$$
(11)

where \(\Psi _{1}=K\left( N-K\right)\).

As observed from (11), note that as \(L=1\), the average IEP of the ML in (11) reduces to [25, Eq. (16)]. In addition, \({\overline{P}}_{I_{1}}\) mainly relies on NK and L, while being less influenced by the modulation size M.

Finally, the BER of the ML (denoted by \(P_{b_{1}}\)) can be obtained by inserting (9) and (11)–(8) as

$$\begin{aligned} P_{b_{1}}&\approx \frac{{\widetilde{\Psi }}_{1}}{24p}\left\{ \frac{\left( L!\right) ^{2}}{\prod _{l=1}^{L} \left[ l+\frac{\left( 1-\epsilon ^{2}\right) {\bar{\gamma }}}{4+2{\bar{\gamma }}\epsilon ^{2}}\right] ^{2}} +\frac{3\left( L!\right) ^{2}}{\prod _{l=1}^{L}\left[ l+\frac{2\left( 1-\epsilon ^{2}\right) {\bar{\gamma }}}{6+3{\bar{\gamma }}\epsilon ^{2}}\right] ^{2}}\right\} \nonumber \\&+\frac{K\xi }{12p}\left\{ \frac{L!}{\prod _{l=1}^{L}\left[ l+\frac{\left( 1-\epsilon ^{2}\right) \rho {\bar{\gamma }}}{1+\epsilon ^{2}{\bar{\gamma }}} \right] }+\frac{3L!}{\prod _{l=1}^{L}\left[ l+\frac{4\left( 1-\epsilon ^{2}\right) \rho {\bar{\gamma }}}{3\left( 1+\epsilon ^{2}{\bar{\gamma }}\right) }\right] }\right\} , \end{aligned}$$
(12)

where \({\widetilde{\Psi }}_{1}=\Psi _{1}\left( \eta p_{1}+m\right) =K\left( N-K\right) \left( \eta p_{1}+m\right) .\)

It is shown from (12) that increasing L improves the BER of the ML. Moreover, for given NL and \({\bar{\gamma }}\), the BER \(P_{b_{1}}\) depends on both K and \(\epsilon ^{2}\). For example, when K gets larger, the second term, which is related to the M-ary symbol detection, will dominate over \(P_{b_{1}}\). Especially, as \(K=N\), (12) reduces to the BER of the classical OFDM.

3.2 BER for GD with SC reception and CSI uncertainty

In MCIK-OFDM with the single antenna used at both the transmitter and the receiver, the IEP of the GD is independent of CSI conditions [26]. However, this is no longer true when employing the SC for MCIK-OFDM. Particularly, the instantaneous IEP of the GD is given by [26, 27]

$$\begin{aligned} P_{I_{2}}=\frac{K}{N}\sum _{\alpha =1}^{N}\sum _{i=1}^{N-K}\frac{\left( -1\right) ^{i+1}C \left( N-K,i\right) }{i+1}e^{-\frac{i{\bar{\gamma }}\nu _{\alpha }}{i+1}}, \end{aligned}$$
(13)

where \(\nu _{\alpha }=\left| h\left( \alpha \right) \right| ^{2}\) which is obviously affected by the estimate \({\hat{h}}_{l}\left( \alpha \right)\) due to \(h\left( \alpha \right) =h_{l^{*}}\left( \alpha \right)\) with \(l^{*}=\max _{l}\left| {\hat{h}}_{l}\left( \alpha \right) \right| ^{2}\). The detailed derivation of (13) over Rayleigh fading channels was presented in [28], which is not included here for the sake of brevity. Thus, the IEP of the GD in our system depends on the channel estimation errors. This makes the derivation of the average IEP for this detector non-trivial as follows.

First, it is needed to figure out the MGF of \(\nu _{\alpha }\). Using (2), \(h\left( \alpha \right)\) can be represented as \(h\left( \alpha \right) =e^{j\phi }\left| {\hat{h}}\left( \alpha \right) \right| +e\left( \alpha \right) =e^{j\phi } \left( \left| {\hat{h}}\left( \alpha \right) \right| +{\tilde{e}}\left( \alpha \right) \right)\), where \({\tilde{e}}\left( \alpha \right) =e^{-j\phi }e\left( \alpha \right) \sim \mathcal{C}\mathcal{N}\left( 0,\epsilon ^{2}\right)\) and \(\phi\) denotes the argument of \({\hat{h}}\left( \alpha \right)\). This results in

$$\begin{aligned} \left| h\left( \alpha \right) \right| ^{2}=\left| \left| {\hat{h}}\left( \alpha \right) \right| +{\tilde{e}} \left( \alpha \right) \right| ^{2}. \end{aligned}$$
(14)

From (14), the MGF of \(\nu _{\alpha }\) can be computed as

$$\begin{aligned} {\mathcal {M}}_{\nu }\left( t\right)&={\mathbb {E}}_{\left| h\left( \alpha \right) \right| ^{2}}\left\{ e^{\left| h \left( \alpha \right) \right| ^{2}t}\right\} \nonumber \\&={\mathbb {E}}_{\left| {\hat{h}}\left( \alpha \right) \right| ^{2}}\left\{ {\mathbb {E}}_{\left| \left| {\hat{h}} \left( \alpha \right) \right| +{\tilde{e}}\left( \alpha \right) \right| ^{2}}\left\{ e^{\left| \left| {\hat{h}}\left( \alpha \right) \right| +{\tilde{e}}\left( \alpha \right) \right| ^{2}t}\right\} \right\} \nonumber \\&=\int _{0}^{\infty }f_{\left| {\hat{h}}\left( \alpha \right) \right| ^{2}}\left( x\right) {\mathcal {M}}_{\left| \left| {\hat{h}}\left( \alpha \right) \right| +{\tilde{e}}\left( \alpha \right) \right| ^{2}}\left( t\right) dx, \end{aligned}$$
(15)

which motivates us to propose the following lemma.

Lemma 2

Let \({\tilde{e}}\left( \alpha \right) \sim \mathcal{C}\mathcal{N}\left( 0,\epsilon ^{2}\right) ,\) then for given \(\left| {\hat{h}}\left( \alpha \right) \right| ^{2},\) the MGF of \(\left| \left| {\hat{h}}\left( \alpha \right) \right| +{\tilde{e}}\left( \alpha \right) \right| ^{2}\) is given by

$$\begin{aligned} {\mathcal {M}}_{\left| \left| {\hat{h}}\left( \alpha \right) \right| +{\tilde{e}}\left( \alpha \right) \right| ^{2}} \left( t\right) =\frac{e^{\frac{\left| {\hat{h}}\left( \alpha \right) \right| ^{2}t}{1-\epsilon ^{2}t}}}{1-\epsilon ^{2}t}. \end{aligned}$$
(16)

Proof

See Appendix B. \(\square\)

Inserting (32) and (16) into (15), through simple manipulations, we obtain

$$\begin{aligned} {\mathcal {M}}_{\nu }\left( t\right) =\frac{L!}{\left( 1-\epsilon ^{2}t\right) \prod _{l=1}^{L} \left[ l-\frac{\left( 1-\epsilon ^{2}\right) t}{1-\epsilon ^{2}t}\right] }. \end{aligned}$$
(17)

Note that to the best of our knowledge, the approach to derive the MGF of \(\nu _{\alpha }\) in closed-form (17) is novel. This interestingly results in a simple, exact closed-form expression for the average IEP of the GD with the SC and uncertain CSI, by applying the MGF approach to (13) and using (17), as

$$\begin{aligned} {\overline{P}}_{I_{1}}=K\sum _{i=1}^{N-K}\frac{\left( -1\right) ^{i+1}C\left( N-K,i\right) L!}{\left( i+1+i\epsilon ^{2}{\bar{\gamma }}\right) \prod _{l=1}^{L}\left[ l+\frac{\left( 1-\epsilon ^{2}\right) i{\bar{\gamma }}}{i+1+i\epsilon ^{2}{\bar{\gamma }}}\right] }. \end{aligned}$$
(18)

As observed from (18), when \(L=1\), the expression for \({\overline{P}}_{I}\) becomes [26, Eq. (8)] which no longer depends on \(\epsilon ^{2}.\) In addition, as \(L>1\), the IEP performance suffers from a degradation caused by CSI uncertainty, i.e., \(\epsilon ^{2}.\) Note that for any \(\epsilon ^{2}\in \left[ 0,1\right)\), \({\overline{P}}_{I_{1}}\) always tends to 0 as \({\bar{\gamma }}\) increases to infinity, even for the worst case of \(\epsilon ^{2}=1\).

Finally, the BER of the GD with the SC and uncertain CSI can be attained by substituting (9) and (18)–(8) as follows:

$$\begin{aligned} P_{b_{2}}&\approx \frac{K\left( \eta p_{1}+m\right) }{2p}\sum _{i=1}^{N-K}\frac{\left( -1\right) ^{i+1}C \left( N-K,i\right) L!}{\left( i+1+i\epsilon ^{2}{\bar{\gamma }}\right) \prod _{l=1}^{L} \left[ l+\frac{\left( 1-\epsilon ^{2}\right) i{\bar{\gamma }}}{i+1+i\epsilon ^{2}{\bar{\gamma }}}\right] }\nonumber \\&\quad +\frac{K\xi }{12p}\left\{ \frac{L!}{\prod _{l=1}^{L}\left[ l+\frac{\left( 1-\epsilon ^{2}\right) \rho {\bar{\gamma }}}{1+\epsilon ^{2}{\bar{\gamma }}} \right] }+\frac{3L!}{\prod _{l=1}^{L}\left[ l+\frac{4\left( 1-\epsilon ^{2}\right) \rho {\bar{\gamma }}}{3 \left( 1+\epsilon ^{2}{\bar{\gamma }}\right) }\right] }\right\} . \end{aligned}$$
(19)

Observe from (19) that different from MCIK-OFDM with the single antenna [26], where \(\epsilon ^{2}\) affects only the term related to the M-ary symbol detection, in MCIK-OFDM-SC having multiple anttenas, \(\epsilon ^{2}\) influences on both the index detection error and the M-ary symbol detection error. As \(L=1\), (19) reduces to [26, Eq. (15)], which confirms the accuracy of our derivation for the BER expression of MCIK-OFDM-SC.

4 Asymptotic analysis

We now carry out the asymptotic analysis for the BERs of both ML and GD detectors at high SNRs and in a large number of antennas. In particular, we investigate the impact of various CSI uncertainties, namely perfect CSI, fixed CSI uncertainty, and minimum mean square (MMSE) based variable CSI uncertainty. In addition, the performance comparison between the two detectors is provided. This allows to recommend that when the GD should be used under each CSI condition as the number of antennas increases.

Note that existing studies [25,26,27] have not provided any analytical comparisons between the ML and the GD such as the behavior of the coding gain gap between them when the number of antennas changes. Moreover, [27] even has not included any asymptotic analysis for the GD with the SC.

4.1 Perfect CSI \((\epsilon ^{2}=0)\)

As \(\epsilon ^{2}=0\) and \({\bar{\gamma }}\) tends to infinity, the BERs in (12) and (19) can be asymptotically approximated by

$$\begin{aligned}{} & {} P_{b_{1}}\approx \Upsilon \left( \frac{\xi \Omega }{6\rho ^{L}}\right) \frac{1}{\gamma _{0}^{L}}, \end{aligned}$$
(20)
$$\begin{aligned}{} & {} P_{b_{2}}\approx \Upsilon \left[ \left( \eta p_{1}+m\right) \omega +\frac{\xi \Omega }{6\rho ^{L}}\right] \frac{1}{\gamma _{0}^{L}}, \end{aligned}$$
(21)

where \(\Upsilon =K^{L+1}L!/2pN^{L}\), \(\Omega =1+3^{L+1}/4^{L}\), \(\omega =\sum _{i=1}^{N-K}\left( -1\right) ^{i+1}C\left( N-K,i\right) \left( 1+i\right) ^{L-1}/i^{L}\), and \(\gamma _{0}=E_{s}/N_{0}\) is the average SNR per sub-carrier.

As observed from (20) and (21), both the ML and the GD attain a diversity order of L under perfect CSI. Moreover, for given N and L, a smaller K provides lower BERs.

Regarding the comparison between the GD and the ML, we consider the coding gain attained by the ML over the GD under perfect CSI (denoted by \(\Delta _{1}\)), which can be denoted by \(\Delta _{1}=10\log _{10}\left( P_{b_{2}}/P_{b_{1}}\right) ^{1/L}\). Using (20) and (21), we have

$$\begin{aligned} \Delta _{1}=\frac{10}{L}\log _{10}\left( 1+\eta _{1}\right) \text {\,(dB)}, \end{aligned}$$
(22)

where \(\eta _{1}=6\left( \eta p_{1}+m\right) \omega \rho ^{L}/\xi \Omega\). Based on this result, we introduce the following theorem.

Theorem 1

Consider MCIK-OFDM with the SC and perfect CSI. For \(M=2\), the ML performs better than the GD in terms of the BER by 3 dB, at large L, i.e, \(\lim _{L\rightarrow \infty }\Delta _{1}\approx 3\) (dB). For \(M\ge 4\), the BER of GD approaches to that of ML when increasing L, i.e., \(\lim _{L\rightarrow \infty }\Delta _{1}=0\) (dB). Especially, when \(M\ge 8\), the BERs of the two detectors rapidly converge to each other as L increases, i.e., \(\lim _{L\rightarrow \infty }\eta _{1}=0\).

Proof

Since \(\omega\) in (21) can be approximated by \(\omega \approx \left( N-K\right) 2^{L-1}\) at large L, we approximate \(\eta _{1}\) at large L as

$$\begin{aligned} \eta _{1}\approx \beta _{1}\lambda _{1}^{L}, \end{aligned}$$
(23)

where \(\lambda _{1}=2\rho ,\) recalling \(\rho =\sin ^{2}\left( \pi /M\right) ,\) and \(\beta _{1}=3\left( \eta p_{1}+m\right) \left( N-K\right) /\xi \Omega\) which decreases when increasing L due to \(\Omega =1+3^{L+1}/4^{L}.\)

For \(M=2\), we obtain \(\lambda _{1}=2\), thus \(\eta _{1}\approx \beta _{1}2^{L}\). Using (22), \(\lim _{L\rightarrow \infty }\Delta _{1}=\lim _{L\rightarrow \infty }\left( 10/L\right) \log _{10} \left( 1+\beta _{1}2^{L}\right) =10\log _{10}2\approx 3\) (dB).

For \(M\ge 4,\) we obtain \(\lambda _{1}\le 1\), thus \(1<\eta _{1}\le 1+\beta _{1}\). This leads to \(\lim _{L\rightarrow \infty }\Delta _{1}=0\) (dB).

For \(M\ge 8,\) we attain \(\lambda _{1}\le 2\sin ^{2}\left( \pi /8\right) <0.3\), which results in \(\lim _{L\rightarrow \infty }\eta _{1}=\lim _{L\rightarrow \infty }\beta _{1}\lambda _{1}^{L}=0\). \(\square\)

Remark 3

From Theorem 1, it is recommended that the GD should be used rather than the ML under perfect CSI as \(M\ge 8\), especially when L gets larger. This is because the GD can achieve a nearly optimal BER at a significantly lower complexity than the ML detector for large M and L. Note that the complexities of the ML and GD in MCIK-OFDM with the SC are \({\mathcal {C}}_{ML-SC}=N+2CM^{K}\) and \({\mathcal {C}}_{GD-SC}=2N+2KM,\) respectively, where \(C=2^{p_{1}}\) [27]. Obviously, when K and M become larger, we attain \({\mathcal {C}}_{ML-SC}\gg {\mathcal {C}}_{GD-SC}\).

4.2 Fixed CSI uncertainty \((\epsilon ^{2}>0)\)

As \(\epsilon ^{2}>0\) is fixed, the BERs in (18) and (19) can be rewritten at high SNRs, respectively, as follows:

$$\begin{aligned}&P_{b_{1}} \approx \underbrace{\frac{{\widetilde{\Psi }}_{1}}{24p}\left\{ \frac{1}{\prod _{l=1}^{L} \left[ 1+\frac{\left( 1-\epsilon ^{2}\right) }{2l\epsilon ^{2}}\right] ^{2}}+\frac{3}{\prod _{l=1}^{L} \left[ 1+\frac{2\left( 1-\epsilon ^{2}\right) }{3l\epsilon ^{2}}\right] ^{2}}\right\} }_{A_{1}}\nonumber \\&\quad +\frac{K\xi }{12p}\left\{ \frac{1}{\prod _{l=1}^{L}\left[ 1+\frac{\left( 1-\epsilon ^{2}\right) \rho }{l\epsilon ^{2}}\right] } +\frac{3}{\prod _{l=1}^{L}\left[ 1+\frac{4\left( 1-\epsilon ^{2}\right) \rho }{3l\epsilon ^{2}}\right] }\right\} , \end{aligned}$$
(24)
$$\begin{aligned}&P_{b_{2}}\approx \frac{K\xi }{12p}\left\{ \frac{1}{\prod _{l=1}^{L}\left[ 1+\frac{\left( 1-\epsilon ^{2}\right) \rho }{l\epsilon ^{2}}\right] } +\frac{3}{\prod _{l=1}^{L}\left[ 1+\frac{4\left( 1-\epsilon ^{2}\right) \rho }{3l\epsilon ^{2}}\right] }\right\} , \end{aligned}$$
(25)

where we recall that \({\widetilde{\Psi }}_{1}=K\left( N-K\right) \left( \eta p_{1}+m\right) .\)

As seen from (24) and (25), for fixed \(\epsilon ^{2}\), there exists error floors on the BERs of both the ML and the GD, or equivalently, increasing the SNR does not improve the BER. Thus, these two detectors in this case achieve a zero diversity gain for any L. Furthermore, when L gets larger or \(\epsilon ^{2}\) gets smaller, the error floors in (24) and (25) become lower.

The following theorem compares the BER between the ML and the GD in MCIK-OFDM with the SC and fixed \(\epsilon ^{2}\).

Theorem 2

In MCIK-OFDM using the SC under fixed CSI uncertainty, the GD achieves a better BER than the ML detector at high SNRs, i.e., \(P_{b_{1}}>P_{b_{2}}\).

Proof

It is shown from (24) and (25) that at high SNRs, \(P_{b_{1}}=P_{b_{2}}+A_{1}>P_{b_{2}},\) where the term \(A_{1}\) is related to the index detection error of the ML. This concludes the proof.

Remark 4

As a result of Theorem 2, under fixed CSI imperfection, the GD is able to outperform the ML in terms of both the BER and computational complexity, even for any M. This is obviously contrary to the perfect CSI case, where the BER of ML is always lower than that of GD.

4.3 MMSE-based variable CSI uncertainty

Note that the error variance provided by the MMSE channel estimator is given by [26]

$$\begin{aligned} \epsilon ^{2}=\frac{1}{1+\gamma _{0}}, \end{aligned}$$
(26)

which varies as a decreasing function of the SNR.

Inserting (26)–(12) and (19), we obtain the asymptotic BERs for the ML and the GD in this case as

$$\begin{aligned}{} & {} P_{b_{1}}\approx \Upsilon \left[ \frac{\xi \Omega \left( 1+N/K\right) ^{L}}{6\rho ^{L}}\right] \frac{1}{\gamma _{0}^{L}}, \end{aligned}$$
(27)
$$\begin{aligned}{} & {} P_{b_{2}}\approx \Upsilon \left[ \psi \left( \eta p_{1}+m\right) +\frac{\xi \Omega \left( 1+N/K\right) ^{L}}{6\rho ^{L}}\right] \frac{1}{\gamma _{0}^{L}}, \end{aligned}$$
(28)

where \(\Upsilon\) and \(\Omega\) are defined in (21), and \(\psi =\sum _{i=1}^{N-K}\left( -1\right) ^{i+1}C\left( N-K,i\right) \left( i+1+iN/K\right) ^{L-1}/i^{L}\)

As seen from (27) and (28), both the GD and the ML of MCIK-OFDM with the SC achieves the same diversity order of L in this case. However, due to the impact of MMSE channel estimation errors, the BERs in (27) and (28) are obviously greater than that of the perfect CSI case. For example, we can see from (20) and (27) that under the MMSE CSI imperfection, the ML endures a coding gain loss of \(10\log _{10}\left( 1+N/K\right)\) (dB) compared with the perfect CSI case.

As for the comparison in the BER between the ML and the GD, denote by \(\Delta _{2}\) the coding gain attained by the ML over GD detector under MMSE variable CSI uncertainty, which can be obtained from (27) and (28) as

$$\begin{aligned} \Delta _{2}=\frac{10}{L}\log _{10}\left( 1+\eta _{2}\right) \text {\,(dB)}, \end{aligned}$$
(29)

where \(\eta _{2}=6\psi \left( \eta p_{1}+m\right) \rho ^{L}/\xi \Omega \left( 1+N/K\right) ^{L}.\) Similar to Theorem 1, utilizing (29) we propose the following theorem.

Theorem 3

Consider MCIK-OFDM using the SC and the MMSE-based variable CSI imperfection. For \(M\ge 4\), the BERs of the ML and the GD rapidly converge to each other as increasing L, i.e., \(\lim _{L\rightarrow \infty }\eta _{2}=0\). When \(M=2\), the ML performs better than the GD in terms of the BER by a coding gain of \(10\log _{10}\left[ 1+K/\left( N+K\right) \right] (dB)\), at large L, i.e., \(\lim _{L\rightarrow \infty }\Delta _{2}=10\log _{10}\left[ 1+K/\left( N+K\right) \right]\) (dB), moreover \(\lim _{L\rightarrow \infty }\Delta _{2}<\lim _{L\rightarrow \infty }\Delta _{1}.\)

Proof

Akin to Theorem 1, at large L, \(\psi\) in (28) can be approximated as \(\psi \approx \left( N-K\right) \left( 2+N/K\right) ^{L-1}.\) Thus,

$$\begin{aligned} \eta _{2}\approx \beta _{2}\lambda _{2}^{L}, \end{aligned}$$
(30)

where \(\beta _{2}=6\left( \eta p_{1}+m\right) \left( N-K\right) /\xi \Omega \left( 2+N/K\right)\) which is a decreasing function of L and \(\lambda _{2}=\rho \left[ 1+K/\left( N+K\right) \right] .\)

For \(M\ge 4\), we have \(\lambda _{2}\le \left[ 1+K/\left( N+K\right) \right] /2<1\) for any \(K<N\). Hence, \(\lim _{L\rightarrow \infty }\eta _{2}=\lim _{L\rightarrow \infty }\beta _{2}\lambda _{2}^{L}=0.\)

For \(M=2,\) we attain \(\lambda _{2}=1+K/\left( N+K\right) >1\). Thus, \(\lim _{L\rightarrow \infty }\Delta _{2}=\left( 10/L\right) \log _{10}\left[ 1+K/\left( N+K\right) \right] ^{L} =10\log _{10}\left[ 1+K/\left( N+K\right) \right]\) (dB). Moreover, due to \(1+K/\left( N+K\right) <1.5\), \(\lim _{L\rightarrow \infty }\Delta _{2}<\left( 10/L\right) \log _{10}\left( 1.5^{L}\right) \approx 1.76< \lim _{L\rightarrow \infty }\Delta _{1}\approx 3\) (dB). \(\square\)

Remark 5

Compared to the perfect CSI case (Theorem 1), Theorem 3 indicates that for given M, the performance gap between the two detectors under uncertain CSI gets smaller than that under perfect CSI. Therefore, the GD becomes more attractive than the ML under the MMSE CSI condition, particularly when the receiver has more antennas.

5 Simulation results

We provide simulation results for MCIK-OFDM-SC having \(N_{c}=128\) total sub-carriers, which are divided into G clusters, each having N sub-channels. For illustrations, we consider \(N\in \left\{ 2,4\right\}\), \(K<4\), \(M\in \left\{ 2,4,8\right\}\), and \(L\in \{1,2,4,8,12\}\). The BER simulation results for the GD are compared to the ML under various MCIK parameters and CSI conditions.

5.1 Accuracy of theoretical and asymptotic expressions

Fig. 2
figure 2

BER of the GD detector in MCIK-OFDM-SC under various CSI conditions, with \((N,K,M,L)=(4,1,4,2)\)

Fig. 3
figure 3

BER of the ML detector in MCIK-OFDM-SC under various CSI conditions, with \((N,K,M,L)=(4,2,4,3)\)

Fig. 4
figure 4

BER comparison between the ML and the GD in MCIK-OFDM-SC under perfect CSI, with \((N,K,M)=(2,1,2)\) and \(L=1,2,4,8\)

Figure 2 depicts the simulation results of MCIK-OFDM-SC using the GD, along with the theoretical and asymptotic BER expressions when \((N,K,M,L)=(4,1,4,2)\), under various CSI conditions. As observed from Fig. 2, the theoretical BER expressions derived for the GD are very tight, i.e., very close to simulation results in a broad range of SNRs, while the asymptotic results are accurate in high SNR regions. This observation clearly confirms the accuracy of our theoretical analysis provided in Sects. 3 and 4. In addition, under fixed or variable \(\epsilon ^{2}\), the GD suffers from a considerable loss in the BER compared to the perfect CSI case (\(\epsilon ^{2}=0\)). For example, at BER of \(10^{-3}\) in Fig. 2, the loss of SNR gain caused by fixed or variable CSI uncertainty is more than 4 dB. Note that a similar observation can be seen in Fig. 3 for the ML detector.

5.2 BER under perfect CSI

Fig. 5
figure 5

BER comparison between the ML and the GD in MCIK-OFDM-SC under perfect CSI, with \((N,K,M)=(4,3,4)\) and \(L=1,2,8\)

Figure 4 depicts the BERs for the ML and the GD in MCIK-OFDM-SC under perfect CSI, with \((N,K,M)=(2,1,2)\) and \(L=1,2,4,8\). As observed from Fig. 4, the ML always outperforms the GD even as L increases. For instance, as \(L=8\), at BER of \(10^{-4}\), the ML achieves the SNR gain of 3 dB over the GD. This confirms Theorem 1 as \(M=2\).

In Fig. 5, the BER comparison between the two detectors under perfect CSI is illustrated for MCIK-OFDM-SC with \((N,K,M)=(4,3,4)\) and \(L=1,2,8.\) It is shown from Fig. 5 that when \(M=4\), the BER of the GD approaches to that of the ML as L gets larger. In particular, at BER of \(10^{-3}\), the coding gain attained by the ML over the GD is about 5 dB when \(L=1\), while this gain reduces to only 1 dB when \(L=8\). This validates Theorem 1 for the case of \(M=4.\)

Figure 6 illustrates the BERs for the ML and the GD when \((N,K,M)=(4,2,8)\) and \(L=1,2,4,8\). It is clear from this figure that the BER of GD rapidly tends to that of the ML as L increases. Specifically, as \(L=4\), the performance gap between these two detectors becomes negligible. This confirms Theorem 1 for the case of \(M\ge 8.\)

Fig. 6
figure 6

BER comparison between the ML and the GD in MCIK-OFDM-SC under perfect CSI, with \((N,K,M)=(4,2,8)\) and \(L=1,2,4,8\)

5.3 BER under fixed CSI uncertainty

Figure 7 depicts the BER comparison between the ML and the GD under fixed CSI uncertainty, with \((N,K,M)=(4,2,2)\), \(L=2,4,8,12\) and \(\epsilon ^{2}=0.2\). Interestingly, it can be seen from this figure that at high SNRs, the GD outperforms the ML in terms of the BER. For example, as \(L=4,\) the GD achieves the BER lower than the ML when \(E_{s}/N_{0}\ge 15\) dB. This is due to the fact that under the fixed CSI uncertainty, using the energy detection, the GD achieves better index detection performance than its ML counterpart, leading to better BER performance, as theoretically proved in Sect. IV-B. Moreover, due to the fixed error variance, i.e., \(\epsilon ^{2}=0.2\), there exists error floors on the BERs of the two detectors. These floors get lower as L increases. This observations validate Theorem 2.

Fig. 7
figure 7

BER comparison between the ML and the GD in MCIK-OFDM-SC under fixed CSI, with \((N,K,M)=(4,2,2)\), \(L=2,4,8,12\) and \(\epsilon ^{2}=0.2\)

5.4 BER under MMSE variable CSI uncertainty

Figure 8 depicts the BER comparison between the GD and ML detectors under MMSE-based variable CSI uncertainty, with \((N,K,M)=(2,1,2)\) and \(L=1,2,4,8\). As seen via Fig. 8, when L gets larger, the BERs of the two detectors become closer. However, the ML always outperforms the GD. In addition, the performance gap between them under variable CSI uncertainty gets smaller than that under perfect CSI in Fig. 4. These observations validate Theorem 3 for \(M=2\).

Fig. 8
figure 8

BER comparison between the ML and the GD in MCIK-OFDM-SC under MMSE variable CSI uncertainty, with \((N,K,M)=(2,1,2)\) and \(L=1,2,4,8\)

Figure 9 compares the BER between the two detectors under MMSE variable CSI, when \((N,K,M)=(4,1,4)\) and \(L=1,2,4,8\). Unlike the perfect CSI case, the BERs of the ML and the GD under this CSI condition quickly converge to each other as L increases even when \(M=4\). Similar to Fig. 6, as \(L\ge 2\) there is a marginal gap in the BER between the two detectors. Hence, Theorem 3 with \(M\ge 4\) is clearly validated.

Fig. 9
figure 9

BER comparison between the ML and the GD in MCIK-OFDM-SC under MMSE variable CSI uncertainty, with \((N,K,M)=(4,1,4)\) and \(L=1,2,4,8\)

6 Conclusions

We proposed a generalized framework for the BER analysis of MCIK-OFDM using either the GD or ML detector. Based on this, we derived tight, closed-form expressions for the BERs of MCIK-OFDM with the selection combining ML (or GD) receiver, taking effects of CSI uncertainty into account. We provided the asymptotic analysis to investigate impacts of imperfect CSI on their BERs. Furthermore, the BER comparison between the GD and ML detectors under various CSI conditions was presented, which allows to provide a theoretical guideline on the signal detection of MCIK-OFDM-SC under each specific CSI condition. For example, under MMSE-based variable CSI, the SC-based GD was shown to approach the SC-based ML in terms of the BER as the number of antennas increases and \(M\ge 4\). More interestingly, under fixed CSI uncertainty and at high SNRs, the SC-based GD always outperforms the SC-based ML in terms of the BER for any value of M. Finally, the derived BER expressions and theoretical guideline are validated via simulation results. It is noteworthy that the derived expressions and proposed guideline for using the GD would be useful for various designs of the practical implementation of MCIK-OFDM. In our future work, we plan to investigate the performance of MCIK-OFDM-SC in combination with a number of diversity enhancement techniques, such as coordinate interleaving [8], repetition codes [11], and spreading matrix [14].