Performance Analysis of Bit-Interleaved Polar Coded Modulation

Zhao, Kun; Liu, Yong; Du, Haiqing; Yuan, Baoku

doi:10.1007/s11277-019-06613-6

Performance Analysis of Bit-Interleaved Polar Coded Modulation

Published: 11 June 2019

Volume 109, pages 1285–1309, (2019)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Wireless Personal Communications Aims and scope Submit manuscript

Performance Analysis of Bit-Interleaved Polar Coded Modulation

Download PDF

Kun Zhao ORCID: orcid.org/0000-0002-3053-0753^1,2,3,
Yong Liu^1,2,3,
Haiqing Du^1,2,3 &
…
Baoku Yuan^1,2,3

310 Accesses
1 Citation
Explore all metrics

Abstract

Polar codes have been applied to the construction of bit-interleaved polar coded modulation schemes in recent years. Bit-interleaved polar coded modulation schemes consist of the concatenation of multi-order modulation and polar codes. The structure of polar codes has some similar features to bit-interleaved coded modulation schemes. This similar features motivate us to transform the concatenated transmission schemes into a universal equivalent channel model, which consists of independent parallel channels, through considering polar coding and modulation synthetically in practical applications. We also propose a general polar codes constructing algorithm to design the constituent polar codes for the equivalent channel model. Then we employ a bijective mapper to accomplish the modulation i.e., binary address mapping, from coded bits to signals in constellation with Gray labeling or set partition labeling rule in different order modulations. We analyze and compare the performance of the equivalent channel model with different order modulations under different decoding algorithms. Simulation results show that the performance of our proposed schemes outperforms that of low-density parity-check codes in WiMAX standard and Turbo codes.

Performance and Complexity Comparisons of Polar Codes and Turbo Codes

Simplified Design of Multilevel Coding Scheme with Polar Codes for High-Order Modulation

Design of Polar Encoder and Decoder for Next-Generation Networks

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

In 1982, Ungreboeck proposed that coding and modulation can be jointly considered as an entity to improve the performance of communication system in his paper on trellis-coded modulation [1]. Then this concept was developed into a lot of versions in coded and modulation application fields. In [2], Zehavi proposed a theory of replacing the symbol-wise interleaver in Ungreboeck’s method with a bit-wise interleaver that implements binary encoding and decoding independently at both sides of channels through decomposing the non-binary input channel into binary channels. The result shown that as a suboptimal (assuming independent between each bit channel) scheme, bit-interleaved coded modulation (BICM) could offer significant advantages at high signal to noise ratio (SNR) region with low computational complexity. At the same time, the encoding and modulation procedures of BICM can be designed simply and independently as they are detached by a bitwise interleaver [3]. With those remarkable advantages, BICM has been a popular scheme in modern communication systems.

Polar codes are famous for their channel coding properties of low complexity and binary capacity achieving [4], where the capacity-achieving property is an asymptotic character of polar codes with infinite coding length,which is applied into binary-input discrete memoryless symmetric (B-DMS) channels. As the performance of successive cancellation (SC) decoding is disappointing for short length of polar codes, Tal and Vardy proposed the list decoding algorithm for polar codes, i.e., successive cancellation list (SCL) decoding algorithm, to improve the performance of polar codes with moderate coding length. SCL decoding algorithm can completely achieve the maximum likelihood (ML) performance of polar codes at the practical insight [5], which attract more interest in channel coding fields and make polar codes competitive with the state-of-the-art codes i.e., LDPC [6] and Turbo codes [7]. Meanwhile, since iterative belief propagation decoding algorithm has parallel executing property inherently, this decoding has also attracted a lot focus on hardware implement [8,9,10].

In the following years, polar codes had been widely applied in coded modulation communication system, especially, the modulation of 2^m-order digital modulation system [11, 12], which would not increase spectral efficiency of binary polar coded modulation in some degree. With the proposition of polar coded modulation, BICM has attracted a lot focus again. In [13], authors focused on the polar coded BICM and designed a kind of interleaver which outperformed others with the employed mappings.

Except for those competitive performance, since polar codes can offer the flexibility for practical application in accommodating multilevel coding rates and various transmission channels, polar codes have been selected as official coding scheme for the control channel in the fifth generation (5G) enhanced mobile broadband (eMBB) scenario [14, 15]. However, there are few works on the performance analysis and design of synthetically considering multilevel modulation and polar codes under SC and SCL decoding algorithms for an equivalent channel model with a general polar codes construction in BIPCM system.

In this paper, we aim at offering a general equivalent channel model of BICM with multi-order modulation and universal flexible polar coding construction to optimize the information transmission problem of non-binary discrete, memoryless channels. The main contributions can be listed in the following several parts:

Through observing a group of BIPCM schemes and the structure features of polar codes, we find that BIPCM can be transformed to an equivalent channel model consisting of several independent parallel channels by considering modulation and polar coding synthetically. We analyze the channel equivalent characters and analytically prove that the equivalent channels are capacity-achieving.
We propose a general polar codes constructing algorithm and design component polar codes for the equivalent channel model applying the proposed code constructing algorithm to independent parallel channels.
We use general bijective mapping rules to map coded bits to signal points deriving from various order modulation constellations over the equivalent channel model. Detail simulation results shown the error-correcting performance of different synthetic schemes consisting of multi-order modulation with various decoding algorithms.

We analyze the time and space complexities of those BIPCM schemes. We use simulation results to compare the performance of different schemes consisting of concatenations of polar codes with multi-order modulation with SC, SCL, and log likelihood ratio based (LLR-based) SCL decoding algorithms through AWGN channels.

The construction of this paper consists of the following sections: In Sect. 2, the preliminary of the overall system schemes is introduced, which includes symbol notation, polar codes definition and mapping rules. In Sect. 3, system models of BIPCM and a general equivalent channel model are built, then we analyze and derive the capacities of both type channel models and design the equivalent BIPCM applying the general code constructing algorithm under coding rate constraint. And then we analyze the detail decoding processes of SC, SCL, and LLR-based SCL decoding algorithms, meanwhile, compare the time complexity and space complexity of those decoding algorithms. In Sect. 4, numerical results illustrate that the error-correcting performance of schemes over the equivalent channel model with SCL decoding algorithm outperforms that of LDPC in WiMAX standard and Turbo codes. In addition, we compare the reliability performance of the propose scheme with that of several other schemes under different order modulation. In Sect. 5, we show the summation of this paper.

2 Preliminary

Let ${\mathcal{C}} \triangleq \{ 1,2, \ldots ,N\}$ denote an integer set. The cardinality of set ${\mathcal{C}}$ is represented by $|{\mathcal{C}}| = N$. Boldface upper case letters represent matrix while boldface lower case letters denote vectors. Lower case letters without boldface represent the constant parameters. Upper case letters without boldface denote set or channel depending on the practical applicative scenarios. We set ${\mathbf{u}}_{p}^{q} = [u_{p} ,u_{p + 1} \ldots ,u_{q} ],\;(0 \le p < q)$. Set ${\mathbf{F}} \triangleq \left[ {\begin{array}{*{20}c} 1 & 0 \\ 1 & 1 \\ \end{array} } \right]$ to denote the underlying kernel matrix of polar codes. Matrix dimension denotes by (m, n).

2.1 Polar Codes

Let the quad tuple $(N,K,{\mathcal{A}},{\mathbf{u}}_{{{\mathcal{A}}^{c} }} )$ present polar codes, where N denotes the length of polar codes, K represents the length of information bits in polar codes, subsets ${\mathcal{A}} \subset {\mathcal{C}}$ denote the index sets of information bit channels which are used to transmit information bits and with cardinality $|{\mathcal{A}}| =K$. Vectors ${\mathbf{u}}_{{{\mathcal{A}}^{c} }}$ denote the frozen bit sequences. The index sets of frozen bit sequences denote by ${\mathcal{A}}^{c}$ with cardinality $| {\mathcal{A}}^{c} | = N - K$. In general, we set N to be a power of 2. The number of bit layers of polar codes denotes by $n = \log_{2} (N)$. The generator matrix of polar codes define by ${\mathbf{G}} = {\mathbf{B}}^{N} {\mathbf{F}}^{ \otimes n}$, where ${\mathbf{B}}^{N}$ presents bit reversal permutation matrix. The given information sequences denote by ${\mathbf{u}}_{{\mathcal{A}}} = {\mathbf{u}}_{1}^{K}$ with the informational index sets ${\mathcal{A}}$ in vector ${\mathbf{u}}_{1}^{N}$. The polar codes are generated by ${\mathbf{v}}_{1}^{N} = {\mathbf{u}}_{1}^{N} \cdot {\mathbf{G}}$. Binary field denotes by ${\mathcal{F}}_{2}$.

2.2 General Mapping

In order to generate more reliable and robust Euclidean space codes for the given 2^m-order signal constellation, each signal point is assigned a label consisting of m coded bits. Especially, we assume that M = 2^m denotes the order of constellation and each label presented by m-bit digitals. In this paper, we apply a bijective mapping function $x = f({\mathbf{v}}_{1}^{m} )$ to map binary address index vector $v_{1}^{m} = (v_{1} , \ldots ,v_{m - 1} ,v_{m} ),\;v_{i} \in \{ 0,1\} ,\;\;1 \le i \le m$, into a responding signal point of input alphabet $x \in {\mathcal{X}}$. Generally, the mapping comes from the procedure of partitioning the signal set ${\mathcal{X}}$ into several subsets in succession [16].

Labeling rule transforms a channel with 2^m-ary input into m parallel bit channels, which simplifies the processes of encoding and decoding for non-binary channel, i.e., binary encoding and decoding replace that of non-binary channel directly. We just consider two kinds of labeling rules: Gray labeling and set partition labeling. Since Gray labeling could maximize the capacity of BICM transmission scheme [3], we give priority to employ Gray mapping in this paper.

In [17], authors pointed out that natural binary labeling can be transformed to binary reflective Gray labeling using linear transformation which also suits for the reversed direction of one dimensional constellations. To make the best use of set partitioning (SP) and Gray labeling rule, we have:

$${\mathbf{M}}_{SP,3} = \left[ {\begin{array}{*{20}c} 0 &\quad 0 &\quad 0 \\ 1 &\quad 0 &\quad 0 \\ 0 &\quad 1 &\quad 0 \\ 1 &\quad 1 &\quad 0 \\ 0 &\quad 0 &\quad 1 \\ 1 &\quad 0 &\quad 1 \\ 0 &\quad 1 &\quad 1 \\ 1 &\quad 1 &\quad 1 \\ \end{array} } \right],\quad {\mathbf{M}}_{Gray,3} = \left[ {\begin{array}{*{20}c} 0 &\quad 0 &\quad 0 \\ 1 &\quad 0 &\quad 0 \\ 1 &\quad 1 &\quad 0 \\ 0 &\quad 1 &\quad 0 \\ 0 &\quad 1 &\quad 1 \\ 1 &\quad 1 &\quad 1 \\ 1 &\quad 0 &\quad 1 \\ 0 &\quad 0 &\quad 1 \\ \end{array} } \right]$$

(1)

The index of each constellation point can be represented by a given number bits. For example, a given 2^m-order ASK (2^m–ASK) constellation is characterized by a binary $(M,m)$ matrix ${\mathbf{M}}_{SP,m}$ where m bits have the dual representation for the row indices among the set $\{ 0, \ldots ,2^{m} - 1 \}$. Specially, the right-most column denotes the most significant bits.

In [17], authors shown that set partitioning and Gray labeling rules could interconvert with each other. In other words, set partitioning labeling could translate into a binary Gray labeling over a $2^{m} - \text{ASK}$ constellation by multiplying an m ordered binary square matrix ${\mathbf{T}}_{m}$.

$${\mathbf{T}}_{m} = \left[ \begin{aligned} \begin{array}{*{20}c} 1 &\quad 0 &\quad 0 &\quad \cdots &\quad 0 &\quad 0 \\ 1 &\quad 1 &\quad 0 &\quad \cdots &\quad 0 &\quad 0 \\ 0 &\quad 1 &\quad 1 &\quad \cdots &\quad 0 &\quad 0 \\ \vdots &\quad \vdots &\quad \vdots &\quad \cdots &\quad \vdots &\quad \vdots \\ 0 &\quad 0 &\quad 0 &\quad \cdots &\quad 1 &\quad 0 \\ \end{array} \hfill \\ \begin{array}{*{20}c} 0 &\quad 0 &\quad 0 &\quad \cdots &\quad {{\kern 1pt} 1} &\quad {{\kern 1pt} {\kern 1pt} 1} \\ \end{array} \hfill \\ \end{aligned} \right]$$

(2)

So the general linear mapping is derived by:

$${\mathbf{M}}_{Gray,m} = {\mathbf{M}}_{SP,m} \cdot {\mathbf{T}}_{m}$$

(3)

3 Bit-Interleaved Polar Coded Modulation

Figure 1 depicts the overall operation on the input information vector ${\mathbf{u}}_{1}^{K}$, where ${\mathbf{u}}_{1}^{K} = [u_{1} , \ldots ,u_{i} , \ldots ,u_{K} ]$, $u_{i} \in \{ 0,1\}$ denotes the input information vector at the transmitter. The generator matrix of encoder is G_N. Let the notation $W:{\mathcal{X}} \to {\mathcal{Y}}$ denote a DMC with input random variable $X \in {\mathcal{X}}$ where ${\mathcal{X}}$ is alphabet set with size of 2^m, and output random variable $Y \in {\mathcal{Y}}$ with finite size alphabet set ${\mathcal{Y}}$ and $\left| {\mathcal{Y}} \right| < \infty$.

The mutual information between input X and output Y denotes by I(X;Y). Additionally, x and y denote the realizations of input random variable X and output random variable Y, respectively. For AWGN channel, the original channel W can be modeled by

$$Y = X + N$$

(4)

where parameter $N \sim {\mathcal{N}}(0,\sigma^{2} )$ denotes Gaussian random variable. We set input X with the probability mass function (PMF) $p_{X} (x)$ of the input symbols $x \in {\mathcal{X}}$.

Such a channel with discrete outputs can be represented by the conditional transition PMF $p_{Y|X} (y|x)$. The mutual information of this channel is defined by

$$I(X;Y) = \sum\limits_{{x \in {\mathcal{X}}}} {\sum\limits_{{y \in {\mathcal{Y}}}} {p_{X} (x)p_{Y|X} (y|x)\log_{2} \frac{{p_{Y|X} (y|x)}}{{p_{Y} (y)}}} }$$

(5)

where $p_{Y} (y) = \sum\nolimits_{{x \in {\mathcal{X}}}} {p_{X} (x)p_{Y|X} (y|x)}$. Specially, there is no analytic methods to compute $I(X;Y)$, and the results can be achieved by applying numerical integration methods.

Channel with continuous output alphabet ${\mathcal{Y}}$ with infinite size could be represented by a probability density function (pdf) $f_{Y|X} (y|x)$. The mutual information of this channels can be defined by

$$I(X;Y) = \int_{ - \infty }^{ + \infty } {\sum\limits_{{x \in {\mathcal{X}}}} {p_{X} (x)f_{Y|X} (y|x)\log_{2} \frac{{f_{Y|X} (y|x)}}{{f_{Y} (y)}}} dy}$$

(6)

As we all know, mutual information of a symmetric DMC maximize by assuming input random variable X selected with uniform distribution over input alphabets ${\mathcal{X}}$. Then the mutual information can be further written as

$$\begin{aligned} I(X;Y) & = \int_{ - \infty }^{ + \infty } {\frac{1}{{\left| {\mathcal{X}} \right|}}\sum\limits_{{x \in {\mathcal{X}}}} {f_{Y|X} (y|x)\log_{2} \frac{{f_{Y|X} (y|x)}}{{f_{Y} (y)}}} dy} \\ & =I(W) \\ \end{aligned}$$

(7)

For the fact that the capacity of a symmetric DMC with uniform input is equal to its mutual information, we obtain the capacity of such a channel:

$$C(W) = \mathop {\hbox{max} }\limits_{{p_{X} (x)}} I(X;Y) = I(W)$$

(8)

Similarly, non-binary discrete memoryless channels can be partitioned into several symmetric binary discrete memoryless channels [12]. If we define a binary input symmetric discrete memoryless channel $B:{\mathcal{X}}^{\prime} \to {\mathcal{Y}}^{\prime}$, where ${\mathcal{X}}^{\prime} = \{ 0,1\}$. The m copies of B denote by B^m i.e., vector channel $B^{m} = \{ B \times B \times \cdots \times B\}$, which expresses the operating procedure from the output of encoder to the input of decoder. The vector channel can be partitioned into m ordered bit levels [18] $\eta :B^{m} \to \left\{ {B^{(1)} ,B^{(2)} , \ldots ,B^{(m)} } \right\}$ with set partition (SP) or binary reflected Gray labeling rules.

3.1 Encoding and Decoding of BIPCM

First, at the transmitter, input information vector is encoded by standard polar codes encoder, i.e., information vector ${\mathbf{u}}_{1}^{K}$ multiplies the generator matrix G_N of encoder which generates polar codes ${\mathbf{v}}_{1}^{N}$ with code length $N{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} (N = 2^{n} )$. Polar codes are interleaved by a bitwise interleaver, which was designed in [13]. Then those interleaved bit streams are separated into plenty of subsequences of m bits each. Moreover, each m bits are mapped into signal points in ${\mathcal{X}},|{\mathcal{X}}| = 2^{m}$ with the labeling mapping $\alpha :\{ 0,1\}^{m} \to X$, e.g. the general mapping. The linear mapping characterization can denote by matrix operations, i.e., Eqs. (1–3). Labeling positions ${\mathbf{b}} = (b_{1} \cdots b_{j} \cdots b_{m} )$ of symbol x transmit the polar codes through the corresponding bit channel $B^{(j)} ,\;j \in \{ 1,2, \ldots ,m\}$ independently.

At the receiver, we use ${\mathcal{X}}_{b}^{j} = \left\{ {x \in {\mathcal{X}}: b_{j} (x) = b} \right\}$ to denote the sets of constellation points with a binary labeling value $b_{j} \in \{ 0,1\}$ at the j-th labeling position. Moreover, we apply the general mapping to accomplish the mapping from coded bits to indices of signal points with Gray or set partition labeling in corresponding constellation i.e., modulation. Then we obtain symbol sequences after modulation. Next, the symbol sequences are sent into AWGN channel W. During this transmission, the symbol sequences are always corrupted by noise e.g. AWGN. Generally, receiver should take some measures to eliminate those noise or interference. After symbol sequences are transmitted over wireless channels, the demodulator calculates the probabilities of labeling positions $b_{j} (x)$ basing on the received symbols. In generally, the probabilities will be converted to log-likelihood ratios (LLRs) under the assumption that the transmitted prior value is 0 or 1. Finally, those LLRs values are de-interleaved and sent to decoders which implement several decoding algorithms and output the estimates of information sequence ${\hat{\mathbf{u}}}_{1}^{K}$.

3.2 Equivalent Channel Model

According to the mutual information definition, the capacity of equivalent channels of BIPCM can be achieved by the proof in [19, 20]. Considering the coding construction of polar codes and BICM system, we transform the BIPCM model into m independent parallel equivalent binary input channels ${\mathbf{B}} = \{ B^{(1)} , \ldots ,B^{(m)} \}$ in Fig. 2.

Let coded bit sequence ${\mathbf{v}}_{1}^{mN}$ be interleaved by bit-wise interleaver $\pi :i \to i^{\prime}$ where index $i = (j - 1) \cdot N + k$ and $i^{\prime} = k \cdot 2^{m} + j$, i.e., $i^{\prime} = \pi (i)$ for all $1 \le j \le m,\;{\kern 1pt} 1 \le k \le N$. The interleaved bit streams are separated into groups of subsequence with m bits each.

Binary labeling mapping $\alpha :\{ 0,1\}^{m} \to {\mathcal{X}}$, accomplishes the mapping operations step by step. Each label position $b_{j} (x)$$j \in \{ 1,2, \ldots ,m\}$ is corresponding to the bit-wise index of symbol x which distributes uniformly over the subset of modulation constellation with the fixed label position j, i.e., $b_{j} (x)$ denotes a realization of Bernoulli random variable with probability 1/2 in the j-th equivalent channel B^(j).

$${\mathbf{l}}_{1}^{mN} = \{ llr_{1} (v_{1} ), \ldots ,llr_{i} (v_{i} ), \ldots ,llr_{mN} (v_{mN} )\}$$

(9)

$$llr_{i} (v_{i} ) = \log_{e} \frac{{B({\mathbf{y}},\hat{v}_{1}^{i - 1} |v_{i} = 0)}}{{B({\mathbf{y}},\hat{v}_{1}^{i - 1} |v_{i} = 1)}}$$

(10)

where vector ${\mathbf{y}} = (y_{1} ,y_{2} , \ldots ,y_{N} )$ denote the received symbol sequences which are also the input vector of demodulator in Fig. 1.

Meanwhile, we set vector ${\mathbf{x}} = (x_{1} , \ldots ,x_{N} )$ as the output symbol vector of modular. Demodular demodulates the received symbol sequence by the following “argmax” rule:

$${\hat{\mathbf{x}}} = \mathop {\arg \hbox{max} }\limits_{{\mathbf{x}}} q({\mathbf{x}},{\mathbf{y}})$$

(11)

where notation $q({\mathbf{x}},{\mathbf{y}})$ denotes the decoding metric, which is defined by the product of symbol metrics $q({\mathbf{x}},{\mathbf{y}}) = \varPi_{k = 1}^{N} q(x_{k} ,y_{k} )$. The decoder is equal to a function of denotation from output alphabet ${\mathcal{Y}}$ to the sequence space which estimates the transmitted sequence ${\hat{\mathbf{x}}}$ in (11).

The nature model of the symbol decoding metrics is defined by:

$$q(x_{k} ,y_{k} ) = \prod\limits_{j = 1}^{m} {p_{j} \left( {b_{j} (x_{k} ),y_{k} } \right)}$$

(12)

$$p_{j} (b_{j} (x_{k} ),y_{k} ) = {\kern 1pt} \sum\limits_{{x^{{\prime }} \in X_{j}^{b} (x_{k} )}} {p(y_{k} |x^{{\prime }} )}$$

(13)

$$p_{j} (y_{k} |b_{j} (x_{k} )) = p_{j} (b_{j} (x_{k} ),y_{k} ) \cdot p_{j} (b_{j} (x_{k} ))$$

(14)

where q(x_k, y_k) denotes symbol probability of equivalent channels W with input symbol x_k and output symbol y_k. Notation p_j(b_j(x_k),y_k) denotes the crossover probability of the j-th label position b_j(x_k) in symbol x_k with the given received symbol y_k. Especially, we do not consider the mutual information between joint label positions in one symbol. Term p_j(y_k|b_j(x_k)) denotes the prior probability of received symbol y_k corresponding to b_j(x_k) at the j-th label position. The form p_j(b_j(x_k)) denotes the distribution probability of b_j(x_k).

The capacity of channel W can be rewritten by:

$$\begin{aligned} I(W) &= I(X;Y) \hfill \\ &\mathop \le \limits^{a} \sum\limits_{j = 1}^{m} {I(B^{(j)} )} \hfill \\ & = \sum\limits_{j = 1}^{m} {\sum\limits_{k = 1}^{N} {\sum\limits_{{x_{k} \in {\mathcal{X}}}} {\sum\limits_{{y_{k} \in {\mathcal{Y}}}} {p_{j} (b_{j} (x_{k} ))p_{j} (y_{k} |b_{j} (x_{k} ))} } } } \hfill \\ &\quad \times \log_{2} \frac{{p_{j} (y_{k} |b_{j} (x_{k} ))}}{{\sum\nolimits_{{b_{j} (x_{k} ) = 0}}^{1} {p_{j} (y_{k} |b_{j} (x_{k} ))} }} \hfill \\ \end{aligned}$$

(15)

where I(W) denotes the mutual information of channel W with input alphabet ${\mathcal{X}}$ and output alphabet ${\mathcal{Y}}$. Through chain rule of mutual information [21], we obtain the inequality relation (a) in (15).

At receiver, demodulator receives the symbol sequences, and then employs equations in Eqs. (13, 14) and (16, 17) to calculate the probabilities p_j(b_j(x_k),y_k) of each labeling position without considering the mutual information between bit levels in each received symbol. The log likelihood ratios of the prior probability of label position b_j(x_k) with the input symbol x_k and output symbol y_k. According to equality (10), the following formula holds:

$$llr\left( {y_{k} |b_{j} (x_{k} )} \right) = \log \frac{{p_{j} (y_{k} |b_{j} (x_{k} ) = 0)}}{{p_{j} (y_{k} |b_{j} (x_{k} ) = 1)}}$$

(16)

where llr(y_k|b_j(x_k)) will feed back to the input of de-interleaver. Then we implement the interleaving operation for all llr(y_k|b_j(x_k)). Thus, we have the following relation:

$$llr_{\pi (i)} (v_{\pi (i)} ) = llr(y_{k} |b_{j} (x_{k} ))$$

(17)

where $\pi (i) = k \cdot 2^{m} + j$. After the de-interleaved operation, we obtain the LLRs sequence ${\mathbf{l}}_{1}^{mN}$.

Then demodulator calculates the decoding metrics, which are sent to de-interleaver forming a length mN LLRs sequence ${\mathbf{I}}_{1}^{mN}$. Finally, the estimates ${\mathbf{u}}_{1}^{N}$ of input message can be retrieved by decoders. The process of decoding is described in detail in the following section.

3.3 Design of BIPCM Basing on Equivalent Channel Model

Firstly, we need to generate an efficient code construction. In general, we could apply channel polarization (Bhattacharyya parameter estimation), Monte Carlo method and Gaussian appropriation (GA) to choose the index set ${\mathcal{A}}$ of information bit channels. Let $(N,K^{(j)} ,{\mathcal{A}}^{(j)} ,{\mathbf{u}}_{{{\mathcal{A}}^{(j)c} }} )$ denote the constituent polar codes for the j-th bit level of the equivalent channel model in Fig. 2.

We set B to denote the underlying channel for each bit level (also called equivalent channel) with ${\mathbf{B}} = \{ B^{(1)} , \ldots ,B^{(m)} \}$. We assume that the k-th bit channel at $\varphi$-th decoding layer among the component polar codes with code length N for the j-th bit level of the equivalent model denotes by $B_{k,\varphi }^{(j)}$. Bhattacharyya parameter denotes by $Z(B_{k,\varphi }^{(j)} ) = Z_{k,\varphi }^{(j)}$. The recursive calculation relational expressions are given as following:

$$\left\{ \begin{aligned} Z_{k - 1,\varphi }^{(j)} \le 2Z_{{\left\lfloor {k/2} \right\rfloor ,\varphi - 1}}^{(j)} - (Z_{{\left\lfloor {k/2} \right\rfloor ,\varphi - 1}}^{(j)} )^{2} ,{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} b_{k}^{(j)} = 0 \hfill \\ Z_{k,\varphi }^{(j)} = (Z_{{\left\lfloor {k/2} \right\rfloor ,\varphi - 1}}^{(j)} )^{2} ,{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} b_{k}^{(j)} = 1 \hfill \\ \end{aligned} \right.$$

(18)

for $j = \{ 1, \ldots ,m\}$$k = \{ 1, \ldots ,N\}$, and $0 \le \varphi \le n,{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} n = \log_{2} N$. Especially, $b_{k}^{(j)} = b_{j} (x_{k} )$ denotes the binary value of the j-th label position in the k-th received symbol, and $Z_{0} = Z(B^{(j)} )$.

Through polarization operation, the cumulative symmetric capacities and reliability are remained in following relational formulas [4]:

$$\mathop {\lim }\limits_{N \to \infty } \frac{1}{N}\sum\limits_{k = 1}^{N} {I(B_{k}^{(j)} )} = I(B^{(j)} )$$

(19)

$$\frac{1}{N}\sum\limits_{k = 1}^{N} {Z(B_{k}^{(j)} ) \le Z(B^{(j)} )}$$

(20)

The fraction of the left side of formula (19) tends to I(B^(j)) with probability 1, when the component code length $N \to \infty$. On the contrary, when the length of the component polar codes is finite, the following relation holds

$$\frac{1}{N}\sum\limits_{k = 1}^{N} {I(B_{k}^{(j)} )} \le I(B^{(j)} )$$

(21)

Especially, Eq. (19) holds for the independence among different bit levels of the equivalent channel model in Fig. 2.

In BIPCM, let R^(j) denote the code rate of ordered bit channel B^(j) in bit vector channels ${\mathbf{B}} = \{ B^{(1)} , \ldots ,B^{(m)} \}$. We assume that channel W in Fig. 1 depicts the underlying 2^m-ary discrete memoryless channel. Then source bits of BIPCM scheme are encoded by component binary polar encoders with code rate R^(j) for the j-th bit level. The total code rate is defined by

$$R_{BICM} = \sum\limits_{j = 1}^{m} {R^{(j)} }$$

(22)

$$R^{(j)} = \frac{{\left| {{\mathcal{A}}^{(j)} } \right|}}{N}$$

(23)

$$R^{(j)} \le I(B^{(j)} )$$

(24)

where $j \in \{ 1,2, \ldots ,m\}$ denotes the index of bit levels. In this process, each R^(j) is employed with respect to the capacity I(B^(j)) of the j-th duplicated bit channel B^(j) of the underlying channel B.

The set ${\mathcal{A}}^{(j)}$ of indices of information bit channels is chosen based on the following rule with constraint condition (22):

$$\hbox{max} \sum\limits_{j = 1}^{m} {\sum\limits_{{k \in A^{(j)} }} {I(B_{k}^{(j)} )} }$$

(25)

Instead of exactly analyzing the term $I(B_{k}^{(j)} )$, we approximate every bit capacity among all of the bit channels $\sum\nolimits_{j = 1}^{m} {\sum\nolimits_{{k \in {\mathcal{A}} \cup {\mathcal{A}}^{c} }} {I(B_{k}^{(j)} )} }$ by channel polarization with recursively computing the Bhattacharyya parameters to estimate the capacities or bit error rate (BER) of all the bit channels. Then we pick out the indices of K best bit channels satisfying the constraint conditions on different design requirements (19), (20), (24) to constitute the information bit index set $A^{(j)}$.

According to Arikan’s Proposition 2 [4], we denote the probabilities of block error for DMC channel W employing polar codes $(N,K,{\mathcal{A}},{\mathbf{u}}_{{{\mathcal{A}}^{c} }} )$ by $P_{e} (N,K,{\mathcal{A}},{\mathbf{u}}_{{{\mathcal{A}}^{c} }} )$ with the following definitions:

$$P_{e} (N,K,{\mathcal{A}},{\mathbf{u}}_{{{\mathcal{A}}^{c} }} ) = P_{r} \left\{ {\hat{u}_{i} \ne u_{i} |{\hat{\mathbf{u}}}_{1}^{i - 1} = {\mathbf{u}}_{1}^{i - 1} ,i \in {\mathcal{A}},u_{i} \in F_{2} } \right\}$$

(26)

$${\mathcal{A}} = \sum\limits_{j = 1}^{m} {A^{(j)} }$$

(27)

where the block error probabilities satisfy the following constraint:

$$P_{e} (N,K,{\mathcal{A}} ,{\mathbf{u}}_{{{\mathcal{A}}^{c} }} ) \le \sum\limits_{j = 1}^{m} {\sum\limits_{{k \in {\mathcal{A}}^{(j)} }} {Z_{k}^{(j)} } }$$

(28)

In order to obtain Bhattacharyya parameter values, firstly, we calculate the capacity I(B^(j)) of each channel employing formula (15), then define the corresponding Bhattacharyya parameter using $Z_{0} = 1 - I(B^{(j)} )$. When channel B^(j) belongs to binary erasure channel (BEC), Bhattacharyya parameter of this channel is equal to its erasure probability. Then the reliability parameter of polar codes $(N,K,{\mathcal{A}} ,{\mathbf{u}}_{{{\mathcal{A}}^{c} }} )$ could be chosen by the following rule:

$$\hbox{min} \sum\limits_{j = 1}^{m} {\sum\limits_{{k \in A^{(j)} }} {Z_{k}^{(j)} } }$$

(29)

In this paper, under the joint rate constraint in (22), we generate code construction corresponding to m independent parallel channels in Fig. 2. In other words, the best strategy for assigning individual rates for m independent parallel component polar codes is equal to pick out the information bit sets with the minimum summary results of formula (29).

3.4 General Polar Codes Constructing Algorithm

The above part mainly introduces the Bhattacharyya parameters as the upper bounds of block error rate (BLER) of component polar codes to design the BIPCM schemes over the equivalent channels. In fact, there exist several other measures to estimate the quality of polarized channels e.g. Monte Carlo method, Gaussian approximation and so on. To simplify the description, we describe the design progress of polar codes construction with all those measures in the following general polar codes constructing algorithm.

In this algorithm, the reliability parameters $\xi (B_{k}^{(j)} )$ can be Bhattacharyya parameters, BER, and BLER of polarized channels. The BER and BLER can be estimated by Monte Carlo method or Gaussian approximation.

Complexity of BIPCM. Assuming that all of component polar codes have the same block length N. The equivalent independent channel model has m levels (corresponding to 2^m-order modulation). The list size of SCL decoding algorithm denotes by L. The complexities of encoding and decoding of BICM can be defined by the following formulas.

The time complexities of encoding and decoding of BIPCM

$$\nabla_{T,E} = O(mN\log N)$$

(30)

$$\nabla_{T,D}^{SC} = O(mN\log N)$$

(31)

$$\nabla_{T,D}^{SCL} = O(mLN\log N)$$

(32)

where $\nabla_{T,E}$ denotes the time complexity of encoding of BIPCM; $\nabla_{T,D}^{SC}$ denotes the time complexity of SC decoder; $\nabla_{T,D}^{SCL}$ denotes the time complexity of SCL decoder.

The space complexities of encoding and decoding of BIPCM denote by the following relations:

$$\Delta_{S,E} = O(N\log N)$$

(33)

$$\Delta_{S,D}^{SC} = O(N\log N)$$

(34)

$$\Delta_{S,D}^{SCL} = O(LN\log N)$$

(35)

where $\nabla_{S,E}$ denotes the space complexity of encoding of BIPCM; $\nabla_{S,D}^{SC}$ denotes the space complexity of SC decoder; $\nabla_{S,D}^{SCL}$ denotes the space complexity of SCL decoder.

We assume that our operation system can implement the infinite precious arithmetic in unit cost.

3.5 Decoding Processes of Different Decoding Algorithms

Considering the construction of polar codes, we can regard all component polar codes for m equivalent channels as a sequence of mN independent uses of the B-DMC.

Since polar codes belong to a class of recursive concatenated codes, we can calculate those polarized probabilities of bit channels recursively. Decoding process also can be implemented by the similar recursive features in reverse direction.

Successive Cancellation Decoding: In this section we research the process of decoding in Fig. 1. We denote the layer orders of the total coded bit sequence by $\lambda ,{\kern 1pt} {\kern 1pt} {\kern 1pt} 1 \le \lambda \le \theta ,\;\theta = { \log }_{2} (mN)$. There exist $\psi = 2^{\lambda }$ bit channels in the $\lambda$-th layer. The $\tau$-th bit channel denotes by $W_{\lambda }^{{{\prime }(\tau )}} ,\;1 \le \tau \le \psi$ in the $\lambda$-th layer. Then the transmission probability can be defined by following recursive forms [22]:

$$W_{\lambda }^{{{\prime }(\tau )}} \{ {\mathbf{y}}_{1}^{\psi } |{\mathbf{u}}_{1}^{\tau - 1} \} {\kern 1pt} = \sum\limits_{{u_{\tau + 1} = 0}}^{1} {W_{\lambda - 1}^{{({\prime }\left\lfloor {\tau /2} \right\rfloor )}} } \left\{ {{\mathbf{u}}_{1,e}^{\tau - 1} \oplus {\mathbf{u}}_{1,o}^{\tau - 1} |{\mathbf{y}}_{1}^{\psi /2} } \right\}W_{\lambda - 1}^{{{\prime }(\left\lfloor {\tau /2} \right\rfloor )}} \left\{ {{\mathbf{u}}_{1,o}^{\tau - 1} |{\mathbf{y}}_{{\psi /2{ + }1}}^{\psi } } \right\}$$

(36)

$$W_{\lambda }^{{{\prime }(\tau { + 1})}} \{ {\mathbf{y}}_{1}^{\psi } |{\mathbf{u}}_{1}^{\tau } \} = W_{\lambda - 1}^{{{\prime }(\left\lfloor {\tau /2} \right\rfloor )}} \left\{ {{\mathbf{u}}_{1,e}^{\tau - 1} \oplus {\mathbf{u}}_{1,o}^{\tau - 1} |{\mathbf{y}}_{1}^{\psi /2} } \right\}W_{\lambda - 1}^{{{\prime }(\left\lfloor {\tau /2} \right\rfloor )}} \left\{ {{\mathbf{u}}_{1,o}^{\tau - 1} |{\mathbf{y}}_{{\psi /2{ + }1}}^{\psi } } \right\}$$

(37)

For the $\tau$-th bit channel, the SC decoder outputs the decisions employing the following decision rule:

$$\hat{u}_{\tau } = \left\{ {\begin{array}{ll} 0, \hfill & {i \in {\mathcal{A}}^{c} } \hfill \\ {\mathop {\arg \hbox{max} }\limits_{{u_{\tau } \in {\mathcal{F}}_{2} }} W_{\psi }^{{{\prime }(\tau )}} \left\{ {{\mathbf{y}}_{1}^{mN} ,{\mathbf{u}}_{1}^{\tau - 1} |u_{\tau } } \right\}}, \hfill & {i \in {\mathcal{A}}} \hfill \\ \end{array} } \right.$$

(38)

To improve the reliability and decrease the implementation complexity of the proposed schemes, we achieve the soft decision messages LLRs by combing the decoding messages in each layer employing following recursive calculations:

$$\begin{aligned} LLR_{\lambda }^{(\tau )} ({\mathbf{u}}_{1}^{{\tau { - }1}} |{\mathbf{y}}_{1}^{\psi } ) & = 2arc\tanh \left( {\tanh \left( {{{\left( {LLR_{\lambda - 1}^{{(\left\lfloor {\tau /2} \right\rfloor )}} ({\mathbf{u}}_{1,e}^{\tau - 1} \oplus {\mathbf{u}}_{1,o}^{\tau - 1} |{\mathbf{y}}_{1}^{\psi /2} )} \right)} \mathord{\left/ {\vphantom {{\left( {LLR_{\lambda - 1}^{{(\left\lfloor {\tau /2} \right\rfloor )}} ({\mathbf{u}}_{1,e}^{\tau - 1} \oplus {\mathbf{u}}_{1,o}^{\tau - 1} |{\mathbf{y}}_{1}^{\psi /2} )} \right)} 2}} \right. \kern-0pt} 2}} \right)} \right) \\ {\kern 1pt} & \quad \times \tanh \left( {{{\left( {LLR_{\lambda - 1}^{{(\left\lfloor {\tau /2} \right\rfloor )}} ({\mathbf{u}}_{1,o}^{\tau - 1} |{\mathbf{y}}_{\psi /2 + 1}^{\psi } )} \right)} \mathord{\left/ {\vphantom {{\left( {LLR_{\lambda - 1}^{{(\left\lfloor {\tau /2} \right\rfloor )}} ({\mathbf{u}}_{1,o}^{\tau - 1} |{\mathbf{y}}_{\psi /2 + 1}^{\psi } )} \right)} 2}} \right. \kern-0pt} 2}} \right) \\ \end{aligned}$$

(39)

$$\begin{aligned} LLR_{\lambda }^{{(\tau { + }1)}} ({\mathbf{u}}_{1}^{\tau } |{\mathbf{y}}_{1}^{\psi } ) & = ( - 1)^{{u_{\tau } }} LLR_{\lambda - 1}^{{(\left\lfloor {\tau /2} \right\rfloor )}} \left( {{\mathbf{u}}_{1,e}^{\tau - 1} \oplus {\mathbf{u}}_{1,o}^{\tau - 1} |{\mathbf{y}}_{1}^{\psi /2} } \right) \\ & \quad + LLR_{\lambda - 1}^{{(\left\lfloor {\tau /2} \right\rfloor )}} \left( {{\mathbf{u}}_{1,o}^{\tau - 1} |{\mathbf{y}}_{\psi /2 + 1}^{\psi } } \right) \\ \end{aligned}$$

(40)

where $LLR_{\lambda }^{(\tau )} ({\mathbf{u}}_{1}^{{\tau { - }1}} |{\mathbf{y}}_{1}^{\psi } ) = \ln \frac{{W_{\lambda }^{(\tau )} \{ u_{1}^{{\tau { - }1}} ,u_{\tau } = 0|{\mathbf{y}}_{1}^{\psi } \} }}{{W_{\lambda }^{(\tau )} \{ u_{1}^{{\tau { - }1}} ,u_{\tau } = 1|{\mathbf{y}}_{1}^{\psi } \} }}$. The decision rule for index $\tau \in A$ over LLRs of the $\theta - th$ layer reads

$$\hat{u}_{\tau } = \left\{ {\begin{array}{*{20}l} 0 \hfill & {LLR_{\theta }^{\tau } ({\mathbf{u}}_{1}^{\tau - 1} |{\mathbf{y}}_{1}^{mN} ) > 0} \hfill \\ 0 \hfill & {otherwise} \hfill \\ \end{array} } \right.$$

(41)

when $\lambda = 0$, the fractions of the right hand sides of two recursive Eqs. (39), (40) are initialized with the initial LLRs ${\mathbf{l}}_{1}^{mN} = \{ llr_{1} (v_{1} ), \ldots ,llr_{i} (v_{i} ), \ldots ,llr_{mN} (v_{mN} )\}$ of the equivalent labeling position sequence ${\mathbf{y}}_{1}^{mN}$ for the received symbol sequence ${\mathbf{Y}}_{1}^{N}$. While reaching the last layer i.e., $\lambda { = }\theta$, decoders start to estimate elements of input information vector ${\mathbf{u}}_{1}^{K}$. This successive cancellation decoding process has time complexity $O(mN\log N)$ referring to (31). If the values of LLRs are quantized by $Q_{SC}$ bits, space complexity (34) of SC decoding can be derived as [23]:

$$\Delta_{SC} = (2mN + 1) \cdot Q_{SC} + mN - 1$$

(42)

However, when the length of polar codes is finite, Arikan’s SC decoding is sub-optimal compared with ML decoding of polar codes. Then we turn to the asymptotic ML decoding performance algorithm, i.e., SCL decoding algorithm.

SCL decoding algorithm: Instead of making hard decision on the input of channel $W_{\lambda - 1}^{{{\prime }\,(\left\lfloor {\tau /2} \right\rfloor )}}$, we consider $W_{\lambda }^{{{\prime }\,(\tau )}}$ and $W_{\lambda }^{{{\prime }\,(\tau { + }1)}}$ in a decoding tree simultaneously. However, with the increase of decoding level, the number of decoding paths is increasing exponentially. SCL decoding algorithm offers a lower complexity decoding scheme which just considers the first L largest decoding paths and prunes the remaining paths. The paths are composed of likelihood ratios $W_{\lambda }^{{{\prime }\,(\tau )}} ({\mathbf{y}}_{1}^{\psi } ,{\mathbf{u}}_{1}^{\tau - 1} |u_{\tau } = 0)/W_{\lambda }^{{{\prime }\,(\tau )}} ({\mathbf{y}}_{1}^{\psi } ,{\mathbf{u}}_{1}^{\tau - 1} |u_{\tau } = 1)$. Here, L denotes the number of decision paths, i.e., the list size of SCL decoding algorithm. A rule of thumb is that the larger the decision path number L is, the better the reliability of polar codes with SCL decoding. However, SCL decoding algorithm has $O(LmN\log N)$ time complexity. The increase of list size will result to the increment of time complexity linearly. With the increase of index $\tau$, the likelihood ratio of the index will tend to infinite small, which can result in numerical underflow.

I.
In each SC decoding layer, we uniform N probabilities illustrating this decoding algorithm, since there are N probabilities for each layer of log N layers. The complexity of normalized unit is O(N log N).

In order to avoid numerical underflow, we use the relationships in (39) and (40) to calculate the posterior probabilities for bit channels [24], build and span the decision decoding tree.
II.
Another way to avoid underflow: During the update process of the term $W_{\lambda }^{{{\prime }\,(\tau )}}$, we regard the maximum term $\hbox{max} \{ W_{\lambda }^{{{\prime }\,(r)}} ({\mathbf{y}}_{1}^{Z} |{\mathbf{u}}_{1}^{\tau + 1} )|r \in {\mathcal{A}} \}$ as a normalized factor to scale the residual likelihood values in the $\lambda$-th layer. Additionally, in case of reducing the complexity of normalized steps, we can perform the computations in logarithm region.
III.
In [24], authors proposed a method (LLR-based SCL decoding algorithm) of calculating path metrics corresponding to (39) and (40): making use of the posterior soft decisions on $LLR_{\lambda }^{(g)} ,\;g \in \{ 1, \ldots ,\tau \}$
$$PM_{l}^{\tau } = \sum\limits_{g = 1}^{\tau } {\ln \left( {1 + e^{{ - (1 - 2\hat{u}_{g} [l]) \cdot LLR_{\psi }^{(g)} [l]}} } \right)}$$
(43)
$$LLR_{\psi }^{(g)} [l] = \log_{e} \frac{{W_{\psi }^{(g)} \{ u_{1}^{{{\text{g - }}1}} ,u_{g} = 0|{\mathbf{y}}_{1}^{Z} \} }}{{W_{\psi }^{(g)} \{ u_{1}^{{{\text{g - }}1}} ,u_{g} = 1|{\mathbf{y}}_{1}^{Z} \} }}$$
(44)

where $PM_{l}^{\tau }$ denotes the $\tau$-th path metric at the l-th decision path among the L largest paths. Notation $LLR_{\psi }^{(g)} [l]$ denotes the g-th LLRs of the information bits in the $\psi$-th layer at the l-th decision path. Term $\hat{u}_{g} [l]$ denotes the estimate of the g-th bit at the l-th decision path.

The path metrics $PM_{l}^{(\tau )}$ are updated successively by decoders by:

$$PM_{l}^{(\tau )} = \varphi \left( {PM_{l}^{(\tau - 1)} ,LLR_{\psi }^{(\tau )} [l],\hat{u}_{\tau } [l]} \right)$$

(45)

where we have the setting for function $\varphi$ with

$$\varphi :R^{2} \times \{ 0,1\} \to R$$

$$\varphi (a,b,c) = a + \ln \left( {1 + e^{ - (1 - 2c)b} } \right)$$

(46)

where a, b, and c represent $PM_{l}^{(\tau - 1)} ,\;LLR_{\psi }^{(\tau )} [l]$, and $\hat{u}_{\tau } [l]$ respectively.

The method in [24] is one kind of hard-friendly algorithm can boost the throughput of per unit area. Time complexity is still $O(LmN\log N)$. If we assume that path metrics are quantized by $Q_{PM}$ bits, the memory requirement of this decoding can be calculated as [24]:

$$\Delta_{PM} = ((mN - 1) \cdot L + mN) \cdot Q_{SC} { + }L \cdot Q_{PM} + (2mN - 1) \cdot L$$

(47)

Either SCL decoding algorithm or LLR-based SCL decoding algorithm, can improve the error-correcting performance of BIPCM schemes employing aided cyclic redundancy check (CRC), which just costs negligible compute complexity and little loss of coding rates.

4 Performance Evaluation

In this section, we use MATLAB 2016b simulation platform and win7 64bit operation system to complete the performance evaluation of groups of bit-interleaved polar coded modulation schemes.

Before using Monte Carlo simulation to polarize bit channels and search for good bit channels which have the lowest bit error ratios. We assume that the transmission codewords consist of all zero only. During each iteration in all BIPCM schemes with different decoding algorithms, we just apply SC decoding algorithms to estimate the sent massages (all zero). When achieving BER of each bit channel, we sort those results and pick out the indices of the given number most reliability channels to compose the information sets. Unfortunately, the computation complexity of Monte-Carlo code construction is $O(M \cdot N\log N)$, M denotes the iterative number. So, in order to achieve the frozen bit index sets accurately, we need huge training data and cost vast runtime.

In this section, we simulate several bit-interleaved polar coded modulation schemes employing BPSK, 4-ASK (ASK4), 8-ASK (ASK8) and 16-ASK(ASK16) with Gray labeling for constant code length N = 2048, R = 0.5 under SC, simple successive cancellation list (SSCL), and SCL decoding algorithms. Hereafter, we use the shorthand SSCL to represent the SCL decoding algorithm with only one single decoding path, since the number of decoding paths of SCL decoding algorithm can range from 1 to the length of component polar codes. Specially, the schemes with ASK8 modulation cannot be labeled by Gray labeling rule, we employ a quasi-Gray labeling, since the labeling cannot minimizing the number of signals [25] i.e., Gray labeling condition cannot be met. In different BIPCM schemes, we want to maximize the code performance (referring to 22), so we fix different target signal to noise ratios and the number of iteration as 1 × 10⁵ for Monte-Carlo code construction.

We apply algorithm 1 to seek index sets of the information bit channels: Bhattacharyya parameters, Monte Carlo method and Gaussian approximation are used to estimate the performance of polarized channels and construct the index sets of information and frozen bit channels with the construction (29).

Figure 3 descripts the distribution of capacities $I(W_{mN}^{{{\prime }(i)}} )$ of all polarized bit channels with code length 256, 512, and 1024 of polar codes respectively. In other words, it depicts the polarization effect of the binary erasure channels with erasure probability $\varepsilon = 0.32$ according to natural channel indices ranging from 256 to 1024. With the increase of polar codes length, the proportion of channels whose capacities fall into the middle region between 0 and 1, is becoming fewer and fewer. On the contrary, the proportion of channels with capacities 0 or 1 is more than that falling into middle region between 0 and 1. This polarization phenomenon belongs to the well-known channel polarization [4].

Figure 4 descripts the capacities sorted increasingly for bit channels with normalized indices. With the increase of polar codes length, the slopes of curves become steeper and steeper. In other words, the rate of polarization approaches optimality with the increment of polar codes length. This is also corresponding to Fig. 3. Both of Figs. 3 and 4 show that increasing the length of polar codes can improve the reliability of polar codes with the cost of complexities of encoding and decoding.

Figure 5 depicts the capacity comparison of each bit level of 16-ASK modulation. The solid curves denote the capacities of different levels in polar codes and 16-ASK modulation. The dot and dash curves denote the capacities of BIPCM schemes. The rates of component polar codes for different levels are randomly distributed in this figure. Apparently, Fig. 5 shows that the assignment is not optimal to design component codes. One should select the rates of different levels according to (24).

We assume that the transmission information messages consist of all zero and employ algorithm 1 with BER deriving from Monte Carlo method for bit channels of polar codes $(2048,1024,{\mathcal{A}},{\mathbf{u}}_{{{\mathcal{A}}^{c} }} )$. We obtain the information bit sets ${\mathcal{A}}$ and frozen bit sets ${{{\mathcal{A}}^{c} }}$. Unfortunately, the coding construction with Monte Carlo method has the time complexity $O(\delta \cdot N\log N)$ which increases linearly with the iterative number $\delta$. So we need to consider the trade-off between reliability and complexity of polar codes synthetically.

Figure 6 describes the rate distributions of component polar codes for different levels of 16-ASK modulation in BIPCM. We set the length of component polar codes to be equal to 1024 and overall code rate to be equal to 0.5. The rates are derived from the ratios of the size of information bit sets for component polar codes to the total length of polar codes referring to (22) and (23) employing Monte Carlo code construction. This curves come from the natural set partition mapping rule. The right-most column presents the least significance column corresponding to ${\mathbf{M}}_{SP,3}$ in (1). The code rate for level 1 is lowest, because its Euclidean distance of the corresponding subsets of constellation partition is comparative minimum which results in the worst error correction performance. On the contrary, level 4 has the best reliability for the protection of large subset distance. At the highest SNR region, the code rates tend to identical value, which show that code rates would be uniformly distributed on each bit level with increasing the signal transition power.

Another method of approaching BLER of BIPCM is Gaussian approximation [26] which applies the LLRs mean of each polarized channels and approximates the BLER of the corresponding channels [27]. And then, algorithm 1 employs those BLER to construct the information set ${\mathcal{A}}$ through selecting the K best sub-channels among N channels.

Figure 7 depicts the rates distributions of constituent polar codes for 4 levels of 16-ASK modulation over the equivalent channel model. The polar codes have the overall code length 2048 and code rate 0.5. Figure 7 shows us the rate distributions for each component code over 4 levels of modulation referring to (22) and (23). From Fig. 7, we observe that level 1 has the lowest code rate and level 4 own the highest code rate over the whole SNR region. The curve tracks of level 1 and level 4 are related with the partition procedure of the 16-ASK modulation constellation which is similar to Fig. 6. In addition, we can find that at the lowest and highest SNR regions, the distributions of code rates for each bit level become most separated. Then, we always apply Monte Carlo code construction to simulate the random process of channel polarization considering the stable property of Monte Carlo theory with large training set.

We design the component polar codes for the equivalent channel model applying algorithm 1 which employs BER deriving from Monte Carlo simulation with iteration 1 × 10⁵ over BPSK, 4-ASK (ASK4), 8-ASK (ASK8) and 16-ASK (ASK16) with Gray labeling, and polar codes with code length N = 2048, R = 0.5 under SC, SSCL, and SCL decoding algorithms respectively. Among different BIPCM schemes, to maximize the overall summary capacities referring to (25), we fix several different target SNR for different modulation schemes [13].

Figure 8 depicts that the error-correcting performance of polar codes, LDPC, and Turbo codes over BICM with BPSK modulation. All kinds of codes have coding rate 0.5. Especially, Turbo codes employ generator polynomials $g(d) = [1,{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {{(1 + d + d^{2} )} \mathord{\left/ {\vphantom {{(1 + d + d^{2} )} {(1 + d)}}} \right. \kern-0pt} {(1 + d)}}]$. The length of polar codes is 2048 and that of LDPC is 2304, which belongs the fixed construction of WiMAX standard. The error-correcting performance of SCL with L = 16 outperforms other schemes and has 0.25 dB gain comparing with LDPC at BLER level 10⁻³, 1.5 dB for Turbo codes. The convergent feature of LDPC with min-sum decoding algorithms is better than polar codes with SCL decoding algorithm at SNR region being more than 2.2 dB. The reliability of scheme with SCL decoding algorithm (L = 1) is similar to that of SC decoding algorithm. Turbo codes with max and linear log maximum posterior probability have the worst convergent property. However, at high SNR region, the convergence of SC decoding algorithm also becomes worse and worse.

Figure 9 depicts the performance of BIPCM schemes with BPSK modulation under SC, SSCL and SCL decoding algorithms applying Gray labeling. At the same time, we fix code length as 2048 and coding rate as 0.5. At 1 × 10⁵ BER levels, the scheme with SCL $L = 4$ decoding algorithm outperforms SSCL scheme about 0.7 dB, and has about 0.8 dB performance gain than the scheme with SC decoding algorithm. The similar phenomenon emerges at 1 × 10⁻³ BLER levels. Meanwhile, both groups of BER and BLER curves almost have identical trend, since the calculations of BER have a lot relations with BLER respectively. Simultaneously, the performance of BER and BLER with SSCL and SC decoding schemes has the similar tendency. But at higher SNR region, the convergent property of SSCL scheme is better than that with SC decoding.

Figure 10 depicts that polar coded ASK4 BICM scheme under three different decoding algorithms. The error-correcting performance of scheme with SCL decoding algorithm outperforms the others. By the comparison between Figs. 9 and 10, we find that the convergent property of SCL become better and better at the end of curves with the equivalent partition of encoded bit sequence in Fig. 2. It can be seen that the scheme with SCL decoding L = 4 algorithm at BER 1 × 10⁻⁵ has 0.9 dB gain comparing with the other two BICM schemes employing SC and SSCL decoding algorithms. The BLER of the scheme with SCL decoding reaches 1 × 10⁻³ at SNR 4.5 dB with 0.5 dB gain more than the other two competitive schemes. The curves of the BICM schemes with SC and SSCL decoding algorithms are almost coincide. However, in the end of SC decoding algorithm, the speed of decrease becomes slower than SSCL decoding algorithm, i.e., the convergent character of SC algorithm is worse than SSCL algorithm.

Figure 11 shows that the BIPCM scheme with SCL improves about 0.9 dB and 0.7 dB gain comparing with the schemes with SSCL and SC decoding algorithms respectively at BER 1 × 10⁻⁵. It induces that in polar coded modulation schemes, the error correcting performance of SSCL decoding algorithm does not keep better than SC decoding algorithm with the increase of modulated order at all time.

Figure 12 depicts that with the increase of modulating order, the reliability improvements of the equivalent channel model with SCL decoding algorithm are still remarkable, achieve 0.5 dB and 0.6 dB gain outperforming the competitive schemes with SSCL decoding and SC decoding, at BER 10⁻³ and BLER 10⁻⁵ individually. Nevertheless, there exist different fluctuations in some degree, in SCL decoding schemes and SC decoding schemes at high SNR region. Conversely, the profiles of BER and BLER curves of SSCL decoding schemes are more stable than that of other decoding algorithms.

By the observations on above figures, we find that the BER and BLER performance of the schemes with SCL decoding algorithm have the best convergent property. At the same time, the performance of SSCL decoding schemes has the comparative stable graphic tracks over the schemes with SC decoding algorithm.

Figure 13 depicts the BER and BLER performance comparisons of BIPCM schemes with multi-order modulation under SCL (L = 4) decoding algorithm. Since we employ Monte Carlo method to construct the information sets which are used to translate information bits, we set distinct beginning points of simulation for different modulation schemes to maximize the coding performance with respect to (25). We can see that under the same decoding algorithms, the BIPCM scheme with BPSK modulation expresses the fastest decreasing speeds at both of BER and BLER regions. The convergent gaps among BPSK, ASK4, ASK8, ASK16 schemes widen with the increase of spectrum efficiency, i.e., the increase of modulation order. In the other word, in order to improve the spectrum efficiency under the same BER and BLER performance, we have to cost more power to process signals in a finite range.

5 Conclusion

Through analyzing polar codes construction and the design of bit-interleaved coded modulation, we find that the construction of BIPCM is equal to an equivalent channel model which is composed of several independent parallel transmission channels through synthetically considering coding and modulation in practical applications. We propose a general code constructing algorithm to design the constituent polar codes for those equivalent parallel channels. And then, we employ a general bijective mapping to accomplish the bit address mapping from coded bits to signal points in constellation with Gray labeling or set partition labeling rules under different polar decoding algorithms. At the receiver, de-modulators calculate the decoding metrics composed of LLRs of the received symbol sequences from AWGN channels. Decoders apply SC, SSCL and SCL decoding algorithms to estimate the translated information bits and recover the original messages. Simulation results show that the BIPCM schemes applying SCL decoding algorithm have the best error-correcting performance among all the decoding algorithms. The schemes with SSCL and SC decoding algorithms have the similar performance. However, at high SNR region, the schemes with SC decoding algorithm become more unstable than SSCL schemes. Further work will focus on non-binary input discrete memoryless channels with component non-binary polar codes in BIPCM.

References

Ungerboeck, G. (1982). Channel coding with multilevel/phase signals. IEEE Transactions on Information Theory, 28(1), 55–67.
Article MathSciNet Google Scholar
Zehavi, E. (1989). 8-PSK trellis codes on Rayleigh channel. In Proceedings of IEEE military communications conference (pp. 536–540).
Caire, G., Taricco, G., & Biglieri, E. (1998). Bit-interleaved coded modulation. IEEE Transactions on Information Theory, 44(3), 927–946.
Article MathSciNet Google Scholar
Arkan, E. (2009). Channel polarization: A method for constructing capacity-achieving codes for symmetric binary-input memoryless channels. IEEE Transactions on Information Theory, 55(7), 3051–3073.
Article MathSciNet Google Scholar
Tal, I., & Vardy, A. (2011). List decoding of polar codes. In Proceedings of IEEE international symposium on information theory (pp. 1–5).
MacKay, D. J. C. (1999). Good error-correcting codes based on very sparse matrices. IEEE Transactions on Information Theory, 45, 399–431.
Article MathSciNet Google Scholar
Berrou, C., Glavieux, A., & Thitimajshima, P. (1993). Near Shannon limit error correcting coding and decoding: Turbo-codes. In Proceedings of IEEE international conference on communications (ICC), (Geneva, Switzerland) (Vol. 2, pp. 1064–1070).
Leroux, C, Tal, I., Vardy, A., & Gross, W. J. (2011). Hardware architectures for successive cancellation decoding of polar codes. In Proceedings of IEEE international conference on acoustics, speech and signal processing (ICASSP), (pp. 1665–1668).
Arikan, E. (2008). A performance comparison of polar codes and Reed–Muller codes. IEEE Communications Letters, 12(6), 447–449.
Article Google Scholar
Arikan, E. (2010). Polar codes: A pipelined implementation. In Proceedings of 4th international symposium broadcasting communications (ISBC), (pp. 11–14).
Afser, H., & Tirpan, N. (2014). Bit interleaved polar coded modulation. IEEE wireless communications and networking conference (WCNC), (pp. 480–484).
Seidl, M., Schenk, A., Stierstorfer, C., & Huber, J. B. (2013). Polar-coded modulation. IEEE Transactions on Communications, 61(10), 4108–4119.
Article Google Scholar
Shin, D. M., Lim, S. C., & Yang, K. (2012). Mapping selection and code construction for 2^m-ary polar coded modulation. IEEE Communications Letters, 16(6), 905–908.
Article Google Scholar
GPP. (2016). Study on new radio access technology physical layer aspects. TR 38.802.
Chandesris, L., Savin, V., Declercq, D. (2018). Lasting successive-cancellation based decoders for multilevel polar coded modulation. In IEEE conferences, 2018 25th international conference on telecommunications (ICT), (pp. 264–268).
Pottie, G. J., & Taylor, D. P. (1989). Multi-level codes based on partitioning. IEEE Transactions on Information Theory, 35(1), 87–98.
Article MathSciNet Google Scholar
Alvarado, A., Graell i Amat, A., Brannstrom, F., & Agrell, E. (2012). On the equivalence of TCM encoders. In Proceedings of IEEE international symposium on information theory (ISIT), (pp. 2401–2405).
Wachsmann, U., Fischer, R. F. H., & Huber, J. B. (1999). Multilevel codes: Theoretical concepts and practical design rules. IEEE Transactions on Information Theory, 45, 1361–1391.
Article MathSciNet Google Scholar
Martinez, A., Guillen, A., Fabregas, I., Caire, G., & Willems, F. (2009). Bit-interleaved coded modulation revisited: A mismatched decoding perspective. IEEE Transactions on Information Theory, 55, 2756–2765.
Article MathSciNet Google Scholar
Martinez, A., & Willems, F. (2006). A coding theorem for bit-interleaved coded modulation. In 27th symposium on information theory in the Benelux (WIC 2006), Noordwijk, the Netherlands.
Cover, T. M., & Thomas, J. A. (1991). Elements of information theory (pp. 22–23). New York: Wiley.
Book Google Scholar
Trifonov, P., Miloslavskaya, V., & Morozov, R. (2018). Fast sequential decoding of polar codes. Available: https://arxiv.org/abs/1703.06592v1.
Leroux, C., Raymond, A. J., Sarkis, G., & Gross, W. J. (2013). A semi-parallel successive-cancellation decoder for polar codes. IEEE Transactions on Signal Processing, 61(9), 289–299.
Article MathSciNet Google Scholar
Stimming, A. B., Parizi, M. B., & Bury, A. (2015). LLR based successive cancellation list decoding of polar codes. IEEE Transactions on Signal Processing, 63(19), 5165–5179.
Article MathSciNet Google Scholar
Stierstorfer, C., & Fischer, R. F. H. (2007). (Gray) mappings for bit-interleaved coded modulation. In Proceedings of IEEE vehicle technology conference (pp. 1703–1707).
Mori, R., & Tanaka, T. (2009). Performance of polar codes with the construction using density evolution. IEEE Communications Letters, 13(7), 519–521.
Article Google Scholar
Wu, D., Li, Y., & Sun, Y. (2014). Construction and block error rate analysis of polar codes over AWGN channel based on Gaussian approximation. IEEE Communication Letters, 18(7), 1099–1102.
Article Google Scholar

Download references

Acknowledgements

Manuscript submitted December 14, 2017. This work was supported by the National Natural Science Foundation of China under Grant 61671073.

Author information

Authors and Affiliations

Beijing Key Laboratory of Network System Architecture and Convergence, Beijing University of Posts and Telecommunications, Beijing, 100876, China
Kun Zhao, Yong Liu, Haiqing Du & Baoku Yuan
Beijing Laboratory of Advanced Information Networks, Beijing University of Posts and Telecommunications, Beijing, 100876, China
Kun Zhao, Yong Liu, Haiqing Du & Baoku Yuan
School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing, 100876, China
Kun Zhao, Yong Liu, Haiqing Du & Baoku Yuan

Authors

Kun Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Yong Liu
View author publications
You can also search for this author in PubMed Google Scholar
Haiqing Du
View author publications
You can also search for this author in PubMed Google Scholar
Baoku Yuan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Haiqing Du.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhao, K., Liu, Y., Du, H. et al. Performance Analysis of Bit-Interleaved Polar Coded Modulation. Wireless Pers Commun 109, 1285–1309 (2019). https://doi.org/10.1007/s11277-019-06613-6

Download citation

Published: 11 June 2019
Issue Date: November 2019
DOI: https://doi.org/10.1007/s11277-019-06613-6

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Performance Analysis of Bit-Interleaved Polar Coded Modulation

Abstract

Similar content being viewed by others

Performance and Complexity Comparisons of Polar Codes and Turbo Codes

Simplified Design of Multilevel Coding Scheme with Polar Codes for High-Order Modulation

Design of Polar Encoder and Decoder for Next-Generation Networks

1 Introduction

2 Preliminary

2.1 Polar Codes

2.2 General Mapping

3 Bit-Interleaved Polar Coded Modulation

3.1 Encoding and Decoding of BIPCM

3.2 Equivalent Channel Model

3.3 Design of BIPCM Basing on Equivalent Channel Model

3.4 General Polar Codes Constructing Algorithm

3.5 Decoding Processes of Different Decoding Algorithms

4 Performance Evaluation

5 Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Performance Analysis of Bit-Interleaved Polar Coded Modulation

Abstract

Similar content being viewed by others

Performance and Complexity Comparisons of Polar Codes and Turbo Codes

Simplified Design of Multilevel Coding Scheme with Polar Codes for High-Order Modulation

Design of Polar Encoder and Decoder for Next-Generation Networks

1 Introduction

2 Preliminary

2.1 Polar Codes

2.2 General Mapping

3 Bit-Interleaved Polar Coded Modulation

3.1 Encoding and Decoding of BIPCM

3.2 Equivalent Channel Model

3.3 Design of BIPCM Basing on Equivalent Channel Model

3.4 General Polar Codes Constructing Algorithm

3.5 Decoding Processes of Different Decoding Algorithms

4 Performance Evaluation

5 Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation