Statistical properties of side-channel and fault injection attacks using coding theory

Carlet, Claude; Guilley, Sylvain

doi:10.1007/s12095-017-0271-4

Statistical properties of side-channel and fault injection attacks using coding theory

Published: 14 December 2017

Volume 10, pages 909–933, (2018)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Cryptography and Communications Aims and scope Submit manuscript

Statistical properties of side-channel and fault injection attacks using coding theory

Download PDF

1016 Accesses
21 Citations
Explore all metrics

Abstract

Naïve implementation of block ciphers are subject to side-channel and fault injection attacks. To deceive side-channel attacks and to detect fault injection attacks, the designer inserts specially crafted error correcting codes in the implementation. The impact of codes on protection against fault injection attacks is well studied: the number of detected faults relates to their minimum distance. However, regarding side-channel attacks, the link between codes and protection efficiency is blurred. In this paper, we relate statistical properties of code-based countermeasures against side-channel attacks to their efficiency in terms of security, against uni- and multi-variate attacks.

Analysis of a Code-Based Countermeasure Against Side-Channel and Fault Attacks

Codes for Side-Channel Attacks and Protections

Automated Deployment of Software Encoding Countermeasure

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Protection as a coding problem

Cryptographic algorithms are subject to attacks aiming at extracting their keys. When the adversary has access to the device, he is able to target the implementation of the cryptographic algorithm. Two attack paths customarily encountered are side-channel attacks (where the attacker reads some leakage from the implementation when it is running), and fault injection attacks (where the attacker modifies some intermediate variables inside of the implementation).

In this article, we analyze algorithmic protections combining both side-channel prevention and fault injection detection. We survey security models for a given set of security parameters. In general, several such models can be defined, each addressing a particular kind of attacker. The equivalence or even the reduction of security models is hard and is currently at the core of intensive researches. However, when one focuses on one specific implementation operated in a given context, then security notions can be clarified (e.g., be shown equivalent).

In this paper, we focus on a protection against side-channel and fault injection attacks where the state of the cryptographic algorithm is encoded. From the security model and its parameters, we can thus derive desirable protection properties. Those result from a statistical analysis of the leakage in the presence of countermeasures.

Contributions

Regarding side-channel analysis protections, we identify that the inner product masking scheme [3, 58] is an instance of the leakage squeezing (see [18, 19, 37, 37, 38] for 2 shares, and [16] for strictly more than 2 shares) protection using linear bijections (Section 6.4). The papers about inner product masking scheme explain the engineering aspects related to secure computation of finite field laws (addition and multiplication), whereas papers about leakage squeezing highlight the accurate security level of the data representation. In this article, we bridge the gap by showing how to design inner product masking schemes with quantifiable security level against bit-level side-channel attacks. For the first time, we relate the dual distance of the code used in the countermeasure, the mutual information between sensitive variable and leakage, and the attack success rate.

A second contribution of this paper is to analyze joint side-channel and fault attacks protections. Specifically, we emit a warning: fault protections and side-channel protections can happen to combine nicely, provided a careful analysis of their combined implementation is carried out (Section 6.3). Without such analysis, the combination can be destructive security-wise.

Eventually, we expose a novel method to derive Boolean codes from codes over $\mathbb {F}_{2^{k}}$ (Section 7).

Outline

The rest of the paper is structured as follows. We start in Section 2 by explaining how error correcting codes can provide a protection against both side-channel and fault injection attacks. Then, we review in Section 3 existing security models, and select some of them. Relevant security parameters are given in Section 4. The impact on the protections architecturing is then analyzed in Section 5. Known constructions are revisited in Section 6, and a new one is given in Section 7. Our contributions beyond the state-of-the-art are in Section 6 and 7. Eventually, conclusions are in Section 8.

2 Introduction

2.1 Principle of coding

One purpose of codes is to detect (and correct) errors. Another purpose is to allow multiple users to use the same channel without interference, while maximizing the use of its capacity. In the context of protections against side-channel attacks, one user will be the cryptographic computation, and the other ones are noisy sources, aiming at making the leakage passing through the channel as difficult to interpret as possible for an eavesdropper. Clearly, this dual use of codes allows to kill two birds with the same stone, which makes it appealing.

Let us insist more in detail on the protection against side-channel attacks. We denote by $X{\in \mathbb {F}_{2}^{k}}$ (where ${\mathbb {F}_{2}^{k}}$ is {0,1}^k equipped with an additive group law, denoted by “⊕”) a sensitive variable, we intend to protect. It is usually a word, of bit length k. As usual in statistics, we shall use capital letters (such as X) for random variables, and small letters (such as x) for their realizations. The AES [45] block cipher will be our running example, because it is very widespread in the field and is well studied in academic papers. As AES is byte-oriented, we will consider that every variable can be represented by one or more bytes, hence k = 8 bits. In a cryptographic implementation, such variable is leaking some non-injective and noisy information. The non-injective function is denoted as $\varphi :{\mathbb {F}_{2}^{k}}\to \mathbb {R}$, and N denotes the additive noise. Both are represented in Fig. 1, as well as the leakage $X \rightsquigarrow \varphi (X)+N$.

Typically, φ is an extensive function (that is, it is the weighted sum over $\mathbb {R}$ of its coordinates), such as the Hamming weight (denoted as w _H). This model is attested in many devices, such as smartcards, whose leakage is analyzed in Fig. 2 [31].

To protect against straightforward analysis of leakage, masking countermeasure has been initially presented (by Thomas S. Messerges [40]) as a two-step process:

1.
the algorithmic parameters (e.g., substitution boxes) are recomputed for a given mask (randomly chosen) by replacing each sensitive data X by $(X \oplus \bigoplus _{i = 1}^{t} Y_{i}, Y_{1},Y_{2},{\ldots } ,Y_{t})$, where t is some security parameter and where the Y _i’s are chosen randomly independently in the same additive group ${\mathbb {F}_{2}^{k}}$ as X, and then
2.
the masked algorithm is executed with masked plaintext and masked key as inputs.

While this strategy works well from a theoretical point of view, some criticisms have emerged over time:

from a security point of view, it has been noted that the recomputation stage algorithm (which does not depend on the key) leaks a lot of information which can be combined in a constructive way with the algorithm execution (where masked sensitive data, that is data which depend on the key and on inputs/outputs known by the attacker, are used), and that such attack path is hard to counter [14, 47, 55],
from a performance point of view, the recomputation takes a longer time^{Footnote 1} than the execution of the recomputed algorithm, which obviously limits the advantage of such solution.

Therefore, solutions which are free from the preliminary recomputation stage are favored in practice in many applications (except low-cost smartcard, which do not have enough resources to get rid of the security-wise weak table recomputation stage). Historically, the data and the masking material are processed together during the execution of the algorithm. For instance, in the case where t = 1 above, the computation is organized by duplicating the state: one half contains the masking material $Y{\in \mathbb {F}_{2}^{k}}$, whereas the second half contains the masked data $X\oplus Y{\in \mathbb {F}_{2}^{k}}$. This is illustrated in Fig. 3. It shall be noted that the leakage is now bi-variate, hence harder to exploit by the attacker, because the latter must combine two values to recover useful information. However, some implementations manage to handle X ⊕ Y and Y side-by-side; when the non-injective leakage function φ is extensive, we thus have φ(X ⊕ Y,Y ) = φ(X ⊕ Y ) + φ(Y ), hence it is convenient to describe the masking as an encoding of (X,Y ). Namely, the sensitive variable X is encoded by a linear code of generating matrix (I ∥ 0), the mask is encoded using the repetition code of generating matrix (I ∥ I), where I is the identity matrix in ${\mathbb {F}_{2}^{k}}$, and these two codewords are added together.

Thus, we see that protection against side-channel attack can also be expressed in terms of codes. In the former example, the two binary codes are:

1.
C, of parameters [n = 2k,k,1], of generating matrix (I ∥ 0), and
2.
D, of parameters [n = 2k,k,2], of generating matrix (I ∥ I),

such that any element $Z=(X \oplus Y, Y) {\in \mathbb {F}_{2}^{n}}$ is the direct sum of the encoding of X through C and of Y though D.

This approach of coding is well suited to the physical leakage as represented in Fig. 1, since side-channel analysis can be reinterpreted as a decoding problem: the aim of the attacker is the recovery of X after its encoding with masks, and transformation through the non-injective (owing to φ) and noisy (owing to N) leakage function. Notice that high-order masking schemes are detailed in greater details under the view of coding theory in Section 6.1.

However, we stress that the attacker has other means to recover information on Z (X after coding):

with a probing station, the attacker is able to read and/or write selected bits,
on multicore platforms running an operating system, cache hit/miss ^{Footnote 2} probing can be used as an attack, especially if data are used as addresses to memories.

2.2 Design choice

In the previous section, we took the example (recall Fig. 3) of mask (Y) and information (X) of same bitwidth. However, we have already seen that this can be more general, with a value of t larger than 1 in traditional masking. Typically, Y can be made smaller (as small as 1 bit, e.g., in [8, 40, 54]—for instance, in [40], a 1-bit masking is used to perform a Boolean to arithmetic transform.) But also, for enhanced security, Y can be larger than X, especially in so-called high-order masking schemes [52].^{Footnote 3} The general encoding using linear codes of X is as follows:

$$ Z = X G \oplus Y H , $$

(1)

where:

G is the generating matrix of a code of length n and of dimension k, and
H is the generating matrix of a code of length n and of dimension (n − k).

Typically, k = 8 bits for AES. For high-order protections, the masks are used as multiple k bit words. Therefore, a typical study will be that (n − k) is a multiple of k.

However, probing attacks do target individual bits.

Therefore, we will consider two kinds of codes: codes on $\mathbb {F}_{2^{k}}$ and codes on $\mathbb {F}_{2}$. Notice that a code on $\mathbb {F}_{2^{k}}$ can be expanded on $\mathbb {F}_{2}$. In MAGMA [56], this operation can be realized on a code C easily using command

$${\texttt{C\_expanded := SubfieldRepresentationCode(C, GF(2));}}.$$

If C has parameters $[n, k, d]_{2^{m}}$, then C_expanded has parameters [m n,m k,d ^′]₂, where d ^′≥ d. A concrete example will be given in Section 7.

3 Security models

3.1 Side-channel analysis

Masking consists in adding some randomness in the computations, which forces the attacker to perform a high-order attack, process during which several leakage sources are combined. In turn, if the leakage samples are noisy, the combination results in a so-called noise amplification.

There are mainly two security models:

Probing model (cf. Section 3), as in [32] (and many other papers [9, 23, 36, 50] which stem from this seminal publication).
Bounded moment model (cf. Section 3), initially defined in [44, Section 4], and then reintroduced in [6].

Probing model

The probing model states the following:

Definition 1 (Probing model)

A masking scheme is secure at order t in the probing model if no tuple of t intermediate variables depends on the secret.

An unprotected implementation is secure at order t = 0 (recall Fig. 2). A protected implementation is secure at order t > 1.

When the algorithm handles bitvectors (elements of ${\mathbb {F}_{2}^{k}}$), there is an ambiguity whether the Definition 1 refers to intermediate variables as bitvectors or as individual bits. Thus, in the sequel, we shall clarify this point when talking about the probing model.

An automated method to test for the security of an algorithm with respect to this model at bitvector-level is given in [4, 5].

Bounded moment model

The bounded moment model states the following:

Definition 2 (Bounded moment model)

A masking scheme is secure at order t in the bounded moment model if no moment of degree t in the intermediate variables depends on the secret.

With this definition, we also have that an unprotected implementation is secure at order t = 0, while a protected implementation is secure at order t > 1.

The Definition 2 has initially been introduced in the context of low entropy masking schemes (LEMS [8, 44]). The concept has been recovered independently [42, 43] by noting that attacks at many orders are possible, but that in usual situations (see exception in [14]), the lowest order is the most successful.

Reductions between leakage security models are studied in [6]. When probing model and bounded moment models are considered at the bit level, then they are equivalent (see Theorems 9 and 10 of [30]).

3.2 Fault injection analysis

Protection of block ciphers with codes is a topic which has been studied for a long time [2, 35]. Basically, the security metric relates to the code error detection probability. However, we notice that few constructs have been tackling simultaneously protection against both side-channel and fault injection analyzes.

3.3 Combination of side-channel and fault injection

The ODSM countermeasure (to be analyzed at Section 6.3) is the first joint protection against side-channel and fault injection analyzes. Carlet et al. noticed that masks are not sensitive by themselves (in that they do not leak information “standalone”); thus faults can be detected by verifying that masks have not been altered. This strategy is all the more relevant in first-order masking schemes, where the security can be attained by reusing the same mask throughout the algorithm to protect, hence the possibility to perform the integrity check at any arbitrarily chosen time while the algorithm unfolds. A careful warning is nonetheless formulated in Section 6.3.

4 Security parameters

Security at order one is nowadays considered insufficient for most practical operational environments. Indeed, many attacks at first order (such as second-order correlation power analysis [41], collision-correlation [26], MIA [7], etc.) are known and well mastered by most adversaries.

Regarding fault injection attacks, it is known that very powerful exploitation techniques exist for block ciphers [33]. Thus, once again, detecting a single fault is insufficient.

However, it shall be noted that some palliative countermeasures are usually implemented in addition to the two abovementioned curative countermeasures. Palliative countermeasures consist typically in artificial insertion of horizontal noise (desynchronized start date, random interrupts, dummy decoil operations, etc.), which makes the step for succeeding higher-order attacks drastically high.

Concluding, second-order resistance (t ≥ 2) to both side-channel analysis and fault injection resistance is, in most case, sufficient if well complemented by other protection means, in a construction denoted by defense in depth.

5 Architectural options for protection

Protecting against both side-channel and fault injection attacks can resort to the masks verification strategy of ODSM. But more generally, it can be imagined to implement orthogonal protections one of top of each other. Both approaches have pros and cons:

encode then mask suffers no security issue. Indeed, encoding does increase the data bitwidth while making the encoded data redundant, thus reducing the density of the new sensitive variable. However, this does not cause any security issue as masking remains secure even if the variable to protect is not uniformly distributed (which is the case because the sensitive variable here belongs to a codebook). The “encode then mask” suffers more performance than security issues: the sensitive variable, after encoding, encounters a blow-up in size corresponding to the inverse of the code rate. After application of the side-channel protection, this overhead is multiplied by the order of the masking scheme.
mask then encode is thus more efficient in terms of variable size growth. But care must be taken on the way the redundancy is applied. Indeed, linear codes consist in computing some redundancy on top of the masked data, and this redundancy is a linear transformation. It is well known that some linear transformations destructively combine with the masking: e.g., the addition of all shares clearly completely unmasks the masked data. Besides, it becomes non-obvious to compute on an encoded state. The only proposal in this direction is paper [51], which handles security at bit-level.

In addition to those considerations, it shall be noticed that verification can be achieved both at word or at bit levels. Further investigations are left for future considerations.

6 Some known constructions revisited

In this section, we present several masking schemes under the prism of coding theory. We highlight the links between their definition and their security level. The perfect additive masking (Section 6.1) is typically word-oriented.

6.1 Perfect additive masking scheme [9]

In this section, we answer the question “why is masking an encoding?”. Actually, it is straightforward to show that share-based masking schemes (e.g. [27, 50]) consist in encodings. We denote by t the order of the masking, and by d = t + 1 the number of shares, that are elements of ${\mathbb {F}_{2}^{k}}$. The protection rationale is as follows:

$x{\in \mathbb {F}_{2}^{k}}$ the clear data,
$y=(y_{1}, y_{2}, \ldots , y_{t})\in ({\mathbb {F}_{2}^{k}})^{t}$ are the masks, and the protected data is:
$z=(x\oplus \bigoplus _{i = 1}^{t} y_{i}, y_{1}, y_{2} \ldots , y_{t})\in ({\mathbb {F}_{2}^{k}})^{d}$.

So we have n = d × k = (t + 1) × k, and z = x G ⊕ y H, where

$$ G = \left( \begin{array}{ccccc} I_{k} & 0 & 0 & {\cdots} & 0 \end{array}\right) \quad \text{and} \quad H = \left( \begin{array}{ccccc} I_{k} & I_{k} & 0 & {\cdots} & 0 \\ I_{k} & 0 & I_{k} & {\cdots} & 0 \\ {\vdots} & {\vdots} & {\vdots} & {\ddots} & {\vdots} \\ I_{k} & 0 & 0 & {\cdots} & I_{k} \end{array}\right). $$

(2)

Notice that G H ^T≠ 0, thus the codes generated by G and H are not complementary dual [22].

6.2 Inner product (IP [3]) masking scheme

The perfect masking scheme depicted in (2) presents intrinsic weaknesses:^{Footnote 4} for instance, it does not correspond to individual bit masking, but rather word-wise. Individual bits in the shares can be attacked independently one of the others, thereby enabling k parallel divide-and-conquer mono-bit strategies. Hence there is a need for a secondary security objective which is bit-oriented. The publications dealing with inner product masking [3, 58] therefore attempt to shuffle bits within one share. However, both the choice to focus on one share and the code selection method are currently not discussed mathematically in the published literature. Still, there is a way to select linear functions in line with a security objective. This will be made clear in Section 6.4 devoted to leakage squeezing countermeasure.

6.3 Orthogonal direct sum masking (ODSM [11])

ODSM refers to the masking scheme where the data to protect is represented as in (1).

Implementation

An example of implementation of ODSM for AES (n = 2k = 16) is given in the Appendix B of [12]. These indications shall suffice to reproduce a protected design. The only practical detail to be precised in the implementation is the computation of the linear transformation $\mathbb {F}_{2}^{16}\to \mathbb {F}_{2}^{16}$. It can be implemented as a vector-matrix product, as explained in Algorithm 2 of [12]. Thus, there is no need to save a 2¹⁶ × 16 table corresponding to all the values of x L ^′ for $x\in \mathbb {F}_{2}^{16}$. Besides, if it is wanted all the same to resort to a table-based implementation, it is possible to split the 2¹⁶ × 16 table into tables of size 2¹⁶ × 8 (see Alg. 1 in [30]).

Security against fault injection attacks

In ODSM, the transformations (e.g., the call to substitution boxes) are presented as operating in parallel on the whole state $z{\in \mathbb {F}_{2}^{n}}$. It is described in [11] how linear and non-linear operations can be tabulated. When k|n, the ODSM scheme can be interpreted as a computation which can be carried out on k-bit words. In this case, one knows that linear operations can be safely implemented as the parallel composition of the linear operation on each of the d = n/k shares. However, this should not be understood as the fact that arbitrary linear operations can be securely implemented on the whole state. Indeed, for instance, the projection of z on x is linear and is clearly insecure. Therefore, care must be taken when implementing (linear) operations between shares. For instance, it is secure to project z on the code of generating matrix H in parallel to the code of generating matrix G, and to get then y, but the projection algorithm shall be scrutinized. Indeed, the following method:

Step 1::: z is projected on the code of generating matrix G in parallel to the code of generating matrix H to retrieve x,
Step 2::: then y is retrieved as the subtraction z ⊕ x G on $\mathbb {F}_{2}$,

is not desirable from a security standpoint, owing to the demasking of the variable at Step 1.

6.4 Leakage squeezing

Background about leakage squeezing countermeasure

The leakage squeezing idea [28, 38] is based on masking but additionally applies some bijective functions (linear or non-linear) to the shares. A quantitative analysis of the gain in terms of bounded moment leakage security model is carried out in [37], where it is found that the best bijections can be non-linear in relation with non-linear codes (e.g., the Nordstrom-Robinson code for k = 8, n = 16). A comprehensive search of functions / codes suitable in the bounded moment leakage model is carried out in [20]. The suitable codes are nicknamed Complementary Information Set (CIS). A survey of usage of codes in the field of side-channel analysis is conducted by the first author [15]. In parallel, an approach using cellular automata to build codes is proposed in [34]. Following from [37], the conditions for building better codes are precised in [18]. Also, this journal paper shows that the leakage squeezing countermeasure resists model imperfections. The mutual information between the sensitive data and the leakage is computed empirically in [18, 37]. In [19], it is demonstrated mathematically that this mutual information vanishes exponentially with the noise variance, at a rate which is proportional to the countermeasure first non-constant moment (known as the HCI or High-order Correlation Immunity). In this section, we relate bounded moments, mutual information, and attack success rate. That is, we show that the attacks are all the more difficult as the first non-constant moment of the leakage is high, and that this behavior tracks that of the mutual information.

Eventually, notice that leakage squeezing with more than two shares has already been studied, from a security perspective in [16] and from the codes construction point of view in [25] (where HO-CIS codes are introduced as a generalization of CIS codes). The most recent survey on codes in side-channel analysis is available in [30]. In the rest of this section on leakage squeezing, we do detail only leakage squeezing with two shares.

Definition and use-case

Leakage squeezing (LS) consists in masking $X{\in \mathbb {F}_{2}^{k}}$ using representation

$$ (X\oplus Y, F(Y)), $$

(3)

where F is a bijective function from ${\mathbb {F}_{2}^{k}}$. The security order of LS is studied in [37]. We compare here-after LS at various orders (and we use indices, e.g., F _t, for t = 0,1,…, to make a difference between the different functions F):

0 (no protection); the leakage has only one share, that is X (plain).
1: F ₁ = I d, i.e., (3) represents perfect masking (F ₁(y) = y).
2: F ₂ is a linear function, where the matrix of F ₂ is:
$$M_{2}= \left( \begin{array}{cccc} 0&0&1&1\\ 1&1&0&1\\ 0&1&1&1\\ 1&1&0&0 \end{array}\right). $$
3: F ₃ is a linear function (which is optimal—cf. F3^′ in [18, Section 5.2]), of matrix:
$$M_{3}= \left( \begin{array}{cccc} 0&1&1&1\\ 1&0&1&1\\ 1&1&0&1\\ 1&1&1&0 \end{array}\right). $$

Alternatively, the truth tables of F _t are (using hexadecimal notations):

{F ₁(y),0 ≤ y < 2⁴} = {0,1,2,3,4,5,6,7,8,9,a,b,c,d,e,f},
{F ₂(y),0 ≤ y < 2⁴} = {0,a,e,4,5,f,b,1,7,d,9,3,2,8,c,6},
{F ₃(y),0 ≤ y < 2⁴} = {0,e,d,3,b,5,6,8,7,9,a,4,c,2,1,f}.

When the bijective functions F _t are linear, the leakage squeezing is a special instance of ODSM, with generating matrices G and H defined in (1) equal to G = (I _k||0) and H = (I _k||M _t).

Leakage distributions for leakage squeezing

The resulting (uni-variate) distributions in Hamming weight, when X ⊕ Y and F _t(Y ) are manipulated in parallel, are represented in Fig. 4a, when the noise has variance σ ² = 1. The versions resisting attacks at orders 1, 2 and 3 are represented in Figs. 5a, 6a, and 7a. The scale is the same for all plots. It can be seen that:

distributions in Fig. 4a do not have the same mean,
distributions in Fig. 5a have the same mean (= n = 4), but not the same variance (informally, some distributions are larger than others),
distributions in Fig. 6a have the same mean (= n = 4), the same variance (= n/2 + σ ²), but not the same skewness (informally, some distributions are bending to the right, other to the left, while the others are straight),
distributions in Fig. 7a have same mean (= n = 4), same variance (= n/2 + σ ²), no skewness, but different kurtosis (informally, some distributions have smaller tails than others).

Remark 1

The distributions represented in Figs. 4a, 5a, 6a, and 7a are the convolution of the 2ⁿ cosets of the weight distribution of the graph of functions F _t, for 0 ≤ t ≤ 3.

The bi-variate distributions, that is:

$$(w_{H}(X\oplus Y)+N, w_{H}(F_{t}(Y))+N^{\prime}) \in\mathbb{R}^{2}, \quad\text{where } N,N^{\prime}\sim\mathcal{N}(0,\sigma^{2}), $$

which represent the word-oriented case, are represented in Figs. 4b, 5b, 6b, and 7b. It can be seen that Fig. 4a is merely the value at abscissa of the corresponding bi-variate distribution (somehow artificial, since this implementation uses only one share—however, the representation allows to contrast leakage of unprotected and protected implementations) represented in Fig. 4b. Besides, Figs. 5a, 6a, and 7a are merely the diagonal of corresponding bi-variate distributions represented in Figs. 5b, 6b, and 7b.

It is interesting to see that some distributions are identical for some values of x. We group identical distributions by classes, labeled in lexicographical order. In the uni-variate case (recall (5)), the number of classes is respectively 5, 5, 6 and 3 (for bijection F ₀, F ₁, F ₂ and F ₃), as represented in Table 1. The bi-variate case (recall (4)) is represented in the bottom line of Table 1. In these tables, the layout is as given below:

$$\left[\begin{array}{cccc} \mathrm{x}=\texttt{0x0} & \mathrm{x}=\texttt{0x1} & \mathrm{x}=\texttt{0x2} & \mathrm{x}=\texttt{0x3} \\ \mathrm{x}=\texttt{0x4} & \mathrm{x}=\texttt{0x5} & \mathrm{x}=\texttt{0x6} & \mathrm{x}=\texttt{0x7} \\ \mathrm{x}=\texttt{0x8} & \mathrm{x}=\texttt{0x9} & \mathrm{x}=\texttt{0xa} & \mathrm{x}=\texttt{0xb} \\ \mathrm{x}=\texttt{0xc} & \mathrm{x}=\texttt{0xd} & \mathrm{x}=\texttt{0xe} & \mathrm{x}=\texttt{0xf} \end{array}\right].$$

Table 1 Classes of identical uni-variate and bi-variate distributions, in leakage squeezing with functions F _t, for 0 ≤ t ≤ 3

Full size table

Uni- and bi-variate attacks on leakage squeezing

Attacks have been simulated, both in uni- and bi-variate settings. In the bi-variate setting, the attacker gets the leakages L ⁽¹⁾ and L ⁽²⁾ corresponding to masked data and mask (with bijection $F_{t}: {\mathbb {F}_{2}^{k}}{\to \mathbb {F}_{2}^{k}}$ applied on it):

$$\begin{array}{@{}rcl@{}} \left\{ \begin{array}{ll} L^{(1)} = w_{H}(T\oplus k^{*}\oplus Y)+N \\ L^{(2)} = w_{H}(F_{t}(Y))+N^{\prime} \end{array} \right., \end{array} $$

(4)

where X = T ⊕ k ^∗ is the sensitive variable (known text $T{\in \mathbb {F}_{2}^{k}}$ and secret key $k^{*}{\in \mathbb {F}_{2}^{k}}$). The (4) is the application of the Hamming weight leakage model on the two shares of (3), and in the addition of noise. In the uni-variate setting, the attackers gets only one leakage sample:

$$ L=L^{(1)}+L^{(2)}. $$

(5)

For the sake of fair comparison, we focus on the optimal attack [13], that is, the attack which maximizes the probability of success in secret key recovery. Notice that the bijections used in leakage squeezing countermeasure are supposed public information.

The uni-variate attack measures the sum $l_{q}^{(1)}+l_{q}^{(2)}$ of leakage for each trace q (1 ≤ q ≤ Q), hence the optimal attack estimates the correct key k ^∗ as:
$$\begin{array}{@{}rcl@{}} \hat{k^{*}} &=& \underset{k{\in\mathbb{F}_{2}^{k}}}{\text{argmax}} \sum\limits_{q = 1}^{Q} \log \sum\limits_{y{\in\mathbb{F}_{2}^{k}}} \backslash\\ && \exp -\frac1{4\sigma^{2}} \left\{ \left( l_{q}^{(1)}+l_{q}^{(2)} - w_{H}(t_{q} \oplus k \oplus y, F_{t}(y)) \right)^{2} \right\}. \end{array} $$
(6)
The bi-variate attacks measures each share $l_{q}^{(1)}$ and $l_{q}^{(2)}$ independently, hence the optimal attack estimates the correct key k ^∗ as:
$$\begin{array}{@{}rcl@{}} \hat{k^{*}} &=& \underset{k{\in\mathbb{F}_{2}^{k}}}{\text{argmax}} \sum\limits_{q = 1}^{Q} \log \sum\limits_{y{\in\mathbb{F}_{2}^{k}}}\backslash \\ &&\exp -\frac1{2\sigma^{2}} \left\{ \left( l_{q}^{(1)} - w_{H}(t_{q} \oplus k \oplus y) \right)^{2} + \left( l_{q}^{(2)} - w_{H}(F_{t}(y) ) \right)^{2} \right\}. \end{array} $$
(7)

Notice that the noise in uni-variate case is $N+N^{\prime }\sim \mathcal {N}\left (0,2\sigma ^{2}\right )$, whereas in the bi-variate case, it is $(N,N^{\prime })\sim \mathcal {N}\left (\left (\begin {array}{cc}0&0\\0&0 \end {array}\right ),\sigma ^{2}\left (\begin {array}{cc}1&0\\0&1 \end {array}\right ) \right )$; this explains the different factors in the exponential for expressions (6) and (7). Results in terms of success rate ($\mathsf {SR}=\mathbb {P}(\hat {h^{*}}=k^{*})$) are shown in Fig. 8 for σ = 1. The success rates are obtained after 100 independent attacks, and the estimation error of the curves are superimposed (they correspond to ± the standard deviation of the S R estimator; refer to [39] for their calculation).

It can be seen that the security increases (i.e., more and more traces are needed to recover the key) with the resistance order t, 0 ≤ t ≤ 3. Said differently, the larger the dual distance of the code generated by H, the more difficult the attack. Moreover, it appears clearly that bi-variate attacks are more successful than uni-variate attacks, since information is lost while the two leakages are summed up (recall that in Table 1, there are less classes in the uni-variate case than in the bi-variate case). This notice settles a quantitative assessment why so-called zero-offset uni-variate attacks [57] are less efficient than truly multi-variate counterparts. The two functions F ₂ and F ₃ seem to yield similar security level, at least for low noise σ = 1. However, when the noise increases, F ₃ clearly increases more than F ₂ the resistance of the implementation against attacks, as illustrated in Fig. 9 for σ = 2. One can see the “staggering” of the number of traces to succeed for a given order: the success rate curve without protection (F ₀) is squared to obtain that with 1st-order protection (F ₁). This fact has already been reported in [29].

Information leakage under leakage squeezing protection

Besides, we also evaluate the information leakage of the four levels of protections. We compute I(L;X), where X = T ⊕ k ^∗ is uniformly distributed over ${\mathbb {F}_{2}^{k}}$, and where the leakage L is the uni- or bi-variate leakage function.

In the uni-variate case, L is
$$L^{(1)}+L^{(2)}=w_{H}(T\oplus k^{*}\oplus Y)+N+ w_{H}(F_{t}(Y))+N^{\prime}\in\mathbb{R};$$
In the bi-variate case, L is
$$(L^{(1)},L^{(2)})=(w_{H}(T\oplus k^{*}\oplus Y)+N, w_{H}(F_{t}(Y))+N^{\prime})\in\mathbb{R}^{2},$$

where:

Y is uniformly distributed over $\mathbb {F}_{2}^{n-k}$ (here $={\mathbb {F}_{2}^{k}}$ since n = 2k) and
N and N ^′ are two additive (recall Fig. 1) and independent noises of centered normal law with identical standard deviation σ.

The resulting mutual information values are given in Fig. 10 for uni- and bi-variate attacks.

Interestingly, in presence of large noise, the mutual information decreases affinely with σ (in log-log scale), with a slope − 2(t + 1) = − 2d, where:

t is the protection order, and
d = t + 1 is the minimum order of a successful attack (also denoted High-order Correlation Immunity or HCI in [19, Def. 2]).

This noting is demonstrated mathematically in [19, Theorem 1].

It can thus be stated that LS with bijection F _t has the same bit-level security with two shares as perfect masking with t + 1 = d shares.

Link between attacks and information leakage

It is demonstrated in [29] that, for additive distinguishers, there exists a coefficient E, called first-order exponent, such that the number of traces q to extract the key k ^∗ with success probability S R satisfies the property:

$$ 1-\mathsf{SR} \approx \exp-q \cdot E , $$

(8)

where ≈ is an asymptotic equivalence (detailed in [29]).

It is hinted in [53] that such exponent is proportional to the mutual information (as computed in previous Section 6.4), provided the distinguisher is the template attack. Now, with perfect profiling, the template attack [24] coincides with the optimal distinguisher [13]. Thus, we aim in this section at validating this finding on the bi-variate (but higher-order secure) LS masking scheme, using the distinguishers (6) and (7) for the optimal attacks.

To validate experimentally that the first-order E involved in (8) is proportional to I(L;X), we extract the number of measurements q to recover the key k ^∗ with probability S R = 80%. The two Figs. 8 and 9 allow to extract 16 values of number of traces. The corresponding values (for σ = 1 and 2) of I(L;X) are extracted from Fig. 10. These data are represented in Fig. 11.

In the case of the bi-variate attack, it is possible to fit these data by linear regression as relationship:

$$\log(q) = \log(-\log(0.80)) - \log(\alpha \cdot I(L;X)) , $$

where the estimated parameter α is found to be α = 0.0396361 ± 0.0002805. This good fit with a law where q × I(L;X) is a constant (curve of slope − 1 in Fig. 11) validates that in the case of optimal attack on bi-variate leakage, one has that (8) holds, with first-order exponent equal to:

$$ E = \alpha \cdot I(L;X). $$

(9)

We underline that this result holds, surprisingly, for 4 different leakage scenarios (corresponding to the use of F _t, t ∈{0,1,2,3}). Therefore, the relationship (9) seems very general. On the contrary, it might explain why the law (8) fits less nicely (the interpolated slope of the curve is > − 1), since sum of two leakages is a ad hoc operation.

7 A new construction for leakage squeezing and inner product masking

7.1 Rationale of the construction

In this section, we explain how to obtain CIS (and HO-CIS) codes based on code expansion from $\mathbb {F}_{2^{k}}$ to $\mathbb {F}_{2}$. The procedure is the following:

1.
Decide on a number m of shares of k bit words.
2.
Search for a code of parameters $[m,1]_{2^{k}}$ of minimum distance m; basically, this means that the generating matrix of the code consists in a line of m non-zero values of $\mathbb {F}_{2^{k}}$.
3.
Expand the code on $\mathbb {F}_{2}$. This code is HO-CIS of order m (see Proposition 2.2 of [1]). The protection order of this code in bit-level security models is equal to its minimum distance minus one.
4.
Write its generating matrix as (M ₁||M ₂||…||M _m), where M _i (1 ≤ i ≤ m) are k × k matrices with entries in $\mathbb {F}_{2}$.
5.
As explained in [17, Appendix B, page 21], the linear function to apply to share i (1 ≤ i ≤ m) is generated by matrix $M_{i}^{-1}$.

7.2 Example on a non-optimal code

We detail in Listing 1 an example of a masking with m = 2 shares of k = 4 bits, which has order 1 security at word level and order 2 security at the bit level.^{Footnote 5}

The construction for this code needed in leakage squeezing is explained below:

1.
we opt for a leakage squeezing with a mask Y of bitwidth equal to that of the data X to protect,
2.
the code C5 in $\mathbb {F}_{2^{k}}$ is generated by (1||1 + X), where $\mathbb {F}_{2^{4}}=\mathbb {F}_{2}[X]/1+X+X^{4}$, hence has parameters [2,1,2]₁₆,
3.
this code is expanded into C5_expanded, which has parameters [8,4,3]₂. Therefore, the security at bit level is 3 − 1 = 2, which is one more than that of C5 at word level,
4.
the generating matrix of C5_expanded is written in systematic form as
$$(I_{4} || \text{\texttt{M5\_inv}}) = \left( \begin{array}{ccccccccc} 1&0&0&0& &1&1&0&0\\ 0&1&0&0& &0&1&1&0\\ 0&0&1&0& &0&0&1&1\\ 0&0&0&1& &1&1&0&1 \end{array}\right)$$
5.
the researched linear bijection has matrix $M5 = \text {\texttt {M5\_inv}}^{-1} = \left (\begin {array}{cccc} 0&1&1&1\\ 1&1&1&1\\ 1&0&1&1\\ 1&0&0&1 \end {array}\right )$.

The resulting linear function has truth table:

$$\{F_{5}(y), 0\leq y<2^{4}\} = \{\texttt{0}, \texttt{e}, \texttt{3}, \texttt{d}, \texttt{7}, \texttt{9}, \texttt{4}, \texttt{a}, \texttt{f}, \texttt{1}, \texttt{c}, \texttt{2}, \texttt{8}, \texttt{6}, \texttt{b}, \texttt{5}\}.$$

7.3 Example on an optimal code

In the case k = 4 and n = 2k = 8, we detail how the (autodual, and unique of type-II ^{Footnote 6}) code with parameters [8,4,4]₂ and generating matrix

$$ \left( \begin{array}{ccccccccc} 1&0&0&0& &0&1&1&1\\ 0&1&0&0& &1&0&1&1\\ 0&0&1&0& &1&1&0&1\\ 0&0&0&1& &1&1&1&0 \end{array}\right) $$

(10)

can be derived from a linear code of parameters [2,1]₁₆ over $\mathbb {F}_{2^{4}}$. We aim to find an irreducible polynomial P(X) such that $\mathbb {F}_{16}=\mathbb {F}_{2}[X]/P(X)$ and the code over $\mathbb {F}_{16}$ has parameters [1,X + X ² + X ³]₂. Equivalently, this means that:

1.
d ^∘(P(X)) = 4 and
2.
the three following conditions are met:
- X(X + X ² + X ³) = 1 + X ² + X ³ mod P(X),
- X ²(X + X ² + X ³) = 1 + X + X ³ mod P(X),
- X ³(X + X ² + X ³) = 1 + X + X ² mod P(X).

The second, third, and forth condition mean that P(X) is a divisor of $\gcd (X(X+X^{2}+X^{3})+ 1+X^{2}+X^{3},X^{2}(X+X^{2}+X^{3})+ 1+X+X^{3},X^{3}(X+X^{2}+X^{3})+ 1+X+X^{2})=\gcd (1+X^{4},1+X+X^{4}+X^{5},1+X+X^{2}+X^{4}+X^{5}+X^{6})$.

Now, it happens indeed that 1 + X ⁴|1 + X + X ⁴ + X ⁵,1 + X + X ² + X ⁴ + X ⁵ + X ⁶. However, $\mathbb {F}_{2}[X]/(1+X^{4})$ is a ring, and not a field, because 1 + X ⁴ = (1 + X)⁴. Thus, the code is defined over a ring, which has never been analyzed this way in masking, and which opens the door to interesting perspectives.

We have tested all 4! permutations of the four last columns in the [8,4,4]₂ code generating matrix (10), without success to lift this code as a [2,1,2]₁₆ code in $\mathbb {F}_{16}$. Actually, this code can be obtained as a binary image (through a graymap) of a [2,1] code on $\mathbb {F}_{4}[X]/(X^{2})$ or on $\mathbb {F}_{2}[X]/(X^{4})$, and of a code of parameters [4,2] on $\mathbb {F}_{2}[X]/(X^{2})$.

8 Conclusions

In this paper we have studied the statistical distribution of uni- and multi-variate leakage functions of cryptographic implementations when some countermeasures against fault injection and side-channel analyzes are applied.

We have observed that the previous studied protection called leakage squeezing is a generalization of the variants of perfect masking, including inner product masking (see Table 2 for a recap). In this sense, we extend the work [48], which explores the links between inner product masking and direct sum masking. We show that leakage squeezing is all the more secure as its underlying code has a high minimum distance d. Side-channel attacks of orders 1,…,t = d − 1 are impossible. We relate this value to the slope − 2d of the mutual information between sensitive variables and the leakage (represented in log-log scale), and show that, in practice, the success rate of attacks is less when d is large. We also reveal that bi-variate mutual information (resp. bi-variate attack probability success) is less than in the uni-variate case.

Table 2 Hierarchy between masking styles

Full size table

Eventually, we propose a new method to build (HO-)CIS codes based on code expansion, which is promising in the context of leakage squeezing, i.e., when a high level of security is required both at word- and at bit-level.

Notes

For instance, in the AES block cipher, the substitution box has 256 entries, hence recomputation requires 256 memory accesses. The number of substitution box calls in the algorithm is 16 (resp. 4) per round for the datapath (resp. key schedule), hence a total of (16 + 4) × 10 = 200 calls, which is indeed less than 256.
Systems with multiple processors speed up memory accesses using data and instruction memory caches, which are shared by the processors; if a data which is not in the cache memory is fetched, then there is a cache miss (which takes a long time) otherwise, there is a cache hit (which is fast). Thus the hit/miss patterns betray the memory access sequence.
Beware that the high-order implementation in this publication is flawed. For fixes, please refer to [21].
The perfect masking scheme introduced in 2001 [9] is perfect in that it ensures perfect independence at word-level between tuples of intermediate variable missing at least one share. However, it is not perfect in the sense of bit-level security. Hence the later introduction in 2011 of leakage squeezing masking scheme [38] and in 2015 of inner product masking scheme [3].
Notice that in Listing 1 and in the rest of this section, the symbol X denotes the dummy variable for field $\mathbb {F}_{2}$ extension to $\mathbb {F}_{16}$. Thus, it shall not be confused with X, the sensitive variable (recall (1)).
See http://www.unilim.fr/pages_perso/philippe.gaborit/SD/GF2/GF2II.htm.

References

Alahmadi, A., Güneri, C., Shohaib, H., Solé, P.: Long quasi-polycyclic t-CIS codes. CoRR, arXiv:1703.03109 (2017)
Azzi, S., Barras, B., Christofi, M., Vigilant, D.: Using linear codes as a fault countermeasure for nonlinear operations: application to AES and formal verification. J. Cryptogr. Eng. 7(1), 75–85 (2017)
Article Google Scholar
Balasch, J., Faust, S., Gierlichs, B.: Inner product masking revisited. In: Oswald and Fischlin [46], pp. 486–510 (2015)
Barthe, G., Belaïd, S., Dupressoir, F., Fouque, P.-A., Grégoire, B.: Compositional verification of higher-order masking: application to a verifying masking compiler. IACR Cryptology ePrint Archive 2015, 506 (2015)
MATH Google Scholar
Barthe, G., Belaïd, S., Dupressoir, F., Fouque, P.-A., Grégoire, B., Strub, P.-Y.: Verified proofs of higher-order masking. In: Oswald and Fischlin [46], pp. 457–485 (2015)
Barthe, G., Dupressoir, F., Faust, S., Grégoire, B., Standaert, F.-X., Strub, P.-Y.: Parallel implementations of masking schemes and the bounded moment leakage model. In: Coron, J.-S., Nielsen, J.B. (eds.) Advances in Cryptology - EUROCRYPT 2017 - 36th Annual International Conference on the Theory and Applications of Cryptographic Techniques, Paris, France, April 30 - May 4, 2017, Proceedings, Part I, vol. 10210 of Lecture Notes in Computer Science, pp. 535–566 (2017)
Batina, L., Gierlichs, B., Prouff, E., Rivain, M., Standaert, F.-X., Veyrat-Charvillon, N.: Mutual information analysis: a comprehensive study. J. Cryptology 24(2), 269–291 (2011)
Article MathSciNet MATH Google Scholar
Bhasin, S., Danger, J.-L., Guilley, S., Najm, Z.: A low-entropy first-degree secure provable masking scheme for resource-constrained devices. In: Proceedings of the Workshop on Embedded Systems Security, WESS 2013, Montreal, Quebec, Canada, September 29 - October 4, 2013, pp. 7:1–7:10. ACM (2013)
Blömer, J., Guajardo, J., Krummel, V.: Provably secure masking of AES. In: Handschuh, H., Hasan, M. A. (eds.) Selected Areas in Cryptography, vol. 3357 of Lecture Notes in Computer Science, pp. 69–83. Springer (2004)
Bringer, J., Carlet, C., Chabanne, H., Guilley, S., Maghrebi, H.: Orthogonal direct sum masking – a smartcard friendly computation paradigm in a code, with builtin protection against side-channel and fault attacks. In: WISTP, vol. 8501 of LNCS, pp. 40–56. Springer, Heraklion, Greece (2014)
Bringer, J., Carlet, C., Chabanne, H., Guilley, S., Maghrebi, H.: Orthogonal direct sum masking – a smartcard friendly computation paradigm in a code, with builtin protection against side-channel and fault attacks. In: Naccache, D., Sauveron, D. (eds.) Information Security Theory and Practice. Securing the Internet of Things - 8th IFIP WG 11.2 International Workshop, WISTP 2014, Heraklion, Crete, Greece, June 30 - July 2, 2014. Proceedings, vol. 8501 of Lecture Notes in Computer Science, pp. 40–56. Springer (2014)
Bringer, J., Carlet, C., Chabanne, H., Guilley, S., Maghrebi, H.: Orthogonal Direct Sum Masking: A Smartcard Friendly Computation Paradigm in a Code, with Builtin Protection against Side-Channel and Fault Attacks. Cryptology ePrint Archive, Report 2014/665. http://eprint.iacr.org/2014/665/ (extended version of conference paper [10]) (2014)
Bruneau, N., Guilley, S., Heuser, A., Rioul, O.: Masks will fall off – higher-order optimal distinguishers. In: Sarkar, P., Iwata, T. (eds.) Advances in Cryptology – ASIACRYPT 2014 - 20th International Conference on the Theory and Application of Cryptology and Information Security, Kaoshiung, Taiwan, R.O.C., December 7–11, 2014, Proceedings, Part II, vol. 8874 of Lecture Notes in Computer Science, pp. 344–365. Springer (2014)
Bruneau, N., Guilley, S., Najm, Z., Teglia, Y.: Multivariate high-order attacks of shuffled tables recomputation. J. Cryptol. 1–43 (2017). https://springerlink.bibliotecabuap.elogim.com/article/10.1007/s00145-017-9259-7
Carlet, C.: Correlation-immune boolean functions for leakage squeezing and rotating s-box masking against side channel attacks. In: Gierlichs, B., Guilley, S., Mukhopadhyay, D. (eds.) SPACE, vol. 8204 of Lecture Notes in Computer Science, pp. 70–74. Springer (2013)
Carlet, C., Danger, J.-L., Guilley, S., Maghrebi, H.: Leakage squeezing of order two. In: Galbraith, S D, Nandi, M. (eds.) Progress in Cryptology - INDOCRYPT 2012, 13th International Conference on Cryptology in India, Kolkata, India, December 9–12, 2012. Proceedings, vol. 7668 of Lecture Notes in Computer Science, pp. 120–139. Springer (2012)
Carlet, C., Danger, J.-L., Guilley, S., Maghrebi, H.: Leakage Squeezing of Order Two. Cryptology ePrint Archive Report 2012/567 (2012). http://eprint.iacr.org/2012/567
Carlet, C., Danger, J.-L., Guilley, S., Maghrebi, H.: Leakage squeezing: optimal implementation and security evaluation. J. Mathematical Cryptology 8(3), 249–295 (2014)
Article MathSciNet MATH Google Scholar
Carlet, C., Danger, J.-L., Guilley, S., Maghrebi, H., Prouff, E.: Achieving side-channel high-order correlation immunity with leakage squeezing. J. Cryptogr. Eng. 4(2), 107–121 (2014)
Article Google Scholar
Carlet, C., Gaborit, P., Kim, J.-L., Solé, P.: A new class of codes for boolean masking of cryptographic computations. IEEE Trans. Inf. Theory 58(9), 6000–6011 (2012)
Article MathSciNet MATH Google Scholar
Carlet, C., Goubin, L., Prouff, E., Quisquater, M., Rivain, M.: Higher-order masking schemes for s-boxes. In: Canteaut, A. (ed.) Fast Software Encryption - 19th International Workshop, FSE 2012, Washington, DC, USA, March 19–21, 2012. Revised Selected Papers, vol. 7549 of Lecture Notes in Computer Science, pp. 366–384. Springer (2012)
Carlet, C., Guilley, S.: Complementary dual codes for counter-measures to side-channel attacks. Adv. Math. Commun. 10(1), 131–150 (2016)
Article MathSciNet MATH Google Scholar
Carlet, C., Prouff, E., Rivain, M., Roche, T.: Algebraic decomposition for probing security. In: Gennaro, R., Robshaw, M. (eds.) Advances in Cryptology - CRYPTO 2015 - 35th Annual Cryptology Conference, Santa Barbara, CA, USA, August 16–20, 2015, Proceedings, Part I, vol. 9215 of Lecture Notes in Computer Science, pp. 742–763. Springer (2015)
Chari, S., Rao, J.R., Rohatgi, P.: Template attacks. In: Kaliski, B.S. Jr., KoÇ, Ç.K., Paar, C. (eds.) Cryptographic Hardware and Embedded Systems - CHES 2002, 4th International Workshop, Redwood Shores, CA, USA, August 13–15, 2002, Revised Papers, vol. 2523 of Lecture Notes in Computer Science, pp. 13–28. Springer (2002)
Chee, Y.M., Cherif, Z., Danger, J.-L., Guilley, S., Kiah, H.M., Kim, J.-L., Solé, P., Zhang, X.: Multiply constant-weight codes and the reliability of loop physically unclonable functions. IEEE Trans. Inf. Theory 60(11), 7026–7034 (2014)
Article MathSciNet MATH Google Scholar
Clavier, C., Feix, B., Gagnerot, G., Roussellet, M., Verneuil, V.: Improved collision-correlation power analysis on first order protected AES. In: Preneel and Takagi [49], pp. 49–62 (2011)
Coron, J.-S.: Higher order masking of look-up tables. In: Nguyen, P. Q., Oswald, E. (eds.) EUROCRYPT, vol. 8441 of Lecture Notes in Computer Science, pp. 441–458. Springer (2014)
Danger, J.-L., Guilley, S.: Cryptography circuit protected against observation attacks, in particular of a high order. International patent, granted as CA2749961, CN102405615, ES2435721, EP2380306, FR2941342, JP2012516068, KR20120026022, SG173111, US2012250854 and WO2010084106 (2010)
Guilley, S., Heuser, A., Rioul, O.: A key to success - success exponents for side-channel distinguishers. In: Biryukov, A., Goyal, V. (eds.) Progress in Cryptology - INDOCRYPT 2015 - 16th International Conference on Cryptology in India, Bangalore, India, December 6–9, 2015, Proceedings, vol. 9462 of Lecture Notes in Computer Science, pp. 270–290. Springer (2015)
Guilley, S., Heuser, A., Rioul, O.: Codes for side-channel attacks and protections. In: El Hajji, S., Nitaj, A., Souidi, E.M. (eds.) Codes, Cryptology and Information Security - Second International Conference, C2SI 2017, Rabat, Morocco, April 10–12, 2017, Proceedings - In Honor of Claude Carlet, vol. 10194 of Lecture Notes in Computer Science, pp. 35–55. Springer (2017)
Heuser, A.: Distinguishing Distinguisher: A Theoretical Approach to Side-channel Analysis. PhD thesis, TELECOM-ParisTech (2015)
Ishai, Y., Sahai, A., Wagner, D.: Private circuits: securing hardware against probing attacks. In: CRYPTO, vol. 2729 of Lecture Notes in Computer Science, pp. 463–481. Springer, Santa Barbara (2003)
Joye, M., Tunstall, M.: Fault Analysis in Cryptography. Springer LNCS. https://doi.org/10.1007/978-3-642-29656-7; ISBN 978-3-642-29655-0. (2011)
Karmakar, S., Chowdhury, D.R.: Leakage squeezing using cellular automata. In: Kari, J., Kutrib, M., Malcher, A. (eds.) Automata, vol. 8155 of Lecture Notes in Computer Science, pp. 98–109. Springer (2013)
Karpovsky, M.G., Kulikowski, K.J., Taubin, A.: Differential fault analysis attack resistant architectures for the advanced encryption standard. In: Quisquater, J.-J., Paradinas, P., Deswarte, Y., El Kalam, A.A. (eds.) Smart Card Research and Advanced Applications VI, IFIP 18th World Computer Congress, TC8/WG8.8 & TC11/WG11.2 Sixth International Conference on Smart Card Research and Advanced Applications (CARDIS), 22–27 August 2004, Toulouse, France, vol. 153 of IFIP, pp. 177–192. Kluwer/Springer (2004)
Kim, H., Hong, S., Lim, J.: A Fast and Provably Secure Higher-Order Masking of AES S-Box. In: Preneel and Takagi [49], pp. 95–107 (2011)
Maghrebi, H., Carlet, C., Guilley, S., Danger, J.-L.: Optimal first-order masking with linear and non-linear bijections. In: Mitrokotsa, A., Audenay, S. (eds.) AFRICACRYPT, vol. 7374 of Lecture Notes in Computer Science, pp. 360–377. Springer (2012)
Maghrebi, H., Guilley, S., Danger, J.-L.: Leakage squeezing countermeasure against high-order attacks. In: Ardagna, C.A., Zhou, J. (eds.) Information Security Theory and Practice. Security and Privacy of Mobile Devices in Wireless Communication - 5th IFIP WG 11.2 International Workshop, WISTP 2011, Heraklion, Crete, Greece, June 1–3, 2011. Proceedings, vol. 6633 of Lecture Notes in Computer Science, pp. 208–223. Springer (2011)
Maghrebi, H., Rioul, O., Guilley, S., Danger, J.-L.: Comparison between side-channel analysis distinguishers. In: Chim, T.W., Yuen, T.H. (eds.) ICICS, vol. 7618 of LNCS, pp. 331–340. Springer (2012)
Messerges, T.S.: Securing the AES finalists against power analysis attacks. In: Schneier, B. (ed.) Fast Software Encryption, 7th International Workshop, FSE 2000, New York, NY, USA, April 10–12, 2000, Proceedings, vol. 1978 of Lecture Notes in Computer Science, pp. 150–164. Springer (2000)
Messerges, T S.: Using second-order power analysis to attack DPA resistant software. In: CHES, vol. 1965 of LNCS, pp. 238–251. Springer, Worcester (2000)
Moradi, A.: Statistical tools flavor side-channel collision attacks. In: Pointcheval, D., Johansson, T. (eds.) EUROCRYPT, vol. 7237 of Lecture Notes in Computer Science, pp. 428–445. Springer (2012)
Moradi, A., Standaert, F.-X.: Moments-correlating DPA. IACR Cryptology ePrint Archive 2014, 409 (2014)
Google Scholar
Nassar, M., Guilley, S., Danger, J.-L.: Formal analysis of the entropy / security trade-off in first-order masking countermeasures against side-channel attacks. In: Bernstein, D.J., Chatterjee, S. (eds.) Progress in Cryptology - INDOCRYPT 2011 - 12th International Conference on Cryptology in India, Chennai, India, December 11–14, 2011. Proceedings, vol. 7107 of Lecture Notes in Computer Science, pp. 22–39. Springer (2011)
NIST/ITL/CSD: Advanced Encryption Standard (AES). FIPS PUB 197. http://nvlpubs.nist.gov/nistpubs/FIPS/NIST.FIPS.197.pdf (also ISO/IEC 18033-3:2010) (2001)
Oswald, E., Fischlin, M. (eds.): Advances in Cryptology - EUROCRYPT 2015 - 34th Annual International Conference on the Theory and Applications of Cryptographic Techniques, Sofia, Bulgaria, April 26–30, 2015, Proceedings, Part I, vol. 9056 of Lecture Notes in Computer Science. Springer (2015)
Pan, J., den Hartog, J I., Lu, J.: You cannot hide behind the mask: power analysis on a provably secure s-box implementation. In: Youm, H.Y., Yung, M. (eds.) Information Security Applications, 10th International Workshop, WISA 2009, Busan, Korea, August 25–27, 2009, Revised Selected Papers, vol. 5932 of Lecture Notes in Computer Science, pp. 178–192. Springer (2009)
Poussier, R., Guo, Q., Standaert, F.-X., Carlet, C., Guilley, S.: Connecting and improving direct sum masking and inner product masking. In: Teglia, Y., Eisenbarth, T. (eds.) Smart Card Research and Advanced Applications - 16th International Conference, CARDIS 2017, Lugano, Switzerland, November 13–15, 2017, Revised Selected Papers, Lecture Notes in Computer Science. Springer (2017)
Preneel, B., Takagi, T. (eds.): Cryptographic Hardware and Embedded Systems - CHES 2011 - 13th International Workshop, Nara, Japan, September 28 – October 1, 2011. Proceedings, vol. 6917 of LNCS, Springer (2011)
Rivain, M., Prouff, E.: Provably secure higher-order masking of AES. In: Mangard, S., Standaert, F.-X. (eds.) CHES, vol. 6225 of LNCS, pp. 413–427. Springer (2010)
Schneider, T., Moradi, A., Güneysu, T.: Parti - towards combined hardware countermeasures against side-channel and fault-injection attacks. In: Robshaw, M., Katz, J. (eds.) Advances in Cryptology - CRYPTO 2016 - 36th Annual International Cryptology Conference, Santa Barbara, CA, USA, August 14–18, 2016, Proceedings, Part II, vol. 9815 of Lecture Notes in Computer Science, pp. 302–332. Springer (2016)
Schramm, K., Paar, C.: Higher order masking of the AES. In: Pointcheval, D. (ed.) CT-RSA, vol. 3860 of LNCS, pp. 208–225. Springer (2006)
Standaert, F.-X., Veyrat-Charvillon, N., Oswald, E., Gierlichs, B., Medwed, M., Kasper, M., Mangard, S.: The world is not enough: another look on second-order DPA. In: Abe, M. (ed.) Advances in Cryptology - ASIACRYPT 2010 - 16th International Conference on the Theory and Application of Cryptology and Information Security, Singapore, December 5–9, 2010. Proceedings, vol. 6477 of Lecture Notes in Computer Science, pp. 112–129. Springer (2010)
Teglia, Y., Liardet, P.-Y., Pomet, A.: Protection of the execution of a DES algorithm. US Patent 8,144,865 (2012)
Tunstall, M., Whitnall, C., Oswald, E.: Masking tables – an underestimated security risk. In: Moriai, S. (ed.) FSE, vol. 8424 of Lecture Notes in Computer Science, pp. 425–444. Springer (2013)
University of Sydney: Magma Computational Algebra System. http://magma.maths.usyd.edu.au/magma/, Accessed on 2014-08-22
Waddle, J., Wagner, D.A.: Towards efficient second-order power analysis. In: Joye, M., Quisquater, J.-J. (eds.) Cryptographic Hardware and Embedded Systems - CHES 2004: 6th International Workshop Cambridge, MA, USA, August 11–13, 2004. Proceedings, vol. 3156 of Lecture Notes in Computer Science, pp. 1–15. Springer (2004)
Wang, W., Standaert, F.-X., Yu, Y., Pu, S., Liu, J., Guo, Z, Gu, D.: Inner product masking for bitslice ciphers and security order amplification for linear leakages. In: Lemke-Rust, K., Tunstall, M. (eds.) Smart Card Research and Advanced Applications - 15th International Conference, CARDIS 2016, Cannes, France, November 7–9, 2016, Revised Selected Papers, vol. 10146 of Lecture Notes in Computer Science, pp. 174–191. Springer (2016)

Download references

Acknowledgements

The authors wish to thank Patrick Solé for valuable inputs and suggestions about this article. This work was supported in part by National Natural Science Foundation of China (No. 61632020), and by the ANR CHIST-ERA project https://secode.enst.fr/ (Secure Codes to thwart Cyber-physical Attacks). The authors are also grateful to Félix Ulmer from University of Rennes 1 for inputs about lifting of binary codes on larger structures of size 2^m, where m > 1.

Author information

Authors and Affiliations

LAGA, University of Paris 8 (and Paris 13 and CNRS), Saint-Denis cedex 02, France
Claude Carlet
Secure-IC S.A.S., 15 Rue Claude Chappe, Bât. B, 35 510, Cesson-Sévigné, France
Sylvain Guilley
LTCI, Télécom ParisTech, Université Paris-Saclay, 75 013, Paris, France
Sylvain Guilley
Département d’Informatique, École Normale Supérieure, 75 005, Paris, France
Sylvain Guilley

Authors

Claude Carlet
View author publications
You can also search for this author in PubMed Google Scholar
Sylvain Guilley
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sylvain Guilley.

Additional information

This article is part of the Topical Collection on Special Issue on Statistics in Design and Analysis of Symmetric Ciphers

Rights and permissions

Reprints and permissions

About this article

Cite this article

Carlet, C., Guilley, S. Statistical properties of side-channel and fault injection attacks using coding theory. Cryptogr. Commun. 10, 909–933 (2018). https://doi.org/10.1007/s12095-017-0271-4

Download citation

Received: 17 August 2017
Accepted: 23 November 2017
Published: 14 December 2017
Issue Date: September 2018
DOI: https://doi.org/10.1007/s12095-017-0271-4

Statistical properties of side-channel and fault injection attacks using coding theory

Abstract

Similar content being viewed by others

Analysis of a Code-Based Countermeasure Against Side-Channel and Fault Attacks

Codes for Side-Channel Attacks and Protections

Automated Deployment of Software Encoding Countermeasure

1 Protection as a coding problem

Contributions

Outline

2 Introduction

2.1 Principle of coding

2.2 Design choice

3 Security models

3.1 Side-channel analysis

Probing model

Definition 1 (Probing model)

Bounded moment model

Definition 2 (Bounded moment model)

3.2 Fault injection analysis

3.3 Combination of side-channel and fault injection

4 Security parameters

5 Architectural options for protection

6 Some known constructions revisited

6.1 Perfect additive masking scheme [9]

6.2 Inner product (IP [3]) masking scheme

6.3 Orthogonal direct sum masking (ODSM [11])

Implementation

Security against fault injection attacks

6.4 Leakage squeezing

Background about leakage squeezing countermeasure

Definition and use-case

Leakage distributions for leakage squeezing

Remark 1

Uni- and bi-variate attacks on leakage squeezing

Information leakage under leakage squeezing protection

Link between attacks and information leakage

7 A new construction for leakage squeezing and inner product masking

7.1 Rationale of the construction

7.2 Example on a non-optimal code

7.3 Example on an optimal code

8 Conclusions

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification (2010)

Search

Navigation