Meet-in-the-Middle Attacks and Structural Analysis of Round-Reduced PRINCE

Derbez, Patrick; Perrin, Léo

doi:10.1007/s00145-020-09345-0

Meet-in-the-Middle Attacks and Structural Analysis of Round-Reduced PRINCE

Published: 04 March 2020

Volume 33, pages 1184–1215, (2020)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of Cryptology Aims and scope Submit manuscript

Meet-in-the-Middle Attacks and Structural Analysis of Round-Reduced PRINCE

Download PDF

Patrick Derbez¹ &
Léo Perrin²

685 Accesses
2 Citations
Explore all metrics

Abstract

NXP Semiconductors and its academic partners challenged the cryptographic community with finding practical attacks on the block cipher they designed, PRINCE. Instead of trying to attack as many rounds as possible using attacks which are usually impractical despite being faster than brute force, the challenge invites cryptographers to find practical attacks and encourages them to actually implement them. In this paper, we present new attacks on round-reduced PRINCE including the ones which won the challenge in the 4-, 6- and 8-round categories—the highest for which winners were identified. Our first attacks rely on a meet-in-the-middle approach and break up to ten rounds of the cipher. We also describe heuristic methods we used to find practical SAT-based and differential attacks. Finally, we also present an analysis of the cycle structure of the internal rounds of PRINCE leading both to a low complexity distinguisher for 4-round PRINCE-core and an alternative representation of the cipher valid in particular contexts and which highlights, in these cases, a poor diffusion.

Meet-in-the-Middle Attacks and Structural Analysis of Round-Reduced PRINCE

Faster Key Recovery Attack on Round-Reduced PRINCE

Improving First-Order Threshold Implementations of SKINNY

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

When tasked with assessing the security of a block cipher, cryptanalysts have now a broad range of tools at their disposal: differential attack [1], linear attack [2], meet-in-the-middle attack [3], etc. The main purpose of a security analysis is usually to identify flaws in the design of a primitive and then to illustrate their gravity through the description of an attack covering as many rounds as possible. However, applicability of said attacks in a realistic situation is usually not the first objective of the cryptanalyst. A simple reason for this is that as our understanding of the design of block ciphers improved, the ease of identifying practical attacks decreased. Furthermore and in accordance with the famous maxim “attacks only get better,” an impractical attack submitted at a given time may later be improved.

While impractical attacks provide the academic community with valuable insights into the security provided by different block ciphers, their components, their design strategies, etc., cryptanalysis in the industry is more focused on practical attacks. In order to promote this view, the Technical University of Denmark (DTU), NXP Semiconductors and the Ruhr University of Bochum challenged the cryptographic community [4] with finding low data complexity attacks on the block cipher PRINCE [5]. More precisely, they accept attacks requiring only at most $2^{20}$ chosen plaintexts or $2^{30}$ known plaintexts. Furthermore, extra rewards (from 1000 to 10000€) are given for attacks on at least eight rounds which require at most $2^{45}$ bytes of memory (about 32 Terabytes) and at most $2^{64}$ encryptions of the round-reduced variant attacked.

Studying PRINCE in this setting may provide valuable data on multiple accounts. First of all, PRINCE is a lightweight block cipher, meaning that it is intended to be run on processors with little computing power to devote to security-related algorithm or on hardware where every logical gate counts. Research on this topic is intense nowadays as the need for such primitives becomes increasingly pressing, see [6] for an extensive review of the algorithms that have been proposed. Second, PRINCE implements a simplified version of the so-called FX construction: Encryption under key $(k_{0} || k_{1})$ consists in XOR-ing $k_{0}$ to the plaintext, applying a block cipher called PRINCE-core keyed with $k_{1}$, and then, output the result XOR-ed with $L(k_{0})$ where L is a simple linear bijection. This strategy allows for a greater key size without the cost of a sophisticated key schedule. However, it is impossible to make a security claim as strong as for a more classical construction. Finally, PRINCE-core has a unique property called $\alpha $-reflection. If we denote by $E^{c}_{k_{1}}$ the encryption under PRINCE-core with subkey $k_{1}$, then the corresponding decryption operation is $E^{c}_{k_{1} \oplus \alpha }$ for a constant $\alpha $. In other words, decryption is merely encryption under a related key. The consequences of this property have already been studied, and, in particular, some values of $\alpha $ different from the one used have been shown to lead to weaker algorithms [7].

PRINCE has already been the subject of several cryptanalyses, notably [8] where the security of the algorithm against multiple attacks was assessed, [7] which investigated the influence of the value of $\alpha $, [9] which described meet-in-the-middle attacks on the block cipher and, finally, [10] which proposed the best attack to date in terms of number of rounds attacked. A list of the cryptanalyses of round-reduced PRINCE is provided in Table 1 (attacks working only on PRINCE-core or for modified versions of PRINCE (different $\alpha $ and/or S-Box) are not shown in this table).

Table 1 The best attacks on round-reduced PRINCE in the single-key model

Full size table

As stated before, most of the attacks usually considered often have impractical complexities. For instance, differential attacks and linear attacks require large amounts of chosen (respectively, known) plaintexts, both of which may be impossible to gather to begin with if the algorithm is implemented on a small device with little computer and, hence, a small throughput. Therefore, we focused our efforts on meet-in-the-middle (MitM) attacks, algebraic/logic attack where the fact that a ciphertext is the encryption of a plaintext is encoded as an equation which is fed to a solver and, surprisingly, differential attack for which we found a heuristic method decreasing significantly the data complexity.

Our contribution We describe different low data complexity attacks on round-reduced PRINCE which were submitted to the PRINCE challenge and turned out [12] to be the best ones on PRINCE reduced to four, six and eight rounds. In Sect. 3, we describe our attacks obtained using the meet-in-the-middle technique and we also show a new attack on 10 rounds with practical memory and a time complexity around $2^{68}$ encryptions. Then, we describe in Sect. 4 a low data differential attack against 6-round PRINCE, which is the fastest known. In fact, the power of the filter used to discard wrong pairs in a differential attack can be raised to the power 4 when attacking 6-round PRINCE by considering groups of pairs. In Sect. 5, we show how the equation given to a SAT-solver can be modified so as to make an attack on four rounds practical, and how to simplify our 6-round differential attack by using a SAT-solver to take care of the tedious details of the key recovery. We finally present in Sect. 6 some observations about the cycle structure of the internal rounds of PRINCE and how it implies the existence of alternative representations of the cipher highlighting a poor diffusion in some subsets of the input space. While we do not use these to attack PRINCE directly, we show that the size of these subsets remains reasonable and actually find such sets for 4-round PRINCE-core.

Extra materials including codes of practical attacks can be found at:

https://seafile.cifex-dedibox.ovh/d/ccd77d431e5c4ea28de1/.

2 Specification of PRINCE

2.1 Description of PRINCE

PRINCE is a 64-bit block cipher with a 128-bit key. It is based on a variant of the FX construction which was proposed by Kilian and Rogaway as a generalization of the DESX scheme. The master key k is split into two 64-bit parts $k = k_0 \parallel k_1$, and $k_0$ is used to generate a third subkey $k'_0 = (k_0 \ggg 1) \oplus (k_0 \gg 63)$. Both $k_0$ and $k'_0$ are used as pre- and post-whitening keys, respectively. The full version of the cipher has 12 rounds and is depicted in Fig. 1.

The encryption is quite similar to the AES and consists of a nibble-based substitution layer S and a linear layer M. The operation M can be divided into a ShiftRows operation and a matrix multiplication $M'$ operating independently on each column but not nibble oriented. Furthermore, the matrix $M'$ is an involution, and combined to the fact that the round constants satisfy the relation $RC_i \oplus RC'_i = \alpha $ where $\alpha = \textsf {C0AC29B7C97C50DD}$, the decryption process $D_{k_0,k_1,k'_0}$ is equal to the encryption process $E_{k'_0,k_1 \oplus \alpha , k_0}$. For further details about PRINCE, we refer the reader to [5].

Notations In the sequel, we denote both the plaintext and the ciphertext by p and c, respectively. For the first R rounds of 2R-round PRINCE, we denote the internal state just before (resp. after) the r-th SubNibble layer by $x_r$ (resp. $y_r$), while for the last R rounds, those internal states are denoted by $y'_r$ and $x'_r$, respectively, as shown in Fig. 1. Given a collection of messages $\{p^0, \ldots , p^m, \ldots \}$, $x^m_r[i]$ represents nibble i of state $x_r$ of message $p^m$. As PRINCE is not fully nibble oriented, we use the notation $x_r[i]_b$ to refer to the bit i of the state $x_r$ and the following relation holds for all $i \in \{0, \ldots , 15\}$ (Fig. 2):

$$\begin{aligned} x_r[i] = x_r[4i+3]_b \parallel x_r[4i+2]_b \parallel x_r[4i+1]_b \parallel x_r[4i]_b. \end{aligned}$$

Finally, we use the following notations for some functions:

R::: the composition of S and M so that $R(x) = M\big ( S(x) \big ) = SR\big ( M'(S(x)) \big )$.
$E_{k_{0 ||k_{1}}}^{r}$::: PRINCE reduced to r rounds.
$E^{c}_{k_{1}}$::: full PRINCE-core.
$E_{k_{1}}^{c,r}$::: PRINCE-core reduced to r rounds.

3 Meet-in-the-Middle Attacks

In this section, we present both the 6-round attack and the 8-round attack which won the PRINCE challenge in the chosen-plaintext category together with a new attack on ten rounds. The aim of the challenge was to find the best attacks using at most $2^{20}$ chosen plaintexts, and thus, we decided to follow the strategy used by Demirci and Selçuk on AES in [3], later improved by Dunkelman et al. in [13], Derbez et al. in [14, 15] and by Li et al. in [9]. While our 10-round attack does not fit the restriction on the data complexity, it shows that this kind of attacks is one of the most powerful on SP network.

First, we give the definition of an ordered $\delta $-set which is a particular structure of messages used in our attacks.

Definition 1

Let a $\delta $-set be a set of 16 PRINCE states that are all different in one state nibble (the active nibble) and all equal in the other state nibble (the inactive nibbles). An ordered $\delta $-set is a $\delta $-set $\{x^0,\ldots ,x^{15}\}$ such that the difference in the active nibble between $x^0$ and $x^i$ is equal to i, for $0 \le i \le 15$.

In the sequel, we consider $\delta $-sets such that nibble 7 is the active one. For such a particular set, we made the following observations which are the core of our new attacks.

Observation 1

Consider the encryption of a collection $\{p^0, p^1, \ldots , p^{15}\}$ of 16 messages through 6-round PRINCE. If the set $\{y^0_2, y^1_2, \ldots , y^{15}_2\}$ is an ordered $\delta $-set, then the ordered sequence

$$\begin{aligned} \left[ y_2^{\prime 1}[7] \oplus y_2^{\prime 0}[7], y_2^{\prime 2}[7] \oplus y_2^{\prime 0}[7], \ldots , y_2^{\prime 15}[7] \oplus y_2^{\prime 0}[7] \right] \end{aligned}$$

is fully determined by the following eight nibble parameters:

$- x_3^0[0,7,10,13] \qquad \qquad \qquad - x_3^{\prime 0}[0,7,10,13]$

Consequently, there are at most $2^{8 \times 4} = 2^{32}$ possible sequences when we consider all the possible choices of keys and ordered $\delta $-sets (out of the $2^{4 \times 15} = 2^{60}$ of the theoretically possible 15-nibble sequences).

Proof

The proof is straightforward. The goal is to propagate the differences from the state $y_2$ (which are known) to the state nibble $y_2^{\prime }[7]$. At each intermediate round, each input of an S-Box is either a parameter, not required or constant (so output differences are equal to zero). $\square $

Observation 2

Consider the encryption of a collection $\{p^0, p^1, \ldots , p^{15}\}$ of 16 messages through 8-round PRINCE. If the set $\{x^0_2, x^1_2, \ldots , x^{15}_2\}$ is an ordered $\delta $-set, then the ordered sequence

$$\begin{aligned} \left[ x_2^{\prime 1}[7] \oplus x_2^{\prime 0}[7], \ldots , x_2^{\prime 15}[7] \oplus x_2^{\prime 0}[7], y_2^{\prime 1}[6] \oplus y_2^{\prime 0}[6], \ldots , y_2^{\prime 15}[6] \oplus y_2^{\prime 0}[6] \right] \end{aligned}$$

is fully determined by the following 42 nibble parameters:

$$\begin{aligned} \begin{array}{ll} -~x_2^0[7] &{}\qquad \qquad -~x_4^{\prime 0}[0..15] \\ -~x_3^0[0,7,10,13]&{}\qquad \qquad -~x_3^{\prime 0}[0,7,10,13] \\ -~x_4^0[0..15] &{}\qquad \qquad -~x_2^{\prime 0}[7] \\ \end{array} \end{aligned}$$

Furthermore, those 42 state nibbles can be directly computed from the full state $x_4$ and four nibbles of $M^{-1}(k_1)$. Consequently, there are at most $2^{4 \times (16 + 4)} = 2^{80}$ possible sequences when we consider all the possible choices of keys and ordered $\delta $-sets (out of the $2^{4 \times 30} = 2^{120}$ of the theoretically possible 30-nibble sequences).

Proof

The proof is similar to the one of Observation 1 except the parameters are related. Indeed, from the full state $x_4$ one can directly compute $x'_4$ as no keys are involved ($x'_4 = S^{-1}(M'(S(x_4)))$). Then, we note that the four nibbles $M^{-1}(k_1)[4..7]$ are enough to compute $x_3^0[0,7,10,13]$ from $x_4$ and $x_3^{\prime 0}[0,7,10,13]$ from $x'_4$. Finally, the knowledge of $M^{-1}(k_1)[7]$ allows to compute $x_2^0[7]$ and $x_2^{\prime 0}[7]$ from $x_3^0[0,7,10,13]$ and $x_3^{\prime 0}[0,7,10,13]$, respectively. $\square $

3.1 6-Round Attack

The 6-round attack is depicted in Fig. 3, and its scenario is straightforward. First, the $2^{32}$ possible sequences given in Observation 1 are computed and stored in a hash table during a preprocessing phase. Then during the online phase, we begin by asking for the encryption of a structure of $2^{16}$ chosen plaintexts such that nibbles from 4 to 7 take all the possible values, while the other ones are constant, and pick one of them denoted $p^0$. Now, the goal of the adversary is to identify an ordered $\delta $-set containing $y_2^0$. To do so, he has to guess the five nibbles $x^0_1[4..7]$ and $x^0_2[7]$ and propagate the differences from the state $y_2$ to the plaintext. Then, he gets the corresponding ciphertexts, guesses the five nibbles $x^{\prime 0}_1[4..7]$ and $x^{\prime 0}_2[7]$ and propagates the differences from the ciphertexts to $y^{\prime }_2[7]$. Finally, he discards all the guesses which do not lead to a match in the previously built hash table. The probability for a wrong guess to pass the test is $2^{32} \times 2^{-60} = 2^{-28}$, so we expect $2^{5}$ candidates to remain at the end of the attack. The wrong ones can be discarded by replaying the attack with an other choice for $p^{0}$ without increasing the overall complexity of the attack.

The data complexity of this attack is $2^{16}$ chosen plaintexts, and the memory requirement is around $2^{32} \times 4 \times 15 \times 2^{-3} \approx 2^{34.9}$ bytes. During the online phase, ten state nibbles are guessed; however, they can assume only $2^{33}$ values once the plaintext/ciphertext pair is given. Indeed, the knowledge of the 33 bits:

$$\begin{aligned} \{(k_0 \oplus k_1)[16..27]_b, (k'_0 \oplus k_1)[16..27]_b, k_1[28..31]_b, k_0[28..32]_b \}, \end{aligned}$$

is enough to compute all of them from p and c. Thus, the time complexity of the online phase is approximately $16 \times 2^{33} \times 40 / (6 \times 64) \approx 2^{33.7}$ encryptions.

Key recovery At the end of the attack, $128 - 33 = 95$ key bits are still missing. To find them, the best way is to apply several meet-in-the-middle attacks successively. For instance, one could begin by running the attack depicted in Fig. 12 in “Appendix A” which has an overall complexity below $2^{28}$ as most key bits required in the online phase are already known.

3.2 8-Round Attack

The 8-round attack is similar to the one on six rounds and is depicted in Fig. 18. It relies on Observation 2, so the memory complexity is around $2^{80} \times 15 \times 8 \times 2^{-3} \approx 2^{83.9}$ bytes. In the online phase, the data complexity remains unchanged to $2^{16}$ chosen plaintexts, but the number of state variables to guess is increased. The identification step requires to guess the four nibbles $x^0_1[4..7]$, and then, the nine nibbles $x^{\prime 0}_1[0..7]$ and $x^{\prime 0}_2[6]$ are guessed to build the sequence from the ciphertexts. Those 13 nibbles can assume only $2^{49}$ values once the plaintext/ciphertext pair $(p^0,c^0)$ given as they all can be derived from

$$\begin{aligned} \{(k_0 \oplus k_1)[16..24,28..31]_b,(k'_0 \oplus k_1)[0..23,27..31]_b,k_0[25..27]_b,k_1[24..27]_b \}. \end{aligned}$$

Thus, the time complexity of the online phase is approximately $16 \times 2^{49} \times 52 / (8 \times 64) \approx 2^{49.7}$ encryptions and we expect $2^{49} \times 2^{80} \times 2^{-120} = 2^{9}$ candidates to remain at the end of the attack (Fig. 4).

Key recovery As for the previous attack, the most efficient way to recover the missing key bits is to perform other attacks. For instance, one could run the attack depicted in Fig. 13 (“Appendix B”) which has the same complexity than the one above since there are approximately $2^{9}$ candidates for the four active nibbles of $x_1$. Then, the search space would be small enough to perform an exhaustive search without increasing the overall complexity.

Trade-off It is possible to trade some memory against time without increasing the data complexity by noticing that for a considered structure of $2^{16}$ plaintexts, the four active nibbles of $x_3$ take all the possible values. Thus, we can fix them to 0 during the offline phase and save a factor $2^{16}$ in memory. In the other hand, we now need to run the attack for all the possible choices for $p^{0}$ increasing the time complexity by the same factor of $2^{16}$.

A more sophisticated attack against 8-round PRINCE, requiring much less memory, is described in “Appendix E.”

3.3 10-Round Attack

We now investigate PRINCE reduced to ten rounds. While we were unable to find an attack requiring less than $2^{20}$ chosen plaintexts for the PRINCE challenge, we found one competitive with the current best known attack. To describe it, we first extend the definition of a $\delta $-set as it was done in [14], then we show a meet-in-the-middle attack as the two ones above and finally we apply the differential enumeration technique ([13]).

$\delta $-set. In [14], Derbez et al. showed that the notion of $\delta $-set can be extended to set of states such that some linear combinations of state bits are constant. In the sequel, we denote by $\delta $-set a set of 16 messages such that $y_2[0..4,6,8..12,14]$ and $M'(y_2)[0..4,6,8..12,14]$ are constant, exploiting the fact that the matrix operating on the columns is not MDS.

10-round attack The basis of our attack on 10 rounds is depicted in Fig. 5. The meet-in-the-middle is performed on the four-bit equations described above. The state bytes required as the parameters of the hash table can be computed from the whole state $x_5$ and eight nibbles of the equivalent subkey $M^{-1}(k_1)$, and thus, approximately $2^{96}$ 60-bit sequences are stored. In the online phase, the 24 state nibbles needed can be computed from the following 66 key bits:

$$\begin{aligned}&\{k_0[0,20..24,28..32,52..56,60..63]_b, k_1[20..23,28..31,52..55,60..63]_b,\\&\quad (k_0 \oplus k_1)[16..19,24..27,48..51,56..59]_b,\\&\quad (k'_0 \oplus k_1)[16..19,24..26,48..51,56..58]_b\}. \end{aligned}$$

Note that this attack does not actually work because the number of sequences stored is higher than the number of possible 60-bit sequences, and thus, no key candidates are filtered. The aim of the next section is to show how to reduce the memory requirement.

Differential Enumeration Technique Li et al. applied this technique against PRINCE in [9] and successfully mounted new attacks on eight and nine rounds. The idea of this technique originally introduced by Dunkelman et al. in [13] is to store in the hash table only the sequences built from a $\delta $-set containing a message $p^0$ that belongs to a pair $(p^0,p^1)$ following a well-chosen differential characteristic. In our case, the truncated differential characteristic is depicted in Fig. 5 assuming a zero difference in hatched nibbles. Thus, we expect to store only $2^{96 + 4 - 60} = 2^{40}$ sequences in the offline phase. However, generating them is not as trivial as for the basic attack. We propose the following procedure which has a time complexity around $2^{72}$ operations:

1.
Consider a pair $(p^0,p^1)$ following the differential characteristic.
2.
$S^{-1} \circ M' \circ S$ can be seen as four invertible super S-Boxes $\S _0, \ldots , S_3$ operating on 16-bit words. Build four hash tables such that one can retrieve (x, y) from $(x \oplus y, S_i(x) \oplus S_i(y))$.
3.
Guess the difference in the active nibbles of both $y_4$ and $y'4$ and retrieve the actual value of $x_5$ and $x'_5$ for both messages of the pair.
4.
Guess the difference in the two active nibbles of the first column of $y_3$ and get back the actual values of $y_4[2,5,8,15]$.
5.
Combined with the knowledge of $x_5$, this leads to the knowledge of the four key nibbles $M^{-1}(k_1)[2,5,8,15]$. Use them to partially encrypt $x'_5$ and check whether the difference in the first column of $y'_3$ is correct.
6.
Use $M^{-1}(k_1)[15]$ to partially decrypt $y_4$ and get the difference in $x_2[15]$ and check its correctness. Do the same for the difference in $x'_2[15]$.
7.
Guess the difference in the two active nibbles of the third column of $y_3$ and get back $M^{-1}(k_1)[0,7,11,13]$.
8.
Compute the value of the missing parameters and check whether the pair follows the characteristic or not. If it does, then build the 60-bit sequence from $p^0$ and store it in the hash table.

The complexity of this procedure is dominated by the complexity of steps 4–5 which is $2^{72}$ simple operations that we estimate to be equivalent to $2^{69}$ encryptions. Now that the table is built the online phase is quite similar to the one of the offline phase:

1.
Ask for a structure of $2^{32}$ chosen plaintexts and store the ciphertexts in a hash table to identify the pairs that may follow the differential characteristic.
2.
For each pair $(p^0,p^1)$:
1. (a)
  Guess the difference in the first column of $y_1$ and of $y_2$ and deduce the corresponding value of $(k_0 \oplus k_1)[12..15]$ and $k_1[15]$. Store them in a hash table $T_0$ indexed by $k_1[15], k_0[61..63]_b$.
2. (b)
  Similarly, compute $(k'_0 \oplus k_1)[12..15]$ and $k_1[15]$ from the ciphertexts and use $T_0$ and the linear relations between $k_0$ and $k'0$ to get back the $2^{2 \times 4 + 2} \cdot 2^{-7} = 2^{3}$ corresponding values of the key nibbles above. Store those $2^{13}$ key candidates in a hash table $T_1$ indexed by $(k_0 \oplus k_1)[12..15]$, $(k'_0 \oplus k_1)[12..15]$ and $k_0[55]_b \oplus k_0[60]_b$ ($ = (k_0 \oplus k_1)[55]_b \oplus \ldots \oplus (k_0 \oplus k_1)[60]_b \oplus (k'_0 \oplus k_1)[55]_b \oplus \ldots \oplus (k'_0 \oplus k_1)[59]_b \oplus k_1[60]_b$).
3. (c)
  Repeat the two steps above but now by guessing the third column of $y_2$ and use $T_1$ to obtain the $2^{2 \times 13 - 8 - 8 - 1} = 2^{9}$ and store them in a hash table $T_2$ indexed by the difference in $y_2$. (While the match is on 33 bits, $(k_0 \oplus k_1)[12..15]$ and $(k'_0 \oplus k_1)[12..15]$ only depend on four 4-bit parameters.)
4. (d)
  Repeat the three steps above but now by guessing the third column of $y_1$ and use $T_3$ to finally retrieve all the $2^{9 + 9 - 8} = 2^{10}$ key candidates.
5. (e)
  For each key candidate, identify a $\delta $-set from $p^0$, build the 60-bit sequence and check whether it belongs to the table constructed in the offline phase. If it does, then try the key candidate.
3.
Repeat the procedure until the right key is found.

As each structure contains $2^{63}$ pairs and each of these pairs follows the differential with probability $2^{-28 - 60} = 2^{-88}$, we need $2^{25}$ structures on average. Then, for each structure we have to study only $2^{63 - 32} = 2^{31}$ pairs and for each of them we have to perform $4 \times 2^{13} + 2^{10} \times 2^{4}$ simple operations estimated to approximately $2^{12}$ encryptions. Thus, this procedure has the time complexity of $2^{25 + 31 + 12} = 2^{68}$ encryptions and requires $2^{25 + 32} = 2^{57}$ chosen plaintexts. At the end of the attack, $2^{66} \times 2^{40} \times 2^{-60} = 2^{46}$ key candidates remain. As 62 key bits are also missing, performing an exhaustive search is not a valid option. Instead, the best way to recover the key is to apply several meet-in-the-middle attacks. For instance, we can assume that when a match happens, we get back the corresponding values of the red nibbles in Fig. 5 and then deduce step by step each key bits of $M^{-1}(k_1)$ by completing the first and the third columns of $y'_3$ without increasing the overall complexity of the attack.

4 Differential Attack

In this section, we describe a new differential attack against 6-round PRINCE. There have already been some differential cryptanalyses of PRINCE, see, for example, [10], which is the best attack to date, and also [16]. Our attack uses a new method to increase the power of the filter by considering groups of pairs. This allows a rather low data complexity considering that 6 rounds are attacked and that differential attacks usually demand large amounts of chosen plaintexts.

4.1 Amplified Differential Trails

Our attack relies on some differences propagating identically in different pairs. To better describe this, we introduce the following definitions:

Encryption:

We call encryption a couple plaintext/ciphertext encrypted under a fixed key.

Pair:

A pair is a set of two encryptions where the plaintexts are separated by a known difference.

Family:

A family is a group of pairs with a particular structure. They are generated from a single pair $\big \{({p[0],\ldots ,p[b-1]}), ({p'[0],\ldots ,p'[b-1]}) \big \}$, where p[i] and $p'[i]$ are nibbles. Suppose that the input difference covers the first three nibbles so that $p[3]=p'[3]=c[3],\ldots ,p[b-1]=p'[b-1]=c[b-1]$ for some constants c[i]. Then, the family corresponding to this pair is made by exchanging some nibbles between the two encryptions in the pair so as to obtain the following pairs:

$$\begin{aligned}&\left\{ \begin{array}{c} ({p[0],p[1],p[2]}{,c[3],\ldots ,c[b-1]}) \\ ({p'[0],p'[1],p'[2]}{,c[3],\ldots ,c[b-1]}) \end{array} \right. ~~ \left\{ \begin{array}{c} ({p'[0]},{p[1],p[2]}{,c[3],\ldots ,c[b-1]}) \\ ({p[0]},{p'[1],p'[2]}{,c[3],\ldots ,c[b-1]}) \end{array} \right. \\&\left\{ \begin{array}{c} ({p[0]},{p'[1]},{p[2]}{,c[3],\ldots ,c[b-1]}) \\ ({p'[0]},{p[1]},{p'[2]}{,c[3],\ldots ,c[b-1]}) \end{array} \right. ~~ \left\{ \begin{array}{c} ({p[0],p[1]},{p'[2]}{,c[3],\ldots ,c[b-1]}) \\ ({p'[0],p'[1]},{p[2]},{c[3],\ldots ,c[b-1]}) . \end{array} \right. \end{aligned}$$

Overall, if there are n nibbles with nonzero differences in the input, then a family is made of $2^{n-1}$ pairs and $2^{n}$ encryptions.

In the case of PRINCE, we consider differential trails where the input differences are only over one column and such that all the pairs in a family follow the same trail for the first three rounds. For example, the trails we describe in Sect. 4.2 are either followed by all the elements in a family or none of them. A similar heuristic is used in [17] to perform a multiset attack on the SASAS structure.

This behavior comes from the fact that the transition in the trails we study depends only on the transitions occurring during the first round, which are the same in all pairs of a family, and on the actual value of some nibbles to which the difference has not had the time to propagate, which are the same in all encryptions of the structure.

4.2 Our Trails

We consider trails which are completely specified during the first 3 rounds and then propagate with probability 1 for 2.5 rounds before having spread to the full internal state. Figure 6 shows a first trail covering 5.5 rounds in this way which we denote $\mathcal {T}_{1}$. Each array corresponds to the differences between the internal states of two encryptions under 6-round PRINCE, and each cell gives the value of the difference: Light gray corresponds to a fully specified nonzero value at the nibble level (e.g., a difference of 1), dark gray to an unknown nonzero difference and white to a zero difference. A very similar trail with a probability two times smaller, $\mathcal {T}_{2}$, is given in Fig. 14 (see “Appendix C”). To compute their probabilities, we use the difference distribution matrix of the S-Box. If we let the input difference be $(1,1,1,0,\ldots ,0)$, then $\mathcal {T}_{1}$ has a probability of $2^{-2 \cdot 3} \cdot 2^{-2} \cdot 2^{-2-2-3} = 2^{-15}$ and $\mathcal {T}_{2}$ has a probability of $2^{-2 \cdot 3} \cdot 2^{-2} \cdot 2^{-2-3-3} = 2^{-16}$.

Querying enough families at random to find one right family for any of these would require $(2^{-15} + 2^{-16})^{-1} = 2^{14.41}$ families with an input difference over three nibbles, i.e., $2^{14.41} \cdot 2^{3} = 2^{17.41}$ encryptions. However, we can use structures to decrease this complexity.

We note that the input differences which might lead to an output difference of 1 are those listed in Table 2. As we can see, the second bit from the right in little-endian notation is only involved in 0x2 and 0xb which, taken together, only have a probability of 1/4 of leading to a difference of 1. Hence, we use the following structures where ${\mathtt {b}}$ is a bit taking all possible values and c is a constant across the structure:

$$\begin{aligned} \mathtt {{bb}c{b}~{bb} c{b}~{bb}c{b}~cccc~cccc~\ldots ~cccc}. \end{aligned}$$

We found experimentally that such structures contain several^{Footnote 1} right families with probability $2^{-5.9}$ on average when we take into account all possible input differences, i.e., $(\delta ,\delta ',\delta '',0,\ldots ,0)$ where $\delta $, $\delta '$, $\delta ''$$\in \{1, 4, c, d \}$. Hence, obtaining at least two right families only requires about $2^{9+5.9} = 2^{14.9}$ queries to the encryption oracle on average.

Table 2 Input differences which might be mapped to a difference of 1 by the S-Box of PRINCE

Full size table

4.3 Filtering Right Pairs

Full diffusion has been achieved by the sixth round. Thus, we guess 12 bits to be able to partially invert the last round on one column. More precisely, we first guess the difference in one column before the last MixColumns for one pair. Each guess leads to a candidate for the corresponding column of $k_0' \oplus k_1$ that we try against the three other pairs of the family. A guess leads to the correct nibble having a zero difference in every pair of the family with probability $2^{-3 \cdot 4} = 2^{-12}$. We repeat this independently over each column and obtain either 64 bits of key material or none at all. Since there are either several right families or none at all in the structures we consider, we only return the key guesses which come from several families as well as the corresponding families.

Note this is a powerful filter: While we expect each family from the structure to yield about one 64-bit candidate, the probability to have a collision is very small.^{Footnote 2}

The same procedure is then used one round earlier to recover 48 bits of $k_1$, corresponding to its three first columns. They allow us to partially decrypt the family and check whether differences after the third MixColumns are correct for the three first columns. The probability for a wrong family to pass this test is $2^{- 6 \cdot 4 \cdot 4} = 2^{- 96}$, so we expect to be left with right families only. Finally, the missing 16 bits of $k_1$ are recovered by performing an exhaustive search.

Complexity Querying $2^{5.9}$ structures of $2^{9}$ chosen plaintexts to obtain at least two right families, the time complexity of the key recovery procedure is around $2^{5.9 + 9} \cdot 2^{12} \cdot 7$ partial encryptions, which is equivalent to $2^{29.7} \cdot \frac{4}{6 \cdot 16} \approx 2^{25.1}$ encryptions. The memory complexity is dominated by the storage of the $2^{14.9}$ required to run this attack.

5 Combining Differential Attack with a SAT-Solver

5.1 Attacking 4-Round PRINCE with a SAT-Solver

5.1.1 Encoding PRINCE as a CNF Formula

The idea is to generate a CNF formula where a set p of Boolean variables corresponds to the 64 bits of the plaintext, c to the 64 bits of the ciphertext and k to the 128 bits of the key and such that there exists a unique assignment of the variables satisfying the CNF corresponding to the case $E_k(p)=c$.

Hence, if we generate such a formula, set the variables in p and k to a chosen value and use a SAT-solver to find an assignment satisfying the CNF formula, and the variables in c will correspond to the ciphertext. Solving such a formula is easy, an observation which we can relate to the fact that the evaluation of a block cipher has to be “easy” for the block cipher to be fast for both encryption and decryption.

Another way to use such a formula is to fix the variables in p and in c according to a known plaintext/ciphertext pair, solve the CNF and recover the key from the variables corresponding to it. Unless the number of rounds is very small (at most 3 in the case of PRINCE), solving such a system is impractical. Again, we can relate this observation to the fact that recovering the key given one or several plaintext/ciphertext pair has to be “hard.” Our approach consists in using some knowledge about the internal state of the cipher to simplify the task of the SAT-solver and make such a resolution possible for a higher number of rounds.

In order to encode a PRINCE encryption as a CNF formula, we introduce several sets of 64 Boolean variables corresponding to each step of each round: one for the internal state at the beginning of the round ($x_{r}$), one for the internal state after going through the S-Box $(y_{r})$, etc. We also use Boolean variables corresponding to the key bits.

Our task is then to create a CNF formula connecting these variables in such a way as to ensure that, for instance if $k[0,\ldots ,63]$ is fixed, it has only one solution where $y_r[0,\ldots ,63]$ is indeed the image of $x[0,\ldots ,63]_r$ by S, etc.

In order to encode the linear layer, we use the alternative representation of $M'$ from [10] where it was shown that $M'$ operates on columns of four bits independently by first rotating them by a column-dependent number of bit and then XOR-ing the parity of the column in each bit. We thus add variables corresponding to the hamming weights of the columns and encode the corresponding XOR’s as CNF formulas. The SR operation is only a permutation of the bits, so we simply set the corresponding bits to be equal.

The encoding of the S-Box is less simple to obtain. In order to find the best one, we chose to look for it directly instead of using the ANF as an intermediate step. Indeed, since the S-Box is 4x4, it is small enough for us to brute force all clauses^{Footnote 3} involving input and output bits and check whether they hold for every input.

Doing this leads us to find 29 clauses with three variables. However, they are not sufficient to completely specify the S-Box, so we used a greedy algorithm to find the best clauses with four variables to add to this encoding. In the end, we have 29 clauses with three variables and nine clauses with four variables which are such that the only solutions of the CNF made of all these clauses are all the assignments corresponding to pairs $\big ( x,S(x) \big )$ for all $x \in [0, 15]$.

These clauses with three variables can be interpreted as simple implications. For example, if $o[3,\ldots ,0]_b = S(i[3,\ldots ,0]_b)$, then the following two clauses hold with probability 1:

$$\begin{aligned} \big ( i[1]_b \vee o[2]_b \vee o[3]_b \big ) \wedge \big ( i[1]_b \vee o[1]_b \vee o[2]_b \big ). \end{aligned}$$

They are logically equivalent to the following implication:

$$\begin{aligned} \overline{i[1]_b} \implies \big ( (o[2]_b \vee o[3]_b) \wedge (o[1]_b \vee o[2]_b) \big ). \end{aligned}$$

5.1.2 Differential Over-Definition

The approach consisting in using the knowledge from a differential trail to ease the task of a SAT-solver used to attack a cryptographic primitive has been explored in [18] in order to attack MD4 and MD5. The authors of this paper first use heuristic methods to find a high probability differential trail leading to a collision and then use a SAT-solver to find a pair of messages which satisfies this trail. In the same paper, we can find the following observation:

An interesting result of our experiments with SAT solvers is the importance of having a differential path encoded in the formula.

As we shall see, this also holds for block ciphers. Attacking 4- round PRINCE-core takes more than 10 hours if we simply encode as a CNF that some plaintexts are encrypted into known ciphertexts, but we can both drastically reduce this time while breaking PRINCE with its whitening keys using differential over-definition.

Definition 2

We call differential over-definition (or DOD) the following algorithm which simplifies a CNF formula knowing that the variables correspond to bits of the internal state of an encryption following a certain trail.

For all pairs of variables in the CNF, proceed as follows:

If they are assumed to be equal, replace all occurrences of the first one by the second one.
If they are assumed to be different, replace all occurrences of the first one by the negation of the second one.

While the idea behind this algorithm is simple, it is necessary for cryptographers to implement it efficiently “by hand.” Indeed, the only input of a SAT-solver is a CNF formula, i.e., merely a list of clauses from which deriving what variables are equal to each other without knowledge of the structure of the problem is far from trivial. For instance, it would be necessary for the SAT-solver to “understand” that the set of clauses used to model one S-Box call all correspond to a unique function so that identical inputs lead to identical outputs, all this without having any distinction between the input and output bits. That is why differential over-definition, an easy algorithm for the cryptographer to implement, is a valuable preprocessing step when using a SAT-solver for cryptography leading to gains in time complexity of several orders of magnitude.

This algorithm can be implemented efficiently using a hash table containing the correspondences between the variables. Once this algorithm has been run, the CNF is over-defined: The solution would have been such that the equalities hold anyway, but there are less variables and less clauses in the CNF. However, if the pair actually does not follow the trail, the CNF has become unsatisfiable. This is a difference between our work and the one described in [18]: We do not always know before hand whether the CNF has a solution. We can think of this as a trade-off between “solving one CNF known to be true” and “solving many over-defined CNF’s which may or may not be true”: The second approach loses time by requiring several calls to a SAT-solver, but these calls take less time thanks to the over-definition.

Such an over-definition can be used in different ways.

1.
Propagating only the zero differences holding with probability 1 inside a group of eight encryptions with many zero differences is enough to reduce the time complexity of an attack on four rounds from more than 10 hours to a few seconds (see below). Furthermore, such a formula is always true.
2.
Instead of implementing an algorithm recovering the key from a pair following a particular trail by peeling of layer after layer of encryption in our attack on six rounds described in the remainder of this section, we simply reused the code of our attack on four rounds and over-defined the CNF modeling the encryptions of right pairs according to the high probability trail we used.

We implemented the attack described in Algorithm 1 to attack 4-round PRINCE (with its whitening keys) using the SAT-solver Minisat [19] and obtained an average total time of 5.13s and average time spent solving the CNF of 3.06s. The designers of PRINCE did not consider SAT-based attacks, but they did investigate algebraic attacks. They manage to attack 4-round PRINCE-core in less than 2s, while our attack requires about 5s to attack 4-round PRINCE, a cipher which uses twice as much key material.

5.2 Differential Attacks on 6-Round PRINCE

Another possible use for a SAT-solver is the handling of the tedious details of an actual attack implementation, e.g., the finding of an efficient guessing strategy. We illustrate this by performing the recovery of the second chunk of the master key in our 6-round differential attack using a SAT-solver. Pseudo-code describing this attack on 6-round PRINCE is provided in Algorithm 2.

We ran this attack ten times and found that about $2^{5.75}$ structures were needed on average. The filtering step is the most time-consuming: Finding a right pair requires about 6 minutes, while the SAT-solver requires about 0.5s to recover the full key or (rarely) to discard the pair. For this reason, we approximate the complexity of this attack by the complexity of its filtering step. Memory complexity is dominated by the SAT-solver but is (well) below 1 GB, i.e., (well) below $2^{27}$ 64-bit blocks.

6 Structural Analysis of PRINCE

The $\alpha $-reflection introduced along with PRINCE [5] is the name given to the following property of a block cipher $E_{k}$: $E_{k}^{-1} = E_{{k \oplus \alpha }}$. In other words, there is a constant $\alpha $ such that decryption for a key k is the same operation as encryption under key $k \oplus \alpha $. PRINCE-core implements this property by having a three-part structure as described here:

$$\begin{aligned} E^{c}_{k_{1}} = F_{k_{1} \oplus \alpha }^{-1} \circ I\circ F_{k_{1}}, \end{aligned}$$

where $F_{k}$ corresponds to five rounds of a classical substitution–permutation network construction and where $I$ is an involution.

Since we are going to study the structure of the cycles of different functions in a fashion similar to the way Biryukov analyzed the inner rounds of some involutional ciphers in [20], we define the cycle type of a permutation.

Definition 3

The cycle type of a permutation $\pi $ is an (ordered) multiset containing the cycle lengths of the permutation. The cycle type of $\pi $ is denoted by $\mathcal {L}(\pi )$.

For instance, for $\pi = (0 ~ 1 ~ 2) (3 ~ 4) ( 5 ~ 6)$ we have $\mathcal {L}(\pi ) = \{2, 2, 3\}$, while for $\pi = (0 ~ 1 ~ 2) (3 ~ 4 ~ 5) (6)$, $\mathcal {L}(\pi ) = \{1, 3, 3\}$. In what follows, we do not represent the round constants for the sake of simplicity. However, not only do our result hold in their presence but we could actually generalize them to any key schedule preserving the fact that the subkeys of symmetric rounds have a XOR equal to $\alpha $.

6.1 Small Cycles in Round-Reduced PRINCE

The central involution is $I= S^{-1} \circ M' \circ S$. Therefore, it is isomorphic to $M'$, a linear involution operating on each column of the internal state independently. It is easy to check experimentally the result given in [7] stating that $M'$ has exactly $2^{32}$ fixed points, meaning that $I$ also has $2^{32}$ fixed points. Therefore, $I$ has $2^{32}$ cycles of length 1 and $2^{63}-2^{31}$ cycles of length 2.

The cycle type of $I^{\alpha }: x \mapsto I(x) \oplus \alpha $ is more sophisticated but still contains a fair amount of small cycles. After noting that both $I$ and $x \mapsto x \oplus \alpha $ operate on each column of the internal space independently, we denote $I^{\alpha }_{i}$ the restriction of $x \mapsto I(x) \oplus \alpha $ to column i and $I_{i}$ that of $I$. Since each of the $I^{\alpha }_{i}$’s operates only on a space of size $2^{16}$, it is easy to generate their complete cycle structures independently by searching the whole space. Each $I^{\alpha }_{i}$ has a cycle type made of many “small” cycles, the largest having a length of 2844. This is explained by the fact that both $I$ and $x \mapsto x \oplus \alpha $ are involutions and each column of $I$ has exactly $2^{8}$ fixed points. Thus, most of the cycles have a particular structure described in [21] which we recall in Fig. 7. We remark that to each cycle of $I^{\alpha }_{i}$ correspond two fixed points of $I_{i}$.

After generating the cycle type for each $I^{\alpha }_{i}$, we combine them to obtain the cycle type of $x \mapsto I(x) \oplus \alpha $ using Algorithm 3. The cycle type of this function is too complex to be printed completely, but some information extracted from it is given in Table 3. If we pick x uniformly at random, the expected length of the cycle it is on is $2^{30.7}$.

Table 3 Information about the cycle type of $I^{\alpha }$, where $\ell (x)$ is the length of the cycle on which x is

Full size table

Recall that $E_{k_{1}}^{c,4}$ is the permutation of $\{ 0,1 \}^{64}$ corresponding to an encryption under key $k_{1}$ by PRINCE-core reduced to four rounds. Then, $x \mapsto E_{k_{1}}^{c,4}(x) \oplus \alpha $ has the same cycle type as $I^{\alpha }$ due to the cancellation of the last round of one encryption with the first round of the next. Indeed, to each cycle of this function corresponds one of $I^{\alpha }$, as illustrated in Fig. 8 where a cycle $(x_{0}, x_{1}, x_{2}, x_{3})$ of length 4 of $x \mapsto E_{k_{1}}^{c,4}$ is represented along with the corresponding cycle of $I^{\alpha }$ (dashed line).

A first consequence of these observations is the existence of a distinguisher for 4-round PRINCE-core requiring about $2^{27.4}$ adaptatively chosen plaintexts. As stated in Table 2, an element picked at random is on a cycle of length at most $2^{15}$ with probability $2^{-12.4}$. Therefore, we repeat multiple times the experiment consisting in picking an element x uniformly at random and then check whether it is on a cycle of length at most $2^{15}$ by iterating $x \mapsto E_{k_{1}}^{c,4}(x) \oplus \alpha $ at most $2^{15}$ times. The experiment is a success if x is on a cycle of length at most $2^{15}$. If the permutation is $E_{k_{1}}^{c,4}$ for some $k_{1}$, then its probability of success is $2^{-12.4}$, but if the permutation is a random permutation,^{Footnote 4} then the probability of success becomes $2^{-49}$. We confirmed experimentally the success probability of this experiment for $E_{k_{1}}^{c,4}$.

A second consequence is the existence of “small” sets of plaintext/ciphertext encryptions where the set of the ciphertexts is the image of the set of the encryptions by a function significantly simpler than a PRINCE encryption. This topic is studied in the next section.

6.2 Simplifications of PRINCE’s Representation

The particular cycle types of the round-reduced versions of PRINCE studied above lead to simpler alternative representations of the encryption algorithm.

6.2.1 Consequences of the Cycle Type of $I$

Suppose that an encryption is such that the input of $I$ is one of the $2^{32}$ fixed points of this function. Then, the key addition before and after this function cancels each other so that only the addition of $\alpha $ remains. Then, since M is linear, the operations $M^{-1} \circ (\oplus \alpha ) \circ M$ become simply the addition of $M^{-1}(\alpha )$. Thus, the four center rounds— minus the first and last key addition—become a simple S-Box layer which we denote $S'$ and which is defined by

$$\begin{aligned} S'(x) = S^{-1}\big ( S(x) \oplus M^{-1}(\alpha ) \big ). \end{aligned}$$

This simplifying process is summarized in Fig. 9. Note that if $M^{-1}(\alpha )$ has any nibble equal to 0, then the function $S'$ is the identity for this nibble. However, for the value of $\alpha $ chosen by the designers of PRINCE, there is no such nibble.

The simplification goes further. Indeed, since $S'$ operates only at the nibble level, it commutes with the operations SR and $SR^{-1}$ (up to a reordering of the S-Boxes in $S'$). Therefore, if we add one round before and one round after $S'$, we can replace $SR^{-1} \circ S' \circ SR$ by $S''$ where $S''$ is another S-Box layer. Hence, six rounds of PRINCE operate on each column of the internal state independently: Each output bit depends only on 16 bits of the input, 28 bits^{Footnote 5} of $k_{1}$ and at most 18 bits of $k_{0}$. This simplification is summarized in Fig. 10.

Similar simplifications occur if instead of having a fixed point we have a particular collision between two encryptions. This setting corresponds to the so-called mirror slide attack described by Dunkelman et al. in [22]. Consider two encryptions $(p^{0},c^{0})$ and $(p^{1},c^{1})$ by PRINCE-core as follows:

$$\begin{aligned} c^{0} = E^{c}_{k_{1}}(p^{0})&= \big ( F_{k_{1} \oplus \alpha }^{-1} \circ I\circ F_{k_{1}} \big ) (p^{0}) \\ c^{1} = E^{c}_{k_{1}}(p^{1})&= \big ( F_{k_{1} \oplus \alpha }^{-1} \circ I\circ F_{k_{1}} \big ) (p^{1}) \end{aligned}$$

which are such that $F_{k_{1}}(p^{0}) = I\big ( F_{k_{1}}(p^{1}) \big )$. In this case, we have that

$$\begin{aligned} c^{0}&= \big ( F_{k_{1} \oplus \alpha }^{-1} \circ F_{k_{1}} \big ) (p^{1}) \\ c^{1}&= \big ( F_{k_{1} \oplus \alpha }^{-1} \circ F_{k_{1}} \big ) (p^{0}), \end{aligned}$$

where six rounds of $F_{k_{1} \oplus \alpha }^{-1} \circ F_{k_{1}}$ can be simplified exactly as described and therefore only operate on each column separately.

In conclusion, if an encryption is such that the input of $I$ is a fixed point of this function or if two encryptions form a mirror slide pair, then four rounds of PRINCE consist simply in 16 parallel operations on each nibble and six rounds of PRINCE in four parallel operations on each column.

6.2.2 Consequences of the Cycle Type of $I^{\alpha }$

Consider a sequence of plaintexts $(p^{0}, \ldots , p^{\ell -1})$ and their corresponding ciphertexts $(c^{0}, \ldots , c^{\ell -1})$ such that the input $x^{i}_{5} \oplus k_{1}$ of the sixth round for the plaintext $p^{i}$ is the image of $x^{i-1}_{5} \oplus k_{1}$ by $I^{\alpha }$. We call such a sequence a cycle set, and we give a representation of such a sequence on Fig. 11: If two values are equal, then they are connected by a line; red lines correspond to the cycle of $I^{\alpha }$ this set is built out of and blue lines correspond to the propagation of these equalities through identical operations, namely $x \mapsto k_{1} \oplus R^{-1}(x \oplus k_{1})$.

There is a unique function mapping $p^{i}$ to $c^{i-1}$ in every cycle set which corresponds to the encryption algorithm where the four center rounds have been removed and replaced by a simple addition of $\alpha $. This means that this function undergoes the simplifications described above except that these cover two more rounds. In particular, for 6-round PRINCE-core, the function mapping $p^{i}$ to $c^{i-1}$ only operates at the nibble level and, for 8-round PRINCE-core, it operates at the column level. At least 10 rounds are necessary to obtain full diffusion out of the 12 PRINCE has.

The cycle sets we consider cover the four center rounds of PRINCE, but it is possible to generalize this construction to an arbitrary amount of rounds. However, the cycle set sizes are abnormally small in this case because of the cycle type of $I^{\alpha }$. Indeed, a random plaintext/ciphertext pair is in a cycle set of size $2^{30.7}$ and in a cycle set of size smaller than $2^{15}$ with probability $2^{-12.4}$. In other cases, including a priori if we have a cycle covering at least six rounds, the expected size of a cycle set is the expected size of the cycle of a random permutation a random element is on, namely $2^{63}$.

Should the cycle sets of PRINCE become identifiable, the security of up to eight rounds may be compromised as the alternative versions of the cipher we described in this section are much weaker than the original cipher. Furthermore, since small cycles are not unlikely to be found, the data complexity of such an attack may remain feasible.

7 Conclusion

We looked for practical attacks which would hinder the security provided by round-reduced versions of PRINCE in a realistic framework provided by the designers of this cipher. We found that approaches based on a meet-in-the-middle, SAT-based or, surprisingly, differential framework can all lead to practical attacks on up to half of the rounds. We checked our results by actually implementing one of our attacks. As a matter of fact, our attacks were the best submitted to the PRINCE challenge for four, six and eight rounds. Furthermore, during our investigations on PRINCE we discovered a new attack on ten rounds which despite its data complexity of $2^{57}$ chosen plaintexts has a reasonable complexity and a very (very!) motivated adversary could run it.

We also identified some simplifications of the encryption occurring because of the small cycles of the inner rounds of this block cipher, thus shedding new light on the consequences of the $\alpha $-reflection as it is implemented in PRINCE.

Notes

Actually, a structure of size $2^{12}$ where the first three nibbles take all values contains 64 right families with probability about $2^{-5.9}$. If we reduce these to form the structures of $2^{9}$ plaintext/ciphertext encryptions we described, only some of these 64 families are still present, hence the presence of either 0 or several right families in a structure.
Each structure yields $2^{9-3}=2^6$ families for each of the $4^3$ interesting input differences so that we consider the families by groups of $2^{12}$. This implies that a collision has a probability of about ${2^{12} \atopwithdelims ()2} \cdot 2^{-64} \approx 2^{-41}$.
A clause is the logical OR of several variables, e.g., $a \vee b$, a, $\overline{a} \vee b \vee \overline{c}$ where $\overline{x}$ is the negation of x.
Recall that the probability for x to be on a cycle of length $\ell $ for a permutation of $[0, n-1]$ is 1/n. Indeed, let $x_0 = x$, we require that $x_1 = p(x_0) \notin \{x_0\}$, $x_2 = p(x_1) \notin \{x_0, x_1\}$, $\ldots $ so the probability that x belongs to an $\ell $-cycle is $\frac{n-1}{n} \cdot \frac{n-2}{n-1} \ldots \frac{n-\ell }{n-\ell + 1} \cdot \frac{1}{n - \ell } = 1/n$. Hence, the probability that the length is smaller than $2^{15}$ for a permutation of $[0, 2^{64}-1]$ is $\sum _{\ell =1}^{2^{15}} 2^{-64} = 2^{-49}$.
In each column, 16 bits from the corresponding column of $k_{1}$ are used as well as 16 bits from the corresponding column of $SR^{-1}(k_{1})$. Since the top nibble of these two sets is the same, we are left with $32-4=28$ bits.

References

E. Biham, A. Shamir, Differential cryptanalysis of DES-like cryptosystems. J. CRYPTOLOGY4(1), 3–72(1991).
Article MathSciNet Google Scholar
M. Matsui, Linear cryptoanalysis method for DES cipher. in Advances in Cryptology—EUROCRYPT 93, Workshop on the Theory and Application of of Cryptographic Techniques, Lofthus, Norway, May 23–27, 1993, (Proceedings, 1993), pp. 386–397
H. Demirci, A.A. Selçuk, A meet-in-the-middle attack on 8-round AES. in Fast Software Encryption, (Springer, 2008), pp. 116–126
N. Semiconductors, The PRINCE challenge (2014). https://www.emsec.rub.de/research/research_startseite/prince-challenge/
J. Borghoff, A. Canteaut, T. Güneysu, E.B. Kavun, M. Knezevic, L.R. Knudsen, G. Leander, V. Nikov, C. Paar, C. Rechberger, et al, PRINCE—a low-latency block cipher for pervasive computing applications. in Advances in Cryptology–ASIACRYPT 2012. (Springer, 2012), pp. 208–225
A. Biryukov, L. Perrin, State of the art in lightweight cryptography. http://cryptolux.org/index.php/Lightweight_Cryptography
H. Soleimany, C. Blondeau, X. Yu, W. Wu, K. Nyberg, H. Zhang, L. Zhang, Y. Wang, Reflection cryptanalysis of PRINCE-like ciphers. in S. Moriai, ed., Fast Software Encryption - 20th International Workshop, FSE 2013, Singapore, March 11-13, 2013. Revised Selected Papers. Vol. 8424 of Lecture Notes in Computer Science (Springer, 2013), pp. 71–91. https://doi.org/10.1007/978-3-662-43933-3_5
J. Jean, I. Nikolić, T. Peyrin, L. Wang, S. Wu, Security analysis of prince. in Fast Software Encryption: 20th International Workshop, FSE 2013, Singapore, March 11–13, 2013. Revised Selected Papers. Vol. 8424., (Springer, 2014), 92
L. Li, K. Jia, X. Wang, Improved meet-in-the-middle attacks on aes-192 and prince. Cryptology ePrint Archive, Report 2013/573 (2013). http://eprint.iacr.org/
A. Canteaut, T. Fuhr, H. Gilbert, M. Naya-Plasencia, J.R. Reinhard, Multiple differential cryptanalysis of round-reduced PRINCE (full version). Cryptology ePrint Archive, Report 2014/089 (2014). http://eprint.iacr.org/
P. Morawiecki, Practical attacks on the round-reduced PRINCE. IET Inf. Secur.11(3), 146–151 (2017).
Article Google Scholar
C. Rechberger, Update on the 10000 euro PRINCE cipher-breaking challenge: Results of round-1 (2014). http://crypto.2014.rump.cr.yp.to/d037206eda8f9278cef1ea26cd62e51f.pdf
O. Dunkelman, N. Keller, A. Shamir, Improved single-key attacks on 8-round AES-192 and AES-256. in Advances in Cryptology - ASIACRYPT 2010 - 16th International Conference on the Theory and Application of Cryptology and Information Security, Singapore, December 5–9, 2010. Proceedings. pp. 158–176 (2010)
P. Derbez, P. Fouque, Exhausting Demirci-Selçuk meet-in-the-middle attacks against reduced-round AES. in Fast Software Encryption - 20th International Workshop, FSE 2013, Singapore, March 11–13, 2013. Revised Selected Papers. pp. 541–560 (2013)
P. Derbez, P.A. Fouque, J. Jean, Improved Key Recovery Attacks on Reduced-Round AES in the Single-Key Setting. in T. Johansson, P.Q. Nguyen, eds., EUROCRYPT. Vol. 7881 of Lecture Notes in Computer Science., (Springer, 2013), pp. 371–387
F. Abed, E. List, S. Lucks, On the security of the core of prince against biclique and differential cryptanalysis. Cryptology ePrint Archive, Report 2012/712 (2012). http://eprint.iacr.org/
A. Biryukov, A. Shamir, Structural cryptanalysis of SASAS, in B. Pfitzmann, ed., Advances in Cryptology—EUROCRYPT 2001. Vol. 2045 of Lecture Notes in Computer Science. (Springer, Berlin, Heidelberg, 2001), pp. 395–405
I. Mironov, L. Zhang, Applications of sat solvers to cryptanalysis of hash functions. In Biere, A., Gomes, C., eds.: Theory and Applications of Satisfiability Testing—SAT 2006. Volume 4121 of Lecture Notes in Computer Science. (Springer, Berlin, Heidelberg, 2006), pp. 102–115
N. Eén, N. Sörensson, An extensible SAT-solver. in Theory and applications of satisfiability testing, (Springer, 2004), pp. 502–518
A. Biryukov, Analysis of involutional ciphers: Khazad and Anubis. in Fast Software Encryption, (Springer, 2003), pp. 45–53
J.H. Moore, G.J. Simmons, Cycle structure of the DES with weak and semi-weak keys. in Advances in Cryptology–CRYPTO’86, (Springer, 1987), pp. 9–32
O. Dunkelman, N. Keller, A. Shamir, Minimalism in cryptography: The Even-Mansour scheme revisited. in Advances in Cryptology—EUROCRYPT 2012. (Springer, 2012), pp. 336–354

Download references

Acknowledgements

The authors thank Alex Biryukov for useful discussions about the differential attack on PRINCE. We also thank NXP Semiconductors for organizing the PRINCE challenge and sending us our rewards! The work of the authors was supported by the CORE ACRYPT project (ID C12-15-4009992) and funded by the Fonds National de la Recherche (Luxembourg).

Author information

Authors and Affiliations

Univ Rennes, CNRS, IRISA, Rennes, France
Patrick Derbez
Inria, Paris, France
Léo Perrin

Authors

Patrick Derbez
View author publications
You can also search for this author in PubMed Google Scholar
Léo Perrin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Patrick Derbez.

Additional information

Communicated by Vincent Rijmen.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Patrick Derbez and Léo Perrin were supported by the CORE ACRYPT project from the Fond National de Recherche (Luxembourg).

This article is the full version of the article with the same title published at Fast Software Encryption 2015.

Appendices

A The Second 6-Round Attack

See Fig. 12.

B The Second 8-Round Attack

See Fig. 13.

C The Second 5.5-Round Trail

See Fig. 14.

D Simple Meet-in-the-Middle Attacks

In this section, we describe two attacks on round-reduced PRINCE. Both are simple meet-in-the-middle attacks requiring only few known plaintext/ciphertext pairs to work.

1.1 D.1 4-Round Attack

1.1.1 Simple Attack

We begin by presenting a simple attack on 4-round PRINCE with a complexity around $2^{40}$ which is depicted in Fig. 15. It is based on the two following equations involving few bits of the middle states y and $y'$:

$$\begin{aligned} \left\{ \begin{array}{l} y[38]_b \oplus y[46]_b = y'[38]_b \oplus y'[46]_b \\ y[39]_b \oplus y[43]_b \oplus y[47]_b = y'[47]_b \end{array} \right. \end{aligned}$$

Let $K_p$ (resp. $K_c$) be the key bits required to compute $y[38]_b \oplus y[46]_b$ and $y[39]_b \oplus y[43]_b \oplus y[47]_b$ from p (resp. $y'[38]_b \oplus y'[46]_b$ and $y'[47]_b$ from c). Then, the attacks scenario is:

1.
Ask for n known plaintext/ciphertext pairs.
2.
Let T be an empty hash table.
3.
For all possible values of $K_p$, do
1. (a)
  for j from 1 to n compute $y^{j}[38]_b \oplus y^{j}[46]_b$ and $y^{j}[39]_b \oplus y^{j}[43]_b \oplus y^{j}[47]_b$ from the j-th plaintext
2. (b)
  make the sequence $s {=} \left[ y^{1}[38]_b {\oplus } y^{1}[46]_b,y^{1}[39]_b {\oplus } y^{1}[43]_b {\oplus } y^{1}[47]_b, \ldots \right] $
3. (c)
  add the value of $K_p$ to $T[s]_b$
4.
For all possible values of $K_c$, do
1. (a)
  for j from 1 to n compute $y'^{j}[38]_b \oplus y'^{j}[46]_b$ and $y^{j}[47]_b$ from the j-th ciphertext
2. (b)
  make the sequence $s = \left[ y'^{1}[38]_b \oplus y'^{1}[46]_b,y'^{1}[47]_b, \ldots \right] $
3. (c)
  check whether T[s] is empty or not. If T[s] is empty, then the guess of $K_c$ is wrong. Otherwise, T[s] contains the possible value(s) for $K_p$ and, if n is large enough, this will happen only for the right guess.

In our case, both $K_p$ and $K_c$ can assume only $2^{40}$ values and thus 40 plaintext/ciphertext pairs are enough to get only one candidate for $K_p \cup K_c$ with high probability. Thus, the complexity of this attack is 40 known plaintexts and around $2^{40}$ for both time and memory.

Saving data and memory First, we stress that $K_P$ (resp. $K_C$) can be safely replaced by any basis of the vector space spawned by itself, so let consider it as a vector space over $\mathbb {F}_2$. Now, we are interested by the vector space $K_P \cap K_C$. Here, a basis of this vector space is:

$$\begin{aligned}&\{k_1[36..39]_b, k_1[44..47]_b, k_0[37..39]_b, k_0[45..47]_b, k_0[40]_b \oplus \ldots \\&\quad \oplus k_0[44]_b \oplus k_1[40]_b \oplus \ldots \oplus k_1[43]_b\}. \end{aligned}$$

Thus, only 33 plaintext/ciphertext pairs are needed to discard the wrong guesses and the attack scenario becomes:

1.
Ask for n known plaintext/ciphertext pairs.
2.
For all possible values of $K_p \cap K_c$, do
1. (a)
  Let T be an empty hash table.
2. (b)
  Partially encrypt/decrypt the plaintext/ciphertext pairs.
3. (c)
  For all possible values of $K_p$, do
  1. (i)
    for j from 1 to n compute $y^{j}[38]_b \oplus y^{j}[46]_b$ and $y^{j}[39]_b \oplus y^{j}[43]_b \oplus y^{j}[47]_b$ from the j-th plaintext
  2. (ii)
    make the sequence $s = \left[ y^{1}[38]_b \oplus y^{1}[46]_b,y^{1}[39]_b \oplus y^{1}[43]_b \oplus y^{1}[47]_b, \ldots \right] $
  3. (iii)
    add the value of $K_p$ to T[s]
4. (d)
  For all possible values of $K_c$, do
  1. (i)
    for j from 1 to n compute $y'^{j}[38]_b \oplus y'^{j}[46]_b$ and $y^{j}[47]_b$ from the j-th ciphertext
  2. (ii)
    make the sequence $s = \left[ y'^{1}[38]_b \oplus y'^{1}[46]_b,y'^{1}[47]_b, \ldots \right] $
  3. (iii)
    check whether T[s] is empty or not. If T[s] is empty, then the guess of $K_c$ is wrong. Otherwise, T[s] contains the possible value(s) for $K_p$ and, if n is large enough, this will happen only for the right guess.

All in all the memory requirement is approximately $25 \times 2^{25} \times 2^{-3} \approx 2^{26.7}$ bytes and the time complexity around $33 \times 2 \times 2^{40} \times 40 / (4 \times 64) \approx 2^{43.4}$ encryptions.

Key recovery At the end of the attack, we know (or have very few candidates for) $K_p \cup K_c$. But we still need $128 - 65 = 63$ key bits to fully recover $k_0$ and $k_1$. Performing an exhaustive search at this point would increase the overall complexity of the attack and thus is not a good idea. Instead, it is better to perform another meet-in-the-middle attacks successively. For instance, the attack depicted in Fig. 16 allows to recover 20 more key bits with a time complexity around $2^{20}$.

1.2 6-Round Attack

The 4-round attack can be extended to a 6-round attack as depicted in Fig. 17. Now, the dimension of both $K_p$ and $K_c$ is 96 and the dimension of the intersection is 64. Thus, it leads to an attack requiring $(2 \times 96 - 64)/2 = 64$ known plaintext/ciphertext pairs, with a memory complexity around $(96 - 64) \times 2^{96-64} \times 2^{-3} = 2^{34}$ bytes and a time complexity of approximately $2 \times 64 \times 2^{96} \times 104 / (6 \times 64) \approx 2^{101.1}$ encryptions.

Finally, an exhaustive search can be performed to retrieve $k_0$ and $k_1$ without increasing the overall complexity.

E Improved 8-Round Attack

In this section, we concisely describe a second attack against 8-round PRINCE, still based on Demirci and Selçuk technique, requiring much less memory than the attack described in Sect. 3.2.

1.1 Step 1

The first step of the attack is depicted in Fig. 18. This is a classical Demirci and Selçuk attack against 8-round PRINCE. The nibble requiblack in the online phase is in gray and can take $2^{8 \times 4} = 2^{32}$ values. In another hand, nibbles requiblack in the offline phase are in black and can assume $2^{80}$ values thanks to the (lack of) key schedule.

Here, $\delta $-sets contain $2^{4} = 16$ messages and the check is performed on sequences of $(16 - 1) = 15$ differences in one nibble, i.e., on 60-bit sequences. As more than $2^{60}$ sequences are computed during the offline phase, the attack does not filter values of gray nibbles.

1.2 Step 2

The idea is to switch the online and the offline phases (and actually they are now both performed online). Given a plaintext/ciphertext pair (P, C), we compute the $2^{32}$ possible 60-bit sequences and store them in a hash table. Then for each value of the black nibbles, we compute the corresponding 60-bit sequence and check whether it belongs to the hash table. Only $2^{80 - 60 + 32} = 2^{52}$ values should pass this test.

1.3 Step 3

We notice that, given a structure of $2^{16}$ plaintexts such that the first column is active, while the other ones are constant, each column of state $y_3$ takes all the possible values too. Hence, we can fix the value to 0 of four black nibbles of $y_4$ (shiftrows of a column), decreasing the time complexity by a factor $2^{16}$. As a result, the hash table now has to contain $2^{48}$ sequences, $2^{32}$ for each of the $2^{16}$ plaintext/ciphertext pairs.

1.4 Step 4

To decrease further the number of possible values for the black nibbles, we apply the same attack, but we change the nibble on which the check is performed as shown in Fig. 19. So we have to store an other table containing $2^{48}$ 60-bit sequences and guess the value of the black nibble of $y'_2$. Only $2^{52 + 4 - 60 + 48} = 2^{44}$ values should pass this test.

1.5 Step 5

As in step 4, we perform the two other attacks corresponding to the two other positions for the black nibble on $y'_2$. Only $2^{28}$ values should pass this test.

1.6 Step 6

We switch again online and offline phases, coming back to a classical Demirci and Selçuk attack. We run successively the four previous attacks to retrieve a unique plaintext/ciphertext pair together with a unique value for the first column of $y_1$ and the whole state $y'_1$. Indeed, we did not check whether the plaintext/ciphertext pair and the first column of $y_1$ are the same for the four attacks, while they have to. Thus, only the right ones should remain at the end of the attack.

1.7 Step 7

The missing 48 bits of the key can be exhausted.

1.8 Complexity

The data complexity is $2^{16}$ chosen plaintexts. The memory complexity is around $4 \times 2^{48}$ 60-bit sequences which is equal to $2^{52.9}$ bytes. The time complexity is dominated by steps 2/3, which is equivalent to $2^{64} \times 2^{4} \times (42 - 4)/(8 \times 16) \simeq 2^{66.25}$ encryptions.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Derbez, P., Perrin, L. Meet-in-the-Middle Attacks and Structural Analysis of Round-Reduced PRINCE. J Cryptol 33, 1184–1215 (2020). https://doi.org/10.1007/s00145-020-09345-0

Download citation

Received: 31 March 2016
Revised: 12 February 2020
Published: 04 March 2020
Issue Date: July 2020
DOI: https://doi.org/10.1007/s00145-020-09345-0

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Meet-in-the-Middle Attacks and Structural Analysis of Round-Reduced PRINCE

Abstract

Similar content being viewed by others

Meet-in-the-Middle Attacks and Structural Analysis of Round-Reduced PRINCE

Faster Key Recovery Attack on Round-Reduced PRINCE

Improving First-Order Threshold Implementations of SKINNY

1 Introduction

2 Specification of PRINCE

2.1 Description of PRINCE

3 Meet-in-the-Middle Attacks

Definition 1

Observation 1

Proof

Observation 2

Proof

3.1 6-Round Attack

3.2 8-Round Attack

3.3 10-Round Attack

4 Differential Attack

4.1 Amplified Differential Trails

4.2 Our Trails

4.3 Filtering Right Pairs

5 Combining Differential Attack with a SAT-Solver

5.1 Attacking 4-Round PRINCE with a SAT-Solver

5.1.1 Encoding PRINCE as a CNF Formula

5.1.2 Differential Over-Definition

Definition 2

5.2 Differential Attacks on 6-Round PRINCE

6 Structural Analysis of PRINCE

Definition 3

6.1 Small Cycles in Round-Reduced PRINCE

6.2 Simplifications of PRINCE’s Representation

6.2.1 Consequences of the Cycle Type of \(I\)

6.2.2 Consequences of the Cycle Type of \(I^{\alpha }\)

7 Conclusion

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendices

A The Second 6-Round Attack

B The Second 8-Round Attack

C The Second 5.5-Round Trail

D Simple Meet-in-the-Middle Attacks

1.1 D.1 4-Round Attack

1.1.1 Simple Attack

1.2 6-Round Attack

E Improved 8-Round Attack

1.1 Step 1

1.2 Step 2

1.3 Step 3

1.4 Step 4

1.5 Step 5

1.6 Step 6

1.7 Step 7

1.8 Complexity

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation