Generating Cryptographically-Strong Random Lattice Bases and Recognizing Rotations of $$\mathbb {Z}^n$$

Blanks, Tamar Lichter; Miller, Stephen D.

doi:10.1007/978-3-030-81293-5_17

Tamar Lichter Blanks¹⁰ &
Stephen D. Miller¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNSC,volume 12841))

Included in the following conference series:

International Conference on Post-Quantum Cryptography

1154 Accesses
6 Citations

Abstract

Lattice-based cryptography relies on generating random bases which are difficult to fully reduce. Given a lattice basis (such as the private basis for a cryptosystem), all other bases are related by multiplication by matrices in $GL(n,\mathbb {Z})$. We compare the strengths of various methods to sample random elements of $GL(n,\mathbb {Z})$, finding some are stronger than others with respect to the problem of recognizing rotations of the $\mathbb {Z}^n$ lattice. In particular, the standard algorithm of multiplying unipotent generators together (as implemented in Magma’s RandomSLnZ command) generates instances of this last problem which can be efficiently broken, even in dimensions nearing 1,500. Likewise, we find that the random basis generation method in one of the NIST Post-Quantum Cryptography competition submissions (DRS) generates instances which can be efficiently broken, even at its 256-bit security settings. Other random basis generation algorithms (some older, some newer) are described which appear to be much stronger.

T. L. Blanks—Supported by a National Science Foundation Graduate Research Fellowship.

S. D. Miller—Supported by National Science Foundation Grants CNS-1526333 and CNS-1815562.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Quadratic Time, Linear Space Algorithms for Gram-Schmidt Orthogonalization and Gaussian Sampling in Structured Lattices

Lattice Reductions over Euclidean Rings with Applications to Cryptanalysis

Structural Lattice Reduction: Generalized Worst-Case to Average-Case Reductions and Homomorphic Cryptosystems

Keywords

1 Introduction

In cryptography one often encounters problems which are easy to solve using a secret private basis of a lattice ${\varLambda }\subset {{\mathbb {R}}}^n$, but are expected to be difficult to solve using suitably-chosen public bases. Famous examples include the Shortest Vector Problem (SVP) and Closest Vector Problem (CVP).

In [17] Lenstra and Silverberg posed the challenge of whether highly-symmetric lattices have hard bases, and proved several interesting results along these lines (related to earlier work of Gentry-Szydlo [10]; see also [16, 18]). One particularly beautiful question they posed is:

$$\begin{aligned} \text {can one efficiently recognize rotations of the standard}\; {\mathbb {Z}}^{n}\; \text {lattice?} \end{aligned}$$

(1.1)

To be more precise, this problem can be stated in two different group-theoretic ways (the second being the formulation in [17, §2]). Let $\{b_1,\ldots ,b_n\}$ denote a basis for $\varLambda $ and let B denote the $n\times n$ matrix whose i-th row is $b_i$:

Alternatively, following [9] and [17, §2] we may suppose one is given a positive-definite symmetric matrix $G\in SL(n,{\mathbb {Z}})$ (which we think of as the Gram matrix $G=BB^t$ of $\varLambda $):

Clearly, Problem 1 reduces to Problem 2 with $G=BB^t$. Conversely, one can orthogonally diagonalize the matrix G in Problem 2 as $G=PDP^t$ for some $P\in O(n)$ and diagonal matrix D with positive diagonal entries. Then $B=PD^{1/2}$ solves the equation $G=BB^t$, and Problem 2 therefore reduces to Problem 1 (modulo technicalities we will not delve into, such as that the entries of P, D, and B may in general be irrational).

In particular, by orthogonal diagonalization it is trivial to find a non-integral solution $M\in GL(n,{\mathbb {R}})$ to Problem 2. However, imposing the constraint that $M\in GL(n,{\mathbb {Z}})$ adds an intricate dose of number theory, since Problem 2a then becomes a class number problem: indeed, in large dimensions n there is a combinatorial explosion of possible $GL(n,{\mathbb {Z}})$-equivalence classes.^{Footnote 1}

Both Problems 1 and 2 have inefficient solutions using sufficiently strong lattice basis reduction. For example, the given information is sufficient to determine whether or not all lattice vector norms are square-roots of integers, and an SVP solver can determine the shortest nonzero norm $\lambda _1(\varLambda )$. If $\lambda _1(\varLambda )\ne 1$, the lattice $\varLambda $ is definitely not a rotation of ${\mathbb {Z}}^n$ and Problems 1a and 2a have negative solutions. However, if one finds a vector of norm 1 and all lattice norms are square-roots of integers, it is then easy to see (by subtracting multiples of this vector to obtain an orthogonal complement) that the dimension in Problems 1b and 2b reduces from n to $n-1$. It was recently shown in [13] that Problem 2a is in the class NP$\cap $co-NP, using results of Elkies [8] on characteristic vectors of lattices (see also [11, §9.6]).

This paper primarily concerns Problem 2b, i.e., one is handed a matrix of the form $MM^t$ and wishes to efficiently recover M. Of course permuting the columns of M does not change $MM^t$, nor does multiplying any subset of columns by $-1$; thus we look for solutions up to such signed permutations of the columns. (For this reason it is equivalent to insist that $M\in SL(n,{\mathbb {Z}})$.) We find that the choice of procedure to randomly generate instances of M has a drastic impact on the difficulty of the problem. We state this in terms of a probability density function $p:GL(n,{\mathbb {Z}})\rightarrow {\mathbb {R}}_{\ge 0}$ (i.e., $\sum _{M\in GL(n,{\mathbb {Z}})}p(M)=1$):

In Sect. 2 we compare various methods of generating random bases of a lattice, corresponding to different probability densities p (generalizing [3, §5.1.2]; see also Sect. 4). Here one seeks distributions for which Problem 3 is hard on average, much like SIS and LWE are average-case hard instances of variants of SVP and CVP, respectively. We then perform experiments on them in Sect. 3. Some of the methods we describe, such as the long-known Algorithm 4 (see, for example, [5]), give relatively hard instances of Problem 3. However, our main finding is that a certain well-known existing method, namely generating matrices by multiplying unipotents (e.g., Magma’s RandomSLnZ command), is cryptographically weak: we were able to recover M in instances in dimensions nearly 1500 (in some measurable ways these instances are comparable to NTRU lattices having purported 256-bit quantum cryptographic strength). That gives an example of an average-case easy distribution. In Sect. 4 we similarly find that the random basis generation method used in the DRS NIST Post-Quantum Cryptography submission [21] also gives weak instances of Problem 3: in 708 hours we could recover M generated using DRS’s 256-bit security settings.

2 Choosing Random Elements of $GL(n,{\mathbb {Z}})$

We consider the problem of uniformly sampling matrices in a large box^{Footnote 2}

$$\begin{aligned} \varGamma _T \ \ := \ \ \left\{ M=(m_{ij}) \in GL(n,{\mathbb {Z}}) \ : \ |m_{ij}|\le T \right\} \,, \ \ T > 0\,, \qquad {(2.1)} \end{aligned}$$

inside $GL(n,{\mathbb {Z}})$. For large T one has $\#\varGamma _T \sim c_n T^{n^2-n}$, for some positive constant $c_n$.^{Footnote 3} We now consider a series of algorithms to sample matrices in $GL(n,{\mathbb {Z}})$. The most naive way to uniformly sample $\varGamma _T$ is prohibitively slow:

Though we do not analyze it here, the determinant of such a randomly chosen matrix M is a very large integer, and highly improbable to be $\pm 1$ as required for membership in $GL(n,{\mathbb {Z}})$. One minor improvement that can be made is to first check that the elements of each row (and of each column, as well) do not share a common factor, which is a necessary condition to have determinant $\pm 1$. Nevertheless, this fails to seriously improve the extreme unlikelihood of randomly producing an integral matrix of determinant $\pm 1$.

We note that some computer algebra packages include commands for generating random elements of $GL(n,{\mathbb {Z}})$. In addition to its command RandomSLnZ which we shall shortly come to in Algorithm 2, Magma’s documentation includes the command RandomUnimodularMatrix for fairly rapidly generating matrices in $GL(n,{\mathbb {Z}})$ (not $SL(n,{\mathbb {Z}})$ as the name indicates) having “most entries” inside a prescribed interval, but provides no further explanation. Even after accounting for a typo which switches the role of the command’s arguments, we found that in fact most of the entries were outside the prescribed interval (the documentation’s claims notwithstanding). Furthermore, the lattices constructed using this command appear to be much easier to attack than those generated by the closest analog considered here (Algorithm 4). SageMath’s random_matrix command has a unimodular constructor (designed for teaching purposes) which does produce matrices in $GL(n,{\mathbb {Z}})$ whose entries are bounded by a given size, but it is not as fast as other alternatives and its outputs must satisfy further constraints. For these reasons we did not seriously examine RandomUnimodularMatrix and random_matrix.

Because Algorithm 1 is so slow, the rest of this section considers faster algorithms which do not uniformly sample $\varGamma _T$, some coming closer than others.^{Footnote 4} For $1\le i\ne j \le n$ let $E_{i,j}$ denote the elementary $n\times n$ matrix whose entries are all 0 aside from a 1 in the (i, j)-th position. Here as elsewhere the abbreviation “i.i.d.” stands for “independently identically distributed”.

As we shall later see, the matrices produced by Algorithm 2 have a very special form, creating a cryptographic weakness.

Algorithm 2 can be thought of as a counterpart to the LLL algorithm [15], which applies successive unipotent matrices and vector swaps to reduce lattices. Although Algorithm 2 does not literally contain vector swaps, they are nevertheless present in the background because conjugates of $\gamma _j$ by permutation matrices have the same form $I_n+xE_{i,j}$ as $\gamma _k$. In that light, the following algorithm can then be thought of as an analog of BKZ reduction [23], since it utilizes block matrices of size much smaller than n. Its statement involves the embedding maps $\varPhi _{k_1,\ldots ,k_d}:GL(d,{\mathbb {R}})\hookrightarrow GL(n,{\mathbb {R}})$ for size-d subsets $\{k_1,\ldots ,k_d\}\subset \{1,\ldots ,n\}$,

$$\begin{aligned} (\varPhi _{k_1,\ldots ,k_d}(h))_{i'j'} \ \ = \ \ \left\{ \begin{array}{ll} h_{ij}, &{} \text {if }i'=k_i\text { and }j'=k_j \text { for some }i,j\le d; \\ \delta _{i'=j'}, &{} \mathrm {otherwise\,} \\ \end{array} \right. \end{aligned}$$

(2.3)

where $h=(h_{ij})\in GL(d,{\mathbb {R}})$.^{Footnote 5} The image of $\varPhi _{k_1,\ldots ,k_d}$ is a subgroup of $GL(n,{\mathbb {R}})$ isomorphic to $GL(d,{\mathbb {R}})$. (Of course we will only apply the map $\varPhi _{k_1,\ldots ,k_d}$ to elements of $GL(d,{\mathbb {Z}})$.)

We expect Algorithm 3 produces more-uniformly distributed matrices as d increases. The role of the parameter d is essentially to interpolate between Algorithm 1 (which is the case $d=n$) and Algorithm 2 (which is close to the case $d=2$, but not exactly: $\gamma ^{(2)}$ need not be unipotent).

Next we turn to the following method, which among the algorithms we considered seems the best at rapidly creating uniformly-distributed entries of matrices in $GL(n,{\mathbb {Z}})$. This algorithm was originally suggested to us by Joseph Silverman in a slightly different form, in which more coprimality conditions needed to be checked. It relies on the fact that an integral $n\times n$ matrix $M =(m_{ij})$ lies in $GL(n,{\mathbb {Z}})$ if and only if the n determinants of $(n-1)\times (n-1)$ minors

$$\begin{aligned} \det \left( {\begin{array}{*{20}c} {m_{{22}} } &{} \cdots &{} {m_{{2n}} } \\ \vdots &{} \ddots &{} \vdots \\ {m_{{n2}} } &{} \cdots &{} {m_{{nn}} } \\ \end{array} } \right) ,\det \left( {\begin{array}{*{20}l} {m_{{21}} } &{} {m_{{23}} } &{} \cdots &{} {m_{{2n}} } \\ \vdots &{} \vdots &{} \ddots &{} \vdots \\ {m_{{n1}} } &{} {m_{{n3}} } &{} \cdots &{} {m_{{nn}} } \\ \end{array} } \right) ,\det \left( {\begin{array}{*{20}c} {m_{{21}} } &{} \cdots &{} {m_{{2n - 1}} } \\ \vdots &{} \ddots &{} \vdots \\ {m_{{n1}} } &{} \cdots &{} {m_{{nn - 1}} } \\ \end{array} } \right) \end{aligned}$$

(2.4)

share no common factors.

Remarks on Algorithm 4: The n large integers in (2.4) are unlikely to share a common factor: for example, the most probable common factor is 2, which happens only with probability ${\approx }2^{-n}$. Obviously the top row of M is chosen differently than the others, and its size is different as well since it typically has entries larger than size T – this is because the euclidean algorithm can produce large coefficients (as the minors in (2.4) are themselves so enormous). Also, it is likely that the first two or three minors will already be coprime, and hence that most of the entries in $[m_{11}\,m_{12}\,\cdots \,m_{1n}]$ will vanish. The use of rounding and least-squares cuts down this size and further randomizes the top row, while keeping the determinant equal to one.

One could instead try a different method to find an integral combination of the bottom $n-1$ rows closer to the initial guess for the top row. One extreme possibility involves appealing to the Closest Vector Problem (CVP) itself, which is thought to be very difficult. We found Algorithm 4 gave good randomness properties in that nearly all of the matrix is equidistributed, and it is fairly fast to execute. In comparison, we will see that using Algorithm 2 requires many matrix multiplications to achieve random entries of a similar size, which are not as well distributed anyhow.

The following algorithm is folklore and has appeared in various guises in many references (for example [5], which uses Gaussian sampling and has provable hardness guarantees,^{Footnote 6} though not necessarily for Problem 3). As we shall see just below, it shares some similarities with Algorithm 4.

A surprising connection between Algorithms 4 and 5: Even though Algorithms 4 and 5 appear to be very different, they are actually extremely similar (in fact, arguably nearly identical) in practice. Algorithms for Hermite Normal Form (such as HermiteDecomposition in Mathematica) proceed by building the matrix M directly out of the rows of B whenever possible. For example, it is frequently the case that the first $n-1$ rows of U agree with those of the identity matrix $I_n$, or at least differ only very slightly; in other words, the first $n-1$ rows of B and M are expected to coincide or nearly coincide.^{Footnote 7} Also, the last row of M is an integral combination of the first n rows of B. In contrast with Algorithm 4 this last combination, however, is mainly determined by arithmetic considerations, and in particular depends on the n-th row of B; thus more random information is used than in Algorithm 4, which uses only $n^2-n$ random integers instead of the $n^2$ here.^{Footnote 8}

To summarize, in fairly typical cases both Algorithms 4 and 5 populate the matrix M by first generating all but one row uniformly at random, and then using integral combinations to create a final row having relatively small entries. The practical distinction is essentially how this final row is created, which utilizes further random information in Algorithm 5 but not in Algorithm 4. The final row also appears to be typically smaller (that is, closer to fitting in the box defined in (2.1)) when using Algorithm 4 than when using Algorithm 5; consequently, we did not perform any experiments with Algorithm 5.

Note that the Hermite decomposition as stated above is not unique, since there are lower triangular matrices in $GL(n,{\mathbb {Z}})$. Thus there can be no immediate guarantee on the entry sizes of M unless this ambiguity is resolved. Algorithm 5 can be thought of as a p-adic analog of the following method of producing random rotations in O(n): apply the Gram-Schmidt orthogonalization process to a matrix chosen according to a probability density function (e.g., Gaussian) which is invariant under multiplication by O(n).

Remarks on an Algorithm in [22]: Igor Rivin makes the proposal in [22, §6.1] to generate matrices in $GL(n,{\mathbb {Z}})$ by applying complete lattice basis reduction to a basis of ${\mathbb {R}}^n$ chosen inside a large ball. Let $B\in GL(n,{\mathbb {R}})$ denote the $n\times n$ matrix whose rows consist of this basis. Complete lattice reduction produces a random element $\gamma \in GL(n,{\mathbb {Z}})$ of constrained size for which $\gamma B$ lies in a fixed fundamental domain for $GL(n,{\mathbb {Z}})\backslash GL(n,{\mathbb {R}})$.

This procedure is extremely slow, since complete lattice reduction is impractical in large dimensions. Rivin thus considers instead using weaker lattice basis reduction methods (such as LLL [15]) to speed this up, but at the cost of less-uniform distributions. For example, the results of LLL are thought to be skewed towards certain favored outputs avoiding “dark bases” [14]. Since our interest in generating random bases is to see how long incomplete lattice reduction takes on them, the use of lattice reduction to itself make the basis itself is too slow for our purposes (hence we did not consider this algorithm in our experiments).

3 Experiments on Recognizing ${\mathbb {Z}}^n$

In this section we report on attempts to solve Problem 2b on instances of matrices M generated using some of the algorithms from Sect. 2 for sampling $GL(n,{\mathbb {Z}})$. We first note that Geissler and Smart [9] reported on attempts to solve Problem 2b on NTRU lattices using LLL [15] (as well as their own modification, for which they report up to a factor of four speedup), and concluded from lattice reduction heuristics that LLL itself is insufficient for NTRU instances with dimensions and matrix entry size far smaller than those considered in (3.2) below (see Appendix C). Nevertheless LLL performs fairly well on rotations of the ${\mathbb {Z}}^n$ lattice as compared to on a random lattice, which is not unexpected since the latter has shortest vector on the order of $\sqrt{n}$ (as opposed to 1 for rotations of the ${\mathbb {Z}}^n$ lattice). Given that LLL typically outperforms its provable guarantees, it is not surprising it is fairly effective on Problem 2b.

Our main emphasis is that LLL and BKZ perform better on certain distributions with respect to Problem 2b than on others. Instead of LLL alone, we try the following:

We chose to use Magma’s built-in lattice basis reduction routines, partly because of slow running times with other implementations (such as fplll in SageMath) on matrices with very large integer entries. In step 2 one can of course continue further with block sizes larger than 5, but we fixed this as a stopping point in order to be systematic.

Our main finding is that Algorithm 2 in Sect. 2 (as implemented in Magma’s RandomSLnZ) is insecure for generating hard instances of Problem 2b. Algorithms 3, 4, and 5 fare much better. It is not surprising that Algorithm 5 (and the nearly-equivalent Algorithm 4) give harder instances, since there are provable guarantees attached to Algorithm 5 in a different context [5]; there is a serious difference between these and Algorithm 2 described below and in Appendices A and B.

3.1 Experiments with Algorithm 2 (Magma’s RandomSLnZ Command)

We begin with some comments on entropy and generating random products with a constrained number of bits. To mimic random elements of $GL(n,{\mathbb {Z}})$, one may desire that the product matrix has as many nonzero entries as possible per random bit. For this reason, our experiments set the parameter $b=1$ in Algorithm 2 in order to take longer products (thereby further increasing the number of nonzero entries of the matrix), while keeping the number of random bits constant. When the product length is less than n, one expects to have rows or columns of the product matrix which are unchanged by the successive matrix multiplications. (This much less likely to be the case for the Gram matrices, however.)

Thus each random factor has at most a single nonzero off-diagonal entry, which is $\pm 1 $. It is prohibitive to pack in as many random bits as the total number of entries this way, since multiplication of large matrices is slow. As an extreme example, as part of a comparison with the last row of (C.3) we generated a random matrix in $GL(1486,{\mathbb {Z}})$ using products of length 55,000, again with $b=1$. Generating the product alone took about half a day. Its row lengths were between $2^{14}$ and $2^{20}$ in size. For comparison, an NTRU matrix with similar row lengths (as in Table C.3) uses 8,173 random bits. The comparison with NTRU is made here simply because concrete bit-strengths have been asserted for NTRU lattices; this is why we took the particular values of n in (3.2) (see Appendix C for more details). One might hypothesize that having more random bits in the matrix makes solving Problem 2b more difficult, but as we shall see this in fact turns out to not always be the case: the structure of the matrix plays a very important role, and the product structure from Algorithm 2 seems to be a contributing weakness. In particular, the larger the value of the parameter b, the more unusual properties the product matrix possesses.

From the success of our trials one immediately sees the Lenstra-Silverberg Problem 2b is fairly easy for matrices M generated by Magma’s RandomSLnZ command. (Of course it is well known to be impossible to solve Problem 2b using LLL or BKZ with small block sizes on NTRU matrices of the comparable size listed in (3.2) and (C.3), or even those much smaller.)

3.2 Experiments with Algorithm 3 (Random $GL(d,{\mathbb {Z}})$ Matrices)

Next we consider matrices generated by Algorithm 3 (random $GL(d,{\mathbb {Z}})$’s), and find that for small d they are also cryptographically weak for the Lenstra-Silverberg problem, but stronger than those generated by Algorithm 2. Furthermore, we see their strength increases with increasing d.

The tables in Appendix A list the outcomes of several experiments attacking instances of Problem 2b for matrices M generated by Algorithm 3. One sees the dramatic effect of the product length $\ell $. For example, if $\ell $ is too short there may be rows and columns of the matrix not touched by the individual multiplications by the embedded random $d\times d$ matrices; if $\ell $ is too long, the matrix entries become large and lattice basis reduction becomes difficult.

3.3 Experiments with Algorithm 4

Finally, we turn to the opposite extreme of random elements of $GL(n,{\mathbb {Z}})$ generated by Algorithm 4, in which the bottom $n-1$ rows are uniformly distributed among entries in the range $[-T,T]$. Here we were able to solve Problem 2b with instances having $n=100$, even with entry sizes up to $T=50$ (again, using the testing procedure in (3.1)). However, none of our experiments with $n\ge 110$ were successful at all, even with $T=1$ (i.e., all entries below the top row are $-1$, 0, or 1). See the tables in Appendix B for more details.

4 Random Basis Generation in the DRS NIST Post-Quantum Cryptography Competition Submission

In [3, §5.1.2] some examples of methods for generating random lattice bases are described, which are closely related to Algorithms 2, 3, and 5. The authors reported their experiments on those methods resulted in similar outcomes in practice. Our experiments, however, do show a difference (as was explained in Sect. 3).

In this section we wish to make further comments about one method highlighted in [3], which is from the DRS NIST Post-Quantum competition submission [21, §2.2]. Random elements of $GL(n,{\mathbb {Z}})$ there are constructed as products of length $2R+1$ of the form

$$\begin{aligned} P_1 \gamma _1 P_2 \gamma _2 P_3 \gamma _3\cdots P_R \gamma _R P_{R+1}\,, \qquad \qquad \qquad \qquad \qquad {(4.1)} \end{aligned}$$

where $P_1,\ldots ,P_{R+1}$ are chosen uniformly at random among permutation matrices in $GL(n,{\mathbb {Z}})$ and $\gamma _1,\ldots ,\gamma _R$ are elements in $SL(n,{\mathbb {Z}})$ produced by the following random process. Let and . Then each $\gamma _i$ is a block diagonal matrix with $\frac{n}{2}$ $2\times 2$ entries chosen uniformly at random from $\{A_+,A_{-}\}$. This construction has some similarities with Algorithm 3 for $d=2$, but note that here many of the SL(2) matrices commute (being diagonal blocks of the same matrix). In fact, since $A_+$ is conjugate by to $A_{-}$ one may replace each $\gamma _j$ with the block diagonal matrix

$$D \ \ = \ \ {\text {diag}}(A_+,A_+,\ldots ,A_+)\,,$$

at the cost of allowing the $P_i$’s to be signed permutation matrices. Alternatively, by rearranging the permutation matrices and applying an extra rotation on the right, Problem 2b on matrices of the form (4.1) is equivalent to it on products of the form

$$\begin{aligned} M \ \ = \ \ M_1M_2\cdots M_R\,, \qquad \qquad \qquad \qquad \quad \qquad {(4.2)} \end{aligned}$$

in which each $M_i$ is conjugate of D by a random signed permutation matrix.

Since Algorithm 3 with $d=2$ performed relatively weakly in the experiments of Sect. 3, we suspect Problem 2b is relatively easy to solve on matrices generated using (4.1) (as compared to those, say, generated using Algorithm 4). The experiments described below bear this out. (All of our remaining comments in this section pertain solely to (4.1) in the context of Problem 2b, and not to any other aspect of [21].)

The parameters listed in [21, §3.2] assert 128-bit security for their scheme when $(n,R)=(912,24)$, 192-bit security when $(n,R)=(1160,24)$, and 256-bit security when $(n,R)=(1518,24)$. Our main finding is that the testing procedure (3.1) was able to recover M chosen with the 256-bit security parameters in 708 hours of running time. We could also recover M chosen with the 192-bit security parameters in 222 hours of running time but (as we describe below) could not fully recover M with the 128-bit security parameters.

The testing procedure (3.1) also easily solves Problem 2b when n or R are smaller yet still relatively large. For example, it took roughly an hour to recover M from $MM^t$ when $(n,R)=(180,24)$ using BKZ with block sizes up to 26. In Fig. 1 we show the results of several experiments for the parameter choice of $n=912$ and increasing values of R up to the recommended choice of $R=24$ for 128-bit security. The results were strikingly successful, in that each trial for $R\le 22$ successfully recovered M from $MM^t$ using only LLL (without requiring BKZ). We additionally tried $R=23$ and nearly recovered M using LLL this way: the longest vector in the LLL output had length $\sqrt{7}$, and subsequently applying BKZ reduction with block size 3 for less than five minutes then fully recovered M. However, we were unsuccessful in the $R=24$ case suggested in [21].

Again, these results are only for Problem 2b applied to the random basis construction used in the DRS digital signature scheme [21]; nevertheless, this may indicate a weakness in the digital signature scheme as well. Somewhat counter intuitively, our experiments for fixed values of the product length parameter R sometimes fared better for larger values of n. For example, we were successful with $(n,R)=(912,22)$ despite not being successful for $(n,R)=(200,22)$, and we were successful with $(n,R)=(1160,24)$ and (1518, 24) despite not being successful for $(n,R)=(912,24)$. Our explanation is that as n grows there may be a weakness in that it is hard to randomly fill out the full matrix M (a similar phenomenon occurs in Algorithms 2 and 3 for small $\ell $). Indeed, matrices of the form (4.1) seem to have a very special form: Fig. 2 shows the entry sizes in $MM^t$ have a banded structure.

5 Conclusions

We have considered the role of generating random elements in $GL(n,{\mathbb {Z}})$ in the difficulty of lattice problems, and have found that it can have a profound influence. Concretely, Magma’s RandomSLnZ command (Algorithm 2) gives easy instances of Lenstra-Silverberg’s “Recognizing ${\mathbb {Z}}^n$ Decision” Problem 2b from (1.2). We were able to successfully attack lattices of dimension up to 1,486, which are in some measurable ways comparable to NTRU lattices having claimed 256-bit quantum security. On the other hand, using the apparently stronger methods of Algorithms 3 and 4 make Problem 2b much more difficult to solve (as expected).

We would thus recommend not using Algorithm 2 in generating random bases for cryptographic applications. We also recommend not using the random basis algorithm from the NIST Post-Quantum Competition submission DRS [21], because we were similarly able to solve Problem 2b on instances of its random basis generation method with its recommend parameters for 256-bit security.

We have not fully understood the weaknesses of these algorithms. It seems plausible that the failure to quickly fill out the matrix entries in a uniform way is at least partly to blame, since many do not get sufficiently randomized. The construction of Algorithm 2 in some sense reverses the steps of an LLL basis reduction, which might explain why LLL is particularly effective against it. More generally one might expect the block sizes in Algorithm 3 to be related to the block sizes in the BKZ algorithm. It is natural from this point of view to expect Algorithms 4 and 5 to be the strongest lattice basis generation algorithms considered in this paper, consistent with the results of our experiments.

Notes

1.
For example, the $E_8$ lattice has a Gram matrix G in $SL(8,{\mathbb {Z}})$, but is not isometric to the ${\mathbb {Z}}^8$ lattice. In general the number of $GL(n,{\mathbb {Z}})$-equivalence classes of such integral unimodular lattices grows faster than exponentially in n [6, Chapter 16].
2.
One can consider other shapes, such as balls; boxes are convenient for our applications and for making more concise statements. The same problem for $SL(n,{\mathbb {Z}})$ is of course equivalent.
3.
See [12, Corollary 2.3] and [7, (1.14)] for more details on this surprisingly difficult result.
4.
Unfortunately it is prohibitively complicated here to describe particular parameter choices matching the bound in (2.1).
5.
The role of $GL(\cdot ,\cdot )$ as opposed to $SL(\cdot ,\cdot )$ here is again purely cosmetic.
6.
It should be mentioned that provable guarantees were earlier established in [1, 2, 19] when one generates both the lattice together with a basis at random from a family. Here our emphasis is on a fixed, given lattice.
7.
In our experiments, for example, the top $n-2$ rows agreed most of the time for $m=n\ge 10$.
8.
Note the order of magnitude of the set $\varGamma _T$ from (2.1) is $T^{n^2-n}$, naturally matching the $n^2-n$ random integers picked in Algorithm 4.

References

Ajtai, M.: Generating hard instances of the short basis problem. In: Wiedermann, J., van Emde Boas, P., Nielsen, M. (eds.) ICALP 1999. LNCS, vol. 1644, pp. 1–9. Springer, Heidelberg (1999). https://doi.org/10.1007/3-540-48523-6_1
Chapter Google Scholar
Alwen, J., Peikert, C.: Generating shorter bases for hard random lattices. Theory Comput. Syst. 48, 535–553 (2011)
Article MathSciNet Google Scholar
Aono, Y., Espitau, T., Nguyen, P.Q.: Random Lattices: Theory And Practice, preprint. https://espitau.github.io/bin/random_lattice.pdf
Begelfor, E., Miller, S.D., Venkatesan, R.: Non-abelian analogs of lattice rounding. Groups Complex. Cryptol. 7(2), 117–133 (2015)
Google Scholar
Cash, D., Hofheinz, D., Kiltz, E., Peikert, C.: Bonsai trees, or how to delegate a lattice basis. In: Gilbert, H. (ed.) EUROCRYPT 2010. LNCS, vol. 6110, pp. 523–552. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-13190-5_27
Chapter Google Scholar
Conway, J.H., Sloane, N.J.A.: Sphere Packings, Lattices, and Groups. Grundlehren der mathematischen Wissenschafter, vol. 290, 3rd edn. Springer, New York (1999). https://doi.org/10.1007/978-1-4757-6568-7
Book MATH Google Scholar
Duke, W., Rudnick, Z., Sarnak, P.: Density of integer points on affine homogeneous varieties. Duke Math. J. 71, 143–179 (1993)
Article MathSciNet Google Scholar
Elkies, N.D.: A characterization of the $\mathbb{Z}^{n}$ lattice. Math. Res. Lett. 2, 321–326 (1995)
Article MathSciNet Google Scholar
Geißler, K., Smart, N.P.: Computing the $M=UU^t$ integer matrix decomposition. In: Paterson, K.G. (eds.) Cryptography and Coding. Cryptography and Coding 2003. Lecture Notes in Computer Science, vol. 2898. Springer, Heidelberg (2003). https://doi.org/10.1007/978-3-540-40974-8_18
Gentry, C., Szydlo, M.: Cryptanalysis of the revised NTRU signature scheme. In: Knudsen, L.R. (ed.) EUROCRYPT 2002. LNCS, vol. 2332, pp. 299–320. Springer, Heidelberg (2002). https://doi.org/10.1007/3-540-46035-7_20. http://www.szydlo.com/ntru-revised-full02.pdf
Gerstein, L.: Basic Quadratic Forms. Graduate Studies in Mathematics, vol. 90. American Mathematical Society, Providence (2008)
MATH Google Scholar
Gorodnik, A., Nevo, A.: The Ergodic Theory of Lattice Subgroups. Annals of Mathematics Studies, vol. 172. Princeton University Press, Princeton (2010)
MATH Google Scholar
Hunkenschröder, C.: Deciding whether a Lattice has an Orthonormal Basis is in co-NP. arxiv:1910.03838
Kim, S., Venkatesh, A.: The behavior of random reduced bases. Int. Math. Res. Notices 20, 6442–6480 (2018)
Article MathSciNet Google Scholar
Arjen, K. Lenstra Jr., H.W., Lovasz, L.: Factoring polynomials with rational coefficients. Mathematische Annalen 261, 513–534 (1982)
Google Scholar
Lenstra Jr., H.W., Silverberg, A.: Revisiting the Gentry-Szydlo algorithm. In: Garay, J.A., Gennaro, R. (eds.) CRYPTO 2014. LNCS, vol. 8616, pp. 280–296. Springer, Heidelberg (2014). https://doi.org/10.1007/978-3-662-44371-2_16
Lenstra Jr., H.W., Silverberg, A.: Lattices with symmetry. J. Cryptol. 30, 760–804 (2017). https://doi.org/10.1007/s00145-016-9235-7
Lenstra Jr., H.W., Silverberg, A.: Testing isomorphism of lattices over CM-orders. SIAM J. Comput. 48(4), 1300–1334 (2019)
Google Scholar
Micciancio, D., Peikert, C.: Trapdoors for lattices: simpler, tighter, faster, smaller. In: Pointcheval, D., Johansson, T. (eds.) EUROCRYPT 2012. LNCS, vol. 7237, pp. 700–718. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-29011-4_41
Chapter Google Scholar
Nguyen, P.Q., Stehlé, D.: An LLL algorithm with quadratic complexity. SIAM J. Comput. 39, 874–903 (2009)
Article MathSciNet Google Scholar
Plantard, T., Sipasseuth, A., Dumondelle, C., Susilo, W.: DRS: Diagonal dominant Reduction for lattice-based Signature. NIST Post-Quantum Digital Signature Competition entry. https://csrc.nist.gov/Projects/post-quantum-cryptography/Round-1-Submissions
Rivin, I.: How to pick a random integer matrix? (and other questions). Math. Comp. 85, 783–797 (2016)
Article MathSciNet Google Scholar
Schnorr, C.P.: A hierarchy of polynomial time lattice basis reduction algorithms. Theor. Comput. Sci. 53, 201–224 (1987)
Article MathSciNet Google Scholar
Whyte, W., Wilson, L.: Quantum Safety In Certified Cryptographic Modules. https://icmconference.org/wp-content/uploads/A21c-Whyte.pdf

Download references

Acknowledgements

It is a pleasure to thank Huck Bennett, Leo Ducas, Nicholas Genise, Craig Gentry, Shai Halevi, Nadia Heninger, Jeff Hoffstein, Hendrik Lenstra, Amos Nevo, Phong Nguyen, Chris Peikert, Oded Regev, Ze’ev Rudnick, Alice Silverberg, Damien Stehlé, Noah Stephens-Davidowitz, and Berk Sunar for very helpful discussions. We are particularly indebted to Joe Silverman for kindly suggesting an earlier variant of Algorithm 4, which is very similar to the one we suggest here, and to Daniel J. Bernstein for important comments about the poor equidistribution provided by Algorithm 2. We are also grateful to Galen Collier of the Rutgers University Office of Advanced Research Computing for his assistance, and to the Simons Foundation for providing Rutgers University with Magma licenses.

Author information

Authors and Affiliations

Department of Mathematics, Rutgers University, New Brunswick, USA
Tamar Lichter Blanks & Stephen D. Miller

Authors

Tamar Lichter Blanks
View author publications
You can also search for this author in PubMed Google Scholar
Stephen D. Miller
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Stephen D. Miller .

Editor information

Editors and Affiliations

Seoul National University, Seoul, Korea (Republic of)
Jung Hee Cheon
Inria, Paris, France
Jean-Pierre Tillich

Appendices

A Experiments with Algorithm 3 (Random $GL(d,{\mathbb {Z}})$ Matrices)

Below we list tables of the experimental results mentioned in Sect. 3 on Algorithm 3, performed using the testing procedure (3.1).

n	d	T	$\ell $	shortest row	longest row	found M?
				length (in bits)	length (in bits)
200	2	1	4000	6.03607	12.7988	$\times $
200	2	2	1500	1.29248	18.5329
200	2	2	2000	7.86583	22.2151	$\times $
200	2	3	1000	0.5	27.0875	$\times $
200	2	3	2000	23.521	41.5678	$\times $
200	2	10	500	2.04373	38.7179
200	2	10	700	7.943	49.0346	$\times $
200	3	1	1000	2.04373	11.3283
200	3	1	1500	7.66619	17.1312	$\times $
200	3	1	2000	13.0661	20.8768	$\times $
200	3	2	500	3.27729	18.4087
200	3	2	600	4.89232	24.111	$\times $
200	3	2	1000	13.0585	34.0625	$\times $
200	4	1	500	3.66096	12.2277
200	4	2	300	0.5	24.2424
200	4	2	400	1.79248	26.6452	$\times $

key: n = lattice dimension, d = size of smaller embedded matrices, T = bound on embedded matrix entries, $\ell $ = length of the product of smaller matrices.

n	d	T	$\ell $	shortest row	longest row	found M?
				length (in bits)	length (in bits)
500	2	1	4000	0	5.90085
500	2	1	8000	3.41009	10.7467
500	2	1	10000	7.08508	12.7447
500	2	1	15000	12.6617	18.5326
500	2	1	20000	18.0246	24.5732	$\times $
500	2	2	4000	4.21731	18.587
500	2	2	6000	12.3467	28.7882	$\times $
500	2	2	8000	18.87	35.7267	$\times $
500	2	2	10000	28.5508	45.8028	$\times $
500	2	3	2000	0	19.0752
500	2	3	3000	7.38752	32.9895
500	2	3	4000	16.9325	40.9656	$\times $
500	2	10	1000	0	30.3755
500	2	10	2000	11.9964	61.5006	$\times $
500	3	1	1000	0	5.39761
500	3	1	2000	1.29248	9.164
500	3	1	3000	2.37744	13.9903
500	3	1	4000	8.43829	17.4593
500	3	1	5000	14.1789	21.528
500	3	1	6000	18.3878	25.2578	$\times $
500	3	1	7000	20.5646	29.287	$\times $
500	3	2	1000	0	15.551
500	3	2	2000	3.24593	33.0945
500	3	2	3000	23.5966	43.7986	$\times $
500	3	3	1000	0	28.1575
500	3	3	2000	16.6455	53.1806	$\times $
500	3	3	3000	41.3371	83.9486	$\times $
500	4	1	1000	0	9.85319
500	4	1	2000	8.11356	18.9434
500	4	1	3000	19.1019	26.9836
500	4	1	4000	24.4869	35.6328	$\times $
500	4	1	5000	26.6804	44.3982	$\times $
500	4	1	6000	40.5944	53.3654	$\times $
500	4	2	1000	6.29272	33.4373
500	4	2	2000	33.6181	63.3469	$\times $

key: n = lattice dimension, d = size of smaller embedded matrices, T = bound on embedded matrix entries, $\ell $ = length of the product of smaller matrices.

n	d	T	$\ell $	shortest row	longest row	found M?
				length (in bits)	length (in bits)
886	2	1	3000	0	3.49434
886	2	1	4000	0	3.80735
886	2	1	5000	0	4.40207
886	2	1	6000	0	5.30459
886	2	1	7000	0	6.16923
886	2	1	8000	0	6.90754
886	2	1	9000	1	7.58371
886	2	1	10000	2.37744	8.05954
886	2	1	15000	5.46942	11.2176
886	2	1	20000	8.6594	14.5837
886	2	1	25000	10.884	18.035
886	2	1	30000	15.0082	21.0333
886	2	1	35000	17.6964	24.8408
886	2	1	40000	20.7706	28.3888
886	2	1	45000	24.484	30.6745
886	2	1	50000	25.7401	34.0742	$\times $

key: n = lattice dimension, d = size of smaller embedded matrices, T = bound on embedded matrix entries, $\ell $ = length of the product of smaller matrices.

Comments

Each sequence of experiments (for fixed values of n, d, and T) eventually fails when $\ell $ is sufficiently large. For $\ell $ too small the random product will not involve all the rows and columns of the matrix, meaning that the dimension of the lattice problem is effectively reduced to a smaller value of n, so the most interesting cases are for intermediate values of $\ell $ (e.g., $10000\le \ell \le 50000$ in this last table). There is some correlation between a successful trial and having a short vector in M (the fifth column), especially in the trials for $n=200$. For $n=500$ one sees more successful trials with longer shortest rows, especially as d (and to a lesser extent, T) increase. Note that each entry in these tables corresponds to a single experiment; we did not attempt to average over several experiments since we wanted to report on the range of the row lengths.

We did not take values of $d>4$, since it is difficult to use Algorithm 1 to generate larger random elements of $GL(d,{\mathbb {Z}})$.

The table for $n=886$ is in some sense an elaboration of the middle entry of (3.2), the difference being that the latter uses unipotents (instead of embedded $GL(2,{\mathbb {Z}})$ matrices).

B Experiments with Algorithm 4

Below we list tables of the experiments mentioned in Sect. 3 on Algorithm 4, performed using the testing procedure (3.1).

n	T	shortest row length (in bits)	longest row length (in bits)	found M?
100	1	2.91645	4.65757
100	3	4.14501	5.81034
100	4	4.50141	6.20496
100	10	5.64183	7.15018
100	50	7.99332	9.77546
100	1	2.91645	4.65757
110	1	2.98864	4.54902	$\times $
120	1	3.03304	4.77441	$\times $
125	1	3.09491	4.93979	$\times $
150	1	3.12396	5.09738	$\times $
200	1	3.42899	5.32597	$\times $
200	2	4.23584	6.42421	$\times $
200	3	4.72766	6.82899	$\times $
200	4	5.06529	7.41803	$\times $

key: n = lattice dimension, T = bound on matrix entries in bottom $n-1$ rows.

Comments

In general, matrices in $GL(n,{\mathbb {Z}})$ with large entries have very small determinants ($\pm 1$) relative to their overall entry size, so they are already very close to singular matrices. However, the size of the rank of nearby matrices is important. The matrices produced by Algorithm 4 are perturbations of matrices having rank $n-1$ (which is as large as possible for singular $n\times n$ matrices). In contrast, one numerically sees that matrices produced by Algorithm 2 are instead nearly rank-one matrices (i.e., up to a small overall perturbation relative to the size of the entries). We expect Algorithm 3’s matrices, which are produced by taking products of random $GL(d,{\mathbb {Z}})$ matrices, have intermediate behavior (but have not systematically analyzed this).

A related fact is that matrices produced by Algorithm 2 frequently have a very large row or column (if b is sufficiently large) – typically coming from the first or last factor in the matrix multiplication, respectively. That serves as a possible hint to recover the spelling of the word in the random product, along the lines of the length-based attack in [4, §4]. However, we were unable to turn this into a direct, general attack. For example, it is unclear what to do when the value of $x\in {\mathbb {Z}}\cap [-b,b]$ is small, say in the regime that $b\le \ell $. (The situation is clearer when b is extremely large relative to $\ell $, in which case we expect a bias effect in random words similar to underlying device used in [4, §4].)

C A Reference Point for the Bit-Strength of Lattice Problems: NTRU

In this appendix we give some information about how we measured when product lengths in Algorithms 2 and 3 were sufficiently long enough to ensure Gram matrix entries have an appropriately large size. The security of lattices against LLL and BKZ is an active area in which no general consensus has been reached despite many competing suggestions (reflecting its underlying notoriously complicated difficulty).

One type of lattice for which bit strengths have been suggested are NTRU lattices. We mention this as an attempt to quantify the notion that lattice problems in high dimensions are hard, as well as to provide a point of comparison—though there are of course many differences between NTRU lattices and rotations of the ${\mathbb {Z}}^n$ lattice (we don’t say anything about the security of NTRU itself).

NTRU matrices have the form

(C.1)

with n even, q an integer greater than one, and X randomly chosen from a certain distribution among all integral matrices of the form

$$\begin{aligned} X = \left( {\begin{array}{*{20}l} {x_{1} } &{} {x_{2} } &{} {x_{3} } &{} \cdots &{} {x_{{n/2 - 1}} } &{} {x_{{n/2}} } \\ {x_{2} } &{} {x_{3} } &{} {x_{4} } &{} \cdots &{} {x_{{n/2}} } &{} {x_{1} } \\ \vdots &{} \vdots &{} \vdots &{} \ddots &{} \vdots &{} \vdots \\ {x_{{n/2}} } &{} {x_{1} } &{} {x_{2} } &{} \cdots &{} {x_{{n/2 - 2}} } &{} {x_{{n/2 - 1}} } \\ \end{array} } \right) ,\left| {x_{j} } \right| \le \frac{q}{2} \end{aligned}$$

(C.2)

The rows of an NTRU matrix span an “NTRU lattice” $\varLambda \subset {{\mathbb {R}}}^n$. In [24] and in earlier NIST Post-Quantum Cryptography submissions the following quantum bit security is suggested for NTRU with the following parameters:

These estimates are not directly relevant to the lattice bases we examine, which have different determinants and a very different structure. Nevertheless, they are consistent with the general expectation that lattice problems in dimensions 500 or more (and especially 1,000 or more) become cryptographically difficult.

The choice of length $\ell $ in the experiments in (3.2) was determined as follows. The vector lengths of the rows in the NTRU matrix (C.1) are either roughly $\sqrt{\frac{n}{2}}\frac{q}{2}$ (for the first n/2 rows), or exactly q (for the last n/2 rows). We took $\ell $ large enough so that the resulting product had comparable row lengths, and made sure to use at least as many random bits as go into constructing an NTRU lattice (which is $\frac{n}{2}\log _2(q)$).

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Blanks, T.L., Miller, S.D. (2021). Generating Cryptographically-Strong Random Lattice Bases and Recognizing Rotations of $\mathbb {Z}^n$. In: Cheon, J.H., Tillich, JP. (eds) Post-Quantum Cryptography. PQCrypto 2021. Lecture Notes in Computer Science(), vol 12841. Springer, Cham. https://doi.org/10.1007/978-3-030-81293-5_17

Download citation

DOI: https://doi.org/10.1007/978-3-030-81293-5_17
Published: 15 July 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-81292-8
Online ISBN: 978-3-030-81293-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Generating Cryptographically-Strong Random Lattice Bases and Recognizing Rotations of \(\mathbb {Z}^n\)

Abstract

Similar content being viewed by others

Quadratic Time, Linear Space Algorithms for Gram-Schmidt Orthogonalization and Gaussian Sampling in Structured Lattices

Lattice Reductions over Euclidean Rings with Applications to Cryptanalysis

Structural Lattice Reduction: Generalized Worst-Case to Average-Case Reductions and Homomorphic Cryptosystems

Keywords

1 Introduction

2 Choosing Random Elements of \(GL(n,{\mathbb {Z}})\)

3 Experiments on Recognizing \({\mathbb {Z}}^n\)

3.1 Experiments with Algorithm 2 (Magma’s RandomSLnZ Command)