New Efficient Algorithms for Multiplication Over Fields of Characteristic Three

Cenk, Murat; Zadeh, Farhad Haghighi; Hasan, M. Anwar

doi:10.1007/s11265-017-1234-x

New Efficient Algorithms for Multiplication Over Fields of Characteristic Three

Published: 06 March 2017

Volume 90, pages 285–294, (2018)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of Signal Processing Systems Aims and scope Submit manuscript

New Efficient Algorithms for Multiplication Over Fields of Characteristic Three

Download PDF

Murat Cenk¹,
Farhad Haghighi Zadeh² &
M. Anwar Hasan³

309 Accesses
2 Citations
Explore all metrics

Abstract

In this paper, we first present an enhancement of the well-known Karatsuba 2-way and 3-way algorithms for characteristic three fields, denoted by $\mathbb {F}_{3^{n}}$ where n≥1. We then derive a 3-way polynomial multiplication algorithm with five 1/3 sized multiplications that use interpolation in $\mathbb {F}_{9}$. Following the computation of the arithmetic and delay complexity of the proposed algorithm, we provide the results of our hardware implementation of polynomial multiplications over $\mathbb {F}_{3}$ and $\mathbb {F}_{9}$. The final proposal is a new 3-way polynomial multiplication algorithm over $\mathbb {F}_{3}$ that uses three polynomial multiplications of 1/3 of the original size over $\mathbb {F}_{3}$ and one polynomial multiplication of 1/3 of the original size over $\mathbb {F}_{9}$. We show that this algorithm represents about 15% reduction of the complexity over previous algorithms for the polynomial multiplications whose sizes are of practical interest.

Some new results on binary polynomial multiplication

Article 16 May 2015

Improved Polynomial Multiplication Algorithms over Characteristic Three Fields and Applications to NTRU Prime

Efficient Polynomial Multiplication via Modified Discrete Galois Transform and Negacyclic Convolution

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Multiplication in characteristic three fields, denoted by $\mathbb {F}_{3^{n}}$, where n≥1, is employed in curve-based cryptography. The use of these fields in elliptic curve cryptography has been discussed in [11, 14, 16, 17, 19, 21]. Examples of work related to efficient arithmetic in characteristic three fields can be found in [1–3, 20]. A common method for multiplication in $\mathbb {F}_{3^{n}}$ is to use polynomial basis representation, in which the elements of $\mathbb {F}_{3^{n}}$ are represented by polynomials of degree up to (n−1) over $\mathbb {F}_{3}$, the finite field with three elements. To perform multiplication in $\mathbb {F}_{3^{n}}$, the polynomials are first multiplied, and the result is then reduced modulo an irreducible polynomial of degree n over $\mathbb {F}_{3}$. The arithmetic cost of the reduction step is linear in input size; on the other hand, the polynomial multiplication step requires a sub-quadratic complexity. The multiplication step is thus more costly than the reduction step. As a result, reducing the cost of polynomial multiplication over $\mathbb {F}_{3}$ directly affects the cost of multiplication in $\mathbb {F}_{3^{n}}$.

Our Contributions

For practical cryptographic applications, polynomial multiplication schemes with low arithmetic complexity are essentially Karatsuba-like algorithms. For example, recursive uses of 2-way and 3-way algorithms have a total arithmetic complexity of 7n ^1.58−8n+2 and 6.8n ^1.63−8n+2.2, respectively. In this paper, after introducing an improved version of the 2-way and 3-way algorithms, we propose a 3-way polynomial multiplication algorithm with five multiplications using interpolation in $\mathbb {F}_{9}$. In contrast to the formula presented in [13], we show that the recursive use of the algorithm yields an arithmetic complexity of $15n^{1.46}-4.85n\log _{3}n-14n$. In addition, the time delay complexity that are useful when the algorithm is mapped on to bit parallel hardware are also derived. The final proposal is a new 3-way algorithm for multiplication of polynomials over $\mathbb {F}_{3}$ which uses three multiplications of polynomials of 1/3 of the input size over $\mathbb {F}_{3}$ and one multiplication of polynomial of 1/3 of the input size over $\mathbb {F}_{9}$ with a total complexity less than 15n ^1.46 which is, to our knowledge, superior arithmetic complexity than that available with previously known algorithms.

Organization of the Paper

The remainder of the paper is organized as follows. Notations and preliminaries used in the rest of the paper are provided in Section 2. Relevant known algorithms along with our suggestions for their improvements are presented in Section 3. The next section introduces the proposed 3-way algorithm with five 1/3 sized polynomial multiplications over $\mathbb {F}_{3}$ and $\mathbb {F}_{9}$. Section 5 reports the results of our hardware implementation. Further improvements are discussed in Section 6 and concluding remarks are made in the final section.

2 Notations and Preliminaries

This section explains the notations used in the remainder of the paper and describes a few basic algorithms. Unless otherwise stated, the fields employed in the work are assumed to be of characteristic three. The following notations are used:

M _3,⊕(n): number of $\mathbb {F}_{3}$ additions (or subtractions) required for the multiplication of two degree n−1 polynomials over $\mathbb {F}_{3}$.
M _3,⊗(n): number of $\mathbb {F}_{3}$ multiplications required for the multiplication of two degree n−1 polynomials over $\mathbb {F}_{3}$.
M ₃(n): number of total $\mathbb {F}_{3}$ operations required for the multiplication of two degree n−1 polynomials over $\mathbb {F}_{3}$, i.e., M ₃(n) = M _3,⊕(n) + M _3,⊗(n).
M _9,⊕(n): number of $\mathbb {F}_{3}$ additions (or subtractions) required for the multiplication of two degree n−1 polynomials over $\mathbb {F}_{9}$.
M _9,⊗(n): number of $\mathbb {F}_{3}$ multiplications required for the multiplication of two degree n−1 polynomials over $\mathbb {F}_{9}$.
M ₉(n): number of total $\mathbb {F}_{3}$ operations required for the multiplication of two degree n−1 polynomials over $\mathbb {F}_{9}$, i.e., M ₉(n) = M _9,⊕(n) + M _9,⊗(n).
D ₃(n): delay complexity associated with the multiplication of two degree n−1 polynomials over $\mathbb {F}_{3}$.
D ₉(n): delay complexity associated with the multiplication of two degree n−1 polynomials over $\mathbb {F}_{9}$.
D _⊕: latency of an $\mathbb {F}_{3}$ addition (or subtraction).
D _⊗: latency of an $\mathbb {F}_{3}$ multiplication.

We represent the elements of $\mathbb {F}_{3^{n}}$ as polynomials over $\mathbb {F}_{3}$ with a degree less than n. Moreover, we construct $\mathbb {F}_{9}\cong \mathbb {F}_{3}[X]/(X^{2}+1)$ and assume that ω ²+1=0, where $\omega \in \mathbb {F}_{9}.$

A further assumption is that multiplication by −1 of a polynomial is cost free and that addition and subtraction have identical complexity. It should also be noted that the cost of the multiplication in $\mathbb {F}_{9}$ can be assumed to be equivalent to four multiplications and two additions in $\mathbb {F}_{3}$, based on the following formula:

$$(a+b\omega)(c+d\omega)=ac-bd+(bc+ad)\omega.$$

In addition, no cost is incurred for the multiplication of an element in $\mathbb {F}_{9}$ by ω since (a + b ω)ω=−b + a ω.

Throughout the paper we make use of the solution to the following recurrence equation.

Lemma 1

Let a,b,ℓ be positive integers, n=b ^ℓ , a≠1, and

$$M(n)=aM(n/b)+cn+d+fn^{\delta}, \;\; M(1)=e$$

(i)
If a≠b and f=0 then the solution of M(n) is
$$M(n)=\left( e+{\frac{bc}{a-b}}+{\frac{d}{a-1}} \right)n^{\log_{b}a}-{\frac{bc}{a-b}} n+{\frac{d}{a-1}}.$$
(ii)
If a=b then the solution of M(n) is
$$M(n)={\frac{fb^{\delta}}{b^{\delta}-a}}n^{\delta}+\left( e-{\frac{fb^{\delta}}{b^{\delta}-a}}+{\frac{d}{a-1}} \right)n+cn\log_{b}n-{\frac{d}{a-1}}.$$
(iii)
If a≠b then the solution of M(n) is
$$\begin{array}{@{}rcl@{}} M(n)&=&{\frac{fb^{\delta}}{b^{\delta}-a}}n^{\delta}+\left( e+{\frac{bc}{a-b}}{~}-{\frac{fb^{\delta}}{b^{\delta}-a}}{~}+\frac{d}{a-1} \right)n^{\log_{b}a}\\ &&-\left( {\frac{bc}{a-b}}\right)n-{~}{\frac{d}{a-1}}. \end{array} $$

Proof

Proofs of (i) and (ii) are in [10] and [7]. For (iii), we substitute the value of M(n/b) into M(n). Then, we have

$$M(n)=a(aM(n/b^{2})+cn/b+d+fn^{\delta}/b^{\delta})+cn+d+fn^{\delta}.$$

This equation yields

$$M(n)=a^{2}M(n/b^{2})+(cn+acn/b)+(d+ad)+(fn^{\delta}+afn^{\delta}/b^{\delta}).$$

When we substitute the value of M(n/b ²) into the last equation and continue this process, we obtain

$$\begin{array}{@{}rcl@{}} M(n)&=&a^{\ell} M(1)+cn(1+a/b+\ldots+(a/b)^{\ell-1})\\ &&+d(1+a+\ldots+a^{\ell-1})+fn^{\delta}(1+(a/b^{\delta})+\ldots\\ &&+(a/b^{\delta})^{\ell-1}). \end{array} $$

After computing the expressions in the parenthesis and using $a^{\ell }=a^{\log _{b}n}=n^{\log _{b}a}$, we get

$$\begin{array}{@{}rcl@{}} M(n)\!&=&\!\frac{fb^{\delta}}{b^{\delta}-a}n^{\delta}\,+\,\left( e\,+\,\frac{bc}{a-b\!}-\!\frac{fb^{\delta}}{b^{\delta}-a}+\frac{d}{a-1} \right)n^{\log_{b}a}\\ &&-\left( \frac{bc}{a-b}\right)n-\frac{d}{a-1}. \end{array} $$

□

3 Known Algorithms and their Improvements

This section presents Karatsuba 2-way and 3-way algorithms for characteristic three fields. Following the work in [4] and [23] for improving the corresponding algorithms in characteristic two, we introduce improvements for characteristic three.

Remark 1

To apply recursive 2-way and 3-way algorithms, the polynomials are split in two and three parts, respectively. If n is not divisible by two or three, we pad the polynomial with one or two zeros so that the sizes become divisible by two or three. This adjustment has a negligible effect on the complexity.

3.1 Karatsuba 2-Way Algorithm

Let $A={\sum }_{i=0}^{n-1} a_{i} X^{i}$ and $B={\sum }_{i=0}^{n-1} b_{i} X^{i}$. We can divide A and B into two parts as follows: A(X) = A ₀ + A ₁ X ^n/2 and B(X) = B ₀ + B ₁ X ^n/2 where A _i and B _i are polynomials of degree less than n/2. Let $C={\sum }_{i=0}^{2} C_{i} X^{ni/2}$ be the product of A and B. The Karatsuba 2-way algorithm [12, 15] is the following:

$$ \left\{\begin{array}{l} P_{0}=A_{0}B_{0},\;\; P_{1}=(A_{0}+A_{1})(B_{0}+B_{1}),\;\;P_{2}=A_{1}B_{1}, \\ C=P_{0}+(P_{1}-P_{0}-P_{2})X^{n/2}+P_{2}X^{n}. \end{array} \right. $$

(1)

The algorithm given in Eq. 1 requires three multiplications of two degree n/2−1 polynomials plus 4n−4 additions. On the other hand, the delay complexity of the algorithm is D ₃(n/2)+3D _⊕. The recursive use of this algorithm is thus associated with the following complexities:

$$ \left\{ \begin{array}{lcl} M_{3,\otimes}(n)& \leq &3M_{3,\otimes}(n/2), \, M_{3,\otimes}(1)=1, \\ M_{3,\oplus}(n)&\leq &3M_{3,\oplus}(n/2)+4n-4, \, M_{3,\oplus}(1)=0,\\ M_{3}(n)&\leq &3M_{3}(n/2)+4n-4, \, M_{3}(1)=1,\\ D_{3}(n)&\leq &D_{3}(n/2)+3D_{\oplus}, D_{3}(1)=D_{\otimes} . \end{array} \right. $$

(2)

Using Lemma 1, we obtain the following bounds:

$$ \left\{ \begin{array}{lcl} M_{3,\otimes}(n)&\leq &n^{\log_{2}3}, \\ M_{3,\oplus}(n)&\leq &6n^{\log_{2}3}-8n+2,\\ M_{3}(n)&\leq &7n^{\log_{2}3}-8n+2,\\ D_{3}(n)&\leq &3(\log_{2}n)D_{\oplus}+D_{\otimes}. \end{array} \right. $$

(3)

3.2 Improved Karatsuba 2-Way Algorithm

Using the algorithm given in [4] enables us to reconstruct part of the algorithm given in Eq. 1 as follows:

$$ \left\{\begin{array}{l} P_{0}=A_{0}B_{0},\;\; P_{1}=(A_{0}+A_{1})(B_{0}+B_{1}),\;\;P_{2}=A_{2}B_{2}, \\ C=(X^{n/2}-1)(X^{n/2}P_{2}-P_{0})+P_{1}X^{n/2}. \end{array}\right. $$

(4)

The algorithm given in Eq. 4 requires three multiplications of two degree n/2−1 polynomials plus 7n/2−3 additions. The delay complexity of the algorithm is D ₃(n/2)+3D _⊕. Therefore, using this algorithm recursively yields the following complexity:

$$ \left\{ \begin{array}{lcl} M_{3,\otimes}(n)&\leq&3M_{3,\otimes}(n/2), \, M_{3,\otimes}(1)=1, \\ M_{3,\oplus}(n)&\leq&3M_{3,\oplus}(n/2)+7n/2-3, \, M_{3,\oplus}(1)=0,\\ M_{3}(n)&\leq&3M_{3}(n/2)+7n/2-3, \, M_{3}(n)=1,\\ D_{3}(n)&\leq&D_{3}(n/2)+3D_{\oplus}, D_{3}(1)=D_{\otimes}. \end{array} \right. $$

(5)

Using Lemma 1 gives the following bounds:

$$ \left\{ \begin{array}{lcl} M_{3,\otimes}(n)&\leq& n^{\log_{2}3}, \\ M_{3,\oplus}(n)&\leq& 5.5n^{\log_{2}3}-7n+1.5,\\ M_{3}(n)&\leq&6.5n^{\log_{2}3}-7n+1.5,\\ D_{3}(n)&\leq& 3(\log_{2}n)D_{\oplus}+D_{\otimes}. \end{array} \right. $$

(6)

Compared to Eq. 3, using Eq. 6 results in about 7% reduction in the number of arithmetic operations. The delay complexities of Eqs. 3 and 6 are the same.

3.3 Karatsuba Like 3-Way Algorithm

As before, let $A={\sum }_{i=0}^{n-1} a_{i} X^{i}$ and $B={\sum }_{i=0}^{n-1} b_{i} X^{i}$. This time we divide A and B into three parts as follows: A(X) = A ₀ + A ₁ X ^n/3 + A ₂ X ^2n/3 and B(X) = B ₀ + B ₁ X ^n/3 + B ₂ X ^2n/3 where A _i and B _i are polynomials of degree less than n/3. To compute the product C = A B, a Karatsuba-like 3-way algorithm, which can be obtained using the Chinese remainder theorem [22] and from [18], can be expressed as follows:

$$ \left\{\begin{array}{l} P_{0}=A_{0}B_{0},\;\; P_{1}\,=\,A_{1}B_{1},\;\;P_{2}\,=\,A_{2}B_{2},\;\;P_{3}\,=\,(A_{0}+\!A_{1})(B_{0}+B_{1}),\\[-.5pt] P_{4}=(A_{0}+A_{2})(B_{0}+B_{2}),\;\;P_{5}=(A_{1}+A_{2})(B_{1}+B_{2}).\\[-.5pt] C=P_{0}+(P_{3}-P_{0}-P_{1})X^{n/3}+(P_{4}+P_{1}-P_{0}-P_{2})X^{2n/3}+\\[-.5pt] (P_{5}-P_{1}-P_{2})X^{3n/3}+P_{2}X^{4n/3}. \end{array}\right. $$

(7)

The algorithm given in Eq. 7 requires six multiplications of two degree n/3−1 polynomials plus 2n additions for P _i’s, 14n/3−7 additions for the coefficients of C and 4n/3−4 additions for overlaps. On the other hand, the delay complexity of the algorithm is D ₃(n/3)+4D _⊕. The recursive use of this algorithm is therefore associated with the following complexity:

$$ \left\{ \begin{array}{lcl} M_{3,\otimes}(n)&\leq&6M_{3,\otimes}(n/3), \, M_{3,\otimes}(1)=1, \\[-.5pt] M_{3,\oplus}(n)&\leq&6M_{3,\oplus}(n/3)+8n-11, \, M_{3,\oplus}(1)=0,\\ M_{3}(n)&\leq&6M_{3}(n/3)+8n-11, \, M_{3}(1)=1,\\[-.5pt] D_{3}(n)&\leq&D_{3}(n/3)+4D_{\oplus}, D_{3}(1)=D_{\otimes}. \end{array} \right. $$

(8)

Applying Lemma 1 gives the following bounds:

$$ \left\{ \begin{array}{lcl} M_{3,\otimes}(n)&\leq& n^{\log_{3}6}, \\[-.5pt] M_{3,\oplus}(n)&\leq&5.8n^{\log_{3}6}-8n+2.2,\\[-.5pt] M_{3}(n)&\leq&6.8n^{\log_{3}6}-8n+2.2,\\[-.5pt] D_{3}(n)&\leq&4(\log_{3})nD_{\oplus}+D_{\otimes}. \end{array} \right. $$

(9)

3.4 Improved Karatsuba Like 3-Way Algorithm

This section presents our redesign of the reconstruction part of the algorithm in Eq. 7 with the use of a technique similar to that reported in [6, 23]. It should be noted that the degree of P _i products for 0≤i≤5 is 2n/3−2. We divide each P _i into two parts as P _i = P _{i
L} + x ^n/3 P _{i
H} where P _{i
L} is a degree n/3−1 polynomial and P _{i
H} is a degree n/3−2 polynomial. Substituting those representations of the products into the reconstruction part of Eq. 7 gives the following:

$$\begin{array}{@{}rcl@{}} C&=&P_{0L}+X^{n/3}(P_{0H}-P_{0L}-P_{1L}+P_{3L})+X^{2n/3}\\ &&\times(-P_{0L}-P_{0H}+P_{1L}-P_{1H}-P_{2L}+P_{3H}+P_{4L})\\ &&+X^{3n/3}(-P_{0H}+P_{1H}-P_{2L}-P_{2H}+P_{4H}+P_{5L}-P_{1L})\\ &&+X^{4n/3}(-P_{1H}+P_{2L}-P_{2H}+P_{5H})+X^{5n/3}P_{2H}. \end{array} $$

(10)

It should be noted that Eq. 10 contains no overlaps so that we compute only the cost of the coefficients. The algorithm can be improved through the observation of some common terms R ₁ = P _0H−P _1L and R ₂ = P _1H−P _2L in Eq. 10. We then have,

$$\begin{array}{@{}rcl@{}} C&=&P_{0L}+X^{n/3}(R_{1}-P_{0L}+P_{3L})+X^{2n/3}\\ &&\times(-R_{1}-P_{0L}-P_{1H}-P_{2L}+P_{3H}+P_{4L})\\ &&+X^{3n/3}(R_{2}-P_{0H}-P_{1L}-P_{2H}+P_{4H}+P_{5L})\\ &&+X^{4n/3}(-R_{2}-P_{2H}+P_{5H})+X^{5n/3}P_{2H}. \end{array} $$

(11)

The number of additions required in Eq. 11 is computed as follows: Computing (A ₀ + A ₁),(B ₀ + B ₁),(A ₀ + A ₂),(B ₀ + B ₂),(A ₁ + A ₂) and (B ₁ + B ₂) requires 2n additions. Computing R ₁ and R ₂ requires n/3−1 additions each. On the other hand, we need 2n/3 additions for (R ₁−P _0L + P _3L), 5n/3−2 additions for −R ₁−P _0L−P _1H−P _2L + P _3H + P _4L, 5n/3−3 additions for R ₂−P _0H−P _1L−P _2H + P _4H + P _5L and 2n/3−2 additions for −R ₂−P _2H + P _5H. The delay complexity of the algorithm is the same as that of the previous one. The recursive use of this algorithm results in the following complexity:

$$ \left\{ \begin{array}{lcl} M_{3,\otimes}(n)&\leq&6M_{3,\otimes}(n/3), \, M_{3,\otimes}=1, \\ M_{3,\oplus}(n)&\leq&6M_{3,\oplus}(n/3)+22n/3-9, \, M_{3,\oplus}(1)=0,\\ M_{3}(n)&\leq&6M_{3}(n/3)+22n/3-9, \, M_{3}(n)=1,\\ D_{3}(n)&\leq&D_{3}(n/3)+4D_{\oplus}, \, D_{3}(1)=D_{\otimes}. \end{array} \right. $$

(12)

Applying Lemma 1 gives the following solutions:

$$ \left\{ \begin{array}{lcl} M_{3,\otimes}(n)&\leq&n^{\log_{3}6}, \\ M_{3,\oplus}(n)&\leq&5.53n^{\log_{3}6}-7.33n+1.8,\\ M_{3}(n)&\leq&6.53n^{\log_{3}6}-7.33n+1.8,\\ D_{3}(n)&\leq& 4(\log_{3}n)D_{\oplus}+D_{\otimes}. \end{array} \right. $$

(13)

The results in Eq. 13 represents a reduction of the complexity of Eq. 9 by approximately 4%. The delay complexities are the same.

4 Proposed 3-Way Algorithm with Five Multiplications

In this section, we introduce a 3-way algorithm for multiplying polynomials of degree n−1 over $\mathbb {F}_{3}$ with five multiplications using interpolation in $\mathbb {F}_{9}$. Let $A={\sum }_{i=0}^{n-1} a_{i} X^{i}$ and $B={\sum }_{i=0}^{n-1} b_{i} X^{i}$. We can divide A and B into three parts as follows: A(X) = A ₀ + A ₁ X ^n/3 + A ₂ X ^2n/3 and B(X) = B ₀ + B ₁ X ^n/3 + B ₂ X ^2n/3 where A _i and B _i are polynomials of degree less than n/3. Let $C={\sum }_{i=0}^{4} C_{i} X^{in/3}$ be the product of A and B. Recall that $\mathbb {F}_{9}=\mathbb {F}_{3}[X]/(X^{2}+1)$ and ω is a root of X ²+1=0 in $\mathbb {F}_{9}$.

To obtain an algorithm for the product C = A B with five multiplications, we use the interpolation method which yields Toom-Cook like formulas [5, 7–9, 13, 22]. Since $\mathbb {F}_{3}$ has insufficient points for the interpolation method, we use an element from $\mathbb {F}_{9}$, i.e., we use the points $0,1,2,\infty $ and ω as evaluation points. Evaluation of A B = C at those points then gives us the following system of linear equations in $\mathbb {F}_{9}$:

$$\begin{array}{@{}rcl@{}} &&{}\text{Evaluation at}~X=0 \Longrightarrow P_{0}=A_{0}B_{0}=C_{0} \\ &&{}\text{Evaluation at}~X=1 \Longrightarrow P_{1}=(A_{0}+A_{1}+A_{2})\\ &&\times(B_{0}+B_{1}+B_{2})=C_{0}+C_{1}+{\cdots} +C_{4} \\ &&{}\text{Evaluation at}~X=-1 \Longrightarrow P_{2}=(A_{0}-A_{1}+A_{2})\\ &&\times(B_{0}-B_{1}+B_{2})=C_{0}-C_{1}+{\cdots} +C_{4} \\ &&{}\text{Evaluation at}~X=\omega \Longrightarrow P_{3}=(A_{0}+A_{1}\omega-A_{2})\\ &&\times(B_{0}+B_{1}\omega-B_{2})=C_{0}+C_{1}\omega-{\cdots} +C_{4} \\ &&{}\text{Evaluation at}~X=\infty \Longrightarrow P_{4}=A_{2}B_{2}=C_{4}. \end{array} $$

Solving this system of linear equations yields the following algorithm for computing the product C = A B:

$$ \left\{\begin{array}{l} C_{0}=P_{0},\\ C_{1}=(P_{1}-P_{2})-(-P_{0}+P_{1}+P_{2}-P_{3}-P_{4})\omega,\\ C_{2}=-(P_{0}+P_{1}+P_{2}+P_{4}),\\ C_{3}=(P_{1}-P_{2})+(-P_{0}+P_{1}+P_{2}-P_{3}-P_{4})\omega,\\ C_{4}=P_{4}. \end{array}\right. $$

(14)

We now compute the cost of the recursive use of this algorithm. Assume that A and B are degree n−1 polynomials. A ₀,A ₁,A ₂,B ₀,B ₁ and B ₂ are therefore degree (n/3−1) polynomials. The cost associated with the operations are listed in Table 1.

Table 1 Cost of multi-evaluation and reconstruction for the new three-way split formulas.

Full size table

Remark 2

For the $\mathbb {F}_{3}$ computations indicated in Table 1, the cost of both U ₄ and U ₅ is zero since these are related to the ω-free part of the results .

To compute the delay complexity for the multiplication in $\mathbb {F}_{9}[X]$, we have drawn the multi-evaluation and reconstruction data flow shown in Fig. 1. As can be seen from the figure, the critical path for the evaluation requires two additions. It begins at A ₀ and ends at R ₂. On the other hand, the critical path for the reconstruction needs four additions that starts from P ₁ and continues through U ₂, U ₄, and U ₅, ending at C ₁. It should be noted that multiplication by ω is cost free and not counted in the complexity analysis. The last consideration is the requirement for one addition in the final overlap, so we obtain the complexity in Eq. 15.

$$ \left\{ \begin{array}{lll} M_{9,\otimes}(n)&\leq&5M_{9,\otimes}(n/3), \, M_{9,\otimes}(1)=4, \\ M_{9,\oplus}(n)&\leq&5M_{9,\oplus}(n/3)+20n-24, \, M_{9,\oplus}(1)=2,\\ M_{9}(n)&\leq&5M_{9}(n/3)+20n-24, \, M_{9}(1)=6,\\ D_{9}(n)&\leq&D_{9}(n/3)+7D_{\oplus}, D_{9}(1)=D_{\oplus}+D_{\otimes}. \end{array} \right. $$

(15)

Applying Lemma 1 leads to these bounds:

$$ \left\{ \begin{array}{lcl} M_{9,\otimes}(n)&\leq&4n^{\log_{3}5}, \\ M_{9,\oplus}(n)&\leq&26n^{\log_{3}5}-30n+6,\\ M_{9}(n)&\leq&30n^{\log_{3}5}-30n+6,\\ D_{9}(n)&\leq&(7\log_{3}n+1)D_{\oplus}+D_{\otimes}. \end{array} \right. $$

(16)

We now obtain the following complexity for M ₃(n) by substituting the result from Eq. 15 into M ₃(n) and applying Lemma 1:

$$ \left\{ \begin{array}{lcl} M_{3,\otimes}(n)&\leq&4M_{\otimes}(n/3)+M_{9,\otimes}(n/3), \, M_{3,\otimes}(1)=1,\\ M_{3,\oplus}(n)&\leq&4M_{3,\oplus}(n/3)+M_{9,\oplus}(n/3)+8n-10, \, M_{3,\oplus}(1)=0,\\ M_{3}(n)&\leq&4M_{3}(n/3)+M_{9}(n/3)+8n-10, \, M_{3}(1)=6,\\ D_{3}(n)&\leq &D_{9}(n/3)+7D_{\oplus}. \end{array} \right. $$

(17)

$$ \left\{ \begin{array}{lcl} M_{3,\otimes}(n)&\leq&4n^{\log_{3}5}-3n^{\log_{3}4}, \\ M_{3,\oplus}(n)&\leq&26n^{\log_{3}5}-33.33n^{\log_{3}4}+6n+1.33,\\ M_{3}(n)&\leq&30n^{\log_{3}5}-36.33n^{\log_{3}4}+6n+1.33,\\ D_{3}(n)&\leq&(7\log_{3}n+1)D_{\oplus}+D_{\otimes}, \, D_{3}(1)=D_{\otimes}. \end{array} \right. $$

(18)

When we compare the total number of arithmetic operations in Eq. 18 with that in Eq. 13, which is the best known 3-way algorithm, we see that the algorithm presented in this section becomes cost effective after the size of polynomials is more than 729. The percentage of cost reduction increases significantly when the polynomial size increases. For example, the reduction is about 4% for n=3⁷, 14% for n=3⁸, 25% for n=3⁹, and 55% for n=3¹⁰. The delay complexity in Eq. 18 is about 75% higher than that in Eq. 13 and this relative difference remains the same for all practical values of n. This is because the dominant coefficients for the delay complexities in Eqs. 18 and 13 are $7\log _{3} n$ and $4\log _{3} n$, respectively.

5 Complexity Analysis and Implementation Results for Practical n Values

This section presents the arithmetic complexity analysis and hardware implementation of polynomial multiplications over $\mathbb {F}_{3}$ and $\mathbb {F}_{9}$ for n=167,193,239,317 and 353. It should first be noted that we use 2-way or 3-way splits. If n is not divisible by two or three, we pad the polynomial with one or two zeros so that the sizes become divisible by three. The adjustment has a negligible effect on the complexity. Another note is that using the same algorithm in every recursion until the size becomes unity fails to produce the best results and that employing the schoolbook method after the size becomes small enough yields a better outcome. We recall that the schoolbook method requires n ² multiplications and (n−1)² additions in order to multiply two degree (n−1) polynomials leading to

$$ M_{3}(n+1)\leq M_{3}(n)+4n. $$

(19)

When we refer to the schoolbook method it is implied that we are computing M ₃(n+1) in terms of M ₃(n).

To demonstrate the effect of this approach, we can consider, for example, M ₃(8). Using the improved Karatsuba method in each recursion gives

$$M\!_{3}(8)\,=\,3M_{3}(4)\,+\,25\,=\,9M_{3}(2)+58=27M_{3}(1)+94=123. $$

On the other hand, using the schoolbook method after n=4 gives

$$M_{3}(8)=3M_{3}(4)+25=3\cdot25+25=100. $$

We use the same strategy for the multiplication in $\mathbb {F}_{9}[X]$. It can be observed that using the schoolbook method and the improved Karatsuba 2-way method together for multiplication in $\mathbb {F}_{9}[X]$ yields better results for small values of n. Recalling M ₉(1)=6 from Section 2 leads to an easy determination that the improved Karatsuba 2-way method for multiplication in $\mathbb {F}_{9}[X]$ gives

$$ M_{9}(n)\leq 3M_{9}(n/2)+7n-6, $$

(20)

and that the schoolbook method in $\mathbb {F}_{9}[X]$ gives

$$ M_{9}(n+1)\leq M_{9}(n)+16n+4. $$

(21)

As a further refinement, we use an additional strategy and proceed as follows: We split the degree (n−1) polynomials that will be multiplied into two parts by extracting ω part and ω-free part, i.e., we write $A,B \in \mathbb {F}_{9}[X]$ as A = A ₀ + A ₁ ω and B = B ₀ + B ₁ ω where $A_{0},A_{1},B_{0},B_{1} \in \mathbb {F}_{3}[X]$ of degree (n−1). Then

$$\begin{array}{@{}rcl@{}} AB&=&(A_{0}+A_{1}\omega)(B_{0}+B_{1}\omega)=A_{0}B_{0}-A_{1}B_{1}\\ &&+((A_{0}+A_{1})(B_{0}+B_{1})-A_{0}B_{0}-A_{1}B_{1}) \omega, \end{array} $$

(22)

i.e., M ₉(n)≤3M ₃(n)+8n−3 and M ₉(3)≤60.

We are now ready to compute the arithmetic cost of multiplication for n=167, 193, 249, 317, and 353. We have employed the following abbreviations for the algorithm names:

KA for the improved Karatsuba 2-way in $\mathbb {F}_{3}[X]$ as presented in Section 3.2
SB for the schoolbook method in $\mathbb {F}_{3}[X]$
K A ₉ for the improved Karatsuba 2-way in $\mathbb {F}_{9}[X]$ as given by Eq. 20
A1₉ for the new 3-way algorithm for multiplication in $\mathbb {F}_{9}[X]$, as explained in Section 4
A2₉ for multiplication in $\mathbb {F}_{9}[X]$ as given by Eq. 22

It should be noted that when we recursively use an algorithm A ℓ times, we write (A)^ℓ. The recursions listed in Tables 2 are also used in our hardware implementations.

Table 2 Complexities for polynomial multiplication over $\mathbb {F}_{3}$ and $\mathbb {F}_{9}$ for special n values.

Full size table

An additional note is that the classical approach for polynomial multiplication over $\mathbb {F}_{9}$ is the method expressed in Eq. 22, which has

$$ M_{9}(n)/ M_{3}(n)\approx 3. $$

(23)

Our proposed algorithms reduce this ratio to 2.57 or lower for the values indicated in Table 2, representing an improvement of about 15%.

For hardware implementation using digital technologies, we represent the elements of $\mathbb {F}_{3}$ as (x ₁,x ₂) where x ₁, x ₂∈{0,1} and the elements of $\mathbb {F}_{3}$, 0, 1, and 2 are represented by (0,0), (1,0) and (0,1), respectively. The addition of the elements of $\mathbb {F}_{3}$ is performed using the method reported in [20]. Let (x ₁,x ₂), (y ₁,y ₂), $(z_{1},z_{2}) \in \mathbb {F}_{3}$ such that (x ₁,x ₂)+(y ₁,y ₂)=(z ₁,z ₂). The addition can be implemented as follows:

$$\begin{array}{@{}rcl@{}} &&{}t = (x_{1}\, | \, y_{2})\,\, \hat{} \,\,(x_{2} \, | \, y_{1});\\ &&{}z_{1}= (x_{2} \, | \, y_{2}) \, \hat{} \, t;\\ &&{}z_{2}= (x_{1}\, | \, y_{1}) \, \hat{} \, t; \end{array} $$

It should also be noted that the negation is as follows: 2(x ₁,x ₂)=−(x ₁,x ₂)=(x ₂,x ₁), i.e., multiplication of an element of $\mathbb {F}_{3}$ by −1 is essentially free of cost.

Assume that (x ₁,x ₂), (y ₁,y ₂), $(z_{1},z_{2}) \in \mathbb {F}_{3}$ such that (x ₁,x ₂)(y ₁,y ₂)=(z ₁,z ₂). We use the following method for multiplication in $\mathbb {F}_{3}$.

$$\begin{array}{@{}rcl@{}} z_{1} = (x_{1}\, \& \,y_{1})\,\, |\,\, (x_{2} \,\& \,y_{2});\\ z_{2} = (x_{1}\, \& \,y_{2}) \,\,| \,\,(x_{2} \,\& \,y_{1}); \end{array} $$

Now, we show the representation and operations for $\mathbb {F}_{9}$. Recall that we represen

$$\mathbb{F}_{9} \cong \mathbb{F}_{3}[\omega]/(\omega^{2}+1)\,=\,\{0,1,2,\omega,\omega+1,\omega+2,2\omega,2\omega+1,2\omega+2\}.$$

The elements of $\mathbb {F}_{9}$ are therefore represented by a ₀ + a ₁ ω, where $a_{1},a_{2} \in \mathbb {F}_{3}$. For $a,b,c,d \in \mathbb {F}_{3}$, the addition and multiplication in $\mathbb {F}_{9}$ are:

$$\left\{ \begin{array}{l} (a+b\omega)+(c+d\omega)=(a+c)+(b+d)\omega,\\ (a+b\omega)(c+d\omega)=ac-bd+(bc+ad)\omega. \end{array} \right. $$

Since the elements of $\mathbb {F}_{3}$ are each represented as a two-tuple, we need two two-tuples for representing the elements of $\mathbb {F}_{9}$, i.e., for $a=a_{0}+a_{1}\omega \in \mathbb {F}_{9}$, we have a=(a _0,1,a _0,2)+(a _1,1,a _1,2)ω or simply a=[(a _0,1,a _0,2),(a _1,1,a _1,2)], where the entries are from {0,1}.

Let $a,b \in \mathbb {F}_{9}$ such that a=[(a ₁,a ₂),(a ₃,a ₄)] and b=[(b ₁,b ₂),(b ₃,b ₄)]. Then a + b is obtained as follows:

$$a+b=[\underbrace{(a_{1},a_{2})+(b_{1},b_{2})}_{\text{Call}~\mathbb{F}_{3}~\text{addition}},\underbrace{(a_{3},a_{4})+(b_{3},b_{4})}_{\text{Call}~\mathbb{F}_{3}~\text{addition}}]. $$

On the other hand, multiplication of a=[(a ₁,a ₂),(a ₃,a ₄)] and b=[(b ₁,b ₂),(b ₃,b ₄)] can be performed as follows:

$$\begin{array}{@{}rcl@{}} ab&=&[(a_{1},a_{2})(b_{1},b_{2})-(a_{3},a_{4})(b_{3},b_{4}),(a_{3},a_{4})(b_{1},b_{2})\\ &&+(a_{1},a_{2})(b_{3},b_{4})]. \end{array} $$

It should be noted that multiplication in $\mathbb {F}_{9}$ requires multiplication, addition and negation in $\mathbb {F}_{3}$.

We have implemented the polynomial multiplication of degree (n−1) for n=167, 239, 317, and 353. For each of these values, we have used the proposed algorithms for $\mathbb {F}_{3}$ and $\mathbb {F}_{9}$ for a number of recursions at the beginning and as the size of polynomials became smaller we have switched to other algorithms so that the overall arithmetic complexity could be kept at a minimum. Table 2 lists the sequence of algorithms used for each of the values of n. For example, for the multiplication of polynomials of degree 166 (or size 167) over $\mathbb {F}_{3}$, KA is used for the first six recursions, and SB is then used for the multiplication of polynomials of degree two. For computing the multiplication of polynomials of degree 166 over $\mathbb {F}_{9}$, we used A1₉ twice, K A ₉ once, A2₉ once, KA once and SB five times.

We have implemented the proposed algorithms at the Register Transfer Level (RTL) using Verilog HDL. For each algorithm, the sequence of operations described in Table 2 has been realized as pure combinational circuits. As an example, a high level block diagram of our circuits for the multiplication algorithm A1₉ is given in Appendix. In our implementation using Verilog, the schoolbook multiplication, $\mathbb {F}_{3}$ addition and $\mathbb {F}_{9}$ addition have each been coded to be configurable so that they can be instantiated in the recursive tree simply by passing the size parameter. The gate level synthesis has been performed in a Synopsys Design Compiler Version E-2010.12 using the TSMC 65 nm standard cell library at the worst case corner. The synthesis has been targeted to optimize for the area. The total areas and critical path delays achieved in the post-synthesis simulation for a variety of values of n are listed in Table 3. We note that there seems to be no previous ASIC implementation of characteristic three polynomial or field multiplication algorithms for values of n similar to those reported in Table 3. Readers interested in FPGA implementation results for multiplication of elements over characteristic fields like $\mathbb {F}_{3^{97}}$ are referred to [20].

Table 3 Implementation results for polynomial multiplication over $\mathbb {F}_{3}$ and $\mathbb {F}_{9}$ for special values of n.

Full size table

6 Further Improvements

M ₃(n) can be improved about 50% if we design a new algorithm as follows: We use the evaluation points 0,1,ω,−ω and $\infty $. Then we have

$$\begin{array}{@{}rcl@{}} &&{}\text{Evaluation at}~X=0 \Longrightarrow P_{0}=A_{0}B_{0}=C_{0} \\ &&{}\text{Evaluation at}~X=1 \Longrightarrow P_{1}=(A_{0}+A_{1}+A_{2})\\ &&\times(B_{0}+B_{1}+B_{2})=C_{0}+C_{1}+{\cdots} +C_{4} \\ &&{}\text{Evaluation at}~X=\omega \Longrightarrow P_{2}=(A_{0}+A_{1}\omega-A_{2})\\ &&\times(B_{0}+B_{1}\omega-B_{2})=C_{0}+C_{1}\omega-{\cdots} +C_{4} \\ &&{}\text{Evaluation at}~X=-\omega \Longrightarrow P_{3}=(A_{0}-A_{1}\omega-A_{2})\\ &&\times(B_{0}-B_{1}\omega-B_{2})=C_{0}-C_{1}\omega+{\cdots} +C_{4} \\ &&{}\text{Evaluation at}~X=\infty \Longrightarrow P_{4}=A_{2}B_{2}=C_{4}. \end{array} $$

Let P ₂ = P _2,0 + ω P _2,1 and P ₃ = P _3,0 + ω P _3,1. It can be observed that P _2,0 = P _3,0 and P _2,1=−P _3,1, which shows that P ₃ can be obtained from P ₂, thus avoiding the requirement to compute P ₃. The following formula and recursions can easily be obtained with the use of the method described previously in this section:

$$ \left\{\begin{array}{l} C_{0}=P_{0},\\ C_{1}=-P_{0}-P_{1}-P_{2,0}-P_{4}+P_{2,1}\omega,\\ C_{2}=-P_{0}-P_{2,0}+P_{4},\\ C_{3}=-P_{0}-P_{1}-P_{2,0}-P_{4}-P_{2,1}\omega,\\ C_{4}=P_{4}. \end{array}\right. $$

(24)

$$ \left\{ \begin{array}{lcl} M_{3,\otimes}(n)&\leq&3M_{\otimes}(n/3)+M_{9,\otimes}(n/3), \, M_{3,\otimes}(1)=1,\\ M_{3,\oplus}(n)&\leq&3M_{3,\oplus}(n/3)+M_{9,\oplus}(n/3)+14n/3-6, \, M_{3,\oplus}(1)=0,\\ M_{3}(n)&\leq&3M_{3}(n/3)+M_{9}(n/3)+14n/3-6, \, M_{3}(1)=6,\\ D_{3}(n)&\leq &D_{9}(n/3)+7D_{\oplus}. \end{array} \right. $$

(25)

$$ \left\{ \begin{array}{lcl} M_{3,\otimes}(n)&\leq&2n^{\log_{3}5}-n, \\ M_{3,\oplus}(n)&\leq&13n^{\log_{3}5}-4.85n{\log_{3}n}-13n,\\ M_{3}(n)&\leq&15n^{\log_{3}5}-4.85n{\log_{3}n}-14n,\\ D_{3}(n)&\leq&(7\log_{3}n+1)D_{\oplus}+D_{\otimes}. \end{array} \right. $$

(26)

The complexity of each algorithm is summarized in Table 4. It should be noted that the new 3-way algorithm outperforms the improved Karatsuba algorithm when n>400.

Table 4 Complexities of the different approaches for multiplication over $\mathbb {F}_{3}$.

Full size table

Example 1

This example illustrates our design of an area efficient algorithm for n=709. Using the recursion in Eq. 25 and the values for M ₃(239), M ₉(239) and M ₃(355) from Table 2 yields the following results:

$$\begin{array}{@{}rcl@{}} M_{3}(709)&<&M_{3}(717)\leq 3M_{3}(239)\\&&+M_{9}(239)+14\cdot 239-6=191870. \end{array} $$

On the other hand, improved Karatsuba gives

$$M_{3}(709)<M_{3}(710)\leq 3M_{3}(355)+7\cdot355-3. $$

Even if we use M ₃(355) = M ₃(353)≤67761 from Table 2, we get

$$M_{3}(709)\leq 205765. $$

The improved Karatsuba thus yields M ₃(709)≤205765 while the new algorithm results in M ₃(709)≤191870, i.e., an approximately 7% reduction in the complexity

Remark 3

The classical approach for performing multiplication in $\mathbb {F}_{3^{2n}}$ is to use the Karatsuba algorithm in the first recursion so that the complexity becomes approximately M ₃(2n)≤3M ₃(n). The other possible method relies on the extension field representation of the elements based on the use of $\mathbb {F}_{3^{2n}} \cong \mathbb {F}_{9^{n}}$. The elements of $\mathbb {F}_{3^{2n}}$ can then be represented by the polynomials over $\mathbb {F}_{9}$ of degree less than n. This method requires approximately M ₃(2n)≤M ₉(n). Recall that M ₉(n)≈2.5M ₃(n) (see Table 2). The use of the 3-way algorithm for multiplication of polynomials over $\mathbb {F}_{9}$ is therefore superior to the classical approach by about 15%.

7 Conclusion

In this paper, we have proposed improved algorithms for multiplication in $\mathbb {F}_{3^{n}}$. As a first step, we introduced improvements to the classical Karatsuba algorithm, which can also be employed for characteristic three fields, and we also indicated the computational cost of the improved Karatsuba 2-way and 3-way algorithms. Next, we explained our derivation of a new 3-way polynomial multiplication algorithm with five 1/3 sized multiplications using interpolation in $\mathbb {F}_{9}$ and determined the arithmetic and delay complexity associated with the recursive use of this algorithm. We then described ASIC implementation of multiplication of polynomials that are of practical interest. The final contribution of this work is another efficient algorithm for multiplication in $\mathbb {F}_{3^{n}}$ that uses polynomial multiplication over $\mathbb {F}_{9}$ and produces superior results. This algorithm leads to about 15% reduction in terms of the number of basic $\mathbb {F}_{3}$ operations needed for fields considered in Section 5.

References

Ahmadi, O., Hankerson, D., & Menezes, A. (2007). Formulas for cube roots in F$_{3^{\mathrm {m}}}$. Discrete Applied Mathematics, 155(3).
Ahmadi, O., Hankerson, D., & Menezes, A. (2007). Software implementation of arithmetic in F$_{3^{\mathrm {m}}}$. In WAIFI (pp. 85–102).
Barbulescu, R., Detrey, J., Estibals, N., & Zimmermann, P. (2012). Finding optimal formulae for bilinear maps. In WAIFI (pp. 168–186).
Bernstein, D.J. (2009). Batch binary Edwards. In Advances in cryptology - CRYPTO 2009, volume 5677 of LNCS (pp. 317–336).
Cenk, M., Koç, Ç. K., & Özbudak, F. (2009). Polynomial multiplication over finite fields using field extensions and interpolation. In IEEE symposium on computer arithmetic (pp. 84–91).
Cenk, M., Hasan, M.A., & Negre, C. (2014). Efficient subquadratic space complexity binary polynomial multipliers based on block recombination. IEEE Transactions on Computers, 63(9), 2273–2287.
Article MathSciNet MATH Google Scholar
Cenk, M., Negre, C., & Hasan, M. A. (2011). Improved three-way split formulas for binary polynomial multiplication. In Selected areas in cryptography (pp. 384–398).
Cenk, M., Negre, C., & Anwar Hasan, M. (2013). Improved three-way split formulas for binary polynomial and toeplitz matrix vector products. IEEE Transactions on Computers, 62(7), 1345–1361.
Article MathSciNet MATH Google Scholar
Cenk, M., & Özbudak, F. (2008). Efficient multiplication in $\mathbb {F}_{3^{\ell m}}$, m≥1 and 5≤ℓ≤18. In AFRICACRYPT (pp. 406–414).
Fan, H., & Hasan, M. A. (2007). A new approach to subquadratic space complexity parallel multipliers for extended binary fields. IEEE Transactions on Computers, 56(2), 224–233.
Article MathSciNet Google Scholar
Farashahi, R. R., Wu, H., & Zhao, C. (2013). Efficient arithmetic on elliptic curves over fields of characteristic three. In Selected areas in cryptography (pp. 135–148).
Von Zur Gathen, J., & Gerhard, J. (2013). Modern computer algebra, 3rd edn. Cambridge: Cambridge University Press.
Book MATH Google Scholar
Gorla, E., Puttmann, C., & Shokrollahi, J. (2007). Explicit formulas for efficient multiplication in $F_{3^{6m}}$. In Selected areas in cryptography (pp. 173–183).
Hisil, H., Carter, G., & Dawson, E. (2007). New formulae for efficient elliptic curve arithmetic. In Progress in cryptology–INDOCRYPT 2007 (pp. 138–151).
Karatsuba, A. A., & Ofman, Y. (1963). Multiplication of multidigit numbers on automata. Soviet Physics Doklady, 7(2), 595–596.
Google Scholar
Kim, K. H., Choe, J. S., & Kim, S. I. (2007). New fast algorithms for arithmetic on elliptic curves over finite fields of characteristic three. Cryptology ePrint Archive, Technical Report 2007/179.
Koblitz, N. (1998). An elliptic curve implementation of the finite field digital signature algorithm. In Advances in cryptology–CRYPTO’98 (pp. 327–337). Springer.
Montgomery, P. L. (2005). Five, six, and seven-term karatsuba-like formulae. IEEE Transactions on Computers, 54(3), 362–369.
Article MATH Google Scholar
Negre, C. (2005). Scalar multiplication on elliptic curves defined over fields of small odd characteristic. In Progress in cryptology–INDOCRYPT 2005 (pp. 389–402). Springer.
Page, D., & Smart, N. P. (2003). Hardware implementation of finite fields of characteristic three. In Cryptographic hardware and embedded systems-CHES 2002 (pp. 529–539). Springer.
Smart, N. P., & Westwood, E. J. (2003). Point multiplication on ordinary elliptic curves over fields of characteristic three. Applicable Algebra in Engineering, Communication and Computing, 13(6), 485–497.
Article MathSciNet MATH Google Scholar
Winograd, S. (1980). Arithmetic complexity of computations. Society For Industrial & Applied Mathematics, U.S.
Zhou, G., & Michalik, H. (2010). Comments on a new architecture for a parallel finite field multiplier with low complexity based on composite field. IEEE Transactions on Computers, 59(7), 1007–1008.
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Applied Mathematics at Middle East Technical University, Ankara, Turkey
Murat Cenk
Synopsys Inc., Toronto, Ontario, Canada
Farhad Haghighi Zadeh
Department of Electrical, Computer Engineering at University of Waterloo, Waterloo, Ontario, Canada
M. Anwar Hasan

Authors

Murat Cenk
View author publications
You can also search for this author in PubMed Google Scholar
Farhad Haghighi Zadeh
View author publications
You can also search for this author in PubMed Google Scholar
M. Anwar Hasan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Murat Cenk.

Appendix

A high level block diagram of our circuits for the multiplication algorithm of A1₉ is presented in Fig. 2. A and B are two degree (n−1) polynomials over $\mathbb {F}_{3}$. They are split into three parts as A(X) = A ₀ + A ₁ X ^n/3 + A ₂ X ^2n/3 and B(X) = B ₀ + B ₁ X ^n/3 + B ₂ X ^2n/3 where A _i and B _i are polynomials of degree less than n/3. For the details of the algorithm, we refer the reader to Section 4.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Cenk, M., Zadeh, F.H. & Hasan, M.A. New Efficient Algorithms for Multiplication Over Fields of Characteristic Three. J Sign Process Syst 90, 285–294 (2018). https://doi.org/10.1007/s11265-017-1234-x

Download citation

Received: 13 May 2014
Revised: 24 November 2016
Accepted: 20 February 2017
Published: 06 March 2017
Issue Date: March 2018
DOI: https://doi.org/10.1007/s11265-017-1234-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

New Efficient Algorithms for Multiplication Over Fields of Characteristic Three

Abstract

Similar content being viewed by others

Some new results on binary polynomial multiplication

Improved Polynomial Multiplication Algorithms over Characteristic Three Fields and Applications to NTRU Prime

Efficient Polynomial Multiplication via Modified Discrete Galois Transform and Negacyclic Convolution

1 Introduction

Our Contributions

Organization of the Paper

2 Notations and Preliminaries

Lemma 1

Proof

3 Known Algorithms and their Improvements

Remark 1

3.1 Karatsuba 2-Way Algorithm

3.2 Improved Karatsuba 2-Way Algorithm

3.3 Karatsuba Like 3-Way Algorithm

3.4 Improved Karatsuba Like 3-Way Algorithm

4 Proposed 3-Way Algorithm with Five Multiplications

Remark 2

5 Complexity Analysis and Implementation Results for Practical n Values

6 Further Improvements

Example 1

Remark 3

7 Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Appendix

Rights and permissions

About this article

Cite this article

Keywords

Navigation

New Efficient Algorithms for Multiplication Over Fields of Characteristic Three

Abstract

Similar content being viewed by others

Some new results on binary polynomial multiplication

Improved Polynomial Multiplication Algorithms over Characteristic Three Fields and Applications to NTRU Prime

Efficient Polynomial Multiplication via Modified Discrete Galois Transform and Negacyclic Convolution

1 Introduction

Our Contributions

Organization of the Paper

2 Notations and Preliminaries

Lemma 1

Proof

3 Known Algorithms and their Improvements

Remark 1

3.1 Karatsuba 2-Way Algorithm

3.2 Improved Karatsuba 2-Way Algorithm

3.3 Karatsuba Like 3-Way Algorithm

3.4 Improved Karatsuba Like 3-Way Algorithm

4 Proposed 3-Way Algorithm with Five Multiplications

Remark 2

5 Complexity Analysis and Implementation Results for Practical n Values

6 Further Improvements

Example 1

Remark 3

7 Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation