4 COMPLEXITY COMPARISON
In this section we evalutate the complexity of the
modified form of Lagrange multiplication of Bajard et
al.. First we focus on the theoretic complexity by
evaluating the number of field operations F
p
(addi-
tions and multiplications). In the table 4 we give
the cost of each step of the Algorithm 2 used with
Change
Rep of section 3. For an explicit evaluation
of the cost of the FFT we refer to (von zur Gathen and
Gerhard, 1999).
Multiplication Additions
Step 1 2r 0
Step 2 rlog
2
(r) 2rlog
2
(r)
Step 3 3r r
Step 4 (2rlog
2
(r) + r − 1) 4rlog
2
(r)
The global cost of the algorithm is thus equal to
(4rlog
2
(r)+7r−2) multiplications and (8rlog
2
(r)+
r) additions. We get a clearly improvement compared
to original LR multiplication, since the complexity
was equal to (2r(r − 1) + r) addtions and 2r
2
+ 5r
multiplications in F
p
.
Hardware implementation. For hardware imple-
mentations, the FFT have the nice property to be
parallelizable. Specially, we can compute F F T
r
(A)
with r/2 multipliers and r adders in parallel. The de-
lay of the architecture to perform one F F T
r
is then
equal to log
2
(r)T
M
+log
2
(r)T
A
. For a precise expla-
nation of this fact we refer to (Johnson et al., 2000) .
But the other computations in the Algorithm 2 re-
quire also at most only r multipliers in parallel and r
adders in parallel. This is clear for step 1 and step 3,
for step 2 and 4, we need r adders and multipliers for
the FFT parts, and also r multipliers for the compu-
tation of
e
Q and R(X). We can use at each time the
same r adders and multipliers.
Consequently the Algorithm 2 can be implemented
in hardware with an architecture using r multipliers
and r adders in parallel.
Let us evaluate the delay of such architecture. The
total delay is equal to the sum of the delay of each
step of Algorithm 2. If we note T
M
the time for a
multiplication in F
p
, the step 1 has a delay of 2T
M
and the step 2 has a delay of 3T
M
+ T
A
. In step 2
and 4 the delay is equal to the delay of two F F T
r
plus the delay of two multiplications ,i.e. (one T
M
for the multiplications by r
−1
and a second for the
computation of
e
A), each step has a delay of (log
2
(r)+
2)T
M
+ (log
2
(r)T
A
.
Finally, the global delay is equal to (log
2
(r) +
5)T
M
+ (log
2
(r) + 1)T
A
.
5 CONCLUSION
In this paper we have presented a modified version of
the Algorithm of Bajard et al. (Bajard et al., 2003)
computes the product of two elements of F
p
n
. We
have modified only a part of the algorihtm: precisely
we modified the changes of representation in a way
to use FFT algorithm. We thus obtain an algorithm
which have a sub-quadratic complexity: a multipli-
cations requires (4rlog
2
(r) + 7r − 2) multiplica-
tions and (8rlog
2
(r) + r) additions in F
p
instead of
(2r(r−1)+r) addtions and 2r
2
+5r multiplications in
the original work of Bajard et al. (Bajard et al., 2003).
We are greatefull to N. Louvet for helpfull com-
ments on a preliminary version of this paper.
REFERENCES
Bailey, D. and Paar, C. (1998). Optimal Extension Fields
for Fast Arithmetic in Public-Key Algorithms. Lecture
Notes in Computer Science, 1462:472.
Bajard, J.-C., Imbert, L., Negre, C., and Plantard, T. (2003).
Efficient Multiplication in GF(p
k
) for Elliptic Curve
Cryptography. In ARITH 16, 16th IEEE Symposium
on Computer Arithmetic June 15-18, 2003 Santiago
de Compostela, SPAIN.
Berlekamp, E. (1982). Bit-serial Reed-Solomon encoder.
IEEE Transaction on Information Theory, IT-28(6).
Johnson, J., Kumhom, P., and Nagvajara, P. (2000). De-
sign, optimization, and implementation of a universal
fft processor. In 13th IEEE International ASIC/SOC
Conference, Washington, DC.
Koblitz, N. (1987). Elliptic curve cryptosystems. Mathe-
matics of Computation, 48(177):203–209.
Miller, V. (1986). Use of elliptic curves in cryptography.
Advances in Cryptology, proceeding’s of CRYPTO’85,
218:417–426.
Montgomery, P. (1985). Modular multiplication without
trial division. Mathematic of computation, 44(170).
Ning, P. and Yin, Y. (2001). Efficient Software Implemen-
tation for Finite Field Multiplication in Normal Basis.
Lecture Notes in Computer Science, 2229:177.
Sunar, B. and Koc, C. (1999). Mastrovito Multiplier for All
Trinomials. IEEE Transaction on Computers.
von zur Gathen, J. and Gerhard, J. (1999). Modern com-
puter algebra. Cambridge University Press, New
York, NY, USA.
FINITE FIELD MULTIPLICATION IN LAGRANGE REPRESENTATION USING FAST FOURRIER TRANSFORM
323