Professional Documents
Culture Documents
3, APRIL 2008
I. INTRODUCTION
A. CP
Based on the cellular-array power-sum circuit and the cel-
lular-array multiplier [8], a CP is constructed. The CP is pro-
vided to perform and in the finite field ,
which includes an array of identity cells, as shown in
Fig. 2. Each identity cell includes three two-input AND gates,
one two-input XOR gate, one three-input XOR gate, and a mul-
tiplexer, as shown in Fig. 3. In this CP, what arithmetic oper-
ation which this CP wants to perform is decided by a control
signal . Assume two input elements, and , are, re-
spectively, expressed as
(2)
Fig. 3. Circuit of (i; j ) identity cell in CP.
(3)
Then, according to the irreducible polynomial and its modulo
polynomial, we have
(4)
830 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS—I: REGULAR PAPERS, VOL. 55, NO. 3, APRIL 2008
(5) ;
for to ,
By this, a CP is designed to perform the operation and
operation. A signal is used to control the operations. for to ,
When the control signal ,
;
and operation is performed. This CP performs the
operation when . At this time, end;
for and . The outcome of this
CP is ;
for to ,
;
end;
(6)
end;
For the operation, the calculation result can be repre- .
sented as follows [8]:
For the operation, the algorithm can be represented
as follows:
(7)
(10)
(8)
(9) (12)
CHEN et al.: AU FOR FINITE FIELD 831
In summary, the algorithm of is as follows. in each identity cell. Accordingly, for the
identity cell
;
for to , (13)
for to ,
; (14)
(15)
end;
;
Then the identity cells in the th column of the CP perform
; the computations of and . Finally, the output of
the CP is . The CP performs the
for to ,
operation when the control signal = 1. At this time, the
; multiplexer MUX of each identity cell outputs .
Accordingly
end;
end;
(16)
.
(17)
The complete algorithm of CP can then be obtained by just
combining the algorithms of and , that is, as shown
For the identity cell
in the following algorithm.
;
for to , (18)
for to , (19)
;
end; Therefore, the identity cells in the th column of the CP per-
form the computations of and . Then the output
if , then of the CP is .
The CP mentioned above can be modified to be a general CP
;
of a finite field (see Fig. 4). The same as the above
for to , CP, the general CP is also an array of identity cells. Assuming
this general CP is structured of identity cells, then this
; general CP can perform the and operations in all
end; finite fields for , where and are elements
of the finite field . Furthermore, to adapt the different-
else sized finite fields , each identity cell is further provided
; with two two-input multiplexers MUX1, MUX2 and a control
signal . The control signal is determined by the size of
; the finite field , for controlling the multiplexers MUX1
and MUX2. The control signal only for the th
for =2 to ,
row of identity cells, so that the multiplexer MUX1 can pass
; to , in the same row, and the other
multiplexer MUX2 can pass to , in
end;
the same row. The other control signals for ,
end; so that the multiplexer MUX1 in all identity cells for
can receive of the upper identity cell to its
.
, and the other multiplexer MUX2 in all identity cells
Figs. 2 and 3 show the circuit diagram of the CP and it per- for can receive of the upper
forms the operation when the control signal . identity cell to its . Thus, the identity cells in
At this time, the input signal . As a result, the lower left part of this general CP perform the same arithmetic
the gate AND3 outputs a 0’s, and the multiplexer MUX outputs operations as the above mentioned CP. Furthermore, the
832 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS—I: REGULAR PAPERS, VOL. 55, NO. 3, APRIL 2008
(21)
B. AP
The AP is structured on the CP, for performing all arithmetic
operations except addition. These arithmetic operations can be
combined by four basic operations: loading, multiplication, ex-
ponentiation, and inverse multiplication. For example, division
is implemented by combining multiplication and inverse mul-
tiplication. The detailed structural diagram of the AP is shown
in Fig. 6, which includes a CP and additional control circuits
and storage memories. For a finite field , these control
Fig. 4. General CP. (a) Structural diagram. (b) Circuit of (i; j ) identity cell. circuits and storage memories includes five -bit multiplexers,
two groups of -bit D-type flip flops, an -bit switch and some
logic gates generating control signals for multiplexers. The
output , are redundancy since input of the AP includes: ,
input signal for , that is control signal ,
determined by the maximum size of the finite field
Here is an element in the finite field , which is multiplication , where . In fact, for
input from the input , where the finite field . Therefore, to
is between 0 and and can be divided as perform is to perform the exponentiation with
, then can
be expressed as . Clearly, inverse mul-
tiplication , like exponentiation , is implemented by
operations of the CP. Therefore, the control signal
(21) for cycles, so that the CP performs the
operation for times. The outcome of the th cycle
Clearly, exponentiation can be implemented by is stored in the register Register1 so as to feedback to the CP for
operations of the CP. Therefore, the control signal the next operation. The control signal of the multiplexer
for cycles so that the CP performs the operations MUX5 is for cycles, the control signal of
for times. The outcome of the th cycle is stored in the multiplexer MUX1 is for the first cycle, the control
the register Register1 so as to feedback to the CP for the next signal of the multiplexer MUX2 is
operation for cycles, the control signal of the multiplexer MUX3
is 1 for cycles, the control signal of the multiplexer
(22) MUX4 is . Thus, the outcome of the
inverse multiplication operation can be obtained in
Furthermore, the outcome of the exponentiation is se- cycles and stored in Register1. Furthermore, when the inverse
lected from or according to . The control signal multiplication operation is executing, the outcome for each
of the multiplexer MUX5 is for cycle is stored in Register1, and so the data of the previous
cycles, the control signal of the multiplexer MUX1 is instruction stored in Register1 has to be transferred to Register2
for the first cycle, the control signal of the multi- (controlled by the signal ) for later use.
plexer MUX2 is for Since it is able to perform loading, multiplication, exponen-
cycles, the control signal of the multiplexer MUX3 is 0 for tiation and inverse multiplication, the AP can perform all arith-
cycles, the control signal of the multiplexer MUX4 metic operations in the finite field except addition (ac-
is . Thus, the outcome of the cumulation), which can be implemented by the ALU.
exponentiation operation can be obtained in cycles
and stored in the register Register1. Moreover, when the expo- C. ALU
nentiation operation is executing, the outcome for each cycle Addition in the finite field can be simply imple-
is stored in the register Register1. Therefore, the data of the mented by XOR gates, and another register is provided to
previous instruction stored in the register Register1 has to be store the previous data when performing accumulation. When
transferred to the register Register2 (controlled by the signal the accumulation is completed, the register is also refreshed.
) for later use. The overall ALU can be seen in Fig. 7. This circuit is designed
4) Inverse Multiplication: When the control signal to perform one accumulation in each cycle, which adds the data
, the AP performs inverse from the AP and the data stored in the register and outputs back
CHEN et al.: AU FOR FINITE FIELD 835
to the register. Whether or not the AP performs accumulation is tion. When , a zero element (0) in the finite field
determined by the control signal . When , is sent to the ALU, so the output of the ALU remains
the ALU receives the output of the AP and performs accumula- the same.
836 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS—I: REGULAR PAPERS, VOL. 55, NO. 3, APRIL 2008
TABLE I
OPERATION PROCEDURE OF EXAMPLE
[10] W. C. Tsai and S. J. Wang, “Two systolic architecture for multiplication Shyue-Win Wei (S’85–M’86–SM’97) was born in
in GF(2 ) ,” Proc. IEE Comput. Digit. Tech., vol. 147, pp. 375–382, Changhua,Taiwan, R.O.C., on June 9, 1958. He re-
Nov. 2000. ceived the M.S. degree in communications and Ph.D.
[11] C. H. Kim, C. P. Hong, and S. Kwon, “A digit-serial multiplier for finite degree in electronics from National Chiao Tung Uni-
field GF(2 ) ,” IEEE Trans. Very Large Scale Integr. (VLSI) Circuits versity, Hsinchu, Taiwan, R.O.C., in 1986 and 1990,
Syst., vol. 13, no. 4, pp. 476–483, Apr. 2005. respectively.
[12] H. Wu, “Bit-parallel finite field multiplier and squarer using polynomial From 1980 to 1984 he was with the Institute
basis,” IEEE Trans. Comput., vol. 51, no. 7, pp. 750–758, Jul. 2002. of Police Telecommunications, Taiwan, R.O.C. In
[13] C. Y. Lee, “Low complexity bit parallel systolic multiplier over
GF(2 ) using irreducible trinomials,” Proc. IEE Comput. Digit.
1990, he joined Telecommunication Laboratories,
Taiwan, R.O.C., where he worked on the devel-
Tech., vol. 150, no. 1, pp. 39–42, Jan. 2003. opment of a high-bit-rate digital subscriber line
[14] K. Y. Chang, D. Hong, and H. S. Cho, “Low complexity bit-par-
allel multiplier for GF(2 ) defined by all-one polynomials using
transmission system. Form 1992 to 2000, he was with the Department of
Electrical Engineering, Chung Hua University, Taiwan, R.O.C. Since 2000, he
redundant representation,” IEEE Trans. Comput., vol. 54, no. 12, pp. was been a Professor in the Department of Electrical Engineering, National
1628–1630, Dec. 2005.
Chi-Nan University, Nantou, Taiwan, R.O.C. His research interests include
[15] C. L. Wang and J. L. Lin, “A systolic architecture for computing in-
verses and divisions in finite field GF(2 ) ,” IEEE Trans. Comput.,
digital transmission system, digital subscriber lines, coding theory, and related
VLSI circuits design.
vol. 42, no. 9, pp. 1141–1146, Sep. 1993.
[16] A. V. Dinh, R. J. Bolton, and R. Mason, “A low latency architecture
for computing multiplicative inverses and divisions in GF(2 ) ,” IEEE
Trans. Circuits Syst. II, Analog Digit. Signal Process., vol. 48, no. 8,
pp. 789–793, Aug. 2001. Hung-Jen Tsai was born on April 28, 1968. He re-
[17] Z. Yan and D. V. Sarwate, “New systolic architectures for inversion ceived the M.S. degree in electrical engineering from
and division in GF(2 ) ,” IEEE Trans. Comput., vol. 52, no. 11, pp. Chung Hua University, Hsinchu, Taiwan, R.O.C., in
1514–1519, Nov. 2003. 1997.
[18] C. C. Wang, T. K. Truong, H. M. Shao, L. J. Dentsch, J. K. Omura, Since 1989, he has been a member of Beam Dy-
and I. S. Reed, “VLSI architectures for computing multiplications
GF(2 )
namics Group, National Synchrotron Radiation Re-
and inverses in ,” IEEE Trans. Comput., vol. C-34, no. 7, pp. search Center, Hsinchu, Taiwan, R.O.C.
709–716, Jul. 1985.
[19] P. A. Scott, S. J. Simmons, S. E. Tavares, and L. E. Peppard, “Archi-
tectures for exponentiation in GF(2 ) ,” IEEE J. Sel. Areas Commun.,
vol. 6, no. 3, pp. 578–586, Apr. 1988.
[20] S. W. Wei, “A systolic power-sum circuit for GF(2 ) ,” IEEE Trans.
Comput., vol. 43, no. 2, pp. 226–229, Feb. 1994.