CN103186360A - Fast arithmetic multi-bit serial pulse dual-base binary finite field multiplier - Google Patents

Fast arithmetic multi-bit serial pulse dual-base binary finite field multiplier Download PDF

Info

Publication number
CN103186360A
CN103186360A CN2013101154017A CN201310115401A CN103186360A CN 103186360 A CN103186360 A CN 103186360A CN 2013101154017 A CN2013101154017 A CN 2013101154017A CN 201310115401 A CN201310115401 A CN 201310115401A CN 103186360 A CN103186360 A CN 103186360A
Authority
CN
China
Prior art keywords
module
result
input
individual
xor
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2013101154017A
Other languages
Chinese (zh)
Other versions
CN103186360B (en
Inventor
潘正祥
杨春生
白忠海
李秋莹
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Graduate School Harbin Institute of Technology
Original Assignee
Shenzhen Graduate School Harbin Institute of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Graduate School Harbin Institute of Technology filed Critical Shenzhen Graduate School Harbin Institute of Technology
Priority to CN201310115401.7A priority Critical patent/CN103186360B/en
Publication of CN103186360A publication Critical patent/CN103186360A/en
Application granted granted Critical
Publication of CN103186360B publication Critical patent/CN103186360B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Complex Calculations (AREA)

Abstract

The invention relates to a fast arithmetic multi-bit serial pulse dual-base binary finite field multiplier, comprising an input end B, k PE modules, an FRRP module and an R3 module. The k PE modules are connected in series, the k PE modules pass through k cycles, in the first cycle, the input of A is that B is directly input, and the calculation result is restored and input into a temporary register C through the FRRP module; in the second cycle, the input of A is that B is input through the R3 module, the calculation result is also restored through the FRRP module, and is added to the calculation result of the first cycle and stored in the temporary register C; so, in the k cycle, the input of A is that B is input after passing through the R3 module for (k-1) times, the calculation result is restored through the FRRP module, added to the accumulation result of the previous (k-1) times and stored in the temporary register C, and the temporary register C outputs the result.

Description

Scale-of-two Galois field multiplier at the bottom of the first series connection pulsation of the quick computing multidigit double-basis
Technical field
The present invention relates to a kind of scale-of-two Galois field multiplier, relate in particular to scale-of-two Galois field multiplier at the bottom of a kind of quick computing multidigit unit series connection pulsation double-basis.
Background technology
In recent years, Elliptic Curve Cryptography (ECC, Elliptic curve cryptography) [1], [2] are connected with cryptographic research.Along with the appearance of Elliptic Curve Cryptography in common key cryptosystem, some hard-wired problems have been carried in the application of ECC.NIST has recommended 5 two bit fields, and GF (2 163), GF (2 233), GF (2 283), GF (2 409), and GF (2 571).In the cipher protocol based on the ECC substrate, it is the requisite element that calculation level becomes that on-the-spot multiplication is arranged.The common influence area of the validity of cryptographic system hardware, energy consumption, and performance performance.
For the realization of high speed lsi (VLSI, very-large-scale integration), the heart contraction array structure is better selection.In two bit fields of expansion, multiple effective heart contraction array multiplier has been designed and can be classified as bit parallel and has been serial mechanism.Effectively bit parallel heart contraction multiplier adopts the preferential or MSB priority algorithm of LSB usually.The major advantage of bit parallel heart contraction multiplier is the connectivity in the whole computation process.Yet these structures need O (m to the polynomial expression based on two bit fields 2) XOR, O (m 2) AND, O (m 2) one latch and the delay complexity of O (m).For minimizing time and space complexity, LEE[8], [9], [13] algorithm has been showed has on-the-spot multiplication for some special polynomial expressions, a full polynomial expression for example, five polynomial expressions, three polynomial expressions, can use Toeplitz matrix-vector multiplication (TMVP, Toeplitz matrix-vector product) to remove to set up the full parallel heart contraction multiplier that is.Bit serial heart contraction array multiplier needs the space complexity of O (m), but they have caused longer computing relay.
For a compromise of time complexity and space complexity, be side by side and be that digital tandem heart contraction multiplier is disclosed between the series connection multiplier.Multiplier at the bottom of the numeral tandem conversion polynomial basis is numeral based on inside, and the outside is that the structure that walks abreast is suggested in [20].In such multiplier, the m position can be divided into again during element fields was long
Figure GDA00003008658000011
The son section that individual d position is long.In each clock period, the word string of d position is calculated and the multiplication of a m position has calculated.The parallel Hunk vector matrix of an intrinsic d*d position of an extendible and systaltic multiplier use is in [15], and the delay that [16] put forward it is
Figure GDA00003008658000021
The individual clock period.The multidigit unit different structure of the series connection pulsation inside and outside use of multiplier presents in the literature.The delay of these multipliers is Clock period.As previously mentioned, the design of the heart contraction Galois field multiplier of low complex degree is fixed against the selection of irreducible function and the selection of performance substrate, and these numeral series connection multipliers need high time-delay to go to realize that multiplication calculates.
Summary of the invention
The technical matters that the present invention solves is: make up scale-of-two Galois field multiplier at the bottom of a kind of quick computing multidigit unit series connection pulsation double-basis, overcoming existing multiplier needs high time-delay to go the technical matters that realizes that multiplication calculates.
Technical scheme of the present invention is: make up scale-of-two Galois field multiplier at the bottom of a kind of quick computing multidigit unit series connection pulsation double-basis, comprise input end B, k PE module, FRRP module, R3 module, described k PE module series connection, described k PE module is through k cycle, and the input of the 1st cycle A is A 0, A 1..., A K-1, B directly imports, and result of calculation is input among the working storage C through described FRRP module reduction; The input A of the 2nd cycle A k, A K+1..., A 2k-1, B is through described R3 module input, and result of calculation also through the reduction of FRRP module, with the result of calculation addition in the 1st cycle, is kept among the working storage C; So, in k cycle, the input of A is B imports through after (k-1) inferior described R3 module, and result of calculation, is saved among the working storage C with described (k-1) inferior accumulation result addition through described FRRP module reduction, and by working storage C output result, described R3 module realizes Bx again KdThe calculating of modF (x), described PE module comprise R1 module, CMP module, CVP module, PWM module,
Figure GDA00003008658000023
Individual XOR gate and Individual latch, described R3 module output to described R1 module and carry out the coefficient conversion by described CMP module, and the coefficient conversion that described CVP module is carried out the segmentation of A is imported in the segmentation of A, and the result of calculation of CMP module and CVP module all is input to the PWM module, realizes B InCalculate process with A segmentation product
Figure GDA00003008658000025
Individual XOR gate adds up, and the result is kept at In the individual latch, by
Figure GDA00003008658000027
Latch output result
Figure GDA00003008658000028
Wherein, A is by three polynomial expression F (x)=1+x n+ x m, be expressed as A=a 0+ a 1X+...+a M-1x M-1, total m coefficient, i.e. (a 0, a 1..., a M-1).Use the segmentation patterning method, the A of m position is cut into
Figure GDA000030086580000211
Every section d position, always total k 2Therefore individual segmentation has
Figure GDA00003008658000029
B can be expressed as B=b at the bottom of by double-basis 0β 0+ b 1β 1+ ...+b M-1β M-1, as another input of multiplier; C is the output result.
Further technical scheme of the present invention is: described FRRP module comprises FR module, R2 module, and described R2 module realizes Cmod (x m+ 1) calculating, the input of described FR module are the result of calculation of k series connection PE module, and the result is reduced, and output to the R2 module.
Further technical scheme of the present invention is: described CMP module comprises XOR gate XOR_1 and XOR_2, described XOR gate XOR_1 and XOR_2 parallel connection.
Further technical scheme of the present invention is: described CVP module is XOR gate XOR_3.
Further technical scheme of the present invention is: described PWM module comprise three parallel connections with door AND_1, AND_2 and AND_3.The result of described CMP module and the output of described CVP module is carried out point-to-point multiplying each other.
Further technical scheme of the present invention is: described FR module comprises XOR gate XOR_4 and the XOR_5 of two parallel connections.
Technique effect of the present invention is: make up scale-of-two Galois field multiplier at the bottom of a kind of quick computing multidigit unit series connection pulsation double-basis, comprise input end B, k PE module, FRRP module, R3 module, described k PE module series connection, described k PE module is through k cycle, and the input of the 1st cycle A is (A 0, A 1... A K-1), B directly imports, and result of calculation is input among the working storage C through described FRRP module reduction; Input (the A of the 2nd cycle A k, A K+1..., A 2k-1), B is through described R3 module input, and result of calculation also through the reduction of FRRP module, with the result of calculation addition in the 1st cycle, is kept among the working storage C; So, in k cycle, the input of A is
Figure GDA00003008658000032
B imports through after (k-1) inferior described R3 module, result of calculation, is saved among the working storage C with front (k-1) inferior accumulation result addition through described FRRP module reduction, again by working storage C output result, the present invention in conjunction with polynomial basis at the bottom of and MPB remove to set up multiplication at the bottom of the double-basis.Some have on-the-spot multiplication can access in the parallel organization in place to obtain by inferior subspace TMVP.At two bit field GF (2 m), undecomposable three polynomial expressions and five polynomial expressions are widely used in the password field, and are long bigger usually at such field meta.By multiplier is by using time secondary TMVP formula at the bottom of a kind of new numeral series connection new website contraction double-basis, in case the Toeplitz multiplication of a d*d has been selected, it is low-down that the structure that is suggested can be gone among the present invention
Figure GDA00003008658000031
Clock period.
Description of drawings
Fig. 1 is structural representation of the present invention.
Fig. 2 is the multidigit series connection pulsation multiplier architecture figure of unit of the present invention.
Fig. 3 is the structural drawing of processing unit PE of the present invention.
Fig. 4 is the physical circuit figure of PE module of the present invention.
Embodiment
Below in conjunction with specific embodiment, technical solution of the present invention is further specified.
As shown in Figure 2, the specific embodiment of the present invention is: make up scale-of-two Galois field multiplier at the bottom of a kind of quick computing multidigit unit series connection pulsation double-basis, comprise input end B, k PE module, FRRP module, R3 module, described k PE module series connection, described k PE module is through k cycle, and the input of the 1st cycle A is A 0, A 1..., A K-1, B directly imports, and result of calculation is input among the working storage C through described FRRP module reduction; The input A of the 2nd cycle A k, A K+1..., A 2k-1, B is through described R3 module input, and result of calculation also through the reduction of FRRP module, with the result of calculation addition in the 1st cycle, is kept among the working storage C; So, in k cycle, the input of A is
Figure GDA00003008658000048
B imports through after (k-1) inferior described R3 module, and result of calculation, is saved among the working storage C with described (k-1) inferior accumulation result addition through described FRRP module reduction, and by working storage C output result, described R3 module realizes Bx again KdThe calculating of modF (x), described PE module comprise R1 module, CMP module, CVP module, PWM module,
Figure GDA00003008658000041
Individual XOR gate and Individual latch, described R3 module output to described R1 module and carry out the coefficient conversion by described CMP module, and the coefficient conversion that described CVP module is carried out the segmentation of A is imported in the segmentation of A, and the result of calculation of CMP module and CVP module all is input to the PWM module, realizes B InCalculate process with A segmentation product
Figure GDA00003008658000043
Individual XOR gate adds up, and the result is kept at
Figure GDA00003008658000044
In the individual latch, by
Figure GDA00003008658000045
Latch output result
Figure GDA00003008658000046
Wherein, A is by three polynomial expression F (x)=1+x n+ x m, be expressed as A=a 0+ a 1X+...+a M-1x M-1, total m coefficient, i.e. (a 0, a 1..., a M-1).Use the segmentation patterning method, the A of m position is cut into
Figure GDA00003008658000049
Every section d position, always total k 2Therefore individual segmentation has
Figure GDA00003008658000047
B can be expressed as B=b at the bottom of by double-basis 0β 0+ b 1β 1+ ...+b M-1β M-1, as another input of multiplier; C is the output result.
Preferred implementation of the present invention is: described FRRP module comprises FR module, R2 module, and described R2 module realizes Cmod (x m+ 1) calculating, the input of described FR module are the result of calculation of k series connection PE module, and the result is reduced, and output to the R2 module.
The input of CMP module and CVP module is respectively B InWith
Figure GDA00003008658000051
Its output result is as the input of PWM module, and the output of PWM module is passed through Individual XOR gate and
Figure GDA00003008658000053
Individual latch, the output result
Figure GDA00003008658000054
The input of R1 module is B In, its output is through m latch, and output is B as a result OutThe input of CMP module is Bx Dk (i+1)+jd, output is [B (p+q), B ( P+q+1)..., B (p+q+d-1)], the input of CVP module is A Ik+j, output be [a q, a Q+1..., a Q+d-1] T, wherein
Figure GDA00003008658000055
Expression
Figure GDA000030086580000510
Be arranged in line number and the columns of matrix, i, j=0,1 ..., k-1, the i of i representing matrix is capable, the j row of j representing matrix, p represents dk (i+1)+jd, and q represents (ik+j) d, and T represents [a q, a Q+1..., a Q+d-1] transpose of a matrix.The result of its output result and a last FRRP module adds up, and outputs to next FRRP module.
The structure of having showed multiplication at the bottom of the whole double-basis at the bottom of Fig. 1 systolic arrays double-basis in the multiplier architecture, A, B, C be three at GF (2 m) in element, by undecomposable three polynomial expression F (x)=1+x n+ x mForm, wherein, n≤m/2.Elements A is represented that by the polynomial basis radix notation B and C represent that with the double-basis radix notation whole multiplier is realized C=ABmodF (x) function, and wherein A, B are as input, and C is the output result.A is by three polynomial expression F (x)=1+x n+ x m, be expressed as A=a 0+ a 1X+...+a M-1x M-1, total m coefficient, i.e. (a 0, a 1..., a M-1).Use the segmentation patterning method, the A of m position is cut into Every section d position, always total k 2Therefore individual segmentation has Each segmentation Ai can be expressed as A i=a Id+ a Id+1X+ ... + a Id+d-1x D-1, all segmentations
Figure GDA00003008658000058
Replace A as the input of whole multiplier.B can be expressed as B=b at the bottom of by double-basis 0β 0+ b 1β 1+ ...+b M-1β M-1, as another input of multiplier.C is calculated by C=ABmodF (x) for the output result, i.e. the function of whole multiplier realization.
Because A is divided into So A can be expressed as A = A 0 + A 1 x d + . . . + A k 2 - 1 x ( k 2 - 1 ) d . Therefore A among the C=ABmodF (x) is launched and can obtain:
Wherein C = AB mod F ( x ) = B ( A 0 + A 1 x d + · · · + A k 2 - 1 x ( k 2 - 1 ) d ) mod F ( x ) = ( B ( A 0 + A 1 x d + · · · + A k - 1 x ( k - 1 ) d ) + Bx dk ( A k + A k + 1 x d + · · · + A 2 k - 1 x ( k - 1 ) d ) + · · · + Bx dk ( k - 1 ) ( A k ( k - 1 ) + A k ( k - 1 ) + 1 x d + · · · + A k 2 - 1 x ( k - 1 ) d ) ) mod F ( x ) = ( C 0 + C 1 + · · · + C k - 1 ) mod F ( x ) C 0 = B ( A 0 + A 1 x d + · · · + A k - 1 x ( k - 1 ) d ) C 1 = Bx dk ( A k + A k + 1 x d + · · · + A 2 k - 1 x ( k - 1 ) d ) · · · C k - 1 = Bx dk ( k - 1 ) ( A k ( k - 1 ) + A k ( k - 1 + 1 ) x d + · · · + A k 2 - 1 x ( k - 1 ) d )
In the whole multiplier architecture of Fig. 1, that the 1st row calculates is C 0=B (A 0+ A 1x d+ ... + A K-1x (k-1) d), its 1st processing unit PE 0,0Calculate BA 0Result of product, the 2nd processing unit PE 0,1Calculate BA 1x dResult of product, by that analogy, k processing unit PE 0, k-1Calculate BA K-1x (k-1) dResult of product.Whole k processing unit result of calculation adds up and finally obtains C 0, be input to the 1st FRRP (Final Reconstruction-Reduction-Polynomial) module.That similarly, the 2nd of whole multiplier architecture the row calculates is C 1=Bx Dk(A k+ A K+1x d+ ... + A 2k-1x (k-1) d), the R3 modular of increase calculates Bx DkModF (x), its input is B.Its 1st processing unit PE 1,0Calculate Bx DxA 0Result of product, follow-up similar with the 1st row, calculate gained C as a result 1, be input to the 2nd FRRP module, adding up with the 1st FRRP module obtains (C 0+ C 1) modF (x).Similar calculating is carried out in every provisional capital of whole multiplier, and to calculate k capable always, and the output result of its R3 module is Bx Dk (k-1)ModF (x), k FRRP module is input as C K-1, be output as (C 0+ C 1+ ... + C K-1) modF (x), be whole multiplier operation result C=(C 0+ C 1+ ... + C K-1) modF (x).
Each processing unit PEi, the detailed circuit of j are used for calculating Bx as shown in Figure 2 Dk (i+1)+jdA Ik+jResult of product.A In, B InWith
Figure GDA00003008658000062
As input, B OutWith As output.The 1st processing unit PE to every row I, 0, its A InThat import is A Ik, B InBe the output by i+1 R3 module, be Bx Dk (i+1)ModF (x), and
Figure GDA00003008658000064
Be initialized as 0.B OutAs the output of R1, also be the 2nd processing unit PE I, 1Input, the result of output is Bx Dk (i+1)+dModF (x).
Figure GDA00003008658000071
What export is
Figure GDA00003008658000072
The result, namely calculate Bx Dk (i+1)A IkResult of product.The 2nd processing unit PE of every row I, 1, its A InThat import is A Ik+1, B InThat import is Bx Dk (i+1)+dModF (x),
Figure GDA00003008658000073
That import is the 1st processing unit PE I, 0Result of calculation is Bx Dk (i+1)A Ik, as the 3rd processing unit PE I, 1Input
Figure GDA00003008658000074
B OutThat export is Bx Dk (i+1)+2dModF (x) result of calculation is as the 3rd processing unit PE I, 1Input B In,
Figure GDA00003008658000075
That export is Bx Dk (i+1)+dA Ik+1Result of product.By that analogy, j+1 processing unit PE of every row I, jThat calculate is Bx Dk (i+1)+jdA Ik+jResult of product, its A InThat import is A Ik+j, B InThat import is Bx Dk (i+1)+jdModF (x),
Figure GDA00003008658000076
What import is j module
Figure GDA00003008658000077
The output result is Bx Dk (i+1)+(j-1) dA Ik+ (j-1), B OutThat export is Bx Dk (i+1)+(j+1) dModF (x) result of calculation,
Figure GDA00003008658000078
That export is Bx Dk (i+1)+jdA Ik+jResult of product.
With Bx Dk (i+1)+jdAnd A Ik+jLaunch respectively, i.e. Bx Dk (i+1)+jd=(b 0β 0+ b 1β 1+ ... + b M-1β M-1) x Dk (i+1)+jd, A Ik+j=a (ik+j) d+ a (ik+j) d+1X+ ... + a (ik+j) d+d-1x D-1 ,According to multiplication rule at the bottom of the double-basis, then can obtain:
Bx dk(i+1)+jdA ik+j
=(b 0β 0+b 1β 1+…+b m-1β m-1)x dk(i+1)+jdA ik+j
=(b 0 (p)β 0+b 1 (p)β 1+…b m-1 (p)β m-1)A ik+j
=(a (ik+j)d+a (ik+j)d+1x+…+a (ik+j)d+d-1x d-1)B (p)
=a qB (p)+a q+1xB (p)+…+a q+d-1x d-1B (p)
=a qB (p+q)+a q+1B( p+q+1)+…+a q+d-1B (p+q+d-1)
=[B (p+q),B (p+q+1),...,B (p+q+d-1)][a q,a q+1,...,a q+d-1] T
p=dk(i+1)+jd
Wherein, q=(ik+j) d
B (p)=b 0 (p)β 0+b 1 (p)β 1+…+b m-1 (p)β m-1
Fig. 3 processing unit PE I, jDetailed circuit in, the input of CMP module is Bx Dk (i+1)+jd, output is [B (p+q), B (p+q+1)..., B (p+q+d-1)], the input of CVP module is A Ik+j, output be [a q, a Q+1..., a Q+d-1] T, the PWM module is used for calculating [B (p+q), B (p+q+1)..., B (p+q+d-1)] [a q, a Q+1..., a Q+d-1] TResult of product, again with
Figure GDA00003008658000079
Addition, the result is input among the working storage L, exports from working storage L again The input of R1 module is B In, realize x dB InModF (x) computing, the result is saved among the working storage L, again from working storage L as B OutOutput.
Calculating [B (p+q), B (p+q+1)..., B (p+q+d-1)] [a q, a Q+1..., a Q+d-1] T, owing to be Toeplitz matrix-vector product, be divided into t 1 t 2 t 0 t 1 v 0 v 1 , ( t 1 t 2 t 0 t 1 Expression is with Toeplitz matrix [B (p+q), B (p+q+1)..., B (p+q+d-1)] be divided into four, wherein two is the same t that is 1, two is t in addition 0And t 2, v 0 v 1 With vector [a q, a Q+1..., a Q+d-1] TBe divided into two sections, T representing matrix transposition wherein can obtain
= [ B ( p + q ) , B ( p + q + 1 ) , . . . , B ( p + q + d - 1 ) ] [ a q , a q + 1 , . . . , a q + d - 1 ] T
= t 1 t 2 t 0 t 1 v 0 v 1 = t 1 ( v 0 + v 1 ) + v 1 ( t 2 + t 1 ) t 1 ( v 0 + v 1 ) + v 0 ( t 0 + t 1 )
= c 0 c 1
Fig. 4 has shown the CMP of processing unit PE, CVP and PWM physical circuit.The input of CMP module is (t 0, t 1, t 2), through XOR gate XOR_1 and XOR_2, input (t 0+ t 1, t 1, t 1+ t 2); That the CVP module is imported is (v 0, v 1), through XOR gate XOR_3, input (v 0, v 0+ v 1, v 1); The PWM module is that the result with the output of CMP module and CVP module carries out point-to-point multiplying each other, through 3 with door AND_1, AND_2 and AND_3, output (v 0(t 0+ t 1), t 1(v 0+ v 1), v 1(t 2+ t 1)); The FR recovery module is utilized 2 XOR gate XOR_4 and XOR_5, calculates c 0=t 1(v 0+ v 1)+v 1(t 2+ t 1) and c 1=t 1(v 0+ v 1)+v 0(t 0+ t 1), output (c 0, c 1).
Fig. 2 has provided the multidigit unit series connection pulsation multiplier architecture that the present invention proposes, and is the structure that Fig. 1 provides to be folded obtain.Used k among Fig. 1 2Individual arithmetic element PE, and the 26S Proteasome Structure and Function of every capable k arithmetic element PE is the same, so can substitute remaining k arithmetic element PE with k arithmetic element PE of the 1st row, needs k cycle like this.The input of the 1st cycle A is (A 0, A 1..., A K-1), B directly imports, and result of calculation is input among the working storage C through the FRRP recovery module; Input (the A of the 2nd cycle A k, A K+1..., A 2k-1), B is through the input of R3 module, and result of calculation is also passed through the FRRP recovery module, with the result of calculation addition in the 1st cycle, is kept among the working storage C; So, know k cycle, the input of A is
Figure GDA00003008658000088
B imports through after (k-1) inferior R3 module, and result of calculation, is saved among the working storage C with front (k-1) inferior accumulation result addition through the FRRP recovery module, by working storage C output result, is C=ABmodF (x) again.
Above content be in conjunction with concrete preferred implementation to further describing that the present invention does, can not assert that concrete enforcement of the present invention is confined to these explanations.For the general technical staff of the technical field of the invention, without departing from the inventive concept of the premise, can also make some simple deduction or replace, all should be considered as belonging to protection scope of the present invention.

Claims (6)

1. scale-of-two Galois field multiplier at the bottom of the quick computing multidigit unit series connection pulsation double-basis is characterized in that, comprises input end B, kIndividual PE module, FRRP module, R3 module, described kIndividual PE module series connection, described kIndividual PE module warp kThe individual cycle, the 1st cycle AInput be
Figure 893260DEST_PATH_IMAGE001
, B directly imports, and result of calculation is input to working storage through described FRRP module reduction CIn; The 2nd cycle AInput , BThrough described R3 module input, also through the reduction of FRRP module, the result of calculation addition with the 1st cycle is kept at working storage to result of calculation CIn; So, kThe individual cycle, AInput be
Figure 199531DEST_PATH_IMAGE003
, BThrough ( k-1) import after the inferior described R3 module, result of calculation is through the reduction of described FRRP module, with described ( k-1) inferior accumulation result addition is saved in working storage CIn, again by working storage CThe output result, described R3 module realizes
Figure 275107DEST_PATH_IMAGE004
Calculating, described PE module comprise R1 module, CMP module, CVP module, PWM module, Individual XOR gate and Individual latch, described R3 module output to described R1 module and carry out the coefficient conversion by described CMP module, and the coefficient conversion that described CVP module is carried out the segmentation of A is imported in the segmentation of A, and the result of calculation of CMP module and CVP module all is input to the PWM module, realizes
Figure 870277DEST_PATH_IMAGE007
With AThe segmentation product calculates, process
Figure 730917DEST_PATH_IMAGE005
Individual XOR gate adds up, and the result is kept at
Figure 476894DEST_PATH_IMAGE005
In the individual latch, by
Figure 422984DEST_PATH_IMAGE005
Latch output result
Figure 414074DEST_PATH_IMAGE008
Wherein, ABy three polynomial expressions
Figure 690072DEST_PATH_IMAGE009
, be expressed as
Figure 108415DEST_PATH_IMAGE010
, total mIndividual coefficient, namely
Figure 338539DEST_PATH_IMAGE011
,
Use the segmentation patterning method, will mThe position ACut into
Figure 133320DEST_PATH_IMAGE012
, every section dThe position, total total k 2 Therefore individual segmentation has
Figure 201508DEST_PATH_IMAGE013
BBy being expressed as at the bottom of the double-basis
Figure 728435DEST_PATH_IMAGE014
, as another input of multiplier; CBe the output result.
2. according to scale-of-two Galois field multiplier at the bottom of the first series connection pulsation of the described quick computing multidigit of claim 1 double-basis, it is characterized in that described FRRP module comprises FR module, R2 module, described R2 module realizes
Figure 773752DEST_PATH_IMAGE015
Calculating, the input of described FR module is the result of calculation of k series connection PE module, and the result is reduced, and outputs to the R2 module.
3. according to scale-of-two Galois field multiplier at the bottom of the first series connection pulsation of the described quick computing multidigit of claim 1 double-basis, it is characterized in that described CMP module comprises XOR gate XOR_1 and XOR_2, described XOR gate XOR_1 and XOR_2 parallel connection.
4. according to scale-of-two Galois field multiplier at the bottom of the first series connection pulsation of the described quick computing multidigit of claim 1 double-basis, it is characterized in that described CVP module is XOR gate XOR_3.
5. according to scale-of-two Galois field multiplier at the bottom of the first series connection pulsation of the described quick computing multidigit of claim 1 double-basis, it is characterized in that, described PWM module comprise three parallel connections with door AND_1, AND_2 and AND_3, the result of described CMP module and the output of described CVP module is carried out point-to-point multiplying each other.
6. according to scale-of-two Galois field multiplier at the bottom of the first series connection pulsation of the described quick computing multidigit of claim 1 double-basis, it is characterized in that described FR module comprises XOR gate XOR_4 and the XOR_5 of two parallel connections.
CN201310115401.7A 2013-04-03 2013-04-03 Binary system Galois field multiplier at the bottom of rapid computations many bits series connection pulsation double-basis Expired - Fee Related CN103186360B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310115401.7A CN103186360B (en) 2013-04-03 2013-04-03 Binary system Galois field multiplier at the bottom of rapid computations many bits series connection pulsation double-basis

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310115401.7A CN103186360B (en) 2013-04-03 2013-04-03 Binary system Galois field multiplier at the bottom of rapid computations many bits series connection pulsation double-basis

Publications (2)

Publication Number Publication Date
CN103186360A true CN103186360A (en) 2013-07-03
CN103186360B CN103186360B (en) 2016-08-03

Family

ID=48677539

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310115401.7A Expired - Fee Related CN103186360B (en) 2013-04-03 2013-04-03 Binary system Galois field multiplier at the bottom of rapid computations many bits series connection pulsation double-basis

Country Status (1)

Country Link
CN (1) CN103186360B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104252332A (en) * 2014-08-20 2014-12-31 哈尔滨工业大学深圳研究生院 Multiplier and multiplier processing element for ellipse cipher apparatus

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW527561B (en) * 2001-11-02 2003-04-11 Chiou-Ying Lee Low-complexity bit-parallel systolic multiplier over GF (2m)
TW200710716A (en) * 2006-11-24 2007-03-16 Univ Lunghwa Sci & Technology Low-complexity finite field GF(2m) bit-parallel systolic array dual-basis multiplier
CN102073477A (en) * 2010-11-29 2011-05-25 北京航空航天大学 Implementation method of finite field multiplying unit with functions of detecting, correcting and locating error
CN102929574A (en) * 2012-10-18 2013-02-13 复旦大学 Pulse multiplying unit design method on GF (Generator Field) (2163) domain

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW527561B (en) * 2001-11-02 2003-04-11 Chiou-Ying Lee Low-complexity bit-parallel systolic multiplier over GF (2m)
TW200710716A (en) * 2006-11-24 2007-03-16 Univ Lunghwa Sci & Technology Low-complexity finite field GF(2m) bit-parallel systolic array dual-basis multiplier
CN102073477A (en) * 2010-11-29 2011-05-25 北京航空航天大学 Implementation method of finite field multiplying unit with functions of detecting, correcting and locating error
CN102929574A (en) * 2012-10-18 2013-02-13 复旦大学 Pulse multiplying unit design method on GF (Generator Field) (2163) domain

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
CHIOU-YNG LEE: "《Low-Complexity Bit-Parallel Sysolic Montgomery Multipliers for Special Classes of GF(2/sup m)》", 《IEEE TRANSACTION ON COMPUTERS》, vol. 54, no. 9, 25 July 2005 (2005-07-25), pages 1061 - 1070 *
HAINING FAN ET AL.: "Subquadratic Computational Complexity Schemes for Extended Binary Field Multiplication Using Optimal Normal Bases", 《IEEE TRANSACTION ON COMPUTERS》, vol. 56, no. 10, 25 October 2007 (2007-10-25), pages 1435 - 1437, XP011191962, DOI: doi:10.1109/TC.2007.1076 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104252332A (en) * 2014-08-20 2014-12-31 哈尔滨工业大学深圳研究生院 Multiplier and multiplier processing element for ellipse cipher apparatus
CN104252332B (en) * 2014-08-20 2018-09-18 哈尔滨工业大学深圳研究生院 A kind of multiplier processing unit and multiplier for elliptic curves cryptosystem device

Also Published As

Publication number Publication date
CN103186360B (en) 2016-08-03

Similar Documents

Publication Publication Date Title
Kim et al. A digit-serial multiplier for finite field GF (2/sup m/)
Lee Low complexity bit-parallel systolic multiplier over GF (2m) using irreducible trinomials
Namin et al. A word-level finite field multiplier using normal basis
CN103793199A (en) Rapid RSA cryptography coprocessor capable of supporting dual domains
Choi et al. Efficient systolic modular multiplier/squarer for fast exponentiation over GF (2m)
Meher et al. Low-Latency, Low-Area, and Scalable Systolic-Like Modular Multipliers for $ GF (2^{m}) $ Based on Irreducible All-One Polynomials
Xie et al. High-throughput finite field multipliers using redundant basis for FPGA and ASIC implementations
Chen et al. Scalable and systolic dual basis multiplier over GF (2m)
Talapatra et al. Unified digit serial systolic Montgomery multiplication architecture for special classes of polynomials over GF (2m)
Nayeem et al. Efficient reversible Montgomery multiplier and its application to hardware cryptography
Reyhani-Masoleh A new bit-serial architecture for field multiplication using polynomial bases
CN205721742U (en) It is applicable to mould and removes new architecture and the non-interwoven one-dimensional pulsation framework of algorithm
CN103186360A (en) Fast arithmetic multi-bit serial pulse dual-base binary finite field multiplier
Xie et al. Novel bit-parallel multiplier for GF (2m) defined by all-one polynomial using generalized Karatsuba algorithm
Pawale et al. High speed Vedic multiplier design and implementation on FPGA
Feng et al. A high-speed and spa-resistant implementation of ecc point multiplication over gf (p)
Tsai et al. Two systolic architectures for multiplication in GF (2m)
CN102929574A (en) Pulse multiplying unit design method on GF (Generator Field) (2163) domain
Kadu et al. Hardware implementation of efficient elliptic curve scalar multiplication using vedic multiplier
Mozhi et al. Efficient bit-parallel systolic multiplier over GF (2 m)
Meher Systolic formulation for low-complexity serial-parallel implementation of unified finite field multiplication over GF (2 m)
Tujillo-Olaya et al. Hardware architectures for elliptic curve cryptoprocessors using polynomial and Gaussian normal basis over GF (2 233)
Sargunam et al. High speed bit-parallel systolic multiplier over GF (2 m) for cryptographic application
Selma et al. A survey of GF (2 m) multipliers on FPGA
Fournaris et al. Low area elliptic curve arithmetic unit

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20160803

Termination date: 20180403