CN108052307A

CN108052307A - The advanced operation method and system of processor floating point unit leading zero quantity

Info

Publication number: CN108052307A
Application number: CN201711207258.9A
Authority: CN
Inventors: 杨雪; 庄伟�; 于立新; 彭和平; 侯国伟; 张梅梅
Original assignee: Beijing Microelectronic Technology Institute; Mxtronics Corp
Current assignee: Beijing Microelectronic Technology Institute; Mxtronics Corp
Priority date: 2017-11-27
Filing date: 2017-11-27
Publication date: 2018-05-18
Anticipated expiration: 2037-11-27
Also published as: CN108052307B

Abstract

The invention discloses a kind of advanced operation methods and system of processor floating point unit leading zero quantity.Wherein, this method includes decoding operation, obtains the leading zero number of every 8 data：It is 8n data A [8n 1 by data bit:0] 8 one group is divided into according to the order from a high position to low level, translates the number B of leading zero in n 8 data by n 84 decoders respectively_m[3:0]；Wherein, B_mRepresent the leading zero number of 8 data of m groups, m=1~n, n=1~8；Data A [8n 1 are obtained by the advanced computing and logic judgment of every level-one in three-level:0] leading zero number per can two-by-two be divided input data in level-one pair, carries out computing parallel between each pair；Wherein, when n is odd number, last is to only there are one input datas.It is time-consuming longer that the present invention solves the problems, such as that multi-group data adds up, and has achieved the effect that quickly to provide leading zero quantity.

Description

The advanced operation method and system of processor floating point unit leading zero quantity

Technical field

The invention belongs to floating-point operation field more particularly to a kind of advanced computings of processor floating point unit leading zero quantity Method and system.

Background technology

Leading zero refers to scan 0 occurred between until first 1 since the highest order of binary data Number.Determine leading zero number effect be to maintain character length it is consistent, facilitate character string sorting.Leading zero prediction passes through The leading zero of mantissa's subtraction result is predicted to reduce the normalization shift computing required time, it is necessary to use leading zero count Component.

China Patent Publication No. CN 102664637A, publication date are September in 2012 12, entitled " to determine binary number According to the method and device of leading zero number " in disclose, the leading zero computing for 32 data, it is a kind of it is more common, Easy method is it to be divided into 4 groups by a high position to low level, 8 digits are one group, and every group of data are compiled by an encoder The output of code device then represents the number of the leading zero of each 8 data, and next judging the output of this four encoders successively is The no number for being 8, leading zero then being drawn by accumulating operation.Accumulating operation is the side more generally used in leading zero computing Method, shortcoming are that multiple data accumulations need to consume longer time.

The content of the invention

Present invention solves the technical problem that it is：It is leading to overcome the deficiencies of the prior art and provide a kind of processor floating point unit It is time-consuming longer to solve the problems, such as that multi-group data adds up, has reached before quickly providing for the advanced operation method and system of zero quantity Lead the effect of zero quantity.

The object of the invention is achieved by the following technical programs：According to an aspect of the invention, there is provided a kind of place The advanced operation method of device floating point unit leading zero quantity is managed, the described method includes the following steps：Step 100：Decoding operation obtains To the leading zero number of every 8 data：By the data A [8n-1 that data bit is 8n:0] according to the order from a high position to low level according to It is secondary to be divided into 8 one group, the number B of leading zero in n 8 data is translated by n 8-4 decoder respectively_m[3:0]；Wherein, B_m Represent the leading zero number of 8 data of m groups, m=1~n, n=1~8；Step 200：Pass through the advanced of every level-one in three-level Computing and logic judgment obtain data A [8n-1:0] leading zero number, per can two-by-two be divided input data in level-one pair, Carry out computing parallel between each pair；Wherein, when n is odd number, last is to only there are one input datas.

In the advanced operation method of above-mentioned processor floating point unit leading zero quantity, step 200 comprises the following steps：Step 210：First order input is the result B of decoding operation_m[3:0], export as the leading zero number B of every 16 or 8 data_jk[4: 0], logic judgment is according to the highest order B for being each pair input data_m[3], result and/or input number from advanced computing are exported According to low three B_m[2:0]；Wherein, k=j+1, j=1,3,5,7 and j≤n；Step 220：Second level input exports for the first order Result B_jk[4:0], export as the leading zero number B of every 32,16 or 8 data_jkrs[5:0], logic judgment foundation is The highest order B of each pair input data_jk[4], low four B of result and/or input data from advanced computing are exported_jk[3: 0]；Wherein, k=j+1, r=j+2, s=j+3, j=1,5 and j≤n；Step 230：The knot that third level input exports for the second level Fruit B_jkrs[5:0], export as the leading zero number B of 8n data；Logic judgment is according to the highest order B for being each pair input data_jkrs [5], low four B of result and/or input data from advanced computing are exported_jkrs[4:0]；Wherein, n=1~8.

In the advanced operation method of above-mentioned processor floating point unit leading zero quantity, the decoding principle of 8-4 decoders is, when When input data is 8 ' b00000000,4 ' b1000 of output data；When input data is 8 ' b00000001, output data For 4 ' b0111；When input data is 8 ' b0000001x (x=0 or 1), 4 ' b0110 of output data；When input data is 8 ' During b000001xx, 4 ' b0101 of output data；When input data is 8 ' b00001xxx, 4 ' b0100 of output data；When When input data is 8 ' b0001xxxx, 4 ' b0011 of output data；When input data is 8 ' b001xxxxx, output data For 4 ' b0010；When input data is 8 ' b01xxxxxx, 4 ' b0001 of output data；When input data is 8 ' b1xxxxxxx When, 4 ' b0000 of output data.

In the advanced operation method of above-mentioned processor floating point unit leading zero quantity, data B_m[3:0] operational formula is：

B_m[3]=~(A [7] | A [6] | A [5] | A [4] | A [3] | A [2] | A [1] | A [0])；

B_m[2]=~(A [7] | A [6] | A [5] | A [4] | A [3] | A [2] | A [1] |~A [0]) ＆ (A [3] | A [2] |~A [1]) ＆ (A [3] |~A [2]) ＆ (~A [3])；

B_m[1]=~(A [7] | A [6] | A [5] | A [4] | A [3] | A [2] | ((A [1] |~A [0]) ＆~A [1])) ＆ (A [5] |~A [4]) ＆ (~A [5])；

B_m[0]=~(A [7] | (and A [6] | A [5] | (A [4] | A [3] | (A [2] | A [1] |~A [0]) ＆~A [2]) ＆~A [4]) ＆~A [6])；

Wherein, A [p] be data A pth+1, B_m[q] (q=0~3) are data B_mQ+1；Wherein, p=0~ 7。

In the advanced operation method of above-mentioned processor floating point unit leading zero quantity, in every level-one logic judgment, to data Divide two-by-two to method be：As n=1, B₁As the leading zero quantity of computing, computing terminate；When 1<During n≤8, by B_mBy height Position to low level is divided into a pair two-by-two, carries out first order computing parallel between each pair, when n for odd number when last to less than two numbers According to then retaining B_nIt is not computing, B_nValue be transferred directly to next stage, high position zero padding during digit deficiency；Or as n=2, B₁With B₂Operation result B₁₂As the leading zero quantity of computing, computing terminate；When 2<It, will into the second level logic judgment during n≤8 The output result B of first order logic judgment_jk(wherein k=j+1；J=1,3,5,7 and j≤n) big-endian is divided into one two-by-two It is right, carry out second level logic judgment between each pair parallel, if last retains a data and do not transport to less than two data Calculation is directly passed to next stage, high position zero padding, wherein k=j+1 during digit deficiency；J=1,3,5,7 and j≤n；Or when 2<n≤4 When, B₁₂₃₄As the leading zero quantity of computing, computing terminate；When 4<During n≤8, into third level logic judgment, the second level is patrolled It collects two operation results judged and carries out computing.

In the advanced operation method of above-mentioned processor floating point unit leading zero quantity, include per the advanced computing of level-one： In the advanced computing of level-one, if high-order numerical value B_j<8, then this is B to data leading zero number_jk=5 ' b { 00, B_j[2:0]}；It is if high Bit value B_j=8, and low level numerical value B_k<8, then this is B to data leading zero number_jk=5 ' b { 01, B_k[2:0]}；If seniority top digit Value B_j=8, and low level numerical value B_k=8, then this is B to data leading zero number_jk=5 ' b { 10,000 }；When n is odd number, most It is latter to there was only B_n, only need to be in high-order zero padding, that is, exportable B_jk=5 ' b { 0, B_n[3:0]}；Wherein, k=j+1, j=1,3,5,7 And j≤n；In the advanced computing in the second level, if high-order numerical value B_jk<16, then this is B to data leading zero number_jkrs=6 ' b 00, B_jk[3:0]}；If high-order numerical value B_jk=16, and low level numerical value B_rs<16, then this is B to data leading zero number_jkrs=6 ' b {01,B_rs[3:0]}；If seniority top digit B_jk=16, and low level numerical value B_rs=16, then this is B to data leading zero number_jkrs=6 ' b {100,000}；When last is to there was only a data B_jkWhen, it only need to be in high-order zero padding, that is, exportable B_jkrs=6 ' b { 0, B_jk[4: 0]}；Wherein, k=j+1, r=j+2, s=j+3, j=1,5 and j≤n；In the advanced computing of the third level, if high-order numerical value B₁₂₃₄< 32, then this is B=7 ' b { 00, B to data leading zero number₁₂₃₄[4:0]}；If high-order numerical value B₁₂₃₄=32, and low level numerical value B₅₆₇₈<32, then this is B=7 ' b { 01, B to data leading zero number₅₆₇₈[4:0]}；If seniority top digit B₁₂₃₄=32, and low level numerical value B₅₆₇₈=32, then this is B=7 ' b { 1,000,000 } to data leading zero number.

In the advanced operation method of above-mentioned processor floating point unit leading zero quantity, include per level-one logic judgment： In level-one logic judgment, if B_jHighest order B_j[3]=0, then B is exported_jk=5 ' b { 00, B_j[2:0]}；If B_j[3]=1, B_k [3]=0, then B is exported_jk=5 ' b { 01, B_k[2:0]}；If B_jAnd B [3]=1,_k[3]=1, then B is exported_jk=5 ' b { 10,000 }； When n is odd number, last is to there was only B_n, export B_jk=5 ' b { 0, B_n[3:0]}；Wherein, k=j+1, j=1,3,5,7 and j≤ n；In the logic judgment of the second level, if B_jk[4]=0, then B is exported_jkrs=6 ' b { 00, B_jk[3:0]}；If B_jk[4]=1, B_rs [4]=0, then B is exported_jkrs=6 ' b { 01, B_rs[3:0]}；If B_jkAnd B [4]=1,_rs[4]=1, then B is exported_jkrs=6 ' b 100, 000}；When last is to there was only a data B_jkWhen, export B_jkrs=6 ' b { 0, B_jk[4:0]}；Wherein, k=j+1, r=j+2, s =j+3, j=1,5 and j≤n；In third level logic judgment, if B₁₂₃₄[5]=0, then B=7 ' b { 00, B are exported₁₂₃₄[4: 0]}；If B₁₂₃₄[5]=1 B₅₆₇₈[5]=0, then B=7 ' b { 01, B are exported₅₆₇₈[4:0]}；If B₁₂₃₄And B [5]=1₅₆₇₈[5]= 1, then export B=7 ' b { 1,000,000 }.When only existing B₁₂₃₄, without B₅₆₇₈When, leading zero number B=B₁₂₃₄。

According to another aspect of the present invention, a kind of advanced computing system of processor floating point unit leading zero quantity is additionally provided System, including：First module obtains the leading zero number of every 8 data for decoding operation：By the data A that data bit is 8n [8n-1:0] 8 one group is divided into according to the order from a high position to low level, translates n 8 by n 8-4 decoder respectively The number B of leading zero in data_m[3:0]；Wherein, B_mRepresent the leading zero number of 8 data of m groups, m=1~n, n=1~8； Second module obtains data A [8n-1 for passing through the advanced computing of every level-one in three-level and logic judgment:0] leading zero Number per can two-by-two be divided input data in level-one pair, carries out computing parallel between each pair；Wherein, when n is odd number, most It is latter to only there are one input data.

In the advanced computing method, system of above-mentioned processor floating point unit leading zero quantity, the decoding principle of 8-4 decoders is, When input data is 8 ' b00000000,4 ' b1000 of output data；When input data is 8 ' b00000001, number is exported According to for 4 ' b0111；When input data is 8 ' b0000001x, 4 ' b0110 of output data；When input data is 8 ' During b000001xx, 4 ' b0101 of output data；When input data is 8 ' b00001xxx, 4 ' b0100 of output data；When When input data is 8 ' b0001xxxx, 4 ' b0011 of output data；When input data is 8 ' b001xxxxx, output data For 4 ' b0010；When input data is 8 ' b01xxxxxx, 4 ' b0001 of output data；When input data is 8 ' b1xxxxxxx When, 4 ' b0000 of output data；Wherein, x=0 or 1.

In the advanced computing method, system of above-mentioned processor floating point unit leading zero quantity, data B_m[3:0] operational formula For：

B_m[3]=~(A [7] | A [6] | A [5] | A [4] | A [3] | A [2] | A [1] | A [0])；

The present invention has the advantages that compared with prior art：

(1) the 8-4 decoders that the present invention uses can directly provide the control bit of selector compared to common 8-3 decoders And carry-out bit, efficient arithmetic speed has been exchanged for the area cost of very little.

(2) it is a kind of supplementary means that the present invention reduces arithmetic speed to computing data to be divided two-by-two, can allow every level-one In data press to concurrent operation, 8n data point are log to the series of computing needs₂N, and the operation stage needed for computing one by one Number is n-1, and the operation time per level-one is essentially identical, so the bigger advantage of the data bit of computing is more apparent.

(3) present invention is using the advantage of advanced computing, when obtaining the decoding result of 8-4 decoders, input data Leading zero quantity just it has been determined that the result output that need to only coordinate control bit selection definite, without doing add operation again, is The present invention is obtained where the core of efficient arithmetic speed.

Description of the drawings

By reading the detailed description of hereafter preferred embodiment, it is various other the advantages of and benefit it is common for this field Technical staff will be apparent understanding.Attached drawing is only used for showing the purpose of preferred embodiment, and is not considered as to the present invention Limitation.And throughout the drawings, the same reference numbers will be used to refer to the same parts.In the accompanying drawings：

Fig. 1 is the functional block of the advanced operation method of processor floating point unit leading zero quantity provided in an embodiment of the present invention Figure.

Specific embodiment

The exemplary embodiment of the disclosure is more fully described below with reference to accompanying drawings.Although the disclosure is shown in attached drawing Exemplary embodiment, it being understood, however, that may be realized in various forms the disclosure without should be by embodiments set forth here It is limited.On the contrary, these embodiments are provided to facilitate a more thoroughly understanding of the present invention, and can be by the scope of the present disclosure Completely it is communicated to those skilled in the art.It should be noted that in the case where there is no conflict, embodiment in the present invention and Feature in embodiment can be mutually combined.The present invention will be described in detail below with reference to the accompanying drawings and in conjunction with the embodiments.

Fig. 1 is the functional block of the advanced operation method of processor floating point unit leading zero quantity provided in an embodiment of the present invention Figure.As described in Figure 1, the advanced operation method of the processor floating point unit leading zero quantity comprises the following steps：

Step 100：Decoding operation obtains the leading zero number of every 8 data：It is 8n data A [8n-1 by data bit: 0] 8 one group is divided into according to the order from a high position to low level, is translated respectively by n 8-4 decoder in n 8 data The number B of leading zero_m[3:0]；Wherein, B_mRepresent the leading zero number of 8 data of m groups, m=1~n, n=1~8；

Step 200：Data A [8n-1 are obtained by the advanced computing and logic judgment of every level-one in three-level:0] leading Zero number per can two-by-two be divided input data in level-one pair, carries out computing parallel between each pair；Wherein, when n is odd number, Last is to only there are one input datas.

Step 200 comprises the following steps：

Step 210：First order input is the result B of decoding operation_m[3:0], export as the leading of every 16 or 8 data Zero number B_jk[4:0], logic judgment is according to the highest order B for being each pair input data_m[3], the result from advanced computing is exported And/or low three B of input data_m[2:0]；Wherein, k=j+1, j=1,3,5,7 and j≤n；

Step 220：The result B that second level input exports for the first order_jk[4:0], export as every 32,16 or 8 digits According to leading zero number B_jkrs[5:0], logic judgment is according to the highest order B for being each pair input data_jk[4], output is from advanced The result of computing and/or low four B of input data_jk[3:0]；Wherein, k=j+1, r=j+2, s=j+3, j=1,5 and j≤ n；

Step 230：The result B that third level input exports for the second level_jkrs[5:0], export as the leading zero of 8n data Number B；Logic judgment is according to the highest order B for being each pair input data_jkrs[5], the result from preceding computing and/or input are exported Low four B of data_jkrs[4:0]；Wherein, n=1~8.

In above-described embodiment, the decoding principle of 8-4 decoders is, when input data is 8 ' b00000000, output data For 4 ' b1000；When input data is 8 ' b00000001,4 ' b0111 of output data；When input data is 8 ' b0000001x When (x=0 or 1), 4 ' b0110 of output data；When input data is 8 ' b000001xx, 4 ' b0101 of output data；When When input data is 8 ' b00001xxx, 4 ' b0100 of output data；When input data is 8 ' b0001xxxx, output data For 4 ' b0011；When input data is 8 ' b001xxxxx, 4 ' b0010 of output data；When input data is 8 ' b01xxxxxx When, 4 ' b0001 of output data；When input data is 8 ' b1xxxxxxx, 4 ' b0000 of output data.Data B_m[3:0] Operational formula be：

B_m[3]=~(A [7] | A [6] | A [5] | A [4] | A [3] | A [2] | A [1] | A [0])；

Wherein, A [p] (p=0~7) be data A pth+1, B_m[q] (q=0~3) are data B_mQ+1.

In above-described embodiment, per level-one logic judgment in, data are divided two-by-two to method be：

As n=1, B₁As the leading zero quantity of computing, computing terminate；When 1<During n≤8, by B_mBig-endian two Two are divided into a pair, carry out first order computing parallel between each pair, when n be odd number when last to less than two data, then retain B_nIt is not computing, B_nValue be transferred directly to next stage, high position zero padding during digit deficiency；Or

As n=2, B₁And B₂Operation result B₁₂As the leading zero quantity of computing, computing terminate；When 2<During n≤8, Into second level logic judgment, by the output result B of first order logic judgment_jk(wherein k=j+1；J=1,3,5,7 and j≤n) Big-endian is divided into a pair two-by-two, carries out second level logic judgment parallel between each pair, if last is to less than two numbers According to, then retain a data and do not do computing and be directly passed to next stage, high position zero padding, wherein k=j+1 during digit deficiency；J=1, 3,5,7 and j≤n；Or

When 2<During n≤4, B₁₂₃₄As the leading zero quantity of computing, computing terminate；When 4<During n≤8, patrolled into the third level It collects and judges, computing is carried out to two operation results of second level logic judgment.

In above-described embodiment, include per the advanced computing of level-one：

In the advanced computing of the first order, if high-order numerical value B_j<8, then this is B to data leading zero number_jk=5 ' b { 00, B_j [2:0]}；If high-order numerical value B_j=8, and low level numerical value B_k<8, then this is B to data leading zero number_jk=5 ' b { 01, B_k[2: 0]}；If high-order numerical value B_j=8, and low level numerical value B_k=8, then this is B to data leading zero number_jk=5 ' b { 10,000 }；Work as n For odd number when, last is to there was only B_n, only need to be in high-order zero padding, that is, exportable B_jk=5 ' b { 0, B_n[3:0]}；Wherein, k=j+1, J=1,3,5,7 and j≤n；

In the advanced computing in the second level, if high-order numerical value B_jk<16, then this is B to data leading zero number_jkrs=6 ' b {00,B_jk[3:0]}；If high-order numerical value B_jk=16, and low level numerical value B_rs<16, then this is B to data leading zero number_jkrs=6 ' b{01,B_rs[3:0]}；If seniority top digit B_jk=16, and low level numerical value B_rs=16, then this is B to data leading zero number_jkrs=6 ' b{100,000}；When last is to there was only a data B_jkWhen, it only need to be in high-order zero padding, that is, exportable B_jkrs=6 ' b { 0, B_jk[4: 0]}；Wherein, k=j+1, r=j+2, s=j+3, j=1,5 and j≤n；

In the advanced computing of the third level, if high-order numerical value B₁₂₃₄<32, then this to data leading zero number for B=7 ' b 00, B₁₂₃₄[4:0]}；If high-order numerical value B₁₂₃₄=32, and low level numerical value B₅₆₇₈<32, then this is B=7 ' b to data leading zero number {01,B₅₆₇₈[4:0]}；If seniority top digit B₁₂₃₄=32, and low level numerical value B₅₆₇₈=32, then this is B=7 ' to data leading zero number b{1,000,000}。

In above-described embodiment, include per level-one logic judgment：

In first order logic judgment, if B_jHighest order B_j[3]=0, then B is exported_jk=5 ' b { 00, B_j[2:0]}；If B_j [3]=1, B_k[3]=0, then B is exported_jk=5 ' b { 01, B_k[2:0]}；If B_jAnd B [3]=1,_k[3]=1, then B is exported_jk=5 ' b{10,000}；When n is odd number, last is to there was only B_n, export B_jk=5 ' b { 0, B_n[3:0]}；Wherein, k=j+1, j=1, 3,5,7 and j≤n；

In the logic judgment of the second level, if B_jk[4]=0, then B is exported_jkrs=6 ' b { 00, B_jk[3:0]}；If B_jk[4]= 1, and B_rs[4]=0, then B is exported_jkrs=6 ' b { 01, B_rs[3:0]}；If B_jkAnd B [4]=1,_rs[4]=1, then B is exported_jkrs= 6’b{100,000}；When last is to there was only a data B_jkWhen, export B_jkrs=6 ' b { 0, B_jk[4:0]}；Wherein, k=j+1, R=j+2, s=j+3, j=1,5 and j≤n；

In third level logic judgment, if B₁₂₃₄[5]=0, then B=7 ' b { 00, B are exported₁₂₃₄[4:0]}；If B₁₂₃₄[5] =1 and B₅₆₇₈[5]=0, then B=7 ' b { 01, B are exported₅₆₇₈[4:0]}；If B₁₂₃₄And B [5]=1₅₆₇₈[5]=1, then B=is exported 7’b{1,000,000}.When only existing B₁₂₃₄, without B₅₆₇₈When, leading zero number B=B₁₂₃₄。

Below with 64 data instances, leading zero calculation that the present invention uses is first passes through 8-4 decoders and translates often The number of leading zero in 8, then using the highest order of decoding value as judgment condition, the low level of decoding value is as the election select it is defeated Go out value, the data of output valve also advanced computing simultaneously provide final result after judgement two-by-two step by step, are of leading zero Number.

Embodiment：

As shown in Figure 1, it is provided during 64 data of processing, it is necessary to use eight decoders of B1~B8 by three-level logic judgment As a result.By 64 data A [63:0] it is divided into 8 one group, respectively by eight 8-4 decoders, translates leading in eight 8 data Zero number B_m[3:0] (m=1~8).Decoding principle is as shown in the table：

It is as shown in the table after completing decoding, into first order logic judgment：If 8 data A [63 of highest:56] translate Data B₁Numerical value is less than 8, i.e. B₁[3]=0, then B₁₂=5 ' b { 00, B₁[2:0]}.If B₁[3]=1 B₂[3]=0, then B₁₂=5 ' b{01,B₂[2:0]}.If B₁And B [3]=1₂[3]=1, then B₁₂=5 ' b { 10,000 }.

In detection B₁While value, B is detected₃Whether numerical value is less than 8, if B₃[3]=0, then B₃₄=5 ' b { 00, B₃[2:0]}； If B₃[3]=1 B₄[3]=0, in computing B₁₂While computing B₃₄, B₃₄=5 ' b { 01, B₄[2:0]}；If B₃And B [3]=1₄ [3]=1, then in computing B₁₂While draw B₃₄=5 ' b { 10,000 }.B₅₆、B₇₈Logic judgment and the same B of output principle₁₂And B₃₄。

In the logic judgment of the second level, highest order input B is detected first₁₂Value, if B₁₂[4]=0, then B₁₂₃₄=6 ' b {00,B₁₂[3:0]}.If B₁₂[4]=1 B₃₄[4]=0, then B₁₂₃₄=6 ' b { 01, B₃₄[3:0]}.If B₁₂And B [4]=1₃₄[4] =1, then B₁₂₃₄=6 ' b { 100,000 }.B₅₆₇₈Logic judgment and the same B of output principle₁₂₃₄。

In third level logic judgment, highest order input B is detected first₁₂₃₄Value, if B₁₂₃₄[5]=0, then leading zero Number B=7 ' b { 00, B₁₂₃₄[4:0]}.If B₁₂₃₄[5]=1 B₅₆₇₈[5]=0, then number B=7 ' b { 01, B of leading zero₅₆₇₈ [4:0]}.If B₁₂₃₄And B [5]=1₅₆₇₈[5]=1, then B=7 ' b { 1,000,000 } i.e. 32.Computing terminates.

Above-mentioned logic discrimination condition can be clearly corresponding in the following table with exporting the relation of result.

In summary often during level-one logic judgment, corresponding input value or advanced is selected simply by control signal for step Operation values are as output, without doing other computings (such as add operation) again, so as to save operation time.It, will at different levels A pair of of concurrent operation, 64 data only need that by 3 grades of logic judgments leading zero quantity can be obtained, are less than two-by-two for data division The time loss by 7 grades of logic judgments is needed from a high position to low level sequential operation.

The 8-4 decoders that the present embodiment uses can directly provide the control bit of selector compared to common 8-3 decoders And carry-out bit, efficient arithmetic speed has been exchanged for the area cost of very little；It is the present embodiment drop to computing that data are divided two-by-two A kind of supplementary means of low arithmetic speed, can allowing the data in every level-one, 8n data point need computing by concurrent operation The series wanted is log₂N, and the computing series needed for computing is n-1 one by one, the operation time per level-one is essentially identical, so fortune The bigger advantage of data bit of calculation is more apparent；The present embodiment is using the advantage of advanced computing, is obtaining the decoding of 8-4 decoders When as a result, the leading zero quantity of input data just it has been determined that the result output that need to only coordinate control bit selection definite, without Add operation is done again, is the core place that the present invention obtains efficient arithmetic speed.

Device embodiment

The present embodiment additionally provides a kind of advanced arithmetic system of processor floating point unit leading zero quantity, including：First Module and the second module.Wherein.

First module obtains the leading zero number of every 8 data for decoding operation：It is 8n data A by data bit [8n-1:0] 8 one group is divided into according to the order from a high position to low level, translates n 8 by n 8-4 decoder respectively The number B of leading zero in data_m[3:0]；Wherein, B_mRepresent the leading zero number of 8 data of m groups, m=1~n, n=1~8；

Second module obtains data A [8n-1 for passing through the advanced computing of every level-one in three-level and logic judgment:0] Leading zero number, per can two-by-two be divided input data in level-one pair, carry out computing parallel between each pair；Wherein, n is strange During number, last is to only there are one input datas.

In above-described embodiment, the decoding principle of 8-4 decoders is, when input data is 8 ' b00000000, output data For 4 ' b1000；When input data is 8 ' b00000001,4 ' b0111 of output data；When input data is 8 ' b0000001x When, 4 ' b0110 of output data；When input data is 8 ' b000001xx, 4 ' b0101 of output data；When input data is During 8 ' b00001xxx, 4 ' b0100 of output data；When input data is 8 ' b0001xxxx, 4 ' b0011 of output data； When input data is 8 ' b001xxxxx, 4 ' b0010 of output data；When input data is 8 ' b01xxxxxx, number is exported According to for 4 ' b0001；When input data is 8 ' b1xxxxxxx, 4 ' b0000 of output data；Wherein, x=0 or 1.

In above-described embodiment, data B_m[3:0] operational formula is：

B_m[3]=~(A [7] | A [6] | A [5] | A [4] | A [3] | A [2] | A [1] | A [0])；

Embodiment described above is the present invention more preferably specific embodiment, and those skilled in the art is in this hair The usual variations and alternatives carried out in the range of bright technical solution should all include within the scope of the present invention.

Claims

1. a kind of advanced operation method of processor floating point unit leading zero quantity, which is characterized in that the described method includes following Step：

Step 100：Decoding operation obtains the leading zero number of every 8 data：By the data A [8n-1 that data bit is 8n:0] 8 one group is divided into according to the order from a high position to low level, before being translated respectively by n 8-4 decoder in n 8 data Lead zero number B_m[3:0]；Wherein, B_mRepresent the leading zero number of 8 data of m groups, m=1~n, n=1~8；

Step 200：Data A [8n-1 are obtained by the advanced computing and logic judgment of every level-one in three-level:0] leading zero Number per can two-by-two be divided input data in level-one pair, carries out computing parallel between each pair；Wherein, when n is odd number, finally It is a pair of that only there are one input datas.

2. the advanced operation method of processor floating point unit leading zero quantity according to claim 1, it is characterised in that：Step Rapid 200 comprise the following steps：

Step 210：First order input is the result B of decoding operation_m[3:0], export as the leading zero of every 16 or 8 data Number B_jk[4:0], logic judgment is according to the highest order B for being each pair input data_m[3], export result from advanced computing and/or Low three B of input data_m[2:0]；Wherein, k=j+1, j=1,3,5,7 and j≤n；

Step 220：The result B that second level input exports for the first order_jk[4:0], export as every 32,16 or 8 data Leading zero number B_jkrs[5:0], logic judgment is according to the highest order B for being each pair input data_jk[4], output is from advanced computing Result and/or input data low four B_jk[3:0]；Wherein, k=j+1, r=j+2, s=j+3, j=1,5 and j≤n；

Step 230：The result B that third level input exports for the second level_jkrs[5:0], export as the leading zero number B of 8n data； Logic judgment is according to the highest order B for being each pair input data_jkrs[5], result and/or input data from advanced computing are exported Low four B_jkrs[4:0]；Wherein, n=1~8.

3. the advanced operation method of floating point unit leading zero quantity according to claim 1, it is characterised in that：8-4 is decoded The decoding principle of device is, when input data is 8 ' b00000000,4 ' b1000 of output data；When input data is 8 ' During b00000001,4 ' b0111 of output data；When input data is 8 ' b0000001x, 4 ' b0110 of output data；When When input data is 8 ' b000001xx, 4 ' b0101 of output data；When input data is 8 ' b00001xxx, output data For 4 ' b0100；When input data is 8 ' b0001xxxx, 4 ' b0011 of output data；When input data is 8 ' b001xxxxx When, 4 ' b0010 of output data；When input data is 8 ' b01xxxxxx, 4 ' b0001 of output data；When input data is During 8 ' b1xxxxxxx, 4 ' b0000 of output data；Wherein, x=0 or 1.

4. the advanced operation method of floating point unit leading zero quantity according to claim 1 or 2, it is characterised in that：Data B_m [3:0] operational formula is：

B_m[3]=~(A [7] | A [6] | A [5] | A [4] | A [3] | A [2] | A [1] | A [0])；

B_m[0]=~(A [7] | (and A [6] | A [5] | (A [4] | A [3] | (A [2] | A [1] |~A [0]) ＆~A [2]) ＆~A [4]) ＆~ A[6])；

Wherein, A [p] be data A pth+1, B_m[q] (q=0~3) are data B_mQ+1；Wherein, p=0~7.

5. the advanced operation method of floating point unit leading zero quantity according to claim 1 or 2, it is characterised in that：It is each Grade logic judgment in, data are divided two-by-two to method be：

As n=1, B₁As the leading zero quantity of computing, computing terminate；When 1<During n≤8, by B_mBig-endian divides two-by-two For a pair, carry out first order computing parallel between each pair, last then retains B to less than two data when n is odd number_nNo It is computing, B_nValue be transferred directly to next stage, high position zero padding during digit deficiency；Or

As n=2, B₁And B₂Operation result B₁₂As the leading zero quantity of computing, computing terminate；When 2<During n≤8, into Two-level logic judges, by the output result B of first order logic judgment_jk(wherein k=j+1；J=1,3,5,7 and j≤n) by a high position It is divided into a pair two-by-two to low level, carries out second level logic judgment parallel between each pair, if last protects less than two data A data is stayed not do computing and are directly passed to next stage, high position zero padding, wherein k=j+1 during digit deficiency；J=1,3,5,7 and j≤n；Or

When 2<During n≤4, B₁₂₃₄As the leading zero quantity of computing, computing terminate；When 4<During n≤8, sentence into third level logic It is disconnected, computing is carried out to two operation results of second level logic judgment.

6. the advanced operation method of floating point unit leading zero quantity according to claim 5, it is characterised in that：Surpass per level-one Preceding computing includes：

In the advanced computing of the first order, if high-order numerical value B_j<8, then this is B to data leading zero number_jk=5 ' b { 00, B_j[2: 0]}；If high-order numerical value B_j=8, and low level numerical value B_k<8, then this is B to data leading zero number_jk=5 ' b { 01, B_k[2:0]}； If high-order numerical value B_j=8, and low level numerical value B_k=8, then this is B to data leading zero number_jk=5 ' b { 10,000 }；When n is strange During number, last is to there was only B_n, only need to be in high-order zero padding, that is, exportable B_jk=5 ' b { 0, B_n[3:0]}；Wherein, k=j+1, j= 1,3,5,7 and j≤n；

In the advanced computing in the second level, if high-order numerical value B_jk<16, then this is B to data leading zero number_jkrs=6 ' b { 00, B_jk [3:0]}；If high-order numerical value B_jk=16, and low level numerical value B_rs<16, then this is B to data leading zero number_jkrs=6 ' b 01, B_rs[3:0]}；If seniority top digit B_jk=16, and low level numerical value B_rs=16, then this is B to data leading zero number_jkrs=6 ' b {100,000}；When last is to there was only a data B_jkWhen, it only need to be in high-order zero padding, that is, exportable B_jkrs=6 ' b { 0, B_jk[4: 0]}；Wherein, k=j+1, r=j+2, s=j+3, j=1,5 and j≤n；

In the advanced computing of the third level, if high-order numerical value B₁₂₃₄<32, then this is B=7 ' b { 00, B to data leading zero number₁₂₃₄ [4:0]}；If high-order numerical value B₁₂₃₄=32, and low level numerical value B₅₆₇₈<32, then this to data leading zero number for B=7 ' b 01, B₅₆₇₈[4:0]}；If seniority top digit B₁₂₃₄=32, and low level numerical value B₅₆₇₈=32, then this to data leading zero number for B=7 ' b 1, 000,000}。

7. the advanced operation method of floating point unit leading zero quantity according to claim 5, it is characterised in that：It is patrolled per level-one It collects and judges to include：

In first order logic judgment, if B_jHighest order B_j[3]=0, then B is exported_jk=5 ' b { 00, B_j[2:0]}；If B_j[3] =1, and B_k[3]=0, then B is exported_jk=5 ' b { 01, B_k[2:0]}；If B_jAnd B [3]=1,_k[3]=1, then B is exported_jk=5 ' b {10,000}；When n is odd number, last is to there was only B_n, export B_jk=5 ' b { 0, B_n[3:0]}；Wherein, k=j+1, j=1, 3,5,7 and j≤n；

In the logic judgment of the second level, if B_jk[4]=0, then B is exported_jkrs=6 ' b { 00, B_jk[3:0]}；If B_jk[4]=1, and B_rs[4]=0, then B is exported_jkrs=6 ' b { 01, B_rs[3:0]}；If B_jkAnd B [4]=1,_rs[4]=1, then B is exported_jkrs=6 ' b {100,000}；When last is to there was only a data B_jkWhen, export B_jkrs=6 ' b { 0, B_jk[4:0]}；Wherein, k=j+1, r= J+2, s=j+3, j=1,5 and j≤n；

In third level logic judgment, if B₁₂₃₄[5]=0, then B=7 ' b { 00, B are exported₁₂₃₄[4:0]}；If B₁₂₃₄[5]=1 and B₅₆₇₈[5]=0, then B=7 ' b { 01, B are exported₅₆₇₈[4:0]}；If B₁₂₃₄And B [5]=1₅₆₇₈[5]=1, then export B=7 ' b 1, 000,000}.When only existing B₁₂₃₄, without B₅₆₇₈When, leading zero number B=B₁₂₃₄。

8. a kind of advanced arithmetic system of processor floating point unit leading zero quantity, it is characterised in that including：

First module obtains the leading zero number of every 8 data for decoding operation：By the data A [8n- that data bit is 8n 1:0] 8 one group is divided into according to the order from a high position to low level, n 8 data is translated by n 8-4 decoder respectively The number B of middle leading zero_m[3:0]；Wherein, B_mRepresent the leading zero number of 8 data of m groups, m=1~n, n=1~8；

Second module obtains data A [8n-1 for passing through the advanced computing of every level-one in three-level and logic judgment:0] before Zero number is led, per can two-by-two be divided input data in level-one pair, carries out computing parallel between each pair；Wherein, n is odd number When, last is to only there are one input datas.

9. the advanced arithmetic system of floating point unit leading zero quantity according to claim 8, it is characterised in that：8-4 is decoded The decoding principle of device is, when input data is 8 ' b00000000,4 ' b1000 of output data；When input data is 8 ' During b00000001,4 ' b0111 of output data；When input data is 8 ' b0000001x, 4 ' b0110 of output data；When When input data is 8 ' b000001xx, 4 ' b0101 of output data；When input data is 8 ' b00001xxx, output data For 4 ' b0100；When input data is 8 ' b0001xxxx, 4 ' b0011 of output data；When input data is 8 ' b001xxxxx When, 4 ' b0010 of output data；When input data is 8 ' b01xxxxxx, 4 ' b0001 of output data；When input data is During 8 ' b1xxxxxxx, 4 ' b0000 of output data；Wherein, x=0 or 1.

10. the advanced arithmetic system of floating point unit leading zero quantity according to claim 9, it is characterised in that：Data B_m [3:0] operational formula is：

B_m[3]=~(A [7] | A [6] | A [5] | A [4] | A [3] | A [2] | A [1] | A [0])；