CN103092561A - Goldschmidt division implementation method based on divisor mapping - Google Patents

Goldschmidt division implementation method based on divisor mapping Download PDF

Info

Publication number
CN103092561A
CN103092561A CN201310019685XA CN201310019685A CN103092561A CN 103092561 A CN103092561 A CN 103092561A CN 201310019685X A CN201310019685X A CN 201310019685XA CN 201310019685 A CN201310019685 A CN 201310019685A CN 103092561 A CN103092561 A CN 103092561A
Authority
CN
China
Prior art keywords
divisor
mapping
dividend
division
range
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201310019685XA
Other languages
Chinese (zh)
Other versions
CN103092561B (en
Inventor
陈禾
闫雯
于文月
谢宜壮
曾涛
龙腾
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Institute of Technology BIT
Original Assignee
Beijing Institute of Technology BIT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Institute of Technology BIT filed Critical Beijing Institute of Technology BIT
Priority to CN201310019685.XA priority Critical patent/CN103092561B/en
Publication of CN103092561A publication Critical patent/CN103092561A/en
Application granted granted Critical
Publication of CN103092561B publication Critical patent/CN103092561B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Complex Calculations (AREA)

Abstract

本发明公开了一种基于除数映射的Goldschmidt除法实现方法,首先,把浮点形式的被除数Nf和除数Df规格化为f×2e的形式,规格化后的被除数和除数记为N和D;根据给定的最小相对误差E以及迭代次数M求出分界值p;若规格化后的除数D落在[1,p]区间内,则直接进行M次迭代;如果除数D落在[p,2)区间内,则将D映射到[1,p]区间内,然后,再进行M次迭代。迭代时,初始值F0=2-D0。M次迭代得到f部分的相除结果,最后将f部分的相除结果与2e部分的相减组合起来,得到最终的除法运算结果。该方法不需要初始估计值,从而能够节省大量的存储资源。

Figure 201310019685

The invention discloses a Goldschmidt division realization method based on divisor mapping. First, the dividend N f and divisor D f in floating point form are normalized into the form of f×2 e , and the normalized dividend and divisor are recorded as N and D; Calculate the boundary value p according to the given minimum relative error E and the number of iterations M; if the normalized divisor D falls within the [1,p] interval, then directly perform M iterations; if the divisor D falls in [ p, 2) interval, map D to the interval [1, p], and then perform M iterations. During iteration, the initial value F 0 =2-D 0 . The division result of the f part is obtained by M iterations, and finally the division result of the f part is combined with the subtraction of the 2e part to obtain the final division operation result. This method does not require an initial estimate, thus saving a lot of storage resources.

Figure 201310019685

Description

A kind of Goldschmidt division implementation method based on the divisor mapping
Technical field
The invention belongs to the signal process field, relate to a kind of Goldschmidt division implementation method based on the divisor mapping.
Background technology
Floating-point division is applied in the middle of the processor design more and more, becomes the pith that signal is processed.Although divider does not have the frequency of utilization of adder-subtractor high, the performance of divider has directly affected the effect that signal is processed, so accelerate the arithmetic speed of divider and reduce hardware resource to become a key issue.
The modern commerce processor mostly adopts parallel multiplier to improve the speed of division, utilize the iteration of multiplication to realize division arithmetic, and required precision is higher, and the number of times of iteration is more.Now the main iterative division based on multiplication has the Goldschmidt algorithm that Newton-Raphson algorithm that M.J.Flynn proposes and R.E.Goldschmidt propose in Applications of division by convergence one literary composition in On division by functional iteration one literary composition.
The Newton-Raphson algorithm is converted into division the inverse of asking divisor, then obtains the business by multiplying each other with dividend, and then the Goldschmidt algorithm directly tries to achieve the business by normalization divisor and dividend.These two each iteration of algorithm all relate to twice multiply operation, but this twice multiply operation can not parallel work-flow in the Newton-Raphson algorithm, can only carry out serial operation, and these two multiplication can parallel processing in the Goldschmidt algorithm, so adopt the processing speed of Goldschmidt algorithm can be than comparatively fast.
The Goldschmidt algorithm when carrying out division arithmetic, Q=N/D, wherein N is dividend, D is divisor, Q is the business, by constructing a group factor F 1, F 2..., F M, make molecule and denominator multiply each other with this group factor respectively, when denominator levels off to 1 the time, molecule is desired result of division.Molecule and denominator and this group factor multiply each other and realize by M iteration.
The iterative formula of traditional Goldschmidt division arithmetic can be write as following form:
F i + 1 = 2 - D i + 1 N i + 1 = N i F i D i + 1 = D i F i - - - ( 1 )
Wherein, i=1,2 ... M-1, M represents iterations, N iAnd D iRepresent respectively dividend and the divisor after iteration the i time, F iRepresent the factor.Here N 0And D 0Be dividend and the divisor that initially will calculate, and F 0Be that inverse by divisor estimates, but the precision of estimating is limited, the error of establishing initial estimate is s, F 0=(1-s)/D can know that through deriving the error of calculation of traditional Goldschmidt division algorithm is
Figure BDA00002750431200022
Can be found out by upper surface analysis, the error of traditional Goldschmidt algorithm iteration result depends on the evaluated error s of initial value, and namely the height of initial value estimated accuracy has determined the size of iteration resultant error.
For this problem, D.DasSarma and D.W.Matula be at Faithful Bipartite ROM Reciprocal Tables one literary composition, and M.J.Schulte and J.E.Stine have proposed multiple loop up table and reduce the initial estimate error to improve the precision of iteration result in Symmetric Bipartite Tables for Accurate Function Approximation one literary composition.
The principle of this method is the initial value of storage 1/D in ROM, in order to improve the precision of computing, just needs to increase the figure place of look-up table, and establishing the look-up table figure place is n, and the resource of look-up table is n2 n, namely needed resource will increase with exponential manner along with the figure place of look-up table.
In sum, in the situation that accuracy requirement is higher, the storage resources of look-up table will sharply increase along with figure place, in order to improve the performance of divider, reduce storage resources and will become a problem demanding prompt solution.
Summary of the invention
In view of this, the present invention proposes a kind of improved Goldschmidt division implementation method in order to solve the large problem of ROM storage resources, and this division algorithm does not need initial estimate, thereby can save a large amount of storage resources.
In order to solve the problems of the technologies described above, the present invention is achieved in that
A kind of Goldschmidt division implementation method based on the divisor mapping, the method comprises the steps:
Step 1, the dividend N of relocatable fWith divisor D fBe normalized to f * 2 eForm, wherein, f be magnitude portion and f ∈ [1,2), e is exponential part; Dividend after normalization and divisor are designated as N and D; Floating data be divided by be converted into magnitude portion be divided by and exponential part subtract each other;
Step 2, when magnitude portion is divided by, according to given minimum relative error E and in the situation that time delay meets the demands the value of the minimum iterations M of resource requirement, employing formula (I) is obtained cut off value p;
p = exp ( ln E 2 M ) + 1 - - - ( I )
Divisor D after step 3, judgement normalization and the size of cut off value p; If the divisor D after normalization drops in [1, p] interval, directly make N 0=N, D 0=D, execution in step 4; If normalized divisor D drop on [p, 2) in the interval, D is mapped to [1, p] interval by multiply by mapping coefficient, simultaneously dividend N also be multiply by described mapping coefficient, with the D after mapping and N assignment to D 0And N 0, and then execution in step 4;
Step 4, through type (IV) carry out the division arithmetic result that M iteration obtains magnitude portion;
F i = 2 - D i N i + 1 = N i F i D i + 1 = D i F i - - - ( IV )
Wherein, i=0,2 ..., M-1, N iAnd D iRepresent respectively dividend and the divisor after iteration the i time;
Step 5, the magnitude portion phase division result that step 4 is obtained and exponential part are subtracted each other and are combined, and obtain final division arithmetic result.
Preferably, in step 3, describedly D is mapped to [1, p] interval by multiply by mapping coefficient, the step that simultaneously dividend N also be multiply by described mapping coefficient is:
Step 31, obtain the value of minimum mapping segments T according to formula (II), and [p, 2) be divided into the T section;
T=ceil(log p2-1) (II)
Wherein, the ceil representative rounds up;
Step 32, through type (III) are obtained border and the mapping coefficient of the scope of respectively shining upon, and find the mapping scope at divisor D place, extract corresponding mapping coefficient; With divisor D and dividend N all with the correspondence mappings multiplication of extracting;
Range ( j , max ) = 2 p ( T - 1 - j ) Range ( j , min ) = 2 p T - j c ( j ) = p T - j 2 - - - ( III )
Range (j, min)And Range (j, max)Be respectively lower limit and the upper limit of j mapping scope, c (j) is the mapping coefficient of j mapping scope; J=0,1 ..., T-1.
Preferably, in advance mapping coefficient is carried out the CSD coding, and storage; During with divisor D and dividend N and correspondence mappings multiplication, adopt CSD to encode to be fixed the multiplier multiplication.
Beneficial effect:
(1) this improved Goldschmidt division algorithm of the present invention's proposition, do not need the estimation to the F initial value, can save a large amount of storage resources.
(2) error of this method iteration result is also just relevant with iterations with divisor, makes the operation result error controlled.
(3) this method also utilizes fixed number least resource multiplication algorithm to reduce the resource occupation of divisor mapping algorithm.
(4) this method is easy to the Parallel Implementation of hardware.
Description of drawings
Fig. 1 (a) and Fig. 1 (b) are divisor mapping process schematic diagram.
Fig. 2 is for improving Goldschmidt algorithm realization flow figure.
Embodiment
Below in conjunction with the accompanying drawing embodiment that develops simultaneously, describe the present invention.
At first the iterative formula of division arithmetic is modified, is write as following form:
F i = 2 - D i ( 2 ) N i + 1 = N i F i ( 3 ) D i + 1 = D i F i ( 4 )
Wherein, i=1,2 ... M-1, M represents iterations, N iAnd D iRepresent respectively dividend and the divisor after iteration the i time.Here N 0=N represents initial dividend, D 0=D represents initial divisor, and the initial value F of the factor 0Do not need to estimate, can directly be calculated by formula (2), so just can save storing initial estimated value F 0The ROM space.
Bring formula (2) into formula (4), the row iteration of going forward side by side is derived, and can obtain
D i + 1 - 1 = D i ( 2 - D i ) - 1 = - ( D i - 1 ) 2 = - ( D - 1 ) 2 i + 1 - - - ( 5 )
According to formula (3) and formula (4), can derive D iAnd N iIterative relation
N i + 1 = N D D i + 1 - - - ( 6 )
Wushu (5) substitution formula (6) can get
N i + 1 = N D [ 1 - ( D - 1 ) 2 i + 1 ] - - - ( 7 )
After M iteration of process, denominator D M≈ 1, therefore
Q M = N M D M ≈ D M = N D [ 1 - ( D - 1 ) 2 M ] - - - ( 8 )
Wherein, Q iBe the business after after the i time iteration.
The definition relative error
ϵ Q i = abs ( Q i - ( N / D ) N / D ) - - - ( 9 )
Wherein, the result of division relative error under M iteration is
ϵ Q M = ( D - 1 ) 2 M - - - ( 10 )
Can find out from formula (10), the relative error of improved division arithmetic result is only relevant with size and the iterations of divisor.
For floating-point system, N and D are represented with scientific notation, namely dividend and divisor all are normalized to f * 2 eForm, for example 2.5 is 1.25 * 2 after being normalized 1, wherein f be magnitude portion and f ∈ [1,2), e is exponential part, the division arithmetic of floating data is that mantissa is divided by, index subtracts each other, so a division arithmetic that only needs here to consider magnitude portion gets final product.The value scope through D and N after normalization be [1,2), in this scope, fixedly the time, relative error function is along with the monotonically increasing function that is increased to of D as iterations M, divisor D is more larger close to 2 relative errors.
Suppose when iterations is fixed as M, make relative error be not more than E for the parameter E of provisioning request, namely
( D - 1 ) 2 M ≤ E - - - ( 11 )
Can release
D ≤ exp ( ln E 2 M ) + 1 = p - - - ( 12 )
Divisor maximal value when p satisfies the relative error requirement exactly so, and, p ∈ [1,2).That is to say, only have when divisor D ∈ [l, p) time, could satisfy the requirement of relative error.
Therefore [1, requirement p) is in order to allow the divisor in all scopes can both satisfy error requirements but not all divisor D can satisfy D ∈, for the data of D ∈ (p, 2), take it is mapped to [1, p] method in scope satisfies accuracy requirement, as shown in Figure 1.
As p〉3/2 the time, only need a unified mapping coefficient can be mapped to the number in (p, 2) scope in [1, p] interval.For once mapping, when D ∈ (p, 2), define a mapping coefficient γ ∈ (0.5,1), D '=D * γ ∈ [1, p] is arranged, D ' is the D after shining upon. Respectively as dividend and divisor after mapping, divisor is mapped in [1, p] scope with N * γ and D * γ.At this moment, the relative error after M iteration of process will guarantee satisfy accuracy requirement by formula (11).
But when p<3/2, aforementioned unified mapping coefficient can not be mapped to [1 with the number in all (3/2,2) scopes, p] in the interval, therefore need (3/2,2) is divided into the T section, set a mapping coefficient c, choose suitable mapping coefficient according to the place section of D and shine upon for every section.Suppose (p, 2) are divided into the T section, wherein adjacent two sections can have common factor, and the mapping scope of note j section is (Range (j, min), Range (j, max)], j=0,1 ..., T-1, the mapping scope of T-1 section is (Range (T-1, min), 2), Range represents the border of the scope of shining upon, and following characteristic is arranged
1 ≤ Range ( 0 , min ) ≤ p Range ( j - 1 , max ) ≥ Range ( j , min ) Range ( T - 1 , max ) = 2 - - - ( 13 )
Note C={c (0), c (1) ..., c (T-1) } be the mapping coefficient of each section mapping scope, through following relation is arranged after mapping
c ( j ) Range ( j , max ) ≤ p c ( j ) Range ( j , min ) ≥ 1 - - - ( 14 )
When all mapping scopes were all nonoverlapping, needed mapping segments T was minimum, therefore order
c ( j ) Range ( j , max ) = p c ( j ) Range ( j , min ) = 1 - - - ( 15 )
Formula (13) can turn to
Range ( 0 , min ) = p Range ( j - 1 , max ) = Range ( j , min ) Range ( T - 1 , max ) = 2 - - - ( 16 )
In conjunction with formula (15) and (16), can be in the hope of each mapping scope and mapping coefficient, as follows
Range ( j , max ) = 2 p ( T - 1 - j ) Range ( j , min ) = 2 p T - j c ( j ) = p T - j 2 - - - ( 17 )
Can be obtained by foregoing formula (13) again,
1 ≤ Range ( 0 , min ) = 2 p T ≤ p - - - ( 18 )
Have
log p2-1≤T≤log p2 (19)
Therefore, the value of mapping segments T is determined by (19).Consider that T is integer, in order to choose minimum T, get here simultaneously
T=ceil(log p2-1) (20)
Wherein the ceil representative rounds up.
Derivation by above-mentioned divisor algorithm can be seen, under M time fixing iterated conditional, as long as before iteration, divisor and dividend multiply by different mapping coefficients in different scopes, just can make the operation result of all scope divisors all reach accuracy requirement by a mapping operations.
For the improvement Goldschmidt division algorithm that will carry out M iteration, according to formula (2,3,4) as can be known, each iteration needs 2M multiplication and M add operation (thinking that here the resource that plus-minus method takies is identical), and in hardware is realized the floating-point operation system, particularly high precision computation, with respect to multiplier, totalizer can be ignored, and therefore only considers 2M multiplier here.
Equally also need to consider in the divisor mapping algorithm multiplication to the mapping coefficient of each mapping scope, because molecule and denominator all need to multiply each other with mapping coefficient, therefore need twice multiplication.But consider in this situation it is multiplication to fixed coefficient, by mapping coefficient being carried out CSD(Canonical Signed-Digit Code, the code symbol code) coding can reduce multiplication hardware resource occupation amount, definition β (β ∈ [0,1]) for fixing multiplier multiplication, general multiplication is taken the resource ratio, represent one fixedly the multiplier multiplication unit to take hardware resource be β times that a general multiplication unit takies resource.
Define this algorithm under M iteration, relative error during less than E needed multiplier number be R (M),
R(M)=2M+2β (21)
When adopting CSD to encode to be fixed the multiplier multiplication, the mean value of β is 0.3.Macleod proposes a kind of fixedly multiplier least resource multiplication algorithm in Use of minimum-adder multiplier blocks in FIR digital filters one literary composition, reduce by 25% resource than CSD multiplication, and this moment, the mean value of β was 0.25.
Can be found out by (21) formula, when given accuracy requires, can be in the hope of function R(M) minimum value, that is to say after the assigned error precision, the algorithm that the present invention derives can be in the hope of satisfying the needed minimum iterations of this error precision, and take needed all parameters in the situation of resource minimum.
Based on above-mentioned derivation, the core concept that the present invention improves the Goldschmidt division arithmetic is at first, the dividend of relocatable and divisor to be normalized to f * 2 eForm, the dividend after normalization and divisor are designated as N and D; Obtain cut off value p, if D drops in [1, p] interval, directly utilize formula (2)~(4) to carry out iteration M time; If D drop on [p, 2) in the interval, D is mapped to [1, p] interval by multiply by mapping coefficient, simultaneously dividend N also be multiply by described mapping coefficient, then carries out iteration M time.M iteration obtains the phase division result of f part, at last with f phase division result and 2 partly eSubtracting each other of part combines, and obtains final division arithmetic result.
Referring to Fig. 2, its specific implementation process is as follows:
Step 1, the dividend N of relocatable fWith divisor D fBe normalized to f * 2 eForm, floating data is divided by and is converted into that magnitude portion is divided by and exponential part is subtracted each other, the span that needs to carry out the dividend N of division arithmetic and divisor D be [1,2).
N f = N × 2 e 1 D f = D × 2 e 2
Step 2, given minimum relative error E and in the situation that time delay meets the demands the value of the minimum iterations M of resource requirement.
Step 3, obtain cut off value p according to formula (12), interval [1,2) be divided into [1, p] and [p, 2), then judge the size of normalized divisor D and cut off value p.If the divisor after normalization drops in [1, p] interval, make N 0=N, D 0=D jumps to step 6; If normalized divisor drop on [p, 2) in the interval, order is carried out downwards.
Step 4, obtain the value of minimum mapping segments T according to formula (20), and [p, 2) be divided into the T section.
Step 5, through type (17) are obtained border and the mapping coefficient of the scope of respectively shining upon, and find the mapping scope at divisor D place, extract corresponding mapping coefficient; With divisor D and dividend N all with the correspondence mappings multiplication of extracting.With the mapping after divisor D and the N assignment to D 0And N 0, change step 6 over to.
Step 6, through type (2)~(4) are carried out M iteration and are obtained the division arithmetic result.
Step 7, the magnitude portion phase division result that step 6 is obtained and exponential part are subtracted each other and are combined, and obtain final division arithmetic result.
So far, this flow process finishes.
It is not overlapping that above flow process is based on all mapping scopes, the situation of formula (16) namely, and this moment, needed mapping segments T was minimum.In practice, also can not limit so strictly, only need to satisfy formula (13) and get final product, mapping coefficient is chosen the numerical value that satisfies formula (14) and is got final product at this moment.
The iteration that the present invention has derived this improvement algorithm is relative error as a result, and has obtained the relation between required hardware resource and accuracy requirement, makes in satisfying the scope of accuracy requirement, obtains taking the divider structure of resource minimum.
In sum, these are only preferred embodiment of the present invention, is not for limiting protection scope of the present invention.Within the spirit and principles in the present invention all, any modification of doing, be equal to replacement, improvement etc., within all should being included in protection scope of the present invention.

Claims (3)

1.一种基于除数映射的Goldschmidt除法实现方法,其特征在于,该方法包括如下步骤:1. a Goldschmidt division method realization method based on divisor mapping, is characterized in that, the method comprises the steps: 步骤1、把浮点形式的被除数Nf和除数Df规格化为f×2e的形式,其中,f为尾数部分且f∈[1,2),e为指数部分;规格化后的被除数和除数记为N和D;则浮点数据相除转化为尾数部分相除和指数部分相减;Step 1. Normalize the dividend N f and the divisor D f in floating point form into the form of f×2 e , where f is the mantissa part and f∈[1,2), and e is the exponent part; the normalized dividend The sum and divisor are recorded as N and D; the division of floating-point data is transformed into the division of the mantissa and the subtraction of the exponent; 步骤2、在尾数部分相除时,根据给定最小相对误差E以及在时延满足要求的情况下所需资源最少的迭代次数M的值,采用式(I)求出分界值p;Step 2. When dividing the mantissa part, according to the given minimum relative error E and the value of the number of iterations M with the least resource required when the time delay meets the requirements, use formula (I) to find the cut-off value p; pp == expexp (( lnln EE. 22 Mm )) ++ 11 -- -- -- (( II )) 步骤3、判断规格化后的除数D和分界值p的大小;若规格化后的除数D落在[1,p]区间内,则直接令N0=N,D0=D,执行步骤4;若规格化的除数D落在[p,2)区间内,则通过乘以映射系数将D映射到[1,p]区间,同时将被除数N也乘以所述映射系数,将映射后的D和N赋值给D0和N0,然后再执行步骤4;Step 3. Determine the size of the normalized divisor D and the boundary value p; if the normalized divisor D falls within the interval [1,p], directly set N 0 =N, D 0 =D, and execute step 4 ; If the normalized divisor D falls within the [p, 2) interval, D is mapped to the [1, p] interval by multiplying the mapping coefficient, and the dividend N is also multiplied by the mapping coefficient, and the mapped Assign D and N to D 0 and N 0 , and then perform step 4; 步骤4、通过式(IV)进行M次迭代得到尾数部分的除法运算结果;Step 4, performing M iterations through formula (IV) to obtain the division result of the mantissa part; Ff ii == 22 -- DD. ii NN ii ++ 11 == NN ii Ff ii DD. ii ++ 11 == DD. ii Ff ii -- -- -- (( IVIV )) 其中,i=0,2,…,M-1,Ni和Di分别代表第i次迭代之后的被除数和除数;Among them, i=0,2,...,M-1, N i and D i respectively represent the dividend and divisor after the ith iteration; 步骤5、将步骤4得到的尾数部分相除结果与指数部分相减组合起来,得到最终的除法运算结果。Step 5. Combining the division result of the mantissa part obtained in step 4 with the subtraction of the exponent part to obtain the final result of the division operation. 2.如权利要求1所述的方法,其特征在于,步骤3中,所述通过乘以映射系数将D映射到[1,p]区间,同时将被除数N也乘以所述映射系数的步骤为:2. The method according to claim 1, characterized in that, in step 3, said D is mapped to [1, p] interval by multiplying the mapping coefficient, and the dividend N is also multiplied by the step of the mapping coefficient for: 步骤31、根据式(II)求出最小映射分段数T的值,则[p,2)被分为了T段;Step 31, calculate the value of the minimum mapping segment number T according to formula (II), then [p, 2) is divided into T segments; T=ceil(logp2-1)  (II)T=ceil(log p 2-1) (II) 其中,ceil代表向上取整;Among them, ceil represents rounding up; 步骤32、通过式(III)求出各映射范围的边界以及映射系数,找到除数D所在的映射范围,提取对应的映射系数;将除数D和被除数N均与提取的对应映射系数相乘;Step 32. Calculate the boundaries and mapping coefficients of each mapping range through formula (III), find the mapping range where the divisor D is located, and extract the corresponding mapping coefficients; multiply the divisor D and the dividend N by the extracted corresponding mapping coefficients; RangeRange (( jj ,, maxmax )) == 22 pp (( TT -- 11 -- jj )) RangeRange (( jj ,, minmin )) == 22 pp TT -- jj cc (( jj )) == pp TT -- jj 22 -- -- -- (( IIIIII )) Range(j,min)和Range(j,max)分别为第j个映射范围的下限和上限,c(j)为第j个映射范围的映射系数;j=0,1,…,T-1。Range (j, min) and Range (j, max) are the lower limit and upper limit of the jth mapping range, respectively, and c(j) is the mapping coefficient of the jth mapping range; j=0,1,...,T-1 . 3.如权利要求1或2所述的方法,其特征在于,预先对映射系数进行CSD编码,并存储;将除数D和被除数N与对应映射系数相乘时,采用CSD编码来进行固定乘数乘法。3. The method according to claim 1 or 2, characterized in that, the mapping coefficients are pre-CSD coded and stored; when the divisor D and the dividend N are multiplied with the corresponding mapping coefficients, the CSD coding is used to carry out the fixed multiplier multiplication.
CN201310019685.XA 2013-01-18 2013-01-18 A kind of Goldschmidt division implementation method mapped based on divisor Active CN103092561B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310019685.XA CN103092561B (en) 2013-01-18 2013-01-18 A kind of Goldschmidt division implementation method mapped based on divisor

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310019685.XA CN103092561B (en) 2013-01-18 2013-01-18 A kind of Goldschmidt division implementation method mapped based on divisor

Publications (2)

Publication Number Publication Date
CN103092561A true CN103092561A (en) 2013-05-08
CN103092561B CN103092561B (en) 2015-11-25

Family

ID=48205184

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310019685.XA Active CN103092561B (en) 2013-01-18 2013-01-18 A kind of Goldschmidt division implementation method mapped based on divisor

Country Status (1)

Country Link
CN (1) CN103092561B (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105389157A (en) * 2015-10-29 2016-03-09 中国人民解放军国防科学技术大学 Goldschmidt algorithm-based floating-point divider
CN107562408A (en) * 2016-06-30 2018-01-09 德州仪器公司 Support can interrupt the framework and instruction set of floating-point division
CN108241868A (en) * 2016-12-26 2018-07-03 浙江宇视科技有限公司 Image objective similarity to subjective similarity mapping method and device
CN111104092A (en) * 2019-12-06 2020-05-05 北京多思安全芯片科技有限公司 Fast divider and division operation method
CN111610960A (en) * 2018-08-30 2020-09-01 合肥工业大学 A LMS-based divider module and its division method
CN112732223A (en) * 2020-12-31 2021-04-30 上海安路信息科技股份有限公司 Data processing method and system for half-precision floating-point number divider

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101326486A (en) * 2005-12-02 2008-12-17 富士通株式会社 Computing device and computing method for performing division operation or square root operation of floating-point numbers
US20090216825A1 (en) * 2008-02-26 2009-08-27 International Business Machines Corporation Method, system and computer program product for detecting errors in fixed point division operation results
CN102460424A (en) * 2009-06-10 2012-05-16 新思科技有限公司 Multiplicative division circuit with reduced area

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101326486A (en) * 2005-12-02 2008-12-17 富士通株式会社 Computing device and computing method for performing division operation or square root operation of floating-point numbers
US20090216825A1 (en) * 2008-02-26 2009-08-27 International Business Machines Corporation Method, system and computer program product for detecting errors in fixed point division operation results
CN102460424A (en) * 2009-06-10 2012-05-16 新思科技有限公司 Multiplicative division circuit with reduced area

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
DANIEL PISO等: "《Variable Latency Goldschmidt Algorithm Based on a New Rounding Method and a Remainder Estimate》", 《IEEE TRANSACTION ON COMPUTERS》, vol. 60, no. 11, 30 November 2011 (2011-11-30) *
MILOS D.ERCEGOVAC等: "《Improving Goldschmidt Division,Square Root,and Square Root Reciprocal 》", 《IEEE TRANSACTION ON COMPUTERS》, vol. 49, no. 7, 31 July 2000 (2000-07-31) *

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105389157A (en) * 2015-10-29 2016-03-09 中国人民解放军国防科学技术大学 Goldschmidt algorithm-based floating-point divider
CN107562408A (en) * 2016-06-30 2018-01-09 德州仪器公司 Support can interrupt the framework and instruction set of floating-point division
CN107562408B (en) * 2016-06-30 2023-09-12 德州仪器公司 Implementation method and system supporting interruptible floating point division
CN108241868A (en) * 2016-12-26 2018-07-03 浙江宇视科技有限公司 Image objective similarity to subjective similarity mapping method and device
CN108241868B (en) * 2016-12-26 2021-02-02 浙江宇视科技有限公司 Method and device for mapping objective similarity of images to subjective similarity
CN111610960A (en) * 2018-08-30 2020-09-01 合肥工业大学 A LMS-based divider module and its division method
CN111610960B (en) * 2018-08-30 2022-09-27 合肥工业大学 Divider module based on LMS and division method thereof
CN111104092A (en) * 2019-12-06 2020-05-05 北京多思安全芯片科技有限公司 Fast divider and division operation method
CN111104092B (en) * 2019-12-06 2022-10-11 北京多思安全芯片科技有限公司 Fast divider and division operation method
CN112732223A (en) * 2020-12-31 2021-04-30 上海安路信息科技股份有限公司 Data processing method and system for half-precision floating-point number divider
CN112732223B (en) * 2020-12-31 2024-04-30 上海安路信息科技股份有限公司 Semi-precision floating point divider data processing method and system

Also Published As

Publication number Publication date
CN103092561B (en) 2015-11-25

Similar Documents

Publication Publication Date Title
CN103092561A (en) Goldschmidt division implementation method based on divisor mapping
Barrois et al. The hidden cost of functional approximation against careful data sizing—A case study
CN103294446A (en) Fixed-point multiply-accumulator
CN106325756B (en) Data storage method, data calculation method and equipment
CN103970720A (en) Embedded reconfigurable system based on large-scale coarse granularity and processing method of system
CN103677737A (en) Method and device for achieving low delay CORDIC trigonometric function based on carry-save summator
Shi et al. Spectral-collocation method for Volterra delay integro-differential equations with weakly singular kernels
Soares et al. Approximate adder synthesis for area-and energy-efficient FIR filters in CMOS VLSI
CN104215962B (en) Chirp Scaling formation method based on monolithic FPGA
Mohanty et al. Modified PEB formulation for hardware-efficient fixed-width booth multiplier
Murillo et al. A suite of division algorithms for posit arithmetic
CN107423026A (en) The implementation method and device that a kind of sin cos functionses calculate
Goyal et al. Computation-distortion characteristics of block transform coding
Sarbishei et al. On the fixed-point accuracy analysis and optimization of FFT units with CORDIC multipliers
US20150363170A1 (en) Calculation of a number of iterations
US20160041947A1 (en) Implementing a square root operation in a computer system
CN111694543B (en) Approximate multiplier design method, approximate multiplier and image sharpening circuit
CN108334305A (en) A kind of division operation method, device, equipment and storage medium
JP5816383B2 (en) Interleaver index generation apparatus and method
Yan et al. Improved Goldschmidt division method using mapping of divisors
CN105278923B (en) A kind of signal sources number detection hardware circuit and its implementation based on AIC information criterions
CN105846873B (en) Triangle systolic array architecture QR decomposer and decomposition method based on advanced iterative
Guardia Implementation of a fully pipelined BCD multiplier in FPGA
Stamenković et al. Constant-coefficient FIR filters based on residue number system arithmetic
KR20170030327A (en) Apparatus and method for controlling operation

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant