JP2002244843A

JP2002244843A - Method and device for calculating reciprocal

Info

Publication number: JP2002244843A
Application number: JP2001386674A
Authority: JP
Inventors: Alexei Krouglov; クロウグラブアレクセイ; Jie Zhou; ゾウジエ; Daniel Gudmunson; ガドマンソンダニエル
Original assignee: Sicon Video Corp
Current assignee: Sicon Video Corp
Priority date: 2000-12-20
Filing date: 2001-12-19
Publication date: 2002-08-30
Also published as: GB0129122D0; FR2818407A1; GB2372353A

Abstract

PROBLEM TO BE SOLVED: To provide a method and a device for calculating the reciprocal of normalized decimal part in a floating-point input number D. SOLUTION: An expression for determining the minimum size of a look-up table according to necessary precision and an expression for calculating a look-up entry are provided. The look-up table stores an initial approximation and a correction factor. The initial approximation and the correction factor are designated in address by a number corresponding to the highest position bit of a decimal part and are used for calculating a reciprocal of the initial approximate by a linear interpolation requiring a subtraction and a multiplication. A result of the linear interpolation can be supplied to a Newton-Raphson reiteration device requiring two multiplications and a complement calculation of 2 for each iteration to be able to double the precision of the reciprocal.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、信号処理に関し、
詳細には、逆数を計算するための方法および装置に関す
る。TECHNICAL FIELD The present invention relates to signal processing.
In particular, it relates to a method and apparatus for calculating a reciprocal.

【０００２】[0002]

【従来の技術】逆数を計算することは、特に浮動小数点
１０進数での除算演算で重要な役割を果たす。逆数を使
用することにより、２つの数の除算の結果は、被除数に
除数の逆数を掛けることによって得ることができる。こ
の除算方法を使用して、コンピュータなどのデジタル処
理装置や、デジタル信号処理（ＤＳＰ）プロセッサなど
のアプリケーション特有の集積回路で、複雑な計算を処
理する速度を向上することができる。2. Description of the Related Art Calculating a reciprocal plays an important role especially in a division operation with a floating-point decimal number. By using the reciprocal, the result of dividing the two numbers can be obtained by multiplying the dividend by the reciprocal of the divisor. This division method can be used to increase the speed at which complex calculations are processed by digital processing devices such as computers and application-specific integrated circuits such as digital signal processing (DSP) processors.

【０００３】参照により本明細書に組み込まれる２進浮
動小数点計算のためのＩＥＥＥ標準０７５４Ｐ−１９８
５によれば、浮動小数点フォーマット中の浮動小数点標
準数は、単精度の仮数（小数部）２４ビット長を有する
３２ビット内にパックされ、または倍精度の仮数５３ビ
ット長を有する６４ビット内にパックされる。[0003] IEEE Standard 0754P-198 for Binary Floating Point Computation, which is incorporated herein by reference.
According to 5, the floating-point standard number in the floating-point format is packed into 32 bits with a single-precision mantissa (decimal part) of 24 bits or 64 bits with a double-precision mantissa of 53 bits. Will be packed.

【０００４】直接近似、線形補間、２次（ｓｑｕａｒ
ｅ）補間、３次（ｃｕｂｉｃ）補間などを含むいくつか
の補間方法および反復方法が、逆数を計算するために開
発者によって広く使用されている。[0004] Direct approximation, linear interpolation, quadratic (square)
e) Several interpolation and iterative methods, including interpolation, cubic interpolation, etc., are widely used by developers to calculate reciprocals.

【０００５】逆数を得る直接近似方法では、逆数につい
てのすべての可能な小数部が、ＲＯＭテーブル中に格納
される。この方法を使用すれば結果を直ちに得ることが
できるが、この方法は、非常に大きいメモリ容量を必要
とする。例えば、ＩＥＥＥ標準７５４単精度浮動小数点
フォーマットによる逆数を得るためには、２²³×２３＝
１８４Ｍビットのメモリを必要とする。In the direct approximation method for obtaining the reciprocal, all possible fractions for the reciprocal are stored in a ROM table. Although the results can be obtained immediately using this method, this method requires a very large amount of memory. For example, to obtain the reciprocal in the IEEE standard 754 single precision floating point format, 2 ²³ × 23 =
Requires 184 Mbits of memory.

【０００６】線形補間方法は、計算からの平均値の定理
に基づき、逆数の計算に関して以下のように要約するこ
とができる。[0006] The linear interpolation method is based on the mean value theorem from the calculation and can be summarized as follows for the calculation of the reciprocal.

【０００７】[0007]

【数４】 (Equation 4)

【０００８】上式で、ξ∈［ｘ₀，ｘ］かつｘ≧ｘ₀であ
る。In the above equation, ξ∈ [x ₀ , x] and x ≧ x ₀ .

【０００９】２次補間、３次補間、および他の補間方法
を使用して、必要な精度を有する逆数を得ることも可能
である。しかし、これらの方法はすべて、追加の乗算演
算を必要とし、補正係数を格納するための追加のメモリ
を必要とする。補間方法の主な欠点は、所望の精度が向
上するにつれて、必要なデータを格納するために必要な
メモリ量も増大することである。It is also possible to use quadratic, cubic and other interpolation methods to obtain the reciprocal with the required precision. However, all of these methods require additional multiplication operations and additional memory to store correction factors. A major drawback of the interpolation method is that as the desired accuracy increases, so does the amount of memory required to store the required data.

【００１０】デジタルコンピュータにおいて、ニュート
ンラフソン反復法は、逆数を計算するために広く使用さ
れている。この方法により、方程式ｆ（ｚ）＝０（２）の解が、以下の再帰式を使用することに基づいて与えら
れる。In digital computers, the Newton-Raphson iteration method is widely used to calculate the reciprocal. In this way, a solution to the equation f (z) = 0 (2) is provided based on using the following recursive equation:

【００１１】[0011]

【数５】 (Equation 5)

【００１２】反復ｉの後に得られる値ｚ_iは、２次にｚ
に向かって収束し、その結果反復ｉおよびｉ＋１の後の
対応する誤差εは、以下の式によって関係付けられる。 ε（ｚ_i+1）≦ε²（ｚ_i）（４）The value z _i obtained after iteration i is quadratic
And consequently the corresponding error ε after iterations i and i + 1 is related by: ε (z _{i + 1} ) ≦ ε ² (z _i ) (4)

【００１３】逆数ｘ＝（１／ａ）を計算するためにニュ
ートンラフソン法を使用することにより、以下の式が生
成される。ｘ_i+1＝ｘ_i＊（２−ａ＊ｘ_i）（５）By using the Newton-Raphson method to calculate the reciprocal x = (1 / a), the following equation is generated. _{_{x i + 1 = x i *}} (2-a * x i) (5)

【００１４】方程式（５）からわかるように、この方法
のあらゆる反復ステップは、連続して実行される２つの
乗算演算と、１つの「２の補数」演算とを含む。したが
って、逆数の精度は各反復ステップの後に２倍になる。
ニュートンラフソン反復方法自体の欠点は、必要とする
精度を有する逆数を得るために複数の反復ステップを必
要とする可能性があることである。As can be seen from equation (5), every iteration step of the method involves two multiplication operations performed in succession and one "two's complement" operation. Thus, the accuracy of the reciprocal is doubled after each iteration step.
A disadvantage of the Newton-Raphson iteration method itself is that it may require multiple iteration steps to obtain a reciprocal with the required accuracy.

【００１５】上記の欠点を克服するために、あるタイプ
の補間方法を使用して逆数の初期近似を得、次いでこの
近似に基づいて反復方法を使用する方法が発展した。一
例として、逆テーブルを使用して連続する反復について
の初期値を得ることが提案された。To overcome the above drawbacks, methods have been developed that use some type of interpolation method to obtain an initial approximation of the reciprocal, and then use an iterative method based on this approximation. As an example, it has been proposed to use an inverted table to obtain initial values for successive iterations.

【００１６】[0016]

【発明が解決しようとする課題】本発明は、逆数を直ち
に、かつ高精度で送達することのできる、値を除算する
ための方法および装置を提供する。SUMMARY OF THE INVENTION The present invention provides a method and apparatus for dividing values that can deliver reciprocals immediately and with high precision.

【００１７】[0017]

【課題を解決するための手段】本発明の方法によれば、
逆数の近似を得るために線形補間が使用される。次い
で、この近似をニュートンラフソン反復のための入力値
として使用し、高精度で逆数を計算することができる。According to the method of the present invention,
Linear interpolation is used to obtain an inverse approximation. This approximation can then be used as input for Newton-Raphson iterations to calculate the reciprocal with high accuracy.

【００１８】従来技術の方法とは異なり、本発明の方法
は、必要な精度を有する逆数の近似を得るために、ルッ
クアップテーブル中のエントリの最小数を計算するため
の式を提供する。本発明の方法は、初期近似を計算する
ための式と、ルックアップテーブル中のエントリを構成
するための補正係数も提供する。本発明の方法を実装す
るための装置は、これらの値を格納するためのルックア
ップテーブルメモリ、整数乗算器、および減算器を備え
る。Unlike the prior art methods, the method of the present invention provides an equation for calculating the minimum number of entries in the look-up table to obtain an approximation of the reciprocal with the required precision. The method of the present invention also provides an equation for calculating the initial approximation and a correction factor for constructing an entry in the look-up table. An apparatus for implementing the method of the present invention comprises a look-up table memory for storing these values, an integer multiplier, and a subtractor.

【００１９】したがって本発明は、入力信号によって表
される、正規化された小数部Ｍ（ただし１≦Ｍ＜２）を
有する入力値Ｄの逆数を近似する出力値を表す出力信号
を生成するための方法を提供する。この方法は、入力信
号が、１組のＮ₀個の最上位ビットを含み、出力信号
が、Ｎ≦Ｎ₀として所望の精度ε＝２^-Nを有する逆数を
近似し、以下のステップを含む。Accordingly, the present invention provides an output signal representing an output value, represented by an input signal, that approximates the reciprocal of an input value D having a normalized fractional part M, where 1 ≦ M <2. To provide a way. The method includes an input signal including a set of N ₀ most significant bits and an output signal approximating a reciprocal having a desired accuracy ε = 2 ^−N as N ≦ N ₀ , and includes the following steps: .

【００２０】ａ．入力信号の１組のＰ個の最上位ビッ
トに対して、ｎ＝２^Pとして、複数のルックアップテー
ブル中のエントリの数ｎを生成するステップであって、
ｉ．ｉ＝０，．．．，ｎ−１として、第１ルックアップ
テーブル中の、１組のＮ個の有効ビットを含む１組の入
力エントリｙ_iを生成するサブステップと、ｉｉ．ｉ＝
０，．．．，ｎ−１として、第２ルックアップテーブル
中の、１組の（Ｎ−Ｐ）個の有効ビットを含む１組の入
力エントリＫ_iを生成するサブステップとを含むステッ
プと、ｂ．入力信号のＰ個の最上位ビットの組に対応
するルックアップテーブル中のエントリｙ_iおよびＫ_iを
見つけるステップと、ｃ．Ｋ_iに、入力信号のＰ個の
最上位ビットの組に続く１組の（Ｎ−Ｐ）個の有効ビッ
トを含む信号を掛けるステップと、ｄ．エントリｙ_i
のＮ個の有効ビットの組から、（Ｎ−Ｐ）個の最上位ビ
ットの組を引くステップ。A. Generating a number n of entries in a plurality of lookup tables, where n = 2 ^P for a set of P most significant bits of the input signal,
i. i = 0,. . . , N-1 generating a set of input entries y _i in the first look-up table, including a set of N significant bits, as a first lookup table; ii. i =
0,. . . , N-1 to generate a set of input entries K _i in the second look-up table including a set of (N−P) significant bits; b. Finding entries y _i and K _i in a look-up table corresponding to the set of P most significant bits of the input signal; c. Multiplying K _i by a signal comprising a set of (N−P) significant bits following the set of P most significant bits of the input signal; d. Entry y _i
Subtracting the (N−P) most significant bits from the N significant bits.

【００２１】本発明の方法の別の態様では、ルックアッ
プテーブル中のｎ個のエントリを生成するステップは以
下のサブステップを含む。ｉｉｉ．所望の精度よりも
高い精度を得るために必要なルックアップテーブルエン
トリの最小数ｌを計算するサブステップであって、In another aspect of the method of the present invention, the step of generating n entries in the look-up table includes the following sub-steps. iii. Calculating a minimum number l of look-up table entries required to obtain an accuracy higher than the desired accuracy,

【００２２】[0022]

【数６】 (Equation 6)

【００２３】であるサブステップと、ｉｖ．２^P-1＜
ｌかつ２^P≧ｌであるとして、ｎ＝２^Pに対するルックア
ップテーブルエントリの必要な最小数ｎを見つけるサブ
ステップ。第１ルックアップテーブル中の１組の入力エ
ントリを生成するステップが以下のサブステップを含
む。Ａ．Iv. 2 ^P-1 <
substep of finding the required minimum number n of look-up table entries for n = 2 ^P , where l and 2 ^P ≧ l. Generating a set of input entries in the first look-up table includes the following sub-steps. A.

【００２４】[0024]

【数７】 (Equation 7)

【００２５】ここで、ｉ＝０，．．．，ｎ−１、ｘ₀＝
１、および、ｘ_i+1＝ｘ_i＋（１／ｎ）であるとして計算
するサブステップと、Ｂ．１組のＮ個の有効ビットを
含み、ｉ＝０，．．．，ｎ−１に対するｙ＾_iの小数部
を近似するエントリｙ_iを見つけるサブステップであ
る。かつ／または第２ルックアップテーブル中の１組の
入力エントリを生成するステップが以下のサブステップ
を含む。Here, i = 0,. . . , N−1, x ₀ =
B.1, and calculating as x _{i + 1} = x _i + (1 / n); It contains a set of N valid bits, i = 0,. . . A sub-step of finding an entry y _i approximating the fractional portion of the y ^ _i for n-1. And / or generating a set of input entries in the second lookup table includes the following sub-steps.

【００２６】[0026]

【数８】 (Equation 8)

【００２７】を計算するサブステップと、１組の（Ｎ−
Ｐ）個の有効ビットを含むエントリＫ _iを見つけ、ｉ＝
０，．．．，ｎ−１に対するＫ＾_iの整数部を近似する
サブステップ。And a set of (N−
P) Entry K containing valid bits _iAnd find i =
0,. . . , N−1}_iApproximate the integer part of
Substep.

【００２８】本発明は、Ｎ₀≧Ｎとして１組のＮ₀個の最
上位ビットを含む正規化された小数部Ｍ（ただし１≦Ｍ
＜２）を有する入力値Ｄの精度ε＝２^-Nを有する逆数Ｉ
を計算するための装置をさらに提供する。この装置は、
少なくとも１つのプロセッサと、小数部ＭのＰ個の最上
位ビットの関数としてアドレス指定されるルックアップ
テーブルを形成し、１組のＮ個の有効ビットを含む出力
Ｉ₀を有する第１メモリと、小数部ＭのＰ個の最上位ビ
ットの関数としてアドレス指定されるルックアップテー
ブルを形成し、１組の（Ｎ−Ｐ）個の有効ビットを含む
出力Ｋを有する第２メモリと、小数部Ｍおよび出力Ｋの
Ｐ個の最上位ビットの組に続く１組の（Ｎ−Ｐ）個の有
効ビットの２つの入力と、１組の（Ｎ−Ｐ）×（Ｎ−
Ｐ）有効ビットを含む出力ＭＵとを有する、サイズ（Ｎ
−Ｐ）×（Ｎ−Ｐ）の乗算器と、出力Ｉを有し、出力Ｉ
₀と出力ＭＵの（Ｎ−Ｐ）個の最上位ビットの組とをそ
れぞれ受け取るように接続される２つの入力を有する加
算器／減算器とを備える。The present invention provides a normalized fraction M (where 1 ≦ M) containing a set of N ₀ most significant bits, where N ₀ ≧ N.
Reciprocal I with precision ε = 2- ^N of input value D having <2)
There is further provided an apparatus for calculating This device is
At least one processor and a first memory forming a look-up table addressed as a function of the P most significant bits of the fractional part M and having an output I ₀ comprising a set of N significant bits; A second memory forming a look-up table addressed as a function of the P most significant bits of the fraction M and having an output K comprising a set of (NP) significant bits; And two inputs of a set of (NP) valid bits following the set of P most significant bits of output K, and a set of (NP) * (N-
P) with the output MU containing the significant bits
−P) × (NP) multiplier and an output I.
An adder / subtractor having two inputs connected to receive ₀ and the (NP) most significant bit set of the output MU, respectively.

【００２９】本発明の装置の別の態様では、第１メモリ
および第２メモリが、Ｉ₀およびＫのどちらも格納する
記憶装置に組み合わされ、小数部ＭのＰ個の最上位ビッ
トの関数としてアドレス指定される。装置が、Ｉに基づ
いて、プログラムされたニュートンラフソン反復を実行
するための装置をさらに備える。第１メモリが読取り専
用メモリ（ＲＯＭ）を備える。第２メモリが読取り専用
メモリ（ＲＯＭ）を備える。記憶装置が少なくとも１つ
の読取り専用メモリ（ＲＯＭ）を備える。および／また
は装置がデジタル信号処理装置中に含まれる。In another aspect of the apparatus of the present invention, the first memory and the second memory are combined with a storage device that stores both I ₀ and K, and as a function of the P most significant bits of the fractional part M. Addressed. The apparatus further comprises an apparatus for performing a programmed Newton-Raphson iteration based on I. The first memory comprises a read-only memory (ROM). The second memory comprises a read-only memory (ROM). The storage device includes at least one read-only memory (ROM). And / or a device is included in the digital signal processing device.

【００３０】図面は、本発明の好ましい実施形態を例示
的にのみ示す。The drawings show by way of example only preferred embodiments of the invention.

【００３１】[0031]

【発明の実施の形態】本発明は、浮動小数点フォーマッ
ト中の２進数Ｄの小数部Ｍを使用する計算に適用可能で
ある。入力数の小数部Ｍは、あらかじめ正規化されてい
る、すなわち１以上かつ２未満、１≦Ｍ＜２であると仮
定する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS The present invention is applicable to calculations using the fractional part M of a binary number D in floating point format. It is assumed that the decimal part M of the number of inputs is normalized in advance, that is, 1 or more and less than 2, and 1 ≦ M <2.

【００３２】図１は、逆数を近似するために本発明の方
法で使用される線形補間方法の好ましい実施形態を示
す。直接近似方法と比べて、線形補間は、ルックアップ
テーブル中に格納する必要のあるエントリ数を著しく低
減する。直接近似方法を使用すると共に、逆数の小数部
Ｍに対してＮビットの精度ε＝２^-Nを達成するために
は、ルックアップテーブルは、２^N−１個の、等しく間
隔を置いて配置されたエントリを必要とするが、線形補
間方法を使用して、逆数の小数部に対して同じＮビット
の精度ε＝２^-Nを得るためには、Ｐ≦Ｎとして、ルック
アップテーブル中に２^P個のエントリを有することで十
分である。各エントリは前のエントリから２^- ^Pだけ異な
るので、小数部ＭのＰ個の最上位ビット（ＭＳＢ）は、
ルックアップテーブルアドレスを構成する。FIG. 1 shows a preferred embodiment of the linear interpolation method used in the method of the present invention to approximate reciprocals. Compared to the direct approximation method, linear interpolation significantly reduces the number of entries that need to be stored in the look-up table. In order to use the direct approximation method and to achieve N-bit precision ε = 2- ^N for the reciprocal fraction M, the look-up table is 2 ^N −1 equally spaced requires a entries, using the linear interpolation method, in order to obtain the accuracy epsilon = 2 ^-N of the same N bit for the decimal part of the reciprocal, as P ≦ N, while the look-up table It is sufficient to have 2 ^P entries. Since each entry differs from the previous entry by 2 ^- ^P , the P most significant bits (MSB) of the fractional part M are
Configure the lookup table address.

【００３３】図１の値ｘ_iおよびｘ_i+1は、ルックアップ
テーブル中の２つの連続するエントリを表す。値ｙ
_iは、ルックアップテーブル中に格納される値（１／
ｘ_i）の近似を表す。値（ｘ−ｘ_i）は、小数部の最初の
Ｐ個のＭＳＢのすぐ後ろのビットによって表される。The values x _i and x _{i + 1 in} FIG. 1 represent two consecutive entries in the look-up table. Value y
_i is the value stored in the lookup table (1 /
x _i ). The value (x-x _i) is represented by the bit immediately after the first of the P MSB of the fractional part.

【００３４】必要な逆数の近似は、以下の式によって表
される。ｙ＝ｙ_i−ｋ_i（ｘ−ｘ_i）（６）上式で、ｋ_iは補正係数であり、ｘ_i≦ｘ＜ｘ_i+1であ
る。The required reciprocal approximation is represented by the following equation: y = In _{_{y i -k i (x-x}} i) (6) the above equation, k _i is a correction factor, which is _{_{x i ≦ x <x i +}} 1.

【００３５】Ｎビットの精度を有する逆数の小数部Ｍを
得るためには、小数部ＭのＰ個のＭＳＢの後ろの少なく
とも（Ｎ−Ｐ）ビットが、図１に示す補正係数ｋ_iの少
なくとも（Ｎ−Ｐ）個の有効ビットと一致しなければな
らない。これらの係数は、整数形Ｋ_i＝２^N-P×ｋ_iでル
ックアップテーブル中に格納される。したがって、Ｎビ
ットの精度を有する逆数の小数部Ｍを得ることは、サイ
ズ（Ｎ−Ｐ）×（Ｎ−Ｐ）の整数の乗算器中で、小数部
Ｍの（Ｎ−Ｐ）ビットに係数Ｋ_iの（Ｎ−Ｐ）ビットを
掛け、結果を２^N-Pで割り、次いでその商をｙ_iから引く
ステップを含む。In order to obtain a reciprocal fraction M with N-bit precision, at least (NP) bits after the P MSBs of the fraction M are at least (NP) bits of the correction coefficient k _i shown in FIG. Must match (NP) valid bits. These coefficients are stored in a look-up table in the integer form K _i = 2 ^NP × k _i . Therefore, to obtain a reciprocal fraction M having N-bit precision, a coefficient is added to the (NP) bits of the fraction M in an integer multiplier of size (NP) × (NP). Multiplying the ( ^NP ) bits of K _i , dividing the result by 2 ^NP , and then subtracting the quotient from y _i .

【００３６】ｙ_iに関する小数部ＭのＮビットと、整数
Ｋ_iに関する（Ｎ−Ｐ）ビットとは、ルックアップテー
ブル中に格納される。値ｙ_iおよびＫ_iは、以下の式に従
って、それぞれＮおよび（Ｎ−Ｐ）の精度で計算され
る。The N bits of the decimal part M for y _i and the (N−P) bits for integer K _i are stored in a look-up table. The values y _i and K _i are calculated with N and (N−P) precision, respectively, according to the following equations:

【００３７】[0037]

【数９】 (Equation 9)

【００３８】上式で、ｎはルックアップテーブル中のエ
ントリ数である。In the above equation, n is the number of entries in the look-up table.

【００３９】小数部Ｍの逆数を見つけるための線形補間
方法の最大誤差εは、ｎに応じて１から２の間にあるよ
うに正規化され、以下の式によって表される。The maximum error ε of the linear interpolation method for finding the reciprocal of the decimal part M is normalized to be between 1 and 2 depending on n, and is expressed by the following equation.

【００４０】[0040]

【数１０】 (Equation 10)

【００４１】必要とする最大誤差εが周知である場合、
ルックアップテーブル中のエントリの必要数ｎを式
（９）から決定することができる。If the required maximum error ε is known,
The required number n of entries in the lookup table can be determined from equation (9).

【００４２】例えば、ｎ＝６４に対しては、逆数を得る
ための線形補間方法の最大誤差εはε≒２．９８＊１０
^-5＞２^-16に等しく、ｎ＝１２８に対しては、線形補間
方法による最大誤差はε≒７．５４＊１０^-6＜２^-16に
等しい。For example, for n = 64, the maximum error ε of the linear interpolation method for obtaining the reciprocal is ε ≒ 2.98 * 10
^-5 > ^2-16 , and for n = 128, the maximum error by the linear interpolation method is equal to ε ≒ 7.54 * 10 ^-6 <2 ^-16 .

【００４３】図２は、上述のような方法を実装する、本
発明による逆数を計算するための装置１０を示す。入力
数Ｄの小数部ＭのＰ個のＭＳＢは、２^P個のエントリを
有するＲＯＭ１２のアドレス線を形成する。ＲＯＭ
１２は、線形補間を実行するために、小数部のＰ個のＭ
ＳＢの逆数ｙ_iに関するＮビットを格納し（しかし、逆
数の先頭ビットは常に「０」であるので、（Ｎ−１）ビ
ットを格納することで十分である）、補正係数Ｋ_iの
（Ｎ−Ｐ）ビットを格納することが好ましい。FIG. 2 shows an apparatus 10 for calculating a reciprocal according to the invention, implementing a method as described above. The P MSBs of the fractional part M of the input number D form an address line of the ROM 12 having 2 ^P entries. ROM
12 performs P fractional M executions to perform linear interpolation.
Stores N bits for reciprocal y _i of SB (However, since the first bit of the inverse is always "0", (N-1) it is sufficient to store a bit), the correction coefficient K _i (N -P) It is preferable to store bits.

【００４４】補正係数Ｋの（Ｎ−Ｐ）ビットは、サイズ
（Ｎ−Ｐ）×（Ｎ−Ｐ）を有する整数乗算器１４の１つ
の入力に供給される。乗算器１４の別の入力には、入力
小数部ＭのＰ個のＭＳＢのすぐ後の（Ｎ−Ｐ）ビットが
供給される。（Ｎ−Ｐ）×（Ｎ−Ｐ）ビット長を掛ける
ことによって得られる積の（Ｎ−Ｐ）個のＭＳＢＭＵ
は、サイズＮを有する整数減算器１６の入力に供給され
る。入力ＭＵのＰ個のＭＳＢはすべて「０」であり、積
の（Ｎ−Ｐ）個の最下位ビット（ＬＳＢ）は廃棄され
る。減算器１６の別の入力には、ＲＯＭ１２からの逆数
の近似ｙ_iのＮビット（図２にＩ₀として示す）が供給さ
れる。減算の結果は、装置１０のＮビット出力（図２に
Ｉとして示す）を形成する。The (NP) bits of the correction coefficient K are supplied to one input of an integer multiplier 14 having a size (NP) × (NP). Another input of the multiplier 14 is provided with the (NP) bits immediately following the P MSBs of the input fraction M. (NP) MSB MUs of the product obtained by multiplying by (NP) x (NP) bit length
Is supplied to the input of an integer subtractor 16 having size N. The P MSBs of the input MU are all "0" and the (NP) least significant bits (LSB) of the product are discarded. Another input of the subtractor 16 is supplied with N bits (shown as I _{0 in} FIG. 2) of the reciprocal approximation y _i from the ROM 12. The result of the subtraction forms the N-bit output of device 10 (shown as I in FIG. 2).

【００４５】入力小数部ＭのＰ個のＭＳＢが「１」に等
しい（すなわち、最上位ビットが「１」であり、他の
（Ｐ−１）ビットが「０」である）場合、出力ＩはＭＵ
の（Ｎ−Ｐ）ビットの１の補数として表すことができ、
計算が簡単になることに留意されたい。If the P MSBs of the input fraction M are equal to "1" (ie, the most significant bit is "1" and the other (P-1) bits are "0"), the output I Is MU
(N−P) bits of one's complement,
Note that the calculations are simplified.

【００４６】図３は、結果の精度をＮビットから２Ｎビ
ット精度に向上させるために、線形補間の結果のニュー
トンラフソン反復を実行するための装置２０を示す。補
間装置１０からの出力ＩのＮビットは、サイズＮ×２Ｎ
を有する整数乗算器２２の入力に供給される。乗算器２
２の別の入力には、マルチプレクサ２４からの出力の２
Ｎビットが供給される。マルチプレクサ２４は、（必要
な場合追加の「０」で埋められる）入力小数部Ｍの２Ｎ
個のＭＳＢと、２の補数装置２６の出力の２Ｎビットと
を交互に選択する。FIG. 3 shows an apparatus 20 for performing a Newton-Raphson iteration of the result of a linear interpolation to increase the precision of the result from N bits to 2N bits. The N bits of the output I from the interpolation device 10 are of size N × 2N
Is provided to the input of the integer multiplier 22. Multiplier 2
Another input of 2 is the output 2 of multiplexer 24.
N bits are provided. Multiplexer 24 provides 2N of input fraction M (filled with additional "0" if necessary).
The MSBs and the 2N bits of the output of the two's complement device 26 are alternately selected.

【００４７】乗算器２２は、３Ｎビット長の結果を生成
する。この乗算の積のＮ個の最下位の組が廃棄される。
積の２Ｎ個の最上位ビットＭＵ１が２の補数装置２６に
供給される。２Ｎビット長の２の補数装置２６の出力が
マルチプレクサ２４に供給される。乗算器２２を２回目
に通過する際、乗算の積の２Ｎ個のＭＳＢは、装置２０
の２Ｎビット出力（図３にＩ₁として示す）を形成す
る。The multiplier 22 generates a result having a length of 3N bits. The N least significant sets of products of this multiplication are discarded.
The 2N most significant bits MU1 of the product are provided to a two's complement unit 26. The output of the 2N-bit two's complement device 26 is supplied to the multiplexer 24. On the second pass through the multiplier 22, the 2N MSBs of the product of the multiplication are
2N bit output (shown as I _{1 in} FIG. 3).

【００４８】本発明の好ましい実施形態を例示的にのみ
示し、説明したが、頭記の特許請求の範囲で述べるよう
な本発明の範囲から逸脱することなく、変更および修正
を行うことができることは当業者には明らかであろう。While the preferred embodiment of the invention has been illustrated and described by way of example only, it is to be understood that changes and modifications can be made without departing from the scope of the invention as set forth in the claims below. It will be clear to those skilled in the art.

[Brief description of the drawings]

【図１】本発明で使用する線形補間方法を示すグラフで
ある。FIG. 1 is a graph showing a linear interpolation method used in the present invention.

【図２】逆数の小数部についてＮビットの精度を得るた
めの、本発明による線形補間装置を示すブロック図であ
る。FIG. 2 is a block diagram showing a linear interpolation device according to the present invention for obtaining N-bit precision for a reciprocal fraction part.

【図３】逆数の小数部について２Ｎビットの精度を得る
ための、ニュートンラフソン反復装置を示すブロック図
である。FIG. 3 is a block diagram showing a Newton-Raphson iterator for obtaining 2N-bit precision for the reciprocal fraction part.

[Explanation of symbols]

１２ＲＯＭ１４整数乗算器１６整数減算器２２乗算器２４マルチプレクサ２６２の補数装置 12 ROM 14 Integer Multiplier 16 Integer Subtractor 22 Multiplier 24 Multiplexer 26 Two's Complement Device

───────────────────────────────────────────────────── フロントページの続き (71)出願人 501488860 1550 16ｔｈＡｖｅｎｕｅ，Ｂｌｄｇ. Ｆ，２ｎｄＦｌｏｏｒＲｉｃｈｍｏｎｄＨｉｌｌ，ＯｎｔａｒｉｏＣａｎａｄａＬ４Ｂ３Ｋ９ (72)発明者アレクセイクロウグラブカナダエム８ワイ４シー５オンタリオ州エトビコーククラウンヒルプレイス９ナンバー310 (72)発明者ジエゾウカナダエム２エム３エー５オンタリオ州ノースヨークロウバンクコート 29 (72)発明者ダニエルガドマンソンカナダエル３エックス１エス６オンタリオ州ニューマーケットジェリーアベニュー 200 Ｆターム(参考） 5B016 AA05 BA07 CD01 EA15 5B022 BA01 FA06 ──────────────────────────────────────────────────続き Continuation of the front page (71) Applicant 501488860 1550 16th Avenue, Bldg. F, 2nd Floor Richmond d Hill, Ontario Cana da L4B 3K9 (72) Inventor Alexei Crowgrab Canada M8 W 4 Sea 5 Ontario Etobicoke Crown Hill Place 9 No. 310 (72) Inventor Jie Zou Canada M2M 3A5 Ontario North York Rowbank Court 29 (72) Inventor Daniel Gadmanson Canada El 3X 1S6 Ontario Newmarket Jerry Avenue 200 F term (reference) 5B016 AA05 BA07 CD01 EA15 5B022 BA01 FA06

Claims

[Claims]

1. A method for generating an output signal representing an output value approximating a reciprocal of an input value D having a normalized fractional part M (where 1 ≦ M <2) represented by the input signal. Thus, the input signal includes a set of N ₀ most significant bits, and the output signal has the desired precision ε = 2, where N ≦ N ₀
Approximate the reciprocal with ^-N , a. For a set of P most significant bits of the input signal,
generating the number n of entries in the plurality of look-up tables, where n = 2 ^P , i. i = 0,. . . , N-1 generating a set of input entries y _i in the first look-up table, including a set of N significant bits, as a first lookup table; ii. i = 0,. . . , N-1 to generate a set of input entries K _i in the second look-up table including a set of (N−P) valid bits, b. Finding entries y _i and K _i in a look-up table corresponding to the set of P most significant bits of the input signal; c. Multiplying K _i by a signal comprising a set of (N−P) significant bits following the set of P most significant bits of the input signal; d. From the set of N valid bits of entry y _i , (N
-P) drawing a set of the most significant bits.

2. Creating N entries in a look-up table, comprising: iii. A sub-step of calculating the minimum number l of look-up table entries required to obtain an accuracy higher than the desired accuracy, And iv. Assuming that 2 ^P−1 <l and 2 ^P ≧ l, n = 2 ^P
Sub-step of finding the required minimum number n of look-up table entries for.

3. The step of generating a set of input entries in said first look-up table comprises: (Equation 2) Here, i = 0,. . . B., n-1, x ₀ = 1, and x _{i + 1} = x _i + (1 / n); Find an entry y _i that contains a set of N significant bits, i = 0,. . . , N-1 for approximating the fractional part of y ＾ _i .

4. The step of generating a set of input entries in said second look-up table comprises: (Equation 3) Here, i = 0,. . . B., n-1, x ₀ = 1, and x _{i + 1} = x _i + (1 / n); Contains a set of (NP) valid bits, i =
0,. . . , N-1 to find an entry K _i approximating the integer part of K ＾ _i .

5. The precision ε = 2- ^N of an input value D having a normalized fraction M (1 ≦ M <2) including a set of N ₀ most significant bits, where N ₀ ≧ N. An apparatus for calculating a reciprocal I having at least one processor and a look-up table addressed as a function of the P most significant bits of a fraction M, comprising a set of N valid bits. a first memory having an output I ₀ including bit, to form a look-up table which is addressed as a function of the P most significant bits of the fractional part M, 1 set of (N-
A second memory having an output K containing P) significant bits, and two inputs of a set of (NP) valid bits following the fraction M and the set of P most significant bits of the output K. And a multiplier of size (NP) * (NP) having an output MU comprising a set of (NP) * (NP) significant bits; An apparatus, comprising: an adder / subtractor having two inputs connected to receive I ₀ and a set of (N−P) most significant bits of an output MU, respectively.

6. The memory of claim 1, wherein the first memory and the second memory are combined with a storage device that stores both I ₀ and K and are addressed as a function of the P most significant bits of the fractional part M. An apparatus according to claim 5, characterized in that:

7. The apparatus of claim 5, further comprising an apparatus for performing a programmed Newton-Raphson iteration based on I.

8. The apparatus according to claim 5, wherein said first memory comprises a read only memory (ROM).

9. The apparatus of claim 5, wherein said second memory comprises a read only memory (ROM).

10. The apparatus of claim 6, wherein said storage device comprises at least one read only memory (ROM).

11. A digital signal processing device comprising the device of claim 5.