JPH0887400A

JPH0887400A - Multiplier, adder and subtracter

Info

Publication number: JPH0887400A
Application number: JP6246989A
Authority: JP
Inventors: Yukihiro Ide; 出進博井; Harumune Matsubara; 原玄宗松
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1994-09-14
Filing date: 1994-09-14
Publication date: 1996-04-02
Anticipated expiration: 2019-02-03
Also published as: JP3493064B2

Abstract

PURPOSE: To constitute the multiplier, adder and subtracter in a compact device and speed up the operation of multiplication, addition, and subtraction by performing addition of a 3rd operand C simultaneously with partial multiplication of 1st and 2nd operands A and B. CONSTITUTION: The 1st and 2nd operands A and B are inputted to a multiplication array 301 and multiplied. When a secondary Booth's algorithm is used as multiplication algorithm, 12 partial products are added by a Wallace tree which is composed of full-adders and obtains the sum of partial products of the same weight (same digit). The 3rd operand C is also inputted to the multiplication array 301 and the values of the respective digits of the operand C are added directly to the partial products of the same weight by the adders of a Wallace tree corresponding to the respective digits of the operand C. Consequently, the multiplication array 301 calculates an arithmetic operation result A×B+C of multiplication, addition, and subtraction in carry save form. A final adder converts the arithmetic operation result A×B+C of multiplication, addition, and subtraction, found in the carry save form into a binary number.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、Ａ×Ｂ＋Ｃの演算を行
う乗加減算器の改良に関する。また、他の発明は、演算
装置等においてデータを指定された量だけ桁シフトする
バレルシフタの改良に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an improvement of a multiplier / subtractor for performing A * B + C calculation. Another invention relates to an improvement in a barrel shifter that shifts data by a specified amount in an arithmetic unit or the like.

【０００２】[0002]

[Prior art]

（乗加減算器）科学技術計算、コンピュータ・グラフィ
ックス、信号処理等では、膨大な量の演算を高速に処理
する必要がある。このような応用では行列計算が頻繁に
実行される。演算の多くは、Ａ×Ｂ＋Ｃを行う乗加減算
に帰着させることができる。そのため高速な乗加減算演
算器が要求されている。特に、高速な演算性能が要求さ
れるデジタル信号処理プロセッサ（ＤＳＰ）は、ハード
ウェアで構成された乗加減算演算器を実装している。(Multiplier / Adder / Subtractor) In scientific and technological calculations, computer graphics, signal processing, etc., it is necessary to process enormous amounts of operations at high speed. Matrix calculations are frequently performed in such applications. Many of the operations can be reduced to multiplicative addition / subtraction that performs A × B + C. Therefore, a high-speed multiplying / subtracting arithmetic unit is required. In particular, a digital signal processor (DSP), which requires high-speed arithmetic performance, is equipped with a multiplication / subtraction arithmetic unit composed of hardware.

【０００３】図８は、従来の乗加減算演算器の一例を示
すブロック図である。同図において、２進数の被乗数Ａ
と乗数Ｂとの積を求める乗算器は、例えば、公知のブー
ス（Booth ）のアルゴリズムによる、乗算を実行する乗
算アレイ１０１と、乗算アレイ１０１の和出力群Ｆｓｓ
及び桁上げ出力群Ｆｃｃ（乗算中間値）を２進数の形に
整える最終加算器１０２と、によって構成される。最終
加算器１０２の出力に得られる乗算器の出力Ｐ（＝（Ａ
×Ｂ））と、加算（あるいは減算）すべき２進数の加算
値（あるいは減算値）Ｃとを加算器１０３によって加算
（あるいは減算）して、（Ａ×Ｂ）＋Ｃを得る構成とな
っている。FIG. 8 is a block diagram showing an example of a conventional multiplication / subtraction arithmetic unit. In the figure, the binary multiplicand A
The multiplier that finds the product of the multiplier B and the multiplier B is, for example, a multiplication array 101 that executes multiplication by a known Booth algorithm, and a sum output group Fss of the multiplication array 101.
And a final adder 102 for arranging the carry output group Fcc (multiplication intermediate value) in the form of a binary number. The output P (= (A
× B)) and the binary addition value (or subtraction value) C to be added (or subtracted) are added (or subtracted) by the adder 103 to obtain (A × B) + C. There is.

【０００４】図９は、ブース（Booth ）のアルゴリズム
による乗算を行う乗算アレイ１０１の構成を示してお
り、被乗数Ａのリコード（再符号化）を行うブースのリ
コーダ１０１ａ、被乗数Ａと乗数Ｂとの部分積を生成す
る部分積生成回路１０１ｂ、重み付された部分積を累積
的に加算する部分積加算用桁上げ保存加算器群（例え
ば、ワラス（Wallace ）・ツリー）１０１ｃ、によって
構成される。FIG. 9 shows the configuration of a multiplication array 101 for performing multiplication by Booth's algorithm. It consists of a booth recorder 101a for recoding (re-encoding) a multiplicand A, a multiplicand A and a multiplier B. A partial product generation circuit 101b that generates a partial product and a carry save adder group for partial product addition (for example, Wallace tree) 101c that cumulatively adds the weighted partial products are configured.

【０００５】部分積生成回路１０１ｂは、例えば、一次
のブースのアルゴリズムを実行する場合には、次式に従
う部分積を生成する。The partial product generation circuit 101b generates a partial product according to the following equation, for example, when executing the primary Booth algorithm.

【０００６】ここで、被乗数Ａはｎビットの２進数（＝ａ_n，
ａ_n-1，…，ａ₁，ａ₀）、乗数Ｂはｎビットの２進数
（＝ｂ_n，ｂ_n-1，…，ｂ₁，ｂ₀）である。ｉはｉ番
目の乗算ステップを意味する。[0006] Here, the multiplicand A is an n-bit binary number (= a _n ,
_{a n-1, ..., a} 1, a 0), the multiplier B is an n-bit binary number _{_{(= b n, b n-}} 1, ..., a b _1, b _0). i means the i-th multiplication step.

【０００７】また、二次のブースのアルゴリズムの場合
は、次式で示されるような部分積を生成する。Further, in the case of the secondary Booth's algorithm, a partial product represented by the following equation is generated.

【０００８】Ｐ＝Ａ・Ｂ二次のブースのアルゴリズムでは、ｎがｎ／２に減少
し、部分積の加算量が減る。また、部分積の加算回数を
ｎ／３にした、三次のブースのアルゴリズムも知られて
いる。なお、ブースのアルゴリズムは良く知られてお
り、部分積を生成する乗算のアルゴリズムとして他のア
ルゴリズムを使用することもできる、等の理由により、
ブースのアルゴリズムについては詳述しない。[0008] P = A / B In the secondary Booth's algorithm, n is reduced to n / 2 and the addition amount of partial products is reduced. A cubic Booth algorithm is also known in which the number of partial product additions is n / 3. Note that Booth's algorithm is well known, and other algorithms can be used as an algorithm for multiplication to generate a partial product.
The Booth algorithm is not detailed.

【０００９】図１０は、部分積生成回路１０１ｂ及び部
分積加算用桁上げ保存加算器群１０１ｃを説明するもの
である。部分積生成回路１０１ｂが出力する、例えば、
１２ビットの被乗数Ａ及び乗数Ｂについての、部分積を
数値としての重み付をして配列した例を示している。同
図において、黒丸はある部分積を示している。部分積の
配列は、１２ビットの被乗数Ａ、１２ビットの乗数Ｂ、
により、行方向に２４ビット、列方向に１２ビットであ
り、同一列は同一桁を示す。なお、同図に示される部分
積ｂ0 ａ0 ，ｂ0 ａ1 ，…，ｂ11ａ11は、説明の便宜上
付されたものであり、上述した式の値に付される添字に
関連したものではない点に注意されたい。FIG. 10 illustrates the partial product generation circuit 101b and the partial product addition carry save adder group 101c. The partial product generation circuit 101b outputs, for example,
An example is shown in which partial products of the 12-bit multiplicand A and multiplier B are weighted and arranged as numerical values. In the figure, a black circle indicates a partial product. The array of partial products is a 12-bit multiplicand A, a 12-bit multiplier B,
According to the above, there are 24 bits in the row direction and 12 bits in the column direction, and the same column indicates the same digit. Note that the partial products b0 a0, b0 a1, ..., B11 a11 shown in the figure are added for convenience of explanation, and are not related to the subscripts added to the values of the above expressions. I want to.

【００１０】桁上げ保存加算器群１０１ｃは、生じた桁
上げを次段に繰上げるようにして桁上げを保存しなが
ら、各列の部分積を累積的に加算する。この累積加算に
はいわゆる木（ツリー）構造に配置された桁上げ保存加
算器（CPA:carry-save adder）、例えば、２進木やワラ
ス（Wallace ）ツリーを用いる。The carry save adder group 101c cumulatively adds the partial products of the respective columns while carrying the carried carry to the next stage and saving the carry. A carry save adder (CPA) arranged in a so-called tree structure, for example, a binary tree or a wallace tree is used for this cumulative addition.

【００１１】図１１は、部分積加算用桁上げ保存加算器
群１０１ｃにおいて、図１０に点線で示す一列分（ｂ0
ａ11，ｂ1 ａ10，…，ｂ11ａ0 ）についての加算を担
う、ワラス・ツリー構造の加算器群を示している。ツリ
ーを構成する、桁上げ保存加算器４ｗは、４つの入力
と、１つの桁上げ入力との、５つの入力に対応して、１
つの和出力、２つの桁上げ出力を備える。第１段の、３
つの桁上げ保存加算器４ｗの各入力には、同一桁（同じ
重み）の部分積ｂ0 ａ11，ｂ1 ａ11，…，ｂ11ａ0が入
力される。各部分積の和は、ツリー構造の第２段、第３
段の複数の加算器４ｗを伝搬し、図示しない桁上げ伝搬
加算器（CPA:carry-propagate adder ）に出力される。
同図中、各加算器４ｗへの１つ下位の桁の図示しないワ
ラス・ツリーからの桁上げ入力及び１つ上の桁の図示し
ないワラス・ツリーへの桁上げ出力は、夫々三角の矢印
によって示されている。他の桁（列）のワラス・ツリー
も部分積の入力数に応じた加算器を備えて同様に、配列
分だけ構成される。FIG. 11 shows the carry-save adder group 101c for partial product addition, which corresponds to one column (b0) indicated by the dotted line in FIG.
a11, b1 a10, ..., B11a0) is shown, which is a group of adders having a Wallace tree structure and which is responsible for addition. The carry save adder 4w forming the tree corresponds to four inputs and one carry input, and five 1
It has one sum output and two carry outputs. 1st stage, 3
Partial products b0 a11, b1 a11, ..., B11 a0 of the same digit (same weight) are input to the respective inputs of the carry save adder 4w. The sum of each partial product is the second and third stages of the tree structure.
The signal propagates through a plurality of adders 4w of stages and is output to a carry propagation adder (CPA: carry-propagate adder) not shown.
In the figure, the carry input from the Wallace tree (not shown) of the one lower digit to each adder 4w and the carry output to the Wallace tree (not shown) of the digit one above are shown by triangular arrows. It is shown. The Wallace trees of other digits (columns) also include adders according to the number of inputs of partial products, and are similarly configured by the array.

【００１２】図１２は、桁上げ保存加算器４ｗの構成例
を示している。加算器４ｗは、４つの入力ｄ0 〜ｄ3 か
ら桁上げ出力Ｃout を得る第１の論理回路グループと、
４つの入力ｄ0 〜ｄ3 の和と、下位桁からの桁上げ入力
とから和Ｓを得る第２の論理回路グループと、４つの入
力ｄ0 〜ｄ3 から生じる桁上げと、４つの入力ｄ0 〜ｄ
3 の和と、下位桁からの桁上げ入力Ｃinとから桁上げ出
力ＣＣを得る第３の論理回路グループと、によって構成
される。FIG. 12 shows a configuration example of the carry save adder 4w. The adder 4w includes a first logic circuit group for obtaining a carry output Cout from four inputs d0 to d3,
A second logic circuit group which obtains the sum S from the sum of the four inputs d0 to d3 and the carry input from the lower digit, the carry resulting from the four inputs d0 to d3 and the four inputs d0 to d.
And a third logic circuit group for obtaining a carry output CC from the carry input Cin from the lower digit.

【００１３】このようにして、部分積生成回路１０１ｂ
によって生成された被乗数Ａ及び乗数Ｂについての複数
の部分積は、部分積加算用桁上げ保存加算器群（例え
ば、ワラス・ツリー）１０１ｃによって桁上がりが上位
の桁に伝搬するように加算処理される。各桁の和出力、
桁上がり出力が乗算アレイ１０１から桁上げ伝搬加算器
によって構成される最終加算器１０２に与えられ、桁上
がりが上位桁に伝搬する最終の加算が行われ、２進数の
形の積Ｐに整えられる。そして、この積Ｐと加算数Ｃと
の加算が加算器１０３によって行われ、Ａ×Ｂ＋Ｃの演
算結果が得られる。In this way, the partial product generation circuit 101b
The plurality of partial products for the multiplicand A and the multiplier B generated by are subjected to addition processing by the carry save adder group for partial product addition (for example, Wallace tree) 101c so that the carry propagates to the upper digit. It Sum output of each digit,
The carry output is given from the multiplication array 101 to the final adder 102 constituted by the carry propagation adder, and the final addition is carried out so that the carry propagates to the upper digit, and the product P is arranged in the form of a binary number. . Then, the product P and the number of additions C are added by the adder 103, and the calculation result of A × B + C is obtained.

【００１４】図１３は、他の乗算器の構成例を示してお
り、同図において、図８と対応する部分には同一符号を
付し、かかる部分の説明は省略する。この例では、図８
に示される乗加減算器が乗算器と加算器とを連結して構
成されるのに対し、乗算器、加算器をバスやバイパス線
を介して接続することによって、独立した乗算器、加算
器を用いて乗加減算の演算を行う構成としている。この
構成では、乗加減算以外に、加算、乗算を独立に実行で
きるので演算の自由度が大きい利点がある。このため、
多くの汎用ＤＳＰをはじめ、汎用マイクロプロセッサの
殆どが、このような構成の演算器を実装している。FIG. 13 shows an example of the configuration of another multiplier. In FIG. 13, parts corresponding to those in FIG. 8 are designated by the same reference numerals, and the description of those parts will be omitted. In this example, FIG.
The multiplier / adder shown in (1) is configured by connecting the multiplier and the adder, while the multiplier and the adder are connected via the bus or the bypass line, so that the independent multiplier and the adder can be connected. The configuration is used to perform multiplication, addition, and subtraction operations. With this configuration, addition and multiplication can be performed independently in addition to multiplication, addition and subtraction, so that there is an advantage that the degree of freedom in calculation is large. For this reason,
Most general-purpose microprocessors including many general-purpose DSPs are equipped with an arithmetic unit having such a configuration.

【００１５】上述した乗算器と加算器とを直列に接続し
た形式の乗加減算器において不具合な点は、乗算終了後
に加算を行うため、独立した乗算器、加算器で夫々演算
を行った場合と、演算の実行時間に大差がないことであ
る。A problem with the above-described multiplier adder / subtractor of the type in which the multiplier and the adder are connected in series is that the addition is performed after the end of the multiplication, so that the operation is performed by an independent multiplier and the adder respectively. , There is no big difference in the execution time of the calculation.

【００１６】図１４は、かかる不具合を改善した乗加減
算器（浮動小数点型）の一例を示している。この構成に
よれば、乗加減算の演算を乗算あるいは加算を行うのに
必要な実行時間と同等な時間で実行することができるも
のであり、以下に説明する。FIG. 14 shows an example of a multiplier / adder / subtractor (floating point type) in which such a problem is improved. According to this configuration, the multiplication, addition, and subtraction operations can be executed in a time equivalent to the execution time required for multiplication or addition, which will be described below.

【００１７】図１４において、乗算アレイ２０１は、供
給される第１のオペランドＡの仮数部Ｆａと、第２のオ
ペランドＢの仮数部Ｆｂとの乗算を行う。乗算アレイ２
０１は、前述したように、木状に接続された桁上げ保存
加算器（ＣＳＡ）群によって構成されており、部分積が
キャリー・セーブ・フォーム（和成分Ｆｓｓ，キャリー
成分Ｆｃｃ）で得られる。この例を図１５に示す。桁合
わせシフタ２０２は、加算すべき第３のオペランドＣの
仮数部Ｆｃの桁合わせを行うためのバレルシフタであ
る。最終加算器９０３は、複数ビットの和成分Ｆｓｓ、
キャリー成分Ｆｃｃ、オペランドＣの仮数部Ｆｃの３つ
の入力を桁上げ伝搬加算する、桁上げ伝搬加算器であ
る。キャリー・セーブ・フォーム（和成分Ｆｓｓ，キャ
リー成分Ｆｃｃ）で求められた第１、第２のオペランド
の仮数部の積Ｆａ×Ｆｂを２進数に変換すると共に、第
３のオペランドの仮数部Ｆｃの加算を行う。先行１検出
回路２０４は、正規化の為に必要なシフト数を求める。
正規化シフタ２０５は、演算結果Ｆｂｂの正規化用のバ
レルシフタ、丸め回路９０６は演算結果Ｆｂｂに丸め処
理を行う。In FIG. 14, the multiplication array 201 multiplies the supplied mantissa part Fa of the first operand A and the supplied mantissa part Fb of the second operand B. Multiplication array 2
As described above, 01 is composed of a group of carry save adders (CSA) connected in a tree shape, and a partial product is obtained in a carry save form (sum component Fss, carry component Fcc). This example is shown in FIG. The digit alignment shifter 202 is a barrel shifter for performing digit alignment of the mantissa part Fc of the third operand C to be added. The final adder 903 has a multi-bit sum component Fss,
A carry propagation adder for carrying carry addition of three inputs of carry component Fcc and mantissa Fc of operand C. The product Fa × Fb of the mantissa parts of the first and second operands obtained by the carry save form (sum component Fss, carry component Fcc) is converted into a binary number, and the mantissa part Fc of the third operand is converted. Add. The leading 1 detection circuit 204 determines the number of shifts required for normalization.
The normalization shifter 205 is a barrel shifter for normalizing the calculation result Fbb, and the rounding circuit 906 rounds the calculation result Fbb.

【００１８】次に、この乗加減算器における演算処理動
作について説明する。まず、第１、第２のオペランド
Ａ，Ｂの仮数部Ｆａ，Ｆｂが、乗算アレイ２０１に入力
され乗算が行われる。積はキャリー・セーブ・フォーム
（和成分Ｆｓｓ，キャリー成分Ｆｃｃ）で求められる。
オペランドＣの仮数部Ｆｃは桁合わせシフタに入力さ
れ、乗算処理に並行して桁合わせが行われる。桁合わせ
は、｜Ｅａ＋Ｅｂ−Ｅｃ｜だけ、シフトすることに
よって行われる。ここで、Ｅａ，Ｅｂ，Ｅｃは、オペラ
ンドＡ，Ｂ，Ｃの指数部である。積（和成分Ｆｓｓ，キ
ャリー成分Ｆｃｃ）とオペランドＦｃは、３入力加算器
で加算され、２進数の乗加減算結果Ｆｂｂが求められ
る。乗加減算結果Ｆｂｂは、正規化され、さらに丸め処
理が行われる。Next, the operation processing operation in this multiplier / subtractor will be described. First, the mantissa parts Fa and Fb of the first and second operands A and B are input to the multiplication array 201 and multiplication is performed. The product is obtained by the carry save form (sum component Fss, carry component Fcc).
The mantissa part Fc of the operand C is input to the digit alignment shifter, and digit alignment is performed in parallel with the multiplication process. Digit alignment is performed by shifting by | Ea + Eb-Ec |. Here, Ea, Eb, Ec are the exponent parts of the operands A, B, C. The product (sum component Fss, carry component Fcc) and operand Fc are added by a 3-input adder to obtain a binary addition / subtraction result Fbb. The multiplication / subtraction result Fbb is normalized and further rounded.

【００１９】このように乗算アレイ２０１における乗算
と桁合わせシフタ２０２における桁合わせシフトとを並
行して実行し、積を求める最終加算（Ｆｓｓ＋Ｆｃｃ）
とオペランドＦｃとの加算を３入力加算器２０３を用い
て一度に実行することによって、乗加減算の演算を加
算、乗算と同じ実行時間内に実行することができる。In this way, the final addition (Fss + Fcc) for performing multiplication in the multiplication array 201 and digit shift in the digit shifter 202 in parallel to obtain a product.
By executing the addition of the and the operand Fc at once using the 3-input adder 203, the multiplication / subtraction operation can be executed within the same execution time as the addition and the multiplication.

【００２０】しかしながら、上記構成の乗加減算の演算
器においても別の不具合がある。すなわち、桁合わせを
行う桁合わせシフタ２０２、積の和成分Ｆｓｓ，キャリ
ー成分Ｆｃｃ、及びオペランドＦｃを加算する３入力加
算器２０３、正規化シフトの為のシフト数を算出する先
行１検出回路２０４に、約１６０ビット（ＩＥＥＥ７５
４規格浮動小数点倍精度の場合）のビット幅を必要とす
る。このため、ハードウェア量が増大し、実行速度の高
速化も妨げられている。（バレルシフタ）ところで、こ
のような演算装置においては、データを必要な量（桁
数）だけシフトするためにバレルシフタが用いられる。
バレルシフタにおいては、一般に、シフト量は２のべき
乗の数にて指定される。また、配線面積を最小とするた
めに、図２７に示すような構成となっている。However, there is another problem in the multiplier / adder / subtractor having the above configuration. That is, the digit shifter 202 that performs digit alignment, the three-input adder 203 that adds the product sum component Fss, the carry component Fcc, and the operand Fc, and the preceding 1 detection circuit 204 that calculates the shift number for the normalization shift. , About 160 bits (IEEE75
4 standard floating point double precision) is required. For this reason, the amount of hardware is increased, and the increase in execution speed is also hindered. (Barrel shifter) By the way, in such an arithmetic unit, a barrel shifter is used to shift data by a necessary amount (digit number).
In the barrel shifter, the shift amount is generally designated by a power of two. Further, in order to minimize the wiring area, the structure is as shown in FIG.

【００２１】すなわち、データ入力におけるビット数を
２ⁿとした場合、図２５に示すような２入力１出力のセ
レクタをｎ段使用して、バレルシフタを実現している。
このシフタが動作すると、信号の通過経路にあたるトラ
ンジスタ対がＯＮとなることから、ｎ×２ⁿ対のトラン
ジスタにおいて電力が消費されることになる。また、バ
レルシフタの動作速度は信号が通過する経路上にあるト
ランジスタ対の数によって大きく支配されることから、
従来の構成においてはｎ段分のトランジスタ対による遅
延時間によって動作速度が決定づけられている。That is, when the number of bits in the data input is 2 ⁿ , the barrel shifter is realized by using n stages of 2-input 1-output selectors as shown in FIG.
When this shifter operates, the transistor pair corresponding to the signal passing path is turned on, so that power is consumed in n × 2 ⁿ pairs of transistors. Also, since the operating speed of the barrel shifter is largely controlled by the number of transistor pairs on the path through which the signal passes,
In the conventional configuration, the operation speed is determined by the delay time due to the n-stage transistor pair.

【００２２】一般に、計算機システムでは、いわゆるデ
ータのビットシフト演算のために右方向シフトと左方向
シフトの両方向のシフト機能を具備している必要があ
る。そこで、図２８に示すような、左方向へのバレルシ
フタ及び右方向へのバレルシフタを別々に備え、データ
の左右へのシフト方向をセレクタによって選択してい
る。Generally, a computer system is required to have a rightward shift function and a leftward shift function for a so-called data bit shift operation. Therefore, as shown in FIG. 28, a barrel shifter to the left and a barrel shifter to the right are separately provided, and the left and right shift directions of data are selected by the selector.

【００２３】しかしながら、この構成においては配線量
の増加が顕著である。このため、図２９に示すような、
左あるいは右への単一方向にのみシフトを行うバレルシ
フタの前後に、ビット順を逆転するビット順逆転回路を
付加することによって、ハード量の増加を抑えながら左
右両方向へのビットシフトを実現するものが提案されて
いる。この場合には、ビット順逆転回路による動作速度
の低下が大きな問題となる。However, in this structure, the amount of wiring is remarkably increased. Therefore, as shown in FIG.
By adding a bit order inversion circuit that reverses the bit order before and after the barrel shifter that only shifts to the left or right in a single direction, bit shifting in both left and right directions is realized while suppressing an increase in the amount of hardware. Is proposed. In this case, the decrease of the operation speed due to the bit order inversion circuit becomes a big problem.

【００２４】従って、高速、低消費電力、省面積といっ
た、大規模集積回路におけるニーズを全て満たすことは
困難であり、使用個所に応じた回路の使い分けがなされ
ている。Therefore, it is difficult to meet all the needs of a large-scale integrated circuit such as high speed, low power consumption, and area saving, and the circuit is properly used according to the place of use.

【００２５】[0025]

[Problems to be Solved by the Invention]

（乗加減算器）このように、従来の乗加減算方式では演
算の実行に長時間を要する。また、乗加減算演算を高速
に実行するには非常にビット幅の大きい加算器、先行１
検出器等多くのハードウェアを必要とし、また、この点
が高速化の妨げとなっていた。(Multiplication / subtraction device) As described above, in the conventional multiplication / subtraction system, it takes a long time to execute an operation. Also, in order to execute the multiplication / subtraction operation at high speed, an adder with a very large bit width
A lot of hardware such as a detector is required, and this point hinders the speedup.

【００２６】よって、本発明は、比較的に少ないハード
ウェア構成で乗加減算の演算速度の高速化を実現し得る
乗加減算演算器を提供することを目的とする。（バレルシフタ）上述したバレルシフタの従来技術にお
いては、高速、低消費電力、省面積、という、一般的に
相矛盾するような要求については、ある程度の妥協を余
儀なくされてている。すなわち、従来の構成におけるバ
レルシフタでは、他の加算器のような回路とは異なり、
シフタを構成しているセレクタ回路のほとんど全てが動
作することから消費電力が大きくなる傾向がある。動作
速度についても、高速であることが要求される。更に、
バレルシフタでは配線領域の占める面積が大きく、マイ
クロプロセッサ等における演算回路の回路面積について
決定的な要因となり得る。また、上述した乗加減算器の
コンパクト化、動作の高速化の実現のために、そこで用
いられるバレルシフタについてもコンパクトな構成、動
作の高速化の実現が求められている。Therefore, it is an object of the present invention to provide a multiplication / addition / subtraction arithmetic unit which can realize a high-speed multiplication / subtraction arithmetic operation with a relatively small hardware configuration. (Barrel Shifter) In the above-described conventional barrel shifter technology, the requirements that are generally contradictory to each other, such as high speed, low power consumption, and area saving, have to be compromised to some extent. That is, in the barrel shifter in the conventional configuration, unlike the circuit like other adders,
Since almost all of the selector circuits constituting the shifter operate, power consumption tends to increase. The operating speed is also required to be high. Furthermore,
In the barrel shifter, the wiring area occupies a large area, which can be a decisive factor for the circuit area of the arithmetic circuit in a microprocessor or the like. In addition, in order to make the above-mentioned multiplier / subtractor compact and realize high-speed operation, it is required that the barrel shifter used therein also has a compact structure and high-speed operation.

【００２７】よって、他の発明は、このバレルシフタに
対する相矛盾する省面積、高速、低消費電力という３つ
の要求を、高い次元で両立させることを目的とする。Therefore, another object of the present invention is to satisfy the three requirements of the barrel shifter, which are contradictory area saving, high speed, and low power consumption, at a high level.

【００２８】[0028]

[Means for Solving the Problems]

（乗加減算器）上記目的を達成するために本発明の固定
小数点の乗加減算器は、入力される第１及び第２のオペ
ランドを乗算した乗算値と、入力される第３のオペラン
ドとの、加算あるいは減算を行う固定小数点演算の乗加
減算器において、所定乗算アルゴリズムに従って、上記
第１及び第２のオペランドに対応する複数の部分積を生
成する部分積生成回路と、上記複数の部分積の加算と、
上記第３のオペランドの加算若しくは減算とを、木構造
に配列された複数の加算器群によって行って減算して重
み付された複数の乗算中間値を出力する乗算アレイと、
上記複数の乗算中間値を加算して乗加減算値を求める桁
上げ伝搬型加算器と、を備えることを特徴とする。(Multiply-adder / subtractor) In order to achieve the above object, the fixed-point multiplier-adder / subtractor of the present invention comprises a multiplication value obtained by multiplying input first and second operands, and an input third operand. A fixed-point multiplication-adder / subtractor that performs addition or subtraction, and a partial product generation circuit that generates a plurality of partial products corresponding to the first and second operands according to a predetermined multiplication algorithm, and addition of the plurality of partial products When,
A multiplication array that performs addition or subtraction of the third operand by a plurality of adder groups arranged in a tree structure, subtracts the output, and outputs a plurality of weighted multiplication intermediate values;
A carry propagation type adder for adding the plurality of intermediate values for multiplication to obtain a multiplication / subtraction value.

【００２９】また、上記目的を達成するため本発明の浮
動小数点の乗加減算器は、入力される第１及び第２のオ
ペランドを乗算した乗算値と、入力される第３のオペラ
ンドとの、加算あるいは減算を行う浮動小数点演算の乗
加減算器において、所定乗算アルゴリズムに従って、上
記第１及び第２のオペランド各々の仮数部の各値に対応
する複数の部分積を生成する部分積生成回路と、上記第
３のオペランドの指数部の値が上記第１及び第２のオペ
ランドの積の指数部の値と等しくなるように、上記第３
のオペランドの仮数部の値の桁合わせを行うシフト回路
と、上記複数の部分積の加算と、桁合わせされた上記第
３のオペランドの仮数部の値のうち、上記第１及び第２
のオペランドの仮数部の積の演算に割当てられた桁数に
よって表される値内の、下位桁の各ディジットの値の加
算若しくは減算とを、木構造に配列された複数の加算器
群によって行って重み付された複数の乗算中間値を出力
する乗算アレイと、上記複数の乗算中間値を加算して下
位桁の乗加減算値を求める桁上げ伝搬型加算器と、桁合
わせされた上記第３のオペランドの仮数部の値のうち、
上記第１及び第２のオペランドの仮数部の積の演算に割
当てられた桁数によって表される値を超える、上位桁の
値と、上記桁上げ伝搬型加算器の桁上げ出力とを、加算
して上位桁出力値を得る増分加算器と、上記上位桁出力
値及び上記下位桁の乗加減算値をビット結合して最終乗
加減算値を得る手段と、を備えることを特徴とする。Further, in order to achieve the above object, the floating-point multiplying / subtracting device of the present invention adds the multiplication value obtained by multiplying the input first and second operands with the input third operand. Alternatively, in a multiplicative adder / subtractor of a floating-point operation that performs subtraction, a partial product generation circuit that generates a plurality of partial products corresponding to respective values of the mantissa part of each of the first and second operands according to a predetermined multiplication algorithm; The value of the exponent part of the third operand is equal to the value of the exponent part of the product of the first and second operands.
A shift circuit that performs digit alignment of the mantissa value of the operand, an addition of the plurality of partial products, and digit alignment of the first and second digit values of the mantissa value of the third operand.
The addition or subtraction of the value of each digit of the lower digit in the value represented by the number of digits assigned to the product of the mantissa part of the operand of is performed by a group of adders arranged in a tree structure. A multiplying array for outputting a plurality of weighted intermediate intermediate values, a carry propagation adder for adding the plurality of intermediate intermediate values to obtain a multiplying / subtracting value of a lower digit, and the third digit aligned Of the mantissa value of the operand of
Add the value of the upper digit exceeding the value represented by the number of digits assigned to the operation of the mantissa product of the first and second operands and the carry output of the carry propagation adder. And an increment adder for obtaining an upper digit output value, and means for bit-combining the upper digit output value and the lower digit multiplication, addition, and subtraction values to obtain a final multiplication, addition, and subtraction value.

【００３０】上記乗算アレイを構成する加算器群とし
て、和を和成分、桁上げ成分に分けて求める桁上げ保存
型加算器、冗長数を用いて加算するＳＤ加算器、ＰＤ加
算器、全加算器等を用いることが可能である。（バレルシフタ）上記目的を達成するために本発明のバ
レルシフタは、少なくともデータ入力、データ出力、制
御入力を持ち、入力されるデータを制御入力によって指
定されたビット数だけシフトして出力するバレルシフタ
において、供給されるｌ（整数）ビットからなる入力デ
ータから、４つのビット信号を出力するｌ個の信号シフ
ト手段と、各信号シフト手段が出力する４つのビット信
号の中から１つを選択するｌ個のセレクト手段と、供給
される制御入力をデコードして、各セレクト手段に選択
すべきビット信号を指令する制御信号を与える制御信号
生成手段と、を備え、上記ｌ個の信号シフト手段のうち
ｉ番目の信号シフト手段は、入力データのｉ番目のビッ
ト信号に対し、０ビット、ｎ（整数）ビット、ｍ（整
数）ビット、ｍ＋ｎビット、ビット位置をシフトした位
置にある４つのビット信号を出力するようになされる、
ことを特徴とする。上記信号シフト手段及びセレクト手
段は、入力データのビット数、取扱うシフトビット数の
範囲に応じて所要段数だけ縦列接続される。なお、通常
はｎ及びｍは、２のべき乗の数を用いる。As a group of adders constituting the above-mentioned multiplication array, a carry-save adder for dividing a sum into a sum component and a carry component to obtain it, an SD adder for adding using a redundant number, a PD adder, and a full addition It is possible to use a container or the like. (Barrel shifter) To achieve the above object, the barrel shifter of the present invention has at least a data input, a data output, and a control input, and a barrel shifter that shifts the input data by the number of bits designated by the control input and outputs the shifted data, From the supplied input data consisting of 1 (integer) bits, 1 signal shift means for outputting 4 bit signals and 1 for selecting 1 from the 4 bit signals output by each signal shift means. Selecting means and decoding means for decoding the supplied control input to give a control signal for instructing each selecting means a bit signal to be selected. The th signal shift means is 0 bit, n (integer) bit, m (integer) bit, m + n for the i th bit signal of the input data. Tsu DOO is adapted to output four-bit signal at a position obtained by shifting the bit position,
It is characterized by the following. The signal shift means and the select means are cascade-connected by a required number of stages according to the number of bits of input data and the range of the number of shift bits to be handled. It should be noted that normally n and m use powers of two.

【００３１】また、本発明の両方向シフトのバレルシフ
タは、少なくともデータ入力、データ出力、制御入力を
持ち、入力されるデータを制御入力によって指定された
ビット数だけシフトして出力するバレルシフタにおい
て、供給されるｌ（整数）ビットからなる入力データか
ら、４つのビット信号を出力するｌ個の第１の信号シフ
ト／ビット順逆転手段と、上記第１の信号シフト／ビッ
ト順逆転手段各々が出力する４つのビット信号の中から
１つを選択するｌ個の第１のセレクト手段と、上記ｌ個
の第１のセレクト手段から入力されるｌ個の出力から、
４つのビット信号を出力するｌ個の信号シフト手段と、
各信号シフト手段が出力する４つのビット信号の中から
１つを選択するｌ個の第２のセレクト手段と、上記ｌ個
の第２のセレクト手段から入力されるｌ個の出力から、
４つのビット信号を出力するｌ個の第２の信号シフト／
ビット順逆転手段と、前記第２の信号シフト／ビット順
逆転手段各々が出力する４つのビット信号の中から１つ
を選択するｌ個の第３のセレクト手段と、供給される制
御入力をデコードして、各セレクト手段に選択すべきビ
ット信号を指令する制御信号を与える制御信号生成手段
と、を備え、上記ｌ個の第１の信号シフト／ビット順逆
転手段のうちｉ番目の第１の信号シフト／ビット順逆転
手段は、入力データのｉ番目のビット信号に対し、変化
なし、ビット順逆転、ｏ（整数）ビットシフト、ビット
順逆転及びｏ（整数）ビットシフト、となる関係にある
４つのビット信号を出力し、上記ｌ個の信号シフト手段
のうちｉ番目の信号シフト手段は、入力されるｉ番目の
ビット信号に対し、シフトなし、ｐ（整数）＋ｑ（整
数）ビット、ｐ（整数）ビット、ｑ（整数）ビット、ビ
ット位置をシフトした位置にある４つのビット信号を出
力し、上記ｌ個の第２の信号シフト／ビット順逆転手段
のうちｉ番目の第２の信号シフト／ビット順逆転手段
は、入力データのｉ番目のビット信号に対し、変化な
し、ビット順逆転、ｒ（整数）ビットシフト、ｒ（整
数）ビットシフト及びビット順逆転、となる関係にある
４つのビット信号を出力する、ことを特徴とする。信号
シフト手段及びセレクト手段は所要段数だけ縦続接続す
ることができる。Further, the bidirectional shift barrel shifter of the present invention is provided in a barrel shifter which has at least a data input, a data output, and a control input, and shifts and outputs the input data by the number of bits designated by the control input. From the input data consisting of 1 (integer) bits, 1 first signal shift / bit order inversion means for outputting four bit signals and 4 output by each of the first signal shift / bit order inversion means. From the l first selecting means for selecting one from one bit signal and the l outputs inputted from the l first selecting means,
L signal shift means for outputting four bit signals,
From the l second select means for selecting one out of the four bit signals output by each signal shift means and the l output input from the l second select means,
L second signal shifts / outputting 4 bit signals /
Bit order inversion means, l third select means for selecting one from the four bit signals output by each of the second signal shift / bit order inversion means, and decoding of the supplied control input And a control signal generation means for giving a control signal for instructing a bit signal to be selected to each selection means, and the i-th first signal shift / bit order inversion means of the above l first signal shift / bit order inversion means. The signal shift / bit order inversion means has a relationship of no change, bit order inversion, o (integer) bit shift, bit order inversion and o (integer) bit shift with respect to the i-th bit signal of the input data. Four bit signals are output, and the i-th signal shift means among the l signal shift means does not shift the input i-th bit signal, p (integer) + q (integer) bits, p (Form ) Bits, q (integer) bits, and four bit signals at the bit position shifted positions are output, and the i-th second signal shift / of the l second signal shift / bit sequence inversion means is output. The bit order inversion means has four bits in a relationship of no change, bit order inversion, r (integer) bit shift, r (integer) bit shift and bit order inversion with respect to the i-th bit signal of the input data. It is characterized by outputting a signal. The signal shift means and the select means can be connected in cascade for the required number of stages.

【００３２】[0032]

[Action]

（乗加減算器）上記構成において、この発明では、まず
オペランドＡ、及びＢの仮数部Ｆａ，Ｆｂが、乗算アレ
イに入力され乗算が行われる。乗算アレイは木状に接続
されたキャリー・セーブ・アダーで構成されており、積
はキャリー・セーブ・フォームで求められる。オペラン
ドＣの仮数部Ｆｃは桁合わせシフタに入力され、Ｆａ，
Ｆｂの乗算処理に並行して桁合わせが行われる。桁合わ
せは、｜Ｅａ＋Ｅｂ−Ｅｃ｜だけ、シフトすることに
よって行われる。ここで、Ｅａ，Ｅｂ，Ｅｃは、オペラ
ンドＡ，Ｂ，Ｃの指数部である。桁合わせシフト結果Ｆ
c shift は、（Ｆａ×Ｆｂ）の各デジットと同じ桁にな
る部分Ｆc low とそれより上位の部分Ｆchighとに分割
される。Ｆc low は乗算アレイに入力され、部分積とと
もに加算され、積がキャリー・セーブ・フォーム（和成
分Ｆｓｓ，キャリー成分Ｆｃｃ）で求められる。和成分
Ｆｓｓ，キャリー成分Ｆｃｃは、最終加算器で加算さ
れ、乗加減算演算結果の下位側Ｆbb lowが求まる。ま
た、Ｆc highは、最終加算のキャリーの値によってイン
クリメントされ、乗加減算演算結果の上位側Ｆbb high
が求まる。また、インクリメント結果は、先行１検出回
路に入力され、正規化シフトに必要なシフト・ステップ
数を算出する。この結果をもとに、乗加減算演算結果Ｆ
ｂｂ（Ｆbb high とＦbb lowのビット結合）は正規化さ
れ、丸め処理が施される。（バレルシフタ）入力データのビット長がＬであると
き、本発明のバレルシフタにおける第１の構成において
は信号シフト手段ならびにセレクト手段を所要段数だけ
縦続接続したものを入力データの各ビットに対応する数
Ｌだけ並列に設けておき、与えられたシフトビット数か
らセレクト手段を動作させるための制御信号生成手段か
らの制御信号を、並列に並んでいる各ビット位置のセレ
クト手段に供給する。このとき、並列に並んでいるセレ
クト手段については、それぞれ同一の制御信号によって
動作することから、制御信号生成手段は垂直方向の各セ
レクト手段毎に一つあれば足りることになる。これらの
手段により、入力されたデータは各々の信号シフト手段
＋セレクト手段においてビット位置を順次変更されてい
き、最後に所望のビット数だけシフトした結果が得られ
る。ここで、Ｌ＝２ⁿ（ｎは整数）である場合について
考えると、従来の２入力１出力のセレクト手段を用いた
構成では、信号が入力されてから出力に現われるまで
に、合計ｎ段のセレクト手段を通過することになるが、
本発明による構成ではｎ／２段（ｎが奇数である場合は
ｎ／２＋１段）のセレクト手段を通過するだけで済むこ
とになり、高速化ならびに消費電力の低減を図ることが
できる。さらには、通常信号シフト手段とセレクト手段
とは入力データのビット列に対して垂直方向に段組みさ
れていることから、これらの手段を結合する配線におい
て、従来の２入力１出力セレクタを用いた方式では２組
必要となるのに対し、本発明による構成では１組で済
む。しかも、トランジスタのスイッチング確立を平均で
２５％低減できることから、更に消費電力を低減でき
る。また、セレクト手段に使用される素子の数は、２入
力１出力のセレクト手段を用いた場合に対して、８入力
１出力等の、４入力１出力以外の構成とした場合には増
加してしまうのに対し、４入力１出力のセレクト手段を
用いた場合には全く同じとなることから、ハードウエア
の増加量は最小限に抑えられることになる。(Multiply Adder / Subtractor) In the present invention, in the present invention, first, the mantissa parts Fa and Fb of the operands A and B are input to the multiplication array and multiplication is performed. The multiplication array is composed of carry save adders connected in a tree shape, and the product is obtained by the carry save form. The mantissa part Fc of the operand C is input to the digit adjustment shifter, and Fa,
Digit alignment is performed in parallel with the multiplication process of Fb. Digit alignment is performed by shifting by | Ea + Eb-Ec |. Here, Ea, Eb, Ec are the exponent parts of the operands A, B, C. Digit alignment shift result F
The c shift is divided into a portion Fc low having the same digit as each digit of (Fa × Fb) and a higher portion Fc high. Fc low is input to the multiplication array and added together with the partial product, and the product is obtained in the carry save form (sum component Fss, carry component Fcc). The sum component Fss and the carry component Fcc are added by the final adder to obtain the lower side Fbb low of the multiplication / subtraction operation result. Further, Fc high is incremented by the carry value of the final addition, and the higher-order side Fbb high of the multiplication / subtraction calculation result is obtained.
Is required. Further, the increment result is input to the leading 1 detection circuit, and the number of shift steps required for the normalization shift is calculated. Based on this result, the multiplication / subtraction operation result F
bb (bit combination of Fbb high and Fbb low) is normalized and rounded. (Barrel shifter) When the bit length of the input data is L, in the first configuration of the barrel shifter of the present invention, a number L corresponding to each bit of the input data is obtained by cascade-connecting the required number of signal shift means and select means. However, the control signal from the control signal generating means for operating the selecting means from the given number of shift bits is supplied to the selecting means at each bit position arranged in parallel. At this time, since the selecting means arranged in parallel are operated by the same control signal, only one control signal generating means is required for each selecting means in the vertical direction. By these means, the bit position of the input data is sequentially changed in each signal shift means + select means, and finally the result obtained by shifting the desired number of bits is obtained. Here, considering the case where L = 2 ⁿ (n is an integer), in the configuration using the conventional 2-input 1-output select means, there are n stages in total from the input of a signal to the output. It will pass the selection means,
With the configuration according to the present invention, it suffices to pass through n / 2 stages (n / 2 + 1 stages when n is an odd number) of the selection means, so that speeding up and reduction in power consumption can be achieved. Furthermore, since the normal signal shift means and the select means are vertically stacked with respect to the bit string of the input data, the conventional 2-input 1-output selector is used in the wiring connecting these means. In contrast, two sets are required, whereas the configuration according to the present invention requires only one set. Moreover, since the switching probability of the transistor can be reduced by 25% on average, the power consumption can be further reduced. Further, the number of elements used in the selecting means is increased when the configuration other than 4 inputs and 1 output such as 8 inputs and 1 output is used as compared with the case where the selecting means having 2 inputs and 1 output is used. On the other hand, when the 4-input 1-output select means is used, the same result is obtained, and the increase in hardware can be minimized.

【００３３】また、本発明のバレルシフタにおける第２
の構成においては、データ入力に近い側に、ビット順逆
転を行った後にシフトビット数の大きなシフトを行える
ような信号シフト／ビット順逆転手段ならびにセレクト
手段を縦続接続したものを入力データのビット長Ｌだけ
並列に設けておき、これに引き続いて、第１の構成にお
ける信号シフト手段＋セレクト手段を所要段数だけ縦続
接続してあり、最後にシフトビット数の大きなシフトを
行った後にビット順逆転を行えるような信号シフト／ビ
ット順逆転手段を接続する。このとき、制御信号生成手
段においては、異なる２つの量のシフトビット数に応じ
て制御信号を生成するものに加えて、あるシフトビット
量のシフト及び、ビット順逆転の有無に応じて制御信号
を生成するものを、この構成におけるシフトの最初と最
後のセレクタ段に対して用いる。まず、ビット順逆転を
行わない場合には、この最初と最後のセレクタ段におい
ては、ビット位置のシフトのみが行われ、ある方向への
ビットシフトが全体として行われることになる。そし
て、最初と最後の段において、ビット順逆転を行うと、
入力されたデータはまずビット順を逆転されたのちに所
定の量だけのビットシフト操作を受け、最後にもう一度
ビット順を逆転されることによって、逆方向へのシフト
動作が実現される。このとき、第１の構成におけると同
様の理由によって、従来の２入力１出力セレクタを用い
たシフタの前後にビット逆転機構を設けた構成に対し
て、信号が通過するセレクタの段数が大幅に減少する。
よって、シフト動作の高速化ならびに低消費電力化が達
成できる。さらには、入力データのビット数を２ⁿ（ｎ
は整数）とした時には、通常ビット順逆転を行うために
は２ⁿ本の配線が入力データのビット列に対して並行に
配置され、配線領域の増加によるシフタ回路面積の増大
が問題となるが、本発明による構成では、この２ⁿ本分
の配線を、２^n-1及び２^n-2ビットシフトを行うための
配線として共用することができるため、配線領域の面積
を２^n-1及び２^n-2本分低減することができる。また、
このとき入力データビット列に平行して敷設される配線
の最大長は、入力データビット列の横方向の長さに一致
するが、従来の構成ではビット順逆転を行ってから最大
ビット長のシフトを行うという最悪のケースでは、信号
が通過しなければならない配線長は、入力データビット
列の横方向の長さに、シフトによってビット位置が移動
する長さを加えたものとなることから、本発明により配
線による負荷容量を低減できる。なお、ビット順を逆転
させるための配線を具備していることから、ビット順の
変更をシフトの前後にて行う、あるいは単純にビット順
を逆転するような演算も可能である。The second aspect of the barrel shifter of the present invention
In the above configuration, a signal shift / bit order inversion means and a select means that are connected in cascade so that a large shift bit number can be shifted after performing bit order inversion on the side closer to the data input is the bit length of the input data. Only L are provided in parallel, and subsequently, the signal shift means + select means in the first configuration are cascade-connected by a required number of stages, and finally, a bit order inversion is performed after a large shift bit number shift. Connect signal shift / bit order inversion means as possible. At this time, the control signal generation means generates a control signal in accordance with two different amounts of shift bits, and also generates a control signal in accordance with the shift of a certain shift bit amount and the presence / absence of bit order inversion. What is produced is used for the first and last selector stages of the shift in this configuration. First, when bit order inversion is not performed, only the bit positions are shifted in the first and last selector stages, and the bit shift in a certain direction is performed as a whole. Then, when bit order inversion is performed in the first and last stages,
The input data is first bit-reversed and then subjected to a bit shift operation by a predetermined amount, and finally the bit order is re-inverted once again, whereby a shift operation in the reverse direction is realized. At this time, for the same reason as in the first configuration, the number of selector stages through which the signal passes is significantly reduced compared to the configuration in which the bit inversion mechanism is provided before and after the shifter using the conventional 2-input 1-output selector. To do.
Therefore, speeding up of the shift operation and low power consumption can be achieved. Furthermore, the number of bits of input data is 2 ⁿ (n
Is an integer), 2 ⁿ wirings are normally arranged in parallel to the bit string of the input data in order to perform bit order inversion, and an increase in the wiring area causes an increase in the shifter circuit area. in the arrangement according to the present invention, the wiring of the 2 ⁿ duty, 2 ^n-1 and 2 it is possible to share as a wiring for performing the ^n-2 bit shifting, the area of the wiring region 2 ^n-1 and 2 It can be reduced by ^n-2 lines. Also,
At this time, the maximum length of the wiring laid parallel to the input data bit string matches the horizontal length of the input data bit string, but in the conventional configuration, the bit order is reversed before the maximum bit length is shifted. In the worst case, the wiring length through which the signal must pass is the horizontal length of the input data bit string plus the length by which the bit position moves due to the shift. Can reduce the load capacity. Since the wiring for reversing the bit order is provided, it is possible to perform the operation of changing the bit order before and after the shift or simply reversing the bit order.

【００３４】[0034]

【実施例】以下、本発明の乗加減算器の実施例について
図面を参照して説明する。図１は、本発明の第１の実施
例である固定小数点乗加減算器を示すブロック図であ
り、同図において、図８と対応する部分には同一符号を
付している。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the multiplier / subtractor of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing a fixed-point multiplication / addition device according to a first embodiment of the present invention. In FIG. 1, parts corresponding to those in FIG.

【００３５】この実施例の固定小数点乗加減算器は、乗
算アレイ３０１において、オペランドＡ及びＢの重み付
された部分積を累積的に加算する木（ツリー）構造の加
算器群を利用してオペランドＣを加算若しくは減算し、
加算結果を桁上げ伝搬加算器である最終加算器１０２に
よって２進数の形に整える。The fixed point multiplication adder / subtractor of this embodiment uses an adder group of a tree structure for cumulatively adding the weighted partial products of the operands A and B in the multiplication array 301. Add or subtract C,
The addition result is arranged in the form of a binary number by the final adder 102 which is a carry propagation adder.

【００３６】図２は、乗算アレイ３０１の構成例を示し
ており、まず、図９に示す従来例とと同様に、供給され
る２つのオペランドＡ及びＢの乗算を行う。乗算アレイ
３０１は、例えば、２次のブースのアルゴリズムを用い
ており、２次のブースのリコーダ３０１ａ、部分積生成
回路３０１ｂ、部分積加算用桁上げ保存加算器群（例え
ば、ワラス・ツリー）３０１ｃ、により構成される。２
４ビット固定小数点の演算を行う場合には、２次のブー
スのアルゴリズムにより、１２個の部分積が生成され、
ツリー構造の桁上げ保存加算器群で加算される。積は、
キャリー・セーブ・フォーム（和成分Ｆｓｓとキャリー
成分Ｆｃｃ）で算出される。FIG. 2 shows an example of the structure of the multiplication array 301. First, as in the conventional example shown in FIG. 9, two supplied operands A and B are multiplied. The multiplication array 301 uses, for example, a quadratic Booth algorithm, and has a quadratic Booth recorder 301a, a partial product generation circuit 301b, and a carry save adder group for partial product addition (for example, Wallace tree) 301c. ,,. Two
When performing 4-bit fixed point arithmetic, 12 partial products are generated by the second order Booth's algorithm,
It is added by a carry save adder group having a tree structure. The product is
It is calculated by the carry save form (sum component Fss and carry component Fcc).

【００３７】図３は、部分積生成回路３０１ｂ及び部分
積加算用桁上げ保存加算器群３０１ｃを説明するもので
あり、同図において図１０と対応する部分には同一符号
を付している。この図は、部分積生成回路１０１ｂが出
力する、例えば、１２ビットの被乗数Ａ及び乗数Ｂにつ
いての、部分積を数値としての重み付をして配列した例
を概念的に示している。同図において、黒丸はある部分
積を示している。部分積の配列は、１２ビットの被乗数
Ａ、１２ビットの乗数Ｂ、により、行方向に２４ビッ
ト、列方向に１２ビットであり、同一列は同一桁を示
す。なお、同図に示される部分積ｂ0 ａ0 ，ｂ0 ａ1 ，
…，ｂ11ａ11は、説明の便宜上付されたものである。こ
の部分積の配列の中に、オペランドＣ（＝ｃ11，ｃ10，
…，ｃ1 ，ｃ0 ）が桁合せして配置される。減算の場合
はオペランドＣの補数の加算となる（以下、同様であ
る）。FIG. 3 illustrates the partial product generation circuit 301b and the carry save adder group 301c for partial product addition. In FIG. 3, parts corresponding to those in FIG. 10 are designated by the same reference numerals. This drawing conceptually shows an example in which partial products of, for example, a 12-bit multiplicand A and a multiplier B output from the partial product generation circuit 101b are weighted as numerical values and arranged. In the figure, a black circle indicates a partial product. The array of the partial products is 24 bits in the row direction and 12 bits in the column direction according to the multiplicand A of 12 bits and the multiplier B of 12 bits, and the same column indicates the same digit. The partial products b0 a0, b0 a1,
..., b11a11 are added for convenience of explanation. In the array of this partial product, the operand C (= c11, c10,
,, c1, c0) are aligned and arranged. In the case of subtraction, the complement of operand C is added (the same applies hereinafter).

【００３８】桁上げ保存加算器群３０１ｃは、生じた桁
上げを次段に繰上げるようにして桁上げを保存しなが
ら、各列の部分積を累積的に加算する。この累積加算に
はいわゆる木構造に配置された桁上げ保存加算器（CPA
）、例えば、２進木やワラス・ツリーを用いる。The carry save adder group 301c cumulatively adds the partial products of the respective columns while carrying the carried carry to the next stage and saving the carry. For this cumulative addition, a carry save adder (CPA) arranged in a so-called tree structure is used.
), For example, a binary tree or a Wallace tree is used.

【００３９】図４は、部分積加算用桁上げ保存加算器群
３０１ｃにおいて、図３に点線で示す一列分（ｂ0 ａ1
1，ｂ1 ａ10，…，ｂ11ａ0 ）についての加算を担う、
ワラス・ツリー構造の加算器群を示している。ツリーを
構成する、桁上げ保存加算器４ｗは、図１２に示される
ものと同じであり、４つの入力と、１つの桁上げ入力と
の、５つの入力に対応して、１つの和出力、２つの桁上
げ出力を備える。また、桁上げ保存加算器３ｗは、３つ
の入力に対応して、１つの和出力、１つの桁上げ出力を
備える全加算器である。第１段の、３つの桁上げ保存加
算器４ｗの各入力には、同一桁（同じ重み）の部分積ｂ
0 ａ11，ｂ1 ａ11，…，ｂ11ａ0 が加えられる。第２段
の加算器３ｗには、オペランドＣのｃ11あるいはその補
数が入力される。各部分積の和は、ツリー構造の第２
段、第３段の複数の加算器４ｗを伝搬し、オペランドＣ
の１ビット分ｃ11が加算されて、図示しない桁上げ伝搬
加算器（CPA ）である最終加算器１０２に出力される。
同図中、各加算器４ｗ，３ｗへの１つ下位の桁の図示し
ないワラス・ツリーからの桁上げ入力及び１つ上の桁の
図示しないワラス・ツリーへの桁上げ出力は、夫々三角
の矢印によって示されている。他の桁（列）のワラス・
ツリーも部分積の入力数に応じた数の加算器、オペラン
ドＣを部分積に加え合わせる加算器を備えて同様に、配
列分（桁数分）だけ構成される。FIG. 4 shows the carry-save adder group 301c for partial product addition, which corresponds to one column (b0 a1) indicated by the dotted line in FIG.
, B1 a10, ..., b11 a0)
The adder group of a Wallace tree structure is shown. The carry save adder 4w forming the tree is the same as that shown in FIG. 12, and one sum output corresponding to five inputs of four inputs and one carry input, It has two carry outputs. The carry save adder 3w is a full adder having one sum output and one carry output corresponding to the three inputs. For each input of the three carry save adders 4w in the first stage, partial products b of the same digit (same weight)
0 a11, b1 a11, ..., B11 a0 are added. The c11 of the operand C or its complement is input to the adder 3w in the second stage. The sum of each partial product is the second of the tree structure.
Stage and third stage multiple adders 4w propagate to operand C
Is added to the final adder 102, which is a carry propagation adder (CPA) (not shown).
In the figure, the carry input to the adders 4w and 3w from the Wallace tree (not shown) of the next lower digit and the carry output to the Wallace tree (not shown) of the next upper digit are triangular. It is indicated by an arrow. Wallas of other digits
The tree is also provided with an adder of a number corresponding to the number of inputs of the partial product and an adder for adding the operand C to the partial product, and is similarly configured by the array portion (number of digits).

【００４０】このように、本乗算アレイ３０１で最も重
要かつ特徴的な点は、図４に示すように外部より他の値
Ｃi を入力することができ、この値Ｃi を部分積と同様
にワラス・アレイの中で部分積の加算と同時に加算（あ
るいはｃの補数の加算による減算）をすることができる
ことである。図４では全加算器をツリー構造の加算器群
（ワラス・ツリー）に挿入して外部からの入力を実現し
た。全加算器を挿入することが、本乗算アレイのクリテ
ィカル・パスに全く影響を与えない利点に着目すべきで
ある。As described above, the most important and characteristic point of the multiplication array 301 is that it is possible to input another value Ci from the outside as shown in FIG. It is possible to perform addition (or subtraction by addition of complement of c) at the same time as addition of partial products in the array. In FIG. 4, the full adder is inserted into a tree-structured adder group (Walrus tree) to realize external input. It should be noted that inserting the full adder has no effect on the critical path of the present multiplying array.

【００４１】最終加算器１０２は、キャリー・セーブ・
フォーム（桁上げ成分Ｆｃｃ、和成分Ｆｓｓ）で出力さ
れたアレイ出力を２進数に変換する、例えば、２４ビッ
トのオペランドＡ及びＢの入力に対応して４８ビットを
出力する桁上げ伝搬加算器である。なお、乗算アレイを
構成する加算器群として、上述した和を和成分、桁上げ
成分に分けて求める桁上げ保存型加算器の他、冗長数を
用いて加算するＳＤ加算器、ＰＤ加算器、全加算器等を
用いることが可能である。The final adder 102 is a carry save
A carry propagation adder that converts the array output output in the form (carry component Fcc, sum component Fss) into a binary number, for example, outputs 48 bits corresponding to the inputs of 24-bit operands A and B is there. As a group of adders constituting the multiplication array, in addition to the carry-save adder that finds the above sum by dividing it into a sum component and a carry component, an SD adder, a PD adder that adds using a redundant number, It is possible to use a full adder or the like.

【００４２】次に、本乗加減算器の動作を説明する。ま
ず、第１及び第２のオペランドＡ，Ｂは、乗算アレイ３
０１に入力され乗算が行われる。乗算アルゴリズムとし
て２次のブースのアルゴリズムを用いた場合、１２個の
部分積が、全加算器４ｗにより構成される、同じ重み
（同じ桁）の部分積の和を得るワラス・ツリーで加算さ
れる。ワラス・ツリーは、各桁毎に設けられ、ある桁の
ツリーにおける桁上げ分が次段のツリーで加算されるよ
うにして逐次的に加算される。一方、第３のオペランド
Ｃも乗算アレイ３０１に入力され、オペランドＣの各桁
に対応するワラス・ツリーの加算器によってオペランド
Ｃの各桁の値が同じ重みの部分積と直接加算される。こ
の結果、乗算アレイからは、キャリー・セーブ・フォー
ムで乗加減算の演算結果Ａ×Ｂ＋Ｃが算出される。最終
加算器では、キャリー・セーブ・フォームで求められた
乗加減算の演算結果Ａ×Ｂ＋Ｃが２進数に変換される。Next, the operation of the main adder / subtractor will be described. First, the first and second operands A and B are the multiplication array 3
01 is input and multiplication is performed. When the secondary Booth's algorithm is used as the multiplication algorithm, 12 partial products are added by the Wallace tree configured by the full adder 4w to obtain the sum of partial products of the same weight (same digit). . The Wallace tree is provided for each digit, and the carry portion in the tree of a certain digit is added sequentially in the tree of the next stage. On the other hand, the third operand C is also input to the multiplication array 301, and the value of each digit of the operand C is directly added to the partial product of the same weight by the adder of the Wallace tree corresponding to each digit of the operand C. As a result, the multiplication / subtraction operation result A × B + C is calculated in the carry save form from the multiplication array. The final adder converts the multiplication / subtraction operation result A × B + C obtained by the carry save form into a binary number.

【００４３】このように第３のオペランドＣの加算を乗
算アレイ３０１で（Ａ×Ｂ）を求める部分積の加算と同
時に行うことによって、第１、第２のオペランドの積
（Ａ×Ｂ）と第３のオペランドＣを加算する加算器１０
３、２０３を省略することができる。これによって、高
速化かつ省ハードウェアである乗加減算の演算器を実現
することが可能となる。Thus, the addition of the third operand C is performed at the same time as the addition of the partial products for obtaining (A × B) in the multiplication array 301, so that the product (A × B) of the first and second operands is obtained. Adder 10 for adding the third operand C
3, 203 can be omitted. As a result, it is possible to realize a multiplier / adder / subtractor arithmetic unit that is faster and saves hardware.

【００４４】次に、第２の実施例について説明する。上
述した固定小数点乗加減算器を浮動小数点乗加減算器の
仮数部演算器に用いることによって、同様に高速化、省
ハードウェア化を図った浮動小数点乗加減算器を得るこ
とが可能となる。図５は、このような例を示す浮動小数
点乗加減算器の仮数部演算器を示したブロック図であ
る。Next, the second embodiment will be described. By using the above-mentioned fixed-point multiplication / subtraction device as the mantissa arithmetic unit of the floating-point multiplication / subtraction device, it is possible to obtain a floating-point multiplication / subtraction device that also has high speed and reduced hardware. FIG. 5 is a block diagram showing the mantissa arithmetic unit of the floating point adder / subtractor showing such an example.

【００４５】同図において、乗加減算器は、乗算アレイ
４０１、最終加算器４０２、桁合せシフタ４０３、イン
クリメンタ（増分加算器）４０４、先行１検出回路４０
５、正規化シフタ４０６、丸め回路４０７によって構成
される。Ａ×Ｂ＋Ｃの演算を行う浮動小数点乗加減算器
の場合、積（Ａ×Ｂ）の値よりも、Ｃの値が大きい場合
がある。そこで、この構成では、オペランドＡ，Ｂの仮
数部の積（Ｆａ×Ｆｂ）を表す２進値とオペランドＣの
仮数部Ｆｃを表わす２進値とが共通する桁幅（下位ビッ
トＦc low ）の部分について、乗算アレイ４０１を用い
て加算処理（Ｆａ×Ｆｂ＋Ｆc low ）を行う。Ｆｃが
（Ｆａ×Ｆｂ）を表す２進値の桁幅を超える桁幅の部分
（上位ビットＦc high）については、そのままＦc high
を出力する。ただし、加算処理（Ｆａ×Ｆｂ＋Ｆc low
）において桁上げが生じた場合、インクリメンタ４０
４によってＦc highに「１」を付加える。そして、正規
化シフタに、（Ｆａ×Ｆｂ＋Ｆc low ）と、Ｆc highあ
るいは桁上げによってインクリメントされたＦc high＋
１とを、入力して、桁を揃えたＦａ×Ｆｂ＋Ｆｃの２進
値表示を得、Ｆc highにおいて「１」となる先頭のビッ
ト位置（桁）を判別する先行１の検出結果に対応して正
規化する。そして、丸め回路４０７によって規定の丸め
処理を行う。In the figure, the multiplier / adder is a multiplication array 401, a final adder 402, a digit alignment shifter 403, an incrementer (incremental adder) 404, a leading 1 detection circuit 40.
5, a normalization shifter 406, and a rounding circuit 407. In the case of a floating-point multiplication adder / subtractor that performs the operation of A × B + C, the value of C may be larger than the value of the product (A × B). Therefore, in this configuration, the binary value representing the product (Fa × Fb) of the mantissa parts of the operands A and B and the binary value representing the mantissa part Fc of the operand C have a common digit width (lower bit Fc low). For the portion, addition processing (Fa × Fb + Fc low) is performed using the multiplication array 401. For the part of the digit width (higher-order bit Fc high) where Fc exceeds the digit width of the binary value representing (Fa × Fb), it is Fc high as it is.
Is output. However, addition processing (Fa x Fb + Fc low
), The incrementer 40
Add "1" to Fc high by 4. Then, in the normalization shifter, (Fa × Fb + Fc low) and Fc high or Fc high + incremented by carry
By inputting 1 and, the binary value display of Fa × Fb + Fc with aligned digits is obtained, and corresponding to the detection result of the preceding 1 which determines the leading bit position (digit) which becomes “1” in Fc high. Normalize. Then, the rounding circuit 407 performs a prescribed rounding process.

【００４６】次に、各部について説明する。乗算アレイ
４０１は、入力される正規化された２つのオペランドＡ
及びＢの仮数部Ｆａ及びＦｂの乗算を行う。乗算アレイ
４０１は、図１に示される乗算アレイ３０１と同様の構
成である。例えば、２次のブースのアルゴリズムを用い
ており、図２に示したような、２次のブースのリコー
ダ、部分積生成回路、部分積加算用桁上げ保存加算器群
（例えば、ワラス・ツリー）、によって構成される。倍
精度の演算を行う場合には、２７個の部分積が生成さ
れ、これらの各部分積はワラス・ツリーで加算され、キ
ャリー・セーブ・フォーム（桁上げ成分Ｆｃｃ，和成分
Ｆｓｓ）で積が算出される。Next, each part will be described. The multiplication array 401 receives two normalized operands A
And the mantissa parts Fa and Fb of B are multiplied. The multiplication array 401 has the same configuration as the multiplication array 301 shown in FIG. For example, a secondary booth algorithm is used, and a secondary booth recorder, a partial product generation circuit, and a carry save adder group for partial product addition (for example, Wallace Tree) as shown in FIG. , Composed by. When performing a double-precision operation, 27 partial products are generated, each of these partial products is added by the Wallace tree, and the product is carried by the carry save form (carry component Fcc, sum component Fss). It is calculated.

【００４７】図６は、第２の実施例における桁上げ保存
加算器４ｗを用いたワラス・ツリーの構成図である。４
段に配列された桁上げ保存加算器４ｗと、その間に挿入
された桁上げ保存加算器３ｗにより構成されている。前
述したように、桁上げ保存加算器４ｗは、４つの入力
と、１つの桁上げ入力との、５つの入力に対応して、１
つの和出力、２つの桁上げ出力を備える。また、桁上げ
保存加算器３ｗは、３つの入力に対応して、１つの和出
力、１つの桁上げ出力を備える。FIG. 6 is a block diagram of a Wallace tree using the carry save adder 4w in the second embodiment. Four
It is composed of a carry save adder 4w arranged in stages and a carry save adder 3w inserted therebetween. As described above, the carry save adder 4w corresponds to four inputs, one carry input, and five inputs.
It has one sum output and two carry outputs. The carry save adder 3w has one sum output and one carry output corresponding to the three inputs.

【００４８】この乗算アレイ４０１で最も重要かつ特徴
的な点は、図４に示されるワラス・ツリーと同様に、外
部より他の値を入力することができ、部分積と同様にツ
リー構造（例えば、ワラス・ツリー）に配列された加算
器群のアレイの中で部分積の加算と同時に加算すること
ができることである。図６では、ワラス・ツリーに、３
つの入力に対応して、１つの和出力、１つの桁上げ出力
を備える全加算器３ｗを挿入して外部からの入力を可能
とした。ワラス・ツリーに全加算器３ｗを挿入しても、
乗算アレイ３０１の演算時間に関するクリティカル・パ
スに全く影響を与えていない。The most important and characteristic point of this multiplication array 401 is that other values can be input from the outside like the Wallace tree shown in FIG. , Wallace Tree) can be added at the same time as the addition of partial products in an array of adders. In Figure 6, the Wallace tree has 3
Corresponding to one input, a full adder 3w having one sum output and one carry output was inserted to enable input from the outside. Even if the full adder 3w is inserted in the Wallace tree,
It has no effect on the critical path for the computation time of the multiplication array 301.

【００４９】桁合わせシフタ４０３は、正規化された第
３のオペランドＣの仮数部Ｆｃと、乗算アレイ４０１内
に得られる積（Ｆａ×Ｆｂ）との、桁合わせを行う、例
えば、１０６ビットのバレル・シフタである。桁合わせ
シフタ４０３には、シフト・ステップ数として３つのオ
ペランドの指数部より求められた値｜Ｅａ＋Ｅｂ−Ｅ
ｃ｜が入力され、その分だけオペランドＣの仮数部Ｆ
ｃがシフトされる。前述したように、Ｅａ，Ｅｂ，Ｅｃ
は、夫々オペランドＡ，Ｂ，Ｃの指数部である。The digit alignment shifter 403 performs digit alignment between the normalized mantissa part Fc of the third operand C and the product (Fa × Fb) obtained in the multiplication array 401, for example, of 106 bits. It is a barrel shifter. In the digit alignment shifter 403, the value obtained from the exponent part of the three operands as the number of shift steps | Ea + Eb−E
c | is input, and the mantissa part F of the operand C is input correspondingly.
c is shifted. As mentioned above, Ea, Eb, Ec
Are the exponents of operands A, B, and C, respectively.

【００５０】仮数部Ｆｃの桁合わせシフト結果Ｆc shif
t は、積（Ｆａ×Ｆｂ）の各デジットと同じ桁になる部
分Ｆc low と、それより上位の部分Ｆc highとに分割さ
れる。Ｆc low は乗算アレイ４０１に入力され、Ｆc hi
ghはインクリメンタ４０４に入力される。Digit adjustment shift result of mantissa Fc Fc shif
t is divided into a portion Fc low having the same digit as each digit of the product (Fa × Fb) and a higher portion Fc high. Fc low is input to the multiplication array 401, and Fc hi
gh is input to the incrementer 404.

【００５１】最終加算器４０２は、キャリー・セーブ・
フォームで出力されたアレイ出力（和成分Ｆｓｓ，桁上
げ成分Ｆｃｃ）を２進数Ｆbb lowに変換する１０６ビッ
トの桁上げ伝搬加算器である。Ｆbb lowは、正規化シフ
タ１０６に入力される。また、キャリーｃｃは、インク
リメンタ４０４に入力される。The final adder 402 is a carry save
It is a 106-bit carry propagation adder that converts the array output (sum component Fss, carry component Fcc) output in the form into a binary number Fbb low. Fbb low is input to the normalization shifter 106. The carry cc is also input to the incrementer 404.

【００５２】インクリメンタ４０４は、乗加減算の演算
結果の上位部分Ｆbb high を計算する５３ｂビットのイ
ンクリメンタである。インクリメンタ４０４では、Ｆc
high＋１を求め、最終加算器４０２のキャリーＣＣの値
によって、Ｆc high＋１，Ｆc highのいずれかを選択、
Ｆbb high として出力する。先行１検出回路４０５に、
Ｆbb high が入力され先行「１」の位置が検出される。
出力値Ｓnormは、正規化シフトのステップ数として、正
規化シフタ４０６に入力される。The incrementer 404 is a 53b-bit incrementer for calculating the upper part Fbb high of the multiplication / subtraction operation result. In incrementer 404, Fc
high + 1 is obtained, and either Fc high + 1 or Fc high is selected according to the carry CC value of the final adder 402,
Output as Fbb high. In the leading 1 detection circuit 405,
Fbb high is input and the position of the leading "1" is detected.
The output value Snorm is input to the normalization shifter 406 as the number of normalization shift steps.

【００５３】正規化シフタ４０６は、乗加減算の演算結
果Ｆｂｂ（Ｆbb high ，Ｆbb lowのビット結合）の正規
化を行うバレル・シフタである。５３ビット幅の先行１
検出回路４０５の出力Ｓnormだけ左にシフトされた出力
Ｆｎを得る。丸め回路４０７は、例えば、ＩＥＥＥ７
５４規格に従って、出力Ｆｎの丸め処理を行い、最終演
算結果Ｆｍａｃを得る。The normalization shifter 406 is a barrel shifter for normalizing the multiplication / subtraction operation result Fbb (bit combination of Fbb high and Fbb low). 53 bits wide leading 1
The output Fn shifted to the left by the output Snorm of the detection circuit 405 is obtained. The rounding circuit 407 is, for example, IEEE 7
According to the H.54 standard, the output Fn is rounded to obtain the final calculation result Fmac.

【００５４】次に、第２の実施例における（浮動小数
点）乗加減算器のより詳細な動作について、図５及び図
７を参照して説明する。図７は、乗加減算器におけるデ
ータ信号の流れを表している。Next, a more detailed operation of the (floating point) multiplier adder / subtractor in the second embodiment will be described with reference to FIGS. FIG. 7 shows the flow of data signals in the multiplier / subtractor.

【００５５】まず、正規化されている第１のオペランド
Ａの仮数部Ｆａ及び第２のオペランドＢの仮数部Ｆｂ
は、乗算アレイ４０１に入力され乗算が行われる。本実
施例では、２次のブースのアルゴリズムを用いているの
で、２７個の部分積が４段の４−２コンパクタ（加算器
４ｗ）により構成されるワラス・ツリー（図６）で加算
される。一方、正規化されている第３のオペランドＣの
仮数部Ｆｃは、桁合わせシフタ４０３に入力され、乗算
アレイ４０１におけるＦａ，Ｆｂの乗算処理に並行して
桁合わせが行われる。桁合わせは、オペランドＡ，Ｂ及
びＣ各々の指数部である、Ｅａ，Ｅｂ及びＥｃに基づい
て、｜Ｅａ＋Ｅｂ−Ｅｃ｜だけシフトすることによ
って行われる。桁合わせシフト結果Ｆc shift は、積
（Ｆａ×Ｆｂ）の各デジットと同じ桁になる部分Ｆc lo
w と、積（Ｆａ×Ｆ）の桁幅を超える上位の部分Ｆc hi
ghとに分割される。First, the normalized mantissa part Fa of the first operand A and the normalized mantissa part Fb of the second operand B are normalized.
Is input to the multiplication array 401 and multiplication is performed. Since the second-order Booth's algorithm is used in this embodiment, 27 partial products are added by the Wallace tree (FIG. 6) constituted by four stages of 4-2 compactors (adders 4w). . On the other hand, the normalized mantissa part Fc of the third operand C is input to the digit alignment shifter 403, and digit alignment is performed in parallel with the multiplication processing of Fa and Fb in the multiplication array 401. The digit alignment is performed by shifting by | Ea + Eb−Ec | based on Ea, Eb, and Ec which are exponents of the operands A, B, and C, respectively. The digit alignment shift result Fc shift is a portion Fc lo that has the same digit as each digit of the product (Fa × Fb).
w and the upper part Fc hi that exceeds the digit width of the product (Fa × F)
split into gh and.

【００５６】Ｆc low は乗算アレイ４０１に入力され、
部分積と共にワラス・ツリーで加算され、乗加減算の演
算結果の下位部分がキャリー・セーブ・フォーム（和成
分Ｆｓｓ，キャリー成分Ｆｃｃ）で求められる。これら
の和成分Ｆｓｓ，キャリー成分Ｆｃｃは、最終加算器４
０２で加算され、２進数の乗加減算の演算結果の下位部
分Ｆbb lowが算出される。この際、キャリーオーバが発
生すると、このオーバ分をＦc highのビット幅に担わせ
るべく、最終加算器４０２のキャリー出力ＣＣは、イン
クリメンタ４０４に入力される。Fc low is input to the multiplication array 401,
The partial product is added in the Wallace tree, and the lower part of the multiplication / subtraction operation result is obtained in the carry save form (sum component Fss, carry component Fcc). These sum component Fss and carry component Fcc are added to the final adder 4
02 is added to calculate the lower part Fbb low of the operation result of binary addition / subtraction. At this time, when a carry-over occurs, the carry output CC of the final adder 402 is input to the incrementer 404 so that the bit width of Fc high can be covered by the carry-over.

【００５７】一方、桁合わせシフト結果Ｆc shift の上
位部分Ｆc highはインクリメンタ４０４に入力される。
Ｆc highからＦc high＋１が生成される。インクリメン
タ４０４において、最終加算器４０２のキャリーＣＣの
値が「１」のとき、Ｆc high＋１が選択され、キャリー
ＣＣの値が「０」のとき、Ｆc highが選択される。選択
されたＦc high＋１あるいはＦc highが、乗加減算の演
算結果の上位部分Ｆbbhigh として出力される。また、
インクリメンタ４０４の出力Ｆbb high は先行１検出回
路４０５に入力され、正規化シフトに必要なシフト・ス
テップ数Ｓnormが算出される。乗加減算の演算結果Ｆbb
high 及びＦbb lowはビット結合され、正規化シフタ１
０６においてシフト・ステップ数Ｓnorm分だけシフトさ
れ、正規化される。正規化された正規化結果Ｆｎは、丸
め回路１０７において、有効桁数に満たない下位ビット
（sticky）について、例えば、ＩＥＥＥ−７５４に準じ
た丸めが施され、乗加減算の演算結果Ｆｍａｃを得る。On the other hand, the upper part Fc high of the digit alignment shift result Fc shift is input to the incrementer 404.
Fc high + 1 is generated from Fc high. In the incrementer 404, Fc high + 1 is selected when the carry CC value of the final adder 402 is "1", and Fc high is selected when the carry CC value is "0". The selected Fc high + 1 or Fc high is output as the upper part Fbbhigh of the multiplication / subtraction operation result. Also,
The output Fbb high of the incrementer 404 is input to the preceding 1 detection circuit 405, and the number of shift steps Snorm required for the normalized shift is calculated. Multiplication addition / subtraction operation result Fbb
high and Fbb low are bit-combined and normalized shifter 1
At 06, the number of shift steps is shifted by Snorm and normalized. The normalized normalization result Fn is rounded by the rounding circuit 107 for lower bits (sticky) less than the number of significant digits according to, for example, IEEE-754, and a multiplication / subtraction operation result Fmac is obtained.

【００５８】このように、第３のオペランドＣの加算を
乗算アレイ４０１で行うことにより、従来、約１６０ビ
ットのビット幅を必要とした、乗加減算器を構成する桁
合せシフタ２０２、加算器２０３、先行１検出器２０４
等の演算モジュールのビット幅を減らした構成を得るこ
とが可能となる。これによって、乗加減算の演算速度の
高速化を実現することができる。（バレルシフタ）次に、乗加減算器等の演算装置におい
て、データをシフトするために用いられるバレルシフタ
の改良について説明する。第１の実施例を図１６を参照
して説明する。同図は、入出力データ長が１６ビットで
あり、シフト量を４ビット長の２進数値ｂ8 ｂ4 ｂ2 ｂ
1 にて指定する単一方向バレルシフタの機能ブロック図
を示している。この例では、８、４、２、１ビットのシ
フトを行うかどうかを選択することによって、指定され
たシフト量だけのシフトを実現する。８ビット及び４ビ
ットのシフトは、１組の信号シフト手段と、４入力１出
力のセレクト手段とにて行われる。同様に、２ビット及
び１ビットのシフトは、１組のシフト段と、４入力１出
力のセレクト手段とにて実現される。As described above, by performing the addition of the third operand C by the multiplication array 401, conventionally, a digit shifter 202 and an adder 203, which constitute a multiplication / subtraction device, which required a bit width of about 160 bits. , Leading one detector 204
It is possible to obtain a configuration in which the bit width of the arithmetic module etc. is reduced. As a result, it is possible to realize a higher calculation speed for multiplication, addition and subtraction. (Barrel Shifter) Next, the improvement of the barrel shifter used for shifting data in an arithmetic unit such as a multiplier / subtractor will be described. A first embodiment will be described with reference to FIG. In this figure, the input / output data length is 16 bits, and the shift amount is a 4-bit binary value b8 b4 b2 b.
The functional block diagram of the unidirectional barrel shifter specified in 1 is shown. In this example, by selecting whether or not the shift of 8, 4, 2, 1 bits is performed, the shift by the designated shift amount is realized. 8-bit and 4-bit shifts are performed by a pair of signal shift means and 4-input 1-output select means. Similarly, 2-bit and 1-bit shifts are realized by a pair of shift stages and 4-input 1-output select means.

【００５９】図１７は、図１６に示された、初段の８ビ
ット＋４ビットシフトを行う４入力１出力のセレクト手
段ならびにセレクト手段を適切に動作させるための制御
信号を生成するセレクタ制御信号生成手段の構成例を示
している。図１６に示される１６ビット長のシフタを構
成するためには、もう一段、２ビット＋１ビットシフト
に対応する機構を縦続接続する。ここでは、セレクタ制
御信号生成手段に入力された２ビットのシフト量指示信
号をデコードして、４入力１出力のトランスミッション
ゲート形セレクタのトランジスタのうち、１組のｐチャ
ネル及びｎチャネルトランジスタを導通させることによ
り、信号シフト手段によって供給された信号の内の適切
なものを次の段あるいは出力に伝えるようになってい
る。このシフタについて、シフト量制御入力によって１
３ビットのシフトが指示された場合について考えて見
る。１３ビットというシフト量を２進数値にて表現する
と、（１１０１）となることから、８ビット、４ビット
及び１ビットシフトを行えば良いことになる。このこと
を図１６にて考えると、まず、初段の８ビット＋４ビッ
トシフト段においては、合計１２ビットのシフトが行わ
れることになり、続く２ビット＋１ビットのシフト段に
おいては、１ビットのシフトが行われることから、両段
合わせて１３ビットのシフトが実現される。例えば、デ
ータ入力においてビット１６にあったデータは、最初の
８ビット＋４ビットシフト段において、ビット位置が１
２ビット右方向にシフトされ、ビット位置が４ビット目
に相当する位置にまで移動する。その後、２ビット＋１
ビットシフト段において、１ビットだけ右方向にシフト
される結果、最終的には入力されたビット位置（１６）
から１３ビットだけ右にシフトした３ビット目のビット
位置にデータが現われる。FIG. 17 is a 4-input 1-output selecting means for shifting 8 bits + 4 bits in the first stage shown in FIG. 16 and a selector control signal generating means for generating a control signal for properly operating the selecting means. The example of composition of is shown. In order to form the 16-bit length shifter shown in FIG. 16, another mechanism corresponding to 2 bits + 1 bit shift is connected in cascade. Here, the 2-bit shift amount instruction signal input to the selector control signal generation means is decoded to make one set of p-channel and n-channel transistors conductive among the transistors of the 4-input 1-output transmission gate type selector. Thus, the appropriate one of the signals supplied by the signal shifting means is transmitted to the next stage or output. About this shifter, 1 by the shift amount control input
Consider the case where a 3-bit shift is instructed. When the shift amount of 13 bits is expressed by a binary value, it becomes (1101), so it is sufficient to perform 8-bit, 4-bit and 1-bit shifts. Considering this in FIG. 16, first, in the first 8-bit + 4-bit shift stage, a total of 12 bits are shifted, and in the subsequent 2-bit + 1-bit shift stage, 1-bit shift is performed. Therefore, a 13-bit shift is realized in both stages. For example, the data in bit 16 at the data input has a bit position of 1 in the first 8 bit + 4 bit shift stage.
It is shifted to the right by 2 bits and the bit position is moved to a position corresponding to the 4th bit. Then 2 bits + 1
In the bit shift stage, as a result of shifting rightward by 1 bit, the bit position (16) finally input
The data appears at the third bit position, which is shifted to the right by 13 bits from.

【００６０】図１８は、図１７において示した８ビット
＋４ビットシフト段における信号シフト手段ならびにセ
レクト手段の構成を示している。FIG. 18 shows the structure of the signal shift means and the select means in the 8-bit + 4-bit shift stage shown in FIG.

【００６１】また、図２４は、従来の構成によって、８
ビットシフト及び４ビットシフトを独立して実行する場
合の信号シフト手段ならびにセレクト手段の接続形態を
示している。この場合、信号シフト手段の配線が縦方向
に占める面積は８本（図中配線８本分の高さとして示さ
れている）＋４本（同じく、配線４本分の高さとして示
されている）＝１２本の配線によって決定されるが、本
発明による図１８の構成においても同じく１２本分の配
線面積となることから、構成を変更することによる信号
シフト手段の占有面積の変化（増加）はないことにな
る。また、図１９に示す、本発明における４入力１出力
セレクト手段をトランスミッションゲートにて構成した
場合の回路と、図２５に示す、これと等価な従来の構成
による回路とを比較すると、セレクタを動作させるため
の制御信号線の本数が従来の４本から本発明における構
成では８本と増加しはするものの、トランジスタの数と
しては全く同じであり、制御信号線を除いてはハードウ
エアの増加はない。In addition, FIG. 24 shows a conventional configuration of 8
The connection form of the signal shift means and the select means in the case of independently performing the bit shift and the 4-bit shift is shown. In this case, the area occupied by the wirings of the signal shift means in the vertical direction is 8 (shown as the height of 8 wirings in the figure) + 4 (also shown as the height of 4 wirings). ) = 12 wirings, but the wiring area for 12 wirings is also obtained in the configuration of FIG. 18 according to the present invention. Therefore, the area occupied by the signal shift means is changed (increased) by changing the configuration. There will be no. Further, comparing the circuit shown in FIG. 19 in which the 4-input 1-output selecting means of the present invention is configured with a transmission gate with the circuit shown in FIG. 25 having the equivalent conventional structure, the selector operates. Although the number of control signal lines for increasing the number is increased from eight in the conventional configuration to eight in the configuration of the present invention, the number of transistors is exactly the same, and the hardware is not increased except for the control signal lines. Absent.

【００６２】図１９及び図２５で、データ信号がそれぞ
れ上部から入力されて、出力されるまでの経路について
考えてみると、従来の構成では２つのトランジスタ対を
通過するのに対して、本発明による構成では１つのトラ
ンジスタ対を通過するだけであることから、信号経路に
おけるインピーダンスが半分に低下し、信号通過時の遅
延時間が減少するのと同時に、トランジスタ対における
電力消費量が減少することになる。19 and 25, considering the paths from the input of the data signal to the output of the data signal, the data signal passes through two transistor pairs in the conventional configuration. Since only one transistor pair is passed in the configuration according to, the impedance in the signal path is reduced to half, and the delay time at the time of passing the signal is reduced, and at the same time, the power consumption in the transistor pair is reduced. Become.

【００６３】すなわち、本発明のバレルシフタの第１の
実施例においては、セレクタの制御信号線の本数が若干
増加するが、制御信号を生成するためのハードウエアが
増加する代償として、シフタの高速化ならびに低消費電
力化を図れることになる。That is, in the first embodiment of the barrel shifter of the present invention, although the number of control signal lines of the selector is slightly increased, the speed of the shifter is increased at the cost of increased hardware for generating the control signal. In addition, low power consumption can be achieved.

【００６４】図２０はこの第１の実施例について、１段
のセレクタ段において実行されるビットシフト量の組み
合わせを変更したものであり、８ビット／１ビットシフ
トと、４ビット／２ビットシフトとを、それぞれ足し合
わせた数、すなわち図１８に示したような信号シフト手
段における横方向の配線の長さがほぼ等しくなるような
構成とすることによって、それぞれのセレクト手段にお
ける負荷容量を均等化させ、セレクタ段の前後に信号強
化用のバッファ回路を用いるような場合において、負荷
容量分担の設計を容易にするといった応用が可能であ
る。FIG. 20 shows a modification of the combination of bit shift amounts executed in one selector stage in the first embodiment, that is, 8 bit / 1 bit shift and 4 bit / 2 bit shift. By equalizing the load capacities in the respective selecting means by making the total number of wirings, that is, the lengths of the horizontal wirings in the signal shifting means as shown in FIG. In the case where a buffer circuit for signal enhancement is used before and after the selector stage, the application of facilitating the design of load capacity sharing is possible.

【００６５】図２１に、本発明のバレルシフタの第２の
実施例を示す。この実施例は、入出力データ長が１６ビ
ットであり、シフト量が４ビット長の２進数値にて指定
され、シフト方向が１ビットの信号にて指定される、両
方向バレルシフタである。この例では、データ入力側か
らデータ出力側に向って順に、ビット順逆転＋８ビット
シフト、２ビットシフト＋１ビットシフト、４ビットシ
フト＋ビット順逆転を、それぞれ実行するセレクタ段
が、信号シフト／ビット順逆転手段あるいは信号シフト
手段に引き続いて接続されている。FIG. 21 shows a second embodiment of the barrel shifter of the present invention. This embodiment is a bidirectional barrel shifter in which the input / output data length is 16 bits, the shift amount is designated by a 4-bit binary value, and the shift direction is designated by a 1-bit signal. In this example, selector stages that perform bit order inversion + 8 bit shift, 2 bit shift + 1 bit shift, 4 bit shift + bit order inversion in order from the data input side to the data output side are signal shift / bit. It is connected subsequently to the forward / reverse means or the signal shift means.

【００６６】図２２は、図２１に示されるビット順逆転
を含むセレクト手段ならびにセレクタ制御信号生成手段
の構成例である。本発明のバレルシフタの第１の実施例
において２種類のシフト量を組み合わせる代わりに、１
種類のシフト量と、ビット順逆転操作とを組み合わせて
１段としているのが特徴である。そして、図２１に示す
構成においては、その初段に図２２に示すビット順逆転
＋８ビットシフトを行うブロックを設置した後、第１の
実施例における図１７に示すような構成を用いた２ビッ
ト＋１ビットシフトを実行するブロックを縦続接続す
る。そして、最後に図２１に示す構成において４ビット
シフトを行ってからビット順を逆転させる機能を持たせ
た機能ブロックを設置することによって、全体の両方向
バレルシフタを構成している。すなわち、４入力１出力
のセレクト手段を用いることによる特質は、第１の実施
例におけるものと同一であることになる。FIG. 22 shows an example of the configuration of the selecting means and selector control signal generating means including the bit order inversion shown in FIG. Instead of combining two types of shift amounts in the first embodiment of the barrel shifter of the present invention,
The feature is that the shift amount of each kind and the bit order inversion operation are combined to form one stage. Then, in the configuration shown in FIG. 21, after the block for performing the bit order inversion + 8 bit shift shown in FIG. 22 is installed at the first stage, 2 bits + 1 using the configuration shown in FIG. 17 in the first embodiment. Cascade the blocks that perform bit shifting. Finally, in the configuration shown in FIG. 21, the entire bidirectional barrel shifter is configured by installing a functional block having a function of performing a 4-bit shift and then reversing the bit order. That is, the characteristics of using the 4-input 1-output select means are the same as those in the first embodiment.

【００６７】ここで、図２１に示す両方向シフタにおけ
るデータの流れについて説明する。例えば、１３ビット
の右方向へのシフトを行う場合について考える。図２１
に示したシフタにおいて、ビット順を逆転させない場合
に右方向へのシフトが行われるように信号シフト手段が
構成されているものとすると、まず、初段のビット順逆
転＋８ビットシフト段においては、８ビット右シフトの
みが行われる。そして、次の２ビット＋１ビットシフト
段において１ビットの右シフトが行われたのち、最後の
４ビットシフト＋ビット順逆転段においては、４ビット
右シフトのみを実行することによって、全体で１３ビッ
トの右シフトが実現される。すなわち、このシフタの初
段と最終段に設置されているビット順逆転機構を使用し
ないことによって、このシフタのディフォルトのシフト
方向である右方向へのシフトであることを指示し、入力
データのシフト量は、８ビットシフト＋４ビットシフト
＋１ビットシフトにより、１３ビットシフトとなる。The data flow in the bidirectional shifter shown in FIG. 21 will be described. For example, consider the case of shifting 13 bits to the right. Figure 21
In the shifter shown in FIG. 3, assuming that the signal shift means is configured so that the shift is performed to the right when the bit order is not reversed, first, in the first bit order inversion + 8 bit shift stage, Only bit shift right is performed. Then, after the right shift of 1 bit is performed in the next 2-bit + 1-bit shift stage, only the 4-bit right shift is executed in the last 4-bit shift + bit forward inversion stage, so that a total of 13 bits is obtained. The right shift of is realized. In other words, by not using the bit order inversion mechanism installed in the first stage and the last stage of this shifter, it is indicated that the shift is to the right, which is the default shift direction of this shifter. Becomes a 13-bit shift by 8 bit shift + 4 bit shift + 1 bit shift.

【００６８】ここで、入力データにおけるビット位置が
１６ビット目にあったデータが、シフタの１３ビット右
シフトの操作によってどのように流れていくのかについ
て説明すると、初段のビット順逆転＋８ビットシフト段
においては８ビットシフトのみの操作を受けて、１６ビ
ット目にあったデータのビット位置が８ビット目に移動
する。続いて、２ビット＋１ビットシフト段にて１ビッ
ト右シフトされて、ビット位置が７ビット目となる。そ
して、最後の４ビットシフト＋ビット順逆転手段におい
ては、４ビットシフトのみが行われて、最終的にはデー
タは３ビット目の位置に現われる。当初の１６ビット目
の位置から、１３ビットシフトした３ビット目の位置に
データがシフトされることがわかる。Here, it will be described how the data whose bit position in the input data is at the 16th bit flows by the operation of shifting the shifter 13 bits to the right. In, the bit position of the 16th bit data is moved to the 8th bit by the operation of only the 8 bit shift. Subsequently, the data is shifted right by 1 bit in the 2 bit + 1 bit shift stage, and the bit position becomes the 7th bit. Then, in the last 4-bit shift + bit order inversion means, only 4-bit shift is performed, and finally the data appears at the position of the third bit. It can be seen that the data is shifted from the initial position of the 16th bit to the position of the 3rd bit which is shifted by 13 bits.

【００６９】１３ビット左シフトの操作について説明す
る。この場合には、初段と最終段におけるビット順逆転
機構が両方動作する。まず、初段のビット順逆転＋８ビ
ットシフト段においては、入力データのビット順を逆転
した後に８ビット右シフトする。これにより、例えば、
当初０ビット目にあったデータは、７ビット目の位置に
まで移動する。そして、２ビット＋１ビットシフト段に
て１ビット右シフトされることによって６ビット目にビ
ット位置が移動する。最後の４ビットシフト＋ビット順
逆転段においては、６ビット目のデータを４ビット右シ
フトした後にビット順を逆転させる。これにより、６ビ
ット目の位置から論理的にはまず２ビット目の位置に移
動した後、ビット順逆転によって１３ビット目の位置に
移動することになる。０ビット位置のデータの１３ビッ
ト位置への左シフトが達成される。The operation of 13-bit left shift will be described. In this case, both the bit order inversion mechanism at the first stage and the last stage operate. First, in the bit order inversion + 8 bit shift stage of the first stage, the bit order of the input data is inverted and then right shifted by 8 bits. This gives, for example,
The data originally in the 0th bit moves to the position of the 7th bit. Then, the bit position is moved to the 6th bit by right shifting by 1 bit in the 2 bits + 1 bit shift stage. In the final 4-bit shift + bit order inversion stage, the bit order is reversed after right-shifting the sixth bit data by 4 bits. As a result, the position of the 6th bit is logically moved to the position of the 2nd bit and then to the position of the 13th bit by bit order inversion. A left shift of the 0 bit position data to the 13 bit position is achieved.

【００７０】図２３は、図２２に示す信号シフト／ビッ
ト順逆転手段の構成例を示している。縦方向へは１６本
分の配線スペースが必要となることが分かる。このと
き、配線容量が最大となる信号パスは、同図におけるｂ
0 から入力されて、ｘ15に出力されるものであり、ちょ
うど入力データ列の横幅に相当する距離となる。FIG. 23 shows a configuration example of the signal shift / bit order inversion means shown in FIG. It can be seen that 16 wiring spaces are required in the vertical direction. At this time, the signal path having the maximum wiring capacitance is b in FIG.
It is input from 0 and output to x15, which is exactly the width of the input data string.

【００７１】図２６は、同じ機能を有する信号シフト／
ビット順逆転手段を、従来の２入力１出力セレクト手段
を用いて構成した例を示している。この場合、ビット順
逆転のみを行う信号逆転手段においてすでに１６本分の
縦方向への配線領域が必要となっている。さらには８ビ
ットシフトを行うための８本分の配線スペースが加わる
ことから、合わせて２４本分の配線スペースが必要とな
る。すなわち、信号シフト／ビット順逆転手段に要する
配線領域の面積という観点では、本発明のバレルシフタ
を用いたものの方が小さくて済む。また、図２６に示さ
れる従来の構成では、信号が入力されてから出力される
までの最長の信号パスは、同図中のｂ0 から入力されて
ｙ15を経由してｚ7 に至るという経路であるが、本発明
による図２３に示すものと比べると、ｙ15からｚ7にい
たる配線長が余分に加わっている。従って、本発明のバ
レルシフタを用いることによって、信号の伝搬距離を短
縮することができるから、シフト動作の高速化を図るこ
とが可能となる。FIG. 26 shows a signal shift / function having the same function.
An example in which the bit-order inversion means is configured by using the conventional 2-input 1-output select means is shown. In this case, the signal reversing means for performing only the bit order reversal already requires 16 wiring areas in the vertical direction. Furthermore, since a wiring space for 8 lines is added for performing 8-bit shift, a total of 24 wiring spaces are required. That is, from the viewpoint of the area of the wiring region required for the signal shift / bit order inversion means, the one using the barrel shifter of the present invention can be made smaller. Further, in the conventional configuration shown in FIG. 26, the longest signal path from the input of a signal to the output thereof is the path from b0 in the figure to z7 via y15. However, as compared with the one shown in FIG. 23 according to the present invention, an additional wiring length from y15 to z7 is added. Therefore, by using the barrel shifter of the present invention, the propagation distance of the signal can be shortened, and the shift operation can be speeded up.

【００７２】[0072]

【The invention's effect】

（乗加減算器）以上説明したように、本発明の乗加減算
器においては、オペランドＡ，Ｂ及びＣの乗加減算（Ａ
×Ｂ＋Ｃ）の演算を行うに際し、第３のオペランドＣの
加算を、第１及び第２のオペランドＡ，Ｂの乗算を行う
乗算アレイにおける部分積の加算と同時に行うようにし
たので、従来の乗加減算器に比較して、乗加減算器を構
成するシフタ、加算器、先行１検出器等の演算モジュー
ルのビット幅を減らしてコンパクトに構成することが可
能となり、乗加減算の演算速度をより一層高速化するこ
とが可能となる。（バレルシフタ）シフトを行うセレクタを従来の２入力
１出力のものに対して４入力１出力のものを使用するこ
とによって、入力データ信号がシフトされて出力される
までに通過する信号経路のトランジスタ対の数を従来構
成に比して略１／２にまで減少させることが可能とな
り、信号の通過に要する時間が短縮される。よって、シ
フト動作の高速化及びがトランジスタの消費電力の低減
が達成される。(Multiply-adder / subtractor) As described above, in the multiplier-adder / subtractor of the present invention, the multiplier-adder / subtractor (A
XB + C), the third operand C is added at the same time as the partial product addition in the multiplication array for multiplying the first and second operands A and B. Compared with the adder / subtractor, it is possible to reduce the bit width of the arithmetic modules such as the shifter, adder, and leading 1 detector that compose the adder-subtractor, and to make the arithmetic module more compact. Can be converted. (Barrel shifter) By using a selector for performing shift, which has four inputs and one output, as compared with the conventional one having two inputs and one output, a transistor pair of a signal path through which an input data signal is shifted and output is output. It is possible to reduce the number of H.sub.2 to about 1/2 as compared with the conventional configuration, and the time required for passing a signal is shortened. Therefore, speeding up of the shift operation and reduction of power consumption of the transistor can be achieved.

【００７３】また、ビット順の逆転機構を利用した左右
両方向へのシフトにも同様に適用でき、配線面積の低減
と同時に低消費電力ならびに高速動作を達成することが
可能となる。勿論、本発明のバレルシフタを用いること
によって、乗算器等の演算装置のビット逆転演算を高速
かつ低消費電力にて実行することが可能となる。Further, the invention can be similarly applied to the shift in both the left and right directions using the bit-order inversion mechanism, and it is possible to reduce the wiring area and at the same time achieve low power consumption and high-speed operation. Of course, by using the barrel shifter of the present invention, it becomes possible to execute the bit inversion operation of an arithmetic unit such as a multiplier at high speed and low power consumption.

[Brief description of drawings]

【図１】本発明に係る乗加減算の構成を示すブロック
図。FIG. 1 is a block diagram showing a configuration of multiplication, addition and subtraction according to the present invention.

【図２】本発明に係る乗加減算器における乗算アレイの
構成を示すブロック図。FIG. 2 is a block diagram showing a configuration of a multiplication array in the multiplication / subtraction device according to the present invention.

【図３】ワラス・ツリーによる部分積と第３のオペラン
ドとの加算を説明する説明図。FIG. 3 is an explanatory diagram illustrating addition of a partial product by a Wallace tree and a third operand.

【図４】加算器群によるワラス・ツリーの構成例を示す
説明図。FIG. 4 is an explanatory diagram showing a configuration example of a Wallace tree by an adder group.

【図５】本発明に係る浮動小数点乗加減算器の構成を示
すブロック図。FIG. 5 is a block diagram showing a configuration of a floating point multiplication adder / subtractor according to the present invention.

【図６】加算器群によるワラス・ツリーの構成例を示す
説明図。FIG. 6 is an explanatory diagram showing a configuration example of a Wallace tree by an adder group.

【図７】本発明に係る浮動小数点乗加減算器の動作を説
明する説明図。FIG. 7 is an explanatory diagram for explaining the operation of the floating point multiplication adder / subtractor according to the present invention.

【図８】従来の乗加減算器の構成例を示すブロック図。FIG. 8 is a block diagram showing a configuration example of a conventional multiplying / subtracting device.

【図９】従来の乗加減算器における乗算アレイの構成を
示すブロック図。FIG. 9 is a block diagram showing a configuration of a multiplication array in a conventional multiplication / subtraction device.

【図１０】ワラス・ツリーによる部分積の加算を説明す
る説明図。FIG. 10 is an explanatory diagram illustrating addition of partial products by a Wallace tree.

【図１１】加算器群によるワラス・ツリーの構成例を示
す説明図。FIG. 11 is an explanatory diagram showing a configuration example of a Wallace tree by an adder group.

【図１２】加算器４ｗ（４−２コンパクタ）の構成例を
示す論理回路図。FIG. 12 is a logic circuit diagram showing a configuration example of an adder 4w (4-2 compactor).

【図１３】従来の乗加減算演算器の他の構成例を示すブ
ロック図。FIG. 13 is a block diagram showing another configuration example of a conventional multiply-add / subtract arithmetic unit.

【図１４】従来の浮動小数点乗加減算器の構成例を示す
ブロック図。FIG. 14 is a block diagram showing a configuration example of a conventional floating point multiplication adder / subtractor.

【図１５】乗算アレイ２０１におけるワラス・ツリーの
構成例を示すブロック図。FIG. 15 is a block diagram showing a configuration example of a Wallace tree in the multiplication array 201.

【図１６】本発明のバレルシフタの第１の実施例を示
す、１６ビット長単一方向バレルシフタの構成図。FIG. 16 is a configuration diagram of a 16-bit length unidirectional barrel shifter showing the first embodiment of the barrel shifter of the present invention.

【図１７】第１の実施例において使用されるセレクト手
段、セレクタ制御信号生成手段及び周辺回路の構成例を
示すブロック図。FIG. 17 is a block diagram showing a configuration example of a selection unit, a selector control signal generation unit and a peripheral circuit used in the first embodiment.

【図１８】本発明による１６ビット長バレルシフタの、
８ビット＋４ビット信号シフト手段の構成例を示すブロ
ック図。FIG. 18 shows a 16-bit length barrel shifter according to the present invention,
The block diagram which shows the structural example of 8-bit + 4-bit signal shift means.

【図１９】本発明における４入力１出力セレクト手段の
構成例を示すブロック図。FIG. 19 is a block diagram showing a configuration example of 4-input 1-output selecting means according to the present invention.

【図２０】上記バレルシフタの第１の実施例を、負荷分
散を考慮して構成変更した例を示すブロック図。FIG. 20 is a block diagram showing an example in which the configuration of the first embodiment of the barrel shifter is changed in consideration of load distribution.

【図２１】第２の実施例である、１６ビット長両方向バ
レルシフタの構成例を示すブロック図。FIG. 21 is a block diagram showing a configuration example of a 16-bit long bidirectional barrel shifter which is a second embodiment.

【図２２】第２の実施例において使用されるセレクト手
段、セレクタ制御信号生成手段及び周辺回路の構成例を
示すブロック図。FIG. 22 is a block diagram showing a configuration example of a selection unit, a selector control signal generation unit, and a peripheral circuit used in the second embodiment.

【図２３】第２の実施例における１６ビット長両方向バ
レルシフタの８ビット信号シフト／ビット順逆転手段の
構成例を示すブロック図。FIG. 23 is a block diagram showing a configuration example of 8-bit signal shift / bit order inversion means of a 16-bit length bidirectional barrel shifter in the second embodiment.

【図２４】バレルシフタの従来構成における、１６ビッ
ト長バレルシフタの８ビット及び４ビット信号シフト手
段の構成例を示すブロック図。FIG. 24 is a block diagram showing a configuration example of 8-bit and 4-bit signal shifting means of a 16-bit length barrel shifter in a conventional barrel shifter configuration.

【図２５】従来構成における２入力１出力セレクト手段
の構成例を示すブロック図。FIG. 25 is a block diagram showing a configuration example of a 2-input 1-output selecting unit in a conventional configuration.

【図２６】従来構成における１６ビット長両方向バレル
シフタのビット順逆転手段ならびに８ビット信号シフト
手段の構成例を示すブロック図。FIG. 26 is a block diagram showing a configuration example of bit order inversion means and 8-bit signal shift means of a 16-bit long bidirectional barrel shifter in a conventional configuration.

【図２７】従来の単一方向バレルシフタの構成例を示す
ブロック図。FIG. 27 is a block diagram showing a configuration example of a conventional unidirectional barrel shifter.

【図２８】独立した左及び右シフタの出力を選択するこ
とによって両方向バレルシフタを構成した例を示すブロ
ック図。FIG. 28 is a block diagram showing an example in which a bidirectional barrel shifter is configured by selecting outputs of independent left and right shifters.

【図２９】従来の構成における、単一方向バレルシフタ
の前後に入力データのビットの並びを逆に配列するビッ
ト順逆転機構を設けた両方向バレルシフタの構成例を示
すブロック図。FIG. 29 is a block diagram showing a configuration example of a bidirectional barrel shifter in which a bit order inversion mechanism that reversely arranges bits of input data is arranged before and after the unidirectional barrel shifter in the conventional configuration.

[Explanation of symbols]

１０１，２０１，３０１，４０１乗算アレイ２０２, ４０３桁合わせシフタ１０２，２０３，４０２，最終加算器４０４インクリメンタ２０４，４０５先行１検出回路２０５，４０６正規化シフタ２０６，４０７丸め回路 101, 201, 301, 401 Multiplying array 202, 403 Digit adjustment shifter 102, 203, 402, Final adder 404 Incrementer 204, 405 Leading 1 detection circuit 205, 406 Normalization shifter 206, 407 Rounding circuit

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁶ 識別記号庁内整理番号ＦＩ技術表示箇所Ｇ０６Ｆ 17/10 ─────────────────────────────────────────────────── ─── Continuation of the front page (51) Int.Cl. ⁶ Identification code Internal reference number FI technical display location G06F 17/10

Claims

[Claims]

1. A multi-point adder / subtractor of fixed-point arithmetic for performing addition or subtraction between a multiplication value obtained by multiplying input first and second operands and an input third operand, which is a predetermined multiplication. According to an algorithm, a partial product generation circuit that generates a plurality of partial products corresponding to the first and second operands, an addition of the plurality of partial products, and an addition or subtraction of the third operand are performed in a tree structure. A multiplying array for outputting a plurality of weighted intermediate values of multiplication performed by a plurality of adder groups arranged in a plurality of groups, and a carry propagation type adder for adding the plurality of intermediate values of multiplication to obtain a multiplication / subtraction value A multiplicative adder / subtractor, comprising:

2. A multiplicative adder / subtractor of a floating point operation for adding or subtracting a multiplication value obtained by multiplying an input first and second operand and an input third operand, the mantissa thereof A partial arithmetic unit for generating a plurality of partial products corresponding to respective values of the mantissa of each of the first and second operands according to a predetermined multiplication algorithm; and addition of the plurality of partial products, A multiplication array that performs addition or subtraction of the mantissa value of the third operand by a plurality of adder groups arranged in a tree structure, and outputs a plurality of weighted multiplication intermediate values; A carry-propagation adder for adding a multiplication intermediate value to obtain a multiplication addition / subtraction value, and a multiplication addition / subtraction device.

3. A multiply-adder / subtractor for floating-point arithmetic, which performs addition or subtraction between a multiplication value obtained by multiplying input first and second operands and an input third operand, the multiplication being a predetermined multiplication. A partial product generation circuit for generating a plurality of partial products corresponding to respective mantissa values of the first and second operands according to an algorithm; and a value of an exponent part of the third operand for the first and second operands. Two
A shift circuit that performs digit adjustment of the mantissa value of the third operand so as to be equal to the exponent value of the product of the operands, the addition of the partial products, and the digit-adjusted third operation. A multiplication array that performs addition or subtraction with a part or all of the mantissa value of the operand of by a plurality of adder groups arranged in a tree structure and outputs a plurality of weighted intermediate intermediate values; A carry propagation adder for adding the plurality of intermediate values for multiplication to obtain a multiplication / subtraction operation value, and a multiplication / subtraction device.

4. A multiply-adder / subtractor of a floating-point operation for performing addition or subtraction between a multiplication value obtained by multiplying input first and second operands and an input third operand, wherein the predetermined multiplication is performed. A partial product generation circuit for generating a plurality of partial products corresponding to respective mantissa values of the first and second operands according to an algorithm; and a value of an exponent part of the third operand for the first and second operands. Two
A shift circuit that performs digit adjustment of the mantissa value of the third operand so as to be equal to the exponent value of the product of the operands, the addition of the partial products, and the digit-adjusted third operation. Addition or subtraction of the value of each digit of the lower digit in the value represented by the number of digits assigned to the operation of the product of the mantissas of the first and second operands of the value of the mantissa of the operand And a multiplication array that outputs a plurality of weighted multiplication intermediate values by a plurality of adder groups arranged in a tree structure, and the plurality of multiplication intermediate values are added to obtain the addition, subtraction value of the lower digit. The carry propagation adder to be obtained, and the number of digits assigned to the operation of the product of the mantissa parts of the first and second operands among the mantissa values of the third operand that have been aligned. The value of the upper digit that exceeds the value And the carry output of the carry propagation type adder,
An increment adder for adding to obtain an upper digit output value, and a means for bit-combining the upper digit output value and the lower and upper digit addition and subtraction values to obtain a final multiplication, addition and subtraction value. vessel.

5. A carry-save adder that divides a sum into a sum component and a carry component as an adder group that constitutes the multiplication array, an SD adder that adds using a redundant number, a PD adder, 6. The adder / subtractor according to claim 1, wherein any one of full adders is used.

6. A barrel shifter which has at least a data input, a data output and a control input, shifts the input data by the number of bits designated by the control input, and outputs the shifted data, from the supplied l (integer) bits. From the input data
L signal shift means for outputting four bit signals, l select means for selecting one from the four bit signals output by each signal shift means, and decoding the supplied control input , A control signal generation means for giving a control signal for instructing a bit signal to be selected to each selection means, wherein the i-th signal shift means of the l signal shift means is the i-th bit of the input data. The barrel shifter is characterized in that it outputs 0 bits, n (integer) bits, m (integer) bits, m + n bits, and 4 bit signals at positions where bit positions are shifted with respect to the signal.

7. A barrel shifter which has at least a data input, a data output, and a control input, and shifts and outputs the input data by the number of bits designated by the control input, and from the supplied l (integer) bits. From the input data
L first signal shifts / outputting 4 bit signals /
Bit order inversion means, and l first first signals selecting one from four bit signals output by each of the first signal shift / bit order inversion means
Selecting means, l signal shifting means for outputting four bit signals from the l outputs inputted from the l first selecting means, and four bit signals outputted by each signal shifting means. From the l second select means for selecting one from among the 1 second outputs, and the l second outputs for outputting four bit signals from the l outputs inputted from the l second select means. A signal shift / bit order inversion means, and one third third one selecting one from the four bit signals output by each of the second signal shift / bit order inversion means.
Selection means and control signal generation means for decoding the supplied control input to give a control signal for instructing each selection means of a bit signal to be selected. The i-th first signal shift / bit order inversion means of the bit order inversion means does not change, bit order inversion, o (integer) bit shift, bit order inversion, and i-th bit signal of the input data. o
(Integer) bit shift, which outputs four bit signals, and the i-th signal shift means of the l signal shift means does not shift the input i-th bit signal, p (integer) + q (integer) bits, p (integer) bits, q (integer) bits, and outputs four bit signals at positions where bit positions are shifted, and the l second signal shift / bit order The i-th second signal shift / bit order inversion means of the inversion means does not change, bit order inversion, r (integer) bit shift, r (integer) bit shift with respect to the i-th bit signal of the input data. A bidirectional shift barrel shifter, which outputs four bit signals that are in a relationship of: and bit order inversion.

8. The barrel shifter according to claim 6, wherein the signal shift means and the select means are connected in cascade for a required number of stages.