JPH06274318A

JPH06274318A - High speed arithmetic unit

Info

Publication number: JPH06274318A
Application number: JP5058168A
Authority: JP
Inventors: Hideho Sawada; 秀穂澤田; Yoriko Minoda; 依子箕田; Yuka Takizawa; ユカ滝沢; Fumihiro Maruyama; 文宏丸山
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1993-03-18
Filing date: 1993-03-18
Publication date: 1994-09-30

Abstract

PURPOSE:To simultaneously execute an arithmetic operation including the arithmetic operator of more than two on a high speed arithmetic unit in a processor in a computer system. CONSTITUTION:Registers 21, 22, 22,..., and 2n temporarily hold numerical data D1, D2,..., and Dm constituting bit data of four bits and the like, for example, received from a storage means 10. An allocation means 30 allocates bit data BD for respective pieces of numerical data D1, D2,..., and D, outputted from the registers 21, 22,... and 2n. An addition means 40 executes addition at every bit data BD allocated to the allocation means 30, and outputs an arithmetic result AD.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は計算機システムの高速演
算装置に関し、特に中央処理装置（ＣＰＵ）内で算術演
算を高速に行う高速演算装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a high speed arithmetic unit for a computer system, and more particularly to a high speed arithmetic unit for performing arithmetic operations in a central processing unit (CPU) at high speed.

【０００２】ワークステーション及びパーソナルコンピ
ュータ等の計算機システムでは、数値計算処理のみなら
ず様々な処理においても、例えば相対アドレス演算等の
ような算術演算が必要になる。In computer systems such as workstations and personal computers, arithmetic operations such as relative address operations are required not only in numerical calculation processing but also in various processing.

【０００３】このような算術演算の中には、四則演算子
（＋，−，×，÷）が２以上現れるものも少なくない。
そして、計算機システム全体の高速化のためには、２以
上の四則演算子を含む算術演算の高速化が要請されてい
る。Among such arithmetic operations, there are many cases where two or more arithmetic operators (+,-, x, ÷) appear.
In order to increase the speed of the entire computer system, it is required to increase the speed of arithmetic operations including two or more arithmetic operators.

【０００４】[0004]

【従来の技術】従来の計算機システムにおけるプロセッ
サ内の演算装置は、一度に一つの算術演算しかできなか
った。このため、２以上の四則演算子を含む算術演算を
行う場合には、制御装置のマイクロプログラムにより、
四則演算子ごとに分割して算術演算を行なっていた。2. Description of the Related Art An arithmetic unit in a processor in a conventional computer system can perform only one arithmetic operation at a time. Therefore, when performing an arithmetic operation including two or more four arithmetic operators, the microprogram of the control device
Arithmetic operations were performed by dividing each of the four arithmetic operators.

【０００５】例えば３つの加算演算子を含む算術演算
「ａ＋ｂ＋ｃ＋ｄ」を行う場合、まず「ａ＋ｂ」を行な
って「ｅ」とし、次に「ｅ＋ｃ」を行なって「ｆ」と
し、最後に「ｆ＋ｄ」を行なって当該算術演算の結果を
求めていた。For example, when performing an arithmetic operation "a + b + c + d" including three addition operators, first "a + b" is performed to "e", then "e + c" is performed to "f", and finally "f + d". To obtain the result of the arithmetic operation.

【０００６】[0006]

【発明が解決しようとする課題】しかし、四則演算子ご
とに分割して算術演算を行う方法では、四則演算子が増
加するにつれて時間がかかるため、計算機システム全体
を高速化することができないという問題点があった。However, in the method of performing arithmetic operations by dividing each of the four arithmetic operators, it takes time as the number of arithmetic operators increases, so that the speed of the entire computer system cannot be increased. There was a point.

【０００７】また、大型並列計算機システムでは、２以
上の四則演算子を含む算術演算を一度に行うことができ
る反面、高価であるという問題点があった。本発明はこ
のような点に鑑みてなされたものであり、安価で、２以
上の四則演算子を含む算術演算を同時に行なって高速化
を図った高速演算装置を、提供することを目的とする。Further, the large-scale parallel computer system has a problem that it is expensive although it can perform arithmetic operations including two or more arithmetic operators at once. The present invention has been made in view of the above circumstances, and an object of the present invention is to provide a high-speed arithmetic device that is inexpensive and speeds up by simultaneously performing arithmetic operations including two or more four arithmetic operators. .

【０００８】[0008]

【課題を解決するための手段】図１は上記目的を達成す
る本発明の原理説明図である。本発明の高速演算装置
は、記憶手段１０、レジスタ２１，２２，・・・，２
ｎ、割付手段３０及び加算手段４０の各要素から構成さ
れる。FIG. 1 is a diagram for explaining the principle of the present invention for achieving the above object. The high-speed arithmetic device of the present invention comprises a storage means 10, registers 21, 22, ..., 2
n, allocation means 30 and addition means 40.

【０００９】レジスタ２１，２２，・・・，２ｎは、記
憶手段１０から受けた数値データＤ１，Ｄ２，・・・，
Ｄｍを一時的に保持する。割付手段３０は、上記レジス
タ２１，２２，・・・，２ｎから出力された数値データ
Ｄ１，Ｄ２，・・・，Ｄｍごとに、ビットデータＢＤを
割り付ける。加算手段４０は、上記割付手段３０で割り
付けられたビットデータＢＤごとに加算を行なって、算
術結果ＡＤを出力する。Registers 21, 22, ..., 2n receive numerical data D1, D2 ,.
Hold Dm temporarily. The allocating means 30 allocates the bit data BD to each of the numerical data D1, D2, ..., Dm output from the registers 21, 22 ,. The adding means 40 performs addition for each bit data BD allocated by the allocating means 30 and outputs the arithmetic result AD.

【００１０】なお、必要に応じて、記憶手段１０とレジ
スタ２１，２２，・・・，２ｎとの間にはメモリ管理手
段を、レジスタ２１，２２，・・・，２ｎと割付手段３
０との間には補数回路又は部分積算出手段を、さらに設
けてもよい。If necessary, a memory management means is provided between the storage means 10 and the registers 21, 22, ..., 2n, and the registers 21, 22 ,.
A complement circuit or a partial product calculation means may be further provided between 0 and 0.

【００１１】メモリ管理手段は、レジスタ２１，２２，
・・・，２ｎの数に応じて数値データＤ１，Ｄ２，・・
・，Ｄｍを出力するとともに、加算手段４０から出力さ
れた算術結果ＡＤを記憶手段１０へ格納する等の算術演
算に必要なデータの入出力を管理する。補数回路はレジ
スタ２１，２２，・・・，２ｎごとに対応して、２の補
数を求める。部分積算出手段は、ブースの方法（Booth
algorithm ）による部分積を求める。The memory management means are registers 21, 22,
... Numerical data D1, D2, ... according to the number of 2n
, Dm, and manages the input / output of data necessary for arithmetic operations such as storing the arithmetic result AD output from the adding means 40 in the storage means 10. The complement circuit obtains a two's complement corresponding to each of the registers 21, 22, ..., 2n. The partial product calculation method is Booth's method (Booth
algorithm)) to obtain the partial product.

【００１２】[0012]

【作用】記憶手段１０から受けた数値データＤ１，Ｄ
２，・・・，Ｄｍをレジスタ２１，２２，・・・，２ｎ
が一時的に保持し、割付手段３０が数値データＤ１，Ｄ
２，・・・，Ｄｍごとに、ビットデータＢＤを割り付け
る。その後、加算手段４０は、割付手段３０で割り付け
られたビットデータＢＤごとに加算を行なって、算術結
果ＡＤを出力する。Operation: Numerical data D1, D received from the storage means 10
2, ..., Dm to registers 21, 22, ..., 2n
Is temporarily held, and the allocating means 30 sets the numerical data D1, D
Bit data BD is allocated to each of 2, ..., Dm. After that, the adding means 40 performs addition for each bit data BD allocated by the allocating means 30 and outputs the arithmetic result AD.

【００１３】なお、メモリ管理手段はレジスタ２１，２
２，・・・，２ｎの数に応じて数値データＤ１，Ｄ２，
・・・，Ｄｍを出力するとともに、加算手段４０から出
力された算術結果ＡＤを記憶手段１０へ格納する等の算
術演算に必要なデータの入出力を管理する。このため、
レジスタ２１，２２，・・・，２ｎの数に制限されるこ
となく、数値データＤ１，Ｄ２，・・・，Ｄｍの数に応
じて算術演算を行うことができる。The memory management means is composed of the registers 21 and 2.
2, ..., 2n according to the number of numerical data D1, D2
, Dm are output, and the input / output of data necessary for arithmetic operation such as storing the arithmetic result AD output from the adding means 40 in the storage means 10 is managed. For this reason,
The arithmetic operation can be performed according to the number of numerical data D1, D2, ..., Dm without being limited by the number of registers 21, 22 ,.

【００１４】また、補数回路はレジスタ２１，２２，・
・・，２ｎごとに対応して２の補数を求めるので、数値
データＤ１，Ｄ２，・・・，Ｄｍが負の数の場合でも所
望の算術演算を行うことができる。部分積算出手段はブ
ースの方法による部分積を求めるので、積和演算を高速
に行うことができる。The complement circuit is composed of registers 21, 22, ...
.., 2n is calculated corresponding to every 2n, so that desired arithmetic operation can be performed even when the numerical data D1, D2, ..., Dm are negative numbers. Since the partial product calculating means obtains the partial product by Booth's method, the product-sum calculation can be performed at high speed.

【００１５】ここで、ブースの方法について説明する。
まず、２の補数によって数値が表現される二つの変数
Ｘ，Ｙについて、変数Ｘは被乗数とし、変数Ｙは乗数と
する。なお、、ｍ＝２ｎ（ｎは自然数）、ｙ_iは「０」
又は「１」の整数値（０＜ｉ≦ｍ）、ｙ_i＝０（ｉ≦
０）、ｙ_i＝１（ｉ＞ｍ）であることを条件として、次
式（１ａ）乃至式（１ｃ）において変数Ｙを定義する。The booth method will be described below.
First, regarding two variables X and Y whose numerical values are represented by two's complements, the variable X is a multiplicand and the variable Y is a multiplier. Note that m = 2n (n is a natural number) and y _i is “0”.
Alternatively, an integer value of “1” (0 <i ≦ m), y _i = 0 (i ≦ m
0) and y _i = 1 (i> m), the variable Y is defined in the following formulas (1a) to (1c).

【００１６】[0016]

【数１】 [Equation 1]

【００１７】ここで、式（１ａ）は２進数のビット列で
表す場合の式、式（１ｂ）は式（１ａ）を１０進数で表
す場合の式、式（１ｃ）は式（１ｂ）を数学記号シグマ
（Σ）で表した場合の式を示す。Here, the expression (1a) is an expression when the bit string of a binary number is represented, the expression (1b) is an expression when the expression (1a) is represented by a decimal number, and the expression (1c) is an expression of the expression (1b). The following shows the formula when expressed by the symbol sigma (Σ).

【００１８】このとき、式（１ｃ）における変数Ｙは、
次式（２ａ）乃至式（２ｃ）のように式変形できる。At this time, the variable Y in the equation (1c) is
The equations can be transformed into the following equations (2a) to (2c).

【００１９】[0019]

【数２】 [Equation 2]

【００２０】また、式（２ｃ）において、ｉ＝ｊ−１と
おくと次式（３）が成り立つ。When i = j-1 in the equation (2c), the following equation (3) is established.

【００２１】[0021]

【数３】 [Equation 3]

【００２２】さらに、式（３）において、ｙ_i＝０（ｉ
≦０）を考慮すると、次式（４ａ）乃至式（４ｃ）のよ
うに式変形できる。Further, in the equation (3), y _i = 0 (i
Considering ≦ 0, the equations can be transformed into the following equations (4a) to (4c).

【００２３】[0023]

【数４】 [Equation 4]

【００２４】したがって、変数Ｘと変数Ｙとの乗算結果
は、次式（５ａ）及び式（５ｂ）のようになる。Therefore, the multiplication result of the variable X and the variable Y is expressed by the following equations (5a) and (5b).

【００２５】[0025]

【数５】 [Equation 5]

【００２６】こうして、上記式（５ｂ）における変数
（ｙ_2k-1，ｙ_2k，ｙ_2k+1）のそれぞれに「０」又は
「１」の整数値を入れた結果は、図１５（Ｂ）に示すブ
ースの方式による真理値表になる。Thus, the result of putting the integer value of "0" or "1" into each of the variables (y _2k-1 , y _2k , y _{2k + 1} ) in the above equation (5b) is shown in FIG. 15 (B). The truth table is based on the booth method shown in.

【００２７】[0027]

【実施例】以下、本発明の一実施例を図面に基づいて説
明する。図２は本発明を実施する計算機システムにおけ
るプロセッサの基本構成を示すブロック図である。プロ
セッサは大きく制御装置１００と高速演算装置２００と
から構成される。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described below with reference to the drawings. FIG. 2 is a block diagram showing the basic configuration of the processor in the computer system that implements the present invention. The processor is roughly composed of a control device 100 and a high speed arithmetic device 200.

【００２８】制御装置１００は、シーケンサ（sequence
r ）１０１、マイクロプログラムメモリ（micro progra
m memory）１０２及びマイクロ命令レジスタ（micro in
struction resister）１０３から構成される。シーケン
サ１０１はデータバスＤＢに接続された図１に示す記憶
手段１０としての主記憶装置や他の各種装置からの指令
データＣＤや後述する高速演算装置２００からの演算結
果ＡＤを受けて、マイクロプログラムメモリ１０２のア
ドレス信号をクロック信号ＣＬＫに同期して発生する機
能モジュールである。このシーケンサ１０１が発生する
アドレス信号のビット幅によって、マイクロプログラム
メモリ１０２の記憶容量が規制される。また、アドレス
信号を受けたマイクロプログラムメモリ１０２はマイク
ロ命令（すなわち、アドレス信号に対応するメモリアド
レスのデータ）を出力する。さらに、マイクロ命令を受
けたマイクロ命令レジスタ１０３は、クロック信号ＣＬ
Ｋに同期して次の制御処理のための制御信号ＣＳ１をシ
ーケンサ１０１へ送るとともに、後述する高速演算装置
２００へ算術演算のための制御信号ＣＳ２や制御バスＣ
Ｂに接続された各種装置への制御信号ＣＳ３を出力す
る。The control device 100 includes a sequencer.
r) 101, micro program memory (micro progra
m memory) 102 and micro instruction register (micro in
(struction resister) 103. The sequencer 101 receives the command data CD from the main storage device as the storage means 10 shown in FIG. 1 connected to the data bus DB and other various devices, and the operation result AD from the high-speed operation device 200 described later, and the microprogram It is a functional module that generates the address signal of the memory 102 in synchronization with the clock signal CLK. The storage capacity of the microprogram memory 102 is restricted by the bit width of the address signal generated by the sequencer 101. Further, the micro program memory 102 receiving the address signal outputs a micro instruction (that is, data of a memory address corresponding to the address signal). Further, the microinstruction register 103 that has received the microinstruction changes the clock signal CL.
The control signal CS1 for the next control processing is sent to the sequencer 101 in synchronization with K, and the control signal CS2 for the arithmetic operation and the control bus C are sent to the high-speed arithmetic device 200 described later.
It outputs a control signal CS3 to various devices connected to B.

【００２９】高速演算装置２００は、レジスタ群２０
１、割付手段２０２及び加算手段２０３から構成され
る。レジスタ群２０１は図１に示すレジスタ２１，２
２，・・・，２ｎに相当し、データバスＤＢを介して受
けた数値データについて例えば４ビット等の所定のデー
タビット幅と、制御装置１００からの制御信号ＣＳ２に
ついて例えば２ビット等の所定の制御ビット幅とを有
し、一時的に保持する。また、割付手段２０２は図１に
示す割付手段３０に相当し、レジスタ群２０１で保持さ
れたデータを受けて、データのビットごとにビットデー
タを割り付ける。さらに、加算手段２０３は図１に示す
加算手段４０に相当し、割付手段２０２で割り付けられ
たビットデータごとに加算を行なって、算術結果をアド
レスバスＡＢ及びデータバスＤＢを介して所望の装置へ
出力する。The high-speed arithmetic unit 200 includes a register group 20.
1, an allocating unit 202 and an adding unit 203. The register group 201 is the registers 21 and 2 shown in FIG.
2, ..., 2n, a predetermined data bit width of, for example, 4 bits for the numerical data received via the data bus DB, and a predetermined data bit width of, for example, 2 bits for the control signal CS2 from the control device 100. Control bit width and hold temporarily. The allocating means 202 corresponds to the allocating means 30 shown in FIG. 1, receives the data held in the register group 201, and allocates bit data for each bit of the data. Further, the adding means 203 corresponds to the adding means 40 shown in FIG. 1, performs addition for each bit data allocated by the allocating means 202, and outputs the arithmetic result to a desired device via the address bus AB and the data bus DB. Output.

【００３０】次に、本発明の高速演算装置の具体的な構
成及び動作について説明する。なお、説明を簡単にする
ために、図２に示すレジスタ群２０１のレジスタ数を４
個とし、データバスＤＢを介して受ける数値データは４
ビットのデータビット幅とし、制御装置１００から受け
る制御信号ＣＳ２は２ビットの制御ビット幅として説明
する。Next, a specific structure and operation of the high speed arithmetic device of the present invention will be described. In order to simplify the explanation, the number of registers in the register group 201 shown in FIG.
Numerical data received via the data bus DB is 4
The description will be given assuming that the data bit width of the bit is set, and the control signal CS2 received from the control device 100 is the control bit width of 2 bit.

【００３１】図３は、本発明の第１の実施例を示す図で
ある。図において、高速演算装置は、レジスタ２１ａ，
２２ａ，２３ａ，２４ａ、割付手段３０ａ及び加算手段
４０ａの各要素から構成される。FIG. 3 is a diagram showing a first embodiment of the present invention. In the figure, the high-speed arithmetic unit is shown as a register 21a,
22a, 23a, 24a, allocation means 30a, and addition means 40a.

【００３２】レジスタ２１ａは図１に示すレジスタ２１
に相当し、データビット幅が４ビットの数値データＤ１
ａと、制御ビット幅が２ビットの制御信号ＣＳ２１とを
一時的に保持する。同様に、レジスタ２２ａ，２３ａ，
２４ａは図１に示すレジスタ２２，２３，２４に相当
し、データビット幅が４ビットの数値データＤ２ａ，Ｄ
３ａ，Ｄ４ａと、制御ビット幅が２ビットの制御信号Ｃ
Ｓ２２，ＣＳ２３，ＣＳ２４とをそれぞれ一時的に保持
する。The register 21a is the register 21 shown in FIG.
And numerical data D1 having a data bit width of 4 bits
a and a control signal CS21 having a control bit width of 2 bits are temporarily held. Similarly, registers 22a, 23a,
Reference numeral 24a corresponds to the registers 22, 23 and 24 shown in FIG. 1 and has numerical data D2a and D having a data bit width of 4 bits.
3a, D4a and a control signal C having a control bit width of 2 bits
Each of S22, CS23, and CS24 is temporarily held.

【００３３】割付手段３０ａは図１に示す割付手段３０
に相当し、上記レジスタ２１ａ，２２ａ，２３ａ，２４
ａで保持されたそれぞれの数値データＤ１ａ，Ｄ２ａ，
Ｄ３ａ，Ｄ４ａ及び制御信号ＣＳ２１，ＣＳ２２，ＣＳ
２３，ＣＳ２４等のデータを受けて、数値データＤ１
ａ，Ｄ２ａ，Ｄ３ａ，Ｄ４ａのビットごとに８本のビッ
トデータＢＤ１，ＢＤ２，・・・，ＢＤ８を割り付け
る。なお、割付手段３０ａの具体的な構成は図４におい
て詳述する。The allocating means 30a is the allocating means 30 shown in FIG.
Corresponding to the registers 21a, 22a, 23a, 24
Each of the numerical data D1a, D2a, held in a
D3a, D4a and control signals CS21, CS22, CS
Numerical data D1 in response to data such as 23 and CS24
Eight bit data BD1, BD2, ..., BD8 are allocated for each bit of a, D2a, D3a, and D4a. The specific configuration of the allocating means 30a will be described in detail with reference to FIG.

【００３４】加算手段４０ａは図１に示す加算手段４０
に相当し、割付手段３０ａで割り付けられた８本のビッ
トデータＢＤ１，ＢＤ２，・・・，ＢＤ８ごとに加算を
行なって、９ビットからなる算術結果ＡＤ１，ＡＤ２，
・・・，ＡＤ９を出力する。なお、加算手段４０ａの具
体的な構成は図５において詳述する。The adding means 40a is the adding means 40 shown in FIG.
, BD8, which is allocated by the allocating means 30a, is added for each of the 8 bit data BD1, BD2, ..., BD8, and arithmetic results AD1, AD2 each consisting of 9 bits are added.
..., AD9 is output. The specific configuration of the adding means 40a will be described in detail with reference to FIG.

【００３５】図４は図３に示す割付手段３０ａの構成を
示す図である。図において、割付手段３０ａは、ＭＵＸ
（MUltipleXer;マルチプレクサ）３１，３２，３３，３
４の各要素から構成される。なお、ＭＵＸ３１，３２，
３３，３４はいずれも同一構成であるので、ここではＭ
ＵＸ３１について説明する。FIG. 4 is a diagram showing the structure of the allocating means 30a shown in FIG. In the figure, the allocating means 30a is a MUX.
(MUltipleXer; Multiplexer) 31, 32, 33, 3
4 elements. In addition, MUX 31, 32,
Since both 33 and 34 have the same structure, M is used here.
The UX31 will be described.

【００３６】ＭＵＸ３１は数値データＤ１ａ及び制御信
号ＣＳ２１を受けて、データビット幅が４ビットである
８本のビットデータＢＤ１，ＢＤ２，・・・，ＢＤ８を
割り付ける。The MUX 31 receives the numerical data D1a and the control signal CS21 and allocates eight bit data BD1, BD2, ..., BD8 having a data bit width of 4 bits.

【００３７】具体的には、データビット幅が４ビットの
数値データＤ１ａについて、上位４ビットを「０」する
数値データＤ１ａ４、最上位２ビットを「０」とすると
ともに数値データＤ１ａを上位ビット方向へ２ビットシ
フトした数値データＤ１ａ６及び数値データＤ１ａを上
位ビット方向へ４ビットシフトした数値データＤ１ａ８
を受ける。例えば、数値データＤ１ａを「１１０１」と
すると、それぞれ数値データＤ１ａ４は「００００１１
０１」に、数値データＤ１ａ６は「００１１０１００」
に、数値データＤ１ａ８は「１１０１００００」にな
る。Specifically, for the numerical data D1a having a data bit width of 4 bits, the numerical data D1a4 whose upper 4 bits are "0", the upper 2 bits are "0", and the numerical data D1a is in the upper bit direction. Numerical data D1a6 shifted by 2 bits to and numerical data D1a8 obtained by shifting numerical data D1a by 4 bits in the upper bit direction
Receive. For example, if the numerical data D1a is "1101", the numerical data D1a4 is "000011".
01 ”, the numerical data D1a6 is“ 00110100 ”
In addition, the numerical data D1a8 becomes “11010000”.

【００３８】また、制御ビット幅が２ビットの制御信号
ＣＳ２１を受けて、この制御信号ＣＳ２１に応じて数値
データＤ１ａ４、数値データＤ１ａ６及び数値データＤ
１ａ８の中で、いずれかの数値データを選択して、８本
のビットデータＢＤ１，ＢＤ２，・・・，ＢＤ８の内の
それぞれ１ビットを出力する。Further, the control signal CS21 having a control bit width of 2 bits is received, and the numerical data D1a4, the numerical data D1a6 and the numerical data D are received in accordance with the control signal CS21.
One of the numerical data is selected from 1a8 and 1 bit of each of the 8 bit data BD1, BD2, ..., BD8 is output.

【００３９】こうしてＭＵＸ３１，３２，３３，３４の
それぞれが８本のビットデータＢＤ１，ＢＤ２，・・
・，ＢＤ８の内のそれぞれ１ビットを出力し、同一位の
ビットを合わせてビットデータＢＤ１，ＢＤ２，・・
・，ＢＤ８として出力する。例えば、ＭＵＸ３１，３
２，３３，３４のそれぞれ最下位ビットを合わせてビッ
トデータＢＤ１として出力し、第２位ビットを合わせて
ビットデータＢＤ２として出力する。以下、同様にして
最上位ビットを合わせてビットデータＢＤ８として出力
する。Thus, each of the MUXs 31, 32, 33, 34 has eight bit data BD1, BD2, ...
.., each one bit of BD8 is output, and bit data BD1, BD2, ..
., Output as BD8. For example, MUX31,3
The least significant bits of 2, 33 and 34 are combined and output as bit data BD1, and the second least significant bits are combined and output as bit data BD2. Thereafter, similarly, the most significant bits are combined and output as bit data BD8.

【００４０】次に、図３に示す加算手段４０ａについ
て、図５乃至図８を用いて説明する。図５は加算手段４
０ａの構成を示す図である。図６は入力ビット数と１ビ
ット全加算器の個数等との関係を示す図である。図７
は、図６の入力ビット数に対応する１ビット全加算器の
接続関係を示す図であって、図７（Ａ）には３ビット入
力の場合を、図７（Ｂ）には４ビット入力の場合を、図
７（Ｃ）には６ビット入力の場合を、図７（Ｄ）には７
ビット入力の場合を、図７（Ｅ）には９ビット入力の場
合を、図７（Ｆ）には１０ビット入力の場合をそれぞれ
示す。図８は１ビット全加算器の真理値表を示す図であ
る。Next, the adding means 40a shown in FIG. 3 will be described with reference to FIGS. FIG. 5 shows addition means 4
It is a figure which shows the structure of 0a. FIG. 6 is a diagram showing the relationship between the number of input bits and the number of 1-bit full adders. Figure 7
7A and 7B are diagrams showing connection relations of 1-bit full adders corresponding to the number of input bits in FIG. 6, where 3-bit input is shown in FIG. 7A and 4-bit input is shown in FIG. 7B. 7C, 6-bit input is shown in FIG. 7C, and 7-bit input is shown in FIG. 7D.
In the case of bit input, FIG. 7E shows the case of 9-bit input, and FIG. 7F shows the case of 10-bit input. FIG. 8 is a diagram showing a truth table of the 1-bit full adder.

【００４１】図５において、加算手段４０ａは、加算器
列４１，４２及び桁上げ先見加算器４３の各要素から構
成される。加算器列４１は各４ビットのビットデータＢ
Ｄ１，ＢＤ２，・・・，ＢＤ８を受けて１ビット全加算
を行い、２ビットの第１加算データと１ビットの第１桁
上げデータを出力する８個のビット加算器４１１，４１
２，・・・，４１８から構成される。In FIG. 5, the adding means 40a is composed of the elements of the adder arrays 41 and 42 and the carry lookahead adder 43. The adder string 41 is bit data B of 4 bits each.
Eight bit adders 411, 41 that receive D1, BD2, ..., BD8 and perform 1-bit full addition to output 2-bit first addition data and 1-bit first carry data
2, ..., 418.

【００４２】加算器列４２は、８個のビット加算器４２
１，４２２，・・・，４２８から構成される。具体的に
は、ビット加算器４２１はビット加算器４１１から出力
された２ビットの第１加算データを受けて１ビット全加
算を行い、１ビットの第２加算データと１ビットの第２
桁上げデータを出力する。なお、このビット加算器４２
１から出力される１ビットの第２加算データは、１ビッ
トの算術結果ＡＤ１になる。The adder train 42 includes eight bit adders 42.
1, 422, ..., 428. Specifically, the bit adder 421 receives the 2-bit first addition data output from the bit adder 411, performs 1-bit full addition, and performs the 1-bit second addition data and the 1-bit second addition data.
Output carry data. The bit adder 42
The 1-bit second addition data output from 1 becomes the 1-bit arithmetic result AD1.

【００４３】また、ビット加算器４２２はビット加算器
４１１から出力された１ビットの第１桁上げデータとビ
ット加算器４１２から出力された２ビットの第１加算デ
ータとの３ビットデータを受けて１ビット全加算を行
い、１ビットの第２加算データと１ビットの第２桁上げ
データを出力する。さらに、ビット加算器４２３はビッ
ト加算器４１２から出力された１ビットの第１桁上げデ
ータとビット加算器４１３から出力された２ビットの第
１加算データとの３ビットデータを受けて１ビット全加
算を行い、１ビットの第２加算データと１ビットの第２
桁上げデータを出力する。The bit adder 422 receives the 3-bit data of the 1-bit first carry data output from the bit adder 411 and the 2-bit first addition data output from the bit adder 412. 1-bit full addition is performed and 1-bit second addition data and 1-bit second carry data are output. Further, the bit adder 423 receives the 3-bit data of the 1-bit first carry data output from the bit adder 412 and the 2-bit first addition data output from the bit adder 413, and receives the entire 1-bit data. Addition is performed and 1-bit second addition data and 1-bit second addition data
Output carry data.

【００４４】同様に、ビット加算器４２４，４２５，・
・・，４２８は、それぞれビット加算器４１３，４１
４，・・・，４１７から出力された１ビットの第１桁上
げデータとビット加算器４１４，４１５，・・・，４１
８から出力された２ビットの第１加算データとの３ビッ
トデータを受けて１ビット全加算を行い、１ビットの第
２加算データと１ビットの第２桁上げデータを出力す
る。Similarly, the bit adders 424, 425, ...
.., 428 are respectively bit adders 413, 41
, ..., 417 1-bit first carry data and bit adders 414, 415 ,.
It receives 3-bit data from the 2-bit first addition data output from 8 and performs 1-bit full addition, and outputs 1-bit second addition data and 1-bit second carry data.

【００４５】桁上げ先見加算器４３は、加算器列４２か
ら出力された１ビットの加算データ及び１ビットの桁上
げデータを受けてビットスライスを行い、８ビットの算
術結果ＡＤ２，ＡＤ３，・・・，ＡＤ９を出力する。具
体的には、ビット加算器４２１から出力された１ビット
の第２桁上げデータとビット加算器４２２から出力され
た１ビットの第２加算データを受けてビットスライスを
行い、１ビットの算術結果ＡＤ２を出力する。また、ビ
ット加算器４２２から出力された１ビットの第２桁上げ
データとビット加算器４２３から出力された１ビットの
第２加算データを受けてビットスライスを行い、１ビッ
トの算術結果ＡＤ３を出力する。The carry look-ahead adder 43 receives the 1-bit addition data and the 1-bit carry data output from the adder array 42 and performs bit slicing to obtain 8-bit arithmetic results AD2, AD3 ,. ., Outputs AD9. Specifically, the 1-bit second carry data output from the bit adder 421 and the 1-bit second addition data output from the bit adder 422 are bit-sliced to obtain a 1-bit arithmetic result. Output AD2. Further, it receives the 1-bit second carry data output from the bit adder 422 and the 1-bit second addition data output from the bit adder 423, performs bit slicing, and outputs a 1-bit arithmetic result AD3. To do.

【００４６】同様に、ビット加算器４２３，４２４，・
・・，４２８から出力された１ビットの第２桁上げデー
タとビット加算器４２３，４２４，・・・，４２８から
出力された１ビットの第２加算データを受けてビットス
ライスを行い、それぞれ１ビットの算術結果ＡＤ４，Ａ
Ｄ５，・・・，ＡＤ９を出力する。Similarly, the bit adders 423, 424, ...
.., 428, 1-bit second carry data and 1-bit second adder data output from bit adders 423, 424 ,. Bit arithmetic result AD4, A
D5, ..., AD9 are output.

【００４７】このように、論理回路で構成された桁上げ
先見加算器４３を用いることによって、桁上げのための
伝播時間が短縮されるため、より高速に算術演算を行う
ことができる。As described above, by using the carry look-ahead adder 43 composed of the logic circuit, the propagation time for carry is shortened, and thus the arithmetic operation can be performed at higher speed.

【００４８】図６において、関係テーブル４００は図５
に示す１６個のビット加算器４１１乃至４２８等のビッ
ト加算器における入力ビット数と、１ビット全加算器の
個数、加算データのビット数及び桁上げのビット数との
関係を示すテーブルである。In FIG. 6, the relationship table 400 is shown in FIG.
6 is a table showing the relationship between the number of input bits in the 16 bit adders 411 to 428 and the like shown in FIG. 1, the number of 1-bit full adders, the number of addition data bits, and the number of carry bits.

【００４９】関係テーブル４００において、図面左側か
ら第１列には入力ビット数４０１を、第２列には１ビッ
ト全加算器の個数４０２を、第３列には加算データのビ
ット数４０３を、第４列には桁上げのビット数４０４を
それぞれ示す。In the relationship table 400, from the left side of the drawing, the number of input bits 401 is shown in the first column, the number of 1-bit full adders 402 is shown in the second column, and the number of bits of added data 403 is shown in the third column. The number of carry bits 404 is shown in the fourth column.

【００５０】例えば、図５に示す２ビット入力のビット
加算器４２１は４０５行に対応し、３ビット入力のビッ
ト加算器４２２，４２３，・・・，４２８は４００ａ行
に対応する。いずれも１ビット全加算器の個数４０２は
「１」であり、加算データのビット数４０３は「１」で
あり、桁上げのビット数４０４は「１」である。For example, the 2-bit input bit adder 421 shown in FIG. 5 corresponds to 405 rows, and the 3-bit input bit adders 422, 423, ..., 428 correspond to 400a row. In both cases, the number of 1-bit full adders 402 is "1", the number of bits 403 of addition data is "1", and the number of carry bits 404 is "1".

【００５１】同様に、図５に示す４ビット入力のビット
加算器４１１，４１２，・・・，４１８は４００ｂ行に
対応し、１ビット全加算器の個数４０２は「１」であ
り、加算データのビット数４０３は「２」であり、桁上
げのビット数４０４は「１」である。こうして、関係テ
ーブル４００によって、入力ビット数に対応するビット
加算器の出力ビット数が分かる。Similarly, the 4-bit input bit adders 411, 412, ..., 418 shown in FIG. 5 correspond to the 400b row, and the number 402 of 1-bit full adders is "1". The number of bits 403 of is "2", and the number of carry bits 404 is "1". In this way, the relation table 400 shows the number of output bits of the bit adder corresponding to the number of input bits.

【００５２】図７は、上記関係テーブル４００の４００
ａ行，４００ｂ行，４００ｃ行，４００ｄ行，４００ｅ
行，４００ｆ行に対応する１ビット全加算器の接続関係
を示す図である。以下、図７（Ａ）乃至図７（Ｆ）の各
図について説明する。FIG. 7 shows 400 of the relationship table 400.
Row a, 400b, 400c, 400d, 400e
It is a figure which shows the connection relation of the 1-bit full adder corresponding to a row and 400f row. Hereinafter, each drawing of FIGS. 7A to 7F will be described.

【００５３】図７（Ａ）では、関係テーブル４００の４
００ａ行に対応して、１ビット全加算器５０１の接続が
なされる。具体的には、ビットデータｉ０，ｉ１，ｉ２
の３ビットデータを１ビット全加算器５０１が受けて、
和データｓ及び桁上げデータｃｏを出力する。なお、入
力ビット数が２ビットの場合は例えばビットデータｉ２
に強制的に「０」を入力すればよい。In FIG. 7A, 4 of the relation table 400 is used.
The 1-bit full adder 501 is connected corresponding to the 00a row. Specifically, the bit data i0, i1, i2
1-bit full adder 501 receives the 3-bit data of
The sum data s and the carry data co are output. If the number of input bits is 2 bits, for example, the bit data i2
Forcibly enter "0".

【００５４】図７（Ｂ）では、関係テーブル４００の４
００ｂ行に対応して、１ビット全加算器５０２の接続が
なされる。具体的には、ビットデータｉ０，ｉ１，ｉ２
の３ビットデータを１ビット全加算器５０２が受けて、
和データｓ０及び桁上げデータｃ０を出力し、ビットデ
ータｉ３はそのまま和データｓ１として出力する。In FIG. 7B, 4 in the relation table 400
The 1-bit full adder 502 is connected corresponding to the 00b row. Specifically, the bit data i0, i1, i2
1-bit full adder 502 receives the 3-bit data of
The sum data s0 and the carry data c0 are output, and the bit data i3 is output as it is as the sum data s1.

【００５５】図７（Ｃ）では、関係テーブル４００の４
００ｃ行に対応して、１ビット全加算器５０３，５０４
の接続がなされる。具体的には、ビットデータｉ０，ｉ
１，ｉ２の３ビットデータを１ビット全加算器５０３が
受けて和データｓ０及び桁上げデータｃ０を出力し、ビ
ットデータｉ３，ｉ４，ｉ５の３ビットデータを１ビッ
ト全加算器５０４が受けて和データｓ１及び桁上げデー
タｃ１を出力する。なお、入力ビット数が５ビットの場
合は例えばビットデータｉ５に強制的に「０」を入力す
ればよい。In FIG. 7C, 4 of the relation table 400 is used.
1-bit full adders 503 and 504 corresponding to row 00c
Is connected. Specifically, the bit data i0, i
The 1-bit full adder 503 receives the 3-bit data 1 and i2 and outputs the sum data s0 and the carry data c0. The 1-bit full adder 504 receives the 3-bit data i3, i4 and i5. The sum data s1 and the carry data c1 are output. When the number of input bits is 5, for example, “0” may be forcibly input to the bit data i5.

【００５６】図７（Ｄ）では、関係テーブル４００の４
００ｄ行に対応して、１ビット全加算器５０５，５０６
の接続がなされる。具体的には、ビットデータｉ０，ｉ
１，ｉ２の３ビットデータを１ビット全加算器５０５が
受けて和データｓ０及び桁上げデータｃ０を出力し、ビ
ットデータｉ３，ｉ４，ｉ５の３ビットデータを１ビッ
ト全加算器５０６が受けて和データｓ１及び桁上げデー
タｃ１を出力し、ビットデータｉ７はそのまま和データ
ｓ２として出力する。In FIG. 7D, 4 of the relation table 400 is used.
1-bit full adders 505 and 506 corresponding to row 00d
Is connected. Specifically, the bit data i0, i
The 1-bit full adder 505 receives the 3-bit data 1 and i2 and outputs the sum data s0 and the carry data c0, and the 1-bit full adder 506 receives the 3-bit data i3, i4 and i5. The sum data s1 and the carry data c1 are output, and the bit data i7 is output as it is as the sum data s2.

【００５７】図７（Ｅ）では、関係テーブル４００の４
００ｅ行に対応して、１ビット全加算器５０７，５０
８，５０９の接続がなされる。具体的には、ビットデー
タｉ０，ｉ１，ｉ２の３ビットデータを１ビット全加算
器５０７が受けて和データｓ０及び桁上げデータｃ０を
出力し、ビットデータｉ３，ｉ４，ｉ５の３ビットデー
タを１ビット全加算器５０８が受けて和データｓ１及び
桁上げデータｃ１を出力し、ビットデータｉ６，ｉ７，
ｉ８の３ビットデータを１ビット全加算器５０９が受け
て和データｓ２及び桁上げデータｃ２を出力する。な
お、入力ビット数が８ビットの場合は例えばビットデー
タｉ８に強制的に「０」を入力すればよい。In FIG. 7E, 4 of the relation table 400 is used.
1-bit full adders 507 and 50 corresponding to row 00e
8,509 connections are made. Specifically, the 1-bit full adder 507 receives the 3-bit data of the bit data i0, i1, i2, outputs the sum data s0 and the carry data c0, and outputs the 3-bit data of the bit data i3, i4, i5. The 1-bit full adder 508 receives the sum data s1 and the carry data c1, and outputs the bit data i6, i7,
The 1-bit full adder 509 receives the 3-bit data of i8 and outputs the sum data s2 and the carry data c2. When the number of input bits is 8 bits, for example, “0” may be forcibly input to the bit data i8.

【００５８】図７（Ｆ）では、関係テーブル４００の４
００ｆ行に対応して、１ビット全加算器５１０，５１
１，５１２の接続がなされる。具体的には、ビットデー
タｉ０，ｉ１，ｉ２の３ビットデータを１ビット全加算
器５１０が受けて和データｓ０及び桁上げデータｃ０を
出力し、ビットデータｉ３，ｉ４，ｉ５の３ビットデー
タを１ビット全加算器５１１が受けて和データｓ１及び
桁上げデータｃ１を出力し、ビットデータｉ６，ｉ７，
ｉ８の３ビットデータを１ビット全加算器５１２が受け
て和データｓ２及び桁上げデータｃ２を出力し、ビット
データｉ９はそのまま和データｓ３として出力する。In FIG. 7F, 4 of the relationship table 400 is used.
1-bit full adders 510, 51 corresponding to 00f row
1, 512 connections are made. Specifically, the 1-bit full adder 510 receives the 3-bit data of the bit data i0, i1, i2, outputs the sum data s0 and the carry data c0, and outputs the 3-bit data of the bit data i3, i4, i5. The 1-bit full adder 511 receives the sum data s1 and the carry data c1, and outputs the bit data i6, i7,
The 1-bit full adder 512 receives the 3-bit data of i8 and outputs the sum data s2 and the carry data c2. The bit data i9 is directly output as the sum data s3.

【００５９】図８には図７に示す１ビット全加算器５０
１，５０２，・・・，５１２の真理値表６００を示す。
真理値表６００は図面左欄の入力６０１と図面右欄の出
力６０２からなる。入力６０１には、図面左側から第１
列に入力端子ａに入力されるビットデータを、第２列に
入力端子ｂに入力されるビットデータを、第３列に入力
端子ｃｉに入力されるビットデータを示す。また、出力
６０２には、図面左側から第１列に和データｓのビット
データを、第２列に桁上げデータｃｏのビットデータを
示す。FIG. 8 shows the 1-bit full adder 50 shown in FIG.
A truth table 600 of 1,502, ..., 512 is shown.
The truth table 600 includes an input 601 in the left column of the drawing and an output 602 in the right column of the drawing. Input 601 is first from the left side of the drawing.
The column shows the bit data inputted to the input terminal a, the second column shows the bit data inputted to the input terminal b, and the third column shows the bit data inputted to the input terminal ci. In the output 602, the bit data of the sum data s is shown in the first column from the left side of the drawing, and the bit data of the carry data co is shown in the second column.

【００６０】例えば、６０３行では、入力端子ａにビッ
トデータ「０」が入力され、入力端子ｂ及び入力端子ｃ
ｉにビットデータ「１」が入力された場合、和データｓ
のビットデータ「０」を、桁上げデータｃｏのビットデ
ータ「１」を出力することを示す。For example, in line 603, bit data "0" is input to the input terminal a, and the input terminal b and the input terminal c are input.
When bit data “1” is input to i, sum data s
Bit data "0" of the carry data co is output.

【００６１】図９は図５に示す加算手段４０の他の構成
を示す図であって、パイプラインレジスタ（pipeline r
egister ）を設けた場合の構成を示す図である。図にお
いて、加算手段４０ａは、加算器列４１，４２及び桁上
げ先見加算器４３、パイプラインレジスタ５１，５２，
５３，５４の各要素から構成される。なお、図３及び図
５と同一の要素には同一番号を付し、説明を省略する。FIG. 9 is a diagram showing another configuration of the adding means 40 shown in FIG. 5, which is a pipeline register (pipeline r).
It is a figure which shows the structure at the time of providing egister). In the figure, the adding means 40a includes adder strings 41 and 42, a carry look-ahead adder 43, pipeline registers 51 and 52,
It is composed of 53 and 54 elements. The same elements as those in FIGS. 3 and 5 are designated by the same reference numerals, and the description thereof will be omitted.

【００６２】ここで、パイプラインレジスタ５１は図示
するように、図３に示す割付手段３０ａと加算器列４１
との間に接続する。同様に、パイプラインレジスタ５２
は加算器列４１と加算器列４２との間に接続し、パイプ
ラインレジスタ５３は加算器列４２と先見加算器４３と
の間に接続し、パイプラインレジスタ５４は先見加算器
４３と出力端との間に接続する。Here, the pipeline register 51 is, as shown in the figure, the allocating means 30a and the adder array 41 shown in FIG.
Connect between and. Similarly, the pipeline register 52
Is connected between the adder train 41 and the adder train 42, the pipeline register 53 is connected between the adder train 42 and the look-ahead adder 43, and the pipeline register 54 is connected to the look-ahead adder 43 and the output end. Connect between and.

【００６３】このように、パイプラインレジスタ５１，
５２，５３，５４を加算器列等の間に設けることにより
パイプライン処理を行うことができ、より算術演算を高
速化することができる。In this way, the pipeline registers 51,
By providing 52, 53, and 54 between the adder arrays or the like, pipeline processing can be performed, and the arithmetic operation can be further speeded up.

【００６４】以上の説明では、本発明の高速演算装置に
ついて図２に示すレジスタ群２０１のレジスタ数を４個
とし、データバスＤＢを介して受ける数値データは４ビ
ットのデータビット幅とし、制御装置１００から受ける
制御信号ＣＳ２は２ビットの制御ビット幅としたが、算
術演算の対象に応じてレジスタ群２０１のレジスタ数や
数値データのデータビット幅及び制御信号ＣＳ２の制御
ビット幅を増減することもできる。この場合に、数値デ
ータの数と加算段数との関係は図１０のようになる。In the above description, regarding the high-speed arithmetic device of the present invention, the number of registers of the register group 201 shown in FIG. 2 is 4, and the numerical data received via the data bus DB has a data bit width of 4 bits. The control signal CS2 received from 100 has a control bit width of 2 bits, but the number of registers in the register group 201, the data bit width of numerical data, and the control bit width of the control signal CS2 may be increased or decreased according to the target of the arithmetic operation. it can. In this case, the relationship between the number of numerical data and the number of addition stages is as shown in FIG.

【００６５】すなわち、上記の説明では、図１０の関係
テーブル７００における７０１行に示すように、図面右
列の数値データの数が「４」であるとき、図面左列の加
算段数は「２」、つまり図５に示す加算器列４１，４２
になる。例えば、数値データの数が「８」の場合は、７
０２行に示すように加算段数は４段必要になる。That is, in the above description, as shown in row 701 of the relationship table 700 of FIG. 10, when the number of numerical data in the right column of the drawing is “4”, the number of addition stages in the left column of the drawing is “2”. That is, the adder arrays 41 and 42 shown in FIG.
become. For example, if the number of numerical data is "8", 7
As shown in line 02, the number of addition stages is required to be four.

【００６６】次に、本発明の他の実施例について説明す
る。図１１は、本発明の第２の実施例を示す図である。
図１と異なるのは、記憶手段１０とレジスタ２１，２
２，・・・，２ｎとの間に、さらにメモリ管理手段６０
を設けた点である。なお、図１と同一の要素には同一番
号を付し、説明を省略する。Next, another embodiment of the present invention will be described. FIG. 11 is a diagram showing a second embodiment of the present invention.
The difference from FIG. 1 is that the storage means 10 and the registers 21, 2 are
2, ..., 2n between the memory management means 60
That is the point. The same elements as those in FIG. 1 are designated by the same reference numerals and the description thereof will be omitted.

【００６７】図において、メモリ管理手段６０は、レジ
スタ２１，２２，・・・，２ｎの数に応じて数値データ
Ｄ１，Ｄ２，・・・，Ｄｍを出力するとともに、加算手
段４０から出力された算術結果ＡＤを記憶手段１０へ格
納する等の算術演算に必要なデータの入出力を管理す
る。In the figure, the memory management means 60 outputs numerical data D1, D2, ..., Dm according to the number of registers 21, 22 ,. It manages the input / output of data required for arithmetic operations such as storing the arithmetic result AD in the storage means 10.

【００６８】例えば、図３のレジスタ２１ａ，２２ａ，
２３ａ，２４ａのようにレジスタが４個であって、６つ
の加算演算子を含む算術演算「Ｄ１＋Ｄ２＋Ｄ３＋Ｄ４
＋Ｄ５＋Ｄ６＋Ｄ７」を行う場合、メモリ管理手段６０
は次のような処理手順により上記算術演算を行う。For example, the registers 21a, 22a,
The arithmetic operation "D1 + D2 + D3 + D4" which has four registers like 23a and 24a and includes six addition operators.
+ D5 + D6 + D7 ”, the memory management means 60
Performs the above arithmetic operation according to the following processing procedure.

【００６９】まず、記憶手段１０に格納されている数値
データＤ１，Ｄ２，Ｄ３，Ｄ４，Ｄ５，Ｄ６，Ｄ７の中
で、レジスタ２１ａに数値データＤ１を、レジスタ２２
ａに数値データＤ２を、レジスタ２３ａに数値データＤ
３を、レジスタ２４ａに数値データＤ４を、それぞれ出
力して算術演算「Ｄ１＋Ｄ２＋Ｄ３＋Ｄ４」を行う。そ
して、得られた算術結果ＡＤ１を記憶手段１０へ格納す
る。First, of the numerical data D1, D2, D3, D4, D5, D6, D7 stored in the storage means 10, the numerical data D1 is stored in the register 21a and the register 22 is stored.
The numerical data D2 is stored in a and the numerical data D is stored in the register 23a.
3 is output to the register 24a, and the numerical data D4 is output to the arithmetic operation "D1 + D2 + D3 + D4". Then, the obtained arithmetic result AD1 is stored in the storage means 10.

【００７０】次に、記憶手段１０へ格納した算術結果Ａ
Ｄを受けて、新たにレジスタ２１ａに数値データＡＤ１
として出力し、レジスタ２２ａに数値データＤ５を、レ
ジスタ２３ａに数値データＤ６を、レジスタ２４ａに数
値データＤ７を、それぞれ出力して算術演算「ＡＤ＋Ｄ
５＋Ｄ６＋Ｄ７」を行う。こうして得られた算術結果Ａ
Ｄが、求める算術演算「Ｄ１＋Ｄ２＋Ｄ３＋Ｄ４＋Ｄ５
＋Ｄ６＋Ｄ７」の結果である。Next, the arithmetic result A stored in the storage means 10
Upon receiving D, the numerical data AD1 is newly added to the register 21a.
To output the numerical data D5 to the register 22a, the numerical data D6 to the register 23a, and the numerical data D7 to the register 24a.
5 + D6 + D7 ”. Arithmetic result A thus obtained
D is the desired arithmetic operation "D1 + D2 + D3 + D4 + D5
+ D6 + D7 ”.

【００７１】このため、上記処理手順を繰り返すことに
よって、レジスタ２１，２２，・・・，２ｎの数に制限
されることなく、数値データＤ１，Ｄ２，・・・，Ｄｍ
の数に応じて算術演算を行うことができる。Therefore, by repeating the above processing procedure, the numerical data D1, D2, ..., Dm are not limited to the numbers of the registers 21, 22 ,.
Arithmetic operations can be performed according to the number of.

【００７２】図１２は、本発明の第３の実施例を示す図
である。図１と異なるのは、レジスタ２１，２２，・・
・，２ｎと割付手段３０との間に、さらにレジスタ２
１，２２，・・・，２ｎに対応して補数回路７１，７
２，・・・，７ｎを設けた点である。なお、図１と同一
の要素には同一番号を付し、説明を省略する。FIG. 12 is a diagram showing a third embodiment of the present invention. The difference from FIG. 1 is that the registers 21, 22, ...
.., 2n and the allocating means 30 between the register 2
Complement circuits 71 and 7 corresponding to 1, 2 ...
2, ..., 7n are provided. The same elements as those in FIG. 1 are designated by the same reference numerals and the description thereof will be omitted.

【００７３】補数回路７１，７２，・・・，７ｎはレジ
スタ２１，２２，・・・，２ｎごとに対応して２の補数
を求めるので、数値データＤ１，Ｄ２，・・・，Ｄｍが
負の数の場合でも所望の算術演算を行うことができる。The complement circuits 71, 72, ..., 7n obtain the two's complement corresponding to each of the registers 21, 22, .., 2n, so that the numerical data D1, D2 ,. The desired arithmetic operation can be performed even with the number of.

【００７４】図１３は、本発明の第４の実施例を示す図
である。図１と異なるのは、レジスタ２１，２２，・・
・，２ｎと割付手段３０との間に、さらに部分積算出手
段８０を設けた点である。なお、図１と同一の要素には
同一番号を付し、説明を省略する。FIG. 13 is a diagram showing a fourth embodiment of the present invention. The difference from FIG. 1 is that the registers 21, 22, ...
.., 2n and the allocating means 30 are further provided with a partial product calculating means 80. The same elements as those in FIG. 1 are designated by the same reference numerals and the description thereof will be omitted.

【００７５】例えば、４ビットのビットデータ幅をなす
数値データＤ１，Ｄ２，Ｄ３，Ｄ４，Ｄ５，Ｄ６につい
て、積和演算を含む算術演算「Ｄ１×Ｄ２＋Ｄ３×Ｄ４
＋Ｄ５×Ｄ６」を行う場合、部分積算出手段８０は「Ｄ
１×Ｄ２」，「Ｄ３×Ｄ４」，「Ｄ５×Ｄ６」の部分積
ＺＤを求めて出力する。For example, for the numerical data D1, D2, D3, D4, D5, D6 having a bit data width of 4 bits, an arithmetic operation "D1.times.D2 + D3.times.D4" including a product sum operation is performed.
+ D5 × D6 ”, the partial product calculating means 80 uses“ D
The partial product ZD of 1 × D2 ”,“ D3 × D4 ”, and“ D5 × D6 ”is obtained and output.

【００７６】部分積ＺＤを受けた割付手段３０が部分積
ＺＤのビットデータごとに、ビットデータＢＤを割り付
け、加算手段４０がビットデータＢＤごとに加算を行う
ことにより、積和演算を含む算術演算「Ｄ１×Ｄ２＋Ｄ
３×Ｄ４＋Ｄ５×Ｄ６」の算術結果ＡＤが出力される。The allocating means 30 receiving the partial product ZD allocates the bit data BD for each bit data of the partial product ZD, and the adding means 40 performs addition for each bit data BD, thereby performing an arithmetic operation including a product-sum operation. "D1 x D2 + D
The arithmetic result AD of “3 × D4 + D5 × D6” is output.

【００７７】図１４は部分積算出手段８０の一構成を示
す図であって、積和演算を含む算術演算「Ｄ１×Ｄ２＋
Ｄ３×Ｄ４＋Ｄ５×Ｄ６」のうち、算術演算「Ｄ１×Ｄ
２」を求める場合を示す。なお、数値データＤ１を被乗
数、数値データＤ２を乗数とする。FIG. 14 is a diagram showing a configuration of the partial product calculating means 80, which is an arithmetic operation "D1 × D2 +" including a product sum operation.
D3 × D4 + D5 × D6 ”, the arithmetic operation“ D1 × D
2 ”is shown. The numerical data D1 is the multiplicand and the numerical data D2 is the multiplier.

【００７８】図において、ブース回路８１，８２には、
４ビットのビットデータ幅をなす数値データＤ１が並列
して入力される。また、ブース回路８１には４ビットの
ビットデータ幅をなす数値データＤ２の上位３ビットが
入力され、ブース回路８１には数値データＤ２の下位２
ビットが入力される。そして、ブース回路８１からは部
分積ｚ１及び桁上げデータｃ１が出力され、ブース回路
８２からは部分積ｚ２及び桁上げデータｃ２が出力され
る。なお、他の算術演算「Ｄ３×Ｄ４」，「Ｄ５×Ｄ
６」についても同様な構成をとることによって、積和演
算を含む算術演算「Ｄ１×Ｄ２＋Ｄ３×Ｄ４＋Ｄ５×Ｄ
６」を同時に行うことができる。In the figure, the booth circuits 81 and 82 include
Numerical data D1 having a 4-bit bit data width is input in parallel. The upper 3 bits of the numerical data D2 having a 4-bit bit data width are input to the booth circuit 81, and the lower 2 of the numerical data D2 are input to the booth circuit 81.
Bit is input. The booth circuit 81 outputs the partial product z1 and the carry data c1, and the booth circuit 82 outputs the partial product z2 and the carry data c2. In addition, other arithmetic operations “D3 × D4”, “D5 × D”
6 "also has the same configuration, whereby the arithmetic operation" D1 × D2 + D3 × D4 + D5 × D "including the product-sum operation is performed.
6 ”can be performed simultaneously.

【００７９】したがって、部分積算出手段８０は部分積
ＺＤを求めるので、積和演算を含む算術演算を高速に行
うことができる。この部分積ＺＤを求める方法は上述し
たブースの方法が知られており、図１５にブース回路の
ブロック図及び真理値表を示す。Therefore, since the partial product calculating means 80 obtains the partial product ZD, the arithmetic operation including the product-sum operation can be performed at high speed. As the method of obtaining this partial product ZD, the above-mentioned Booth method is known, and FIG. 15 shows a block diagram of the Booth circuit and a truth table.

【００８０】図１５はブース回路の構成を示す図であ
り、図１５（Ａ）にはブロック図を、図１５（Ｂ）には
真理値表をそれぞれ示す。図１５（Ａ）において、ブー
ス回路８０ａは例えば４ビット等の所定のデータビット
幅データをなす被乗数ｘと、３ビットの乗数（ｙ_2k-1，
ｙ_2k，ｙ_2k+1）とを受けて、部分積ｚ及び２ビットの桁
上げc(0,0)，c(1,1)を出力する。FIG. 15 is a diagram showing the configuration of the booth circuit. FIG. 15A shows a block diagram and FIG. 15B shows a truth table. In FIG. 15A, the Booth circuit 80a has a multiplicand x forming a predetermined data bit width data of, for example, 4 bits, and a multiplier of 3 bits (y _2k-1 ,
y _2k , y _{2k + 1} ) and outputs a partial product z and a 2-bit carry c (0,0), c (1,1).

【００８１】図１５（Ｂ）において、入出力テーブル８
００は上記ブース回路８０ａの真理値表である。例え
ば、８０１行では乗数（ｙ_2k-1，ｙ_2k，ｙ_2k+1）として
（０，１，１）をブース回路８０ａが受けると、部分積
ｚは被乗数ｘの−１倍した−ｘを、桁上げc(0,0)は
「０」を、桁上げc(1,1)は「１」をそれぞれ出力する。In FIG. 15B, the input / output table 8
00 is a truth table of the booth circuit 80a. For example, the multiplier is 801 rows _{_{(y 2k-1, y 2k}} , y 2k + 1) when the (0,1,1) booth circuit 80a receives, -1 times the -x partial product z, multiplicand x , Carry c (0,0) outputs "0", and carry c (1,1) outputs "1".

【００８２】上記の説明では、記憶手段１０にはアドレ
スバスＡＢ等を介して接続される主記憶装置を適用した
が、プロセッサ内に構成するキャッシュメモリを適用し
てもよい。キャッシュメモリを用いることによってバス
アクセスがなくなるので、より高速に算術演算を行うこ
とができる。In the above description, the main storage device connected via the address bus AB or the like is applied to the storage means 10, but a cache memory configured in the processor may be applied. Since the bus access is eliminated by using the cache memory, the arithmetic operation can be performed at a higher speed.

【００８３】また、割付手段３０には一以上のマルチプ
レクサで構成したが、ダイレクト結線によりビットデー
タの割り付けを行うように構成してもよい。この場合、
マルチプレクサを駆動するためのクロックが不要になる
ので、より高速に算術演算を行うことができる。Further, although the allocating means 30 is composed of one or more multiplexers, it may be composed so as to allocate bit data by direct connection. in this case,
Since a clock for driving the multiplexer is unnecessary, it is possible to perform arithmetic operation at higher speed.

【００８４】[0084]

【発明の効果】以上説明したように本発明では、記憶手
段から受けた数値データをレジスタが一時的に保持し、
割付手段が数値データのビットごとにビットデータを割
り付け、加算手段が割り付けられたビットデータごとに
加算を行うように構成したので、２以上の四則演算子を
含む算術演算を同時に行うことができる。このため、算
術演算を高速化することができる。As described above, according to the present invention, the register temporarily holds the numerical data received from the storage means,
Since the allocating means allocates bit data for each bit of the numerical data and the adding means performs addition for each allocated bit data, arithmetic operations including two or more arithmetic operators can be simultaneously performed. Therefore, the arithmetic operation can be speeded up.

【００８５】このような構成の高速演算装置を備えたプ
ロセッサを計算機システムに用いることによって、安価
ながら、２以上の四則演算子を含む算術演算を同時に行
うことができる。By using a processor equipped with a high-speed arithmetic unit having such a configuration in a computer system, it is possible to simultaneously perform arithmetic operations including two or more four arithmetic operators at a low cost.

[Brief description of drawings]

【図１】本発明の原理説明図である。FIG. 1 is a diagram illustrating the principle of the present invention.

【図２】計算機システムにおけるプロセッサの基本構成
を示すブロック図である。FIG. 2 is a block diagram showing a basic configuration of a processor in a computer system.

【図３】第１の実施例を示す図である。FIG. 3 is a diagram showing a first embodiment.

【図４】割付手段の構成を示す図である。FIG. 4 is a diagram showing a configuration of an allocation unit.

【図５】加算手段の構成を示す図である。FIG. 5 is a diagram showing a configuration of an addition unit.

【図６】入力ビット数と１ビット全加算器の個数等との
関係を示す図である。FIG. 6 is a diagram showing a relationship between the number of input bits and the number of 1-bit full adders and the like.

【図７】図７の入力ビット数に対応する１ビット全加算
器の接続関係を示す図であり、（Ａ）には３ビット入力
の場合を、（Ｂ）には４ビット入力の場合を、（Ｃ）に
は６ビット入力の場合を、（Ｄ）には７ビット入力の場
合を、（Ｅ）には９ビット入力の場合を、（Ｆ）には１
０ビット入力の場合をそれぞれ示す。7 is a diagram showing a connection relationship of 1-bit full adders corresponding to the number of input bits in FIG. 7, where (A) shows a case of 3-bit input, and (B) shows a case of 4-bit input. , (C) for 6-bit input, (D) for 7-bit input, (E) for 9-bit input, and (F) for 1
The case of 0-bit input is shown.

【図８】１ビット全加算器の真理値表を示す図である。FIG. 8 is a diagram showing a truth table of a 1-bit full adder.

【図９】加算手段の他の構成を示す図である。FIG. 9 is a diagram showing another configuration of the adding means.

【図１０】数値データの数と加算段数との関係を示す図
である。FIG. 10 is a diagram showing a relationship between the number of numerical data and the number of addition stages.

【図１１】第２の実施例を示す図である。FIG. 11 is a diagram showing a second embodiment.

【図１２】第３の実施例を示す図である。FIG. 12 is a diagram showing a third embodiment.

【図１３】第４の実施例を示す図である。FIG. 13 is a diagram showing a fourth embodiment.

【図１４】部分積算出手段の一構成を示す図である。FIG. 14 is a diagram showing a configuration of a partial product calculating means.

【図１５】ブース回路の構成を示す図であり、（Ａ）に
はブロック図を、（Ｂ）には真理値表をそれぞれ示す。FIG. 15 is a diagram showing a configuration of a Booth circuit, in which (A) is a block diagram and (B) is a truth table.

[Explanation of symbols]

１０記憶手段２１，２２，・・・，２ｎレジスタ３０割付手段４０加算手段 10 storage means 21, 22, ..., 2n register 30 allocation means 40 addition means

───────────────────────────────────────────────────── フロントページの続き (72)発明者丸山文宏神奈川県川崎市中原区上小田中1015番地富士通株式会社内 ─────────────────────────────────────────────────── ─── Continuation of the front page (72) Fumihiro Maruyama 1015 Kamiodanaka, Nakahara-ku, Kawasaki-shi, Kanagawa Fujitsu Limited

Claims

[Claims]

1. A high-speed arithmetic device for high-speed arithmetic operation in a computer system, comprising: numerical data (D1, D2, D2) received from a storage means (10)
..., Dm) are temporarily stored in the registers (21, 2).
2, ..., 2n), and for each of the numerical data (D1, D2, ..., Dm) output from the registers (21, 22, ..., 2n),
Allocation means (30) for allocating bit data (BD)
And an addition unit (4) for performing addition for each of the allocated bit data (BD) and outputting an arithmetic result (AD).
0), and a high-speed arithmetic device characterized by the following.

2. The high-speed arithmetic unit according to claim 1, wherein said storage means (10) is composed of a cache memory.

3. The storage means (10) and the registers (21, 22, ..., 2n) are provided between the storage means (10) and the registers (21, 22 ,. For arithmetic operations such as outputting the numerical data (D1, D2, ..., Dm) and storing the arithmetic result (AD) output from the adding means (40) in the storage means (10). The high-speed arithmetic unit according to claim 1, further comprising a memory management unit that manages input / output of necessary data.

4. The registers (21, 22, ..., 2)
n) and the allocating means (30), and 2 corresponding to each of the registers (21, 22, ..., 2n).
2. The high-speed arithmetic device according to claim 1, further comprising a complement circuit for obtaining the complement of the.

5. The registers (21, 22, ..., 2)
n) and the allocating means (30), further comprising a partial product calculating means for calculating a partial product by Booth's method (Booth algorithm).
The described high speed computing device.

6. The allocating means (30) includes numerical data (D1, D2, ..., Dm) output from the registers (21, 22, ..., 2n) and the numerical data (D1). , D2, ..., Dm) having at least one numerical value data shifted by a predetermined bit width, and selecting and outputting any one of the numerical value data according to the selection signal. The high-speed arithmetic device according to claim 1.

7. The high-speed arithmetic unit according to claim 1, wherein said adding means (40) has a pipeline register.

8. The high-speed arithmetic unit according to claim 1, wherein the adding means (40) has a carry look ahead adder.