JPS6037034A

JPS6037034A - Multiplying circuit for pipeline formation with mode

Info

Publication number: JPS6037034A
Application number: JP58146034A
Authority: JP
Inventors: Takashi Kanazawa; 敬金沢
Original assignee: NEC Corp; Nippon Electric Co Ltd
Current assignee: NEC Corp
Priority date: 1983-08-09
Filing date: 1983-08-09
Publication date: 1985-02-26

Abstract

PURPOSE:To secure the application of a multiplying circuit to the data processing which requires high accuracy by switching in mode the multiplication having high performance and a high accuracy mode, and therefore making use of a fact that the hardware quantity is small together with high performance. CONSTITUTION:When an operation (AXB) is carried out (where A: multiplicand and B: multiplier), A and B are stored to a multiplicand register 10 and a multiplier register 11 respectively. Both the final sum and the final carry are supplied to a result production means 16 via signal lines 1500 and 1501 in a high performance mode and after a prescribed operation. Then the final sum and the final carry are supplied to registers 16-2 and 16-3 in high accuracy mode. The outputs of these registers are supplied to a carry foresighted adder 16-7 via selection circuits 16-5 and 16-6 and added together with the output of a transmission carry producing circuit 16-4. Then 56 bits less than a decimal point are stored to a result register 17 through a signal line 1600. In this mode the carry can be correctly transmitted to secure the multiplication of high accuracy unlike a case where the multiplication of a high performance mode does not transmit carry.

Description

【発明の詳細な説明】〔発明の属する技術分野〕本発明は、データ処理装置におけるパイプライン化乗算
回路に関する。DETAILED DESCRIPTION OF THE INVENTION [Field of the Invention] The present invention relates to a pipelined multiplication circuit in a data processing device.

[Description of prior art]

従来、この種のパイプライン化された浮動小数点乗算回
路は、第１図に示すように、被乗数の仮数部を格納する
レジスタ１と乗数の仮数部を格納するレジスタ２とを備
え、さらに被乗数と乗数ととを入力し被乗数と乗数との
部分積群の上位Ｍｌ十Ｎ１　（ただしＭｌは仮数部のビ
ット長、Ｎ１は乗数精度を規定する値）ビットのみを作
成する部分積群作成回路３と、この部分積群作成回路３
の出力を加算し最終和および最終桁上げを出力する多入
力桁上げ保存加算器４と、この最終和および最終桁上げ
を格納する最終和レジスタ５と、最終桁上げレジスタ６
と、この最終和レジスタ５、最終桁上げレジスタ６の２
人力を加算する桁上げ先見加算器７と、演算結果レジス
タ８とを備え、これらを制御する制御回路９とにより構
成されていた。たとえば、仮数部が５６ビツト（Ｍｌ）
、乗算精度を規定する値８（Ｎｌ）の場合は、乗算をリ
コード乗算器で実行すると、第２図に示すように２９個
の小数点以下６４ピッｌ−（Ｍｌ　＋Ｎ１ビット）より
以下が切り捨てられた部分積が必要である。Conventionally, this type of pipelined floating-point multiplication circuit, as shown in FIG. A partial product group creation circuit 3 that inputs the multiplier and and creates only the upper Ml + N1 (where Ml is the bit length of the mantissa part and N1 is the value that specifies the multiplier precision) bits of the partial product group of the multiplicand and the multiplier. , this partial product group creation circuit 3
a multi-input carry save adder 4 that adds the outputs of and outputs the final sum and final carry; a final sum register 5 that stores the final sum and final carry; and a final carry register 6.
and this final sum register 5, final carry register 6, 2
It was comprised of a carry look-ahead adder 7 for adding human power, an arithmetic result register 8, and a control circuit 9 for controlling these. For example, the mantissa is 56 bits (Ml)
, in the case of the value 8 (Nl) that specifies the multiplication precision, when the multiplication is performed with a recode multiplier, the 29 decimal points below 64 bits l-(Ml + N1 bits) are truncated, as shown in Figure 2. A partial product is required.

この２９個の倍数を第３図に示す多入力桁上げ保存加算
器で加算すると２７個の３人力加算器が必要である。し
かし、一般的なパイプライン化乗算回路に比べ、３人力
加算器のビット幅は最大で６６ビツト、または桁上げ先
見加算器のビット幅も６４ビツトと大幅に少なく、その
結果として、ベクトル演算プロセッサにおける１エレメ
ント／マシンサイクルの高速な浮動小数点乗算の性能を
低下させることなく、大幅な金物量の削減を実現した。When these 29 multiples are added using the multi-input carry save adder shown in FIG. 3, 27 three-man power adders are required. However, compared to general pipelined multiplication circuits, the bit width of the three-power adder is 66 bits at maximum, and the bit width of the carry look-ahead adder is 64 bits, which is significantly smaller. A significant reduction in the amount of hardware was achieved without degrading the performance of high-speed floating-point multiplication of 1 element/machine cycle.

ところが、乗算の精度には問題があり、精度の要求され
るデータ処理にはこの乗算器は使い難い欠点があった。However, there is a problem with the accuracy of multiplication, and this multiplier is difficult to use for data processing that requires accuracy.

[Purpose of the invention]

本発明は、少ない金物量で高性能な乗算を実現する利点
を生かしつつ、かつ高精度なデータ処理の利用も可能に
するパイプライン化乗算回路を提供することを目的とす
る。SUMMARY OF THE INVENTION An object of the present invention is to provide a pipelined multiplication circuit that takes advantage of the advantage of realizing high-performance multiplication with a small amount of hardware while also making it possible to use highly accurate data processing.

[Features of the invention]

本発明は、２人力浮動小数点データを乗算するパイプラ
イン化乗算回路において、Ｍ　（Ｍは仮数部のビット長
）ビットの被乗数とＭビットの乗数の上位Ｋ　（Ｋは高
精度モード乗算幅規定ビット長）ビットとの上位部分積
群と上記被乗数と上記乗数の下位Ｍ−にピントの下位部
分積群の上位Ｍ−に＋Ｎ（Ｎは高性能モード乗算精度規
定値）ビット部分とを作成する部分積群作成回路と、こ
の部分積群作成回路の出力を加算し最終和および最終桁
上げを出力する多入力桁上げ保存加算器と、乗算器上位
部分積群の小数点以下Ｍ＋Ｎ＋１からＭ十にビット部分
の部分積群の総和出力を零とする上位部分積群局所クリ
ア手段と、上記下位部分積群の総和出力を零とする下位
部分積群クリア手段と、上記桁上げ保存加算器の出力を
入力としＭビットの被乗数とにビットの乗数の乗算を繰
り返すことにより所要演算結果を出力する所要結果作成
手段と、高性能モード演算と高精度モード演算を切換え
るモード切換手段とを備えたことを特徴とする。The present invention provides a pipelined multiplication circuit that multiplies floating-point data by two operators, in which a multiplicand of M (M is the bit length of the mantissa) bits and a multiplier of M bits (K is the high-precision mode multiplication width regulation bit) is used. A part that creates an upper partial product group with the above multiplicand and the above multiplier, and a +N (N is the specified value of multiplication accuracy in high performance mode) bit part in the upper M- of the lower partial product group that focuses on the multiplicand and the lower M- of the multiplier. A product group creation circuit, a multi-input carry save adder that adds the outputs of this partial product group creation circuit and outputs the final sum and final carry, and a multiplier that adds bits from M+N+1 to M0 below the decimal point of the upper partial product group. upper partial product group local clearing means for making the summation output of the partial product group of the part zero, lower partial product group clearing means for making the summation output of the lower partial product group zero, and the output of the carry save adder. The present invention is characterized by comprising a required result creation means for outputting a required operation result by repeatedly multiplying an input M-bit multiplicand by a bit multiplier, and a mode switching means for switching between high-performance mode operation and high-precision mode operation. shall be.

[Explanation based on examples]

第４図は本発明実施例装置の構成を示すブロック構成図
である。第５図は第４図の一部分である部分積群作成回
路１３の詳細構成を示すブロック構成図であり、第６図
は第４図の一部分である所要結果作成手段１６の詳細構
成を示すブロック構成図である。FIG. 4 is a block configuration diagram showing the configuration of an apparatus according to an embodiment of the present invention. FIG. 5 is a block diagram showing the detailed configuration of the partial product group generation circuit 13, which is a part of FIG. 4, and FIG. 6 is a block diagram showing the detailed configuration of the required result creation means 16, which is a part of FIG. FIG.

以下に第４図、第５図および第６図に基づいて本装置の
構成を説明する。まず第４図において、１０　ハ５６　
ヒツト（ＭＵＭは仮数部のビット長）の被乗数を格納す
る被乗数レジスタである。１１は５６ビツト（Ｍビット
）の乗数を格納する乗数レジスタである。１２は演算乗
数切換手段である。高性能モードの演算乗数と高精度モ
ード乗算幅規定ビット長２８ビット（Ｋビット）より定
まる乗算数分の２つの演算乗数は第１表のように設定さ
れる。The configuration of this device will be explained below based on FIGS. 4, 5, and 6. First, in Figure 4, 10 ha56
This is a multiplicand register that stores the multiplicand of the number of bits (MUM is the bit length of the mantissa). 11 is a multiplier register that stores a 56-bit (M bit) multiplier. 12 is an arithmetic multiplier switching means. Two arithmetic multipliers for the number of multiplications determined by the arithmetic multiplier in the high performance mode and the prescribed bit length of 28 bits (K bits) for the multiplication width in the high precision mode are set as shown in Table 1.

演算乗数選択手段は、高性能モード時には、高性能モー
ド演算乗数を高精度モードときには、１回目の乗算では
高精度モード第一演算乗数を、２回目の乗算では高精度
モード第二演算乗数を選択し出力とする。演算乗数の下
位２８ピツＩ・（Ｍ−にビット）を零とすることは、被
乗数と乗数の下位２８ビツト（Ｍ−にビット）の下位部
分積群の総和を零とすることと等価で下位部分積群クリ
ア手段に相当する。１３は被乗数と乗数との倍数を作成
する部分積群作成回路である。被乗数の倍数を発生する
アルゴリズムは、リコード手法を用いている。The calculation multiplier selection means selects the high-performance mode calculation multiplier in the high-performance mode, the high-precision mode first calculation multiplier in the high-precision mode, and the high-precision mode second calculation multiplier in the second multiplication. and output it. Setting the lower 28 bits of the multiplier (bits in M-) to zero is equivalent to setting the sum of the lower partial product group of the lower 28 bits (bits in M-) of the multiplicand and the multiplier to zero; This corresponds to partial product group clearing means. 13 is a partial product group creation circuit that creates multiples of the multiplicand and the multiplier. The algorithm for generating multiples of the multiplicand uses the recoding method.

リコード手法は、乗数ａｘ　ａ２　ａ３　・・・ａ　５
４　ａ　５５ａｓ６を１ビツト重複して第２表に示すよ
うに３ピントずつＭＯ〜Ｍ２８の２９組に分割する。各
組の３ビツトの重み付けは、（−２、■、１）であり３
ビツトパターンにより倍率は第３表に示すように設定さ
れ、この３ビツトのＭＯ−Ｍ２Ｓに対応する部分積２９
個が作成される。The recoding method is multiplier ax a2 a3 ... a 5
4a55as6 is divided into 29 groups MO to M28 of 3 focuses each with one bit overlapping as shown in Table 2. The weighting of the 3 bits in each set is (-2, ■, 1), which is 3
The magnification is set according to the bit pattern as shown in Table 3, and the partial product 29 corresponding to this 3-bit MO-M2S is
pieces are created.

第３表１４は被乗数と乗数の上位２８ビツト（Ｋビ、ト）との
上位部分積群の小数点以下６５ピッｌ−（Ｍ＋高性能モ
ード乗算精度規定値Ｎ（８ビツト）＋１ビツト）から８
４ビツト（Ｍ＋にビット）部分の総和出力を零とする上
位部分積群局所クリア手段である。この手段は、高性能
モード時には、下位部分積群の小数点以下６４ビツトよ
り以下を切り捨て、部分積群の残りの部分はそのまま通
過させ、高精度モード時には、部分積群の全てをそのま
ま通過させる手段である。１５は下位部分積群局所クリ
ア手段からの２９本の出力を加算し最終和および最終桁
上げを作成する多入力桁上げ保存加算器である。Table 3 14 shows the upper partial product group of the multiplicand and the upper 28 bits (K bits, t) of the multiplier, from 65 bits below the decimal point (M + high-performance mode multiplication accuracy specification value N (8 bits) + 1 bit) to 8
This is an upper partial product group local clearing means that makes the total output of the 4-bit (bits in M+) part zero. In the high performance mode, this means discards the 64 bits below the decimal point of the lower partial product group and passes the remaining part of the partial product group as is, and in the high precision mode, means to pass the entire partial product group as is. It is. Reference numeral 15 is a multi-input carry save adder that adds 29 outputs from the lower partial product group local clearing means to create a final sum and a final carry.

１６は多入力桁上げ保存加算器の最終和および最終桁上
げ出力を入力とし所要演算結果を出力する所要結果作成
手段である。１７は演算結果を格納する結果レジスタで
ある。１８は指数部の処理等の制御を行う制御回路であ
る。１９は高性能モードと高精度モードとを切換えるモ
ード切換え手段である。Reference numeral 16 denotes a required result generating means which inputs the final sum and final carry output of the multi-input carry save adder and outputs the required operation result. Reference numeral 17 is a result register for storing the calculation result. Reference numeral 18 denotes a control circuit that controls processing of the exponent part and the like. 19 is a mode switching means for switching between high performance mode and high precision mode.

次に第５図に示す部分積群作成回路１３について構成を
説明する。被乗数と演算乗数を入力とし、被乗数の上位
２８ピツ）　（Ｋビット）の上位部分積群に相当するビ
ット幅５８ビツトの部分積１３−１５から１３−２９と
、被乗数と演算乗数の下位２８ビツト（Ｍ−にビット）
との下位部分積群の小数点以下６４ビツト（Ｍ＋Ｎビッ
ト）より以下を切り捨てたものに相当し、ビット幅が１
０．１２．１４、・・・、３６ヒツトと順次法がってい
く部分積１３−１から１３−１４とを作成する。Next, the configuration of the partial product group generation circuit 13 shown in FIG. 5 will be explained. Input the multiplicand and the operation multiplier, and input the partial products 13-15 to 13-29 with a bit width of 58 bits corresponding to the upper partial product group of the upper 28 bits of the multiplicand (K bits), and the lower 28 bits of the multiplicand and the operation multiplier. (M-bit)
It corresponds to the 64 bits below the decimal point (M+N bits) of the lower partial product group of the lower partial product group, and the bit width is 1.
0.12.14, . . . , 36 partial products 13-1 to 13-14 are created sequentially.

次に第６図は所要結果作成手段１６の詳細回路図である
。１６−１は４人力桁上げ保存加算器である。Next, FIG. 6 is a detailed circuit diagram of the required result generating means 16. 16-1 is a four-person carry save adder.

１６−２．１６−３は４人力桁上げ保存加算器の最終相
および最終桁上げを格納する最終相レジスタ、最終桁上
げレジスタである。これらのレジスタの出力は２８ビッ
ト（Ｋビット）シフトされて４人力桁上げ保存加算器１
６−１の入力に戻されている。16-2 and 16-3 are final phase registers and final carry registers for storing the final phase and final carry of the four-manpower carry save adder. The outputs of these registers are shifted 28 bits (K bits) to the four-man carry-save adder 1.
It is returned to the input of 6-1.

１６−４は繰り返し乗算における切り捨てられる最終お
よび最終桁上げより発生する伝播桁上げを取り出す伝播
桁上げ生成回路である。１６−５．１６−６は多入力桁
上げ保存加算器１５からの最終相および最終桁上げ入力
と最終相レジスク１６−２および最終桁上げレジスタ１
６−３からの入力とを切換える選択回路である。１６−
７は選択回路１６−５．１６−６の出力を入力とし加算
する桁上げ先見加算器である。16-4 is a propagation carry generation circuit which extracts a propagation carry generated from the final and final carry to be truncated in repeated multiplication. 16-5.16-6 is the final phase and final carry input from the multi-input carry save adder 15, the final phase register 16-2 and the final carry register 1
This is a selection circuit that switches the input from 6-3. 16-
7 is a carry look-ahead adder which inputs the outputs of the selection circuits 16-5 and 16-6 and adds them.

次に、被乗数をＡ、乗数をＢとして、高性能モードと高
精度モードとにおけるＡＸＢの演算が行われる動作を説
明する。まずＡ、Ｂがそれぞれ被乗数レジスタ１０、乗
数レジスタ１１に格納される。Next, assuming that A is the multiplicand and B is the multiplier, the operation of calculating AXB in the high performance mode and the high precision mode will be described. First, A and B are stored in the multiplicand register 10 and the multiplier register 11, respectively.

高性能モードにおいて、Ａは信号線１０００を通り、部
分積群作成回路１３に入力する。一方、Ｂは信号線１１
００を通り演算切換手段１２に入りＢそのものが演算乗
数として選ばれ信号線１２００から部分積群作成回路１
３に送られる。部分積群作成回路１３で第５図に示した
ような部分積Ｚ１から２２９の２９個が作成される。Ｚ
ｌから２２９は信号線１３００から上位群局所クリア手
段１４に入力され、Ｚｌｓから２２９の各部分積は小数
点以下６４ピッｌ−（Ｍ＋Ｎビット）より以下が切り捨
てられＹ１５からＹ２９となる。ＺｌからＺｌ４、Ｙｔ
ｓからＹ２１１は、第１図の部分積群作成回路３の出力
と実質的に等価であり、信号線１４００を通り、第３図
に示すような多入力桁上げ保存加算器１５に入力され加
算されて最終相Ｓおよび最終桁上げＣとなる。ＳとＣは
信号線１５００と１５０１を経由して所要結果作成手段
１６に入る。所要結果作成手段１６に入力されたＳとＣ
はそれぞれ選択回路１６−５．１６−６を通過し桁上げ
先見加算器１６−７に入り加算されて小数点以下５６ビ
ツトが信号線１６００から結果レジスタ１７に格納され
、第１図の乗算器と同精度、同じベクトル性能のＡＸＢ
の乗算が終了する。高精度モードにおいて、Ｂの上位側
ビットをＢ１、下位詔ビットをＢ２とすると、ＡＸＢミ
Ａｘ　（Ｂ工＋Ｂ２）＝　Ａ　Ｘ　Ｂ　１　＋　Ａ　Ｘ　Ｂ　２となり、ＡＸ
Ｂは、ＡＸＢＩとＡＸＢ２＜７）２回の乗算の繰り返し
が必要である。Ａは信号線１０００を介して部分積群作
成回路１３に入力される。Ｂは信号線１１００を介して
演算乗数切換手段１２に入力され、まず、高精度モード
第一演算乗数Ｂ１が１マシンサイクル後に高精度モード
第二演算乗算Ｂ２が順次選択され信号線１２００がら部
分積群作成回路１３に送られる。始めに、ＡとＢ１の部
分積Ｚ１０ｉがらＺｌ２９が、ｌマシンサイクル遅れて
ＡとＢ２の部分積Ｚ　２０　ｔからｚ２２９が高性能モ
ードと同様に作成されて１マシンサイクルずれて信号線
１３００がら上位部分積群局所クリア手段１４に送られ
る。上位部積群局所クリア手段１４では、Ｚｌｏｌから
Ｚｌ２９とＺ　２０１からＺ２２９は何もされず、１マ
シンサイクル異なって信号線１４００を介して第３図と
同様な多入力桁上げ保存加算器１５に入力され、Ｚｌｏ
ｌがらＺｌ２９に対しＳｌ、Ｃ１が、Ｚ２ｏｚからＺ２
２９に対しＳ　２　、Ｃ２の最終相および最終桁上げが
１マシンサイクルずれて作成される。３１、Ｃ２とＢ２
、Ｃ２は１マシンサイクルの差を持って、所要結果作成
手段１６に入る。始めに送られてきたＳ□と０１は、最
終相レジスタ１６−２と最終桁上げレジスタ１６−３＜
共に初期値は０）の内容と２８ビツトシフトされた形で
４人力桁上げ保存加算器１６−１で加算され最終相およ
び最終桁上げが最終相レジスタ１６−２および最終桁上
げレジスタ１６−３に格納される。１マシンサイクル遅
れて送られてきたＢ２とＣ２はＳｌとＣ１同様４人力桁
上げ保存加算器１６−１で最終相および最終桁上げレジ
スタ１６−２．１６−３に入る。これらのレジスタの出
力は、選択回路１６−５．１６−６を介して桁上げ先見
加算器１６−７に入り、伝播桁上げ生成回路１６−４の
出力と共に加算され小数点以下５６ビツトが信号線１６
００から結果レジスタ１７に格納されＡＸＢの演算を終
了する。このモードでは、高性能モードの乗算が桁上げ
を伝播しないのに対して正しく桁上げを伝播できるため
高精度な乗算が可能である。In the high performance mode, A passes through the signal line 1000 and is input to the partial product group generation circuit 13. On the other hand, B is the signal line 11
00 and enters the arithmetic switching means 12, where B itself is selected as the arithmetic multiplier, and is connected to the partial product group generating circuit 1 from the signal line 1200.
Sent to 3. The partial product group generation circuit 13 generates 29 partial products Z1 to 229 as shown in FIG. Z
229 from l is inputted to the upper group local clearing means 14 from the signal line 1300, and each partial product from Zls to 229 is rounded down to 64 digits below the decimal point l-(M+N bits) to become Y15 to Y29. Zl to Zl4, Yt
Y211 from s is substantially equivalent to the output of the partial product group generation circuit 3 shown in FIG. 1, and is input to the multi-input carry save adder 15 as shown in FIG. 3 through the signal line 1400 and is added. This results in the final phase S and final carry C. S and C enter the required result generating means 16 via signal lines 1500 and 1501. S and C input into the required result creation means 16
pass through the selection circuits 16-5 and 16-6, enter the carry look-ahead adder 16-7, are added together, and the 56 bits below the decimal point are stored in the result register 17 from the signal line 1600, and are connected to the multiplier in FIG. AXB with the same precision and vector performance
The multiplication of is completed. In high precision mode, if the upper bit of B is B1 and the lower edict bit is B2, then AXBmiAx (B+B2) = A X B 1 + A X B 2, and AX
B requires two repetitions of multiplication (AXBI and AXB2<7). A is input to the partial product group generation circuit 13 via the signal line 1000. B is input to the arithmetic multiplier switching means 12 via the signal line 1100, and first, after one machine cycle of the high precision mode first arithmetic multiplier B1, the high precision mode second arithmetic multiplier B2 is sequentially selected and the partial product is changed from the signal line 1200. It is sent to the group creation circuit 13. First, Zl29 is created from the partial product Z10i of A and B1, and z229 is created from the partial product Z20t of A and B2 with a delay of 1 machine cycle in the same way as in the high performance mode, and is created from the signal line 1300 with a delay of 1 machine cycle. It is sent to the partial product group local clearing means 14. In the upper part product group local clearing means 14, nothing is done from Zlol to Zl29 and from Z201 to Z229, and the signals are sent to the multi-input carry save adder 15 similar to that shown in FIG. 3 via the signal line 1400 with a difference of one machine cycle. entered and Zlo
From l to Zl29, Sl, C1 from Z2oz to Z2
29, the final phase and final carry of S 2 and C2 are created with a shift of one machine cycle. 31, C2 and B2
, C2 enter the required result generating means 16 with a difference of one machine cycle. S□ and 01 sent at the beginning are stored in the final phase register 16-2 and final carry register 16-3<
Both initial values are 0), and the contents are shifted by 28 bits and added by a four-man carry save adder 16-1, and the final phase and final carry are stored in the final phase register 16-2 and final carry register 16-3. Stored. B2 and C2, which are sent with a delay of one machine cycle, enter the final phase and final carry register 16-2, 16-3 in the four-man carry save adder 16-1 in the same way as Sl and C1. The outputs of these registers enter the carry look-ahead adder 16-7 via selection circuits 16-5 and 16-6, are added together with the output of the propagation carry generation circuit 16-4, and 56 bits below the decimal point are added to the signal line. 16
00 is stored in the result register 17, and the AXB operation ends. In this mode, high-precision multiplication is possible because carries can be propagated correctly, whereas multiplication in high-performance mode does not propagate carries.

いままで説明してきた高性能モードと高精度モードの乗
算をモード切換手段例えば、命令オペランドによって切
換えることにより、混在している高速で比較的精度のよ
い乗算と比較的高速で精度の良い乗算とを実行できる。By switching the multiplication between the high-performance mode and the high-precision mode described so far using a mode switching means, for example, an instruction operand, it is possible to switch between the mixed high-speed and relatively accurate multiplication and the relatively high-speed and high-precision multiplication. Can be executed.

〔Effect of the invention〕

本発明は、高性能モードと高精度モードの乗算をモード
切換手段で切換え可能にしたことにより、従来のパイプ
ライン化乗算回路の少ない金物量で高性能であるという
利点を生かし、さらに精度に問題があり精度の要求され
るデータ処理への利用が可能になった。The present invention makes it possible to switch between high-performance mode and high-accuracy mode multiplication using a mode switching means, thereby taking advantage of the advantages of conventional pipelined multiplier circuits, such as high performance with a small amount of hardware, while also having problems with accuracy. This makes it possible to use it for data processing that requires precision.

[Brief explanation of drawings]

第１図は従来のパイプライン化乗算回路の構成を示すブ
ロック図。第２図は第１図に示した部分積群作成回路３部分の詳細
ブロック図。第３図は第１図に示した多入力桁上げ保存加算器の詳細
ブロック図。第４図は本発明の実施例回路の構成を示すブロック図。第５図は第４図の部分積群作成回路１３の詳細回路図。第６図は第４図の所要結果作成手段１６の詳細なブロッ
ク図。１、ＩＯ・・・被乗数レジスタ、２．１１・・・乗数レ
ジスタ、３．１３・・・部分積群作成回路、４．１５・
・・多入力桁上げ保存加算器、５・・・最終相レジスタ
、６・・・最終桁上げレジスタ、７・・・桁上げ先見加
算器、８．１７・・・結果レジスタ、９．１訃・・制御
回路、１２・・・演算乗数選択手段、１４・・・上位部
分積群局所クリア手段、１６・・・所要結果作成手段、
１９・・・モード切換手段。第　３ｒｉＵＭ　４　回＋０００　＋２００篤　５　図Ｍ　６　ロFIG. 1 is a block diagram showing the configuration of a conventional pipelined multiplication circuit. FIG. 2 is a detailed block diagram of three parts of the partial product group generation circuit shown in FIG. 1. FIG. 3 is a detailed block diagram of the multi-input carry save adder shown in FIG. 1. FIG. 4 is a block diagram showing the configuration of a circuit according to an embodiment of the present invention. FIG. 5 is a detailed circuit diagram of the partial product group generation circuit 13 of FIG. 4. FIG. 6 is a detailed block diagram of the required result creation means 16 shown in FIG. 4. 1. IO... Multiplicand register, 2.11... Multiplier register, 3.13... Partial product group creation circuit, 4.15.
...Multi-input carry save adder, 5...Final phase register, 6...Final carry register, 7...Carry look ahead adder, 8.17...Result register, 9.1 . . . Control circuit, 12 . . . Operation multiplier selection means, 14 . . . Upper partial product group local clearing means, 16 . . . Required result creation means,
19...Mode switching means. 3rd riU M 4 times +000 +200 Atsushi 5 Figure M 6 Ro

Claims

[Claims]

(1) It is inserted between each output side of the multiplicand register and multiplier register of a pipelined multiplication circuit that multiplies floating-point data by hand, and the input side of the result register, and performs high-performance mode operation and high-precision mode operation. a mode switching means for switching a multiplicand of M (where M is the bit length of the mantissa) bits;
The upper M of the upper partial product group of the upper K of the bit multiplier (however, K is the specified bit length of the high-precision mode multiplication width) and the pinto, and the lower partial product group of the above multiplicand and the lower M- bits of the above multiplier
- +N (However, N is the specified value of multiplication accuracy in high performance mode)
a partial product group creation circuit that creates a focused part; a multi-input carry save adder that adds the outputs of this partial product group creation circuit and outputs the final phase and final carry; An upper partial product group local clearing means that sets the summation output of the partial product group of the bit part from M+N+1 to M+ to zero, a lower partial product group clearing means that makes the summation output of the lower partial product group zero, and the above-mentioned carry save addition. By taking the output of the device as input and repeating the multiplication of the M-bit multiplicand by the bit multiplier,
1. A pipelined multiplier circuit with a mode, comprising a required result creation means for outputting a required operation result.