JPH06282418A

JPH06282418A - Arithmetic unit

Info

Publication number: JPH06282418A
Application number: JP5071326A
Authority: JP
Inventors: Yoshinori Miki; 義則三木; Toshio Miki; 俊雄三木; Tomoyuki Oya; 智之大矢; Yukihiko Okumura; 幸彦奥村
Original assignee: Nippon Telegraph and Telephone Corp; NTT Mobile Communications Networks Inc
Current assignee: NTT Docomo Inc; Nippon Telegraph and Telephone Corp
Priority date: 1993-03-30
Filing date: 1993-03-30
Publication date: 1994-10-07

Abstract

PURPOSE:To use the output result of a multiplier as an input to the multiplier immediately in a next machine cycle. CONSTITUTION:In the arithmetic unit which performs data transfer, multiplication, and ALU arithmetic by a pipeline system by cascading the multiplier 1 having two input and output latches 4 and a data ALU circuit 5 having two inputs, the output latch 4 of the multiplier 1 is connected to a selector 7 for one input of the data ALU circuit 5 and also connected to selectors 2 and 3 for two inputs of the multiplier 1, and a register file 8 and the output latch 4 of the multiplier 1 can be selected as the input to the multiplier 1. Then when continuous arithmetic processing for a three-term product, a four-term product, a five-term product, etc., is performed, the output of the output latch 4 can be supplied directly to the input of the multiplier 1 through the selectors 2 and 3 and the number of execution steps can greatly be decreased.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、一般には、電気信号の
処理を行うプロセッサに関し、特に音声の符号化、画像
の符号化等、ディジタル信号処理を行うプロセッサ、即
ちディジタルシグナルプロセッサ（以下、ＤＳＰと称
す）に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention generally relates to a processor for processing electric signals, and more particularly to a processor for digital signal processing such as audio coding and image coding, that is, a digital signal processor (hereinafter referred to as DSP). It refers to)).

【０００２】[0002]

【従来の技術】音声や画像の符号化、フィルタリングを
始めとして、ディジタル信号処理における主な（最も出
現回数の多い）処理は積和演算である。従来のＤＳＰの
演算回路の中心部の典型的な構成例を図３に示す。この
例は、データバスを２本持ち、メモリ１０とレジスタフ
ァイル８間で同時に２つのデータの転送が可能であるよ
うに構成されたものである。図示するように、積和演算
を高速に行うために、２つの入力を備えたデータＡＬＵ
回路（ＯＭＰＹ）５の他に、２つの入力と出力ラッチ４
を備えた専用の乗算器（ＯＭＬＴ）１を持ち、乗算器１
とＡＬＵ回路５を縦列接続して、データ転送、乗算及び
ＡＬＵ演算をパイプライン方式で実行する構成が採られ
ている。2. Description of the Related Art The main (most frequently appearing) processing in digital signal processing, including coding and filtering of voice and images, is a sum of products operation. FIG. 3 shows a typical configuration example of the central part of a conventional DSP arithmetic circuit. In this example, two data buses are provided, and two data can be transferred between the memory 10 and the register file 8 at the same time. As shown in the figure, a data ALU having two inputs in order to perform the multiply-accumulate operation at high speed.
Besides the circuit (OMPY) 5, two input and output latches 4
Has a dedicated multiplier (OMLT) 1 equipped with
And ALU circuits 5 are connected in cascade to perform data transfer, multiplication, and ALU operation by a pipeline method.

【０００３】データＡＬＵ回路５の２つの入力はセレク
タ６及び７をそれぞれ有しており、その出力はレジスタ
ファイル８に送られる。また、乗算器１の２つの入力も
セレクタ２及び３をそれぞれ有している。乗算器１の出
力ラッチ４からの出力はセレクタ７を介してデータＡＬ
Ｕ回路５の一方の入力に送られる。図３に示す構成にお
いて、メモリ１０に格納されたｘｎ及びｙｎ（ｎ＝1、2、
3、・・・、ｎ）に対して、積和演算ｘ１×ｙ１＋ｘ２×ｙ２
＋・・・ｘＮ×ｙＮを実行し、結果をメモリ１０に返す
処理を行うための手順は次のようになる。ただし、一行
は１ステップに相当する。The two inputs of the data ALU circuit 5 have selectors 6 and 7, respectively, the output of which is sent to the register file 8. The two inputs of the multiplier 1 also have selectors 2 and 3, respectively. The output from the output latch 4 of the multiplier 1 is sent to the data AL via the selector 7.
It is sent to one input of the U circuit 5. In the configuration shown in FIG. 3, xn and yn (n = 1, 2,
3 ..., n), the sum of products operation x1 × y1 + x2 × y2
The procedure for executing + ... xN × yN and returning the result to the memory 10 is as follows. However, one line corresponds to one step.

【０００４】Ｒ１＝ｘ１；Ｒ２＝ｙ１；（１．１）ＯＭＬＴ＝Ｒ１×Ｒ２；Ｒ１＝ｘ２；Ｒ２＝ｙ２；（１．２）ｎ＝3、・・・、N に対して、次の１ステップを繰り返す。 R3＝R3＋OMPY；OMLT＝R1×R2；R1＝ｘｎ；R2＝ｙｎ；（１．３）Ｒ３＝Ｒ３＋ＯＭＰＹ；ＯＭＬＴ＝Ｒ１×Ｒ２；（１．４）Ｒ３＝Ｒ３＋ＯＭＰＹ；（１．５）Ｒ３→ＭＥＭ；（１．６）即ち、（Ｎ＋３）ステップ必要である。R1 = x1; R2 = y1; (1.1) OMLT = R1 × R2; R1 = x2; R2 = y2; (1.2) For n = 3, ..., N, Repeat one step. R3 = R3 + OMPY; OMLT = R1 × R2; R1 = xn; R2 = yn; (1.3) R3 = R3 + OMPY; OMLT = R1 × R2; (1.4) R3 = R3 + OMPY; (1.5) R3 → MEM (1.6) That is, (N + 3) steps are required.

【０００５】ここで、＝はレジスタファイル８のレジス
タＲ１、Ｒ２、・・・への値の転送を、×は乗算器によ
る演算を、また、→はレジスタ内にある演算結果のメモ
リ（ＭＥＭ）１０への転送をそれぞれ意味する。上記の
（１．３）式において、レジスタＲ３にはｘ（ｎ−２）
×ｙ（ｎ−２）の結果が加算され、同時にｘ（ｎ−１）
×ｙ（ｎ−１）の乗算が実行され、さらに同時に、ｘｎ
及びｙｎがメモリ１０から転送されている。これがＤＳ
Ｐ上の積和演算におけるパイプライン処理の流れであ
る。Here, = indicates the transfer of values to the registers R1, R2, ... Of the register file 8, × indicates the operation by the multiplier, and → indicates the memory (MEM) of the operation result in the register. Each means transfer to 10. In the above equation (1.3), x (n-2) is stored in the register R3.
The result of xy (n-2) is added, and at the same time x (n-1)
Xy (n-1) multiplication is performed, and at the same time, xn
And yn have been transferred from the memory 10. This is DS
It is a flow of pipeline processing in the product-sum operation on P.

【０００６】上記の積和演算ｘ１×ｙ１＋ｘ２×ｙ２＋
・・・ｘＮ×ｙＮの処理において、乗算はＮ回出現して
いるから、ステップ数がＮオーダーで済んでいるという
ことは、処理の初めと終わりのオーバーヘッドを除い
て、乗算器１は常に動作しており、無駄なく処理が行わ
れていることを意味する。上述の説明で明白なように、
図３に示す従来のＤＳＰの回路構成においては、パイプ
ライン方式により積和演算を高速に処理することができ
る。The above product-sum operation x1 × y1 + x2 × y2 +
... In the processing of xN × yN, since the multiplication occurs N times, it means that the number of steps is N order, so that the multiplier 1 always operates except the overhead at the beginning and end of the processing. It means that the processing is being performed without waste. As is clear from the above description,
In the circuit configuration of the conventional DSP shown in FIG. 3, the product-sum operation can be processed at high speed by the pipeline method.

【０００７】[0007]

【発明が解決しようとする課題】しかしながら、音声や
画像の符号化においては、積和演算のみならず三項積、
四項積のように、ＡＬＵ回路を用いずに乗算器を連続的
に用いるような演算も数多く出現する。このような演算
実行時には、上記図３に示したような従来の構成は必ず
しも有効ではない。何故ならば、乗算器の出力ラッチか
ら乗算器の入力に直接接続されていないため、次のマシ
ンサイクルで直ちに乗算結果を乗算対象として用いるこ
とができず、レジスタ間の転送のみのマシンサイクルが
不可避であるからである。However, not only the product-sum operation but also the ternary product,
Many operations such as the quaternary product that continuously use the multiplier without using the ALU circuit also appear. The conventional configuration as shown in FIG. 3 is not always effective when performing such an operation. Because the output latch of the multiplier is not directly connected to the input of the multiplier, the multiplication result cannot be immediately used as the multiplication target in the next machine cycle, and the machine cycle of transfer only between registers is inevitable. Because it is.

【０００８】この点について上記図３に示した構成のＤ
ＳＰの場合を例にとって説明する。メモリに格納された
ｘｎ、ｙｎ、及びｚｎ（ｎ＝1、2、3、・・・、ｎ）の三項積を
Ｎ個連続的に計算し、結果をメモリに返す演算を行うた
めには、次のような手順を必要とする。ただし、一行は
１ステップに相当する。Ｒ１＝ｘ１；Ｒ２＝ｙ１；（２．１）ＯＭＬＴ＝Ｒ１×Ｒ２；Ｒ２＝ｚ１；（２．２）Ｒ１←ＯＭＬＴ（２．３）ｎ＝1、2、3、・・・、N-1 に対して、次の３ステップを繰り返
す。With respect to this point, D of the configuration shown in FIG.
The case of SP will be described as an example. In order to perform an operation in which N ternary products of xn, yn, and zn (n = 1, 2, 3, ..., N) stored in the memory are continuously calculated and the result is returned to the memory, , Requires the following steps: However, one line corresponds to one step. R1 = x1; R2 = y1; (2.1) OMLT = R1 × R2; R2 = z1; (2.2) R1 ← OMLT (2.3) n = 1, 2, 3, ..., N- Repeat the following 3 steps for 1.

【０００９】 OMLT＝R1×R2；R1＝ｘ（ｎ＋１）；R2＝ｙ（ｎ＋１）（２．４） R1←OMLT；OMLT＝R1×R2；R2＝ｚ（ｎ＋１）（２．５） R1→ MEM；R1←OMLT （２．６）ＯＭＬＴ＝Ｒ１×Ｒ２；（２．７）Ｒ１←ＯＭＬＴ；（２．８）Ｒ１→ＭＥＭ；（２．９）即ち、（３Ｎ＋３）ステップ必要である。OMLT = R1 × R2; R1 = x (n + 1); R2 = y (n + 1) (2.4) R1 ← OMLT; OMLT = R1 × R2; R2 = z (n + 1) (2.5) R1 → MEM; R1 ← OMLT (2.6) OMLT = R1 × R2; (2.7) R1 ← OMLT; (2.8) R1 → MEM; (2.9) That is, (3N + 3) steps are required.

【００１０】ここで、＝はレジスタへの値の転送を、←
はデータＡＬＵ経由でのレジスタへの値の転送を、×は
乗算器による演算を、また、→はレジスタ内にある演算
結果のメモリへの転送をそれぞれ意味する。この例で
は、乗算は２Ｎ回出現しているので、無駄なく処理が実
行できれば２Ｎオーダーのステップ数で済む筈である。
ところが、乗算器を使用できずにデータの転送だけを行
っているステップがＮ＋２ステップ現れているために、
３Ｎオーダーのステップ数が必要となっている。Here, = indicates the transfer of the value to the register, ←
Means transfer of a value to the register via the data ALU, × means operation by the multiplier, and → means transfer of the operation result in the register to the memory. In this example, since the multiplication occurs 2N times, if the processing can be executed without waste, the number of steps on the order of 2N should be sufficient.
However, since N + 2 steps appear in which only the data is transferred without using the multiplier,
A step count of 3N order is required.

【００１１】以上のように、従来のＤＳＰにおいては、
連続したデータの三項積のように、ＡＬＵ回路を用いず
に乗算器のみを連続的に用いる演算実行時において、乗
算器を使用できないステップ、即ちデータの転送のみの
ステップが生じるため、ステップ数の増加を招き、従っ
て、プロセッサ全体における演算効率が悪くなるという
問題があった。As described above, in the conventional DSP,
When performing an arithmetic operation that continuously uses only a multiplier without using an ALU circuit, such as a ternary product of continuous data, a step in which the multiplier cannot be used, that is, a step only in which data is transferred occurs. Therefore, there is a problem that the calculation efficiency of the entire processor is deteriorated.

【００１２】[0012]

【課題を解決するための手段】本発明は、上記の問題点
を解決するために、ＤＳＰのように乗算器とＡＬＵ回路
を縦列接続し、パイプライン方式に基づいて高速演算処
理を行う演算装置において、乗算器の出力をこの乗算器
の入力に直接入力可能な経路を設け、乗算器の出力結果
を次のマシンサイクルで直ちに乗算器への入力とするこ
とができるように構成したものである。SUMMARY OF THE INVENTION In order to solve the above problems, the present invention is an arithmetic unit for performing high-speed arithmetic processing based on a pipeline system by connecting multipliers and ALU circuits in cascade like a DSP. In the above, a path is provided in which the output of the multiplier can be directly input to the input of this multiplier, and the output result of the multiplier can be immediately input to the multiplier in the next machine cycle. .

【００１３】[0013]

【作用】本発明を適用することにより、三項積、四項積
などのように、ＡＬＵ回路を用いずに乗算器のみを連続
的に用いるような演算実行時に、乗算器の出力を次のマ
シンサイクルで直ちに乗算器入力として用いることがで
きるので、データ転送のみのステップが出現することが
なくなり、ステップ数の増加を抑えることができる。よ
って、特に三項積、四項積などの演算実行時に、プロセ
ッサ全体の処理能力を著しく向上させることができる。By applying the present invention, the output of the multiplier can be changed to the following when performing an operation such as a triplet product or a quadratic product that continuously uses only the multiplier without using the ALU circuit. Since it can be used as a multiplier input immediately in a machine cycle, a step only for data transfer does not appear and an increase in the number of steps can be suppressed. Therefore, it is possible to remarkably improve the processing capability of the entire processor, especially at the time of executing an operation such as a product of three terms and a product of four.

【００１４】[0014]

【実施例】以下、本発明の実施例について図面を参照し
て詳細に説明する。図１は本発明による演算装置の第１
の実施例を示す回路構成図であり、上記図３に示したＤ
ＳＰと同様にデータバスを２本持ち、メモリ１０とレジ
スタファイル８間で同時に２つのデータを転送すること
ができるＤＳＰに本発明を適用した場合を示す。従っ
て、図３と対応する構成要素、素子等には同一符号を付
して必要のない限りそれらの説明を省略する。Embodiments of the present invention will now be described in detail with reference to the drawings. FIG. 1 shows a first arithmetic device according to the present invention.
FIG. 4 is a circuit configuration diagram showing an embodiment of FIG.
Similar to the SP, the present invention is applied to a DSP having two data buses and capable of simultaneously transferring two data between the memory 10 and the register file 8. Therefore, the same reference numerals are given to the components and elements corresponding to those in FIG. 3, and the description thereof will be omitted unless necessary.

【００１５】本実施例においては、乗算器１の出力ラッ
チ４をデータＡＬＵ回路５の一方の入力のセレクタ７に
接続すると同時にこの乗算器１の２つの入力のセレクタ
２、３に直接接続し、出力ラッチ４の出力をセレクタ
２、３を介して乗算器１の２つの入力に直接供給するこ
とを可能にし、乗算器１の出力結果を次のマシンサイク
ルで直ちに乗算器１への入力とすることができるように
したものである。In this embodiment, the output latch 4 of the multiplier 1 is connected to the selector 7 of one input of the data ALU circuit 5 and at the same time directly connected to the selectors 2 and 3 of the two inputs of the multiplier 1. It enables the output of the output latch 4 to be directly supplied to the two inputs of the multiplier 1 through the selectors 2 and 3, and the output result of the multiplier 1 is immediately input to the multiplier 1 in the next machine cycle. It was made possible.

【００１６】即ち、乗算器１の出力ラッチ４からこの乗
算器１の２つの入力に至る経路をそれぞれ設け、乗算器
１の入力としてレジスタファイル８と出力ラッチ４とが
選択できるようにしたものである。図１に示す本実施例
の構成の場合、メモリに格納されたｘｎ、ｙｎ、及びｚ
ｎ（ｎ＝1、2、3、・・・、N ）の三項積をＮ個連続的に計算
し、結果をメモリに返す演算を行うためには、次のよう
な手順を必要とする。ただし、一行は１ステップに相当
する。That is, the paths from the output latch 4 of the multiplier 1 to the two inputs of the multiplier 1 are respectively provided so that the register file 8 and the output latch 4 can be selected as the inputs of the multiplier 1. is there. In the case of the configuration of this embodiment shown in FIG. 1, xn, yn, and z stored in the memory
The following procedure is required in order to perform N arithmetic operations of N ternary products of n (n = 1, 2, 3, ..., N) and return the result to the memory. . However, one line corresponds to one step.

【００１７】Ｒ１＝ｘ１；Ｒ２＝ｙ１；（３．１）ＯＭＬＴ＝Ｒ１×Ｒ２；Ｒ２＝ｚ１；（３．２）ＯＭＬＴ＝ＯＭＬＴ×Ｒ２；Ｒ１＝ｘ２；Ｒ２＝ｙ２；（３．３）ｎ＝2、3、・・・、N に対して、次の２ステップを繰り返す。 R1＝OMLT；OMLT＝R1×R2；R2＝zn；（３．４） R1→ MEM；OMLT＝OMLT×R2；R1＝x(n+1)；R2＝y(n+1) （３．５）即ち、（２Ｎ＋１）ステップで済むことになる。これは
従来の構成に比べて項数Ｎ以上のステップ数の削減とな
っている。R1 = x1; R2 = y1; (3.1) OMLT = R1 × R2; R2 = z1; (3.2) OMLT = OMLT × R2; R1 = x2; R2 = y2; (3.3) The following two steps are repeated for n = 2, 3, ..., N. R1 = OMLT; OMLT = R1 × R2; R2 = zn; (3.4) R1 → MEM; OMLT = OMLT × R2; R1 = x (n + 1); R2 = y (n + 1) (3.5 ) That is, (2N + 1) steps will suffice. This is a reduction in the number of steps of N or more as compared with the conventional configuration.

【００１８】かくして、本実施例によれば、三項積、四
項積などのように、ＡＬＵ回路５を用いずに乗算器１の
みを連続的に用いるような演算実行時に、乗算器１の出
力を次のマシンサイクルで直ちにこの乗算器１の入力と
して用いることができるので、データ転送のみのステッ
プが出現することがなくなり、ステップ数の増加を抑え
ることができる。よって、プロセッサ全体の処理能力を
著しく向上させることができる。Thus, according to this embodiment, the multiplier 1 of the multiplier 1 is used at the time of executing an operation such as the triplet product or the quartet product which continuously uses only the multiplier 1 without using the ALU circuit 5. Since the output can be immediately used as the input of the multiplier 1 in the next machine cycle, the step of only data transfer does not appear, and the increase in the number of steps can be suppressed. Therefore, the processing capability of the entire processor can be significantly improved.

【００１９】次に、本発明の第２の実施例について図２
を参照して説明する。上記第１の実施例では乗算器１の
出力ラッチ４を乗算器１の２つの入力のセレクタ２、３
に直接接続して乗算器１の出力ラッチ４からこの乗算器
１の入力への経路を設けたが、本実施例では図２に示す
ように、乗算器１の出力をレジスタファイル８に直接接
続してレジスタファイル８に乗算器１の出力をラッチす
る機能を持たせ、乗算器１の出力をセレクタ２、３を介
して乗算器１の２つの入力に直接供給することができる
ようにしたものである。Next, a second embodiment of the present invention will be described with reference to FIG.
Will be described with reference to. In the first embodiment, the output latch 4 of the multiplier 1 is replaced by the selectors 2 and 3 of the two inputs of the multiplier 1.
Although a path from the output latch 4 of the multiplier 1 to the input of the multiplier 1 is provided by directly connecting to, the output of the multiplier 1 is directly connected to the register file 8 in this embodiment as shown in FIG. Then, the register file 8 has a function of latching the output of the multiplier 1 so that the output of the multiplier 1 can be directly supplied to the two inputs of the multiplier 1 through the selectors 2 and 3. Is.

【００２０】即ち、本実施例では乗算器１の出力をレジ
スタファイル８に直接供給することによって乗算器１に
出力ラッチを備える必要をなくし、かつ乗算器１の出力
結果を次のマシンサイクルで直ちに乗算器１への入力と
することができるように構成したものである。図２に示
す本実施例の構成の場合、メモリに格納されたｘｎ、ｙ
ｎ、及びｚｎ（ｎ＝1、2、3、・・・、N ）の三項積をＮ個連続
的に計算し、結果をメモリに返す演算を行うためには、
次のような手順を必要とする。ただし、一行は１ステッ
プに相当する。That is, in this embodiment, the output of the multiplier 1 is directly supplied to the register file 8 so that it is not necessary to provide the multiplier 1 with an output latch, and the output result of the multiplier 1 is immediately output in the next machine cycle. It is configured so that it can be used as an input to the multiplier 1. In the case of the configuration of this embodiment shown in FIG. 2, xn, y stored in the memory
In order to perform N operations of n and zn (n = 1, 2, 3, ..., N) consecutively and return the result to the memory,
The following steps are required. However, one line corresponds to one step.

【００２１】Ｒ１＝ｘ１；Ｒ２＝ｙ１；（４．１）ＯＭＬＴ＝Ｒ１×Ｒ２；Ｒ２＝ｚ１；（４．２）ｎ＝2、3、・・・、N に対して、次の２ステップを繰り返す。 OMLT→ MEM；OMLT＝R1×R2；R2＝zn；（４．３） OMLT＝OMLT×R2；R1＝x(n+1)；R2＝y(n+1)；（４．４）即ち、２Ｎステップで済むことになる。R1 = x1; R2 = y1; (4.1) OMLT = R1 × R2; R2 = z1; (4.2) For n = 2, 3, ..., N, the following two steps are performed. repeat. OMLT → MEM; OMLT = R1 × R2; R2 = zn; (4.3) OMLT = OMLT × R2; R1 = x (n + 1); R2 = y (n + 1); (4.4) 2N steps will suffice.

【００２２】かくして、本実施例においても、三項積、
四項積などのように、ＡＬＵ回路５を用いずに乗算器１
のみを連続的に用いるような演算実行時に、乗算器１の
出力を次のマシンサイクルで直ちにこの乗算器１の入力
として用いることができるので、データ転送のみのステ
ップが出現することがなくなり、実行ステップ数を大幅
に削減することができる。よって、プロセッサ全体の処
理能力を著しく向上させることができる。Thus, also in this embodiment, the product of three terms,
Multiplier 1 without using ALU circuit 5 such as quadratic product
When performing an arithmetic operation in which only one is continuously used, the output of the multiplier 1 can be immediately used as the input of the multiplier 1 in the next machine cycle, so that the step of only data transfer does not appear and the operation is executed. The number of steps can be significantly reduced. Therefore, the processing capability of the entire processor can be significantly improved.

【００２３】なお、上記各実施例においては三項積を連
続的に演算する場合について説明したが、三項積に限ら
ず四項積、五項積等の多項積を連続的に演算する場合に
も本発明が適用でき、同様の作用効果が得られることは
言うまでもない。また、データバスを時分割で利用する
構成の装置にも本発明が適用できるし、さらに乗算器の
出力ラッチとデータＡＬＵ回路への入力との間に、シフ
タが挿入された構成の装置にも本発明が適用できること
は勿論である。その他、必要に応じて種々の変形及び変
更がなし得ることは言うまでもない。In each of the above embodiments, the case where the ternary product is continuously calculated has been described, but the case where not only the ternary product but also the quadratic product, the quintic product, and the like is continuously calculated. It is needless to say that the present invention can be applied to and the same operational effect can be obtained. Further, the present invention can be applied to a device configured to use the data bus in a time division manner, and further to a device configured to insert a shifter between the output latch of the multiplier and the input to the data ALU circuit. Of course, the present invention can be applied. In addition, it goes without saying that various modifications and changes can be made as necessary.

【００２４】[0024]

【発明の効果】以上述べたように、本発明によれば、Ｄ
ＳＰのように乗算器とＡＬＵ回路を縦列接続し、パイプ
ライン方式に基づいて高速演算処理を行う演算装置にお
いて、乗算器の出力をこの乗算器の入力に直接入力可能
な経路を設けたので、乗算器の出力結果を次のマシンサ
イクルで直ちに乗算器への入力とすることができる。こ
のため、乗算器のみを連続的に用いるような演算処理に
おいて、データ転送のみのステップの出現をなくすこと
ができるので、実行ステップ数を大幅に削減することが
可能となり、プロセッサ全体の処理能力を著しく向上さ
せることができるという効果があり、特に、三項積、四
項積、・・・の連続演算処理においその効果は顕著であ
る。As described above, according to the present invention, D
Since a multiplier and an ALU circuit are connected in cascade like SP and an arithmetic device for performing high-speed arithmetic processing based on a pipeline system is provided with a path capable of directly inputting the output of the multiplier to the input of this multiplier, The output result of the multiplier can be immediately input to the multiplier in the next machine cycle. Therefore, in the arithmetic processing in which only the multiplier is continuously used, the step of only data transfer can be eliminated, so that the number of execution steps can be significantly reduced and the processing capacity of the entire processor can be reduced. There is an effect that it can be remarkably improved, and the effect is particularly remarkable in the continuous arithmetic processing of the ternary product, the quaternary product, ....

[Brief description of drawings]

【図１】本発明による演算装置の第１の実施例を示す回
路構成図である。FIG. 1 is a circuit configuration diagram showing a first embodiment of an arithmetic unit according to the present invention.

【図２】本発明による演算装置の第１の実施例を示す回
路構成図である。FIG. 2 is a circuit configuration diagram showing a first embodiment of the arithmetic unit according to the present invention.

【図３】従来の演算装置の一例を示す回路構成図であ
る。FIG. 3 is a circuit configuration diagram showing an example of a conventional arithmetic device.

フロントページの続き (72)発明者奥村幸彦東京都港区虎ノ門二丁目10番１号エヌ・ティ・ティ移動通信網株式会社内Front Page Continuation (72) Inventor Yukihiko Okumura 2-10-1 Toranomon, Minato-ku, Tokyo NTT Mobile Communication Network Co., Ltd.

Claims

[Claims]

1. A multiplier having two inputs, each input having a selector and having an output latch, an ALU circuit having two inputs and having a selector at each input, and a plurality of registers. With
It has an input / output interface with a data bus, receives the output of the ALU circuit as an input, and gives the value of an arbitrary register as the input of the multiplier and the ALU circuit via the multiplier and the selector of the ALU circuit, respectively. And a register file capable of storing the register file, and a path from the output latch of the multiplier to each selector of the multiplier is provided, and the register file and the output latch of the multiplier serve as inputs of the multiplier. An arithmetic unit characterized by being selectable.

2. A multiplier having two inputs, each input having a selector, an ALU circuit having two inputs and each input having a selector, a plurality of registers, and a data bus. It has an input / output interface and receives the output of the ALU circuit as an input,
An arithmetic unit comprising: a register file capable of giving a value of an arbitrary register as an input to the multiplier and the ALU circuit via a selector of the multiplier and the ALU circuit, respectively. A path is provided to allow the register file to have a function of latching the output of the multiplier, and the output of the multiplier latched in the register file can be selected as an input of the multiplier. An arithmetic unit characterized by.