JPS6320536A

JPS6320536A - Information processor

Info

Publication number: JPS6320536A
Application number: JP61164748A
Authority: JP
Inventors: Hiroshi Matsumoto; 寛松本
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1986-07-15
Filing date: 1986-07-15
Publication date: 1988-01-28
Anticipated expiration: 2011-06-05
Also published as: JP2503984B2

Abstract

PURPOSE:To execute the processing at a high speed by executing preferentially that which requires a result as soon as possible, in case of a competition between a computing element output in the same computing element and an added pipeline use register output, or otherwise, executing preferentially an output from the latter stage of the pipeline use register. CONSTITUTION:At the time of a selective control of a switching circuit 12, when there is a competition of outputs of the maximum 4 cases of a result of an adder and subtracter circuit system, which is to be executed preferentially is determined, and unless that which desires to output a result as soon as possible especially exists, the degree of priority becomes higher as to that of the latter stage of the pipeline. Also, when that which desires to output the result as soon as possible especially exists, for instance, when an instruction for using its result as an operand is waiting for the execution, the result is stored in an arithmetic operation register 16 immediately. Unless it is read out and executed, a fall of the performance is caused, therefore, even when other result is not transferred yet to the arithmetic operation register 16 but left in the pipeline use register of the latter stage side than said one, this result overruns it and stored in the arithmetic operation register 16 if it is allowed. A switching circuit 13 is also constituted in the same way as the switching circuit 12.

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明はパイプライン化した情報処理装置に関し、特に
複数の命令を並列に実行するパイプライン化した情報処
理装置に関する。DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to a pipelined information processing device, and particularly to a pipelined information processing device that executes a plurality of instructions in parallel.

[Conventional technology]

一般に、高性能を追求した情報処理装置に於ては、独立
した各糧演算器をノにイブライン化し、同種の演算につ
いては、同じ演算器だ対して連続的にオペランドを送シ
込んでノやイグライン処理し。In general, in information processing equipment that pursues high performance, each independent computing unit is made into an online system, and operands are continuously sent to the same computing unit for similar operations. Igraine processing.

異なった演算については別の演算器に対して連続的にオ
ペランドを送り込んで並列に処理することが行なわれて
いる。また、各種演算器は、性能向上の為、それぞれ最
も少ないパイプライン段数で実現している。For different operations, operands are continuously sent to different arithmetic units and processed in parallel. In addition, each type of arithmetic unit is realized with the least number of pipeline stages to improve performance.

[Problem that the invention seeks to solve]

ところが、同一演算器については、・クイシライン処理
の為、演算結果が入力順に順次出力されるが・異なった
演算器からの結果の出力は、各演算器へのオペランドの
投入タイミング、各演算器のパイプライン段数の違いに
ニジ、同一タイミングに競合する可能性がある。演算結
果は一般にソフトウェア可視レジスタ群（以下、演算し
・ゾスタと呼ぶ）の中の−って格納されるが、これは普
通メモリ化されている為、同一タイミングに複数の結果
を格納することはできない。However, for the same arithmetic unit, - due to quisi-line processing, the arithmetic results are output sequentially in the order of input, - results from different arithmetic units are output depending on the input timing of operands to each arithmetic unit, and the timing of inputting operands to each arithmetic unit. Due to the difference in the number of pipeline stages, there may be conflicts at the same timing. The calculation results are generally stored in the software visible register group (hereinafter referred to as the calculation register), but since this is usually stored in memory, it is not possible to store multiple results at the same timing. Can not.

これを避ける為、各演算器のパイプラインの段数から逆
算して、結果の出力タイミングが競合する場合には、各
演算器へのオペランドの投入タイミング、即ち命令の実
行起＠を適当に遅らせている。命令の実行起動が、与え
られた命令列の順にしか出来ない制御方式を採用して込
る装置では。To avoid this, calculate backwards from the number of pipeline stages of each arithmetic unit, and if there is a conflict in the output timing of the result, appropriately delay the input timing of operands to each arithmetic unit, that is, the start of instruction execution. There is. In devices that employ a control method that allows instructions to be executed only in the order of a given instruction sequence.

ｌ命令の実行起動を遅らせることによシ後続の命令も順
次遅れることになシ、性能の低下を招いている。By delaying the start of execution of the 1 instruction, subsequent instructions are also sequentially delayed, resulting in a decrease in performance.

[Means for solving problems]

本発明の情報処理装置はパイプライン化した機能の異な
る演算器を複数個持り清報処理装置において、各演算に
最低必要な・ｇイブライン段数が異なる場合、各演算器
て対してその後段に装置結果の転送のみを行なう・ぐイ
ゾライン用のレジスタを各演算系ノ？イブラインの段数
が等しくなるように必要な段数だけ付加し、前記付加し
たノ４イデライン用のレジスタの任意のレジスタからも
演算結果を出力出来る切替手段と、全結果出力のうちか
ら１つを選択しようとして同−演算器知於ける演算器出
力及び付加したパイプライン用レジスタ出力の間で競合
する場合には、結果を早く必要とするものがあればこれ
を優先し、さもなければパイプライン用レジスタのよゆ
後段からの出力を優先させる制御手段とを含んで構成さ
れる。The information processing device of the present invention has a plurality of pipelined arithmetic units with different functions, and when the minimum number of stages required for each operation is different, the following stage is used for each arithmetic unit. Is there a register for each calculation system that only transfers device results? Add the necessary number of stages so that the number of stages of the ideal line is equal, and select one of the switching means that can output the operation result from any register of the added registers for the ideal line, and the output of all results. If there is a conflict between the arithmetic unit output of the same arithmetic unit and the added pipeline register output, if there is one that requires the result quickly, this will take priority, otherwise the pipeline register and a control means for prioritizing the output from the subsequent stage.

〔Example〕

次に本発明について図面を参照して説明する。 Next, the present invention will be explained with reference to the drawings.

第１図及び第２図は本発明の一実施例のブロック図及び
その一部詳細図である。第１図に於て。1 and 2 are block diagrams and partially detailed views of one embodiment of the present invention. In Figure 1.

乗算回路１．加減算回路２．論理演算回路３はそれぞれ
７．各段が１マシンサイクルで動作・するパイプライン
化された演算器であシ、１マシンサイクル毎に異なった
第４ランドの演算をノクイゾライン式に処理することが
出来る。また各演算に必要なオペランドが入力オベラン
ドノクス１００１，１００２で与えられると、それぞれ
７マシンサイクル、４マシンサイクル、２マシンサイク
ル後忙演算結果を出力することが出来る。Multiplication circuit 1. Addition/subtraction circuit 2. Each logic operation circuit 3 has 7. Each stage is a pipelined arithmetic unit that operates in one machine cycle, and it is possible to process a different fourth land calculation every machine cycle in a noquizoline manner. Furthermore, when the operands necessary for each operation are given through the input obelandnoxes 1001 and 1002, the busy operation results can be output after 7 machine cycles, 4 machine cycles, and 2 machine cycles, respectively.

加減算回路２及び論理演算回路３のノやイブライン段数
はそれぞれ４段、２段であシ２乗算回路ｌ！多のパイプライン段数１７段であるから、その差分。The number of stages of the addition/subtraction circuit 2 and logic operation circuit 3 is 4 stages and 2 stages, respectively. The number of pipeline stages is 17, so there is a difference.

つま）３段及び５段分の／ぐイブライン用レジスタ４〜
６，７〜１１が加減算回路２及び論理演算回路３に接続
されている。(Tsum) 3-stage and 5-stage /give line registers 4~
6, 7 to 11 are connected to the addition/subtraction circuit 2 and the logic operation circuit 3.

切替回路１２は加減算回路２の出力、パイプライン用レ
ジスタ４〜６の出力のうち、いずれかを選択して出力す
る回路であシ、切替回路１３は論理演算回路３の出力、
ツクイブライン用レジスタ７〜１１の出力のうち、いず
れかを選択して出力する回路である。また切替回路１４
は乗算回路１゜切替回路１２．１３の出力のうちいずれ
かを選択して、演算レジスタ１６への書込み・ぐス１０
０３に出力する回路である。The switching circuit 12 is a circuit that selects and outputs either the output of the addition/subtraction circuit 2 or the output of the pipeline registers 4 to 6, and the switching circuit 13 selects and outputs the output of the logic operation circuit 3.
This circuit selects and outputs one of the outputs of the twin line registers 7 to 11. Also, the switching circuit 14
selects one of the outputs of the multiplier circuit 1° switching circuit 12 and 13 and writes it to the calculation register 16.
This is a circuit that outputs to 03.

本実施例には乗算回路ｌにはパイプライン用レジスタを
設けていないが、これは説明を簡単にする為であシ、加
減算回路２．論理演算回路３と同様に後段にパイプライ
ン用レジスタを適当な段数だけ設けても良い。この場合
乗算回路に設けたパイプライン用レジスタの段数だけ、
加減算回路。In this embodiment, the multiplier circuit 1 is not provided with a pipeline register, but this is for the purpose of simplifying the explanation. Similar to the logic operation circuit 3, an appropriate number of pipeline registers may be provided at the subsequent stage. In this case, the number of stages of pipeline registers provided in the multiplication circuit is
Addition/subtraction circuit.

論理演算回路のパイプライン用レジスタの段数を共に増
やすことになる。This also increases the number of stages of pipeline registers in the logical operation circuit.

制御回路１５は切替回路１２，１３．１４の選択を制御
する回路である。まず切替回路１４の選択制御は、各演
算回路系の間で結果の出力の競合が発生したときにどの
演算回路系からの出力を優先するかを決めるものであシ
、優先順は乗算回路系、加減算回路系、論理演算回路系
の順である。The control circuit 15 is a circuit that controls selection of the switching circuits 12, 13, and 14. First, the selection control of the switching circuit 14 is to determine which arithmetic circuit system will give priority to the output when a conflict occurs in the output of the result between the arithmetic circuit systems, and the priority order is the multiplier circuit system. , addition/subtraction circuit system, and logic operation circuit system.

これは例えば乗算回路にはツクイブライン用レジスタが
無いので結果が求まった時点ですぐに演算レジスタ１６
に転送しなければ結果が失なわれてしまうので最優先に
するものであシ、以下同様の考え方で演算に最低必要な
パイプライン段数が多込ものをより優先的に巳ている。For example, the multiplier circuit does not have a register for the multiplier line, so as soon as the result is obtained, the arithmetic register 16
Since the result will be lost if it is not transferred, it is given the highest priority.The same idea is used to give priority to the one with the minimum number of pipeline stages required for the calculation.

次だ切替回路１２の選択制御は、加減算回路系の結果の
最大４ケースの出力の競合がある場合に。The selection control of the next switching circuit 12 is performed when there is a maximum of four cases of conflicting outputs of the results of the addition/subtraction circuit system.

どれを優先するかを決めるものであり、特に結果を早く
出力したいものが無い場合は、優先度はパイプラインの
より後段のものを高くしている。これは例えば最後段の
・卆イゾライン用レジスタ６に結果がある場合は、これ
を優先して演算レジスタ１６に転送しなければ、結果が
失なわれてしまうので最優先にするものであシ、以下同
様の考え方で後段側をよシ優先度を高くしている。また
１％に結果を早く出力したいものがある場合２例えばそ
の結果をオペランドとして使う命令が実行待ちになって
いる場合にはすぐに結果を演算レジスタ１６に格納し、
読み出して実行に入らなければ性能の低下を招くので、
これより後段側のパイプライン用レジスタに別の結果が
まだ演算レジスタ１６に転送されなくて残っている場合
でも、差しつかえない限シこれを追い越して演算レジス
タ１６に格納するという制御を行なっている。切替回路
１３についても切替回路１２と同様である。It is used to decide which one to give priority to, and if there is no particular result that you want to output quickly, the priority is given to the one in the later stages of the pipeline. For example, if there is a result in the last-stage isoline register 6, it must be given priority and transferred to the arithmetic register 16, otherwise the result will be lost, so it should be given top priority. Below, using the same idea, higher priority is given to the later stages. In addition, if there is something that you want to output the result quickly in 1%2, for example, if an instruction that uses the result as an operand is waiting for execution, immediately store the result in the calculation register 16,
If it is not read and executed, performance will deteriorate, so
Even if another result remains in the pipeline register on the downstream side without being transferred to the arithmetic register 16, control is performed to overtake this result and store it in the arithmetic register 16 as far as possible. . The switching circuit 13 is also similar to the switching circuit 12.

フリツノフロップ１０１〜１０７．２０１〜２０７．３
０１〜３０７はそれぞれ乗算回路系。Fritsuno flop 101~107.201~207.3
01 to 307 are multiplication circuit systems, respectively.

加減算回路系、論理演算回路系の各パイプラインステー
ジだ有効データが存在することを表わす有効ビットすな
わち有効フラッグを表わすものであシ、これと制御回路
１５の詳細を示すと第２図のようになっている。Each pipeline stage of the addition/subtraction circuit system and the logic operation circuit system represents a valid bit, that is, a valid flag, which indicates the presence of valid data.The details of this and the control circuit 15 are shown in FIG. It has become.

第２図は主に加減算回路２に注目して詳細が記されてい
る。加減算回路２にオＲランドが与えられると、・ヤイ
プライン式に演算が進むにつれ、フリップフロップ２０
１，２０２，２０３が順に１１となシ、演算結果が求ま
ると同時にフリップフロラｆ２０４が１”になる。この
時に求まった結果を演算レジスタ１６に転送できる場合
は、転送が行なわれてしまうため、フリップフロップ２
０４は％ｌ”→＠０”になう、フリップフロップ２０５
は１にならずに終了する。他との競合があり優先度が低
くて転送できない場合は、結果をパイプライン用レジス
タ４に入れると同時にフリップ７０ノブ２０４は１１”
→“０″、フリップフロップ２０５は′０ｍ−４″ｌ”
になシ、結果が演算レジスタ１６に転送されずにノ？イ
ブライン用レジスタ４に移ったことを示す。以下ツクイ
ブライン用レジスタ５，６゜フリップフロラｆ２０６，
２０７も同様である。FIG. 2 mainly focuses on the addition/subtraction circuit 2 and describes the details thereof. When OR land is given to the adder/subtracter circuit 2, as the operation progresses in the Yaipline style, the flip-flop 20
1, 202, and 203 become 11 in order, and at the same time as the calculation result is obtained, the flip roller f204 becomes 1''.If the result obtained at this time can be transferred to the calculation register 16, the transfer will be performed. flip flop 2
04 becomes %l"→@0", flip-flop 205
ends without becoming 1. If there is a conflict with another and the priority is low and cannot be transferred, the result is put into the pipeline register 4 and at the same time the flip 70 knob 204 is set to 11".
→"0", flip-flop 205 is '0m-4"l"
No, the result is not transferred to the calculation register 16? Indicates that the data has moved to register 4 for Eveline. Below are the Tsukuibline registers 5, 6° flip flora f206,
The same applies to 207.

格納アドレスレジスタ２２１〜２２６は加戒算結果を演
算レジスタ１６に格納する場合の格納アドレスを持ち回
るレジスタであり、オー？ランドアドレスレジスタ２３
１．２３２はそれぞれ今から演算を開始しようとしてい
る命令の２オぜランドの演算レジスタアドレスを保持す
るレジスタで。The storage address registers 221 to 226 are registers that rotate the storage address when storing the result of addition and subtraction in the calculation register 16. Land address register 23
1.232 is a register that holds the 2 Oz operation register address of the instruction that is about to start operation.

現在実行中の命令の結果を待っている場合は実行忙入れ
ずこのレジスタに保持され続けるものである。When waiting for the result of an instruction currently being executed, it continues to be held in this register regardless of execution.

オ被ランドアドレスレジスタ２３１．２３２の内容は格
納アドレスレジスタ２２６，２２５゜２２４と比較され
、一致するものがあれば各・ンイブラインステージ対応
に一致信号１６０１，１６０２゜１６０３が”１＃とな
る。これらの信号から優先度変更信号１７０１．１７０
２が作られる。有効ビット２０１〜２０７に対応するノ
Ｊ？イブラインステージをＡ−Ｇと呼ぶと、この論理テ
ーブルは次の様になっている。The contents of the land address registers 231 and 232 are compared with the storage address registers 226, 225, and 224, and if there is a match, the match signals 1601, 1602, and 1603 become "1#" for each blank stage. .Priority change signal 1701.170 from these signals
2 is made. NoJ? corresponding to valid bits 201-207? If the Eveline stages are called A-G, this logical table is as follows.

つまシＦステージにある演算結果よりり、Ｅステージに
ある結果が早く欲しい場合は、Ｆステージの有効ビット
を抑える信号１７０１を０”にし、Ｅステージにある演
算結果よシロステージにある結果が早く欲しい場合はＥ
ステージの有効ビラトラ抑える信号１７０２を′０＃に
する論理となっている。If you want the result in the E stage to be faster than the result in the front F stage, set the signal 1701 that suppresses the valid bit of the F stage to 0'', so that the result in the white stage is faster than the result in the E stage. E if you want
The logic is such that the signal 1702 for suppressing the effective playback of the stage is set to '0#.

切替回路１２の制御は、有効ビットをあられすフリッグ
フロッｆ２０４〜２０７と優先度変更信号１７０１，１
７０２　、加減算回路系の結果を優先して出力できるこ
とを指示する信号１４０２１’ｌ：よって行なわれる。The switching circuit 12 is controlled by flip-flops f204 to 207 that change the valid bits and priority change signals 1701 and 1.
702, signal 14021'l instructing that the result of the addition/subtraction circuit system can be output with priority: Therefore, this is done.

先ず優先度変更信号１７０１，１７０２か共に“１″の
場合、つまシ便先度変更が無い場合についてみると、信
号１４０２か“１＃であればフリップフロップ２０７．
２０６．２０５．２０４の順に見て例えば最初に１″の
あるフリップフロップが２０６であるとするとこれに対
応するパイプライン用レジスタ５の結果を演算レジスタ
１６に転送するため、　ＡＮＤ’ｌ’−）　２１７〜２
１４０うち対応する椰ゲート２１６の出力信号１２０６
が１１”となり、切替回路１２ではパイプライン用し・
ゾスタ５が選択されて結果の転送が行なわれることにな
る。First, if the priority change signals 1701 and 1702 are both "1" and there is no priority change, if the signal 1402 is "1#", the flip-flop 207.
206, 205, and 204. For example, if the first flip-flop with 1" is 206, then in order to transfer the result of the corresponding pipeline register 5 to the operation register 16, AND'l'-) 217-2
140, the output signal 1206 of the corresponding palm gate 216
becomes 11", and the switching circuit 12
Zostar 5 will be selected and the results will be transferred.

信号１４０２が”０′であれば加減算回路系の結果は演
算レジスタ１６に転送できないのでＡＮＤ）ｒ′″−ト
２１７〜２１４のいずれの出力（１２０７〜１２０４）
もＯ”となシ、従って信号１３０６〜１３０４は全て″
１”となるので７リツプフロツプ２０４〜２０６の内容
がそのままフリップフロップ２０５〜２０７に移される
。If the signal 1402 is "0", the result of the addition/subtraction circuit system cannot be transferred to the arithmetic register 16, so AND) r'" - whichever output from 217 to 214 (1207 to 1204)
Also, the signals 1306 to 1304 are all "O".
1'', the contents of the 7 flip-flops 204-206 are transferred as they are to the flip-flops 205-207.

次に優先度変更がある場合についてみると５例えばフリ
ップフロップ２０７，２０６，２０５゜２０４のうち２
０６と２０５が”１″であり、格納アドレスレジスタ２
２６の内容とオペランドアドレスレジスタ２３１．２３
２とは一致せず（信号１６０１＝”Ｏ″）、格納アドレ
スレジスタ２２５の内容とオペランドアドレスレジスタ
２３１ｉｕ２３２が一致している（信号１６０２＝“１
”）とすると８次に実行に入ろうとしている命令はＥス
テージにある演算結果を待っている状態であシ、Ｆステ
ージの結果よりも先にＥステージの結果を優先して演算
レジスタ１６に格納した方が待合せ時間が少なくて済む
。この場合、優先度変更信号１７０１＝−”Ｏ＝となる
ことばよって７リツプフロツゾ２０６の出力信号を抑え
、ＡＮＤゲート２１６の出力信号は”Ｏ”、　ＡＮＤダ
ート２１５の出力信号−′１＃となって２．切替回路１
２ではノクイゾライン用レジスタ４が選択され、Ｆステ
ージにある結果よシＥステージにある結果が転送される
ことになる。Next, if we consider the case where there is a priority change, 5, for example, 2 of the flip-flops 207, 206, 205°204
06 and 205 are "1", and storage address register 2
26 contents and operand address registers 231.23
2 does not match (signal 1601="O"), and the contents of storage address register 225 and operand address register 231iu232 match (signal 1602="1").
”), the instruction that is about to be executed next is waiting for the operation result in the E stage, and the result of the E stage is given priority over the result of the F stage and stored in the operation register 16. In this case, the priority change signal 1701=-"O= suppresses the output signal of the 7-lip float 206, the output signal of the AND gate 216 becomes "O", and the AND dart 215 The output signal of -'1# becomes 2. Switching circuit 1
In step 2, the register 4 for the noise line is selected, and the results in the F stage and the results in the E stage are transferred.

この場合追い越されて転送されなかったＦステージ上の
結果はＧステージに移される。In this case, the results on the F stage that have been overtaken and not transferred are transferred to the G stage.

次に切替回路１４の選択制御たついて説明する。Next, selection control of the switching circuit 14 will be explained.

各演算回路系から演算レジスタ１６に転送すべき結果が
１つでも存在することを表わす信号１５０１゜１５０２
．１５０３はそれぞれフリップフロップ１０７の出力、
フリップフロップ２０４〜２０７の出力の論理和、フリ
ップフロップ３０２〜３０７の出力の論理和である。Signals 1501 and 1502 indicating that there is at least one result to be transferred from each arithmetic circuit system to the arithmetic register 16.
．． 1503 are the outputs of the flip-flop 107,
These are the logical sum of the outputs of flip-flops 204 to 207 and the logical sum of the outputs of flip-flops 302 to 307.

この３本の信号を入力し、優先順位つまシ乗算回路系、
加減算回路系、論理演算系の順に従って。Input these three signals, prioritize the multiplier circuit system,
Follow the order of addition/subtraction circuit system and logical operation system.

乗算回路系の出力を選ぶ信号１４０１　、加減算回路系
の出力を選ぶ信号１４０２．論理演算系の出力全還ぶ信
号１４０３を作成し、これによって切替回路１４の選択
制御を行なっている。A signal 1401 for selecting the output of the multiplication circuit system, a signal 1402 for selecting the output of the addition/subtraction circuit system. A signal 1403 that returns all the outputs of the logic operation system is generated, and selection control of the switching circuit 14 is performed using this signal.

第３図は結果待ちを行なわない場合の高速化の一例を説
明する為の命令列である。■の命令は演意味する。Ｏ−
Ｏについても同様である。FIG. 3 is an instruction sequence for explaining an example of speeding up when results are not waited. ■The command means performance. O-
The same applies to O.

第４−Ａ図は、第３図の命令列に於ける従来の装置、即
ち第１図に於ける・ぐイゾライン用レジスタ４〜１１．
切替回路１２．１３が無い場合の命令実行のタイムチャ
ートである。命令の実行起動がマシンサイクル毎に一１
命令で、与えられた命令列順にのみ行なえる場合、■〜
■の命令について？ｉｌマシンサイクルずれて乗算回路
ｌ、加減算回路２．加減算回路２に演算を指示し、それ
ぞれ７サイクル目、４サイクル目、４サイクル目に演算
結果が求まるので演算レジスタ１６への書込ミハス１０
３には■■■の命令類に結果が得られる。FIG. 4-A shows the conventional device in the instruction sequence of FIG.
It is a time chart of instruction execution when there is no switching circuit 12 or 13. Instruction execution starts once per machine cycle
If an instruction can only be executed in the order of the given instruction sequence, ■~
■About the command? il machine cycle offset multiplication circuit 1, addition/subtraction circuit 2. The addition/subtraction circuit 2 is instructed to perform calculations, and the calculation results are obtained in the 7th cycle, the 4th cycle, and the 4th cycle, respectively, so the writing to the calculation register 16 is completed.
In 3, results are obtained for the instructions of ■■■.

結果の命令類が変わってもこの場合何ら問題はない。と
ころが■の命令については、タイミング４から実行開始
させたとすると（第４−Ａ図の点線）。In this case, there is no problem even if the resulting instructions change. However, assuming that the instruction (3) is started to be executed at timing 4 (dotted line in FIG. 4-A).

加減算である為４マシンサイクル後のタイミング７で結
果か求まシ、■の命令の結果が求まるタイミングと一致
し、不具合が生じる。従って■の命令の実行開始を１マ
シンサイクル遅らせ、タイミング５から実行させること
によってこれを避けるようにする（第４−Ａ図の実線）
。■の命令についても同様である。こうして７命令を実
行させると、全体で１６マシンサイクルを必要とする。Since this is an addition/subtraction, the result is found at timing 7 after four machine cycles, which coincides with the timing at which the result of the instruction (2) is found, causing a problem. Therefore, this can be avoided by delaying the start of execution of the instruction (■) by one machine cycle and having it executed from timing 5 (solid line in Figure 4-A).
. The same applies to the instruction (2). Executing seven instructions in this way requires a total of 16 machine cycles.

第４−Ｂ図は第３図の命令列て於ける本発明の装置、即
ち第１図でのタイムチャートである。この場合■〜■の
命令すべてについて、前記の従来の場合とは異なり、演
算結果の競合を気にせず。FIG. 4-B is a time chart of the apparatus of the present invention in the instruction sequence of FIG. 3, ie, in FIG. 1. In this case, for all the instructions from ■ to ■, unlike the conventional case described above, there is no need to worry about conflicts in operation results.

１マシンサイクルずれて実行開始させる。タイミング６
で■と■の命令の結果か競合するが、優先順に従って、
■の命令の結果が切替回路１２゜１４で選択され、演算
レジスタ１６に転送される。Start execution with a one-machine cycle delay. timing 6
The results of the instructions ■ and ■ conflict, but according to the priority order,
The result of the instruction (2) is selected by the switching circuit 12-14 and transferred to the arithmetic register 16.

■の命令の結果は演算レジスタ１６には転送されず、パ
イプライン用レジスタ７に移される。次にタイミング７
では■と■と■と■の命令の結果が競合するが、優先順
に従って■の命令の結果が切替回路１４で選択され、演
算レジスタ１６に転送される。■の命令の結果は・やイ
ブライン用レジスタ４に移され、■、■の命令の結果は
それぞれパイプライン用レジスタ８，７に移される。次
にタイミング８では■と■と■の命令の結果が競合する
が、優先順に従って■の命令の結果がパイプライン用レ
ジスタ４から切替回路１２．１４を通して演算レジスタ
１６に転送される。■と■の命令の結果はそれぞれパイ
プライン用レジスタ９，８に移される。次にタイミング
９では■と■の命令の結果が競合するが、優先ｊ頂に従
って■の命令の結果が・やイブライン用レジスタ９から
切替回路１３．１４を通して演算レジスタ１６に転送さ
れる。■の命令の結果はパイプライン用しノスタ９に啓
される。タイミング１ｏでは競合なく■の命令の結果が
パイプライン用レジスタ９から切替回路１３．１４を通
して演算レジスタ１６に転送される。■の命令の結果は
タイミング１３で競合なく結果の格納か行なわれ、結果
的には７命令の実行に１３マシンサイクルで済むので、
高速化されることが理解されよう。The result of the instruction (2) is not transferred to the operation register 16 but is transferred to the pipeline register 7. Next, timing 7
In this case, the results of the instructions ■, ■, ■, and ■ conflict with each other, but the result of the instruction ■ is selected by the switching circuit 14 and transferred to the arithmetic register 16 according to the priority order. The result of the instruction (2) is moved to the line register 4, and the results of the instructions (2) and (2) are moved to the pipeline registers 8 and 7, respectively. Next, at timing 8, the results of the instructions ■, ■, and ■ conflict, but the result of the instruction ■ is transferred from the pipeline register 4 to the arithmetic register 16 through the switching circuit 12.14 in accordance with the priority order. The results of the instructions (2) and (2) are transferred to pipeline registers 9 and 8, respectively. Next, at timing 9, the results of the instructions ■ and ■ compete with each other, but the result of the instruction ■ is transferred from the line register 9 to the arithmetic register 16 through the switching circuits 13 and 14 in accordance with the priority j. The result of the instruction (2) is used for the pipeline and is disclosed to Nosta 9. At timing 1o, the result of the instruction (3) is transferred from the pipeline register 9 to the operation register 16 through the switching circuits 13 and 14 without any conflict. The result of the instruction (①) is stored without conflict at timing 13, and as a result, it only takes 13 machine cycles to execute 7 instructions.
It will be understood that the speed will be increased.

尚、・ヤイプラインの長さを同じ長さえ合わせているの
は、ノやイブラインの長さがすべて同じであれば１マシ
ンサイクル毎の入力に対し必ず出力の競合なしに１マシ
ンサイクル毎に出力を得ることが出来るためであシ、単
に転送の為のパイプライン用レジスタの任意の出力から
結果を取シ出す構成にしているのは、競合さえなければ
出来るだけ早く、演算レジスタを更新し、この結果をオ
イランドとして使用する命令が後続の命令の中にあれば
早く実行に移らせるためである。Furthermore, the reason why the lengths of the lines are the same is that if the lengths of the lines and lines are all the same, then the input for each machine cycle will always have an output for each machine cycle without contention. This is because the result can be obtained from any output of the pipeline register for transfer.The reason why the result is simply taken from the arbitrary output of the pipeline register for transfer is that the calculation register is updated as soon as possible unless there is a conflict, and this This is because if there is an instruction that uses the result as an oiland in a subsequent instruction, it can be executed quickly.

第５図は結果待ちを行なう場合の一例を説明する為の命
令列である。記述方法は第３図と同様の為、説明は省略
する。FIG. 5 is a sequence of instructions for explaining an example of waiting for results. Since the description method is the same as that in FIG. 3, the explanation will be omitted.

第６−Ａ図は、第５図の命令列に於けるタイムチャート
を優先変更回路が無いと仮定した場合のタイムチャート
であシ、■、■、■の命令は順次タイミング９，１０．
１１で結果の転送が行なわれる。■の命令は■の命令の
結果が演算し・ゾスタ１６へ転送され読み出されて（タ
イミング１２）はじめてタイミング１３から実行開始さ
れる。これに対して９本発明の装置に於ける優先変更回
路【より、第６−Ｂ図のタイムチャートに示すように、
■、■、■の命令の結果の転送はタイミング９でＤステ
ージのアドレス一致信号１６０３が１１ｍ忙なるため■
、■、■の順に行なわれ、■の命令はタイミング１１か
ら実行開始され、高速化が計られているのが理解されよ
う。FIG. 6-A is a time chart for the instruction sequence in FIG. 5 assuming that there is no priority change circuit. The instructions ①, ②, ① are sequentially executed at timings 9, 10, .
At step 11, the results are transferred. The instruction (2) starts execution at timing 13 only after the result of the instruction (2) is calculated, transferred to the ZOSTER 16, and read out (timing 12). In contrast, the priority change circuit in the device of the present invention [9] As shown in the time chart of Fig. 6-B,
The results of the instructions ■, ■, and ■ are transferred because the address match signal 1603 of the D stage is busy for 11 m at timing 9.■
, ■, ■ are executed in this order, and the execution of the instruction (■) starts at timing 11, so that it can be understood that speeding up is achieved.

〔Effect of the invention〕

以上説明したように２本発明はパイプライン化した機能
の異なる演算器を複数個持つ情報処理装置に於て、各演
算に最低必要なツクイブライン段数が異なる場合、各演
算器に対してその後段に単に結果の転送のみを行なうパ
イプライン用のレジスタを必要な段数だけ付加すること
によって各演算系パイプラインの段数を等しくシ、付加
したパイプライン用レジスタの任意のレジスタからも演
算結果を出力出来るように構成し、全納果出カのうちか
ら１つを選択しようとして同一演算器に於はル演算器出
力及び付加したパイプライン用レジスタ出力の間で競合
する場合には、結果を早く必要とするものがあればこれ
を優先し、さもなければ／’Ｐイブライン用レジスタの
より後段からの出方を優先させて出力させることによっ
て必要なマシンサイクルを減少し、動作を高速化させる
効果がある。As explained above, in an information processing device having a plurality of pipelined arithmetic units with different functions, when the minimum number of twig line stages required for each operation is different, the following stage for each arithmetic unit is provided. By adding the necessary number of pipeline registers that simply transfer results to the system, the number of stages in each calculation system pipeline can be made equal, and the calculation result can be output from any of the added pipeline registers. If you try to select one of the full outputs and there is a conflict between the outputs of the two arithmetic units and the outputs of the added pipeline registers in the same arithmetic unit, you may need the result quickly. If there is something to do, give priority to this, otherwise give priority to the output from the later stage of the /'P e-line register, which has the effect of reducing the required machine cycles and speeding up the operation. .

る。Ru.

[Brief explanation of the drawing]

第１図は本発明の一実施例を示すブロック図。第２図は第１図の一部全詳細に示した図、第３図。第５図は本発明の詳細な説明する為の命令列例。第４−Ａ図、第４−Ｂ図、第６−Ａ図、第６−Ｂ図は本
発明の詳細な説明する為のタイムチャートである。記号の説明：１・・・乗算回路、２・・・加減算回路。３・・・論理演算回路、４〜１１・・・ノクイゾライン
用しゾスタ、１２〜１４・・・切替回路、１５・・・制
御回路。効７：７／’）、　２１４〜２１　’Ｉ−ＡＮＤ回路、
　７２１〜２２３・・・結果アドレスレジスタ、２２４
〜２２６・・・格納アドレスレジスタ、２３１，２３２
−・・オイランドアドレスレジスタ、２４１・・−比較
器、１００１゜１００２・・・オペランド供給ノ４ス、
１００３・・・結果格納パス、１２０４〜１２０７・・
・選択回路１２に於ける選択信号、１４０１〜１４０３
・・・選択回路１４に於ける選択信号、１５０１・・・
１５０３・・・各演算系に格納すべき結果があることを
示す制御信号、　１６０１−１６０３・・・結果アドレ
ス／片にランドアドレス一致信号。１７０１．１７０２・・・優先度変更信号。范３図ＯＲ３−１’？１ＸＲ２ ■　Ｒ６−Ｒ４すＲ５ ■　Ｒ９−Ｒ’７　＋　Ｒ８ ■　Ｒ１２−Ｒ７０ｆＲＪＩ ■　尺フ５−Ｒ７３・Ｒ１４ ■Ｒ７ε−Ｒ７６・Ｒ７７ＯＲ；２７−　　Ｒ７９・Ｒ２０范４−Ａ図第４−８図第５図 ■　Ｒ＋０３−　ＲＩＯＩ　Ｘ　Ｒ＋０２■　Ｒ１０６
−Ｒ＋０４ＸＲ＋Ｑ５ ■　Ｒ１０９−Ｒ＋０７　十ＲＩ０８ ■　　ＲＩ＋２　−　ＲＩＩＣ１＋ＲＩＩｌ■　尺１＋
５　←ＲＩ＋３十尺１１４第６−Ｂ図FIG. 1 is a block diagram showing one embodiment of the present invention. FIG. 2 is a diagram showing a part of FIG. 1 in full detail, and FIG. FIG. 5 is an example of an instruction sequence for explaining the present invention in detail. 4-A, 4-B, 6-A, and 6-B are time charts for explaining the present invention in detail. Explanation of symbols: 1...multiplication circuit, 2...addition/subtraction circuit. 3...Logic operation circuit, 4-11...Shizostar for Noquizo line, 12-14...Switching circuit, 15...Control circuit. 7:7/'), 214-21' I-AND circuit,
721-223...Result address register, 224
~226...Storage address register, 231, 232
-...Eiland address register, 241...-Comparator, 1001゜1002...Operand supply node,
1003...Result storage path, 1204-1207...
- Selection signal in selection circuit 12, 1401 to 1403
... Selection signal in selection circuit 14, 1501...
1503...Control signal indicating that there is a result to be stored in each calculation system. 1601-1603...Land address match signal for result address/piece. 1701.1702...Priority change signal. Fan 3 figure OR3-1'? １ Figure 5■ R+03- RIOI X R+02■ R106
-R+04XR+Q5 ■ R109-R+07 10RI08 ■ RI+2 - RIIC1+RIIl■ Shaku 1+
5 ←RI+30 shaku114 Figure 6-B

Claims

[Claims]

1. In an information processing device that has multiple pipelined arithmetic units with different functions, if the minimum number of pipeline stages required for each operation is different, it is possible to simply transfer the results for each arithmetic unit to the subsequent stage. a switching means that adds a necessary number of pipeline registers to perform the above processing so that the number of stages of each calculation system pipeline is equal, and outputs a calculation result from any of the added pipeline registers; If you try to select one of all result outputs and there is a conflict between the arithmetic unit output of the same arithmetic unit and the added pipeline register output, if there is one that requires the result quickly, select it. 1. An information processing device comprising: control means for giving priority to output from a later stage of a pipeline register.