JPH0252301B2

JPH0252301B2 -

Info

Publication number: JPH0252301B2
Application number: JP10523684A
Authority: JP
Inventors: Shoji Nakatani; Nobuo Uchida
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1984-05-24
Filing date: 1984-05-24
Publication date: 1990-11-13
Also published as: JPS60247783A

Description

【発明の詳細な説明】 (a) 発明の技術分野１つ乃至複数個のエレメントを、同時にアクセ
ス可能とするベクトルレジスタと、該ベクトルレ
ジスタ間で演算を行う１つ乃至複数個の演算パイ
プラインと、記憶装置と上記ベクトルレジスタ間
でデータ転送を行う１つ乃至複数個のアクセスパ
イプラインとを備えたベクトル処理装置におい
て、該ベクトルレジスタに対する書き込みと、読
み出し動作の間にレジスタ連鎖がある場合の、該
読み出し動作パイプラインに対して停止制御を行
う制御回路に関する。[Detailed Description of the Invention] (a) Technical Field of the Invention A vector register that allows one or more elements to be accessed simultaneously, and one or more operation pipelines that perform operations between the vector registers. In a vector processing device equipped with a storage device and one or more access pipelines for transferring data between the vector registers, when there is a register chain between write and read operations to the vector registers, The present invention relates to a control circuit that performs stop control on the read operation pipeline.

(b) 技術の背景本発明に関連するベクトル処理装置の構成を第
１図に示す。(b) Technical background The configuration of a vector processing device related to the present invention is shown in FIG.

一般に、ベクトル処理装置では、処理能力を上
げる為に、記憶装置１との転送を少なくする意味
で、記憶装置１と演算パイプライン４との間にベ
クトルレジスタ（VR）３を設け、演算は該ベク
トルレジスタ間で実行し、演算に必要なソースデ
ータ，或いは演算終了後の結果のデータを該ベク
トルレジスタ（VR）３と記憶装置１との間で転
送している。 Generally, in a vector processing device, a vector register (VR) 3 is provided between the storage device 1 and the calculation pipeline 4 in order to reduce the number of transfers to and from the storage device 1 in order to increase the processing capacity. The calculation is executed between vector registers, and source data necessary for the calculation or data resulting from the calculation after completion of the calculation is transferred between the vector register (VR) 3 and the storage device 1.

この記憶装置１と該ベクトルレジスタ（VR）
３との間でのデータ転送の役目をするのがアクセ
スパイプライン（ロード，ストア）２である。 This storage device 1 and the vector register (VR)
The access pipeline (load, store) 2 plays the role of data transfer between the access pipeline 3 and the access pipeline 2.

例えば、Ａ×Ｂ＋ＣＤなるベクトル計算を行
うような場合、上記Ａ，Ｂ，Ｃ，Ｄのオペランド
が記憶装置１のある領域にあるとすると、該Ａ，
Ｂ，Ｃのオペランドがアクセスパイプライン（ロ
ード）２１によりベクトルレジスタ（VR）３
〔例えば、ベクトルレジスタＡ（VRa），Ｂ
（VRb），Ｃ（VRc）〕にロードされる。 For example, when performing vector calculation A×B+CD, if the operands A, B, C, and D are located in a certain area of the storage device 1, then the A,
Operands B and C are transferred to vector register (VR) 3 by access pipeline (load) 21
[For example, vector registers A (VRa), B
(VRb), C(VRc)].

ベクトルレジスタVRa，VRbの乗算を実行後、
作業用レジスタとしてベクトルレジスタＸ
（VRx）に格納した後、ベクトルレジスタVRxと
ベクトルレジスタVRcとの加算を行い、その結
果をベクトルレジスタＤ（VRd）に格納した後、
記憶装置１のオペランドＤの領域にアクセスパイ
プライン（ストア）２２によつてストアされる。
そして、上記ベクトル処理における命令の順序制
御全体を命令処理部（IP）５が司る。 After performing multiplication of vector registers VRa and VRb,
Vector register X as a working register
(VRx), perform addition between vector register VRx and vector register VRc, and store the result in vector register D (VRd).
It is stored in the operand D area of the storage device 1 by the access pipeline (store) 22.
The instruction processing unit (IP) 5 controls the entire order of instructions in the vector processing.

この様子をタイムチヤートで示したものが第２
図である。 The second time chart shows this situation.
It is a diagram.

第１図のアクセスパイプライン２において、例
えばロードパイプライン２１が２個と、ストアパ
イプライン２２が１個備えられているものとす
る。 Assume that the access pipeline 2 in FIG. 1 includes, for example, two load pipelines 21 and one store pipeline 22.

第２図において、ベクトルロードＡ，Ｂの処理
La，Lbにおいて、ベクトルレジスタVRa乃至ベ
クトルレジスタVRbに書き込みが完了した後、
乗算パイプライン処理Ｍでの、Ａ×Ｂの乗算を実
行していたのでは、本ベクトル処理装置の性能を
上げることはできない。 In Figure 2, processing of vector loads A and B
After writing to vector register VRa to vector register VRb is completed in La and Lb,
If A×B multiplication is executed in the multiplication pipeline processing M, the performance of the present vector processing device cannot be improved.

従つて、通常は本図に示すように、アクセスパ
イプライン（ロード）処理La，Lbと演算パイプ
ライン処理Ｍとを並列に動作させることが行われ
ている。 Therefore, as shown in this figure, access pipeline (load) processing La, Lb and arithmetic pipeline processing M are normally operated in parallel.

つまり、ベクトルロードＡ，ベクトルロードＢ
の処理La，Lbにおいて、ベクトルレジスタ
VRa，VRb３への書き込みが開始されたことを
認識すると、乗算パイプライン処理Ｍに対して起
動を行う。 In other words, vector load A, vector load B
In the processing La, Lb, vector register
When it is recognized that writing to VRa and VRb3 has started, the multiplication pipeline processing M is activated.

この場合、ベクトルロードＡ、ベクトルロード
Ｂの処理La，Lbにおける書き込みベクトルレジ
スタVRa，VRb３から、乗算の為の読み出しベ
クトルレジスタVRa，VRb３へ、レジスタ連鎖
がある。 In this case, there is a register chain from write vector registers VRa, VRb3 in processing La, Lb of vector load A and vector load B to read vector registers VRa, VRb3 for multiplication.

又、加算パイプライン処理ADにおけるＸ＋Ｃ
については、乗算パイプライン処理Ｍとの間で
は、ベクトルレジスタVRx３を介して、又ベク
トルロードＣの処理Lcにおける書き込みベクト
ルレジスタVRc３から、加算の為の読み出しベ
クトルレジスタ（VR）ｃ３へレジスタの連鎖が
あることを示している。 Also, X+C in addition pipeline processing AD
, between the multiplication pipeline processing M and the vector register VRx3, and from the write vector register VRc3 in the vector load C processing Lc to the read vector register (VR) c3 for addition. It shows that there is.

このような、アクセスパイプライン（ロード）
のベクトルレジスタへの書き込みから乗算，或い
は加算パイプライン、又はアクセスパイプライン
（ストア）に対してレジスタ連鎖が行われていな
い場合においては、ロードアクセスパイプライン
を意識することなく、それぞれのパイプラインが
独立に動作可能であることは云う迄もないことで
ある。 Access pipeline (load) like this
If register chaining is not performed from the write to the vector register to the multiplication or addition pipeline, or the access pipeline (store), each pipeline can be executed without being aware of the load access pipeline. Needless to say, they can operate independently.

上記レジスタ連鎖がある場合において、ロード
アクセスパイプライン処理La，Lbによるベクト
ルレジスタ（VR）３へのデータ転送は、記憶装
置１に対する他のアクセスに起因するバンク衝
突，或いはバス衝突等により、連続して記憶装置
１よりデータが転送されるとは限らない。 In the case where there is the above register chain, the data transfer to the vector register (VR) 3 by the load access pipeline processing La, Lb may be interrupted due to bank collision or bus collision caused by other accesses to the storage device 1. Data is not necessarily transferred from the storage device 1.

従つて、記憶装置１からデータが転送されなく
なつた場合には、ベクトルレジスタ（VR）３に
対する書き込み（即ち、ロードアクセスによるベ
クトルレジスタへの書き込み）を、演算の為の読
み出しが追い越さないようにする為、該ベクトル
レジスタへの書き込みが無くなつた時点で、該読
み出し動作に関連するパイプライン，或いはスト
アパイプライン等の全体の動作を停止していた。 Therefore, when data is no longer transferred from the storage device 1, it is necessary to prevent the read for calculation from overtaking the write to the vector register (VR) 3 (that is, the write to the vector register by load access). Therefore, when there are no more writes to the vector register, the entire operation of the pipeline related to the read operation, the store pipeline, etc. is stopped.

このことは、ハードウエア量は比較的少なくで
きるが、当該演算パイプライン等の全体を停止す
る為、ベクトル処理装置の処理能力を低下させる
要因となつていた。 Although this can reduce the amount of hardware, it has been a factor in reducing the processing capacity of the vector processing device because the entire calculation pipeline is stopped.

(c) 従来技術と問題点第３図は、従来技術によるベクトル処理装置に
おける各パイプラインの流れを詳細に示した図で
あり、第２図で示したように、La，Lb，Lcはロ
ードアクセスパイプライン処理，Ｍは乗算パイプ
ライン処理，AD，AD′は加算パイプライン処理，
Sdはストアアクセスパイプライン処理，Ｄは除
算パイプライン処理を示している。以下第１図を
参照しながら説明する。(c) Prior art and problems Figure 3 is a diagram showing in detail the flow of each pipeline in a vector processing device according to the prior art.As shown in Figure 2, La, Lb, and Lc are load Access pipeline processing, M is multiplication pipeline processing, AD, AD′ is addition pipeline processing,
Sd indicates store access pipeline processing, and D indicates division pipeline processing. This will be explained below with reference to FIG.

先ず、Ａ×Ｂの乗算を行う為に、ロードアクセ
スパイプライン処理La，Lbにより、ベクトルレ
ジスタVRa，VRb３に記憶装置１よりベクトル
データがロードされる。 First, in order to perform A×B multiplication, vector data is loaded from the storage device 1 into vector registers VRa and VRb3 by load access pipeline processing La and Lb.

，のポイントは、それぞれベクトルレジス
タ（VR）ａ，VRb３にベクトルデータが転送
（書き込み）され始めるタイミングである。 , are the timings at which vector data starts to be transferred (written) to vector registers (VR) a and VRb3, respectively.

Twa，Twbはそれぞれベクトルレジスタ
（VR）ａ，VRb３に書き込んでいる区間を示し
ており、Twc，Twdについても同じである。 Twa and Twb indicate the sections written in vector registers (VR) a and VRb3, respectively, and the same applies to Twc and Twd.

Twx1は乗算パイプライン処理Ｍが動作中で、
そのパイプライン処理に、レジスタ連鎖する命令
（即ち、ベクトル加算命令）による加算パイプラ
イン処理ADが、未だ読み出し中でない区間を示
し、Twx2は乗算パイプライン処理Ｍがベクトル
レジスタVRx３に対して書き込み中であり、且
つ加算パイプライン処理ADがベクトルレジスタ
VRx，VRc３から読み出し中の区間であること
を示している。 Twx1 is running multiplication pipeline processing M,
In the pipeline processing, addition pipeline processing AD using a register chain instruction (i.e. vector addition instruction) indicates an interval in which reading is not yet in progress, and Twx2 indicates an interval in which multiplication pipeline processing M is writing to vector register VRx3. Yes, and addition pipeline processing AD is vector register
This indicates that the section is being read from VRx and VRc3.

該Ａ×Ｂの乗算の開始は，が揃つたことに
より行われるが、若し記憶装置１からのデータロ
ードが、他のアクセスとの競合等により、行われ
なくなつた場合には、当該Ａ×Ｂの乗算パイプラ
イン処理Ｍを停止する必要がある。 The start of the multiplication of A×B is performed when the It is necessary to stop the multiplication pipeline processing M of ×B.

この場合、従来方式においては、ベクトルレジ
スタ（VR）３に対する読み出し動作パイプライ
ン処理全体（即ち、乗算，加算，除算，ストアパ
イプライン処理等）を停止するように制御してい
た為、上記データロードに関係しない演算パイプ
ライン〔例えば、別の加算(1)，別の除算(2)等〕動
作も停止されることになり、本ベクトル処理装置
の処理能力を低下させると云う問題があつた。 In this case, in the conventional method, the entire read operation pipeline processing for the vector register (VR) 3 (i.e., multiplication, addition, division, store pipeline processing, etc.) was controlled to be stopped. Operations in the arithmetic pipeline (for example, another addition (1), another division (2), etc.) that are not related to the above are also stopped, which causes a problem in that the processing capacity of the vector processing device is reduced.

(d) 発明の目的本発明は上記従来の欠点に鑑み、記憶装置１か
らのデータ転送が涸渇されているロードアクセス
パイプライン処理La，Lb，Lc，或いは乗算パイ
プライン処理Ｍに関連する読み出し動作パイプラ
イン処理のみを停止させる方法を提供することに
よつて、ベクトル処理装置の処理速度を向上させ
ることを目的とするものである。(d) Purpose of the Invention In view of the above conventional drawbacks, the present invention provides a read operation related to load access pipeline processing La, Lb, Lc or multiplication pipeline processing M in which data transfer from the storage device 1 is exhausted. The purpose of this invention is to improve the processing speed of a vector processing device by providing a method for stopping only pipeline processing.

(e) 発明の構成そしてこの目的は、本発明によれば、１つ乃至
複数個のエレメントを同時にアクセス可能とする
ベクトルレジスタと、該ベクトルレジスタ間で演
算を行う１つ乃至複数個の演算パイプラインと、
記憶装置と上記ベクトルレジスタ間でデータ転送
を行う１つ乃至複数個のアクセスパイプラインと
を備えたベクトル処理装置であつて、上記アクセ
スパイプライン毎に上記記憶装置からベクトルレ
ジスタへデータを転送する際、該アクセスパイプ
ラインと、ベクトルレジスタの読み出し動作を行
うパイプライン間でベクトルレジスタのレジスタ
連鎖を検出する手段と、ベクトルレジスタの読み
出し動作を行う各パイプライン毎にパイプライン
ストツプ識別信号を設け、上記レジスタ連鎖の元
となるアクセスパイプラインで上記記憶装置から
ベクトルレジスタへデータが転送されなくなつた
ことによつて、上記パイプラインストツプ識別信
号で指定されるところの読み出し動作を行うパイ
プラインのみを停止する方法を提供することによ
つて達成され、比較的に少ないハードウエア量
で、ベクトル処理装置の処理能力の低下を防ぐこ
とができる利点がある。(e) Structure of the Invention According to the present invention, the present invention provides a vector register that allows simultaneous access to one or more elements, and one or more arithmetic pipes that perform operations between the vector registers. line and
A vector processing device comprising one or more access pipelines for transferring data between a storage device and the vector register, wherein data is transferred from the storage device to the vector register for each access pipeline. , a means for detecting a register chain of vector registers between the access pipeline and a pipeline that performs a vector register read operation, and a pipeline stop identification signal provided for each pipeline that performs a vector register read operation; Since data is no longer transferred from the storage device to the vector register in the access pipeline that is the source of the register chain, only the pipeline that performs the read operation specified by the pipeline stop identification signal is stopped. This has been achieved by providing a method to do this, and has the advantage of being able to prevent a reduction in the processing power of a vector processing device with a relatively small amount of hardware.

(f) 発明の実施例先ず、本発明の主旨を要約すると、本発明は、
ロードアクセスパイプライン処理において、記憶
装置からベクトルレジスタに対するロードデータ
が転送されなくなつた時、当該ベクトルレジスタ
に対する読み出し動作パイプライン処理全体（乗
算，加算，除算，ストア等）を停止するのではな
く、当該ロードアクセスパイプライン処理に連鎖
する演算パイプライン，或いはストアパイプライ
ン等の該当するパイプライン処理のみを停止する
ようにして、ハードウエア量を比較的少なくして
処理能力の低下を防ぐようにしたものである。(f) Examples of the invention First, to summarize the gist of the present invention, the present invention includes the following:
In load access pipeline processing, when load data is no longer transferred from the storage device to a vector register, instead of stopping the entire read operation pipeline processing (multiplication, addition, division, store, etc.) for the vector register, By stopping only the corresponding pipeline processing such as the calculation pipeline or store pipeline that is chained to the load access pipeline processing, the amount of hardware is kept relatively small to prevent a decline in processing performance. It is something.

以下本発明の実施例を図面によつて詳述する。
第４図は本発明を適用したベクトル処理装置の主
要部分を模式的に示した図であり、第５図は第４
図で示されているパイプラインストツプ制御部の
一実施例を示した図であり、第６図，第７図は第
４図で示されているレジスタ連鎖検出回路５１の
一実施例を示した図である。 Embodiments of the present invention will be described in detail below with reference to the drawings.
FIG. 4 is a diagram schematically showing the main parts of a vector processing device to which the present invention is applied, and FIG.
6 is a diagram showing an embodiment of the pipeline stop control section shown in the figure, and FIGS. 6 and 7 show an embodiment of the register chain detection circuit 51 shown in FIG. 4. It is a diagram.

第４図において、１，２，２２，３，４，５は
第１図で説明したものと同じものであり、２１
０，２１１はそれぞれロードアクセスパイプライ
ン１LP１，ロードアクセスパイプライン２LP
２，を示し、４１は加算パイプライン（AP），４
２は乗算パイプライン（MP），４３は除算パイ
プライン（DP），そして２１０１，２１１１及び
５１が本発明を実施するのに必要な機能ブロツク
で、２１０１，２１１１はそれぞれパイプライン
ストツプ制御部（PSC１，PSC２），５１はレジ
スタ連鎖検出回路（RCDET）である。 In Figure 4, 1, 2, 22, 3, 4, and 5 are the same as those explained in Figure 1, and 21
0 and 211 are load access pipeline 1LP1 and load access pipeline 2LP, respectively.
2, 41 is an addition pipeline (AP), 4
2 is a multiplication pipeline (MP), 43 is a division pipeline (DP), and 2101, 2111, and 51 are functional blocks necessary to implement the present invention. , PSC2), 51 is a register chain detection circuit (RCDET).

又、第５図において、１０は加算連鎖状態フリ
ツプフロツプSTP Ａ１，１１は乗算連鎖状態フ
リツプフロツプSTP Ｍ１，１２は除算連鎖状態
フリツプフロツプSTP Ｄ１，１３はストア連鎖
状態フリツプフロツプSTP Ｓ１，１４は書き込
み状態フリツプフロツプ，１５０〜１５３，１６
０はアンド回路(A)，１６１は否定回路（Ｎ）であ
る。 5, 10 is an addition chain state flip-flop STP A1, 11 is a multiplication chain state flip-flop STP M1, 12 is a division chain state flip-flop STP D1, 13 is a store chain state flip-flop STP S1, 14 is a write state flip-flop, 150 ~153,16
0 is an AND circuit (A), and 161 is a NOT circuit (N).

そして、第６図，第７図において、Ｃは一致回
路を示し、その出力信号において、例えばＬ―Ａ
はロードアクセスパイプラインでの書き込みベク
トルレジスタが、加算パイプラインでの読み出し
ベクトルレジスタに連鎖していることを示してい
る。同じようにして、例えばＡ―Ｍ―Ｄは加算パ
イプラインでの書き込みベクトルレジスタが、乗
算パイプラインでの読み出しベクトルレジスタに
連鎖し、更に該乗算パイプラインでの書き込みベ
クトルレジスタが除算パイプラインでの読み出し
ベクトルレジスタに連鎖していることを示してお
り、以下同じである。 In FIGS. 6 and 7, C indicates a coincidence circuit, and in its output signal, for example, L-A
indicates that the write vector register in the load access pipeline is chained to the read vector register in the add pipeline. In the same way, for example, in A-MD, the write vector register in the add pipeline is chained to the read vector register in the multiply pipeline, and the write vector register in the multiply pipeline is chained to the read vector register in the divide pipeline. This shows that it is chained to the read vector register, and the same applies hereafter.

第６図における入力信号において、例えば「ロ
ード（LOAD）のライトオペランド」はベクト
ルロード命令で指定する所のベクトルレジスタ
（VR）のレジスタ番号を示しており、同様にし
て「加算（ADD）のリードオペランド」はベク
トル加算命令で指定する所のベクトルレジスタ
（VR）のレジスタ番号を示しており、以下同じ
である。但し、MULTIはベクトル乗算命令の場
合であり、DIVはベクトル除算命令の場合であ
り、STOREはベクトルストア命令の場合である
ことを示している。 In the input signals in Figure 6, for example, the "write operand of load (LOAD)" indicates the register number of the vector register (VR) specified by the vector load instruction, and similarly, the "read operand of addition (ADD)" indicates the register number of the vector register (VR) specified by the vector load instruction. "Operand" indicates the register number of the vector register (VR) specified by the vector addition instruction, and the same applies hereinafter. However, MULTI indicates a vector multiplication instruction, DIV indicates a vector division instruction, and STORE indicates a vector store instruction.

第６図の出力信号については、ADXX，
MUXX，DIXX，STXXで示しているが、XXは
添字の番号を示しており、ADは加算パイプライ
ン，MUは乗算パイプライン，DIは除算パイプラ
イン，STはストアパイプラインを、それぞれ意
味している。 Regarding the output signal in Figure 6, ADXX,
They are indicated by MUXX, DIXX, STXX, where XX indicates the subscript number, AD means addition pipeline, MU means multiplication pipeline, DI means division pipeline, and ST means store pipeline. There is.

そして、６０１〜６４９はアンド回路を示し、
７０１〜７０５はオア回路を示している。 And 601 to 649 indicate AND circuits,
701 to 705 indicate OR circuits.

以下、第１図，第３図を参照しながら、第４図
〜第７図によつて、本発明の実施例を説明する。 Embodiments of the present invention will be described below with reference to FIGS. 4 to 7 while referring to FIGS. 1 and 3.

先ず、第４図に示されているレジスタ連鎖検出
回路（RCDET）５１について、第３図，第４図
を参照しながら、第６図，第７図によつて説明す
る。 First, the register chain detection circuit (RCDET) 51 shown in FIG. 4 will be explained with reference to FIGS. 6 and 7 while referring to FIGS. 3 and 4.

本レジスタ連鎖検出回路（RCDET）５１はロ
ードアクセスパイプライン１（LP１）２１０，
ロードアクセスパイプライン２（LP２）２１１
毎に設けられており、ここでは、例えばロードア
クセスパイプライン１（LP１）２１０について
説明する。 This register chain detection circuit (RCDET) 51 includes a load access pipeline 1 (LP1) 210,
Load access pipeline 2 (LP2) 211
Here, for example, the load access pipeline 1 (LP1) 210 will be explained.

ベクトルロードＡをロードアクセスパイプライ
ン処理Laで実行する際、ベクトルレジスタVRa
の書き込みから、乗算パイプライン処理Ｍにおい
て、乗算（Ａ×Ｂ）を実行する為のベクトルレジ
スタVRaの読み出しが連鎖している。 When executing vector load A in load access pipeline processing La, vector register VRa
In the multiplication pipeline processing M, the reading of the vector register VRa for executing multiplication (A×B) is chained from the writing of .

該レジスタ連鎖の期間〜迄の間において
は、LOADのライトオペランド（VRa）と、乗
算（MULTI）のリードオペランド（VRa）とが
同じである為、一致回路ＣＬ―Ｍの出力が論理
“１”になる。上記〜の期間中は乗算
（MULTI）に連鎖する読み出し動作パイプライ
ンがない為、後述するパイプラインストツプ制御
回路（PSC１）２１０１からの書き込み開始信号
（LWS），又は書き込み中信号（LWACK）によ
つて、MU１のみが論理“１”となり、第７図の
オア回路７０３を通して乗算パイプラインストツ
プ識別信号のみが論理“１”になり、後述する乗
算連鎖状態フリツプフロツプ（STP Ｍ１）１１
をオンとする。 During the register chain period, the write operand (VRa) of LOAD and the read operand (VRa) of multiplication (MULTI) are the same, so the output of the matching circuit CLM is logic "1". "become. During the period ~ above, there is no read operation pipeline chained to the multiplication (MULTI), so the write start signal (LWS) or write in progress signal (LWACK) from the pipeline stop control circuit (PSC1) 2101, which will be described later, is used. As a result, only MU1 becomes logic "1", and only the multiplication pipeline stop identification signal becomes logic "1" through the OR circuit 703 in FIG.
Turn on.

この時、上記ロード書き込み開始信号（LWS）
によつて、書き込み状態フリツプフロツプ１４が
セツトされる。 At this time, the above load write start signal (LWS)
The write state flip-flop 14 is set by .

従つて、上記ロードアクセスパイプライン処理
Laでのロード書き込み信号（LW）が論理“０”
になると、否定回路（Ｎ）１６１の出力信号は論
理“１”となり、アンド回路１６０を介して、ア
ンド回路(A)１５１の出力が論理“１”になる。ア
ンド回路(A)１５１の出力が論理“１”になると、
乗算パイプライン（MP）４２へのクロツクスト
ツプ信号（CLKS MP）をオンにするように動作
する。 Therefore, the above load access pipeline processing
Load write signal (LW) at La is logic “0”
Then, the output signal of the NOT circuit (N) 161 becomes logic "1", and the output of the AND circuit (A) 151 becomes logic "1" via the AND circuit 160. When the output of the AND circuit (A) 151 becomes logic “1”,
It operates to turn on the clock stop signal (CLKS MP) to the multiplication pipeline (MP) 42.

次にベクトルロードＣをロードアクセスパイプ
ライン処理Lcで実行する場合においては、期間
Twc中ではベクトルレジスタVRcに対する書き
込みに、加算パイプライン処理ADの読み出しが
連鎖している為、の時点から該加算パイプライ
ン処理ADがクロツクストツプの対象となる。 Next, when vector load C is executed by load access pipeline processing Lc, the period
In Twc, writing to the vector register VRc is chained with reading from the addition pipeline processing AD, so that the addition pipeline processing AD becomes the target of the clock stop from the point in time.

更には、該加算パイプライン処理ADが書き込
みを始めた時点からベクトルストアＤを処理す
るストアアクセスパイプライン処理Sdがスター
トするので、加算パイプライン処理ADにおける
演算処理（Ｘ＋Ｃ）によるベクトルレジスタ
VRdへの書き込みから、ストアアクセスパイプ
ライン処理Sdにおけるベクトルレジスタ（VR）
ｄの読み出しが連鎖していることになる。 Furthermore, since the store access pipeline process Sd that processes the vector store D starts from the time when the addition pipeline process AD starts writing, the vector register by the calculation process (X+C) in the addition pipeline process AD starts.
Vector register (VR) in store access pipeline processing Sd from writing to VRd
This means that the readings of d are chained.

ここで、ストアアクセスパイプライン処理Sd
のスタートは、ベクトルレジスタ（VR）ｄが加
算パイプライン処理ADに連鎖していることか
ら、加算パイプライン処理ADにおける書き込み
が始まらないとできないことは云う迄もないこと
である。 Here, store access pipeline processing Sd
Since the vector register (VR) d is chained to the addition pipeline processing AD, it goes without saying that this cannot be started until writing in the addition pipeline processing AD starts.

次に、上記ロードアクセスパイプライン処理
Lcにおける、ベクトルロードＣで加算パイプラ
イン処理ADに、更にストアアクセスパイプライ
ン処理Sdにレジスタ連鎖している場合において、
当該レジスタ連鎖検出回路（RCDET）５１の動
作を説明する。 Next, the above load access pipeline processing
In Lc, when vector load C is register-chained to addition pipeline processing AD and further to store access pipeline processing Sd,
The operation of the register chain detection circuit (RCDET) 51 will be explained.

第３図のの時点では、ロードアクセスパイプ
ライン処理LcにおけるベクトルレジスタVRcの
書き込みは、加算パイプライン処理ADの読み出
しのベクトルレジスタVRcにレジスタ連鎖して
いる。 At the time point in FIG. 3, the writing to the vector register VRc in the load access pipeline processing Lc is register-chained to the vector register VRc for reading in the addition pipeline processing AD.

従つて、ロード（LOAD）のライトオペラン
ド（VRc）が加算（ADD）のリードオペランド
（VRc）に連鎖している為、一致回路(C)Ｌ―Ａが
論理“１”となり、結果としてアンド回路６０１
の出力AD１が論理“１”になるので、第７図の
オア回路７０２が論理“１”となり、加算パイプ
ラインストツプ識別信号を論理“１”として、加
算連鎖状態フリツプフロツプ（STPA１）１０を
オンにする。 Therefore, since the write operand (VRc) of the load (LOAD) is chained to the read operand (VRc) of the addition (ADD), the coincidence circuit (C)LA becomes logic "1", and as a result, the AND circuit 601
Since the output AD1 becomes logic "1", the OR circuit 702 in FIG. do.

この状態で、上記ロードアクセスパイプライン
処理Lcでのロード書き込み信号（LW）が論理
“０”になると、〜の期間の説明と同様に、
加算パイプライン（AP）４１に対するクロツク
ストツプ信号（CLKS AP）を論理“１”とする
ように動作する。 In this state, when the load write signal (LW) in the load access pipeline processing Lc becomes logic "0", similar to the explanation for the period of ~,
It operates to set the clock stop signal (CLKS AP) to the addition pipeline (AP) 41 to logic "1".

そして、の時点においては、上記加算パイプ
ライン処理ADにおけるベクトルレジスタ（VR）
ｄがストアパイプライン処理Sdにおけるベクト
ルレジスタ（VR）ｄにレジスタ連鎖している
為、一致回路(C)Ａ―STがオンとなるので、アン
ド回路６０７によつて、上記AD１信号と論理積
がとられ、ST２０信号がオンとなり、第７図の
オア回路７０５の出力が論理“１”となつて、ス
トアアクセスパイプラインストツプ識別信号をオ
ンにし、ストア連鎖状態フリツプフロツプ
（STP Ｓ１）１３をオンにする。 Then, at the point of time, the vector register (VR) in the addition pipeline processing AD
Since d is register-chained to the vector register (VR) d in the store pipeline processing Sd, the coincidence circuit (C) A-ST is turned on, so the AND circuit 607 performs the AND with the AD1 signal. The ST20 signal turns on, and the output of the OR circuit 705 in FIG. do.

この状態で、上記ロードアクセスパイプライン
処理Lcでのロード書き込み信号（LW）が論理
“０”になると、〜の期間の説明と同様に、
ストアアクセスパイプライン（SP）２２に対す
るクロツクストツプ信号（CLKS SP）をもオン
とするように動作する。 In this state, when the load write signal (LW) in the load access pipeline processing Lc becomes logic "0", similar to the explanation for the period of ~,
It also operates to turn on the clock stop signal (CLKS SP) for the store access pipeline (SP) 22.

即ち、レジスタ連鎖が続いている場合には、最
初に起動されている読み出しパイプライン（本例
では、加算パイプライン）が停止すれば、後続す
る読み出しパイプライン（本例では、ストアアク
セスパイプライン）をも停止するように機能す
る。 In other words, if the register chain continues, if the first activated read pipeline (in this example, the addition pipeline) stops, the subsequent read pipeline (in this example, the store access pipeline) will start. It also functions to stop.

即ち、加算パイプライン処理ADとストアアク
セスパイプライン処理Sdとの間にレジスタ連鎖
があつても、最初の読み出し動作パイプライン
（ここでは、加算パイプライン）に停止が起こら
なければ、後続する読み出し動作パイプライン
（ここでは、ストアアクセスパイプライン）は停
止させる必要がない事になる。 That is, even if there is a register chain between the addition pipeline processing AD and the store access pipeline processing Sd, unless a stop occurs in the first read operation pipeline (here, the addition pipeline), the subsequent read operation There is no need to stop the pipeline (here, the store access pipeline).

次に、第４図に示されているロードアクセスパ
イプライン１（LPI）２１０におけるパイプライ
ンストツプ制御部（PSC１）２１０１の動作を第
５図によつて説明する。 Next, the operation of the pipeline stop control section (PSC1) 2101 in the load access pipeline 1 (LPI) 210 shown in FIG. 4 will be explained with reference to FIG.

第５図において、各加算，乗算，除算，ストア
連鎖状態フリツプフロツプ（STP Ａ１）１０，
（STP Ｍ１）１１，（STP Ｄ１）１２，（STP
Ｓ１）１３は、前記レジスタ連鎖検出回路
（RCDET）５１から送出されるロードアクセス
パイプラインLPI２１０，ロードアクセスパイプ
ライン２（LP２）２１１に対する加算，乗算，
除算，ストアパイプラインストツプ識別信号によ
つてセツトされ、当該ロードアクセスパイプライ
ン１（LPI）２１０，ロードアクセスパイプライ
ン２（LP２）２１１でのロード書き込み終了信
号（LWE）によつてリセツトされる。 In FIG. 5, each addition, multiplication, division, and store chain state flip-flop (STP A1) 10,
(STP M1) 11, (STP D1) 12, (STP
S1) 13 performs addition, multiplication, and
It is set by the division and store pipeline stop identification signal, and is reset by the load write end signal (LWE) in the load access pipeline 1 (LPI) 210 and load access pipeline 2 (LP2) 211.

又、書き込み状態フリツプフロツプ１４は、ロ
ード書き込み開始信号（LWS）によつてセツト
され、上記ロード書き込み終了信号（LWE）に
よつてリセツトされる。 Also, the write state flip-flop 14 is set by the load write start signal (LWS) and reset by the load write end signal (LWE).

そして、ある連鎖状態フリツプフロツプ〔例え
ば、乗算連鎖状態フリツプフロツプ（STP Ｍ
１）１１〕がオンになつている時に、ロード書き
込み信号（LW）がオフになると、否定回路
（Ｎ）１６１の出力が論理“１”となつて、アン
ド回路(A)１６０の出力が論理“１”となる。 and some chained-state flip-flops [e.g., multiply chained-state flip-flops (STP M
1) 11] is on and the load write signal (LW) is turned off, the output of the NOT circuit (N) 161 becomes logic "1" and the output of the AND circuit (A) 160 becomes logic "1". It becomes “1”.

その結果としてアンド回路(A)１５１の出力が論
理“１”となり、乗算パイプライン（MP）４２
に対してクロツクストツプ信号（CLKS MP）を
送出し、該乗算パイプライン（MP）４２での動
作を停止するように機能する。 As a result, the output of the AND circuit (A) 151 becomes logic "1", and the multiplication pipeline (MP) 42
It functions to send a clock stop signal (CLKS MP) to the multiplication pipeline (MP) 42 and stop the operation in the multiplication pipeline (MP) 42.

本発明の主眼は、上記レジスタ連鎖検出回路
（RCDET）５１と、本パイプラインストツプ制
御部（PSC１）２１０１，（PSC２）２１１１の
動作に集約される。 The main focus of the present invention is concentrated on the operations of the register chain detection circuit (RCDET) 51 and the pipeline stop control sections (PSC1) 2101 and (PSC2) 2111.

以下、第４図を主体にして、第３図を参照しな
がら、本発明を実施した時の全体的な動作を説明
する。 Hereinafter, the overall operation when implementing the present invention will be explained with reference to FIG. 3, with FIG. 4 as the main subject.

先ず、命令処理部（IP）５でベクトルロード
命令，ベクトル演算命令，ベクトルストア命令等
が処理されると、アクセスパイプライン２に対し
ては、起動信号，オペレーシヨンコード
（OPC），ベクトルデータレングス（VL），メモ
リ先頭アドレス，ベクトルデータデイスタンス等
が、演算パイプラインに対しては、起動信号，オ
ペレーシヨンコード（OPC），ベクトルデータレ
ングス（VL）等が送出される。 First, when a vector load instruction, a vector operation instruction, a vector store instruction, etc. are processed in the instruction processing unit (IP) 5, a start signal, an operation code (OPC), a vector data length, etc. are sent to the access pipeline 2. (VL), memory start address, vector data distance, etc., and a start signal, operation code (OPC), vector data length (VL), etc. are sent to the calculation pipeline.

そして、最初に第５図に示されている書き込み
状態フリツプフロツプ１４が、ロードアクセスパ
イプライン１（LPI）２１０，ロードアクセスパ
イプライン２（LP２）２１１内で生起される、
前記ロード書き込み開始信号（LWS）によつて
セツトされると同時に、該ロード書き込み開始信
号（LWS），及び上記書き込み状態フリツプフロ
ツプ１４のロード書き込み中信号（LWACK）(a)
が、レジスタ連鎖検出回路（RCDET）５１に送
出され、第６図，第７図で示した論理によつて、
ロードアクセスパイプライン１（LPI）２１０，
ロードアクセスパイプライン２（LP２）２１１
毎に、又それぞれの読み出し動作パイプライン
（第４図では、乗算，加算，除算，ストアパイプ
ライン）毎に４本宛の計８本のパイプラインスト
ツプ識別信号(b)が、レジスタ連鎖検出回路
（RCDET）５１から、各ロードアクセスパイプ
ラインに送出される。 First, the write state flip-flop 14 shown in FIG. 5 is generated in the load access pipeline 1 (LPI) 210 and the load access pipeline 2 (LP2) 211.
At the same time as being set by the load write start signal (LWS), the load write start signal (LWS) and the load write in progress signal (LWACK) of the write state flip-flop 14 (a)
is sent to the register chain detection circuit (RCDET) 51, and according to the logic shown in FIGS. 6 and 7,
Load access pipeline 1 (LPI) 210,
Load access pipeline 2 (LP2) 211
A total of 8 pipeline stop identification signals (b), 4 for each read operation pipeline (in Figure 4, multiplication, addition, division, and store pipelines), are sent to the register chain detection circuit. (RCDET) 51, and is sent to each load access pipeline.

即ち、ロードアクセスパイプライン１（LP１）
２１０，ロードアクセスパイプライン２（LP２）
２１１において、第３図で示したタイミング
で、ベクトルレジスタVRa，VRbにロードデー
タが書き込まれたことを認識すると、次のベクト
ル乗算を開始する為に、該ロードアクセスパイプ
ライン１（LPI）２１０，ロードアクセスパイプ
ライン２（LP２）２１１のパイプラインストツ
プ制御部（PSC１）２１０１，（PSC２）２１１
１において、上記乗算パイプラインストツプ識別
信号によつて、乗算連鎖状態フリツプフロツプ
（STP Ｍ１，STP Ｍ２）１１をオンにする。こ
の時点から該乗算パイプラインストツプ識別信号
は、該乗算連鎖状態フリツプフロツプ（STP Ｍ
１，STP Ｍ２）１１により、それぞれベクトル
レジスタ（VR）ａ，VRbに書き込みが終了する
迄（即ち、Twa，或いはTwbの間）保持される。 That is, load access pipeline 1 (LP1)
210, Load access pipeline 2 (LP2)
At step 211, when it is recognized that the load data has been written to the vector registers VRa and VRb at the timing shown in FIG. 3, the load access pipeline 1 (LPI) 210, Pipeline stop control unit (PSC1) 2101, (PSC2) 211 of load access pipeline 2 (LP2) 211
1, the multiplication chain state flip-flops (STP M1, STP M2) 11 are turned on by the multiplication pipeline stop identification signal. From this point on, the multiplier pipeline stop identification signal is applied to the multiplier chain state flip-flop (STP
1, STP M2) 11 until the writing is completed in vector registers (VR) a and VRb (that is, during Twa or Twb).

この時点で、ベクトルレジスタ（VR）ａ，或
いはVRbの書き込みが、記憶装置でのバンク衝
突等により行われなくなつた場合には、上記パイ
プラインストツプ識別信号がオンとなつているパ
イプライン〔本実施例においては、乗算パイプラ
イン（MP）４２〕のみを停止するように動作す
る。 At this point, if writing to vector register (VR) a or VRb is no longer performed due to a bank collision in the storage device, etc., if the pipeline stop identification signal is on, In the embodiment, it operates to stop only the multiplication pipeline (MP) 42].

具体的には、第５図で説明した論理に従つて、
乗算パイプライン（MP）４２に対するクロツク
ストツプ信号（CLKS MP）を送出することによ
り機能する。 Specifically, according to the logic explained in Figure 5,
It functions by sending a clock stop signal (CLKS MP) to the multiplication pipeline (MP) 42.

この場合、上記ロードアクセスパイプライン１
（LP１）２１０，ロードアクセスパイプライン２
（LP２）２１１の何れが止まるかは予測できない
ので、それぞれの論理和条件（つまり、どちらか
が止まれば、対象のパイプラインを停止させる）
によつて、当該乗算パイプライン（MP）４２が
ストツプするように構成されている。 In this case, the above load access pipeline 1
(LP1) 210, Load access pipeline 2
(LP2) Since it is impossible to predict which of 211 will stop, each logical sum condition (in other words, if either stops, the target pipeline will be stopped)
Accordingly, the multiplication pipeline (MP) 42 is configured to be stopped.

上記乗算連鎖状態フリツプフロツプ（STP Ｍ
１，STP Ｍ２）１１は、Twa，Twbの最後の
タイミング〔即ち、ロード書き込み終了信号
（LWE）〕でリセツトされるように構成されてい
るので、それ以降のロード書き込みによる停止に
は影響なく、又Twa，乃至Twbの区間における
別の加算１，或いは別の除算２については、パイ
プラインストツプ識別信号をオンにしないように
構成されている為、該パイプライン処理AD′，Ｄ
〔別の加算(1)，別の除算(2)〕が停止することはな
い。 The above multiplication chained state flip-flop (STP M
1. STP M2) 11 is configured to be reset at the last timing of Twa and Twb [i.e., the load write end signal (LWE)], so there is no effect on subsequent stops due to load writes. Also, for another addition 1 or another division 2 in the interval Twa, Twb, the pipeline stop identification signal is not turned on, so the pipeline processing AD', D
[Another addition (1), another division (2)] never stops.

更に、時点から次のレジスタ連鎖となる加算
パイプライン処理ADが始まる迄の区間Twx１に
ついても、上記別の加算(1)，或いは別の除算(2)の
パイプライン処理は、ベクトルロードＡ La，
ベクトルロードＢ Lbを意識することなく動作
できることは前述の通りである。 Furthermore, regarding the interval Twx1 from the point in time to the start of the addition pipeline processing AD, which is the next register chain, the pipeline processing of the above-mentioned another addition (1) or another division (2) is performed using the vector load A La,
As mentioned above, the operation can be performed without being aware of the vector load B Lb.

次に、時点においては、上記乗算連鎖状態フ
リツプフロツプ（STP Ｍ１，STP Ｍ２）１１
は既に無効状態となつているが、ロードアクセス
パイプライン１（LP１）２１０におけるベクト
ルロードＣを処理するロードアクセスパイプライ
ン処理Lcについては、加算パイプライン処理AD
についてのみレジスタ連鎖している為、該加算パ
イプライン処理ADが始まる時点で、上記加算
パイプラインストツプ識別信号によつて、加算連
鎖状態フリツプフロツプ（STP Ａ１）１０をオ
ンにするように動作する。 Next, at the time point, the multiplication chain state flip-flops (STP M1, STP M2) 11
is already in an invalid state, but for load access pipeline processing Lc that processes vector load C in load access pipeline 1 (LP1) 210, addition pipeline processing AD
Since only registers are chained, the addition chain state flip-flop (STP A1) 10 is turned on by the addition pipeline stop identification signal at the time when the addition pipeline processing AD starts.

該フリツプフロツプ（STP Ａ１）１０がオン
になると、ロードアクセスパイプライン１（LP
１）２１０のベクトルレジスタVRcに対する書
き込み動作が終了する迄（即ち、区間Twcの間）
保持され、当該加算パイプライン（AP）４１は
ロードアクセスパイプライン１（LP１）２１０
からのクロツクストツプの対象となる。 When the flip-flop (STP A1) 10 is turned on, the load access pipeline 1 (LP
1) Until the write operation to the vector register VRc of 210 is completed (that is, during the interval Twc)
The addition pipeline (AP) 41 is the load access pipeline 1 (LP1) 210.
It is subject to clock stops from .

時点においては、レジスタ連鎖検出回路
（RCDET）５１において、第６図，第７図で説
明した論理に従つて、ベクトルストアＤを処理す
るストアアクセスパイプライン処理Sdが加算パ
イプライン処理ADに対して、ベクトルレジスタ
（VR）ｄでレジスタ連鎖をしていることが検出
されているので、ストアアクセスパイプラインス
トツプ識別信号により、ストア連鎖状態フリツプ
フロツプ（STP Ｓ１）１３をオンするように機
能する。 At this point, in the register chain detection circuit (RCDET) 51, the store access pipeline processing Sd that processes the vector store D is processed by the addition pipeline processing AD in accordance with the logic explained in FIGS. 6 and 7. , the vector register (VR) d is detected to be chained, so the store chain state flip-flop (STP S1) 13 is turned on by the store access pipeline stop identification signal.

こうして本発明を実施することにより、タイミ
ングから（即ち、Twx１の間）はロードア
クセスパイプライン１（LPI）２１０，ロードア
クセスパイプライン２（LP２）２１１に対して、
乗算パイプライン（MP）４２のみが、時点か
ら迄の間は、ロードアクセスパイプライン１
（LP１）２１０に対して加算パイプライン（AP）
４１のみがパイプラインストツプの対象となり、
の時点からは、ロードアクセスパイプライン１
（LP１）２１０のベクトルレジスタVRcに対する
書き込み動作が終了する迄、加算パイプライン
（AP）４１とストアパイプライン（SP）２２と
が、パイプラインストツプの対象となることにな
る。 By implementing the present invention in this manner, from the timing (that is, during Twx1), for the load access pipeline 1 (LPI) 210 and the load access pipeline 2 (LP2) 211,
Only the multiplication pipeline (MP) 42 is connected to the load access pipeline 1 from
(LP1) Addition pipeline (AP) for 210
Only 41 are subject to pipeline stop,
From the point onwards, load access pipeline 1
The addition pipeline (AP) 41 and the store pipeline (SP) 22 are subject to pipeline stop until the write operation to the vector register VRc of the (LP1) 210 is completed.

このように、本発明においては、それぞれのロ
ードアクセスパイプライン，或いは演算パイプラ
インに対して、レジスタ連鎖のある読み出し動作
パイプラインのみが、パイプラインストツプの対
象となるように機能し、従来方式のように、レジ
スタ連鎖のないパイプライン迄動作を停止させる
ことがないように制御される所に特徴がある。 In this way, in the present invention, for each load access pipeline or arithmetic pipeline, only the read operation pipeline with register chain functions as the target of pipeline stop, unlike the conventional method. The feature is that it is controlled so that the operation does not stop even in the pipeline where there is no register chain.

(g) 発明の効果以上、詳細に説明したように、本発明のパイプ
ライン制御回路は、ロードアクセスパイプライン
処理において、記憶装置からベクトルレジスタに
対するロードデータがこなくなつた時、当該ベク
トルレジスタに対する読み出し動作パイプライン
処理全体（乗算，加算，除算，ストア等）を停止
するのではなく、当該ロードアクセスパイプライ
ン処理にレジスタ連鎖する演算パイプライン，或
いはストアパイプライン等の該当する読み出しパ
イプライン処理のみを停止するようにしたもので
あるので、比較的少ないハードウエア量で、ベク
トル処理装置の処理能力の低下を防ぐことができ
る効果がある。(g) Effects of the Invention As explained above in detail, the pipeline control circuit of the present invention is capable of controlling the vector register when load data to the vector register is no longer received from the storage device in load access pipeline processing. Rather than stopping the entire read operation pipeline process (multiplication, addition, division, store, etc.), only the corresponding read pipeline process such as the arithmetic pipeline or store pipeline that has register chaining to the load access pipeline process Since it is designed to stop the processing, it is possible to prevent a decrease in the processing capacity of the vector processing device with a relatively small amount of hardware.

[Brief explanation of the drawing]

第１図はベクトル処理装置の構成を示した図，
第２図はベクトル処理装置におけるパイプライン
の流れを概念的に示した図，第３図は従来技術に
よるベクトル処理装置における各パイプラインの
流れを詳細に示した図，第４図は本発明を適用し
たベクトル処理装置の主要部分を模式的に示した
図，第５図は第４図に示されているパイプライン
ストツプ制御部の一実施例を示した図，第６図，
第７図は第４図で示されているレジスタ連鎖検出
回路の一実施例を示した図である。図面において、１は記憶装置，２はアクセスパ
イプライン，２１はロードアクセスパイプライ
ン，２２はストアアクセスパイプライン，３はベ
クトルレジスタ（VR），４は演算パイプライン，
５は命令処理部（IP），La，Lb，Lcはロードア
クセスパイプライン処理，Ｍは乗算パイプライン
処理，AD，AD′は加算パイプライン処理，Sdは
ストアアクセスパイプライン，Ｄは除算パイプラ
イン処理，〜はタイミングポイント，Twa，
Twb，Twc，Twx１，Twx２，Twdはベクト
ル処理区間，２１０，２１１はロードアクセスパ
イプライン１（LPI），ロードアクセスパイプラ
イン２（LP２），４１は加算パイプライン
（AP），４２は乗算パイプライン（MP），４３は
除算パイプライン（DP），２１０１，２１１１は
パイプラインストツプ制御部（PSC１，PSC２），
５１はレジスタ連鎖検出回路（RCDET），１０
は加算連鎖状態フリツプフロツプ（STP Ａ１），
１１は乗算連鎖状態フリツプフロツプ（STP Ｍ
１），１２は除算連鎖状態フリツプフロツプ
（STP Ｄ１），１３はストアアクセスパイプライ
ン連鎖状態フリツプフロツプ（STP Ｓ１），１
４は書き込み状態フリツプフロツプ，１５０〜１
５３，１６０，６０１〜６４９はアンド回路(A)，
１６１は否定回路（Ｎ），７０１〜７０５はオア
回路，CLKS AP，CLKS MP，CLKS DP，
CLKS SPは各読み出し動作を行うパイプライン
に対するクロツクストツプ信号，をそれぞれ示
す。 Figure 1 shows the configuration of the vector processing device.
FIG. 2 is a diagram conceptually showing the flow of pipelines in a vector processing device, FIG. 3 is a diagram showing in detail the flow of each pipeline in a vector processing device according to the prior art, and FIG. FIG. 5 is a diagram schematically showing the main parts of the applied vector processing device, and FIG. 5 is a diagram showing an embodiment of the pipeline stop control section shown in FIG. 4.
FIG. 7 is a diagram showing an embodiment of the register chain detection circuit shown in FIG. 4. In the drawing, 1 is a storage device, 2 is an access pipeline, 21 is a load access pipeline, 22 is a store access pipeline, 3 is a vector register (VR), 4 is an arithmetic pipeline,
5 is an instruction processing unit (IP), La, Lb, and Lc are load access pipeline processing, M is multiplication pipeline processing, AD, AD' are addition pipeline processing, Sd is store access pipeline, and D is division pipeline processing. processing, ~ is the timing point, Twa,
Twb, Twc, Twx1, Twx2, Twd are vector processing sections, 210, 211 are load access pipeline 1 (LPI), load access pipeline 2 (LP2), 41 is addition pipeline (AP), 42 is multiplication pipeline (MP), 43 is a division pipeline (DP), 2101, 2111 are pipeline stop control units (PSC1, PSC2),
51 is a register chain detection circuit (RCDET), 10
is an additive chain state flip-flop (STP A1),
11 is a multiplication chain state flip-flop (STP M
1), 12 is a division chain state flip-flop (STP D1), 13 is a store access pipeline chain state flip-flop (STP S1), 1
4 is a write state flip-flop, 150 to 1
53, 160, 601 to 649 are AND circuits (A),
161 is a negative circuit (N), 701 to 705 are OR circuits, CLKS AP, CLKS MP, CLKS DP,
CLKS SP indicates a clock stop signal for the pipeline that performs each read operation.

Claims

[Scope of Claims] 1. A vector register that allows one or more elements to be accessed simultaneously, one or more arithmetic pipelines that perform operations between the vector registers, and a link between a storage device and the vector register. A vector processing device is provided with one or more access pipelines that perform data transfer, and when data is transferred from the storage device to the vector register for each access pipeline, the access pipeline and Means for detecting a register chain of vector registers between pipelines that perform a vector register read operation, and a pipeline stop identification signal for each pipeline that performs a vector register read operation, are provided to detect the access that is the source of the register chain. Since data is no longer transferred from the storage device to the vector register in the pipeline, only the pipeline that performs the read operation specified by the pipeline stop identification signal is stopped. vector processing device. 2 A holding unit is provided to hold a pipeline stop identification signal provided for each access pipeline, and when a pipeline that performs a read operation from the vector register is activated, the pipeline stop identification signal is A line stop identification signal is set in the holding section and reset when a series of operations are completed in the access pipeline, and only when the holding section is in the holding state, a read operation at the location indicated by the holding section is performed. 2. The vector processing device according to claim 1, wherein a pipeline that performs the processing is targeted for pipeline stop. 3. The pipeline that performs each read is configured to stop the pipeline if any of the pipeline stop signals from all the access pipelines indicates a stop state. A vector processing device according to claim 1. 4. A means for detecting a register chain is provided between the pipelines that perform each read operation, and if the first activated read operation pipeline stops, if there is a register chain between the read operation pipelines, A vector processing device according to any one of claims 1, 2, and 3, characterized in that a function is provided to also stop a subsequent read operation pipeline.