JPS61267174A

JPS61267174A - Vector processor

Info

Publication number: JPS61267174A
Application number: JP10811685A
Authority: JP
Inventors: Koichiro Omoda; 面田　耕一郎; Shigeo Nagashima; 長島　重夫; Yasuhiro Inagami; 稲上　泰弘; Takayuki Nakagawa; 貴之中川; Teruo Tanaka; 輝雄田中
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1985-05-22
Filing date: 1985-05-22
Publication date: 1986-11-26
Anticipated expiration: 2011-12-18
Also published as: JP2564264B2

Abstract

PURPOSE:To execute efficiently the vector processing of a processing including the recursion calculation and to execute the processing at high speed by keeping a waiting condition until the previous processing is completed when the writing/ reading conflict presence/absence information is present, and executing successively the processing. CONSTITUTION:A conflict information preparing circuit 900 writes the vector data based upon the list type for the same area on a main memory 100, investigates, after that, whether the data are read or not, for respective elements of the vector data and prepares the reading/writing conflict presence/absence information. The reading/writing conflict presence/absence information is stored in a vector executing control register 700, based upon the information, an executing control circuit 800 controls a series of the vector processing to which the vector data are related, and when the conflict is present, the circuit controls to process successively as the waiting condition until the previous processing is completed. On the other hand, when the conflict is absent, the waiting condition is not obtained, the control is executed to process continuously.

Description

【発明の詳細な説明】〔発明の利用分野〕本発明はベクトル処理を高速に実行可能なベクトルプロ
セッサに関し、特に、リカージョン演算を効率良く処理
するベクトルプロセッサに関する。DETAILED DESCRIPTION OF THE INVENTION [Field of Application of the Invention] The present invention relates to a vector processor that can perform vector processing at high speed, and particularly to a vector processor that can efficiently process recursion operations.

[Background of the Prophecy]

ベクトル処理の適用範囲を拡大させるために、Ｃｒａｙ
社のＣｒａＹ−１、日立の８−８１０．富士通のＶＰ−
１００／２００等のベクトルプロセッサには各種機能が
組込まれている。この中には、インンイレクトアドレッ
シングに基づくリスト処理をベクトル処理により高速実
行する機能がある。In order to expand the scope of vector processing, Cray
Hitachi's CraY-1, Hitachi's 8-810. Fujitsu VP-
Vector processors such as 100/200 are equipped with various functions. This includes a function that performs list processing based on in-direct addressing at high speed using vector processing.

しかし、ＤＯ１０Ｉ＝１．Ｎ。but, DO10I=1. N.

１０　　Ａ（ＬＸ（Ｉ））＝Ａ（ＬＹ（Ｉ））＋Ｂ（Ｉ
）のような処理に於て、異なるＩ、Ｉ’に対して。10 A(LX(I))=A(LY(I))+B(I
) for different I, I'.

インデックスＬＸ（Ｉ）とＬＹ（Ｉ’）あるいはＬＸ（
Ｉ勺とＬＹ（Ｉ）がもし等しくなる場合には、上記のＤ
Ｏ文を従来のように制御変数工を単純に１〜Ｎｏまで連
続してベクトル処理しようとすると正しい結果が得られ
ない場合がある。なお、このようなケースの演算をリカ
ージョン演算と呼ぶ。このため、ベクトル命令を用いて
連続実行するのではなく、汎用プロセッサで処理してい
るように一般命令を用いて制御変数工の各位について逐
次処理を行なっていたため高い性能が得られないという
問題点があった。Indexes LX(I) and LY(I') or LX(
If I and LY(I) are equal, then the above D
If an attempt is made to simply perform vector processing on the O statement in a continuous manner using control variables from 1 to No in the conventional manner, correct results may not be obtained. Note that an operation in such a case is called a recursion operation. For this reason, the problem is that high performance cannot be obtained because each part of the control variable engineering is processed sequentially using general instructions, as if it were processed by a general-purpose processor, instead of sequentially executing it using vector instructions. was there.

〔発明の目的〕本発明はこのような問題点に鑑み、リカージョン演算を
含む処理を出来るだけ効率良くベクトル処理させて高速
実行させることによシ高い処理性能のベクトルプロセッ
サを実現することを目的としている。[Object of the Invention] In view of these problems, an object of the present invention is to realize a vector processor with high processing performance by performing vector processing including recursion operations as efficiently as possible and executing them at high speed. It is said that

[Summary of the invention]

主記憶上の同一領域に対してリスト形式に基づくベクト
ルデータを書込み、その後に読出すかどうかを、上記ベ
クトルデータの各要素毎に調べ、この書込み／読出しコ
ンフリクト有／無情報に基づき、上記ベクトルデータが
関連する一連のベクトル処理を、コンフリクト有シの場
合はそれ以前の処理が終了するまで待ち状態として逐時
処理させるように制御し、無しの場合は待ち状態とせず
従来通シ連続処理させるよう制御させる。Write vector data based on a list format to the same area on the main memory, and check whether to read it after that for each element of the vector data, and based on this write/read conflict existence/non-existence information, the vector data If there is a conflict, a series of related vector processes are controlled to be processed sequentially in a waiting state until the previous process is completed, and if there is no conflict, they are not placed in a waiting state and are processed continuously as usual. Let it be controlled.

[Embodiments of the invention]

以下、本発明を実施例を参照して詳細に説明するが、初
めに本発明の詳細な説明を行う。Hereinafter, the present invention will be explained in detail with reference to Examples, but first a detailed explanation of the present invention will be given.

第１図は本発明における動作例を説明するためのプログ
ラム例である。FIG. 1 is an example of a program for explaining an example of operation in the present invention.

第２図（ａ）、　（ｂ）は、第１図のプログラム例に於
けるインデックスデータＬＸ（Ｉ）とＬＹ（Ｉ）の数値
例を示したものである。なお、処理すべき要素数Ｎをこ
こでは１５と仮定し、また、各インデックス値はベクト
ルデータＡの要素番号を示すも−のとする。FIGS. 2(a) and 2(b) show numerical examples of index data LX(I) and LY(I) in the program example of FIG. 1. It is assumed here that the number N of elements to be processed is 15, and each index value indicates an element number of vector data A.

ＬＹ（５）は１５であるが、ＬＸ（滲も同じ１５である
ため、ＬＸ（４）を用いたリストベクトルデータＡ　（
１５）への書込み（リストベクトルデータＡは主記憶上
にある）の終了を待って、ＬＹ（５）を用いたリストベ
クトルデータＡ（１５）を主記憶から読出す必要がある
。即ち、制御変数Ｉ＝１〜４までは、主記憶への書込み
／読出しを意識する必要がなく、従来のベクトル処理が
可能である。しかし、Ｉ＝５以降の処理はＩ＝４の処理
の終了を待って開始させるものとし、また、Ｉ＝１３以
降の処理は工＝１２の処理の終了を待って開始させるも
のとする。LY (5) is 15, but LX (Yuri is also 15, so list vector data A (
15) (list vector data A is on the main memory), it is necessary to read out the list vector data A(15) using LY(5) from the main memory. That is, for control variables I=1 to 4, there is no need to be aware of writing/reading to/from the main memory, and conventional vector processing is possible. However, the processing after I=5 is started after waiting for the completion of the processing for I=4, and the processing after I=13 is started after waiting for the completion of the processing for I=12.

なお、Ｉ＝−１１の処理の終了を待って工＝１２以降の
処理を開始させるように制御することも可能であるが、
特に差異はなく、どちらでも良い。Note that it is also possible to perform control to wait for the end of the processing for I=-11 and then start the processing for I=12 and after.
There is no particular difference, and either is fine.

第２図（Ｃ）は第３図の数値例を基に作成した本発明の
特徴である読出し／書込みコンフリクト有／無情報を示
す。この情報は制御変数工の１〜１５に対応して作成さ
れるものであり、１１０の値は待ち状態の要／不要を指
定する。FIG. 2(C) shows read/write conflict presence/absence information, which is a feature of the present invention, created based on the numerical example shown in FIG. This information is created corresponding to control variables 1 to 15, and the value 110 specifies whether a wait state is necessary or not.

即ち、１の値が立っている対応する制御変数工の処理を
待ち状態とし、それ以前の処理の終了を待って再開する
ように制御するものである。That is, the process of the corresponding control variable whose value is set to 1 is placed in a waiting state, and control is performed such that it waits for the previous process to finish before restarting.

第３図は本発明の一実施例を示す図である。FIG. 3 is a diagram showing an embodiment of the present invention.

図中、１００は主記憶、２００はスカラ処理装置％３０
０はベクトル処理装置、４００は命令制御部％５００は
メモリアクセス制御部、６００はベクトルレジスタ部、
７００は前述の読出し／書込みコンフリクト有／無情報
を格納するベクトル実行制御レジスタ（ｖＥＣＲ）、８
００はＶＥＣＲ，。In the figure, 100 is the main memory, 200 is the scalar processing unit %30
0 is a vector processing unit, 400 is an instruction control unit, 500 is a memory access control unit, 600 is a vector register unit,
700 is a vector execution control register (vECR) that stores the above-mentioned read/write conflict presence/absence information;
00 is VECR,.

値に基づいてベクトル処理の実行を制御する実行制御回
路、９００はインデックスデータを入力して読出し／書
込みコンフリクト有／無情報を作成するコンフリクト情
報作成回路、１０００は演算処理部である。An execution control circuit controls the execution of vector processing based on a value, 900 is a conflict information creation circuit that inputs index data and creates read/write conflict presence/absence information, and 1000 is an arithmetic processing unit.

次に、第４図に示した処理２（第１図参照）のベク）ｙ
命令列を基に、前述したコンフリクト有／無情報の作成
方法を記す。なお、ベクトル処理の概略手原は次の通シ
である。まず、ベクトル命令を実行するための前処理を
スカラ命令を用いて行なう。この前処理としては、メモ
リアクセス制御部５００にあるアドレスレジスタ群への
アドレス情報（主記憶１ｏｏをアクセスする際のアドレ
ス情報）のセット等がある。次に、スカラ処理装置２０
０がＢＸＶＰ命令を解読すると、命令制御部４００にベ
クトル命令列の処理開始の起動をかける。命令制御部４
００は、ＥＸＶＰ命令で指定された然りトル命令列をス
カラ処理装置２００を介して取込む。そして、取込んだ
命令を順次解読し、ＶＥＮＤ命令が出現するまでベクト
ル命令の処理を行なう。第５図にコンフリクト有／無情
報を作成するコンフリクト情報作成回路を示す。図中、
６１０／６２０は１６要素のデータを格納できるペク）
　Ａ／　レジスタＶＲ（）／ＶＲＩ、Ｒ，１００〜Ｒ１
１５は１６要素のインデックスデータを格納できるベク
トルインデックスレジスタ（ＶＸＲ）Ｌ１００〜Ｌ１１
５はＲ１００〜ＲＩＪ５に対するラッチ、Ｃ１００ＮＣ
１１５は′ＰＬ１０ｏ〜Ｒ１１５に対する一致回路、Ｇ
１００〜Ｇ１１５はＣ１００〜Ｃ１１５に対応して設け
られるＡＮＤゲート“、Ｒ１１６はレジスタ、Ｇ１１６
はＯＲゲート、７００はベクトル実行制御レジスタ（Ｖ
ＥＣＲ）である。Next, the vector of process 2 (see FIG. 1) shown in FIG.
A method of creating the above-mentioned conflict presence/absence information will be described based on the instruction sequence. The outline of vector processing is as follows. First, preprocessing for executing a vector instruction is performed using a scalar instruction. This preprocessing includes setting address information (address information when accessing the main memory 1oo) to a group of address registers in the memory access control unit 500. Next, the scalar processing device 20
When 0 decodes the BXVP instruction, it activates the instruction control unit 400 to start processing the vector instruction sequence. Command control unit 4
00 takes in the exact instruction string specified by the EXVP instruction via the scalar processing device 200. Then, the fetched instructions are sequentially decoded and vector instructions are processed until a VEND instruction appears. FIG. 5 shows a conflict information creation circuit that creates conflict presence/absence information. In the figure,
610/620 can store 16 elements of data)
A/ Register VR()/VRI, R, 100~R1
15 is a vector index register (VXR) L100 to L11 that can store index data of 16 elements.
5 is a latch for R100 to RIJ5, C100NC
115 is a matching circuit for 'PL10o to R115, G
100 to G115 are AND gates provided corresponding to C100 to C115, R116 is a register, and G116
is an OR gate, 700 is a vector execution control register (V
ECR).

最初に、第４図の命令１で主記憶１００に格納されてい
るインデックスベクトルＬＸを１５要素分順次ＶＢ、Ｏ
，命令２でインデックスベク）ｙＬＹを１５要素分順次
ＶＲＩへ読出す。次に、命令３でＶＴＬＯに読出した１
５要素のインデックスベクトルＬＸをレジスタＲ１００
〜Ｒ１１４に転送する。なお、この場合、主記憶１００
がら直接レジスタＲ１００〜Ｒ１１４へ読出す方法も考
えられる。First, the index vector LX stored in the main memory 100 is sequentially converted into 15 elements VB, O by instruction 1 in FIG.
, instruction 2 sequentially reads 15 elements of index vector)yLY to the VRI. Next, the 1 read to VTLO by instruction 3
The 5-element index vector LX is stored in register R100.
~Transfer to R114. In this case, the main memory 100
However, a method of directly reading the data to the registers R100 to R114 is also conceivable.

最後に、命令４の処理を行なうが、処理に先立ってラッ
チＬ１００〜Ｌ１１５を全て０にクリアするものとする
。第６図は第２図の数値例に従って命令４の処理過程の
１部分を示したものであシ、第６図を参照しながら、第
５図を基に命令４の処理内容を以下に説明する。Finally, instruction 4 is processed, but prior to processing, all latches L100 to L115 are cleared to 0. FIG. 6 shows a part of the processing process of instruction 4 according to the numerical example of FIG. 2. The processing contents of instruction 4 will be explained below based on FIG. 5 while referring to FIG. do.

（１）ステップ１：ベクトルレジスタｖＲ１からベクト
ルインデックスＬＹ（１）を読出してレジスタＲ１１６
ヘセツトする。このとき、ラッチＬ１００〜Ｌ１１５は
全て０のため、ゲートＧ１００ＮＧ１１５は開かずゲー
）Ｇ１１６から０が出力され、ＶＥＣＲ，の先頭要素に
０が格納される。(1) Step 1: Read vector index LY(1) from vector register vR1 and register R116
to set. At this time, since the latches L100 to L115 are all 0, the gate G100NG115 is not opened and 0 is output from the gate G116, and 0 is stored in the first element of VECR.

（２）ステップ２：ベクトルレジスタＶＲＩからＬＹ（
２）を読出してレジスタＲ１１６ヘセツトすると同時に
、ラッチＬ１００を１にセットする。このとき、ゲート
Ｇ１００が開くが、レジスタ８１１６とＲ１００の値が
不一致によシ一致回路Ｃ１００からは０が出力されるた
め、ゲー）Ｇ１１６から０が出力されてＶＥＣＲの２番
目要素に０が格納される。(2) Step 2: From vector register VRI to LY(
2) is read and set in register R116, and at the same time, latch L100 is set to 1. At this time, gate G100 opens, but since the values of register 8116 and R100 do not match, 0 is output from coincidence circuit C100, so 0 is output from gate G116 and 0 is stored in the second element of VECR. be done.

（３）ステップ３：ベクトルレジスタＶＢ、１がらＬＹ
（８）を読出してレジスタＲ１１６ヘセツトすると同時
に、ラッチＬＩＯＩを１にセットする。このとき、更に
ゲー）ＧＩＯＩも開くが、−数回路Ｃ１００とＣ１０１
から０が出力されるためゲー）Ｇｌ　１６から０が出力
されでＶＥＣＲの３番目要素に０が格納される。(3) Step 3: Vector register VB, LY from 1
(8) is read and set in register R116, and at the same time, latch LIOI is set to 1. At this time, the game) GIOI also opens, but - several circuits C100 and C101
Since 0 is output from G16, 0 is output from Gl 16 and 0 is stored in the third element of VECR.

（４）ステップ４：ベクトルレジスタＶ’ＲＩからＬＹ
（４）を読出してレジスタＲ１１６ヘセツトすると同時
に、ラッチＬ１０２を１にセットする。このとき、更に
ゲートＧＬＯ２も開くが、−数回路Ｃ１００〜ＣｌＯ２
から全て０が出力されるため、ＶＥＣＲの４番目要素に
０が格納される。(4) Step 4: Vector register V'RI to LY
(4) is read and set in register R116, and at the same time, latch L102 is set to 1. At this time, the gate GLO2 is also opened, but - several circuits C100 to ClO2
Since all 0s are output from , 0 is stored in the fourth element of VECR.

（５）ステップ５：ベクトルレジスタＶＲ，１からＬＹ
（５）を読出してレジスタ８１１６ヘセツトすると同時
に、ラッチＬ１０３ｋｌにセットしゲートＧ１０３も開
く。このとき、−数回路ＣｌＯ３から１が出力されるた
めＶＥＣＲの５番目要素に１が格納される。(5) Step 5: Vector register VR, 1 to LY
(5) is read and set in register 8116, and at the same time, it is set in latch L103kl and gate G103 is also opened. At this time, since 1 is output from the minus number circuit ClO3, 1 is stored in the fifth element of VECR.

（６）ステップ６：ベクトルレジスタＶＲＩからＬＹ（
６）を読出してレジスタＲ１１６ヘセツト　　　。(6) Step 6: From vector register VRI to LY(
6) and set it in register R116.

すると同時に、ラッチＬ１０４を１にセットし、ゲー）
Ｇ１０４も開く。このとき、−数回路０１００〜Ｃ１０
４から全てＯが出力されるため、ＶＥＣＲの６番目要素
に０が格納される。At the same time, set latch L104 to 1 and
G104 also opens. At this time, - number circuits 0100 to C10
Since all O's are output from 4, 0 is stored in the 6th element of VECR.

以下同様に処理が進み、最終的には、ＶＥＣＲの先頭要
素から１５番目要素までに第２図（Ｃ）に示したような
コンフリクト有／無情報が格納される。The processing proceeds in the same manner, and finally, conflict presence/absence information as shown in FIG. 2(C) is stored from the first element to the 15th element of VECR.

なお、本実施例に於ては、専用のベクトル実行制御レジ
スタを用いているが、現状のベクトルプロセッサが保持
しているマスクレジスタを利用することも可能である。Although this embodiment uses a dedicated vector execution control register, it is also possible to use a mask register held by current vector processors.

また、異なったフンフリクト有／無情報間で（例えば論
理）演算し、この演算結果を用いることも可能であシ、
この場合、マスクレジスタと共用するのが望ましい。It is also possible to perform (for example, logical) calculations between different types of information with and without information, and use the results of this calculation.
In this case, it is desirable to share it with the mask register.

第７図はベクトル実行制御レジスタＶＥＣ３に基づく実
行制御回路である。図中、７００はベクトル実行制御レ
ジスタＶＢＣＢ％Ｌ８１０はベクトル実行制御モードラ
ッチ、Ｒ８００〜Ｂ、８０４はレジスタ、８２０はアド
レスカウントアツプ回路、８２１はグラス１回路、８２
２はサーチ回路、８２３は加算器、８２４は比較器、８
８３０〜５８３３はセレクタ、１８４０．、！８４１゜
１８５０−１８５６は信号線である。FIG. 7 shows an execution control circuit based on the vector execution control register VEC3. In the figure, 700 is a vector execution control register VBCB%L810 is a vector execution control mode latch, R800-B, 804 is a register, 820 is an address count up circuit, 821 is a glass 1 circuit, 82
2 is a search circuit, 823 is an adder, 824 is a comparator, 8
830-5833 are selectors, 1840. ,! 841°1850-1856 are signal lines.

第１図に示したプログラム例に基づくベクトル命令列（
第４図参照）および第２図に示した数値例を用いて第７
図の動作を以下に説明する。なお、処理１〜処理３のベ
クトル命令列の各ベクトル命令の処理要素数は全て１５
（即ち、ループ制御変数工が１〜１５）とし、この値は
スカラ処理装置２００が命令制御部４００にベクトル命
令列の処理開始の起動を発行するとき、信号線ｔ８４０
を介してレジスタＲ８０３にセットされ、同時に、ベク
トル命令列の先頭アドレスが信号線ｔ８４１、セレクタ
８８３０を介してレジスタＲ８００にセットされる。ま
た、このとき同時にラッチＬ８１０が０にクリアされる
。但し、レジスタＦＬ８０３にセットされる値は処理要
素数−１となる。即ち、値１４がレジスタＲ８０３にセ
ットされる。なお、処理１と処理３はリカージョン演算
が含まれていす、従来のベクトル処理が可能であるとす
る。Vector instruction sequence (based on the program example shown in Figure 1)
(see Figure 4) and the numerical example shown in Figure 2.
The operation of the figure will be explained below. Note that the number of processing elements for each vector instruction in the vector instruction sequence of processing 1 to processing 3 is all 15.
(that is, the loop control variable is 1 to 15), and this value is set on the signal line t840 when the scalar processing device 200 issues an activation command to the instruction control unit 400 to start processing the vector instruction sequence.
At the same time, the start address of the vector instruction string is set in register R800 via signal line t841 and selector 8830. Furthermore, at the same time, latch L810 is cleared to 0. However, the value set in register FL803 is the number of processing elements minus 1. That is, the value 14 is set in register R803. It is assumed that processing 1 and processing 3 include recursion calculations and that conventional vector processing is possible.

処理１の各命令と、処理２の命令１〜４の実行に於ては
次のように処理要素数が決定される。セレクタ５８３３
からは開始要素番号が、セレクタ５８３２からは終了要
素番号が各々、信号線ｔ８５５．ｔ８５４を介して命令
制御部４００へ転送されるが、今、ラッチＬ８１０が０
のため、開始要素番号として０、終了要素番号としてレ
ジスタＲ８０３の値である１４が選択される。なお、こ
こでの要素番号０はループ制御変数Ｉ＝１を。In the execution of each instruction of process 1 and instructions 1 to 4 of process 2, the number of processing elements is determined as follows. selector 5833
The starting element number is from the selector 5832, the ending element number is from the selector 5832, and the signal line t855. The command is transferred to the instruction control unit 400 via t854, but now the latch L810 is 0.
Therefore, 0 is selected as the starting element number, and 14, which is the value of register R803, is selected as the ending element number. Note that the element number 0 here is the loop control variable I=1.

要素番号１４は工＝１５に対応するものとする。It is assumed that element number 14 corresponds to engineering=15.

また、同時にベクトル命令列の先頭アドレス（処理ｌの
先頭命令のアドレス）が信号線ｔ８５３を介して命令制
御部４００へ転送される。即ち、この場合には、各命令
とも１５の要素が一度に連続してベクトル処理可能であ
ることを意味している。At the same time, the start address of the vector instruction sequence (the address of the start instruction of process 1) is transferred to the instruction control unit 400 via the signal line t853. That is, in this case, each instruction means that 15 elements can be vector-processed consecutively at once.

処理１の各命令と、処理２の命令１〜４ｔ、命令列の最
初から順次解読して実行するが、命令５を解読すると、
詳細は省略するが、それ以前の命・　　令の終了を待っ
てラッチＬ８１０を１にセットし。Each instruction of process 1 and instructions 1 to 4t of process 2 are decoded and executed sequentially from the beginning of the instruction sequence, but when instruction 5 is decoded,
Although the details are omitted, latch L810 is set to 1 after waiting for the previous instruction to finish.

また、レジスタＲ８０２を−１にセットする。なお、こ
のとき、命令制御１ｉ４００から信号線ｔ８２０を介し
てセット信号がラッチＬ８１０に転送されるものとする
。また、セット信号と同時に命令制御部４００は信号線
ｔ８５２を介して命令５のアドレスが転送される。そし
て、アドレスカウントアツプで次の命令６のアドレスを
指すようにカウントアツプして、セレクタ５８３０を介
してレジスタＲ，８００にセットする。その後、レジス
タＲ８０２にセットされた−１の値はプラス１回路８２
１で１を加えて値Ｏとなシ、レジスタＲ８０１にセット
されると同時に、サーチ回路８２２、加算器８２３へ転
送される。サーチ回路８２２では、プラス１回路８２１
から出力される値をＶＥＣＲ７００の要素番号に対応さ
せ、指定された要素番号の次の要素番号から何個連続し
て０が出現するかをカウントしてその値を加算器８２３
に出力するが、この場合、値３を出力する。そして、加
算器８２３でプラス１回路８２１から出力される値０と
サーチ回路８２２から出力される値３とが加算されて、
値３がレジスタＲ８０４にセットされる。レジスタＲ８
０４とレジスタＲ８０３の値を比較器８２４で比較し、
レジスタＲ８０４の値がレジスタＲ８０３の値よシ大き
いかまたは等しいとき信号線ｔ８５６に１を出力する。Also, register R802 is set to -1. Note that at this time, it is assumed that a set signal is transferred from the instruction control 1i400 to the latch L810 via the signal line t820. Further, at the same time as the set signal, the address of the instruction 5 is transferred to the instruction control unit 400 via the signal line t852. Then, the address is counted up to point to the address of the next instruction 6, and set in the register R, 800 via the selector 5830. After that, the value of -1 set in the register R802 is changed to the plus 1 circuit 802.
1 is added to the value O, which is set in the register R801 and simultaneously transferred to the search circuit 822 and adder 823. In the search circuit 822, the plus 1 circuit 821
The value output from the VECR 700 is made to correspond to the element number of the VECR 700, the number of successive 0's appearing from the element number next to the specified element number is counted, and the value is added to the adder 823.
In this case, the value 3 is output. Then, the adder 823 adds the value 0 output from the plus 1 circuit 821 and the value 3 output from the search circuit 822.
The value 3 is set in register R804. register R8
04 and the value of register R803 are compared by comparator 824,
When the value of register R804 is greater than or equal to the value of register R803, 1 is output to signal line t856.

この場合、比較器８２４から０が出力され、セレクタ５
８３１を介してレジスタＲ８０４の値３がレジスタＲ８
０２にセットされる。そして、ラッチＬ８１０が１にな
っているためセレクタ８８３３からは開始要素番号とし
てレジスタＲ８０１０値０が、セレクタ５８３２からは
終了要素番号としてレジスタＲ８０２の値３が出力され
命令制御部４００へ転送される。また、同時に、レジス
タＲ８００にセットされた命令６のアドレスが信号線ｔ
８５３を介して命令制御部４００へ転送され、その後、
命令制御部４００は命令６から処理を開始する。そして
、詳細は省略するが、命令１０を解読すると次命令の解
読を抑止するものとする。In this case, the comparator 824 outputs 0, and the selector 5
The value 3 of register R804 is transferred to register R8 via 831.
Set to 02. Since the latch L810 is set to 1, the selector 8833 outputs the value 0 of the register R8010 as the starting element number, and the selector 5832 outputs the value 3 of the register R802 as the ending element number, which are transferred to the instruction control unit 400. At the same time, the address of instruction 6 set in register R800 is set to signal line t.
853 to the instruction control unit 400, and then
The instruction control unit 400 starts processing from instruction 6. Although details are omitted, it is assumed that once instruction 10 is decoded, decoding of the next instruction is inhibited.

即ち、命令６〜９は制御変数Ｉ＝１〜４までは連続して
ベクトル処理可能であることを示している。That is, instructions 6 to 9 indicate that vector processing can be performed continuously for control variables I=1 to 4.

命令１０を解読する。と、それ以前の処理の終了を待っ
て次に示す動作を行なう。なお、命令１０の実行は行な
わない。前述したように、レジスタＲ８０２の値３がプ
ラス１回路８２１で１を加えられて値４となシ、レジス
タＲ８０１にセットされると同時に、サーチ回路８２２
、加算器８２３へ転送される。サーチ回路８２２から今
度は値７が加算器８２３へ出力され、プラス１回路８２
１から出力される値４と加算されて値１１がレジスタＲ
８０４にセットされる。そして、比較器８２４から前回
同様に０が出力されるため、レジスタＲ８０４の値１１
がセレクタ５８３１を介してレジスタＲ８０２ヘセツト
される。その後、ラッチＬ８１０が１になっているため
開始要素番号としてレジスタＲ８０１の値４が、終了要
素番号としてレジスタＲ８０２の値１１が命令制御部４
００へ転送され、°同時に、レジスタＲ８００にセット
された命令６のアドレスが再度命令制御部４００へ転送
されて、命令制御部４００は命令６から処理を順次開始
する。そして、命令１０を解読するとそれ以前の処理の
終了を待って次に示す動作を行なう。なお、命令１０の
実行は行なわない。レジスタＲ８０１には値１２がセッ
トされ、レジスタＲ８０４には値が１５（但し、ＶＥＣ
ＲＯ要素番号１５の値は以前の演算結果が入っているが
、ここでは０を仮定している。なお、１であっても特に
問題はない）される。比較器８２４からは今度は１が出
力されるため、レジスタＲ，８０３の値１４がセレクタ
５８３１を介してレジスタＲ，８０２にセットされる。Decoding instruction 10. Then, it waits for the previous processing to complete and then performs the following operations. Note that instruction 10 is not executed. As mentioned above, the value 3 of the register R802 is incremented by 1 in the plus 1 circuit 821 to become the value 4, and is set in the register R801, and at the same time, the search circuit 822
, are transferred to adder 823. The search circuit 822 then outputs the value 7 to the adder 823, and the plus 1 circuit 82
The value 11 is added to the value 4 output from 1 and is sent to register R.
804. Then, since the comparator 824 outputs 0 as before, the value of the register R804 is 11.
is set in register R802 via selector 5831. After that, since the latch L810 is set to 1, the value 4 of the register R801 is set as the starting element number, and the value 11 of the register R802 is set as the ending element number to the instruction control unit 4.
At the same time, the address of instruction 6 set in register R800 is transferred again to instruction control unit 400, and instruction control unit 400 sequentially starts processing from instruction 6. Then, when instruction 10 is decoded, the following operation is performed after waiting for the completion of the previous processing. Note that instruction 10 is not executed. The value 12 is set in register R801, and the value 15 is set in register R804 (however, VEC
The value of RO element number 15 contains the previous calculation result, but it is assumed to be 0 here. Note that there is no particular problem even if it is 1). Since comparator 824 outputs 1 this time, the value 14 in register R, 803 is set in register R, 802 via selector 5831.

その後、開始要素番号１２、終了要素番号１４、命令６
のアドレスが命令制御部４００へ転送され、命令６から
処理を順次開始する。そして、命令１０を解読するが、
今度は信号線ｔ８５６が１になっている（必要な要素数
を処理したことを示す）ため、本命令の処理を次のよう
に行なう。即ち、信号線ｔ８５１を介してリセット信号
をラッチＬ８１０に転送してラッチＬ８１０をＯにリセ
ットする。また、同時に、信号線ｔ８５２を介して命令
１０のアドレスをアドレスカウントアツプ回路８２０に
転送し、アドレスカウントアツプを行って次命令のアド
レス（処理３の先頭命令のアドレス）をセレクタ５８３
０を介してレジスタＲ８００ヘセツトする。その後、処
理３の命令列の実行を、前述した処理１の場合と同様に
行なう。After that, start element number 12, end element number 14, instruction 6
The address is transferred to the instruction control unit 400, and processing starts sequentially from instruction 6. Then, I decode instruction 10,
This time, since the signal line t856 is 1 (indicating that the required number of elements have been processed), the processing of this command is performed as follows. That is, a reset signal is transferred to the latch L810 via the signal line t851, and the latch L810 is reset to O. At the same time, the address of instruction 10 is transferred to the address count up circuit 820 via the signal line t852, the address is counted up, and the address of the next instruction (the address of the first instruction in process 3) is sent to the selector 583.
0 to register R800. Thereafter, the instruction sequence in process 3 is executed in the same manner as in process 1 described above.

〔Effect of the invention〕

以上説明したように、本発明によれば、例えば同−ＤＯ
ｌｖ−プ内に、リカージョン演算がある場合であっても
、リカージョン演算を含む全ＤＯループの処理を一括し
てベクトル処理できるため、従来のようにリカージョン
演算部をスカラ処理させる場合に比べて効率が良く、ま
た、リカージョン演算部を部分的ではあるが、ベクトル
処理することが可能であるため、従来のベクトルプロセ
ッサよシも高速処理が実現可能である。As explained above, according to the present invention, for example, the -DO
Even if there is a recursion operation in the lv-p, the processing of all DO loops including the recursion operation can be vector-processed at once. It is more efficient than conventional vector processors, and because the recursion calculation section can perform vector processing, albeit partially, it is possible to achieve faster processing than conventional vector processors.

[Brief explanation of the drawing]

第１０図は、本発明の詳細な説明するためのプログラム
例を示す図、第２図は第１図のプログラム列に基づく数
値列を示す図、第３図は本発明の一実施例の概略構成を
示す図、第４図は第１図のプログラム列に基づく処理の
流れ図、第５図は本発明の特徴である書込み／ｍ出しコ
ンフリクト有／無情報を作成するためのコンフリクト情
報作成回路を示す図、第６図は第５図の作成回路に基づ
く演算例を示す図、第７図は本発明の特徴である書込み
／読出しコンフリクト有／無情報に基づく実行制御回路
を示す図である。茎　１　　図石　２　図（Ｌ）（）））　　　　　　　（Ｃ）冨　３　図FIG. 10 is a diagram showing an example of a program for explaining the present invention in detail, FIG. 2 is a diagram showing a numerical sequence based on the program sequence of FIG. 1, and FIG. 3 is an outline of an embodiment of the present invention. FIG. 4 is a flowchart of processing based on the program sequence shown in FIG. 1, and FIG. 5 shows a conflict information creation circuit for creating write/m output conflict presence/absence information, which is a feature of the present invention. FIG. 6 is a diagram showing an example of calculation based on the creation circuit of FIG. 5, and FIG. 7 is a diagram showing an execution control circuit based on write/read conflict presence/absence information, which is a feature of the present invention. Stem 1 Figure stone 2 Figure (L) ())) (C) Tomi 3 Figure

Claims

[Claims] 1. A vector processor having a function of reading and writing vector data from a main memory based on a list format, which writes the vector data to the same area on the main memory, and then Detects whether to read or not for each element of the vector data and determines whether there is a write/read conflict.
Conflict generation means for outputting as no information; storage means for storing the write/read conflict presence/absence information output from the generation means; based on the write/read conflict presence/absence information stored in the storage means; , if there is a conflict in a series of vector processing related to the vector data, control is performed so that the processing is performed sequentially in a waiting state until the previous processing is completed;
A vector processor characterized in that it has vector instruction execution control means for controlling the execution of vector instructions so that they are processed continuously without being placed in a waiting state when there is no waiting state. 2. The vector instruction execution control means controls execution of a series of vector processing based on the write/read conflict presence/absence information stored in the storage means, or ignores the conflict presence/absence information. The vector processor according to the first term has a switching means for controlling execution of vector processing.