JPS61251960A

JPS61251960A - Vector instruction processor

Info

Publication number: JPS61251960A
Application number: JP6539785A
Authority: JP
Inventors: Hajime Fukuzawa; 福澤　一
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1985-03-29
Filing date: 1985-03-29
Publication date: 1986-11-08
Anticipated expiration: 2010-03-08
Also published as: JPH0721796B2

Abstract

PURPOSE:To speed up instruction processing by inhibiting reading of a corresponding vector factor while a mask bit read out in such a way that mask bit reading proceeds the reading of the corresponding vector factor is indicating the arithmetic to be restrained. CONSTITUTION:At the time of starting a vector instruction, a mask vector address register 12, an inter-vector factor distance registers 41-43 and vector factor registers 49-51 are initialized. The value of an address addition value register 13 is added to the value of the register 12 so as to read out the mask vector from a memory device 2 and stored in a mask vector register 32. A priority encoder 33 counts arithmetic inhibiting bits from the top of the mask vector until the initial arithmetic indicating bit can be found. A multiplying circuit 45 multiplies an inter-vector factor distance by the number of arithmetic inhibiting bits, an address adder circuit 53 adds a difference address to read the vector factor from a memory device 5. The read-out vector factor is transferred to an arithmetic device, thereby speeding up the instruction processing.

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は、ベクトル演算により大規模で高度な技術計算
を高速に処理する演算処理装置に関する。DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to an arithmetic processing device that processes large-scale, advanced technical calculations at high speed through vector operations.

特に、マスクビットによりベクトル演算の実行を制御す
るベクトル命令処理装置に関する。In particular, the present invention relates to a vector instruction processing device that controls execution of vector operations using mask bits.

〔overview〕

本発明は、演算しようとするベクトル要素の各々に対応
して設けたマスクビットの論理レベルにしたがって、演
算を実行または抑止する処理装置において、マスクビットの論理レベルが演算抑止であるベクトル要
素については記憶装置からの読み出しを禁止することに
より、記憶装置から演算装置へのデータ転送を効率的に行うよ
うにしたものである。The present invention provides a processing device that executes or inhibits an operation according to the logic level of a mask bit provided corresponding to each vector element to be operated. By prohibiting reading from the storage device, data is efficiently transferred from the storage device to the arithmetic unit.

[Conventional technology]

大規模で高度な技術計算を高速に処理する演算装置、い
わゆるスーパコンピュータでは、スカラ命令に比べて格
段に高速なベクトル命令を用いて演算を行う。このベク
トル命令で処理する部分の比率、すなわちベクトル化率
を可能な限り高めることが高速性を引き出すうえで重要
である。このため、条件文を含むループの演算を行う場
合等には、マスク付きベクトル演算を行う。Computing devices that process large-scale, highly technical calculations at high speed, so-called supercomputers, perform computations using vector instructions, which are much faster than scalar instructions. In order to achieve high speed, it is important to increase the proportion of the portion processed by vector instructions, that is, the vectorization rate, as much as possible. Therefore, when performing loop calculations including conditional statements, masked vector calculations are performed.

マスク付きベクトル演算は、演算しようとする、　ベク
トル要素（「ベクトル・オペランド」ともいう）のそれ
ぞれに対応してマスクビットを設け、このマスクビット
の論理レベルが「０」のときにはその演算を禁止する。In masked vector operations, a mask bit is provided for each vector element (also called a "vector operand") to be operated on, and when the logic level of this mask bit is "0", the operation is prohibited. .

以下では、論理レベルがｒＯＪのマスクピントを演算抑
止ビット、論理レベルが「１」のマスクビットを演算指
示ビットという。このようなマスクビットの列をマスク
ベクトルという。Hereinafter, the mask focus whose logic level is rOJ will be referred to as an operation inhibition bit, and the mask bit whose logic level is "1" will be referred to as an operation instruction bit. Such a sequence of mask bits is called a mask vector.

従来の一般的なベクトル命令処理装置では、マスクビッ
トが有効なベクトル命令を以下のように実行する。すな
わち、まず、マスクビットの値に拘わらずに記憶装置か
らベクトル要素を読み出し、ベクトル要素を一時的にバ
ッファに蓄える。この後に、マスクビットが演算指示ビ
ットであるときに、バッファに蓄えたベクトル要素を演
算装置に供給する。マスクビットが演算抑止ビットであ
るときには、ベクトル要素を演算装置に供給しない。A conventional general vector instruction processing device executes a vector instruction with valid mask bits as follows. That is, first, vector elements are read from the storage device regardless of the value of the mask bit, and the vector elements are temporarily stored in a buffer. Thereafter, when the mask bit is an operation instruction bit, the vector elements stored in the buffer are supplied to the arithmetic unit. When the mask bit is an operation inhibit bit, the vector element is not supplied to the arithmetic unit.

このようなベクトル命令処理装置を、以下では第一の従
来例という。Such a vector instruction processing device will hereinafter be referred to as a first conventional example.

このような第一の従来例では、マスクベクトル中に演算
抑止ビットが多い場合に、演算装置が演算を行っている
時間の割合が低下する。したがって、ベクトル命令が終
了するまでの演算時間に無駄が多くなる欠点があった。In such a first conventional example, when there are many operation inhibiting bits in a mask vector, the proportion of time that an arithmetic unit performs an operation decreases. Therefore, there is a drawback that a lot of calculation time is wasted until the vector instruction is completed.

このような欠点を解決するため、マスクベクトルを一定
ビット幅に区切って処理するベクトル命令処理装置が、
特願昭５６４２２０２４号に開示されている。この°よ
うな装置では、マスクベクトルを一定ビット幅に区切り
、その中のマスクビットが全て演算抑止ビットのときに
は、ベクトル要素の読み出しおよび演算を禁止する。こ
れにより、マスクベクトルが演算抑止ビットを多数含む
場合の処理時間を短縮することができる。マスクベクト
ルが演算抑止ビットを多数含む傾向があることを、スパ
ーク性が強いという。このようなベクトル命令処理装置
を以下では第二の従来例という。In order to solve these drawbacks, a vector instruction processing device that processes mask vectors by dividing them into fixed bit widths has been developed.
It is disclosed in Japanese Patent Application No. 56422024. In such a device, a mask vector is divided into a constant bit width, and when all the mask bits therein are operation inhibit bits, reading and operation of vector elements are prohibited. Thereby, processing time can be shortened when the mask vector includes a large number of operation inhibit bits. A mask vector that tends to include a large number of operation inhibit bits is said to have strong sparking properties. Such a vector instruction processing device will hereinafter be referred to as a second conventional example.

[Problem that the invention seeks to solve]

しかし、上述の第二の従来例では、一定ビツト幅に区切
られたマスクベクトル中に、演算指示ビットが少数個含
まれているときに問題がある。例えば、区切られたマス
クベクトル中に演算指示ビットが一個だけ存在し、他の
マスクビットが全て演算抑止ビットの場合にも、記憶装
置からベクトル要素を読み出し、第一の従来例と同様の
処理を行う。このため、演算時間に無駄な時間が多くな
る欠点がある。However, the above-mentioned second conventional example has a problem when a small number of operation instruction bits are included in a mask vector divided into a constant bit width. For example, even if there is only one operation instruction bit in a partitioned mask vector and all other mask bits are operation inhibit bits, the vector element is read from the storage device and the same processing as in the first conventional example is performed. conduct. Therefore, there is a drawback that a lot of time is wasted in calculation time.

すなわち、マスクベクトルを区切るビット幅を拡大する
と、その区切られたマスクベクトル中の全てのマスクビ
ットが演算抑止ビットである場合には、演算速度を短縮
することができる。しかし、このビット幅を拡大すると
、演算指示ビットを含む可能性も増加する。区切られた
マスクベクトル中に演算指示ビットを含む場合には、第
一の従来例と同じ処理を行うことになる。That is, by increasing the bit width that separates mask vectors, the calculation speed can be reduced if all mask bits in the partitioned mask vectors are operation inhibit bits. However, when this bit width is expanded, the possibility of including an operation instruction bit also increases. When the delimited mask vectors include operation instruction bits, the same processing as in the first conventional example is performed.

逆に、区切るビット幅を縮小すると、区切られたマスク
ベクトル中に演算指示ビットを含まない可能性を増加さ
せることがきる。しかし、ベクトル要素の読み出し回数
の削減、および無効な演算を実行しないことにより処理
の時間を短縮するという本来の目的を達成できない欠点
がある。On the other hand, by reducing the dividing bit width, it is possible to increase the possibility that the divided mask vectors do not include operation instruction bits. However, there is a drawback that the original purpose of reducing the number of times vector elements are read and shortening processing time by not executing invalid operations cannot be achieved.

本発明は、ベクトル要素の読み出し時間および演算装置
の無駄時間を削減することのできるベクトル命令処理装
置を提供することを目的とする。SUMMARY OF THE INVENTION An object of the present invention is to provide a vector instruction processing device that can reduce the time required to read vector elements and the wasted time of an arithmetic unit.

ｃ問題点を解決するための手段〕本発明のベクトル命令処理装置は、ベクトル演算を行う
演算装置に接続され、記憶装置からベクトル要素を読み
出すベクトル要素読み出し手段と、この手段により読み
出されたベクトル要素を演算装置に転送する手段と、記
憶装置からそれぞれのベクトル要素に対応するマスクビ
ットの列で構成されるマスクベクトルを読み出す手段と
を備えたベクトル命令処理装置において、マスクビット
の読み出しが対応するベクトル要素の読み出しに先行す
るように制御する手段と、この手段により読み出された
マスクビットが演算の抑止を指示しているときには、そ
のマスクビットに対応するベクトル要素の読み出しを禁
止する手段とを備えたことを特徴とする。Means for Solving Problem c] The vector instruction processing device of the present invention includes vector element reading means that is connected to an arithmetic unit that performs vector operations and reads vector elements from a storage device, and a vector instruction processing device that reads vector elements from a storage device. In a vector instruction processing device comprising means for transferring elements to an arithmetic unit and means for reading a mask vector composed of a sequence of mask bits corresponding to each vector element from a storage device, reading of the mask bits corresponds to means for controlling the reading of the vector element in advance of reading the vector element; and means for inhibiting the reading of the vector element corresponding to the mask bit when the mask bit read by the means instructs to inhibit an operation. It is characterized by having

[Effect]

本発明のベクトル演算処理装置は、マスクベクトル中の
演算指示ビットを捜し、それに対応するベクトル要素だ
けを記憶装置から読み出す。演算抑止ピントに対応する
ベクトル要素は読み出さないので、記憶装置からのデー
タ転送時間を削減できる。The vector arithmetic processing device of the present invention searches for an arithmetic instruction bit in a mask vector and reads out only the corresponding vector element from the storage device. Since the vector element corresponding to the arithmetic suppression focus is not read out, the data transfer time from the storage device can be reduced.

〔Example〕

第１図は本発明実施例ベクトル命令処理装置の要部ブロ
ック構成図である。FIG. 1 is a block diagram of main parts of a vector instruction processing device according to an embodiment of the present invention.

マスクベクトル読み出し回路１は、セレクタ１１、マス
クベクトル番地レジスタ１２、番地加算値レジスタ１３
および番地加算回路１４を備えている。セレクタ１１の
一方の入力および番地加算値レジスタ１３は、それぞれ
信号線ＬＯ１Ｌ１を介して図示していない制御部に接続
される。セレクタ１１の出力はマスクベクトル番地レジ
スタ１２に接続される。マスクベクトル番地レジスタ１
２および番地加算値レジスフ１３は番地加算回路１４に
接続される。番地加算回路１４の出力は、記憶装置２お
よびセレクタ１１の他方の入力に接続される。The mask vector readout circuit 1 includes a selector 11, a mask vector address register 12, and an address addition value register 13.
and an address addition circuit 14. One input of the selector 11 and the address addition value register 13 are each connected to a control section (not shown) via a signal line LO1L1. The output of selector 11 is connected to mask vector address register 12. Mask vector address register 1
2 and address addition value register 13 are connected to address addition circuit 14 . The output of the address adder circuit 14 is connected to the storage device 2 and the other input of the selector 11.

演算抑止ビット計数回路３は、セレクタ３１、マスクベ
クトルレジスタ３２、プライオリティエンコーダ３３、
シフト回路３４、加算回路３５、加算値レジスタ３６、
比較回路３７および演算抑止ビット数レジスタ３８を備
えている。記憶装置２の出力は、信号線Ｌ２を介して、
セレクタ３１の一方の入力に接続される。セレクタ３１
の出力は、プライオリティエンコーダ３３およびシフト
回路３４に接続される。プライオリティエンコーダ３３
の出力は、シフト回路３４、加算回路３５の一方の入力
および演算抑止ビット数レジスタ３８に接続される。シ
フト回路３４の出力は、信号線Ｌ３を介してセレクタ３
１の他方の入力に接続される。加算回路３５の出力は加
算値レジスタ３６に接続される。加算値レジスタ３６の
出力は、加算回路３５の他方の入力および比較回路３７
の一方の入力に接続される。比較回路３７の他方の入力
には、番地加算値レジスタ１３の出力が接続される。比
較回路３７の出力は、信号線Ｌ４を介して、セレクタ１
１．３１および加算値レジスタ３６に接続される。演算
抑止ビット数レジスタ３８は、信号線Ｌ５を介して、ベ
クトル要素番地生成回路４の乗算回路４５の一方の入力
に接続される。The operation inhibition bit counting circuit 3 includes a selector 31, a mask vector register 32, a priority encoder 33,
shift circuit 34, addition circuit 35, addition value register 36,
A comparator circuit 37 and an operation inhibition bit number register 38 are provided. The output of the storage device 2 is transmitted via the signal line L2.
It is connected to one input of the selector 31. Selector 31
The output of is connected to a priority encoder 33 and a shift circuit 34. Priority encoder 33
The output is connected to one input of the shift circuit 34 and the adder circuit 35 and to the operation inhibition bit number register 38. The output of the shift circuit 34 is sent to the selector 3 via the signal line L3.
1 is connected to the other input of 1. The output of the addition circuit 35 is connected to an addition value register 36. The output of the addition value register 36 is input to the other input of the addition circuit 35 and the comparison circuit 37.
connected to one input of the The output of the address addition value register 13 is connected to the other input of the comparison circuit 37. The output of the comparison circuit 37 is sent to the selector 1 via the signal line L4.
1.31 and the addition value register 36. The operation inhibition bit number register 38 is connected to one input of the multiplication circuit 45 of the vector element address generation circuit 4 via the signal line L5.

ベクトル要素番地生成回路４は、ベクトル要素間距離レ
ジスタ４１．４２．４３、セレクタ４４、乗算回路４５
、セレクタ４６．４７．４８、ベクトル要素番地レジス
タ４９．５０．５１、セレクタ５２および番地加算回路
５３を備えている。ベクトル要素間距離レジスタ４１．
４２．４３は、それぞれ信号線Ｌ６、Ｌｌ、Ｌ８を介し
て、図示していない制御部に接続される。ベクトル要素
間距離レジスタ４１．４２．４３の出力は、セレクタ４
４に接続される。セレクタ４４は、乗算回路４５の他方
の入力に接続される。セレクタ４６．４７．４８のそれ
ぞれの一方の入力は、それぞれ信号線Ｌ９、ＬＩＯ、Ｌ
ｌｌを介して、制御部に接続される。セレクタ４６．４
７．４８の出力は、それぞれベクトル要素番地レジスタ
４９．５０．５１に接続される。ベクトル要素番地レジ
スタ４９．５０．５１はセレクタ５２に接続される。乗
算回路４５およびセレクタ５２は、番地加算回路５３の
二つの入力にそれぞれ接続される。番地加算回路５３の
出力は、信号線Ｌ１２を介して、セレクタ４６．４７．
４８のそれぞれの他方の入力に接続される。さらに、番
地加算回路５３の出力は、記憶装置５に接続される。The vector element address generation circuit 4 includes vector element distance registers 41, 42, 43, a selector 44, and a multiplication circuit 45.
, selectors 46, 47, and 48, vector element address registers 49, 50, and 51, a selector 52, and an address addition circuit 53. Vector element distance register 41.
42 and 43 are connected to a control section (not shown) via signal lines L6, Ll, and L8, respectively. The outputs of the vector element distance registers 41, 42, and 43 are sent to the selector 4.
Connected to 4. Selector 44 is connected to the other input of multiplication circuit 45 . One input of each of the selectors 46, 47, and 48 is connected to the signal lines L9, LIO, and L
It is connected to the control unit via ll. Selector 46.4
The outputs of 7.48 are connected to vector element address registers 49.50.51, respectively. Vector element address registers 49, 50, 51 are connected to selector 52. Multiplier circuit 45 and selector 52 are connected to two inputs of address addition circuit 53, respectively. The output of the address addition circuit 53 is sent to the selectors 46, 47, . . . via the signal line L12.
48. Further, the output of the address addition circuit 53 is connected to the storage device 5.

記憶装置２はマスクベクトルを記憶し、記憶装置５はベ
クトル要素を記憶する。これらの記憶装置２．５は、同
一の記憶装置内に備えられていてもよい。Storage device 2 stores mask vectors, and storage device 5 stores vector elements. These storage devices 2.5 may be provided within the same storage device.

本ベクトル命令処理装置では、ベクトル命令開始時に、
信号線ＬＯ，Ｌｌ、Ｌ６〜Ｉ、１１およびセレクタ１１
．４６．４７．４８を介して、マスクベクトル番地レジ
スタ１２、ベクトル要素間距離レジスタ４１．４２、４
３、ベクトル要素番地レジスタ４９．５０．５１にそれ
ぞれの初期値を設定する。In this vector instruction processing device, at the start of a vector instruction,
Signal lines LO, Ll, L6 to I, 11 and selector 11
．． 46.47.48, mask vector address register 12, vector element distance register 41.42, 4
3. Set respective initial values in vector element address registers 49, 50, and 51.

マスクベクトル番地レジスタ１２には、マスクベクトル
の最初の読み出し番地が、初期値として設定される。番
地加算値レジスタ１３には、初期加算値「０」が設定さ
れる。マスクベクトル番地レジスタ１２の値と番地加算
値レジスタ１３の値を加算して、記憶装置２に記憶され
たマスクベクトルの番地を得る。In the mask vector address register 12, the first reading address of the mask vector is set as an initial value. In the address addition value register 13, an initial addition value "0" is set. The value of the mask vector address register 12 and the value of the address addition value register 13 are added to obtain the address of the mask vector stored in the storage device 2.

この番地を指定することにより、記憶装置２から信号ｍ
Ｌ２にマスクベクトルを読み出す。このマスクベクトル
を、セレクタ３１を通してマスクベクトルレジスタ３２
に格納する。この後には、番地加算値レジスタ１３の保
持する値を、マスクベクトルレジスタ３２のビット長ｎ
＋１で定まる定数に設定する。以後は、番地加算値レジ
スタ１３はこの値を保持する。By specifying this address, the signal m is sent from the storage device 2.
Read the mask vector to L2. This mask vector is passed through the selector 31 to the mask vector register 32.
Store in. After this, the value held in the address addition value register 13 is transferred to the bit length n of the mask vector register 32.
Set to a constant determined by +1. Thereafter, the address addition value register 13 holds this value.

プライオリティエンコーダ３３は、マスクベクトルの先
頭から最初の演算指示ビットが見つかるまで、演算抑止
ビットの個数を計数する。シフト回路３４は、プライオ
リティエンコーダ３３で得られた演算抑止ビットの個数
分だけマスクベクトルを左シフトさせ、ｒｌＪをシフト
インし、先頭ビットを反転させて出力する。この出力を
、セレクタ３１を介してマスクベクトルレジスタ３２に
再び格納する。したがって、プライオリティエンコーダ
３３の出力は、マスクベクトル中の演算指示ビットの間
隔を示す。プライオリティエコーダ３３の出力は、演算
抑止ビット数レジスタ３８に格納される。The priority encoder 33 counts the number of operation inhibit bits from the beginning of the mask vector until the first operation instruction bit is found. The shift circuit 34 shifts the mask vector to the left by the number of operation inhibit bits obtained by the priority encoder 33, shifts in rlJ, inverts the first bit, and outputs it. This output is stored again in the mask vector register 32 via the selector 31. Therefore, the output of the priority encoder 33 indicates the interval between operation instruction bits in the mask vector. The output of the priority echoer 33 is stored in the operation suppression bit number register 38.

加算回路３５は、プライオリティエンコーダ３３により
得られた演算抑止ビットの個数を、加算値レジスタ３６
に格納されている値に加算する。加算値レジスタ３６は
、加算回路３５により得られた加算結果を一時的に保持
する。したがって、加算値レジスタ３６は、マスクベク
トルのビット数を一時的に格納する。比較回路３７は、
番地加算値レジスタ１３に設定された値（マスクベクト
ルレジスタ３２のビット長で定まる定数ｎ＋１）と、加
算値レジスタ３６に保持された値とを比較する。The addition circuit 35 stores the number of operation inhibit bits obtained by the priority encoder 33 in an addition value register 36.
Add to the value stored in . The addition value register 36 temporarily holds the addition result obtained by the addition circuit 35. Therefore, the addition value register 36 temporarily stores the number of bits of the mask vector. The comparison circuit 37 is
The value set in the address addition value register 13 (constant n+1 determined by the bit length of the mask vector register 32) is compared with the value held in the addition value register 36.

ベクトル要素間距離レジスタ４１．４２．４３には、そ
れぞれ信号＆％Ｌ６、Ｌ７、Ｌ８を介して、各ベクトル
要素に対するベクトル要素間距離が設定される。The inter-vector element distance for each vector element is set in the inter-vector element distance registers 41, 42, and 43 via signals &%L6, L7, and L8, respectively.

ベクトル要素番地レジスタ４９．５０．５１には、初期
状態で、それぞれ信号線Ｌ３、ＬＩＯ、Ｌｌｌからベク
トル要素の番地の初期値が入力される。In the initial state, the initial values of vector element addresses are input to the vector element address registers 49, 50, and 51 from the signal lines L3, LIO, and Lll, respectively.

セレクタ４４は、これらのベクトル要素間距離レジスタ
４１．４２．４３の一つを選択し、乗算回路４５に出力
する。乗算回路４５は、セレクタ４４から入力されたベ
クトル要素間距離と、信号線Ｌ５を介して入力された演
算抑止ビット数との乗算を行い、番地差分を生成する。The selector 44 selects one of these vector element distance registers 41, 42, 43 and outputs it to the multiplication circuit 45. The multiplication circuit 45 multiplies the distance between vector elements inputted from the selector 44 by the number of arithmetic suppression bits inputted via the signal line L5, and generates an address difference.

セレクタ５２は、対応するベクトル要素番地レジスタ４
９．５０または５１を選択する。The selector 52 selects the corresponding vector element address register 4
9. Select 50 or 51.

番地加算回路５３は、番地差分を対応するベクトル要素
の番地に加算し、演算を行うべきベクトル要素が記憶さ
れている番地を得る。この番地により、記憶装置５にベ
クトル要素の読み出しを要求し、読み出したベクトル要
素を演算装置に転送する。The address addition circuit 53 adds the address difference to the address of the corresponding vector element to obtain the address where the vector element to be operated is stored. Based on this address, the storage device 5 is requested to read the vector element, and the read vector element is transferred to the arithmetic unit.

これと同時に、信号線Ｌ１２およびセレクタ４６．４７
または４８を介して、対応するベクトル要素番地レジス
タ４９．５０または５１の値を更新する。At the same time, signal line L12 and selector 46.47
or 48, the value of the corresponding vector element address register 49.50 or 51 is updated.

再び演算抑止ビット計数回路３に説明を戻す。Let us return to the explanation of the arithmetic inhibition bit counting circuit 3 again.

加算値レジスタ３６の保持されている値が、番地加算値
レジスタ１３に設定された値に等しいとき、すなわちｆ
ｉ＋ｌのときには、比較回路３７が一致信号を信号線Ｌ
４に出力する。この一致信号は、セレクタ１１．３１を
制御し、加算値レジスタ３６をリセットする。セレクタ
１１は、一致信号が入力されたことにより、番地加算回
路１４の出力した値をマスクベクトル番地レジスタ１２
に設定する。番地加算回路１４は、この値と番地加算値
レジスタ１３の値とを加算し、次のマスクベクトルの番
地を生成する。この番地に基づいて、記憶装置２がマス
クベクトルを信号線Ｌ２に出力する。セレクタ３１は、
一致信号が入力されたことにより、信号線Ｌ２を介して
入力されたマスクベクトルを、マスクベクトルレジスタ
３２に格納する。以下同様の処理を行う。When the value held in the addition value register 36 is equal to the value set in the address addition value register 13, that is, f
When i+l, the comparison circuit 37 sends the match signal to the signal line L.
Output to 4. This match signal controls selector 11.31 and resets addition value register 36. When the match signal is input, the selector 11 transfers the value output from the address adder circuit 14 to the mask vector address register 12.
Set to . The address addition circuit 14 adds this value to the value of the address addition value register 13 to generate the address of the next mask vector. Based on this address, the storage device 2 outputs a mask vector to the signal line L2. The selector 31 is
When the match signal is input, the mask vector input via the signal line L2 is stored in the mask vector register 32. Similar processing is performed below.

第２図は、演算指示ビットに対するベクトル要素の番地
の生成方法を示す説明図である。このマスクベクトルは
、マスクベクトルレジスタ３２に格納される。FIG. 2 is an explanatory diagram showing a method of generating addresses of vector elements for operation instruction bits. This mask vector is stored in the mask vector register 32.

一つのベクトル命令内では、ベクトル要素の番地Ｖ）ｌ
に一定のベクトル要素間距離ＶＩを加算すると、次のベ
クトル要素の番地が得られる。マスクベクトルの最初の
ピントに対応するベクトル要素番地をＶｌ＋、とじて説
明する。Within one vector instruction, the vector element address V)l
By adding a constant vector element distance VI to , the address of the next vector element is obtained. The vector element address corresponding to the first focus of the mask vector will be described as Vl+.

第２図（ａｌに示すように、読み出したｎ＋１のビット
長のマスクベクトルが、３．８、−１ｎ−３ビツト目に
演算指示ビットを含んでいるとする。As shown in FIG. 2 (al), it is assumed that the read mask vector with a bit length of n+1 includes an operation instruction bit at bit 3.8, -1n-3.

この個数をｍ個とする。これ以外のｎ　−ｍ　＋’　１
個のビットは、全て演算抑止ピントである。Let this number be m. Other n −m +' 1
These bits are all operation inhibit focus.

このとき、マスクベクトルの３ビツト目に対応するベク
トル要素の番地ＶＨ３は、ＶＨ２＝ＶＨＯ＋３　ｘＶＩで得られる。ここで、ベクトル要素間距離ｖ■の係数「
３」は、このマスクベクトルの先頭ビットから最初の演
算指示ビットが見つかるまで、演算抑止ビットの個数を
計数することにより求める。At this time, the address VH3 of the vector element corresponding to the third bit of the mask vector is obtained as VH2=VHO+3 xVI. Here, the coefficient of the vector element distance v■ is ``
3'' is determined by counting the number of operation inhibit bits from the first bit of this mask vector until the first operation instruction bit is found.

次に、第２図（ｂ）に示すように、マスクベクトルを上
記の係数に相当する３ビツト分だけ左シフトを行い、こ
れに伴い３ビツト目の演算指示ピントを反転させ、次に
用いるマスクベクトルとする。Next, as shown in FIG. 2(b), the mask vector is shifted to the left by 3 bits corresponding to the above coefficient, and accordingly, the calculation instruction focus of the 3rd bit is inverted, and the next mask to be used is Let it be a vector.

このとき、最後の３ビツトには、論理レベル「１」をシ
フトインする。シフトを行ったマスクベクトルに対して
、再び先頭から演算指示ビットが見つかるまで、演算抑
止ビットの個数を計数することにより、係数「５」が得
られる。これにより、最初のマスクベクトルの８ビツト
目に対応するベクトル要素の番地Ｖｌ（、は、 νＨａ　＝ＶＨ３＋５　ｘＶＩにより得られる。At this time, logic level "1" is shifted into the last three bits. A coefficient "5" is obtained by counting the number of operation inhibit bits for the shifted mask vector from the beginning until the operation instruction bit is found again. As a result, the address Vl of the vector element corresponding to the 8th bit of the first mask vector is obtained by νHa =VH3+5 xVI.

以上の操作をｍ回繰り返し、第２図（Ｃ）に示されるよ
うなマスクベクトルが得られる。このマスクベクトルは
、ｎ−３ビツト目の演算指示ビットが、反転して先頭の
ビット位置にシフトしたものである。このマスクベクト
ルに上述の操作を繰り返すことにより、ｖＨｏ’　＝ＶＨｎ−３＋４　ｘＶＩというベクトル要素の番地ＶＨ，’が得られる。この番
地νＨ０′　は、新たに連続する次のマスクベクトルの
先頭ビットに対応するベクトル要素の番地である。By repeating the above operation m times, a mask vector as shown in FIG. 2(C) is obtained. This mask vector is obtained by inverting the (n-3)th operation instruction bit and shifting it to the first bit position. By repeating the above operations on this mask vector, the address VH,' of the vector element vHo' = VHn-3+4 xVI is obtained. This address νH0' is the address of a vector element corresponding to the first bit of the next consecutive new mask vector.

以上の操作を、それぞれの操作で得られる係数を毎回加
算し、得られた加算値がマスクベクトルのビット長ｎ＋
１と一致するまで繰り返す。The above operations are performed by adding the coefficients obtained from each operation each time, and the resulting added value is the bit length of the mask vector n +
Repeat until it matches 1.

〔Effect of the invention〕

以上説明したように、本発明のベクトル命令処理装置は
、記憶装置から全てのベクトル要素を読み出すのではな
く、マスクビットの値により演算実行が指示されるベク
トル要素だけを読み出すことができる。したがって、演
算する必要のない無駄なベクトル要素を読み出す必要が
なく、記憶装置の読み出しおよび転送に必要な時間を削
減できる。本発明は、ベクトル演算の速度を高速にする
効果がある。As described above, the vector instruction processing device of the present invention does not read all vector elements from the storage device, but can read only the vector elements whose operation is instructed by the value of the mask bit. Therefore, there is no need to read useless vector elements that do not need to be calculated, and the time required for reading and transferring data from the storage device can be reduced. The present invention has the effect of increasing the speed of vector calculations.

[Brief explanation of drawings]

第１図は本発明実施例ベクトル命令処理装置の要部ブロ
ック構成図。第２図は演算指示ビットに対するベクトル要素の番地の
生成方法を示す説明図。１・・・マスクベクトル読み出し回路、２・・・記憶装
置、３・・・演算抑止ビット計数回路、４・・・ベクト
ル要素番地生成回路、５・・・記憶装置、１工・・・セ
レクタ、１２・・・マスクベクトル番地レジスタ、１３
・・・番地加算値レジスタ、１４・・・番地加算回路、
３１川セレクタ、３２・・・マスクベクトルレジスタ、
３３・・・プライオリティエンコーダ、３４・・・シフ
ト回路、３５川加算回路、３６・・・加算値レジスタ、
３７川比較回路、３８・・・演算抑止ピント数レジスタ
、４１．４２．４３・・・ベクトル要素間距離レジスタ
、４４・・・セレクタ、４５川乗算回路、４６．４７．
４８・・・セレクタ、４９．５０．５１・・・ベクトル
要素番地レジスタ、５２・・・セレクタ、５３・−・番
地加算回路。特許出願人日本電気株式会社　。代理人　弁理士　井　出　直　孝　Ｑ手続補正書昭和６１年５月２９日１、事件の表示昭和６０年特許願第６５３９７号２、発明の名称ベクトル命令処理装置住　所　　東京都港区芝五丁目３３番１号名　称　　（
４２３）日本電気株式会社代表者関本忠弘４、代理人　〒１７７意０３−９２８−５６７３住　所
　　東京都練馬区関町北二丁目２６番１８号、７、−１
氏名　弁理士（７Ｂ２３）井出直炙５５゜補正命令の日付　　　　　　　　　　　　　゛゛′
−゛自発補正６、　補正により増加する発明の数　　な　し７、補正
の対象　　　　　　　　　　　　　　　１３うしゝ゛明
細書の「発明の詳細な説明」の欄Ｗ”’≠）８、補正の
内容ｆｌ）　　明細書第８頁第１６行目「セレクタ３１の出力は、」を［セレクタ３１の出力は、マスクベクトルレジスタ３２
に接続される。マスクベクトルレジスタ３２の出力は、
」と補正する。（２）明細書第１０頁第５行目ないし第６行目［ベクト
ル要素番地レジスタ４９．５０．５１は」を「ベクトル
要素番地レジスタ４９．５０．５工の出力は、」と補正
する。（３）明細書第１０頁第７行目「セレクタ５２は、」を「セレクタ５２の出力は、」と補正する。（４）　　明細書第１０頁第１９行目ないし第２０行目
「マスクベクトル番地レジスタ１２、」を「マスクベク
トル番地レジスタ１２、番地加算値レジスタ１３、」と
補正する。（５）明細書第１３頁第４行目「信号線し３」を［信号線Ｌ９Ｊと補正する。（６）明細書第１３頁第５行目「初期値」を「初期値、すなわちベクトル要素の先頭番地」と補正す
る。（６）明細書第１４頁第６行目「制御し、加算値レジスタ３６を」を「制御し、この一方で加算値レジスタ３６を」と補正す
る。FIG. 1 is a block diagram of main parts of a vector instruction processing device according to an embodiment of the present invention. FIG. 2 is an explanatory diagram showing a method of generating addresses of vector elements for operation instruction bits. DESCRIPTION OF SYMBOLS 1...Mask vector readout circuit, 2...Storage device, 3...Arithmetic suppression bit counting circuit, 4...Vector element address generation circuit, 5...Storage device, 1...Selector, 12...Mask vector address register, 13
...Address addition value register, 14...Address addition circuit,
31 river selector, 32...mask vector register,
33... Priority encoder, 34... Shift circuit, 35 River addition circuit, 36... Addition value register,
37 River comparison circuit, 38... Calculation suppression focus number register, 41.42.43... Vector element distance register, 44... Selector, 45 River multiplication circuit, 46.47.
48... Selector, 49.50.51... Vector element address register, 52... Selector, 53... Address addition circuit. Patent applicant NEC Corporation. Agent Patent Attorney Naotaka Ide Q Procedural Amendment May 29, 1985 1. Indication of Case 1985 Patent Application No. 65397 2. Name of Invention Vector Instruction Processing Device Address 5-chome, Shiba, Minato-ku, Tokyo 33 No. 1 Name (
423) NEC Corporation Representative Tadahiro Sekimoto 4, Agent 177-03-928-5673 Address 2-26-18-7-1, Sekimachi Kita, Nerima-ku, Tokyo
Name Patent attorney (7B23) Naoaki Ide 5 5゜Date of amendment order ゛゛'
- ``Voluntary amendment 6, Number of inventions increased by amendment None 7, Subject of amendment 13 B'' Column ``Detailed description of the invention'' of the specification W"'≠) 8, Contents of amendment fl) Description No. Page 8, line 16, “The output of the selector 31 is” [The output of the selector 31 is the mask vector register 32
connected to. The output of the mask vector register 32 is
” he corrected. (2) In the fifth and sixth lines of page 10 of the specification, [the vector element address register 49.50.51 is] is corrected to "the output of the vector element address register 49.50.5 is". (3) On page 10, line 7 of the specification, "the selector 52 is" is corrected to "the output of the selector 52 is". (4) Correct "Mask vector address register 12," on the 19th to 20th lines of page 10 of the specification to "Mask vector address register 12, address addition value register 13,". (5) Correct "signal line 3" on page 13, line 4 of the specification to [signal line L9J. (6) "Initial value" on page 13, line 5 of the specification is corrected to "initial value, that is, the first address of the vector element." (6) In the 6th line of page 14 of the specification, "control and add value register 36" is corrected to "control and at the same time add value register 36".

Claims

[Claims]

(1) Vector element reading means connected to an arithmetic device that performs vector operations and reading vector elements from the storage device; means for transferring the vector elements read by this means to the arithmetic device; and vector elements from the storage device. A vector instruction processing device comprising: means for reading a mask vector consisting of a string of mask bits corresponding to an element; 1. A vector instruction processing device comprising: means for inhibiting reading of a vector element corresponding to a mask bit when a mask bit read by the mask bit instructs to inhibit an operation.