JPH0448370A

JPH0448370A - Interruption processing method for executing arithmetic operation in cascade fashion

Info

Publication number: JPH0448370A
Application number: JP15753490A
Authority: JP
Inventors: Tetsuro Saito; 斉藤　鉄郎; Yukisuke Sakota; 迫田　行介; Eiichiro Maeda; 栄一郎前田; Hiroshi Ota; 寛太田; Akira Muramatsu; 晃村松
Original assignee: Hitachi Ltd; Hitachi Nuclear Engineering Co Ltd; Hitachi Microcomputer Engineering Ltd
Current assignee: Hitachi Microcomputer System Ltd; Hitachi Ltd; Hitachi Nuclear Engineering Co Ltd
Priority date: 1990-06-18
Filing date: 1990-06-18
Publication date: 1992-02-18

Abstract

PURPOSE:To eliminate the waste time of a reception processor by performing an arithmetic operation in cascade fashion by executing prescribed first processing when a checked result shows a specific value ad executing prescribed second processing when it shows a value other than that. CONSTITUTION:Each of element processors (PE) 1-4 executes a cascade calculation execution instruction by executing processing 1, and takes synchronization for the whole by executing processing 2. The PE1 and PE3 perform transmission processing to the PE2 and PE4 immediately after executing the cascade calculation execution instruction, and perform the processing 2. The PE2 executes the processing 2 after executing the cascade calculation execution instruction, and executes the transmission processing to the PE4 on the extension of interruption processing when receiving an addition request from the PE1 during that time, and returns to the execution of the processing 2. thereby, since a transmission operation can be executed as the extension of a reception operation at a time when it is requested, the waste and idle time of the processor can be eliminated.

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は、複数の処理装置が相互に結合された並列計算
機に関し、特に並列計算機で総和計算等のカスケード型
演算を実行するときの制御に適する。[Detailed Description of the Invention] [Industrial Application Field] The present invention relates to a parallel computer in which a plurality of processing units are interconnected, and particularly to control when a parallel computer executes a cascade type operation such as summation calculation. Suitable.

[Conventional technology]

以下で局所メモリというのは処理装置の内部に含まれる
記憶装置である。各処理装置は自分が保有する局所メモ
リは直接アクセス可能であるが、他の処理装置が保有す
る局所メモリは直接にはアクセスできない、ある処理装
置が他処理装置が保有する局所メモリをアクセスすると
きはアクセスを要求するパケットを該他処理装置に送り
、該他処理装置にデータの書き込みやデータの読み取り
と該データの送信を実行してもらう。このような局所メ
モリを持つ計算機として「並列計算機構成論Ｊ　（昭和
６１年１り月昭晃堂版、富田眞治著）１０２頁から１０
４頁に述べられている並列計算機システムｒＩＬＬＩＡ
ＣＩＶＪがある。In the following, local memory refers to a storage device contained within a processing device. Each processing unit can directly access its own local memory, but cannot directly access the local memory owned by other processing units.When a processing unit accesses local memory owned by another processing unit, A packet requesting access is sent to the other processing device, and the other processing device writes data, reads data, and sends the data. As a computer with such local memory, "Parallel Computer Construction Theory J (January 1986, Shokodo edition, written by Shinji Tomita), pages 102 to 10
Parallel computer system rILLIA described on page 4
There is CIVJ.

処理装置と処理装置を互いに結合する相互結合網にはパ
ス結合網、リング結合網、格子結合網。Mutual coupling networks that couple processing devices to each other include path coupling networks, ring coupling networks, and lattice coupling networks.

２分木結合網、ハイパーキューブ結合網、クロスバ−結
合網、あるいは多段スイッチを使用した結合網等がある
（例えば、前記ｒ並列計算機構成論」６９頁から９８頁
な参照）。There are binary tree connection networks, hypercube connection networks, crossbar connection networks, connection networks using multi-stage switches, etc. (see, for example, the above-mentioned ``Parallel Computer Configuration Theory'', pages 69 to 98).

カスケード型に演算を行うとはこのような局所メモリを
持つ複数の処理装置を相互に結合した並列計算機システ
ムで総和型計算（総和計算、内積計算、最大値計算、最
小値計算等）を実行する方法である。総和計算について
述べれば、これは特開平１−１４７７６７に述べている
ように０２個ずつのデータの部分和を一取る。■２個ず
つ部分和に対して再び部分和を取る。以下■を繰り返す
と総和が得られるというものである。Performing cascade-type operations means performing summation-type calculations (summation calculations, inner product calculations, maximum value calculations, minimum value calculations, etc.) on a parallel computer system in which multiple processing units with local memory are interconnected. It's a method. Regarding the total sum calculation, as described in Japanese Patent Application Laid-Open No. 1-147767, one partial sum of each 02 pieces of data is taken. ■Re-calculate the partial sums for each two pieces. By repeating the following (■), the summation can be obtained.

処理依頼機能を持つ並列計算機は特開平１−１３１９４
９　ｒ処理依頼機能を持つ並列計算機」に述べられてい
る。これはそれぞれデータ保持用のメモリを有する複数
の要素プロセッサがネットワークにより相互に交信可能
に結合されている並列計算機において、第１の要素プロ
セッサに接続された第１のメモリに保持された第１のデ
ータと、第２の要素プロセッサに接続された第２のメモ
リ内の第２のデータを指定するアドレスと、第１．第２
の要素プロセッサに接続された第２のデータについて施
すべき処理の種別とを含む処理要求を第１の要素プロセ
ッサから第２の要素プロセッサに送出する。そのアドレ
スに応答して第２の要素プロセッサに設けられた該第２
のメモリから第２のデータを読み出し、第１のデータと
第２のデータに対して処理を施し、・その処理の結果デ
ータを第２のメモリに格納する。A parallel computer with a processing request function is published in Japanese Patent Application Laid-Open No. 1-13194.
9. Parallel computer with processing request function”. In a parallel computer in which a plurality of element processors, each having a memory for data retention, are connected through a network so that they can communicate with each other, a first element processor held in a first memory connected to the first element processor is used. data, an address specifying the second data in a second memory connected to the second element processor; Second
A processing request including the type of processing to be performed on the second data connected to the first element processor is sent from the first element processor to the second element processor. the second element processor provided in the second element processor in response to the address;
Read second data from the memory, perform processing on the first data and second data, and store the resultant data of the processing in the second memory.

[Problem to be solved by the invention]

前記従来技術のうち特開平１−１４７７６７に述べる方
法では２つの処理装置の間で情報を送受信するとき同期
を取るため受信側の処理装置に時間の浪費が生じる。そ
れは受信側の処理装置は受信命令を実行するとデータを
受信するまで待ち続けるからである。Among the conventional techniques, the method described in Japanese Patent Application Laid-Open No. 1-147767 requires synchronization when transmitting and receiving information between two processing devices, which wastes time on the receiving processing device. This is because once the receiving-side processing device executes the receiving command, it continues to wait until it receives the data.

これに対して特開平１−１３１９４９に述べる処理依頼
機能を持つ並列計算機では送信する側の処理装置でデー
タとそれを加算するデータのある（受信側の処理装置の
）局所メモリアドレスと施すべき処理の種別（加算）と
を含む処理要求を受信側処理装置に送る。この処理要求
を受信した受信側処理装置は実行中の処理を中断して受
信した処理要求中のデータと受信した処理要求中の局所
メモリアドレスのデータとの間で受信した処理要求中で
指定された施すべき処理（加算）を実行して結果を前記
受信した処理要求中の局所メモリアドレスに格納する。On the other hand, in a parallel computer with a processing request function described in JP-A-1-131949, the processing unit on the sending side selects the local memory address (of the processing unit on the receiving side) where the data is located, the data to be added, and the processing to be performed. A processing request including the type (addition) is sent to the receiving processing device. Upon receiving this processing request, the receiving side processing device interrupts the process being executed and compares the data specified in the received processing request between the data in the received processing request and the data at the local memory address in the received processing request. The processing to be performed (addition) is executed and the result is stored in the local memory address in the received processing request.

従がって処理依頼機能を持つ並列計算機では受信命令を
実行することによる受信側の処理装置の時間の浪費は生
じない。Therefore, in a parallel computer having a processing request function, the time of the processing device on the receiving side is not wasted by executing the receiving command.

しかし、処理依頼機能を持つ並列計算機でも総和計算（
カスケード型計算）では別の問題が生じる。それは処理
要求を受信した処理装置は加算した結果を今度は送信す
る必要があることである（ただし、最後の結果を得る処
理装置は送信しないので除く）。問題は送信するには、
処理依頼を受信し加算が終了していることを確認しなけ
ればならない点にある。このために確認で待ちが発生す
る。However, even on a parallel computer with a processing request function, sum calculation (
Another problem arises with cascade calculations. That is, the processing device that received the processing request needs to send the added result this time (with the exception of the processing device that obtains the last result because it does not send it). To submit your problem,
The point is that it must be confirmed that the processing request has been received and the addition has been completed. This causes a wait for confirmation.

確認するとき待ちが発生しないようにするには送信側の
処理装置での前記処理要求の送出が受信側処理装置での
前記確認の前に時間的に離れて実行されれば良い、しか
しながら第１１図に示すように、カスケード型計算では
確認は送信するために前動作であり、確認時点を遅らせ
ることは送信を遅らせることになり、それは次の受信側
処理装置の確認を遅らせることになる。In order to avoid waiting during confirmation, it is sufficient that the sending of the processing request at the sending side processing device is performed at a time interval before the confirmation at the receiving side processing device. As shown in the figure, in cascaded computation, confirmation is a pre-operation to sending, and delaying the confirmation point will delay sending, which will delay the confirmation of the next receiving processing device.

第１１図で、各処理装置（以下ではＰＥ：要素プロセッ
サと略記する）は処理１を実行し、総和計算を行ったの
ち処理２を実行する。ＰＥ１とＰＥ３は処理１の実行の
後他のＰＥからの加算依頼を受信して処理したかを確認
する。ＰＥＩとＰＥ３には処理を依頼するＰＥは存在し
ないのでそれぞれ直ちにＰＥ２とＰＥ４に送信処理を実
行し、それから処理２を実行する。これに対してＰＥ２
ではＰＥＩからの加算依頼の受信と処理を確認してから
ＰＥ４に送信処理を実行し、それから処理２を実行する
。ＰＥ４ではＰＥ３とＰＥ２からの加算依頼の受信と処
理を確認してから処理２を実行する。最後の全体同期は
すべてのＰＥに同一歩調を取らせるための同期である。In FIG. 11, each processing device (hereinafter abbreviated as PE: element processor) executes process 1, performs summation calculation, and then executes process 2. After executing process 1, PE1 and PE3 check whether the addition request has been received and processed from another PE. Since there is no PE requesting processing for PEI and PE3, they immediately execute transmission processing to PE2 and PE4, respectively, and then execute processing 2. On the other hand, PE2
Then, after confirming the reception and processing of the addition request from the PEI, the transmission processing to PE4 is executed, and then processing 2 is executed. PE4 executes process 2 after confirming the reception and processing of the addition requests from PE3 and PE2. The final overall synchronization is synchronization for making all PEs take the same step.

このためカスケード型に演算を開始してから該演算の結
果を参照するまで時間的余裕があっても処理装置に遊び
、すなわち浪費時間が生じる。第１１図では確認待ちと
全体同期待ちの遊び時間が発生している。Therefore, even if there is sufficient time from the start of a cascade-type calculation to the reference to the result of the calculation, there is idle time in the processing device, that is, wasted time. In FIG. 11, there is idle time for waiting for confirmation and waiting for overall synchronization.

本発明の目的はカスケード型の演算開始から結果の参照
まで時間的余裕があれば受信のための処理装置の浪費時
間を削除できるカスケード型の演算実行方法を提供する
ことにある。SUMMARY OF THE INVENTION An object of the present invention is to provide a cascade type calculation execution method that can eliminate the wasted time of a processing device for reception if there is enough time from the start of the cascade type calculation to the reference of the result.

〔課題を解決するための手段〕上記目的を達成するために、本発明ではデータを受信し
た処理装置が該受信データと一当該処理装置の保有する
記憶装置中の第１の所定番地のデータとの間で所定の第
１の演算（カスケード型演算で実行する演算）を実行し
て演算結−果を当該処理装置の保有する記憶装置中の第
１の所定番地に格納した後、当該処理装置の保有する記
憶装置中の第２の所定番地のデータに所定の第２の演算
を実行し、該第２の演算結果が０であれば前記当該処理
装置の保有する記憶装置中の第１の所定番地のデータを
別の処理装置に送信してデータを受信する前の処理に戻
り、該第２の演算結果が０でなければ何もせずにデータ
を受信する前の処理に戻る。[Means for Solving the Problems] In order to achieve the above object, in the present invention, a processing device that receives data combines the received data with data at a first predetermined location in a storage device held by the processing device. After executing a predetermined first operation (operation executed in a cascade type operation) between A predetermined second operation is performed on the data at a second predetermined location in the storage device held by the processing device, and if the result of the second calculation is 0, the data at the first location in the storage device held by the processing device is executed. The process returns to the process before transmitting the data of the predetermined location to another processing device and receiving the data, and if the second calculation result is not 0, nothing is done and the process returns to the process before receiving the data.

[Effect]

データを受信した処理装置では計数カウンタ（第２の所
定番地のデータ）から１を減算して（第２の演算）、受
信するべき情報をすべて受信したかを確認する。前記計
数カウンターがＯになれば、すべての情報を受信したの
で自分自身が根の（最終結果が得られる）処理装置でな
ければ、前記自分の保有する記憶装置中の第１の所定番
地のデータを別の適切な処理装置に送信する。自分自身
が根の処理装置の場合はカスケード計算の終了を通知す
る。本発明では送信動作はそれが必要となった時点で受
信動作の延長として実行されるので処理装置の無駄な遊
び時間が生じない。The processing device that has received the data subtracts 1 from the counting counter (data at the second predetermined location) (second calculation) to check whether all the information to be received has been received. When the counting counter becomes O, all the information has been received, so if the processing device itself is not the root processing device (from which the final result can be obtained), the data at the first predetermined location in the storage device owned by the processing device is to another suitable processing device. If the device itself is a root processing device, it notifies the end of cascade calculation. In the present invention, the transmitting operation is performed as an extension of the receiving operation when it becomes necessary, so there is no wasted idle time of the processing device.

第１２図は本発明によるＰＥの待ち時間を示した図であ
る。第１２図では各ＰＥは処理１を実行し、カスケード
計算実行命令を実行し、処理２を実行して全体の同期を
取る。ＰＥｌとＰＥ３ではカスケード計算実行命令の実
行後、それぞれ直ちにＰＥ２とＰＥ４に送信処理を行い
、処理２を実行する。ＰＥ２はカスケード計算実行命令
の実行後、処理２を実行し、その間にＰＥＩからの加算
依頼を受信するとその割り込み処理の延長上でＰＥ４へ
の送信処理を実行し、処理２の実行に戻る。ＰＥ４はカ
スケード計算実行命令の実行後、処理２を実行し、その
間にＰＥ２とＰＥ３から加算依頼を受信する。２回目の
受信のときカスケード計算の終了を通知する。FIG. 12 is a diagram showing the waiting time of PE according to the present invention. In FIG. 12, each PE executes process 1, executes a cascade calculation execution command, and executes process 2 to achieve overall synchronization. After executing the cascade calculation execution command, PEl and PE3 immediately perform transmission processing to PE2 and PE4, respectively, and execute processing 2. After executing the cascade calculation execution command, PE2 executes process 2, and if it receives an addition request from PEI during that time, it executes a transmission process to PE4 as an extension of the interrupt process, and returns to process 2. After executing the cascade calculation execution command, PE4 executes process 2, and during that time receives addition requests from PE2 and PE3. When received for the second time, the end of cascade calculation is notified.

〔Example〕

実施例１：本発明の実施例を図を用いて説明する。まず本発明に係
わる情報処理装置を説明する。Example 1: An example of the present invention will be described with reference to the drawings. First, an information processing apparatus according to the present invention will be explained.

第２図は１個の要素プロセッサの説明図である。FIG. 2 is an explanatory diagram of one element processor.

要素プロセッサ１は結合網制御回路３を介し°〔他の要
素プロセッサ（図示略）と互いに結合している。The element processors 1 are coupled to other element processors (not shown) via a connection network control circuit 3.

第３図は複数の要素プロセッサを相互に結合した計算機
システム全体図である。第３図で２は要素プロセッサ間
の結合網で４はシステム全体を制御するホスト計算機で
ある６本実施例では結合網はハイパーキューブ（前記「
並列計算機構成論」７９頁から８０頁を参照）のトポロ
ジーを含むと仮定する。FIG. 3 is an overall diagram of a computer system in which a plurality of element processors are interconnected. In FIG. 3, 2 is a connection network between element processors, and 4 is a host computer that controls the entire system. 6 In this embodiment, the connection network is a hypercube (as described above).
It is assumed that the topology includes the topology of "Parallel Computer Configuration Theory", pages 79 to 80).

要素プロセッサ１は基本的にはフォンノイマン型計算機
であり、他に結合網制御装置３を介して他の要素プロセ
ッサ１と通信する機能を持っている１通常の演算命令や
上記通信命令等の他に本発明の実施のための幾つかの命
令を持つが詳しくは後述する。第２図では簡単のためフ
ォンノイマン型計算機の機能のうち本発明に関係しない
部分は省略しである。The element processor 1 is basically a von Neumann type computer, and also has the function of communicating with other element processors 1 via the interconnection network control device 3.It also has the function of communicating with other element processors 1 through the connection network control device 3. contains some instructions for implementing the present invention, which will be described in detail later. In FIG. 2, for the sake of simplicity, parts of the functions of the von Neumann computer that are not related to the present invention are omitted.

第２図中で５は受信フラグ、７は受信レジスタである。In FIG. 2, 5 is a reception flag and 7 is a reception register.

結合網制御回路３は受信レジスタ７？、こ情報を書き込
んだ後に受信フラグ５をＯｎにし、制御回路９は受信レ
ジスタ７の情報の処理が終了すると受信フラグ５を。ｆ
ｆにする。１３は自ＰＥ＃レジスタでこの要素プロセッ
サ１に付けられたユニークな番号の値が書かれている０
本実施例では要素プロセッサ番号は各要素プロセッサの
ハイパーキューブ上でのアドレスが使用されている。１
５はプログラムカウンターで次に実行する命令が格納さ
れている（記憶装置１７中の）アドレスが格納される。Is the connection network control circuit 3 the receiving register 7? After writing this information, the control circuit 9 turns on the reception flag 5, and when the processing of the information in the reception register 7 is completed, the control circuit 9 turns on the reception flag 5. f
Make it f. 13 is the own PE# register and the value of the unique number assigned to this element processor 1 is written 0.
In this embodiment, the address of each element processor on the hypercube is used as the element processor number. 1
A program counter 5 stores an address (in the storage device 17) where the next instruction to be executed is stored.

１９は送信レジスタで、２１は送信フラグである。19 is a transmission register, and 21 is a transmission flag.

制御回路９は送信レジスタ１９に情報を書き込むと送信
フラグ２１を。ｎにし、結合網制御回路３は送信レジス
タ１９の情報を他の要素プロセッサｌに結合した結合網
制御回路３に転送すると送信フラグ２１をｏｆｆにする
。９は要素プロセッサ１の制御回路であり、フォンノイ
マン型計算機としての主要部分である。１１はフォンノ
イマン型計算機の汎用レジスタである。When the control circuit 9 writes information to the transmission register 19, it sets the transmission flag 21. n, and the network control circuit 3 transfers the information in the transmission register 19 to the network control circuit 3 coupled to another element processor 1, and then turns off the transmission flag 21. 9 is a control circuit for the element processor 1, which is the main part of the von Neumann type computer. 11 is a general-purpose register of the von Neumann type computer.

２３は結合網制御回路３内部の受信バッファである。受
信バッファは結合網制御回路３がラインで結合している
結合網制御回路３対応に持っている。（例えば、この結
合網制御回路が７つの結合網制御回路とラインで結ばれ
ていれば７つの受信バッファを持つ、）第４図は受信レジスタ７や送信レジスタ１９゜受信バッ
ファ２３に格納される情報（以下でほこ九をパケットと
呼ぶ）の形式の説明図である。第４図（ａ）の４０１は
受信レジスタ１９と受信バッファ２３に格納される情報
（以下では出力パケットと呼ぶ）の形式である。出力パ
ケット４０１は基本的には４つのフィールドに分がれる
。４ｏ３は宛先ＰＥ番号フィールドであり、このパケッ
トを受は取るべき要素プロセッサの番号が入る。23 is a receiving buffer inside the network control circuit 3. A receiving buffer is provided corresponding to the network control circuit 3 connected to the network control circuit 3 through a line. (For example, if this network control circuit is connected to 7 network control circuits by a line, it will have 7 reception buffers.) Figure 4 shows the data stored in the reception register 7, transmission register 19, and reception buffer 23. It is an explanatory diagram of a format of information (hereinafter referred to as a packet). 401 in FIG. 4(a) is the format of information (hereinafter referred to as an output packet) stored in the reception register 19 and reception buffer 23. The output packet 401 is basically divided into four fields. 4o3 is a destination PE number field, which contains the number of the element processor that should receive this packet.

４０５はこのパケットのデータフィールド４０９の長さ
（ビット長）が入り、４０７の処理コードフィールドは
このパケットに対して行うべき処理の種別を示す、この
３つのフィールド（４０３゜４０５．４０７）は固定長
である。４ｏ９はデータフィールドであり、このフィー
ルドの長さは処環コード４０７により変わる。また４０
９のデータフィールドは処理コード４０７の値によって
は幾つかのサブフィールドに分かれる。ここでは本発明
に係わるカスケード計算処理要求パケットのサブフィー
ルドを示す。４１１は後述するカスケード制御ブロック
（以下ＣＣＢと略記する）のアドレスが入り、４１３に
はカスケード計算の対象データが入る。405 contains the length (bit length) of the data field 409 of this packet, and the processing code field 407 indicates the type of processing to be performed on this packet. These three fields (403, 405, 407) are fixed. It is long. 4o9 is a data field, and the length of this field varies depending on the processing code 407. 40 again
The data field 9 is divided into several subfields depending on the value of the processing code 407. Here, subfields of a cascade calculation processing request packet according to the present invention are shown. 411 contains the address of a cascade control block (hereinafter abbreviated as CCB), which will be described later, and 413 contains target data for cascade calculation.

次に第４図（ｂ）の４０２は受信バッファ７に格納され
る情報の形式（以下では受信パケットと呼ぶ）である、
受信パケット４０２は送信パケット４０１から宛先ＰＥ
番号のフィールドを除いたものである。４０６はデータ
長フィールド、４０ｇは処理コードフィールド、４１０
はデータフィールドである。それらの内容説明は送信パ
ケットのそれと同じなので省略する。Next, 402 in FIG. 4(b) is the format of information stored in the reception buffer 7 (hereinafter referred to as a reception packet).
The received packet 402 is sent from the transmitted packet 401 to the destination PE.
This excludes the number field. 406 is a data length field, 40g is a processing code field, 410
is a data field. The explanation of these contents is the same as that of the transmission packet, so the explanation thereof will be omitted.

結合網制御回路３が受信レジスタ７に情報を書き込むと
きに宛先ＰＥ番号フィールド４０３を取り除く。２５は
後述するｃｃＢの内容をロードするレジスタであり、２
７は本発明に係わる受信パケット４０２中のオペランド
サブフィールド４１３をロードするレジスタであり、２
９はＣＣＢのアドレスをロードするレジスタである。When the connected network control circuit 3 writes information to the reception register 7, the destination PE number field 403 is removed. 25 is a register for loading the contents of ccB, which will be described later;
7 is a register for loading the operand subfield 413 in the received packet 402 according to the present invention;
9 is a register into which the CCB address is loaded.

く結合網制御回路の機能〉第３図に見るように、要素プロセッサは結合網制御回路
３を介して他の要素プロセッサと接続されている。結合
網制御回路３は次の機能を持つ。Functions of Network Control Circuit> As shown in FIG. 3, the element processors are connected to other element processors via the network control circuit 3. The connection network control circuit 3 has the following functions.

（１）第４図に示すような送信パケット４０１を他の要
素プロセッサの結合網制御回路から受は取り受信バッフ
ァ２３に格納する。(1) A transmission packet 401 as shown in FIG. 4 is received from the connection network control circuit of another element processor and stored in the reception buffer 23.

（２）送信パケットの宛先ＰＥ番号フィールド４０３が
自分自身が接続されている要素プロセッサ１に付加され
た番号と異なる場合はそのパケットを適当な要素プロセ
ッサ１に接続された結合網制御回路３に転送する。(2) If the destination PE number field 403 of the transmitted packet is different from the number added to the element processor 1 to which it is connected, the packet is transferred to the connected network control circuit 3 connected to the appropriate element processor 1 do.

（３）送信パケットの宛先ＰＥ番号フィールド４０３が
自分自身が接続されている要素プロセッサ１に付加され
た番号と一致するとき、接続されている要素プロセッサ
１の受信フラグ５がｏｆｆになるのを待って受信レジス
タ７にその送信パケット４０１から宛先ＰＥ番号フィー
ルド４０３を除いた４０５，４０７，４０９を書き込み
、受信フラグを。ｎにする。(3) When the destination PE number field 403 of the transmitted packet matches the number added to the element processor 1 to which it is connected, wait for the reception flag 5 of the connected element processor 1 to turn off. 405, 407, and 409 obtained by removing the destination PE number field 403 from the transmission packet 401 are written in the reception register 7, and a reception flag is set. Make it n.

（４）自分自身が接続されている要素プロセッサ１の送
信フラグ５が。ｎになると送信レジスタ１９の送信パケ
ット４０１をそのパケットの宛先ＰＥ番号フィールド４
０３が指定する要素プロセッサ１に接続されている結合
網制御回路３に届くように適当な結合網制御回路３に転
送する。(4) The transmission flag 5 of the element processor 1 to which it is connected. When n is reached, the transmission packet 401 in the transmission register 19 is transferred to the destination PE number field 4 of that packet.
The data is transferred to an appropriate network control circuit 3 so that it reaches the network control circuit 3 connected to the element processor 1 specified by 03.

〈カスケード制御ブロック〉本発明の実現において演算のカスケード型実行の制御に
カスケード制御ブロック（ＣＣＢ）という名称の構造を
持つデータを使用する。本実施例ではこれは要素プロセ
ッサ１の記憶装置１７上に構成される。その構成内容を
述べておく。<Cascade Control Block> In implementing the present invention, data having a structure called a cascade control block (CCB) is used to control cascade-type execution of operations. In this embodiment, this is configured on the storage device 17 of the element processor 1. Let me explain its composition.

ＣＣＢは第５図に示すように次の６つのフィールドから
なる。The CCB consists of the following six fields as shown in FIG.

（ａ）　　カスケード計算の結果が入る変数のアドレス
２０１（ｂ）　　割り込の計数カウンタ２０３（ｃ）　　ＣＣ
Ｂの状態と属性を表すフィールド２０５（ｄ）　　ルー
プＰＥ番号２０７（ｅ）　　プログラムａのアドレス２０９（ｆ）　　プ
ログラムｂのアドレス２１１以上のフィールドについて
簡単に説明する。(a) Address 201 of the variable containing the result of cascade calculation (b) Interrupt counter 203 (c) CC
Field 205(d) representing the status and attributes of B, Loop PE number 207(e) Address 209(f) of program a, Address 211 of program b and above fields will be briefly explained.

（ａ）は説明不要であろう、（ｂ）はこの要素プロセッ
サを親とする要素プロセッサ１の個数が入る。（ｃ）は
この後で処理する。（ｄ）はカスケード計算の最終結果
を得る要素プロセッサ１の番号が入る。（ｅ）はこの要
素プロセッサ１がらこの要素プロセッサ１の親に送信パ
ケットを送るプログラムａのアドレスが入る。（ｆ）は
カスケード計算が終了したことを通知するプログラムｂ
のアドレスが入る。(a) is self-explanatory; (b) contains the number of element processors 1 that have this element processor as a parent. (c) will be processed later. (d) contains the number of element processor 1 that obtains the final result of the cascade calculation. In (e), the address of program a that sends a transmission packet from this element processor 1 to the parent of this element processor 1 is entered. (f) is a program b that notifies that the cascade calculation has finished.
The address is entered.

更に（ｃ）の属性フィールド２０５は少なくとも次の要
素を含む。Furthermore, the attribute field 205 in (c) includes at least the following elements.

（ｃｌ）　カスケード計算の種類２１３（Ｃ２）割り込
み許可フラグ２１５カスケード計算には総和計算、最大値計算、最小値計算
等がある。これらは２１３の値により区別される。２１
５は２０９のプログラムの起動許可を与えるフラグであ
る。これはこの要素プロセッサ１でカスケード計算命令
（後述する）が実行されるとセットされる。(cl) Cascade calculation type 213 (C2) Interrupt permission flag 215 Cascade calculations include summation calculation, maximum value calculation, minimum value calculation, etc. These are distinguished by a value of 213. 21
5 is a flag that gives permission to start the program 209. This is set when a cascade calculation instruction (described later) is executed in this element processor 1.

以下ではこれまで述べた回路構成の下で、どのように本
発明が実現されるかを述べる。まずカスケード計算実行
命令の書式を述べる。Below, it will be described how the present invention is realized under the circuit configuration described above. First, the format of the cascade calculation execution command will be described.

カスケード計算の実行命令は次の書式である。The execution command for cascade calculation has the following format.

■　ＣＡＳＣＲ１，Ｌ２ ■　ＣＡＳＣＲＲ１，Ｒ２ここでＲ１はカスケード計算の対象となるデータを保有
するレジスタ（汎用レジスタまたは浮動小数点レジスタ
）を指定する。形式■ではＬ２はペースレジスタ＋イン
デックスレジスタ＋オフセットで表される記憶装置上の
ＣＣＢアドレスを指定する。形式■ではＲ２はＣＣＢの
アドレスを含む汎用レジスタを指定する。■ CASCR1, L2 ■ CASCRR1, R2 Here, R1 specifies a register (general-purpose register or floating point register) that holds data to be subjected to cascade calculation. In format 2, L2 specifies the CCB address on the storage device represented by pace register + index register + offset. In format 2, R2 specifies a general-purpose register containing the address of the CCB.

くカスケード計算命令実行時の動作〉カスケード計算命令実行時の動作を第６図のフローチャ
ートに従って説明する。ここでは形式■の書式にしたが
って説明する。Operation upon Execution of Cascade Calculation Instruction> The operation upon execution of the cascade calculation instruction will be described with reference to the flowchart in FIG. Here, explanation will be given according to the format (■).

（１）ラベルＬ２のＣＣＢアドレスを計算して、ＣＣＢ
アドレスレジスタ２９に格納し、該アドレスから変数ア
ドレス２０１．計数カウンタ２０３、属性フィールドの
２０５．根のＰＥ番号２０７．プログラムａのアドレス
２０９．プログラムｂのアドレス２１１をＣＣＢレジス
タ２５にロードする。（６０１）（２）Ｒ１の内容と２００の内容との間で２１３で指定
の計算を実行して結果を２００に格納する。(1) Calculate the CCB address of label L2, and
It is stored in the address register 29, and variable address 201. Counter 203, attribute field 205. Root PE number 207. Address 209 of program a. Load address 211 of program b into CCB register 25. (601) (2) Perform the specified calculation in 213 between the contents of R1 and 200 and store the result in 200.

（３）計数カウンタ２０３がＯ以外ならば２０３を元の
ＣＣＢに格納して終了する。（６０５）（４）計数カウ
ンタ２０３が０のときは根のＰＥ番号２０７と自ＰＥ番
号レジスタ１３の内容とを比較する。（６０７）（５）前記比較の結果が一致すれば２１９のプログラム
を呼び、カスケード計算の一終了を通知し、計数カウン
タ２０３を元のＣＣＢに格納して終了する。（６０９）（６）前記比較の結果が一致しないときは２１７のプロ
グラムを呼び、親の要素プロセッサ１に送信し、計数カ
ウンタ２０３を元のＣＣＢに格納して終了する。（６１
１）次に親ＰＥへの送信処理２１７を説明する。(3) If the count counter 203 is other than O, store 203 in the original CCB and end the process. (605) (4) When the counting counter 203 is 0, the root PE number 207 and the contents of the own PE number register 13 are compared. (607) (5) If the results of the comparison match, the program 219 is called, the end of the cascade calculation is notified, the counting counter 203 is stored in the original CCB, and the process ends. (609) (6) If the results of the comparison do not match, call the program 217, send it to the parent element processor 1, store the counting counter 203 in the original CCB, and end. (61
1) Next, the transmission process 217 to the parent PE will be explained.

（１）自ＰＥ＃レジスタ１３とＣＣＢ中の根のＰＥ番号
２０７から送信先のＰＥ番号を計算する。(1) Calculate the destination PE number from the own PE# register 13 and the root PE number 207 in the CCB.

本実施例では親のＰＥ番号は１３と２０７を下位ヒツト
から比較し、最初に一致しなかったビットのあるビット
位置で１３のビットを反転して得られる。In this embodiment, the parent PE number is obtained by comparing 13 and 207 starting from the lowest hit, and inverting the bit of 13 at the bit position where the first non-matching bit exists.

（２）送信フラグ２１が。ｆｆになるまで待つ。(2) The transmission flag 21 is set. Wait until it becomes ff.

（３）送信パケット４０１を送信レジスタ１９に作成す
る。宛先ＰＥ番号フィールド４０３に前記（１）で計算
した値を書き、データ長フィールド４０５と処理コード
フィールド４０７はＣＣＢレジスタの値をそのまま転送
する。データフィールド４０９のうちアドレスサブフィ
ールド４１１にはＣＣＢアドレスレジスタ２９の内容を
転送し、オペランドサブフィールドには２００の値を書
き込む。(3) Create a transmission packet 401 in the transmission register 19. The value calculated in (1) above is written in the destination PE number field 403, and the data length field 405 and processing code field 407 transfer the values of the CCB register as they are. The contents of the CCB address register 29 are transferred to the address subfield 411 of the data field 409, and a value of 200 is written to the operand subfield.

（４）送信フラグをｏｎにして送信処理を終了する。(4) Turn on the transmission flag and end the transmission process.

３番目にカスケード型計算終了通知処理２１９を説明す
る。この処理はカスケード計算の終了をプログラムに知
らせるもので、通常、システムのマクロを使用する０本
実施例ではｐｏｓｔというマクロで同期を取る。すなわ
ちカスケード計算の終了を待つ方はｗａｉｔというマク
ロを使用し、終了を通知する方はｐｏｓｔというマクロ
を使用する。詳しくは「オペレーティングシステムの実
際Ｊ　（昭和５９年３り昭晃堂版、吉澤康文著）１４３
頁から１４４頁を参照。Third, the cascade type calculation end notification process 219 will be explained. This process notifies the program of the end of the cascade calculation, and normally uses a system macro.In this embodiment, synchronization is achieved using a macro called post. That is, those who wait for the completion of the cascade calculation use a macro called wait, and those who notify the completion use a macro called post. For details, see "Practical Operating System J (March 1980, Shokodo edition, written by Yasufumi Yoshizawa) 143.
See pages 144 to 144.

本発明のカスケード計算処理パケットを受信した場合の
処理を第１図にしたがって説明する。Processing when a cascade calculation processing packet of the present invention is received will be explained with reference to FIG.

制御回路９は１回の命令の終了毎に受信フラグ５を調べ
る。受信フラグ５がＯｎのときは、処理コードフィール
ド４０８を胴入、カスケード計算処理のコードのときは
以下の処理に移る。他のコードのときもそれぞれの処理
に移るが本発明と無関係なのでここでは省略する。The control circuit 9 checks the reception flag 5 every time one command is completed. When the reception flag 5 is on, the processing code field 408 is entered, and when it is a code for cascade calculation processing, the process moves on to the following processing. The processing for other codes will also be repeated, but since they are irrelevant to the present invention, they will be omitted here.

（１）受信パケット４０２中のアドレスサブフィールド
４１１をＣＣＢアドレスレジスタ２９にロードし、前記
アドレスサブフィールドの示すＣＣＢから変数アドレス
２０１．計数カウンタ２０３、属性フィールド２０５．
根のＰＥ番号２０７、プログラムａのアドレス２０９．
プログラムｂのアドレス２１１をＣＣＢレジスタ２５に
ロードする。＜１０１）（２）受信パケット４０２中のオペランドサブフィール
ド４１３をデータレジスタ２７にロードする。（１０３
）（３）データレジスタの内容とＣＣＢで指定の２００の
変数の内容との間で計算の識別用フィールド２１３で指
定の計算を実行し、結果を２００の変数に格納する。（
１０Ｂ）（４）計数カウンタ２０３から１を減算する。(1) Load the address subfield 411 in the received packet 402 into the CCB address register 29, and select the variable address 201. from the CCB indicated by the address subfield. Count counter 203, attribute field 205.
Root PE number 207, program a address 209.
Load address 211 of program b into CCB register 25. <101) (2) Load operand subfield 413 in received packet 402 into data register 27. (103
(3) Perform the calculation specified in the calculation identification field 213 between the contents of the data register and the contents of the 200 variables specified in the CCB, and store the result in the 200 variables. (
10B) (4) Subtract 1 from the counting counter 203.

（５）計数カウンタ２０３が０でないときは終了する。(5) If the count counter 203 is not 0, the process ends.

（１０９）（６）割り込み許可フラグ２１５がｏｆｆならば終了す
る。（１１１）（７）自ＰＥ＃レジスタ１３の内容とＣＣＢの根ＰＥ番
号フィールド２０７を比較する。　（１１３）（８）自
ＰＥ＃レジスタ１３の内容とＣＣＢの根ＰＥ番号フィー
ルド２０７が一致した場合はＣＣＢで指定のプログラム
ｂ２１９を呼び、終了する。（１１５）（９）自ＰＥＲレジスタ１３の内容とＣＣＢの根ＰＥ番
号フィールド２０７が一致しない場合はＣＣＢで指定の
プログラムａ２１７を呼び、終了する。（１１７）実施例２：次に実施例２としてＣＣＢの初期設定を最初のカスケー
ド計算の実行あるいはカスケード計算の受信時に行う例
を述べる。(109) (6) If the interrupt permission flag 215 is off, the process ends. (111) (7) Compare the contents of the own PE# register 13 and the root PE number field 207 of the CCB. (113) (8) If the contents of the own PE# register 13 match the root PE number field 207 of the CCB, the CCB calls the designated program b219 and ends. (115) (9) If the content of the own PER register 13 and the root PE number field 207 of the CCB do not match, the CCB calls the designated program a217 and ends. (117) Embodiment 2: Next, as Embodiment 2, an example will be described in which the initial setting of the CCB is performed at the time of executing the first cascade calculation or receiving the cascade calculation.

実施例として第１図が第７図に変わり、第６図が第８図
に変わるだけである。As an example, only FIG. 1 is changed to FIG. 7, and FIG. 6 is changed to FIG. 8.

まずＣＡＳＣ命令を実行した場合の動作を第８図のフロ
ーチャートにしたがって説明する。形式■のＣＡ’；Ｃ
Ｒの書式に基づいて説明する。First, the operation when the CASC instruction is executed will be explained with reference to the flowchart of FIG. Format ■CA';C
The explanation will be based on the R format.

（１）記憶装置１７上のＲ２の内容が指定する番地から
ＣＣＢをＣＣＢレジスタ２５に読み出し５Ｒ２の内容を
ＣＣＢアドレス２９にコピーする。(1) Read the CCB from the address specified by the contents of R2 on the storage device 17 to the CCB register 25 and copy the contents of 5R2 to the CCB address 29.

（２）Ｒ１の内容とＣＣＢで指定の２００の内容との間
で２１３で指定の演算を実行し、結果を２００に格納す
る。（８０３）（３）計数カウンタ２０３の値を調べ、値が正ならば終
了し、値が負ならば、（７）に分岐する。(2) Perform the specified operation in 213 between the contents of R1 and the contents of 200 specified in CCB, and store the result in 200. (803) (3) Check the value of the counting counter 203, and if the value is positive, the process ends; if the value is negative, the process branches to (7).

（４）ＣＣＢ中の根のＰＥ番号フィールド２０７と自Ｐ
Ｅ＃レジスタ１３の内容とを比較する。(4) Root PE number field 207 in CCB and own P
The contents of the E# register 13 are compared.

（５）２０７と１３が一致するときはＣＣＢのプログラ
ムｂのアドレス２１１で指定のカスケード型計算終了通
知処理を呼出し、終了する。(5) When 207 and 13 match, the designated cascade type calculation end notification process is called at address 211 of program b in the CCB, and the process ends.

（６）２０７と１３が一致しないときはＣＣＢのプログ
ラムａのアドレス２０９で指定の親ＰＥへの送信処理を
呼出し、終了する。（８１１）（７）計数カウンタ２０
３を初期設定する０本実施例では設定する値は自ＰＥ＃
レジスタ１３の内容とＣＣＢ中の根ＰＨの番号フィール
ド２０７との間で排他的論理和を取り、結果の下位ビッ
トから連続するＯの個数である。（８１３）（８）初期
設定した計数カウンタの値を調べ、０のときは（５）に
分岐し、Ｏでなければ終了する。(6) If 207 and 13 do not match, call the transmission process to the specified parent PE at address 209 of program a in the CCB, and end. (811) (7) Counter 20
In this example, the value to be set is the own PE#.
Exclusive OR is performed between the contents of the register 13 and the number field 207 of the root PH in the CCB, and this is the number of consecutive O's starting from the lower bit of the result. (813) (8) Check the value of the initialized counter, and if it is 0, branch to (5), and if it is not 0, end.

次に実施例２でカスケード型計算処理のパケットを受信
した場合の動作を第７図のフローチャートに基づいて説
明する。Next, the operation when a packet of cascade type calculation processing is received in the second embodiment will be explained based on the flowchart of FIG.

（１）受信パケット４０２中のアドレスサブフィールド
４１２をＣＣＢアドレスレジス２９にロードし、４１２
の指定する番地からＣＣＢをＣＣＢレジスタ２５にロー
ドする。（７０１）（２）受信パケット４０２中のデー
タフィールド４１３をデータレジスタ２２にロードする
。(1) Load the address subfield 412 in the received packet 402 into the CCB address register 29,
The CCB is loaded into the CCB register 25 from the address specified by . (701) (2) Load the data field 413 in the received packet 402 into the data register 22.

（３）データレジスタ２２の内容とＣＣＢで指定の２０
０のデータとの間で２１３６で指定の演算を実行し、結
果を２００に格納する。（７０５）（４）計数カウンタ
２０３が負でなれば（６）に分岐する。（７０７）（５）計数カウンタ２０３を初期設定する。　（７１９
）（６）計数カウンタ２０３から１を減算する。(3) Contents of data register 22 and 20 specified by CCB
A specified operation is performed in 2136 with the data of 0, and the result is stored in 200. (705) (4) If the count counter 203 is negative, the process branches to (6). (707) (5) Initialize the counting counter 203. (719
)(6) Subtract 1 from the count counter 203.

（７）計数カウンタ２０３が０以外ならば終了する。(7) If the count counter 203 is other than 0, the process ends.

（８）割り込み許可フラグ２１５がｏｆｆならば終了す
る。（７１３）（９）ＣＣＢ中の根のＰＥ番号フィールド２０７と自Ｐ
Ｅ＃レジスタの内容とを比較する。（７１５）（１０）
　２０７と１３の内容が一致しないときはＣＣＢのプロ
グラムａのアドレス２０９で指定の親ＰＥへの送信処理
を呼呂し、終了する。（７１７）（１１）　１３の内容
と２０７が一致する場合はＣＣＢで指定のプログラムｂ
２１９を呼び、終了する。(8) If the interrupt permission flag 215 is off, the process ends. (713) (9) Root PE number field 207 in CCB and own P
Compare with the contents of the E# register. (715) (10)
If the contents of 207 and 13 do not match, the transmission process to the designated parent PE is terminated at address 209 of program a of the CCB. (717) (11) If the contents of 13 and 207 match, program b specified by CCB
Call 219 and exit.

実施例２の効果ここで実施例２の効果を述べる。第５図のＣＣＢの構造
の中で要素プロセッサにより値が異なるものは割り込み
計数カウンタのみである。他の部分はプログラムを記憶
装置にロードするときあるいはコンパイル時に値を設定
することができる。要素プロセッサ台数が大きいときは
各要素プロセッサに異なるプログラムをロードするのは
実用的でない、実施例２ではコンパイル時に割り込み計
数カウンタに負の値を設定すれば実行時に初期化を行う
のでプログラミングが簡単にする。Effects of Example 2 The effects of Example 2 will be described here. In the structure of the CCB shown in FIG. 5, only the interrupt counter has a different value depending on the element processor. Values for other parts can be set when the program is loaded into a storage device or when compiled. When the number of element processors is large, it is impractical to load a different program to each element processor.In Example 2, if a negative value is set to the interrupt counter at compile time, the program is initialized at run time, which simplifies programming. do.

実施例３：　（本実施例では第２の演算がシフト演算で
ある）本実施例では、第１図が第９図に代わり、第６図が第１
０図に代わる。まず、カスケード型計算実行命令を実行
した場合の動作を第１０図に基づいて説明する。ここで
は形式■の命令を実行した場合について説明する。Embodiment 3: (In this embodiment, the second operation is a shift operation) In this embodiment, FIG. 1 is replaced with FIG. 9, and FIG.
Replaces figure 0. First, the operation when a cascade type calculation execution instruction is executed will be explained based on FIG. 10. Here, we will explain the case where an instruction of format (2) is executed.

（１）ラベルＬ２のアドレスを計算してＣＣＢアドレス
レジスタ２９に格納し、該アドレスからＣＣＢの内容を
ＣＣＢレジスタ２５にロードする。（１００１）（２）Ｒ１の内容とＣＣＢで指定の変数２００の内容と
の間で２１３で指定の演算を実行し、結果を２００の変
数に格納する。（１００３）（３）計数カウンタ２０３
の最下位ビットを調べ、０ならば（７）に分岐する。（
１００５）（４）自ＰＥ＃レジスタ１３の内容とＣＣＢ
中の根のＰＥ番号フィールド２０７を比較し、一致した
場合は（６）に分岐する。（１００７）（５）２０７と
１３の内容が一致しないときはＣＣＢはプログラムａの
アドレス２０９で指定の親ＰＥへの送信処理を呼出し、
終了する。　（１０１１）（６）１３の内容と２０７が
一致する場合はＣＣＢで指定のプログラムｂ２１９を呼
び、終了する。(1) Calculate the address of label L2 and store it in the CCB address register 29, and load the contents of the CCB into the CCB register 25 from the address. (1001) (2) Perform the specified operation in 213 between the contents of R1 and the contents of variable 200 specified in CCB, and store the result in variable 200. (1003) (3) Counter 203
Check the least significant bit of , and if it is 0, branch to (7). (
1005) (4) Contents of own PE# register 13 and CCB
The PE number fields 207 of the roots in the middle are compared, and if they match, the process branches to (6). (1007) (5) If the contents of 207 and 13 do not match, the CCB calls the transmission process to the specified parent PE at address 209 of program a,
finish. (1011) (6) If the contents of 13 and 207 match, the specified program b219 is called in the CCB and the process ends.

（７）ＣＣＢ中の割り込み許可フラグ２１５をセットし
て終了する。（１０１３）次に実施例３でカスケード型計算処理のパケットを受信
した場合の動作を第９図のフローチャートに基づいて説
明する。(7) Set the interrupt permission flag 215 in the CCB and end. (1013) Next, the operation when a packet of cascade type calculation processing is received in the third embodiment will be explained based on the flowchart of FIG.

（１）受信パケット中のアドレスサブフィールド４１２
をＣＣＢアドレスレジスタ２９に複写して、該アドレス
からＣＣＢの内容をＣＣＢレジスタ２５にロードする。(1) Address subfield 412 in received packet
is copied to the CCB address register 29, and the contents of the CCB are loaded into the CCB register 25 from the address.

（９０１）（２）受信パケット中のデータサブフィールド４１４を
データレジスタ２２にロードする。（９０３）（３）デ
ータレジスタの内容と前記ＣＣＢで指定の２００のデー
タとの間で２１３の演算を実行し、結果を２００に格納
する。（９０５）（４）計数カウンタ２０３を１ビツト右にシフトする。(901) (2) Load the data subfield 414 in the received packet into the data register 22. (903) (3) Perform the operation 213 between the contents of the data register and the data 200 specified by the CCB, and store the result in 200. (905) (4) Shift the counting counter 203 to the right by 1 bit.

（９０７）（５）計数カウンタの最下位ビットがＯならば終了する
。（９０９）（６）割り込み許可フラグ２１５が。ｆｆならば終了す
る。（９１１）（７）ＣＣＢ中の根のＰＥ番号２０７と自ＰＥ＃レジス
タ１３とを比較する。（９１３）（８）前記比較結果が一致した場合はＣＣＢのアドレス
ｂで指定の２１９を呼び、終了する。(907) (5) If the least significant bit of the counting counter is O, the process ends. (909) (6) The interrupt permission flag 215 is set. If it is ff, the process ends. (911) (7) Compare the root PE number 207 in the CCB and the own PE# register 13. (913) (8) If the comparison results match, the designated 219 is called at address b of the CCB and the process ends.

（９）前記比較結果が一致しない場合はＣＣＢのアドレ
スａで指定の２１７を呼び、終了する。(9) If the comparison results do not match, the designated 217 is called at address a of the CCB and the process ends.

実施例３の効果ここで実施例３の効果を述べる。実施例３の特長は計数
カウンタの初期設定が簡単になることである。実施例１
と２では計数カウンタの初期値は自ＰＥ＃レジスタ１３
の内容とＣＣＢ中の根のＰＥ番号フィールド２０７との
排他的論理和を計算し、その最下位ビットからの連続す
る０ビツトの数である。Effects of Example 3 The effects of Example 3 will be described here. The feature of the third embodiment is that the initial setting of the counting counter is simple. Example 1
and 2, the initial value of the counting counter is the own PE# register 13.
and the root PE number field 207 in the CCB, which is the number of consecutive 0 bits starting from the least significant bit.

これに対して実施例３で計数カウンタの初期値は自ＰＥ
＃レジスタ１３の内容とＣＣＢ中の根のＰＥ番号フィー
ルド２０７との排他的論理和そのものである。On the other hand, in the third embodiment, the initial value of the counting counter is
# This is the exclusive OR of the contents of the register 13 and the root PE number field 207 in the CCB.

〔Effect of the invention〕

本発明によれば、カスケード型計算においてデータの到
着待ちによる待ち時間や全体同期待ちの待ち時間を削減
できる。According to the present invention, it is possible to reduce the waiting time due to waiting for data arrival and the waiting time due to overall synchronization in cascade type calculation.

[Brief explanation of drawings]

第１図は本発明に係わるカスケード型計算処理要求パケ
ットを受信したときの要素プロセッサの動作のフローチ
ャート、第２図は１個の要素プロセッサの説明図、第３
図は要素プロセッサを結合した並列計算機システム全体
の説明図、第４図は送信パケットと受信パケットの説明
図、第５図はＣＣＢの構造説明図、第６図は本発明に係
わるカスケード型計算実行命令実行時の要素プロセッサ
の動作のフローチャート、第７図と第８図は第２実施例
でのカスケード型計算処理要求パケットを受信したとき
の要素プロセッサの動作のフローチャートとカスケード
型計算実行命令実行時の要素プロセッサの動作のフロー
チャート、第９図と第１０図は第３実施例でのカスケー
ド型計算処理要求パケットを受信したときの要素プロセ
ッサの動作のフローチャートとカスケード型計算実行命
令時の要素プロセッサの動作のフローチャート、第１１
図は従来例による処理依頼機能を持つ並列計算機でカス
ケード型に計算するときの説明図、第１２図は本発明に
よりカスケード型に計算するときの説明図である。￥　１　図第団舅図纂 ■ 冨図冨図寥国第１／図Ｈ１ＥＺＥ３ＥＩＬFIG. 1 is a flowchart of the operation of an element processor when receiving a cascade calculation processing request packet according to the present invention, FIG. 2 is an explanatory diagram of one element processor, and FIG.
The figure is an explanatory diagram of the entire parallel computer system combining element processors, Fig. 4 is an explanatory diagram of transmitting packets and receiving packets, Fig. 5 is an explanatory diagram of the structure of CCB, and Fig. 6 is a cascade type calculation execution according to the present invention. A flowchart of the operation of the element processor when executing an instruction. FIGS. 7 and 8 are a flowchart of the operation of the element processor when receiving a cascade calculation processing request packet in the second embodiment, and a flowchart of the operation of the element processor when executing the cascade calculation execution instruction. 9 and 10 are flowcharts of the operation of the element processor when receiving a cascade calculation processing request packet in the third embodiment, and a flowchart of the operation of the element processor when receiving a cascade calculation execution instruction in the third embodiment. Operation flowchart, 11th
The figure is an explanatory diagram of a cascade type calculation performed by a parallel computer having a processing request function according to a conventional example, and FIG. 12 is an explanatory diagram of a cascade type calculation performed according to the present invention. ￥ 1 Diagram No. 1 Diagram Compilation ■ Tomizu Tomizu Country No. 1/ Diagram H1 EZ E3 EIL

Claims

[Claims] 1. In a parallel computer having a processing request function, a first element processor in the parallel computer receives and executes first data and second data from a second element processor in the parallel computer. receiving as an interrupt a processing request including a type of processing to be performed; and data at a first address calculated from the first data and the second data in a storage device of the first element processor. executing a predetermined first operation between and storing the operation result in a first address calculated from the second data in a storage device of the first element processor; The data at the second address calculated from the second data in the storage device of one element processor is
executing the calculation and storing the calculation result in a second address calculated from the second data in a storage device of the first element processor; If the checked result is a specific value, a predetermined first process is executed, and if the checked result is not the specific value, a predetermined second process is executed. An interrupt processing method for executing an operation in a cascade type, characterized in that the step of executing the process executes the operation in a cascade type. 2. The method according to claim 1, wherein the first process includes the step of checking a flag bit at a third address calculated from the second data in a storage device of the first element processor; 1. An interrupt processing method for executing arithmetic operations in a cascade type, comprising the step of executing a predetermined third process when a flag bit has a predetermined value. 3. An interrupt processing method for executing operations in a cascade type in the method according to claim 1 or 2, wherein the second operation is addition or subtraction of a constant value. 4. A method according to claim 1 or 2, in which the second operation is a 1-bit shift operation, using interrupts to execute operations in a cascade type. 5. In the method according to claim 1 or 2, the second
for performing an operation in a cascade type, characterized in that when the result of the operation or the result of checking the result of the second operation is a specific value or a value in a specific range, an initial value is set at the second address. Interrupt handling method. 6. An interrupt processing method for executing an operation in a cascade type in the method according to claim 5, wherein the second operation is subtracting 1, and the specific range is negative.