JPS6315334A

JPS6315334A - Parallel processor

Info

Publication number: JPS6315334A
Application number: JP15781886A
Authority: JP
Inventors: Koichiro Omoda; 面田　耕一郎; Junji Nakakoshi; 中越　順二; Shigeo Nagashima; 長島　重夫; Teruo Tanaka; 輝雄田中; Naoki Hamanaka; 濱中　直樹
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1986-07-07
Filing date: 1986-07-07
Publication date: 1988-01-22
Anticipated expiration: 2013-08-27
Also published as: JP2791010B2

Abstract

PURPOSE:To attain the reallocation of a processing to another processor possible to operate when a processor is placed under a state impossible to operate, by inputting information from a first and a second holding means, detecting the number of the processor to which the processing can be allocated, and performing the allocation of the processing to each processor. CONSTITUTION:The processing read out from a memory device by a reading means is allocated to and executed at each processor, and at this time, a detecting means sends the number of the processor in which no fault is found, and which is not operated at present, to an allocating means, based on the state of the processor held in the first and the second holding means. The allocating means allocates the processing sequentially based on the number of the processor. Also, when the second holding means in execution on the processor, is set at a state impossible to start up, the detecting means detects the number of the processor, and sends it to the allocating means. The allocating means performs the reallocation of the processing based on the number of the processor. In this way, it is possible to allocate the processing dynamically in the execution time of a program, and furthermore, to reallocate the processing when the fault, etc., is found in the processor.

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は並列処理装置に係り、特にデータ処理装置に好
適な並列処理装置に関する。DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to a parallel processing device, and particularly to a parallel processing device suitable for a data processing device.

[Conventional technology]

プログラムには複数の処理を並列に実行できる部分が多
数ある。この場合、並列実行部分の処理が、例えば配列
の要素明の演算のように規則的であれば、Ｃｒｏｙ−１
等のベクトルプロセッサで採用されているパイプライン
処理方式を用いで、規則正しい多数の演算を一括して連
続的に処理させることで効率良く高速化が実現できる。There are many parts of a program that can execute multiple processes in parallel. In this case, if the processing of the parallel execution part is regular, such as the operation of array elements, Croy-1
By using the pipeline processing method adopted in vector processors such as , etc., it is possible to achieve efficient high-speed processing by sequentially processing a large number of regular operations at once.

しかし、並列実行部分の処理が規則的でない場合も数多
く存在する１例えば、回路解析プログラムのデバイスモ
デル計算では多くのトランジスタに関するｆｆＸ算を並
列に実行可能であるが、トランジスタの種類等により演
算が不規則となる。However, there are many cases where the processing of the parallel execution part is not regular.1 For example, in device model calculations of circuit analysis programs, it is possible to execute ffX calculations for many transistors in parallel, but depending on the type of transistor etc. It becomes a rule.

このような場合の処理では、パイプライン処理方式の効
果が充分得られず、高速化が困難である。In processing in such a case, the effect of the pipeline processing method cannot be obtained sufficiently, and it is difficult to increase the speed.

このため、並列実行部分を多数の処理単位に分解し、多
数の処理装置で同時に実行させて高い演算性能を得る並
列処理方式の開発がさかんに行われている。For this reason, parallel processing methods are being actively developed to obtain high arithmetic performance by breaking down parallel execution parts into a large number of processing units and having them simultaneously executed by a large number of processing devices.

このとき、並列に実行できる多数の処理単位を各処理装
置へ割当てて処理させるが、従来の装置はこの割当てが
プログラムで前もって指定されていた。しかし、実行時
に動的に割当てる点については充分配慮されていなかっ
た。At this time, a large number of processing units that can be executed in parallel are assigned to each processing device for processing, but in conventional devices, this assignment is specified in advance in the program. However, sufficient consideration was not given to dynamic allocation during execution.

[Problem that the invention seeks to solve]

上記従来技術は、各処理装置への処理の割当てを実行時
に動的に割当てる点についての配慮がなされておらず次
のような問題点があった。The above-mentioned conventional technology does not take into account dynamic assignment of processing to each processing device at the time of execution, and has the following problems.

並列処理装置では、一般的に、処理装置の台数は数十か
ら数百、あるいは数千から数万台と広範囲の構成が考え
られる。このとき、各処理装置への処理の割当てがプロ
グラムで固定的に指定される場合には、プログラム作成
時点で仮定した台数と、プログラムを実際に実行する時
点での台数とが不一致のとき、処理装置の台数が不足し
ていてプログラムが実行できなかったり、あるいは台数
が多過ぎて遊んでしまう、このため、実際に実行する時
点での台数に合わせてプログラムレベルで修正する必要
があった。In general, parallel processing devices can have a wide range of configurations, ranging from tens to hundreds, or thousands to tens of thousands of processing devices. At this time, if the allocation of processing to each processing device is fixedly specified in the program, if the number of devices assumed at the time of program creation and the number of devices at the time of actual execution of the program do not match, the processing The program could not be executed due to an insufficient number of devices, or the program could not be executed because there were too many devices, so it was necessary to modify the program at the program level to match the number of devices at the time of actual execution.

また、処理装置の台数が非常に多くなった場合には故障
発生の確率が高くなるが、故障が発生した処理装置に割
当てられた処理を他の処理装置へ再割当てして処理を続
行させることができないため信頼性が低いという問題点
があった。Furthermore, when the number of processing devices becomes extremely large, the probability of a failure occurring increases, but it is possible to continue processing by reassigning the processing assigned to the failed processing device to another processing device. There was a problem that reliability was low because it could not be done.

本発明の目的は、各処理装置への処理の割当てを実行時
に動的に行い、更に、処理装置が動作中に故障等により
動作不能状態になった場合、！Ｉ？ｌＪ作可能な作動能
装置へ処理の再割当てが可能なような並列処理装置を提
供することにある。An object of the present invention is to dynamically allocate processing to each processing device at the time of execution, and furthermore, it is an object of the present invention to dynamically allocate processing to each processing device at the time of execution. I? An object of the present invention is to provide a parallel processing device that can reallocate processing to a device capable of performing IJ operations.

[Means for solving problems]

上記目的は、プログラムの指令に基づき処理に必要な情
報を記憶装置から読出す読出し手段各処理装置が処理を
割当てられて動作中かどうかの状態を保持する第１の保
持手段と、各処理装置が故障等により処理の起動が不能
かどうかの状態を保持する第２の保持手段と、各処理装
置に割当てた処理を実行するに必要な情報を保持する第
３の保持手段と、第１と第２の保持手段からの情報を入
力して各処理装置の状態を検出する検出手段と、上記検
出手段からの状態検出情報と、読出し手段からへカされ
る処理に必要な情報に基づき各処理装置へ処理の割当て
行い、また、上記検出手段からの状態検出情報と、第３
の保持手段からの処理に必要な情報に基づき処理装置へ
処理の再割当てを行う割当て手段とで構成することによ
り達成される。The above purpose is to provide a reading means for reading information necessary for processing from a storage device based on instructions of a program; a second holding means that holds the state of whether or not the processing cannot be started due to a failure or the like; a third holding means that holds information necessary to execute the processing assigned to each processing device; a detection means for detecting the state of each processing device by inputting information from the second holding means, and each processing based on the state detection information from the detection means and the information necessary for the processing that is returned from the reading means. Allocates processing to the device, and also uses the state detection information from the detection means and the third
This is achieved by comprising an allocation means for reallocating processing to a processing device based on the information necessary for the processing from the holding means.

[Effect]

読出し手段により記憶装置から読出した処理を各処理装
置へ割当てて実行するが、その際、第１と第２の保持手
段に保持された処理装置の状態に基づき、検出手段は故
障がなくかつ動作中でない処理装置番号を割当て手段に
送出する。割当て手段はこの処理装置番号に基づき順次
処理を割当てるが、このとき、処理装置へ、処理に必要
な情報を読出し手段から取出して転送すると同時に、同
じ情報を処理装置番号に対応する第３の保持手段にセッ
トしておき、また、対応する第１の保持手段を動作状態
にセットする。なお、処理装置での実行が終了すると第
１の保持手段は非動作状態にセットされ、また、実行中
に故障等があれば対応する第２の保持手段は起動不能状
態にセットされる。The processing read out from the storage device by the reading means is assigned to each processing device and executed. At this time, based on the state of the processing device held in the first and second holding means, the detection means is determined to be free of failure and operational. A processing device number that is not in the middle is sent to the allocating means. The allocation means sequentially allocates processing based on this processing device number, but at this time, the information necessary for processing is retrieved from the reading means and transferred to the processing device, and at the same time, the same information is transferred to the third holding device corresponding to the processing device number. and the corresponding first holding means is set to the operating state. Note that when the execution in the processing device is completed, the first holding means is set to a non-operating state, and if there is a failure or the like during execution, the corresponding second holding means is set to a non-startable state.

また、処理装置で実行中に第２の保持手段が起動不能状
態にセットされた場合、検出手段はその処理装置番号を
検出して割当て手段に送出する。Furthermore, when the second holding means is set to a non-startable state during execution in the processing device, the detection means detects the processing device number and sends it to the allocation means.

割当て手段はこの処理装置番号に基づき処理の再割当て
を行うが、この際の処理に必要な情報は検出された処理
装置番号に対応する第３の保持手段から取出す。The allocation means reallocates the processing based on this processing device number, and the information necessary for the processing at this time is taken out from the third holding device corresponding to the detected processing device number.

このように制御することによって、プログラム実行時に
動的に処理の割当てが可能となり、更に。By controlling in this way, it is possible to dynamically allocate processing during program execution, and furthermore.

処理装置での故障等による処理の再割当てが可能となる
。Processing can be reassigned due to a failure in a processing device or the like.

〔Example〕

以下１本発明の一実施例を図面を用いて説明する。第１
図は本発明の一実施例の概略構成図である０図中、１は
主記憶、２は記憶側＃２−１〜７は主記憶１をセフセス
するためのポート３は入出カプロセッサ、４はホストプ
ロセッサ、５はプロセッサエレメント（ＰＥ）制御回路
、５−１はブリフェッチバッファ、５−２はマイクロコ
ンピュータ、５−３は各ＰＥ毎の動作状態を保持するラ
ッチ群、５−４は各ＰＥ毎に、故障等により処理開始の
起動が不能かどうかの状態を保持するラッチ群、５−５
は、各ＰＥ毎に、ＰＥに割当てられたあるまとまった処
理単位（これをブロックと呼び各ブロックは並列に実行
できるものとする。）に関する情報を保持するレジスタ
群、５−６は各ＰＥの動作状態をサーチする回路、５−
７は起動不能状態をサーチする回路、６〜９は番号Ｏ〜
３のＰＥ、６−１〜９−１はローカル記憶、６−２〜９
−２はプロセシングユニット（ＰＵ）、Ｑ１〜Ω９．ｆ
ｌｌｏ−１−Ｄ〜３．１２１０−２．　Ｑ１０−３−０
〜３．１２１０−４−０〜３は信号線である。なお、第
１図ではプロセッサエレメント台数を４とした構成であ
るが、一般的には、４台に限定する８粟はなく実現可能
な範囲であれば何台でも良い。An embodiment of the present invention will be described below with reference to the drawings. 1st
The figure is a schematic configuration diagram of an embodiment of the present invention. In the figure, 1 is the main memory, 2 is the storage side #2-1 to 7 is the port 3 for accessing the main memory 1, and 4 is the input/output processor. 5 is a host processor, 5 is a processor element (PE) control circuit, 5-1 is a brief fetch buffer, 5-2 is a microcomputer, 5-3 is a latch group that maintains the operating state of each PE, and 5-4 is a A group of latches for each PE that maintains the state of whether processing cannot be started due to a failure or the like, 5-5
is a group of registers that holds information about a certain processing unit (this is called a block and each block can be executed in parallel) allocated to each PE, and 5-6 are registers for each PE. Circuit for searching operating status, 5-
7 is a circuit that searches for an unstartable state, and 6 to 9 are numbers O to
3 PE, 6-1 to 9-1 are local storage, 6-2 to 9
-2 is a processing unit (PU), Q1 to Ω9. f
llo-1-D~3.1210-2. Q10-3-0
~3.1210-4-0~3 are signal lines. Although FIG. 1 shows a configuration in which the number of processor elements is four, in general, there is no limit to four processor elements, and any number of processor elements may be used as long as it is within the practical range.

次に、第１図の概略構成図に基づき概略動作を説明する
が、第６図に示すような簡単な処理例を仮定して説明す
る０図中、１は主記憶、以下は実行に先だって主記憶１
に格納される情報であるが、１００−１〜ｎは入出力デ
ータ、アドレスリスト、プログラムから構成されるｎ個
のブロック、１００−１−１〜１００−ｎ−１は各ブロ
ックに対応する入出力データ、１００−１−２〜１００
−　ｎ　−２は各ブロックに対応するアドレスリストと
プログラム、１０１−１〜ｎはＭＳ　（主記憶１）読出
し先頭アドレス、ＬＳ（ローカル記憶６−１〜９−１）
書込み先頭アドレスＭＳからＬＳへ転送するデータ量を
示すサイズとから構成される各ブロックに対応するブロ
ック情報、１０２はブロック情報１０１−１〜ｎが格納
されている先頭アドレス、１０３は処理すべきブロック
数（この場合ｎ）である。Next, the general operation will be explained based on the schematic configuration diagram of FIG. 1, but the explanation will be made assuming a simple processing example as shown in FIG. Main memory 1
100-1 to n are the n blocks consisting of input/output data, address list, and program, and 100-1-1 to 100-n-1 are the inputs corresponding to each block. Output data, 100-1-2 to 100
- n -2 is an address list and program corresponding to each block, 101-1 to n are MS (main memory 1) read start address, LS (local memory 6-1 to 9-1)
Block information corresponding to each block consists of a size indicating the amount of data to be transferred from the write start address MS to LS, 102 is the start address where block information 101-1 to n is stored, 103 is the block to be processed number (in this case n).

ここで、アドレスリスＩ〜は対応するブロックのプログ
ラムで参照する入出力データのアドレスを指定するリス
トである。例えば、ＦＯＲＴＲＡＮのサブルーチンがブ
ロックに対応するものとすれば、サブルーチンの引数で
指定されるデータが上記の入出力データに相当し、この
入出力データをアクセスする場合のアドレス情報が上記
のアドレスリストに相当する。Here, the address list I~ is a list that specifies the addresses of input/output data to be referenced in the program of the corresponding block. For example, if a FORTRAN subroutine corresponds to a block, the data specified by the subroutine argument corresponds to the above input/output data, and the address information when accessing this input/output data is in the above address list. Equivalent to.

また、各ブロックに対応するアドレスリストとプログラ
ムは主記憶から１割当てられたＰＥのローカル記憶へ転
送されるが、その際、ＭＳ読出し先頭アドレスは主記憶
に格納されている対応するブロックのアドレスリストと
プログラムの先頭アドレスを指定し、また、ＬＳ書込み
アドレスはローカル記憶に格納される先頭アドレスを指
定するものとする。なお、この場合アドレスリストとプ
ログラムは主記憶上で連続して格納されているものと仮
定する。In addition, the address list and program corresponding to each block are transferred from the main memory to the local memory of one assigned PE, but at this time, the MS read start address is the address list of the corresponding block stored in the main memory. and specifies the start address of the program, and the LS write address specifies the start address stored in local storage. In this case, it is assumed that the address list and the program are stored consecutively in the main memory.

ホストプロセッサ４は処理ブロック数１０３と先頭アド
レス１０２をＰＥ制御回路５に転送して処理開始を指示
する、その後、ＰＥ制御回路５は先頭アドレス１０２に
基づき主記憶１からブロック情報１０１−１〜ｎを順次
読出して、ブロック情報１０１−１から順次各ＰＥのロ
ーカル記憶の固定アドレスに転送して起動開始信号を送
出する。起ｌｊＪ開始信号を受付けた各ＰＥは、ローカ
ル記憶の固定アドレスにセットされたブロック情報に基
づき以下のように処理を行う。即ち、ブロック情報にあ
るＭＳ読出し先頭アドレスで指定された主記憶上のアド
レスから対応するアドレスリストとプログラムを読出し
て、ブロック情報に゛あるＬＳ書込み先頭アドレスで指
定されたローカル記憶上のアドレスに書込む。なお、主
記憶′上から読出すアドレスリストとプログラムの量は
、プロツり情報にあるサイズで指定されるものとする。The host processor 4 transfers the number of blocks to be processed 103 and the start address 102 to the PE control circuit 5 and instructs it to start processing.Then, the PE control circuit 5 transfers the block information 101-1 to n from the main memory 1 based on the start address 102. are sequentially read out and sequentially transferred from the block information 101-1 to fixed addresses in the local storage of each PE, and an activation start signal is sent. Each PE that receives the start ljJ start signal performs the following processing based on the block information set in the fixed address of the local storage. That is, the corresponding address list and program are read from the address in the main memory specified by the MS read start address in the block information, and written to the address in the local memory specified by the LS write start address in the block information. It's crowded. It is assumed that the amount of address list and program to be read from the main memory is specified by the size in the plot information.

そして、ローカル記憶に読出したプログラムに従って処
理を開始するが、その際、主記憶上の対応する入出力デ
ータをアクセスする場合は、先程主記憶から読出したア
ドレスリストの情報を用いて行うものとする。なお、プ
ログラムの実行途中で得られる中間結果データはローカ
ル記憶上に格納することが考えられるが、このような場
合には、主記憶とローカル記憶を別々にアクセスする命
令をサポートする必要がある。ＰＥでは処理が終了する
とＰＥ制御回路５へ終了信号を送出する。そして、ＰＥ
制御回路５は未処理のブロックがあれば新たなブロック
のブロック情報をＰＥへ転送して起動信号を送出する。Processing is then started according to the program read out to the local memory, but at that time, when accessing the corresponding input/output data in the main memory, it is done using the information in the address list read out from the main memory earlier. . Note that intermediate result data obtained during program execution may be stored in local storage, but in such a case, it is necessary to support instructions that access main storage and local storage separately. When the processing is completed, the PE sends an end signal to the PE control circuit 5. And P.E.
If there is an unprocessed block, the control circuit 5 transfers the block information of the new block to the PE and sends out a start signal.

このようにして処理ブロック数１０３で指定された全ブ
ロックの処理を終了すると、ＰＥ制御回路５はホストプ
ロセッサへ終了信号を送出して動作を終了するものとす
る。When processing of all blocks specified by the number of blocks to be processed 103 is completed in this manner, the PE control circuit 5 sends an end signal to the host processor and ends the operation.

ここで、各ブロックの処理を各ＰＥへ割当てて処理を行
うが、この割当てに関する制御は本発明の特徴であり、
動作状態保持ラッチ群５−３、起動不能状態保持ラッチ
群５−４、ブロック情報保持レジスタ群５−５．動作状
態サーチ回路５−６゜起動不能状態サーチ回路５−７を
用いて行う。以上では、第２図〜第５図を用いて、各ブ
ロック処理のＰＥへの割当て制御を中心にして実施例を
群細に説明する。Here, the processing of each block is assigned to each PE, and control regarding this assignment is a feature of the present invention.
Operating state holding latch group 5-3, activation disabled state holding latch group 5-4, block information holding register group 5-5. This is performed using the operating state search circuit 5-6 and the inactivation state search circuit 5-7. In the above, the embodiment will be described in detail with reference to FIGS. 2 to 5, focusing on control of allocation of each block process to PE.

第２図は第１図で示したプロセッサエレメント（Ｐ　Ｅ
）制御回路５の構成例である。図中、５−１はブリフェ
ッチバッファ、５−２はマイクロプロセッサ、５−３−
０〜５−３−３は各々プロセッサエレメント（ＰＥ）番
号Ｏ〜３に対応する動作状態保持ラッチ、５−４−０〜
３は各々ＰＥ番号０〜３に対応する起動不能状態保持ラ
ッチ、５−５−〇〜３は各々ＰＥ番号Ｏ〜３に対応する
ブロック情報保持レジスタ、５−６は動作状態サーチ回
路、５−７は起動不能状態サーチ回路、５−１０．５−
１４．５−２０．５−２３〜２６，５−２７．５−２９
〜３２はセレクタ、５−１１゜５−１３．５−１５−１
〜２．５−１８．５−３３はレジスタ、５−１２はカウ
ントアツプ回路。Figure 2 shows the processor element (P E
) This is an example of the configuration of the control circuit 5. In the figure, 5-1 is a brief fetch buffer, 5-2 is a microprocessor, and 5-3-
0 to 5-3-3 are operating state holding latches corresponding to processor element (PE) numbers O to 3, respectively, and 5-4-0 to
3 is a bootable state holding latch corresponding to PE numbers 0 to 3, respectively; 5-5-0 to 3 are block information holding registers corresponding to PE numbers O to 3, respectively; 5-6 is an operation state search circuit; 5- 7 is a startup failure state search circuit, 5-10.5-
14.5-20.5-23 to 26,5-27.5-29
~32 is selector, 5-11°5-13.5-15-1
~2.5-18.5-33 is a register, and 5-12 is a count up circuit.

５−１７はカウントダウン回路、５−１６はゼロ検出回
路、５−１９．５−２１〜２２．５−２７〜２８はデコ
ーダ、　　Ａ４−１〜４．ｆ１５−１〜２゜Ｑ１０−１
−０〜３．　Ｑ１０−２．ｆｌｌＯ−３−０〜３．Ａ１
０−４−０〜３．Ａ５０−１〜４゜Ａ５０−５−１〜２
．　Ａ５０−６〜１９は信号線である。第３図は第２図
に示したＰＥ制御回路５の主要な動作を示す制御フロー
図である。また、第４図と第５図は、各々、第２図に示
した動作状態サーチ回路５−６．起動不能状態サーチ回
路５−７の構成例である。なお、第４図に於て、５−３
−〇〜３は各々、ＰＥ番号０〜３に対応する動作状態保
持ラッチ（第２図に図示）、５−４−０〜３は各々、Ｐ
Ｅ番号Ｏ〜３に対応する起動不能状態保持ラッチ（第２
図に図示）５−３４〜３５゜５−４５はＯＲ回路、５−
３６〜３９．５−４４゜５−４６〜４７．　はＡＮＤ回
路、５−４０〜４３゜５−４８〜５３はＮ　Ｏ７回路、
Ａ５０−５−１〜２．１５０−１２は信号線である。5-17 is a countdown circuit, 5-16 is a zero detection circuit, 5-19. 5-21 to 22. 5-27 to 28 are decoders, A4-1 to 4. f15-1~2゜Q10-1
-0~3. Q10-2. fllO-3-0~3. A1
0-4-0~3. A50-1~4゜A50-5-1~2
．． A50-6 to A50-19 are signal lines. FIG. 3 is a control flow diagram showing the main operations of the PE control circuit 5 shown in FIG. Further, FIGS. 4 and 5 respectively show the operating state search circuit 5-6 shown in FIG. This is an example of the configuration of a startup failure state search circuit 5-7. In addition, in Figure 4, 5-3
-〇~3 are operating state holding latches corresponding to PE numbers 0~3 (shown in Figure 2), 5-4-0~3 are P
The unstartable state holding latch (second
(Illustrated in the figure) 5-34 to 35° 5-45 is an OR circuit, 5-
36-39.5-44°5-46-47. is an AND circuit, 5-40~43°5-48~53 is an NO7 circuit,
A50-5-1 to 2.150-12 are signal lines.

また、第５図に於て、５−３−０〜３と５−４−〇〜３
は第４図と同様であり、５−５０．５−５２〜５６．５
−６０〜６１はＡＮＤ回路、５−５１．５−５９はＯＲ
回路、５−５７〜５８，５−６２はＮＯＴ回路、　Ａ５
０−７〜８．　Ａ５０−１３は信号線である。Also, in Figure 5, 5-3-0~3 and 5-4-0~3
is the same as in Figure 4, and 5-50.5-52 to 56.5
-60 to 61 are AND circuits, 5-51.5-59 are OR circuits
Circuit, 5-57 to 58, 5-62 are NOT circuits, A5
0-7~8. A50-13 is a signal line.

ホストプロセッサ４より、信号線２４−４を介して、第
６図で示したような処理を指定する命令がＰＥ制御回路
５に入力されると、デコーダ５−１９で解読され、信号
線Ｑ５０−１５を介してマイクロコンピュータ５−２に
その旨が伝達される。When a command specifying the processing shown in FIG. 6 is input from the host processor 4 to the PE control circuit 5 via the signal line 24-4, it is decoded by the decoder 5-19 and sent to the signal line Q50- 15 to the microcomputer 5-2.

すると、マイクロコンピュータ５−２はＰＥｌｉＪ号０
〜３のうち全てが起動不能かどうかを、起動不能状態サ
ーチ回路５−７から出力される信号線Ｑ５０−７に基づ
きチェックする。信号ａＱ５０−７は第５図に示すよう
に、起動不能状態保持ラッチ５−４−０〜３が全てＯＮ
のとき、即ち、全ＰＥが起動不能状態のときＯＮとなる
。Then, the microcomputer 5-2 reads PEliJ No. 0.
.about.3 are all unstartable, based on the signal line Q50-7 output from the unstartable state search circuit 5-7. As shown in FIG. 5, the signal aQ50-7 indicates that all start-up disabled state holding latches 5-4-0 to 3 are ON.
It turns ON when , that is, when all PEs are in a non-startable state.

従って、信号線ｆｉ５０−７がＯＮのときは、全ＰＥが
起動不能状態であり、ホストプロセッサ４からの命令を
実行できない。このため、マイクロコンピュータ５−２
は信号線Ｑ４−３を介して処理不可をホストプロセッサ
４へ報告して終了する。Therefore, when the signal line fi50-7 is ON, all PEs are in a non-startable state and cannot execute instructions from the host processor 4. For this reason, the microcomputer 5-2
reports the inability to process to the host processor 4 via the signal line Q4-3 and ends.

一方、信号線Ｑ５０−７がＯＦＦのときは、信号線Ｑ４
−３を介してホストプロセッサ４へ処理可能を報告する
と、ホストプロセッサ４は信号線Ｑ４−１．セレクタ５
−１４を介して、処理ブロック数１０３（第６図参照）
をレジスタ５−１５−２へ、また、信号ｍΩ４−１．セ
レクタ５−１０を介して先頭アドレス１０２（第６図参
照）をレジスタ５−１１ヘセツトする。その後、レジス
タ５−１１にセットした先頭アドレスに基づき、カウン
トアツプ回路５−１２で順次アドレス更新しながら、信
号線Ｑ５−１を介してポート２−３に主記憶１への読出
しアドレスを送出し、ブロック情報（第６図に図示した
１０１−１〜ｎ）を主記憶１から読出し、信号線Ｑ５−
２を介してブリフェッチバッファ５−１に順次格納する
。上記のブロック情報の読出しと並行して、第３図に示
した制御フローに従ってマイクロコンピュータ５−２が
制御を行う、以下では第３図に従って動作を詳細に説明
する。On the other hand, when signal line Q50-7 is OFF, signal line Q4
-3, the host processor 4 reports that processing is possible through the signal lines Q4-1. selector 5
-14, the number of processing blocks is 103 (see Figure 6)
to the register 5-15-2, and the signal mΩ4-1. The start address 102 (see FIG. 6) is set in the register 5-11 via the selector 5-10. After that, based on the start address set in the register 5-11, the read address to the main memory 1 is sent to the port 2-3 via the signal line Q5-1 while updating the address sequentially in the count-up circuit 5-12. , reads the block information (101-1 to 101-n shown in FIG. 6) from the main memory 1, and connects the signal line Q5-
2 and sequentially stored in the brief fetch buffer 5-1. In parallel with the reading of the block information, the microcomputer 5-2 performs control according to the control flow shown in FIG. 3.The operation will be described in detail below according to FIG.

まず、５１００で全ブロックに対して各ＰＥへ起動を掛
けたかどうかチックする。これは、レジスタ５−１５−
２に最初セットした処理ブロック数を、１ブロツク起動
を掛ける毎にカウントダウン回路５−１７でカウントダ
ウンして行き、値が０になったときゼロ検出回路５−１
６から出力される信号線Ｑ５０−１がＯＮとなることに
より全ブロックの起動終了を認識する。First, at 5100, a check is made to see if each PE has been activated for all blocks. This is register 5-15-
The countdown circuit 5-17 counts down the number of processing blocks initially set to 2 every time one block is activated, and when the value reaches 0, the zero detection circuit 5-1
When the signal line Q50-1 output from 6 is turned ON, it is recognized that the activation of all blocks has been completed.

（１）全ブロックの起動が終了していない場合コニの場
合は、まず、５ｌｏｔで非動作中のＰＥがあるかどうか
をチェックする。非動作中のＰＥが無ければ非動作中の
ＰＥが出現するまで新たなブロックの起動を待つが、こ
の間に、５１０４で全ＰＥが起動不能状態かどうかをチ
ェック（信号線Ｑ　５０−７で認識）する。全ＰＥが起
動不能にであれば、マイクロコンピュータ５−２は信号
線Ｑ４−２を介してホストプロセッサ４へ異常終了を報
告し、実行中の処理を中断する。(1) When activation of all blocks has not been completed In the case of Koni, first check whether there is an inactive PE in 5 lots. If there is no non-operating PE, activation of a new block is waited until a non-operating PE appears, but during this time, it is checked in 5104 whether all PEs are in the activation-incapable state (recognized by signal line Q 50-7). )do. If all PEs are unable to start, the microcomputer 5-2 reports abnormal termination to the host processor 4 via the signal line Q4-2, and interrupts the process being executed.

なお、各ＰＥはブロックの処理中にマシンチェック等の
動作不能要因を検出すると、信号線ＱＩＯ−４−０〜３
　（ＰＥ番号Ｏ〜３に対応）。Note that when each PE detects an inoperable factor such as a machine check while processing a block, it connects signal lines QIO-4-0 to QIO-3.
(Corresponding to PE numbers O to 3).

セレクタ５−２９〜３１を介して対応する起動不能状態
保持ラッチ５−４−〇〜３をＯＮにするものとする。It is assumed that the corresponding activation disabled state holding latches 5-4-0 to 3 are turned on via the selectors 5-29 to 31.

次に、非動作中のＰＥが有る場合の処理手順を記すが、
ここで、非動作中のＰＥが有るかどうかは次のようにし
て認識する。Next, we will describe the processing procedure when there is an inactive PE.
Here, whether or not there is an inactive PE is recognized as follows.

即ち、動作状態を保持ラッチと、動作状態保持ラッチと
対応する起動不能状態保持ラッチが共にＯＦＦのものが
あれば非動作中のＰＥが有るものとする。なお、動作状
態保持ラッチは対応するＰＥへの起動が掛かる（信号線
２１０−１−〇〜３を介して起動信号が送出とＯＮとな
り、ＰＥでの処理が終了する（信号線Ｑ１〇−３−〇〜
３を介して終了信号を受取る）とＯＦＦになるものとす
る。具体的には、非動作中のＰＥが有るかどうかの情報
は、第４図に図示した動作状態サーチ回路５−６に基づ
き信号線Ｑ５０−５−１を介してマイクロコンピュータ
５−２に転送されて認識される。That is, if both the operating state holding latch and the activation disabled state holding latch corresponding to the operating state holding latch are OFF, it is assumed that there is an inactive PE. Note that the operating state holding latch is activated to the corresponding PE (the activation signal is sent and turned ON via signal lines 210-1-0 to 210-3, and the processing in the PE is completed (signal line Q1-3). −〇〜
3), it will turn OFF. Specifically, information as to whether there is an inactive PE is transferred to the microcomputer 5-2 via the signal line Q50-5-1 based on the operating state search circuit 5-6 shown in FIG. be recognized.

非動作中のＰＥがあれば、まず第３図に図示した５１０
２でブリフェッチバッファ５−１に必要とするブロック
情報が前もって主記憶１から読出されていれば、引続い
て５１０３の処理を行う。前もって読出されていなけれ
ば読出されるまで待って、８１０３の処理を行う。If there is an inactive PE, first 510 shown in FIG.
If the block information required for the brief fetch buffer 5-1 has been read out from the main memory 1 in advance in step 2, then the process in step 5103 is performed. If it has not been read in advance, wait until it is read and perform the process of 8103.

次に、５１０３での処理手順を記す。まず、動作状態サ
ーチ回路５−６から信号ｍΩ５０−１２を介して出力さ
れる非動作中のＰＥ番号の（非動作中ＰＥが複数あれば
、その中から１つを選択）のエンコード情報をデコーダ
５−２１に入力する。ここで、エンコード情報は第４図
の信号線Ｑ５０−１２のビット０とビット１に図示した
ように２ビツトで作成され、ビット０＝０．ビット１＝ＯのときＰＥ番号Ｏ’　　＝
Ｏ，’　　＝１’　　ＰＥ番号１’　　＝１．　　　’
　　＝Ｏ’　　ＰＥ番号２’　　＝１．　　　’　　＝
１’　　ＰＥ番号３を指定するものとする。そして、マ
イクロコンピュータ５−２が信号、ｌ１ＩＱ５０−３を
介してイネーブル信号をセレクタ５−２１に送出すると
。Next, the processing procedure at 5103 will be described. First, the encoded information of the non-operating PE number (if there are multiple non-operating PEs, one is selected from them) output from the operating state search circuit 5-6 via the signal mΩ50-12 is decoded. Enter in 5-21. Here, the encoded information is created with 2 bits as shown in bit 0 and bit 1 of the signal line Q50-12 in FIG. 4, and bit 0=0. When bit 1 = O, PE number O' =
O,' = 1' PE number 1' = 1. '
=O' PE number 2' =1. '＝
1' PE number 3 shall be specified. Then, when the microcomputer 5-2 sends an enable signal to the selector 5-21 via the signal l1IQ50-3.

前述のエンコード情報が解読され、信号線Ｑ１０−１−
０〜３のどれかがＯＮとなり対応するＰＥへ起動信号が
送出する。このとき同時に、動作状態保持ラッチ５−３
−０〜３のうち対応するラッチをＯＮにして動作状態と
する。The aforementioned encoded information is decoded and the signal line Q10-1-
Any one of 0 to 3 turns ON and a start signal is sent to the corresponding PE. At the same time, the operating state holding latch 5-3
The corresponding latch among -0 to 3 is turned on to be in an operating state.

並行して、ブリフェッチバッファ５−１からブロック情
報を読出し、信号線Ｑ５０−１０゜セレクタ５−２０．
信号線Ｑ１０−２を介して各ＰＥへブロック情報を送出
する。このとき、同時に、上記ブロック情報をブロック
情報保持レジスタ５−５−０〜３のうち対応するレジス
タヘセットする。その後、５１００へ戻り全ブロックの
起動が終了していない場合の処理手順が終了する。In parallel, block information is read from the brief fetch buffer 5-1, and the signal line Q50-10° selector 5-20.
Block information is sent to each PE via the signal line Q10-2. At this time, at the same time, the block information is set in the corresponding register among the block information holding registers 5-5-0 to 5-3. Thereafter, the process returns to 5100 and the processing procedure when all blocks have not been activated ends.

（２）全ブロックの起動が終了した場合：この場合はま
ず、８１０６で動作中のＰＥが有るかどうかチェックす
る。これは、マイクロプロセッサ５−２が、動作状態サ
ーチ回路５−６から信号線１２５０−５−２を介し転送
される情報に基づいて認識する。(2) When activation of all blocks is completed: In this case, first, it is checked in 8106 whether there is any PE in operation. This is recognized by the microprocessor 5-2 based on information transferred from the operating state search circuit 5-6 via the signal line 1250-5-2.

このとき、信号線Ｑ５０−５−２は、第４図に図示した
ように動作状態保持ラッチ５−３−〇〜３のうち、どれ
か１つでもＯＮのものがあればＯＮとなり、動作中のＰ
Ｅが有ることを示す。At this time, the signal line Q50-5-2 is turned ON if any one of the operating state holding latches 5-3-0 to 3 is ON as shown in FIG. P of
Shows that E exists.

動作中のＰＥが無ければ、マイクロプロセッサ５−２は
全ブロックの処理が正常に終了したと認識し、信号線Ｑ
４−２を介してホストプロセッサ４へ正常終了報告を行
う。If there is no PE in operation, the microprocessor 5-2 recognizes that the processing of all blocks has been completed normally, and connects the signal line Q.
A normal completion report is sent to the host processor 4 via 4-2.

動作中のＰＥが有れば、５１０７で全ＰＥが起動不能状
態かどうかチェックする。もし、全ＰＥが起動不能状態
であれば前述したｓｌｏｇの処理を行う、全ＰＥが起動
不能状態で無ければ８１０８の処理を行う、この５１０
８ではＰＥ起動後にＰＥでマシンチェック等の動作不能
要因が発生し、対応する動作不能状態保持ラッチがＯＮ
になっているかどうかをチェックする。If there are any PEs in operation, it is checked in step 5107 whether all PEs are in an unstartable state. If all PEs are in an unstartable state, the above-mentioned slog process is performed, and if all PEs are not in an unstartable state, the process of 8108 is performed.
8, an inoperable factor such as a machine check occurs in the PE after the PE starts, and the corresponding inoperable state holding latch is turned ON.
Check whether it is.

具体的には、第５図に図示したように、起動不能状態保
持ラッチ（５−４−０〜３）がＯＮで、かつ、対応する
動作状態保持ラッチ（５−３−〇〜３）がＯＮのものが
あれば、信号ｍｕ５０−８がＯＮとなり、実行中に起動
不能ＰＥが発生したことがマイクロプロセッサ５−２へ
報告される。Specifically, as shown in FIG. 5, the unstartable state holding latches (5-4-0 to 3) are ON, and the corresponding operating state holding latches (5-3-0 to 3) are ON. If any PE is ON, the signal mu50-8 is turned ON, and the microprocessor 5-2 is notified that an unstartable PE has occurred during execution.

実行中に起動不能ＰＥが発生していなければ５１０６へ
戻り、発生していれば５１０９の処理を行う、　５１０
９では前述の８１０１の処理と同様にして、非動作中の
ＰＥが有るかどうかチェックする。If an unstartable PE has not occurred during execution, the process returns to 5106, and if it has occurred, performs the process of 5109, 510
At step 9, in the same manner as the process at step 8101 described above, it is checked whether there is an inactive PE.

非動作中のＰＥが無ければ、非動作中のＰＥが出現する
まで待つが、この間に全ＰＥが起動不能状態になってい
るかがチェックされ、もし、全ＰＥが起動不能状態であ
れば前述の３１０５の処理を行って異常終了する。If there is no non-operating PE, the system waits until a non-operating PE appears, but during this time it is checked whether all PEs are in a non-startable state, and if all PEs are in a non-startable state, the above-mentioned process is performed. Processing in step 3105 is performed and the process ends abnormally.

一方、非動作中のＰＥがあれば５ＬＩＯ。On the other hand, if there is a non-operating PE, 5 LIO.

５１１２の処理を行って３１０６に戻る。まず。After processing 5112, the process returns to 3106. first.

５Ｌｉｏでは実行中に発生した起動不能ＰＥに対応する
動作状態保持ラッチをＯＦＦにする。In 5Lio, the operating state holding latch corresponding to the unstartable PE that occurs during execution is turned OFF.

具体的には、起動不能状態サーチ回路５−７から信号線
Ａ３０−１３を介して出力される実行中に発生した起動
不能ＰＥ番号（複数あればその中から１つを選択）のエ
ンコード情報を、レジスタ５−３３にセットする。Specifically, the encoded information of the unstartable PE number (select one from among them if there is more than one) that occurred during execution is output from the unstartable state search circuit 5-7 via the signal line A30-13. , set in register 5-33.

なお、エンコード方式は前述した非動作中ＰＥ番号の場
合と同様であり、具体的な論理構成は第５図を参照され
たい。The encoding method is the same as that for the inactive PE number described above, and please refer to FIG. 5 for the specific logical configuration.

そして、レジスタ５−３３にセットしたＰＥ番号のエン
コード情報を信号線Ｑ５０−１４を介してデコーダ５−
２２に入力する。そして、マイクロコンピュータ５−２
が信号線ｆｉ５０−４を介してデコーダ５−２２にイネ
ーブル信号を送出すると、エンコード情報がデコードさ
る６そして、デコード情報がセレクタ５−２３〜２６を
介して対応するＰＥの動作状態保持ラッチのリセット端
子に入力されてＯＦＦとなる。Then, the encoded information of the PE number set in the register 5-33 is sent to the decoder 5-3 via the signal line Q50-14.
22. And microcomputer 5-2
sends an enable signal to the decoder 5-22 via the signal line fi50-4, the encoded information is decoded6, and the decoded information is transmitted via the selectors 5-23 to 5-26 to reset the operating state holding latch of the corresponding PE. It is input to the terminal and turned OFF.

続いて、５１１２の処理を行う。Subsequently, the process of 5112 is performed.

基本的な動作は前述した５１０３と同様であるが、以下
の点が異なる。即ち、５１０３に於ては、ブロック情報
をブリフェッチバッファ５−１から取出したが、５１１
２ではレジスタ５−３３にセットしたＰＥ番号のエンコ
ード情報に基づき、ブロック情報保持レジスタ５−５−
〇〜３の中から対応するブロック情報をセレクタ５−２
７を介して取出す点が異なる。The basic operation is the same as that of 5103 described above, but the following points are different. That is, in 5103, block information was extracted from the brief fetch buffer 5-1, but in 511
2, based on the encoded information of the PE number set in the register 5-33, the block information holding register 5-5-
Selector 5-2 selects the corresponding block information from 〇 to 3.
The difference is that it is taken out via 7.

以上で本発明の一実施例に基づく動作説明を終了する０
以上の説明は、第６図に示した処理例を基にした動作説
明であるが、一般的には第６図に示した処理例に限定さ
れる必要はない。This concludes the explanation of the operation based on one embodiment of the present invention.
Although the above description is an operation description based on the processing example shown in FIG. 6, it is generally not necessary to be limited to the processing example shown in FIG.

例えば、第６図では各ブロックの処理のプログラムは各
ブロック毎にＰＥで別々に主記憶からローカル記憶に読
出しているが、もし、各ブロックに共通であれば、前も
ってブロードキャストして置くことが考えられる。また
、入力データについても、共通データがあれば同様のこ
とが考えられる。For example, in Figure 6, the processing program for each block is read from the main memory to the local memory separately by the PE for each block, but if it is common to each block, it may be a good idea to broadcast it in advance. It will be done. Moreover, the same thing can be considered for input data if there is common data.

さらに、各ＰＥでのプログラムの実行途中で必要となる
データを必要に応じて主記憶から読出すのではなく、プ
ログラムの開始前に前もって必要なデータを一括して主
記憶からローカル記憶に読出しておき、プログラムの実
行中は極力主記憶をアクセスしないようにする方式も考
えられる。また、第６図では、ホストプロセッサ４が複
数のブロックをひとまとめにして、一括して処理するよ
うにプロセッサエレメントＩＩＪ御回路５に指示するが
、各ブロック毎に指示する方式も考えられる。Furthermore, instead of reading the necessary data from the main memory as needed during the program execution in each PE, the necessary data is read in advance from the main memory to the local memory in advance before the program starts. Another possible method is to avoid accessing the main memory as much as possible while the program is running. Further, in FIG. 6, the host processor 4 instructs the processor element IIJ control circuit 5 to process a plurality of blocks at once, but a method of instructing each block separately is also conceivable.

例えば、共通データをブロードキャストする場合の処理
は次のように行う。For example, the process for broadcasting common data is performed as follows.

ホストプロセッサ４より、信号線Ｑ４−４を介して、ブ
ロードキャストを指定する命令がＰＥ制御回路５に入力
されると、デコーダ５−１９で解読され、信号ｍ　Ｄ、
　５０−１６を介してマイクロコンピュータにその昏が
伝達される。同時に、ホストプロセッサ４は、信号線Ｑ
４−１を介して、読出すべき主記憶の先頭アドレスをレ
ジスタ５−１１へ、転送するデータサイズをレジスタ５
−１５−２へ、また、ローカル記憶に書込むべき先頭ア
ドレスをレジスタ５−１５−１にセットする。When a command specifying broadcast is input from the host processor 4 to the PE control circuit 5 via the signal line Q4-4, it is decoded by the decoder 5-19, and the signal mD,
The information is transmitted to the microcomputer via 50-16. At the same time, the host processor 4 connects the signal line Q
4-1, the start address of the main memory to be read is sent to register 5-11, and the data size to be transferred is sent to register 5-11.
-15-2, and sets the start address to be written to the local memory in register 5-15-1.

なお、ここでは、転送するデータのアドレス付けは主記
憶上、ローカル記憶上ともに連続アドレスを仮定する。Here, it is assumed that the addressing of the data to be transferred is continuous in both the main memory and the local memory.

その後、レジスタ５−１５−１にセットされた先頭アド
レスを、信号線１５０−９゜セレクタ５−２０．信号線
ＱＩＯ−２を介して全ＰＥへ転送する。引続いて、レジ
スタ５−１１にセットされた先頭アドレスをカウントア
ツプ回路５−１２でカウントアツプしながら、主記憶か
ら順次データをプリフェッチバッファ５−１に読出し、
レジスタ５−１５−２にセットしたデータサイズだけデ
ータを取出して信号１７１ｔＱ５０−１０゜セレクタ５
−２０、信号線ｆｉｌｏ−２を介して全ＰＥへ転送して
ローカル記憶に書込む。Thereafter, the start address set in the register 5-15-1 is transferred to the signal line 150-9° selector 5-20. Transfer to all PEs via signal line QIO-2. Subsequently, while counting up the start address set in the register 5-11 by the count-up circuit 5-12, data is sequentially read from the main memory to the prefetch buffer 5-1.
Extract data by the data size set in register 5-15-2 and send signal 171tQ50-10° selector 5
-20, transfer to all PEs via signal line filo-2 and write in local storage.

また、第２図に於て、デコーダ５−２７とデコーダ５−
２８は、ホストプロセッサ４から転送される指令に基づ
き、起動不能状態保持ラッチ５−４−０〜３を０Ｎ１０
ＦＦするときに用いられるものである。なお、ホストプ
ロセッサ４が起動不能状態ラッチの５−４−０〜３の状
態を知る必要があれば、そのための機能は容易に付加可
能であり、動作状態保持ラッチ５−３−０〜３について
も同様のことが言える。In addition, in FIG. 2, decoder 5-27 and decoder 5-
28 sets the unstartable state holding latches 5-4-0 to 3 to 0N10 based on the command transferred from the host processor 4.
This is used when performing FF. Incidentally, if the host processor 4 needs to know the status of the non-startable state latches 5-4-0 to 3, a function for that purpose can be easily added. The same thing can be said.

第７図は本発明の他の実施例の構成図である。FIG. 7 is a block diagram of another embodiment of the present invention.

第７図は第６図にベクトル処理装置を組込んだ構成であ
り、１０はベクトル処理装置、２−８はベクトル処理装
置が主記憶１をアクセスするためのボート、１１１はベ
クトル処理袋！！１０とボート２−８の間で、アドレス
および読出しデータ／書込みデータ等を転送するための
信号線、Ｑ１２はベクトル処理装置１０とホストプロセ
ッサ４の間で、制御情報（例えば、ホストプロセッサ４
がらベクトル処理装置１０への処理開始を指示する起動
信号、ベクトル処理袋［１０からホストプロセッサ４へ
の処理終了信号等）を転送するための信号線であり、そ
の他は第６図と同様である。なお、ベクトル処理装置１
０としては、Ｃｒａｙ　　１＋ＨＩＴＡＣ３−８１０等
のベクトル処理装置が考えられる。FIG. 7 shows a configuration in which a vector processing device is incorporated in the configuration shown in FIG. 6, where 10 is a vector processing device, 2-8 is a boat for the vector processing device to access the main memory 1, and 111 is a vector processing bag! ! Q12 is a signal line for transferring addresses, read data/write data, etc. between the vector processing device 10 and the host processor 4, and a signal line Q12 is used to transfer control information (for example, the host processor 4
This is a signal line for transmitting a start signal for instructing the vector processing device 10 to start processing, a vector processing signal (a processing end signal from 10 to the host processor 4, etc.), and the rest is the same as in FIG. . Note that the vector processing device 1
0, a vector processing device such as Cray 1+HITAC3-810 can be considered.

プログラムを実行する場合、並列処理に適した処理はプ
ロセッサエレメント制御回路５の制御の下に、プロセッ
サエレメント番号Ｏ〜３で並列処理させ、一方、ベクト
ル処理（パイプライン処理）に適した処理はベクトル処
理装置１０で処理させることにより、高速化、高効率化
が実現できる。When executing a program, processing suitable for parallel processing is performed in parallel by processor element numbers O to 3 under the control of the processor element control circuit 5, while processing suitable for vector processing (pipeline processing) is performed in parallel using processor element numbers O to 3. By processing with the processing device 10, high speed and high efficiency can be realized.

なお、第７図に於て、ベクトル処理装置１０は、高度の
パイプライン方式を採用した比較的大規模な処理装置で
あり、一方、プロセッサエレメント番号Ｏ〜３は比較的
小規模の処理装置である。また、プロセッサエレメント
の数は４台に限定する必要はなく、実際には数十〜数百
台で構成され、さらに、ベクトル処理装置も複数台で構
成することも考えられる。Note that in FIG. 7, the vector processing device 10 is a relatively large-scale processing device that employs an advanced pipeline system, while processor element numbers O to 3 are relatively small-scale processing devices. be. Further, the number of processor elements does not need to be limited to four; in fact, it may consist of several tens to hundreds of processor elements, and it is also conceivable that the vector processing device may also be composed of a plurality of units.

〔Effect of the invention〕

本発明によれば、各処理装置への処理の割当てを実行時
に動的に行うことが出来るため、利用者は処理装置の台
数および処理装置の番号を意識する必要がなく、また、
処理装置が動作中に故障等により動作不能状態になった
場合、動作可能な他処理装置へ処理の再割当てが可能と
なるため信頼性を向上できる並列処理装置を提供できる
という効果があり、また、ベクトル処理装置を組込み。According to the present invention, since processing can be dynamically assigned to each processing device at the time of execution, the user does not need to be aware of the number of processing devices and the number of the processing device, and
If a processing device becomes inoperable due to a failure or the like during operation, processing can be reassigned to another operational processing device, which has the effect of providing a parallel processing device that can improve reliability. , incorporates a vector processing unit.

ベクトル処理向きの処理をベクトル処理装置で実行させ
、また、並列処理向きの処理を並列処理装置で実行させ
ることによって、処理の内容に応じた負荷分散が可能と
なり、高速化・高効率化が実現できるという効果がある
。By executing processes suitable for vector processing on a vector processing device, and executing processes suitable for parallel processing on a parallel processing device, it is possible to distribute the load according to the content of the processing, achieving higher speed and efficiency. There is an effect that it can be done.

[Brief explanation of drawings]

第１図は本発明の一実施例の概略構成図、第２図は第１
図に図示したプロセッサエレメント制御回路の構成例、
第３図は本発明の一実施例の動作を説明するための制御
フロー、第４図と第５図は、各々、第２図に図示した動
作状態サーチ回路と起動不能状態サーチ回路の構成例、
第６図は本発明の一実施例の動作を説明するための処理
例、第７図は本発明の他の実施例の構成図を示す図であ
る。FIG. 1 is a schematic configuration diagram of an embodiment of the present invention, and FIG.
An example of the configuration of the processor element control circuit shown in the figure,
FIG. 3 is a control flow for explaining the operation of an embodiment of the present invention, and FIGS. 4 and 5 are configuration examples of the operating state search circuit and the inactivation state search circuit shown in FIG. 2, respectively. ,
FIG. 6 is a processing example for explaining the operation of one embodiment of the present invention, and FIG. 7 is a diagram showing a configuration diagram of another embodiment of the present invention.

Claims

[Claims] 1. Consisting of a plurality of processing devices, a control device that controls the processing devices, and a storage device shared by the processing devices and the control device, each processing device has a different in a parallel processing device capable of executing a plurality of processes in parallel;
a second holding means for holding a state as to whether or not each processing device is unable to start processing due to a failure or the like, and inputting information from the first holding means and the second holding means to allocate processing. The parallel processing apparatus is characterized by having a detecting means for detecting possible processing apparatus numbers, and an assigning means for allocating processing to each of the processing apparatuses in response to information output from the detecting means. 2. In the parallel processing device of item 1, there is also a third parallel processing device that holds information necessary to execute the processing assigned to each processing device.
, the detecting means is capable of detecting a processing device number that becomes inoperable during execution of processing, and the assigning means is capable of detecting a processing device number that becomes inoperable during execution of processing. , a parallel processing device capable of reassigning processes by extracting information necessary to execute a process from a corresponding third holding means.