JPH05134998A

JPH05134998A - Multiprocessor system

Info

Publication number: JPH05134998A
Application number: JP3300243A
Authority: JP
Inventors: Masato Nakamura; 真人中村
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1991-11-15
Filing date: 1991-11-15
Publication date: 1993-06-01

Abstract

PURPOSE:To detect the faults of all the processors by dividing plural processors into plural fault detection groups, and connecting a device which detects and processes the fault of the whole system based on fault information from a fault detection processor in each group to a system bus. CONSTITUTION:The plural processors 2 are divided into the plural fault detection groups 3 at every processor related functionally, and the processor that becomes the core of each fault detection group 3 is designated as the fault detection processor 2a which detects the fault in the group. The fault information I1 from the fault detection processor 2a is sent to a fault information processor 4 connected to the system bus 1, and the fault of the whole system can be detected and processed. Thereby, it is possible to detect the faults of all the processors 2, and to suppress burden from increasing since it is enough to process the fault information from the fault detection processor 2a in each group 3.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、自己障害診断機能を備
えた複数個のプロセッサをシステムバスで結合して成る
マルチプロセッサシステム、特にシステム全体の障害監
視，片肺運転，停止，自動一括再起動等のフォールトト
レランス（故障許容）機能を備えたマルチプロセッサシ
ステムに関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a multiprocessor system in which a plurality of processors having a self-fault diagnosis function are connected by a system bus. The present invention relates to a multiprocessor system having a fault tolerance (fault tolerance) function such as startup.

【０００２】[0002]

【従来の技術】マルチプロセッサシステムは、その信頼
性を高めるために、個々のプロセッサの自己障害診断機
能だけでなく、システム全体の障害監視，片肺運転，停
止，自動一括再起動等のフォールトトレランス機能を備
える必要がある。2. Description of the Related Art In order to improve the reliability of a multiprocessor system, not only the self-fault diagnosis function of each processor but also fault tolerance of the whole system such as fault monitoring, single lung operation, stop, automatic batch restart, etc. It is necessary to have a function.

【０００３】図５は、この種のフォールトトレランス機
能を備えたマルチプロセッサシステムの従来例を示すブ
ロック図である（特開昭６４−５５６６９号公報）。FIG. 5 is a block diagram showing a conventional example of a multiprocessor system having a fault tolerance function of this type (Japanese Patent Laid-Open No. 64-55669).

【０００４】図のように、自己障害診断機能を備えた複
数個のプロセッサ２１，２２，２３、およびこれらのプ
ロセッサ２１，２２，２３からの指令で動作するディス
プレイ，プリンタ，外部記憶装置等の入出力チャネルポ
ート（Ｉ／Ｏｃｈ．）２４〜２７が、システムバス２８
によって結合されている。またこのシステムバス２８に
は、システム監視装置２９（以下、監視装置２９と略
す）が接続されている。そのシステム監視装置２９は、
障害診断カウンタ３０（以下、カウンタ３０と略す）を
備えており、システムがスタートした後、一定周期でこ
のカウンタ３０をインクリメントさせる。As shown in the figure, a plurality of processors 21, 22, 23 having a self-diagnosis function, and a display, a printer, an external storage device and the like which operate according to commands from these processors 21, 22, 23 are installed. The output channel ports (I / Och.) 24 to 27 are connected to the system bus 28.
Are joined by. A system monitoring device 29 (hereinafter abbreviated as monitoring device 29) is connected to the system bus 28. The system monitoring device 29
A failure diagnosis counter 30 (hereinafter abbreviated as counter 30) is provided, and the counter 30 is incremented at a constant cycle after the system starts.

【０００５】このマルチプロセッサシステムでは、全て
のプロセッサ２１，２２，２３のうちのいくつか、例え
ばプロセッサ２１と２２が、重要プロセッサとして予め
指定されている。この重要プロセッサは、他のプロセッ
サが障害を起こしてもこのプロセッサが正常であればシ
ステム動作を継続し得るような、システムの中枢となる
プロセッサである。In this multiprocessor system, some of all the processors 21, 22, 23, for example, the processors 21 and 22 are designated in advance as important processors. This important processor is a core processor of the system that can continue the system operation if this processor is normal even if another processor fails.

【０００６】上記構成のマルチプロセッサシステムにお
いて、電源投入時には監視装置２９が、全てのプロセッ
サ２１，２２，２３および入出力チャネルポート２４〜
２７に対してリセット信号Ｓ１を送る。すると全てのプ
ロセッサ２１，２２，２３は、そのリセット信号Ｓ１を
受けた段階で自身を初期化していっせいに動作を開始す
る。また、重要プロセッサ２１，２２は、自身の健在を
示すアライブ信号Ｓ２を、監視装置２９へ一定周期で出
力する。In the multiprocessor system having the above-mentioned structure, when the power is turned on, the monitoring device 29 causes all the processors 21, 22, 23 and the input / output channel ports 24.about.
A reset signal S1 is sent to 27. Then, all the processors 21, 22, 23 initialize themselves at the stage of receiving the reset signal S1 and simultaneously start the operation. In addition, the important processors 21 and 22 output the alive signal S2 indicating the existence of their own health to the monitoring device 29 in a constant cycle.

【０００７】監視装置２９は、重要プロセッサ２１，２
２からアライブ信号Ｓ２を受けた時点でカウンタ３０を
リセットする。しかし全ての重要プロセッサ２１，２２
で障害が発生し、いずれの重要プロセッサ２１，２２か
らもアライブ信号Ｓ２が送られてこなくなった場合に
は、監視装置２９はカウンタ３０をインクリメントさせ
続ける。The monitoring device 29 includes important processors 21 and 2.
When the alive signal S2 is received from 2, the counter 30 is reset. But all the important processors 21, 22
In the case where a failure occurs and the alive signal S2 is not sent from any of the important processors 21 and 22, the monitoring device 29 continues to increment the counter 30.

【０００８】そして監視装置２９は、アライブ信号Ｓ２
を受けずに、カウンタ３０の値が、予め設定された許容
値を超えた場合には、システム障害、つまりシステムと
しての動作が継続不可能な事態が発生したと認識し、リ
セット信号Ｓ１を出力する。これにより、全てのプロセ
ッサ２１，２２，２３が自身を初期化して再スタートす
ることになる。The monitoring device 29 then sends the alive signal S2.
If the value of the counter 30 exceeds the preset allowable value without being received, it is recognized that a system failure, that is, a situation in which the operation of the system cannot be continued has occurred, and the reset signal S1 is output. To do. As a result, all the processors 21, 22, 23 will initialize themselves and restart.

【０００９】ただし重要プロセッサ２１，２２のうちい
ずれかが健在で、アライブ信号Ｓ２を出力している場合
には、監視装置２９がカウンタ３０を一定周期でリセッ
トさせるために、リセット信号Ｓ１が出力されることは
なく、よってシステムとしての動作が継続する。However, when one of the important processors 21 and 22 is alive and outputs the alive signal S2, the reset signal S1 is output so that the monitoring device 29 resets the counter 30 at a constant cycle. Therefore, the operation of the system continues.

【００１０】[0010]

【発明が解決しようとする課題】しかしながら、上記従
来のマルチプロセッサシステムでは、重要プロセッサ２
１，２２に指定されていないプロセッサ２３で発生した
障害は検出されないという問題があった。全てのプロセ
ッサ２１，２２，２３を重要プロセッサに指定すると、
監視装置２９が処理すべきアライブ信号Ｓ２が増加し、
監視装置２９にかかる負荷が非常に大きくなってしま
う。However, in the above conventional multiprocessor system, the important processor 2 is used.
There is a problem that a failure that occurs in the processor 23 not designated as No. 1 or 22 is not detected. If all processors 21, 22, 23 are designated as important processors,
The alive signal S2 to be processed by the monitoring device 29 increases,
The load on the monitoring device 29 becomes very large.

【００１１】本発明は、上記問題を解決するためになさ
れたもので、システム上重要なプロセッサはもちろん、
その他のプロセッサについても障害を検出することがで
き、しかも障害を検出し処理する装置への負荷の増大を
抑えたマルチプロセッサシステムを得ることを目的とす
る。The present invention has been made in order to solve the above-mentioned problems, and not only a system important processor but also
It is an object of the present invention to obtain a multiprocessor system capable of detecting a failure in other processors as well as suppressing an increase in load on a device for detecting and processing the failure.

【００１２】[0012]

【課題を解決するための手段】上記目的を達成するため
に、本発明に係るマルチプロセッサシステムでは、複数
個のプロセッサを、機能的に関連づけられるプロセッサ
ごとに複数の障害検出グループに分けるとともに、その
各障害検出グループの中枢となるプロセッサを、それぞ
れのグループ内の障害を検出する障害検出プロセッサに
指定する。更に、システムバスに、上記各障害検出グル
ープの障害検出プロセッサからの障害情報に基づいてシ
ステム全体の障害を検出し処理する障害情報処理装置を
接続する。To achieve the above object, in a multiprocessor system according to the present invention, a plurality of processors are divided into a plurality of fault detection groups for each processor functionally associated with each other, and The central processor of each failure detection group is designated as a failure detection processor that detects a failure in each group. Further, a fault information processing device that detects and processes a fault in the entire system based on the fault information from the fault detection processor of each fault detection group is connected to the system bus.

【００１３】[0013]

【作用】上記構成のマルチプロセッサシステムでは、障
害検出プロセッサに指定されたシステム上重要なプロセ
ッサについてはもちろん、その他のプロセッサについて
も、それぞれの障害検出グループの障害検出プロセッサ
を介し、障害情報処理装置によって障害が検出される。
つまりシステムを構成する全てのプロセッサについて、
機能別の障害検出グループごとに障害を検出することが
できる。In the multiprocessor system having the above configuration, not only the system-important processor designated as the fault detection processor but also other processors are processed by the fault information processing device via the fault detection processors of the respective fault detection groups. The fault is detected.
In other words, for all processors that make up the system,
A fault can be detected for each fault detection group according to function.

【００１４】また、障害を検出し処理する障害情報処理
装置は、各障害検出グループの障害検出プロセッサから
の障害情報を処理すればよいため、この障害情報処理装
置への負荷の増大が抑えられる。Further, since the fault information processing device for detecting and processing the fault only needs to process the fault information from the fault detection processor of each fault detection group, the load on this fault information processing device can be suppressed.

【００１５】[0015]

【実施例】以下、図面に基づいて本発明の実施例を説明
する。Embodiments of the present invention will be described below with reference to the drawings.

【００１６】実施例１図１は、本発明に係るマルチプロセッサシステムの実施
例１を示すブロック図である。 First Embodiment FIG. 1 is a block diagram showing a first embodiment of a multiprocessor system according to the present invention.

【００１７】図のようにこのマルチプロセッサシステム
では、システムバス１で結合された、自己障害診断機能
を備えた複数個のプロセッサ２が、機能的に関連づけら
れるプロセッサごとに複数の障害検出グループ３に分け
られている。それとともに、各障害検出グループ３の中
枢となるプロセッサが、障害検出プロセッサ２ａに指定
されている。この障害検出プロセッサ２ａは、それぞれ
の障害検出グループ３内の障害を検出して、障害発生信
号とプロセッサ認識データとを含む障害情報Ｉ１を出力
する。As shown in the figure, in this multiprocessor system, a plurality of processors 2 having a self-fault diagnosis function, which are connected by a system bus 1, are assigned to a plurality of fault detection groups 3 for each processor which is functionally related. It is divided. At the same time, the central processor of each failure detection group 3 is designated as the failure detection processor 2a. The fault detection processor 2a detects a fault in each fault detection group 3 and outputs fault information I1 including a fault occurrence signal and processor recognition data.

【００１８】また、システムバス１には障害情報処理装
置４が接続されている。この障害情報処理装置４は、上
記各障害検出グループ３の障害検出プロセッサ２ａから
の障害情報Ｉ１に基づいてシステム全体の障害を検出し
処理するもので、図２のブロック図に示すように、グル
ープ別障害情報格納部５と、障害情報収集コントロール
部６と、障害情報処理部７とから構成されている。A fault information processing device 4 is connected to the system bus 1. This fault information processing device 4 detects and processes a fault of the entire system based on the fault information I1 from the fault detection processor 2a of each fault detection group 3, and as shown in the block diagram of FIG. It is composed of a different fault information storage unit 5, a fault information collection control unit 6, and a fault information processing unit 7.

【００１９】上記構成において、各障害検出グループ３
の障害検出プロセッサ２ａは、障害検出グループ３内の
他のプロセッサ２に対し、システムバス１を介してポー
リングを行うことによって、それぞれの障害検出グルー
プ３内の障害を検出する。つまり、障害検出プロセッサ
２ａからポーリングシーケンスを受けた他のプロセッサ
２は、障害の有無をポーリング信号Ｓによって障害検出
プロセッサ２ａへ知らせる。そして障害検出プロセッサ
２ａは、そのポーリング信号Ｓにより障害の発生を認知
した際には、障害情報Ｉ１を障害情報処理装置４へ送
る。また障害検出プロセッサ２ａ自身に障害が発生した
場合には、その障害検出プロセッサ２ａが直ちに障害情
報Ｉ１を障害情報処理装置４へ送る。In the above configuration, each failure detection group 3
The fault detection processor 2a of 1 detects the fault in each fault detection group 3 by polling the other processors 2 in the fault detection group 3 via the system bus 1. That is, the other processors 2 that have received the polling sequence from the fault detection processor 2a inform the fault detection processor 2a of the presence or absence of a fault by the polling signal S. When the fault detection processor 2a recognizes the occurrence of a fault by the polling signal S, it sends fault information I1 to the fault information processing device 4. Further, when a failure occurs in the failure detection processor 2a itself, the failure detection processor 2a immediately sends the failure information I1 to the failure information processing device 4.

【００２０】障害情報処理装置４では、いずれかの障害
検出グループ３の障害検出プロセッサ２ａから障害情報
Ｉ１が送られてくると、グループ別障害情報格納部５が
その障害情報Ｉ１を格納する。次いで障害情報収集コン
トロール部６が、障害の発生しているプロセッサ２から
直接、障害の内容についての情報をシステムバス１を通
して収集する。そして障害情報処理部７が、その収集さ
れた情報に基づいて、発生している障害に対する適当な
処理を判断し、その処理に必要なデータとリセット信号
とを含む障害処理情報Ｉ２を各障害検出グループ３のプ
ロセッサ２へ送って、障害に対する処理を行う。こうし
てフォールトトレランス機能が働くことになる。In the fault information processing device 4, when the fault information I1 is sent from the fault detection processor 2a of one of the fault detection groups 3, the fault information storage section 5 for each group stores the fault information I1. Next, the fault information collection control unit 6 directly collects information about the fault content from the faulty processor 2 through the system bus 1. Then, the failure information processing unit 7 determines appropriate processing for the failure that has occurred based on the collected information, and detects failure processing information I2 including data necessary for the processing and a reset signal for each failure. It is sent to the processor 2 of the group 3 to perform processing for the failure. In this way, the fault tolerance function works.

【００２１】上述のように、このマルチプロセッサシス
テムでは、障害検出プロセッサ２ａに指定されたシステ
ム上重要なプロセッサについてはもちろん、その他のプ
ロセッサ２についても、それぞれの障害検出グループ３
の障害検出プロセッサ２ａを介して障害を検出すること
ができる。つまりシステムを構成する全てのプロセッサ
２について、機能別の障害検出グループ３ごとに障害を
検出することができる。しかもシステム上重大な障害で
ある障害検出プロセッサ２ａの障害に対しては、その障
害検出プロセッサ２ａが直ちに障害情報Ｉ１を障害情報
処理装置４へ送るため、即時に対応することが可能であ
る。即ち、障害検出グループ３内の一つのプロセッサ２
に発生した障害に対し、そのプロセッサ２の障害検出グ
ループ３内における重要度に応じて対応することができ
る。As described above, in this multiprocessor system, not only the system-important processor designated as the fault detection processor 2a but also the other processors 2 have their fault detection groups 3 respectively.
The fault can be detected through the fault detection processor 2a. In other words, it is possible to detect a failure for each function-dependent failure detection group 3 for all the processors 2 configuring the system. Moreover, a failure of the failure detection processor 2a, which is a serious failure in the system, can be immediately dealt with because the failure detection processor 2a immediately sends the failure information I1 to the failure information processing device 4. That is, one processor 2 in the failure detection group 3
It is possible to deal with the failure that has occurred in 1) according to the importance of the processor 2 in the failure detection group 3.

【００２２】更に、障害を検出し処理する障害情報処理
装置４は、各障害検出グループ３の障害検出プロセッサ
２ａからの障害情報Ｉ１を処理すればよいため、この障
害情報処理装置４への負荷の増大が抑えられる。Further, since the fault information processing device 4 for detecting and processing the fault has only to process the fault information I1 from the fault detection processor 2a of each fault detection group 3, the load on the fault information processing device 4 is reduced. The increase is suppressed.

【００２３】実施例２図３は、本発明に係るマルチプロセッサシステムの実施
例２を示すブロック図である。 Second Embodiment FIG. 3 is a block diagram showing a second embodiment of the multiprocessor system according to the present invention.

【００２４】図のようにこの実施例２の場合には、上記
実施例１のようにして分けられた各障害検出グループ３
内のプロセッサ２どうしが、障害検出専用バス８で接続
されている。そして各障害検出グループ３の障害検出プ
ロセッサ２ａは、障害検出グループ３内の他のプロセッ
サ２に対し、この障害検出専用バス８を介してポーリン
グを行うことによって、それぞれの障害検出グループ３
内の障害を検出する。このように、障害検出専用バス８
を設けることにより、障害検出のためにシステムバス１
を使用する頻度を減らして、システム全体の性能の低下
を抑えることができる。As shown in the figure, in the case of the second embodiment, each failure detection group 3 divided as in the first embodiment.
The internal processors 2 are connected to each other through a fault detection dedicated bus 8. Then, the fault detection processor 2a of each fault detection group 3 polls the other processors 2 in the fault detection group 3 via the fault detection dedicated bus 8 so that each fault detection group 3
To detect a fault within. In this way, the fault detection dedicated bus 8
By providing the system bus 1 for fault detection
It is possible to reduce the frequency of using, and suppress the deterioration of the performance of the entire system.

【００２５】実施例３図４は、本発明に係るマルチプロセッサシステムの実施
例３を示すブロック図である。 Third Embodiment FIG. 4 is a block diagram showing a third embodiment of the multiprocessor system according to the present invention.

【００２６】図のようにこのマルチプロセッサシステム
では、障害検出グループ３の形成方法が上記二つの実施
例１，２と異なる。即ち、システムバス１で結合された
複数個のプロセッサ２は、機能的に関連づけられるプロ
セッサごとに、最上位の障害検出グループから最下位の
障害検出グループまで、上位のグループが下位のグルー
プを包含した形で複数の障害検出グループ３に分けられ
ている。これらの障害検出グループ３は、上位のものほ
ど、システム上、より重要な機能を有するものとする。
そして各障害検出グループ３の中枢となるプロセッサ
が、それぞれの障害検出グループ３内の障害を検出する
障害検出プロセッサに指定されている。この実施例３の
場合、障害検出グループ３として、最上位の障害検出グ
ループ３ａと、中間の準障害検出グループ３ｂと、最下
位の準々障害検出グループ３ｃとが形成されるととも
に、各グループ３ａ，３ｂ，３ｃのプロセッサ２の一つ
がそれぞれ、障害検出プロセッサ２ａ，準障害検出プロ
セッサ２ｂ，準々障害検出プロセッサ２ｃに指定されて
いる。As shown in the figure, in this multiprocessor system, the method of forming the fault detection group 3 is different from that of the above two embodiments 1 and 2. In other words, the plurality of processors 2 connected by the system bus 1 include lower groups, from the highest failure detection group to the lowest failure detection group, for each processor that is functionally related. It is divided into a plurality of failure detection groups 3 in the form. It is assumed that the higher the fault detection group 3, the more important the system has in function.
The central processor of each failure detection group 3 is designated as the failure detection processor that detects a failure in each failure detection group 3. In the case of the third embodiment, as the failure detection group 3, a highest failure detection group 3a, an intermediate quasi-failure detection group 3b, and a lowest quasi-fault failure detection group 3c are formed, and each group 3a, One of the processors 2 of 3b and 3c is designated as the fault detection processor 2a, the quasi-fault detection processor 2b, and the quasi-quarter fault detection processor 2c, respectively.

【００２７】また、上記各障害検出グループ３ａ，３
ｂ，３ｃ内のプロセッサ２どうしは、下位のグループが
存在すればその下位のグループの障害検出プロセッサと
共に障害検出専用バス９，１０，１１で接続されてい
る。つまり最上位の障害検出グループ３ａ内のプロセッ
サ２どうしは、準障害検出グループ３ｂの準障害検出プ
ロセッサ２ｂと共に障害検出専用バス９で接続され、準
障害検出グループ３ｂのプロセッサ２どうしは、準々障
害検出グループ３ｃの準々障害検出プロセッサ２ｃと共
に準障害検出専用バス１０で接続されている。そして最
下位の準々障害検出グループ３ｃのプロセッサ２どうし
は、準々障害検出専用バス１１で接続されている。Further, each of the fault detection groups 3a, 3 described above
If there is a lower group, the processors 2 in b and 3c are connected to the fault detection processors of the lower group by fault detection dedicated buses 9, 10, and 11. That is, the processors 2 in the highest-level failure detection group 3a are connected together with the quasi-failure detection processor 2b in the quasi-failure detection group 3b through the failure detection dedicated bus 9, and the processors 2 in the quasi-failure detection group 3b are detected in the quasi-failure detection. The quasi-fault detection processor 2c of the group 3c is connected to the quasi-fault detection dedicated bus 10. The processors 2 of the lowest quarter-fault detection group 3c are connected by a quarter-fault detection dedicated bus 11.

【００２８】更に、システムバス１には、上記二つの実
施例１，２と同様に、システム全体の障害を検出し処理
する障害情報処理装置４が接続されている。Further, the system bus 1 is connected to a fault information processing device 4 for detecting and processing a fault in the entire system, as in the above-described first and second embodiments.

【００２９】上記構成のマルチプロセッサシステムで
は、各障害検出グループ３ａ，３ｂ，３ｃの障害検出プ
ロセッサ２ａ，２ｂ，２ｃが、各障害検出専用バス９，
１０，１１を介してポーリングを行うことにより、最終
的に最上位の障害検出グループ３ａの障害検出プロセッ
サ２ａが、各グループ３ａ，３ｂ，３ｃ内の障害を検出
し、障害情報処理装置４へ障害情報Ｉ１を出力する。そ
して障害情報処理装置４が、その最上位の障害検出グル
ープ３ａの障害検出プロセッサ２ａからの障害情報Ｉ１
に基づき、上記実施例１で説明したように障害処理情報
Ｉ２を各障害検出グループ３ａ，３ｂ，３ｃのプロセッ
サ２へ送って、障害に対する処理を行うことになる。In the multiprocessor system having the above-mentioned configuration, the fault detection processors 2a, 2b, 2c of the fault detection groups 3a, 3b, 3c have the fault detection dedicated buses 9,
By polling via 10 and 11, the fault detection processor 2a of the highest fault detection group 3a finally detects the fault in each of the groups 3a, 3b and 3c, and the fault information processing device 4 is faulted. The information I1 is output. Then, the fault information processing device 4 receives the fault information I1 from the fault detection processor 2a of the highest fault detection group 3a.
Based on the above, the fault processing information I2 is sent to the processor 2 of each fault detection group 3a, 3b, 3c as described in the first embodiment, and the process for the fault is performed.

【００３０】この実施例３におけるマルチプロセッサシ
ステムの場合には、機能別に分けられた各障害検出グル
ープ３ａ，３ｂ，３ｃが階層的に構成されたことによ
り、システム内の一つのプロセッサ２に発生した障害に
対し、そのプロセッサ２が属するグループ３の機能のシ
ステム内における重要度に応じて、またグループ３内に
おけるそのプロセッサ２の重要度に応じて対応すること
が可能になる。In the case of the multiprocessor system according to the third embodiment, each fault detection group 3a, 3b, 3c classified by function is hierarchically constructed, so that it occurs in one processor 2 in the system. The failure can be dealt with according to the importance of the function of the group 3 to which the processor 2 belongs in the system and according to the importance of the processor 2 in the group 3.

【００３１】また、システム全体の障害を検出し処理す
る障害情報処理装置４は、最上位の障害検出グループ３
ａの障害検出プロセッサ２ａからの障害情報Ｉ１だけを
処理すればよいため、この障害情報処理装置４への負荷
は、上記二つの実施例１，２の場合よりさらに小さく抑
えられる。Further, the fault information processing device 4 for detecting and processing the fault of the entire system is the fault detection group 3 at the highest level.
Since it is necessary to process only the fault information I1 from the fault detection processor 2a of a, the load on the fault information processing device 4 can be further suppressed as compared with the cases of the above two embodiments 1 and 2.

【００３２】[0032]

【発明の効果】以上説明したとおり、本発明に係るマル
チプロセッサシステムによれば、システム上重要なプロ
セッサはもちろん、その他のプロセッサについても障害
を検出することができ、しかも障害を検出し処理する装
置への負荷の増大を抑えることができる。As described above, according to the multiprocessor system of the present invention, it is possible to detect a fault not only in a system-important processor but also in another processor, and a device for detecting and processing the fault. It is possible to suppress an increase in the load on the.

【００３３】更に、システム内の一つのプロセッサに発
生した障害に対し、そのプロセッサの重要度に応じた対
応が可能になる。Furthermore, it becomes possible to deal with a failure occurring in one processor in the system according to the importance of the processor.

[Brief description of drawings]

【図１】本発明の実施例１を示すブロック図である。FIG. 1 is a block diagram showing a first embodiment of the present invention.

【図２】障害情報処理装置の構成を示すブロック図であ
る。FIG. 2 is a block diagram showing a configuration of a fault information processing device.

【図３】本発明の実施例２を示すブロック図である。FIG. 3 is a block diagram showing a second embodiment of the present invention.

【図４】本発明の実施例３を示すブロック図である。FIG. 4 is a block diagram showing a third embodiment of the present invention.

【図５】従来例を示すブロック図である。FIG. 5 is a block diagram showing a conventional example.

[Explanation of symbols]

１システムバス２プロセッサ２ａ，２ｂ，２ｃ障害検出プロセッサ３（３ａ，３ｂ，３ｃ）障害検出グループ４障害情報処理装置８，９，１０，１１障害検出専用バス 1 System Bus 2 Processors 2a, 2b, 2c Fault Detection Processor 3 (3a, 3b, 3c) Fault Detection Group 4 Fault Information Processing Device 8, 9, 10, 11 Fault Dedicated Bus

Claims

[Claims]

1. A multiprocessor system in which a plurality of processors having a self-fault diagnosis function are coupled by a system bus, wherein the plurality of processors are grouped into a plurality of fault detection groups for each processor functionally associated with each other. In addition to dividing, specify the central processor of each fault detection group as the fault detection processor that detects the fault in each group, and specify the fault information from the fault detection processor of each fault detection group on the system bus. A multiprocessor system characterized in that a failure information processing device for detecting and processing a failure of the entire system based on the system is connected.

2. In a multiprocessor system in which a plurality of processors having a self-fault diagnosis function are connected by a system bus, the plurality of processors are grouped into a plurality of fault detection groups for each processor functionally associated with each other. In addition to dividing, specify the central processor of each fault detection group as a fault detection processor that detects a fault in each group, and connect the processors in each fault detection group with a fault detection dedicated bus, A multiprocessor system characterized in that a fault information processing device for detecting and processing a fault of the entire system based on fault information from a fault detection processor of each fault detection group is connected to the system bus.

3. A multiprocessor system in which a plurality of processors having a self-fault diagnosis function are connected by a system bus, and the highest level fault detection is performed for each processor functionally associated with the plurality of processors. From the group to the lowest fault detection group, the upper group divides the lower groups into multiple fault detection groups, and the central processor of each fault detection group is used to identify the faults in each group. Specify the fault detection processor to detect and connect the processors in each fault detection group together with the fault detection processors of the lower groups with the fault detection dedicated bus, and connect the system bus to the fault of the top fault detection group. Detects faults in the entire system based on fault information from the detection processor. Multiprocessor system characterized by connecting the fault information processing apparatus for processing.