JP2000353154A

JP2000353154A - Fault monitoring system

Info

Publication number: JP2000353154A
Application number: JP11163567A
Authority: JP
Inventors: Tomoaki Nagano; 知明長野
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1999-06-10
Filing date: 1999-06-10
Publication date: 2000-12-19

Abstract

PROBLEM TO BE SOLVED: To accurately grasp failures generated in a large scale parallel computer system in the order of generation time series. SOLUTION: The failure monitoring system is provided with a plurality of nodes 10 being respectively independent computers, cross bar data switches 20 for switching a signal route between the nodes, a cross bar controller 30 for controlling the drive of the switches 20, slave service processors 40 connected to the nodes 10, the switches 20 and the controller 30 and allowed to monitor failures and output prescribed failure information immediately after detecting a failure, and a master service processor 50 for monitoring the generation time series of the failure by receiving the failure information.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、障害監視システム
に関し、特に大規模並列計算機システムで使用される障
害監視システムに関するものである。The present invention relates to a fault monitoring system, and more particularly, to a fault monitoring system used in a large-scale parallel computer system.

【０００２】[0002]

【従来の技術】従来より、大規模並列計算機システムを
使って、流れの数値シミュレーション等の大規模数値計
算が行われている。すなわち、連立一次方程式等の計算
を複数の計算機（以下、ノードという）に分担させて行
うことにより、計算の高速化を図っている。このような
大規模並列計算機の一例としては、例えば日本電株式会
社製のスーパーコンピュータＳＸ−５シリーズがあり、
これは１６個のＣＰＵを搭載したノードを最大で３２個
設置することにより、４ＴＦＬＯＰＳの演算性能を実現
している。2. Description of the Related Art Conventionally, large-scale numerical calculations such as numerical simulations of flows have been performed using large-scale parallel computer systems. That is, the calculation of the simultaneous linear equations and the like is shared among a plurality of computers (hereinafter, referred to as nodes) to perform the calculation at high speed. As an example of such a large-scale parallel computer, for example, there is a supercomputer SX-5 series manufactured by Nippon Electric Co., Ltd.
This achieves the computational performance of 4TFLOPS by installing a maximum of 32 nodes equipped with 16 CPUs.

【０００３】ところで、このような大規模並列計算機シ
ステムは一般的に、システム内で発生する障害の監視お
よび復旧を行うため、サービスプロセッサと呼ばれる障
害監視装置が設置されている。サービスプロセッサは、
システム内に複数設置されるのが一般的であり、システ
ムを構成する各装置（例えばノード、クロスバデータス
イッチ、クロスバ制御装置等）毎に設けられたり、また
は、システム内の装置を複数のグループに分けたものに
それぞれ設けられたりしている。したがって、各サービ
スプロセッサは、担当する装置（または装置群）の障害
監視をそれぞれが独立して行い、個別にハードディスク
ドライブ等に障害情報を記憶保持している。Incidentally, such a large-scale parallel computer system is generally provided with a fault monitoring device called a service processor in order to monitor and recover from a fault occurring in the system. The service processor
Generally, a plurality of devices are installed in the system. Each device (for example, a node, a crossbar data switch, a crossbar control device, and the like) that configures the system is provided, or the devices in the system are divided into a plurality of groups. Each of them is provided separately. Accordingly, each service processor independently monitors a fault of a device (or a device group) in charge, and individually stores fault information in a hard disk drive or the like.

【０００４】システムで発生する障害としては、例えば
ノード内におけるＣＰＵのレジスタの故障等がある。レ
ジスタに故障が発生すると、その故障したノードにおけ
る計算が最初に破綻し、上述のとおり並列計算では各ノ
ードで行われている計算は互いに関連しているため、一
つのノードでの計算の破綻が次々その他のノードに伝搬
し、一瞬にしてシステム全体の計算が停止することにな
る。[0004] As a failure that occurs in the system, for example, there is a failure of a register of a CPU in a node. When a failure occurs in a register, the calculation at the failed node fails first, and the calculation performed at each node in the parallel calculation is related to each other as described above. It propagates to other nodes one after another, and the calculation of the entire system stops instantly.

【０００５】このような事態が生じてしまった場合、停
止した計算を復旧するためには、故障したノードを早期
に特定し、ＣＰＵの交換等を行う必要がある。When such a situation occurs, in order to recover the stopped calculation, it is necessary to identify the failed node at an early stage and replace the CPU.

【０００６】[0006]

【発明が解決しようとする課題】しかしながら、従来の
並列計算機システムでは、複数のサービスプロセッサが
独立して機能しているため、障害情報の発生時系列を一
意に判別するのが困難という問題がある。すなわち、各
サービスプロセッサ内では、障害情報に障害発生時刻を
付加したものを、ハードディスクドライブ等に記録する
のだが、各サービスプロセッサに内蔵されている時計が
完全に同期しているとは限らないため、事故の後で障害
情報を調べた際に、時間的に近接して発生した障害の発
生順序が不明になる場合がある。However, in the conventional parallel computer system, since a plurality of service processors function independently, there is a problem that it is difficult to uniquely determine a time series of occurrence of fault information. . That is, in each service processor, the fault information with the fault occurrence time added is recorded on a hard disk drive or the like, but the clocks built in each service processor are not always completely synchronized. When examining fault information after an accident, the order in which faults occurred close in time may become unknown.

【０００７】また、仮に各サービスプロセッサの時計が
完全に同期していたとしても、時間計測の分解能内で発
生した障害に関しては、障害発生の順序を正確に把握す
るのは困難である。[0007] Even if the clocks of the service processors are completely synchronized, it is difficult to accurately grasp the order of occurrence of faults that occur within the resolution of time measurement.

【０００８】本発明は、このような課題を解決するため
のものであり、大規模並列計算機システムにおいて、シ
ステム内で生じた障害を発生時系列順に正確に把握する
ことができる障害監視システムを提供することを目的と
する。SUMMARY OF THE INVENTION The present invention has been made to solve such a problem, and provides a fault monitoring system in a large-scale parallel computer system capable of accurately grasping faults that have occurred in the system in the order of occurrence in chronological order. The purpose is to do.

【０００９】[0009]

【課題を解決するための手段】このような目的を達成す
るために、本発明に係る障害監視システムは、それぞれ
が独立した計算機である複数のノードと、上記ノード間
の信号経路を切り換えるクロスバデータスイッチと、上
記クロスバデータスイッチの駆動を制御するクロスバ制
御装置と、上記ノードおよび上記クロスバデータスイッ
チおよび上記クロスバ制御装置に接続され、障害を監視
するとともに障害を検出すると直ちに所定の障害情報を
出力するスレーブ・サービスプロセッサと、上記障害情
報を受信することにより、障害の発生時系列を監視する
マスタ・サービスプロセッサとを備えたものである。In order to achieve the above object, a fault monitoring system according to the present invention comprises a plurality of nodes, each of which is an independent computer, and a crossbar data for switching a signal path between the nodes. A switch, a crossbar control device for controlling the driving of the crossbar data switch, and a node connected to the node and the crossbar data switch and the crossbar control device for monitoring a fault and outputting predetermined fault information immediately upon detecting the fault. It has a slave service processor and a master service processor that monitors the time series of the failure by receiving the failure information.

【００１０】また、本発明のその他の態様として以下の
ようなものもある。すなわち、上記スレーブ・サービス
プロセッサは、上記障害情報を記録するためのハードデ
ィスクドライブを有してもよい。また、上記マスタ・サ
ービスプロセッサは、上記障害情報を記録するためのハ
ードディスクドライブを有してもよい。また、上記障害
情報は、報告元のスレーブ・サービスプロセッサの番号
と、障害の発生した装置の番号と、報告元における障害
の登録番号とで構成されていてもよい。また、上記障害
情報は、障害の度合いを示す番号をさらに有してもよ
い。さらに、上記障害監視システムは、大規模並列計算
機システムに適用されてもよい。[0010] Other aspects of the present invention include the following. That is, the slave service processor may have a hard disk drive for recording the failure information. Further, the master service processor may include a hard disk drive for recording the failure information. Further, the failure information may include a number of a reporting slave service processor, a number of a device in which the failure has occurred, and a registration number of the failure at the reporting source. Further, the failure information may further include a number indicating a degree of the failure. Further, the fault monitoring system may be applied to a massively parallel computer system.

【００１１】[0011]

【発明の実施の形態】次に、本発明の一つの実施の形態
について図を用いて説明する。図１は、本発明の一つの
実施の形態を示すブロック図である。同図に示すよう
に、本実施の形態に係る障害監視システムは、複数のノ
ード１０と、複数のクロスバデータスイッチ２０と、１
台のクロスバ制御装置３０と、ハードディスクドライブ
４０ａを備えた複数のスレーブ・サービスプロセッサ４
０と、ハードディスクドライブ５０ａを備えた１台のマ
スタ・サービスプロセッサ５０とで構成されている。Next, one embodiment of the present invention will be described with reference to the drawings. FIG. 1 is a block diagram showing one embodiment of the present invention. As shown in the figure, the fault monitoring system according to the present embodiment includes a plurality of nodes 10, a plurality of crossbar data switches 20,
Crossbar controllers 30 and a plurality of slave service processors 4 having a hard disk drive 40a
0 and one master service processor 50 having a hard disk drive 50a.

【００１２】ノード１０は、単数または複数のＣＰＵを
搭載した計算機であり、ノード毎に独立した計算機とし
て機能する。同図ではノード０〜７の計８台が設置され
ている。The node 10 is a computer equipped with one or a plurality of CPUs, and functions as an independent computer for each node. In the figure, a total of eight nodes 0 to 7 are installed.

【００１３】クロスバデータスイッチ２０は、各ノード
１０間に接続され、ノード間を伝送される信号の経路を
切り換える装置である。したがって、各ノード１０は、
クロスバデータスイッチ２０によって、ノード１０間に
おけるデータの送受信およびプロセッサ間通信を行うこ
とが可能となる。同図ではクロスバデータスイッチ０〜
３の計４台が設置されている。また、システム内に存在
する各ノード１０およびクロスバデータスイッチ２０
は、ノード１０とクロスバデータスイッチ２０とを仲介
する通信線とは独立した障害処理専用の通信線により、
スレーブＳＶＰ４０と接続されているThe crossbar data switch 20 is a device that is connected between the nodes 10 and switches the path of a signal transmitted between the nodes. Therefore, each node 10
The crossbar data switch 20 enables data transmission and reception between the nodes 10 and communication between processors. In the figure, crossbar data switches 0 to
There are a total of 4 units, 3 units. Each node 10 and the crossbar data switch 20 existing in the system
Is a communication line dedicated to failure processing that is independent of a communication line that mediates the node 10 and the crossbar data switch 20,
Connected to slave SVP40

【００１４】クロスバ制御装置３０は、各ノードに接続
され、クロスバデータスイッチ２０の駆動を制御する装
置である。The crossbar control device 30 is connected to each node and controls the driving of the crossbar data switch 20.

【００１５】ハードディスクドライブ（以下、ＨＤＤと
いう）４０ａを備えたスレーブ・サービスプロセッサ
（以下、スレーブＳＶＰという）２０は、システム内の
各装置に接続され、障害監視を行う障害処理装置であ
る。そして、障害を発見すると後述の障害情報を出力
し、ＨＤＤ４０ａに記憶保持するとともにマスタＳＶＰ
５０に対して送信する。同図ではスレーブＳＶＰ０〜４
の計５台が設置され、ノード１０およびクロスバデータ
スイッチ２０およびクロスバ制御装置３０に接続されて
いる。A slave service processor (hereinafter, referred to as a slave SVP) 20 having a hard disk drive (hereinafter, referred to as an HDD) 40a is a fault processing device that is connected to each device in the system and monitors a fault. When a failure is found, failure information to be described later is output and stored in the HDD 40a and the master SVP
Send to 50. In the figure, slaves SVP0-4
Are installed and connected to the node 10, the crossbar data switch 20, and the crossbar control device 30.

【００１６】ＨＤＤ５０ａを備えたマスタ・サービスプ
ロセッサ（以下、マスタＳＶＰという）５０は、スレー
ブＳＶＰ４０から送られてくる障害情報を受信し、シス
テム全体における障害を監視する統合障害処理装置であ
る。したがって、マスタＳＶＰ５０は、全てのスレーブ
ＳＶＰ４０に接続され、受信した障害情報をＨＤＤ５０
ａに記憶保持する。A master service processor (hereinafter, referred to as a master SVP) 50 having an HDD 50a is an integrated fault processing device that receives fault information sent from the slave SVP 40 and monitors a fault in the entire system. Therefore, the master SVP 50 is connected to all the slave SVPs 40 and transmits the received fault information to the HDD 50.
Stored in a.

【００１７】このように、本実施の形態に係る障害監視
システムは、複数のノード１０と、ノード間クロスバネ
ットワーク装置（クロスバデータスイッチ２０およびク
ロスバ制御装置３０）と、複数の障害処理装置（スレー
ブＳＶＰ４０およびマスタＳＶＰ５０）とで構成された
マルチノード型コンピュータシステムである。As described above, the fault monitoring system according to the present embodiment comprises a plurality of nodes 10, a crossbar network device between nodes (crossbar data switch 20 and crossbar control device 30), and a plurality of fault processing devices (slave SVP40). And a master SVP 50).

【００１８】また、これらシステムを構成するノード１
０、クロスバデータスイッチ２０、クロスバ制御装置３
０、スレーブＳＶＰ４０およびマスタＳＶＰ５０には、
それぞれシステム内で固有の装置番号が付与されてい
る。したがって、各スレーブＳＶＰ４０は、同一の装置
番号テーブル（表１）を用いて障害処理を行う。The nodes 1 constituting these systems
0, crossbar data switch 20, crossbar controller 3
0, slave SVP40 and master SVP50,
Each device is assigned a unique device number within the system. Therefore, each slave SVP 40 performs a failure process using the same device number table (Table 1).

【００１９】［表１］ ──────────────────── 装置名装置番号 ──────────────────── マスタＳＶＰ０００スレーブＳＶＰ００１０スレーブＳＶＰ１０１１スレーブＳＶＰ２０１２スレーブＳＶＰ３０１３スレーブＳＶＰ４０１４ノード００２０ノード１０２１ノード２０２２ノード３０２３ノード４０２４ノード５０２５ノード６０２６ノード７０２７クロスバ制御装置０３０クロスバデータスイッチ００３１クロスバデータスイッチ１０３２クロスバデータスイッチ２０３３クロスバデータスイッチ３０３４ ────────────────────[Table 1] {Device name Device number} ─ Master SVP 000 Slave SVP0 010 Slave SVP1 011 Slave SVP2 012 Slave SVP3 013 Slave SVP4 014 Node 0 020 Node 1 021 Node 2 022 Node 3 023 Node 4 024 Node 5 025 Node 6 026 Node 7 Data switch Crossbar Control 0 031 Crossbar data switch 1 032 Crossbar data switch 2 033 Crossbar data switch 3 034 ────────────────────

【００２０】次に、本発明の動作について説明する。Next, the operation of the present invention will be described.

【００２１】［単一の障害が発生した場合］ここでは、
障害情報の採取処理の流れを、単一のノードに障害が発
生した場合（図１のノード７に障害が発生したものとす
る）を例にして説明する。[When a Single Failure Occurs] Here,
The flow of the failure information collecting process will be described by taking as an example a case where a failure has occurred in a single node (it is assumed that a failure has occurred in the node 7 in FIG. 1).

【００２２】まず、ノード７の障害を監視するスレーブ
ＳＶＰ４は、ノード７で障害の発生を検出すると、マス
タＳＶＰ５０に対して障害情報を送信する。すなわち、
このスレーブＳＶＰ４の装置番号「０１４」と、障害の
発生したノード７の装置番号「０２７」と、スレーブＳ
ＶＰ４が管理する障害情報のシーケンシャルな登録番号
「００００００２５１４」と、障害の度合いを示す番号
「４」とからなる障害情報を送信する。First, the slave SVP 4 monitoring the failure of the node 7 transmits failure information to the master SVP 50 when the occurrence of the failure is detected in the node 7. That is,
The device number “014” of the slave SVP4, the device number “027” of the failed node 7 and the slave SVP
The failure information including the sequential registration number “00000000214” of the failure information managed by the VP 4 and the number “4” indicating the degree of the failure is transmitted.

【００２３】図２は、スレーブＳＶＰ４がマスタＳＶＰ
５０に対して送信する、障害の発生を通知するための通
信内容（障害情報）を示す説明図である。同図におい
て、装置番号は表１に基づいて決定され、障害の度合い
を示す番号は表２に基づいて決定される。FIG. 2 shows that the slave SVP 4 is the master SVP
FIG. 9 is an explanatory diagram showing communication contents (failure information) for notifying the occurrence of a failure, which is transmitted to the communication device 50; In the figure, the device number is determined based on Table 1, and the number indicating the degree of failure is determined based on Table 2.

【００２４】［表２］ ──────────────────────────────── 障害の度合い略称番号 ──────────────────────────────── 重度の障害ＣＨＥＣＫ０将来重度の障害となりうる障害ＣＡＵＴＩＯＮ１軽度の障害ＷＡＲＮＩＮＧ２障害以外のシグナルＡＴＴＥＮＴＩＯＮ３ ────────────────────────────────[Table 2] 度合い Degree of failure Abbreviation No. ────── ────────────────────────── Severe failure CHECK 0 Failure that could be severe in the future CAUTION 1 Mild failure WARNING 2 Signal other than failure ATTENTION 3 ────────────────────────────────

【００２５】なお、障害の発生時系列のみを知りたい場
合は、障害の度合いを示すコードを付加しなくてもよ
い。障害情報のデータサイズをなるべく小さくした方
が、障害の通知を高速で行う上で都合がよいといえる。
また、伝送上の問題等がなければ、障害情報にその他の
情報を付加してもよい。When it is desired to know only the time series of the occurrence of a fault, it is not necessary to add a code indicating the degree of the fault. It can be said that reducing the data size of the fault information as much as possible is convenient for performing fault notification at high speed.
If there is no transmission problem, other information may be added to the failure information.

【００２６】その後、障害情報を受信したマスタＳＶＰ
５は、受信した障害情報に対して、マスタＳＶＰ５０が
管理するシーケンシャルな登録番号を付与し、さらにマ
スタＳＶＰ５０が管理する時計による時間情報を障害発
生時間として付与して障害登録情報を構成し、マスタＳ
ＶＰ５に接続されているＨＤＤ５ａに記録する。Thereafter, the master SVP receiving the failure information
5 assigns a sequential registration number managed by the master SVP 50 to the received fault information, and further assigns time information by a clock managed by the master SVP 50 as a fault occurrence time to configure the fault registration information. S
Recording is performed on the HDD 5a connected to the VP5.

【００２７】障害登録情報の一例を示すと次のとおりで
ある。左から順に、障害の発生した年月日（マスタＳＶ
Ｐ５０で付加）、障害の発生した時刻（マスタＳＶＰで
付加）、マスタＳＶＰ５０における障害登録番号、障害
を検出したスレーブＳＶＰの番号、障害の発生した装置
名、スレーブＳＶＰにおける障害登録番号、障害の度合
いを示す略称である。Node***はノード１０を示し、IXS
**はクロスバデータスイッチ２０を示す。An example of the failure registration information is as follows. In order from the left, the date of failure (master SV
P50), time of failure (added by master SVP), failure registration number in master SVP 50, number of slave SVP in which failure was detected, name of device in which failure occurred, failure registration number in slave SVP, degree of failure Is an abbreviation that indicates Node *** indicates node 10, IXS
** indicates the crossbar data switch 20.

【００２８】 99-02-15 20:51:40 0000005409 SVP02 Node000 0000001027 WARNING 99-02-15 21:35:48 0000005410 SVP03 Node005 0000000873 ATTENTION 99-02-15 23:54:39 0000005411 SVP04 Node006 0000001354 WARNING 99-02-16 01:15:42 0000005412 SVP00 IXS00 0000001161 CAUTION 99-02-16 10:38:09 0000005413 SVP00 IXS11 0000001162 CHECK 99-02-16 11:22:50 0000005414 SVP03 Node004 0000000874 ATTENTION 99-02-17 11:22:47 0000005415 SVP02 Node000 0000001028 CAUTION 99-02-18 15:23:53 0000005416 SVP02 Node002 0000001029 WARNING 99-02-19 14:16:50 0000005417 SVP04 Node006 0000001355 ATTENTION99-02-15 20:51:40 0000005409 SVP02 Node000 0000001027 WARNING 99-02-15 21:35:48 0000005410 SVP03 Node005 0000000873 ATTENTION 99-02-15 23:54:39 0000005411 SVP04 Node006 0000001354 WARNING 99- 02-16 01:15:42 0000005412 SVP00 IXS00 0000001161 CAUTION 99-02-16 10:38:09 0000005413 SVP00 IXS11 0000001162 CHECK 99-02-16 11:22:50 0000005414 SVP03 Node004 0000000874 ATTENTION 99-02-17 11: 22:47 0000005415 SVP02 Node000 0000001028 CAUTION 99-02-18 15:23:53 0000005416 SVP02 Node002 0000001029 WARNING 99-02-19 14:16:50 0000005417 SVP04 Node006 0000001355 ATTENTION

【００２９】図３は、マスタＳＶＰとスレーブＳＶＰと
がそれぞれ記憶している障害情報を示す説明図である。
同図に示すように、マスタＳＶＰ５は、障害報告通知元
装置番号と報告元障害情報登録番号とに基づいて、スレ
ーブＳＶＰ４の管理する詳細な障害情報を参照すること
ができる。また、スレーブＳＶＰ４では、マスタＳＶＰ
５０への障害発生通知を行った後、ノード７から詳細な
障害内容の収集を行い、収集された詳細な障害情報をス
レーブＳＶＰ４に付属のＨＤＤ４０ａに記録する。FIG. 3 is an explanatory diagram showing fault information stored in the master SVP and the slave SVP, respectively.
As shown in the figure, the master SVP 5 can refer to detailed failure information managed by the slave SVP 4 based on the failure report notification device number and the report failure information registration number. In the slave SVP4, the master SVP
After notifying the failure occurrence to 50, detailed failure contents are collected from the node 7, and the collected detailed failure information is recorded in the HDD 40a attached to the slave SVP4.

【００３０】以上においては、単一の障害が発生した場
合における障害情報の収集手順について説明したが、複
数の装置が連続して障害を起こした場合も、同様の手順
をとることにより対処することができる。In the above description, the procedure for collecting fault information when a single fault has occurred has been described. However, even when a plurality of devices have successive faults, the same procedure can be taken. Can be.

【００３１】［複数の障害が同時に発生した場合］例え
ばノード７とクロスバデータスイッチ２とが連続して障
害を起こした場合について説明する。[Case where a Plurality of Faults Occur Simultaneously] For example, a case where a fault occurs continuously between the node 7 and the crossbar data switch 2 will be described.

【００３２】まず、ノード７に対応するスレーブＳＶＰ
４が、マスタＳＶＰ５０に対して、報告元のスレーブＳ
ＶＰ４の装置番号「０１４」と、障害の発生した装置の
番号「０２７」と、報告元の障害登録番号とを送信し、
ノード７における詳細な障害情報の収集を開始する。マ
スタＳＶＰ５０は、通知された障害情報に対して、統合
障害登録番号と通知を受け取った時刻とを付加してＨＤ
Ｄ５０ａに障害登録情報として記録する。First, the slave SVP corresponding to the node 7
4 is the slave S of the reporting source with respect to the master SVP 50.
The device number “014” of the VP4, the number “027” of the failed device, and the failure registration number of the reporting source are transmitted.
The collection of detailed fault information in the node 7 is started. The master SVP 50 adds the integrated fault registration number and the time at which the notification was received to the notified fault information, and adds
D50a is recorded as failure registration information.

【００３３】次いで、クロスバデータスイッチ２で障害
が起こり、対応するスレーブＳＶＰ２が、マスタＳＶＰ
５０に対して、報告元のスレーブＳＶＰ１の装置番号
「０１１」と、障害の発生した装置の番号「０３３」
と、報告元の障害登録番号とを送信する。マスタＳＶＰ
５０は、受信した障害情報に、マスタＳＶＰ５０内にお
ける統合障害登録番号と通知を受け取った時刻とを付加
してからＨＤＤ５０ａに記録する。Next, a failure occurs in the crossbar data switch 2, and the corresponding slave SVP2 becomes the master SVP.
For 50, the device number “011” of the reporting slave SVP1 and the number “033” of the failed device
And the fault registration number of the report source. Master SVP
50 adds the integrated fault registration number in the master SVP 50 and the time at which the notification was received to the received fault information, and records the information on the HDD 50a.

【００３４】なお、マスタＳＶＰ５０が記録する障害情
報は、図３に示すように非常にシンプルであり、そのデ
ータサイズも小さなものである。したがって、短時間で
障害情報はマスタＳＶＰ５０に送信され、またマスタＳ
ＶＰ５０は１つの障害を短時間で処理することができ
る。ノード７とクロスバデータスイッチ２が連続して障
害を起こし、二つの障害が発生する時間間隔が非常に小
さい場合においても、マスタＳＶＰ５０上で処理のバッ
ティングが起こる可能性は非常に小さく、障害発生時刻
の刻印は障害発生とほぼ同時に行われる。The fault information recorded by the master SVP 50 is very simple as shown in FIG. 3, and has a small data size. Therefore, the failure information is transmitted to the master SVP 50 in a short time,
The VP 50 can handle one failure in a short time. Even when the node 7 and the crossbar data switch 2 successively cause a failure and the time interval between the two failures is very small, the possibility that processing batting occurs on the master SVP 50 is very small, and the failure occurrence time Is performed almost simultaneously with the occurrence of a failure.

【００３５】図４は、図１に係る障害監視システムの詳
細を示すブロック図である。同図に示すように、ノード
１０は、ノード１０内に故障を検出すると１ビットの信
号を出力する故障検出回路１０ａと、レジスタ１０ｂ，
１０ｄと、オア回路１０ｃとで構成されている。スレー
ブＳＶＰ４０は、レジスタ４０ｂ，４０ｅと、ノード番
号に応じて設定された符号を出力するデコーダ４０ｃ
と、セレクタ４０ｄとで構成されている。マスタＳＶＰ
５０は、レジスタ５０ｂと、セレクタ５０ｃと、ＦＩＦ
Ｏ（First In First Out）のバッファ５０ｄとで構成さ
れている。FIG. 4 is a block diagram showing details of the fault monitoring system according to FIG. As shown in FIG. 1, a node 10 includes a failure detection circuit 10a that outputs a 1-bit signal when a failure is detected in the node 10, a register 10b,
10d and an OR circuit 10c. The slave SVP 40 includes registers 40b and 40e, and a decoder 40c that outputs a code set according to the node number.
And a selector 40d. Master SVP
Reference numeral 50 denotes a register 50b, a selector 50c,
O (First In First Out) buffer 50d.

【００３６】この障害監視システムの動作は以下のとお
りである。故障検出回路１０ａによって、ノード１０内
の故障が検出されると直ちに、１ビットの信号がレジス
タ１０ｂ、オア回路１０ｃおよびレジスタ１０ｄを介し
て出力される。出力された信号はスレーブＳＶＰ４０の
レジスタ４０ｂを介してデコーダ４０ｃに入力される。
デコーダ４０ｃは、接続されているノード毎に一意の符
号が出力されるように設定されている。The operation of this fault monitoring system is as follows. As soon as the failure detection circuit 10a detects a failure in the node 10, a 1-bit signal is output via the register 10b, the OR circuit 10c, and the register 10d. The output signal is input to the decoder 40c via the register 40b of the slave SVP 40.
The decoder 40c is set so that a unique code is output for each connected node.

【００３７】また、レジスタ４０ｂの出力は、デコーダ
４０ｃだけでなくセレクタ４０ｄの制御端子にも入力さ
れ、セレクタ４０ｄは、入力のあった経路をレジスタ４
０ｅに接続する。したがって、スレーブＳＶＰ４０に最
初に到着した信号のみが、マスタＳＶＰ５０に入力され
ることになる。スレーブＳＶＰ４０から出力された信号
は、マスタＳＶＰ５０のレジスタ５０ｂおよびセレクタ
５０ｃを介してＦＩＦＯのバッファ５０ｄに格納され
る。セレクタ５０ｃの働きは上述のセレクタ４０ｄと同
様である。また、バッファ５０ｄは、ＲＡＭ等の記憶装
置であり、ＨＤＤ５０ａよりも高速に読み書き可能であ
る。したがって、短時間の間に集中的に送られてくる故
障通知信号（障害情報）は、とりあえずバッファ５０ｄ
に書き込まれた後、ＨＤＤ５０ａに書き込まれる。The output of the register 40b is input not only to the decoder 40c but also to the control terminal of the selector 40d.
0e. Therefore, only the signal that first arrives at the slave SVP 40 is input to the master SVP 50. The signal output from the slave SVP 40 is stored in the FIFO buffer 50d via the register 50b and the selector 50c of the master SVP 50. The operation of the selector 50c is the same as that of the above-described selector 40d. The buffer 50d is a storage device such as a RAM, and can read and write at a higher speed than the HDD 50a. Therefore, the failure notification signal (failure information) intensively sent in a short time is temporarily stored in the buffer 50d.
Is written to the HDD 50a.

【００３８】なお、以上の故障検出のための構成および
手順は、クロスバデータスイッチ２０、クロスバ制御装
置３０およびシステム内のその他の回路においても同様
である。また、スレーブＳＶＰとマスタＳＶＰとを結ぶ
各線路（メタリック・ケーブルまたは光ファイバ・ケー
ブル）のクロック・スキューは、何れも等しくなるよう
に調整されている。The above-described configuration and procedure for failure detection are the same in the crossbar data switch 20, the crossbar control device 30, and other circuits in the system. The clock skew of each line (metallic cable or optical fiber cable) connecting the slave SVP and the master SVP is adjusted to be equal.

【００３９】[0039]

【発明の効果】以上説明したとおり本発明は、ノードお
よびクロスバデータスイッチおよびクロスバ制御装置に
設けられ、障害を監視するとともに障害を発見すると直
ちに障害情報を出力するスレーブ・サービスプロセッサ
と、スレーブ・サービスプロセッサからの障害情報を受
信することにより、障害の発生時系列を監視するマスタ
・サービスプロセッサとを有する。As described above, the present invention is provided in a node and a crossbar data switch and a crossbar control device, monitors a fault and outputs fault information immediately upon finding the fault, and a slave service processor. A master service processor that monitors failure occurrence time series by receiving failure information from the processor.

【００４０】このように構成することにより本発明は、
コンピュータシステム内で発生したすべての障害の登録
を最終的に一つの装置であるマスタＳＶＰが行うため、
障害発生順序が一意に判別可能な状態で保存される。With this configuration, the present invention provides:
Since the registration of all the faults that have occurred in the computer system is finally performed by one device, the master SVP,
The fault occurrence order is stored in a state where it can be uniquely determined.

【００４１】また、詳細な障害情報の収集および蓄積
を、それぞれシステムを構成する装置に接続された障害
処理装置に任すため、複数の障害が近接して発生した場
合でも、特定の障害処理装置に負荷および情報が集中す
る可能性が低くなり、負荷分散が行われる。Further, since collection and accumulation of detailed fault information are entrusted to fault processing devices connected to the respective devices constituting the system, even when a plurality of faults occur in close proximity, a specific fault processing device can be used. The possibility of concentration of load and information is reduced, and load distribution is performed.

[Brief description of the drawings]

【図１】本発明の一つの実施の形態を示すブロック図
である。FIG. 1 is a block diagram showing one embodiment of the present invention.

【図２】スレーブＳＶＰからマスタＳＶＰへ送信され
る障害情報を示す説明図である。FIG. 2 is an explanatory diagram showing fault information transmitted from a slave SVP to a master SVP.

【図３】マスタＳＶＰおよびスレーブＳＶＰ内に登録
されているデータを示す説明図である。FIG. 3 is an explanatory diagram showing data registered in a master SVP and a slave SVP.

【図４】図１に係る障害監視システムの詳細を示すブ
ロック図である。FIG. 4 is a block diagram showing details of a fault monitoring system according to FIG. 1;

[Explanation of symbols]

１０…ノード、２０…クロスバデータスイッチ、３０…
クロスバ制御装置、４０…スレーブ・サービスプロセッ
サ（スレーブＳＶＰ）、５０…マスタ・サービスプロセ
ッサ（マスタＳＶＰ）、１０ａ…故障検出回路、１０
ｂ，１０ｄ…レジスタ、１０ｃ…オア回路、４０ｂ，４
０ｅ…レジスタ、４０ｃ…デコーダ、４０ｄ…セレク
タ、５０ｂ…レジスタ、５０ｃ…セレクタ、５０ｄ…バ
ッファ、４０ａ，５０ａ…ハードディスクドライブ。10 ... node, 20 ... crossbar data switch, 30 ...
Crossbar control device, 40: slave service processor (slave SVP), 50: master service processor (master SVP), 10a: fault detection circuit, 10
b, 10d: register, 10c: OR circuit, 40b, 4
0e: register, 40c: decoder, 40d: selector, 50b: register, 50c: selector, 50d: buffer, 40a, 50a: hard disk drive.

Claims

[Claims]

A plurality of nodes each being an independent computer; a crossbar data switch for switching a signal path between the nodes; a crossbar control device for controlling driving of the crossbar data switch; and the node and the crossbar data. A slave service processor that is connected to the switch and the crossbar control device, monitors a fault, and outputs predetermined fault information immediately upon detection of the fault, and receives the fault information to monitor a time series of the fault occurrence A fault monitoring system comprising a master service processor.

2. The fault monitoring system according to claim 1, wherein the slave service processor has a hard disk drive for recording the fault information.

3. The fault monitoring system according to claim 1, wherein the master service processor has a hard disk drive for recording the fault information.

4. The system according to claim 1, wherein the fault information includes a number of a slave service processor of a report source, a number of a device in which the fault has occurred, and a registration number of the fault at the report source. Characteristic fault monitoring system.

5. The fault monitoring system according to claim 4, wherein the fault information further includes a number indicating a degree of the fault.

6. The fault monitoring system according to claim 1, wherein the fault monitoring system is applied to a large-scale parallel computer system.