JPH0756761A

JPH0756761A - Computer device

Info

Publication number: JPH0756761A
Application number: JP5201713A
Authority: JP
Inventors: Hirokazu Kawabata; 宏和川端; Masatoshi Haruta; 正俊春田
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1993-08-13
Filing date: 1993-08-13
Publication date: 1995-03-03

Abstract

PURPOSE:To minimize the influence exerted on a system by halting system management at the time of diagnosing the generation of a self-fault and outputting a command instructing a stand-by service processor(SVP) to start system management. CONSTITUTION:Each SVP previously selects a substitutive SVP to be a system SVP to be used when a fault occurs in the SVP itself from normal other SVPs. The SVP executes self-diagnosis by a self-diagnosis part 19, and when no fault occurs, continues a series of processing. If a fault occurs, a diagnostic interface 21 informs a fault processing executing part 15 of the occurance of the fault to halt system management and a substitutive SVP is recognized by referring to a substitutive system SVP management table and the substitutive system SVP is informed of a function substituting request through a transmission processing part 22.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明は、自己診断機能を有す
る中央処理装置のシステム接続を制御できる計算機装置
に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a computer system capable of controlling system connection of a central processing unit having a self-diagnosis function.

【０００２】[0002]

【従来の技術】図１３は例えば特開昭６４−８８６７７
号公報に示された従来の計算機装置を示す構成図であ
り、図において、１−１〜１−Ｎはシステムバス４を介
してマルチプロセッサ構成となっているＮ個のＣＰＵ
（中央処理装置）であり、自己が正常に動作しているか
否かを診断する自己診断機能を有するとともに、自己診
断の結果、障害が発生している場合には障害通知信号を
送信する機能を有している。２は自己が正常に動作して
いるか否かを診断する自己診断部、３は自己診断部２に
より障害が発生していると診断されると当該ＣＰＵの各
種処理を中止させるとともに、障害通知信号を送信し、
また、後述するサービスプロセッサ６より各種信号を受
信する診断インターフェイスである。2. Description of the Related Art FIG. 13 shows, for example, JP-A-64-88677.
It is a block diagram which shows the conventional computer apparatus shown by Unexamined-Japanese-Patent No. 1-N in the figure is N CPU of multiprocessor structure via the system bus 4.
(Central processing unit) has a self-diagnosis function for diagnosing whether or not it is operating normally, and a function for transmitting a failure notification signal when a failure occurs as a result of self-diagnosis. Have Reference numeral 2 denotes a self-diagnosis unit that diagnoses whether or not it is operating normally, and 3 stops various processing of the CPU when the self-diagnosis unit 2 diagnoses that a failure has occurred, and a failure notification signal. Send
Further, it is a diagnostic interface for receiving various signals from the service processor 6 described later.

【０００３】また、４は計算機装置のシステムバス、５
は計算機装置の診断用バス、６はＣＰＵ１−１〜１−Ｎ
から障害通知信号を受信すると、その障害通知信号を送
信したＣＰＵをシステムから切り離すシステム管理を行
うＳＶＰ（サービスプロセッサ）、７はＳＶＰ６が管理
するＣＰＵ１−１〜１−Ｎの故障情報や現在のシステム
構成等を記憶するメモリである。Reference numeral 4 denotes a computer system bus, and 5
Is a computer diagnostic bus, and 6 is CPUs 1-1 to 1-N
When a failure notification signal is received from the SVP (service processor), the CPU that sends the failure notification signal is disconnected from the system, and 7 is failure information of the CPUs 1-1 to 1-N managed by the SVP 6 and the current system. It is a memory that stores the configuration and the like.

【０００４】次に動作について説明する。ＣＰＵ１−１
〜１−Ｎは、それぞれ、自己のプログラムにしたがって
各種の処理を行うが、動作中に、実行すべき待ちタスク
がなくなると、あるいは、周期的に、自己診断部２によ
って自己診断を行うマクロ命令が実行され、自己診断が
行われる。ここで、自己診断の結果、正常と診断されれ
ば、今まで通り、各種処理を続行することになるが、障
害が発生していると診断されると、各種処理を中止し、
診断インターフェイス３がＳＶＰ６に対して診断用バス
５を介して障害通知信号を割り込み信号として送信す
る。なお、説明の便宜上、ＣＰＵ１−１に障害が発生
し、ＣＰＵ１−１の診断インターフェイス３から障害通
知信号が送信されたとする。Next, the operation will be described. CPU1-1
Each of 1 to 1-N performs various kinds of processing according to its own program, but when there are no waiting tasks to be executed during operation, or periodically, a macro instruction for performing self-diagnosis by the self-diagnosis unit 2. Is executed and self-diagnosis is performed. If, as a result of the self-diagnosis, it is diagnosed as normal, the various processes are continued as before, but if it is diagnosed that a failure has occurred, the various processes are stopped,
The diagnostic interface 3 transmits a fault notification signal as an interrupt signal to the SVP 6 via the diagnostic bus 5. For convenience of explanation, assume that a failure occurs in the CPU 1-1 and a failure notification signal is transmitted from the diagnosis interface 3 of the CPU 1-1.

【０００５】次に、ＳＶＰ６の動作を図１４のフローチ
ャート図を参照しつつ説明する。まず、ＳＶＰ６は、メ
モリ７の記憶内容に基づいてＣＰＵ１−１〜１−Ｎを管
理するが、上述したように、ＣＰＵ１−１に障害が発生
すると障害通知信号を送信されるので、ＳＶＰ６に対し
て割り込みが発生する（ステップＳＴ１）。これによ
り、ＳＶＰ６は、ＣＰＵ１−１〜１−Ｎからの割り込み
であるのか、他の機器からの割り込みであるのかを判断
する（ステップＳＴ２）。そして、当該割り込みが、他
の機器からのものであれば、所定の処理を実行し（ステ
ップＳＴ３）、当該割り込みが、ＣＰＵ１−１〜１−Ｎ
からのものであれば、割り込みの通知内容（障害通知信
号）を詳細に解析して、処理を分岐する（ステップＳＴ
４）。Next, the operation of the SVP 6 will be described with reference to the flowchart of FIG. First, the SVP 6 manages the CPUs 1-1 to 1-N based on the stored contents of the memory 7, but as described above, when a failure occurs in the CPU 1-1, a failure notification signal is transmitted, so that the SVP 6 is notified to the SVP 6. Interrupt is generated (step ST1). As a result, the SVP 6 determines whether it is an interrupt from the CPU 1-1 to 1-N or an interrupt from another device (step ST2). If the interrupt is from another device, a predetermined process is executed (step ST3), and the interrupt is the CPU 1-1 to 1-N.
If it is, the contents of the interrupt notification (fault notification signal) are analyzed in detail and the process is branched (step ST
4).

【０００６】例えば、通知内容の種別として、上述のＣ
ＰＵにおける処理実行中の自己診断エラーの他に、電源
投入時の自己診断エラー、トラップ処理の続行不能など
があり、これらの場合は、詳述しないが、割り込み原因
に応じて各処理が行われる（ステップＳＴ５〜７）。こ
れに対して、割り込みの通知内容を詳細に解析した結
果、ＣＰＵ１−１における処理実行中の自己診断エラー
であると判明すると、ＳＶＰ６は、ＣＰＵ１−１をシス
テムを切り離すための処理を行う。[0006] For example, as the type of notification content, the above-mentioned C
In addition to the self-diagnosis error during execution of processing in the PU, there are self-diagnosis errors when the power is turned on, trap processing cannot continue, etc. In these cases, although not described in detail, each processing is performed depending on the cause of interrupt. (Steps ST5-7). On the other hand, as a result of detailed analysis of the interrupt notification content, when it is determined that the self-diagnosis error is occurring during execution of processing in the CPU 1-1, the SVP 6 performs processing for disconnecting the CPU 1-1 from the system.

【０００７】即ち、ＣＰＵ１−１は、上述したように、
既に各種処理を中止しているので、メモリ７に記憶され
ているシステム構成のなかから、ＣＰＵ１−１を切り離
すとともに（ステップＳＴ８）、障害の発生していない
他のＣＰＵ１−２〜１−Ｎに対して、診断用バス５を介
してＣＰＵ１−１に障害が発生した旨を通知することに
より（ステップＳＴ９）、システムソフトウエア上での
ＣＰＵ１−１の切り離しを行い、ＣＰＵ１−１をシステ
ムから切り離す。That is, the CPU 1-1, as described above,
Since various processes have already been stopped, the CPU 1-1 is disconnected from the system configuration stored in the memory 7 (step ST8), and the other CPUs 1-2 to 1-N in which no failure has occurred On the other hand, the CPU 1-1 is disconnected from the system by notifying the CPU 1-1 that a failure has occurred through the diagnostic bus 5 (step ST9), and the CPU 1-1 is disconnected from the system. .

【０００８】[0008]

【発明が解決しようとする課題】従来の計算機装置は以
上のように構成されているので、ＣＰＵに障害が発生し
た場合には障害の発生したＣＰＵをシステムから切り離
すことができるため、稼働中のシステムに与える悪影響
を最小限に抑えることができるが、ＳＶＰに障害が発生
した場合には何ら対策が施されておらず、障害の発生し
たＳＶＰがシステムに多大なる悪影響を与えることがあ
るなどの問題点があった。Since the conventional computer system is configured as described above, when a CPU fails, the CPU with the failure can be disconnected from the system, and therefore, it is in operation. Although it is possible to minimize the adverse effect on the system, if a failure occurs in the SVP, no measures are taken and the SVP in which the failure occurs may have a great adverse effect on the system. There was a problem.

【０００９】請求項１の発明は上記のような問題点を解
消するためになされたもので、ＳＶＰに障害が発生した
場合でも、システムに与える悪影響を最小限に抑えるこ
とができる計算機装置を得ることを目的とする。The invention of claim 1 has been made to solve the above-mentioned problems, and provides a computer device capable of minimizing the adverse effect on the system even when a failure occurs in the SVP. The purpose is to

【００１０】請求項２及び請求項３の発明は、請求項１
の発明より、さらに確実に、システムに与える悪影響を
最小限に抑えることができる計算機装置を得ることを目
的とする。The inventions of claims 2 and 3 are defined by claim 1.
Another object of the present invention is to provide a computer device that can more surely minimize the adverse effects on the system.

【００１１】[0011]

【課題を解決するための手段】請求項１の発明に係る計
算機装置は、自己がシステム管理を行っている場合に
は、自己診断部により自己に障害が発生したものと診断
されるとシステム管理を中止するとともに、待機用のサ
ービスプロセッサに対してシステム管理を開始すべき旨
の指令を出力するようにしたものである。According to a first aspect of the present invention, there is provided a computer system in which, when the self-diagnosis unit diagnoses that a failure has occurred in the self-system management, the system management is performed. Is canceled and a command to the effect that system management should be started is output to the standby service processor.

【００１２】また、請求項２の発明に係る計算機装置
は、自己がシステム管理を行っている場合には、各待機
用のサービスプロセッサに対して自己診断をすべき旨の
指令を出力することにより、各待機用のサービスプロセ
ッサからの正常応答信号の有無によって各待機用のサー
ビスプロセッサの診断結果を把握しておき、自己診断部
により自己に障害が発生したものと診断されるとシステ
ム管理を中止するとともに、その診断結果に基づいて正
常な待機用のサービスプロセッサに対してシステム管理
を開始すべき旨の指令を出力するようにしたものであ
る。Further, the computer system according to the second aspect of the present invention outputs a command for self-diagnosis to each standby service processor when the system is managed by itself. , Check the diagnosis result of each standby service processor based on the presence / absence of a normal response signal from each standby service processor, and stop the system management when the self-diagnosis unit diagnoses that a self-failure has occurred. At the same time, based on the diagnosis result, a command to start system management is output to the normal service processor for standby.

【００１３】さらに、請求項３の発明に係る計算機装置
は、負荷率測定部により測定された負荷状況が所定値を
越えている場合には自己に障害が発生したものとみなす
ようにしたものである。Further, the computer apparatus according to the third aspect of the present invention is such that when the load condition measured by the load factor measuring unit exceeds a predetermined value, it is considered that a failure has occurred in itself. is there.

【００１４】[0014]

【作用】請求項１の発明における計算機装置は、自己が
システム管理を行っている場合には、自己診断部により
自己に障害が発生したものと診断されるとシステム管理
を中止するとともに、待機用のサービスプロセッサに対
してシステム管理を開始すべき旨の指令を出力する切替
制御部を設けたことにより、障害の発生したサービスプ
ロセッサがシステムから切り離され、待機用のサービス
プロセッサがシステム管理を行うようになる。In the computer system according to the present invention, when the self-diagnosis unit diagnoses that a failure has occurred in the self-system management, the computer system suspends the system management and waits. By providing a switching control unit that outputs a command to start system management to another service processor, the faulty service processor is disconnected from the system and the standby service processor performs system management. become.

【００１５】また、請求項２の発明における計算機装置
は、自己がシステム管理を行っている場合には、各待機
用のサービスプロセッサに対して自己診断をすべき旨の
指令を出力することにより、各待機用のサービスプロセ
ッサからの正常応答信号の有無によって各待機用のサー
ビスプロセッサの診断結果を把握しておき、自己診断部
により自己に障害が発生したものと診断されるとシステ
ム管理を中止するとともに、その診断結果に基づいて正
常な待機用のサービスプロセッサに対してシステム管理
を開始すべき旨の指令を出力する切替制御部を設けたこ
とにより、障害の発生したサービスプロセッサがシステ
ムから切り離され、正常な待機用のサービスプロセッサ
がシステム管理を行うようになる。Further, the computer device according to the second aspect of the present invention outputs a command for performing self-diagnosis to each standby service processor when the computer device itself manages the system. The diagnosis result of each standby service processor is known based on the presence or absence of a normal response signal from each standby service processor, and system management is stopped when the self-diagnosis unit diagnoses that a failure has occurred in itself. In addition, by providing a switching control unit that outputs a command to start system management to the normal standby service processor based on the diagnosis result, the faulty service processor is disconnected from the system. , The service processor for normal standby comes to perform system management.

【００１６】さらに、請求項３の発明における計算機装
置は、負荷率測定部により測定された負荷状況が所定値
を越えている場合には自己に障害が発生したものとみな
すようにしたことにより、稼働率の低いサービスプロセ
ッサのみが代替用のサービスプロセッサとして選定され
る。Further, the computer system according to the third aspect of the present invention considers that a fault has occurred in itself when the load condition measured by the load factor measuring unit exceeds a predetermined value. Only service processors with low operating rates are selected as alternative service processors.

【００１７】[0017]

【Example】

実施例１．以下、この発明の一実施例を図について説明
する。図１はこの発明の一実施例による計算機装置を示
す構成図であり、図において、従来のものと同一符号は
同一または相当部分を示すので説明を省略する。８−１
〜８−Ｎは自己が正常に動作しているか否かを診断する
自己診断機能を有するとともに、自己診断の結果、正常
に動作している場合には正常応答信号を送信するＣＰＵ
（中央処理装置）、９−１〜９−Ｍは各ＣＰＵ８−１〜
８−Ｎに対して自己診断をすべき旨の指令を出力し、所
定時間内に正常応答信号を送信してこないＣＰＵがあれ
ば、そのＣＰＵに障害が発生したものと判断してそのＣ
ＰＵをシステムから切り離すシステム管理を行うＳＶＰ
（サービスプロセッサ）である。Example 1. An embodiment of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing a computer apparatus according to an embodiment of the present invention. In the figure, the same reference numerals as those of the conventional one indicate the same or corresponding portions, and the explanation thereof will be omitted. 8-1
8 to N have a self-diagnosis function for diagnosing whether or not the self is normally operating, and as a result of the self-diagnosis, a CPU which transmits a normal response signal when the normal operation is performed.
(Central processing unit), 9-1 to 9-M are CPUs 8-1 to 8-1.
If there is a CPU that outputs a command for self-diagnosis to 8-N and does not send a normal response signal within a predetermined time, it is determined that a failure has occurred in that CPU, and the C
SVP that manages the system by disconnecting the PU from the system
(Service processor).

【００１８】また、図２はＳＶＰ９−１〜９−Ｍの詳細
な構成を示す構成図であり、図において、１１はシステ
ムバス４とのインターフェイスとなるシステムバスイン
ターフェイス、１２は診断用バス５とのインターフェイ
スとなる診断用バスインターフェイス、１３は診断用バ
スインターフェイス１２を介して他のＳＶＰまたはＣＰ
Ｕ８−１〜８−Ｎから各種の通知を受信する受信処理
部、１４は受信処理部１３が受信した各種の通知を解析
し、例えば、受信した通知が障害情報であれば、障害の
発生したＳＶＰまたはＣＰＵを割り出す受信情報解析
部、１５は受信情報解析部１４の解析結果及び故障処理
テーブル１６に登録されている故障処理情報に基づいて
障害の発生したＳＶＰまたはＣＰＵをシステムから切り
離すとともに（後述する診断インターフェイス２１に対
して、システム構成テーブル２０に登録されているシス
テム構成を更新するよう指示を与える）、診断インター
フェイス２１より自己に障害が発生した旨の指令を受け
るとシステム管理を中止する故障処理実行部（切替制御
部）である。FIG. 2 is a block diagram showing the detailed structure of the SVPs 9-1 to 9-M. In FIG. 2, 11 is a system bus interface which is an interface with the system bus 4, and 12 is a diagnostic bus 5. Diagnostic bus interface serving as an interface of the SVP, 13 is another SVP or CP via the diagnostic bus interface 12.
The reception processing unit that receives various notifications from U8-1 to U-N, 14 analyzes various notifications received by the reception processing unit 13 and, for example, if the received notification is failure information, a failure has occurred. The reception information analysis unit 15 for identifying the SVP or CPU disconnects the SVP or CPU in which the failure has occurred from the system based on the analysis result of the reception information analysis unit 14 and the failure processing information registered in the failure processing table 16 (see below). The diagnostic interface 21 is instructed to update the system configuration registered in the system configuration table 20), and when the diagnostic interface 21 receives a command indicating that a failure has occurred, the system management is stopped. It is a process execution unit (switch control unit).

【００１９】１６は予め、システムの初期設定段階で、
システムとしての故障処理情報を登録する故障処理テー
ブル、１７は受信情報解析部１４によりあるＣＰＵに障
害が発生したと解析されると、故障処理実行部１５がそ
の受信情報解析部１４の解析結果を登録する故障ＣＰＵ
管理部、１８は受信情報解析部１４によりあるＳＶＰに
障害が発生したと解析されると、故障処理実行部１５が
その受信情報解析部１４の解析結果を登録するＳＶＰ管
理部、１９は自己が正常に動作しているか否かを診断す
る自己診断部、２０はこのシステム内に存在するＳＶＰ
及びＣＰＵのシステム構成情報を記憶するシステム構成
テーブル、２１は故障処理実行部１５の指示に従ってシ
ステム構成テーブル２０の記憶内容を更新し、また、自
己診断部１９により自己に障害が発生したものと診断さ
れると、故障処理実行部１５に対して自己に障害が発生
した旨の指令を与えるとともに、ＳＶＰ管理部１８の登
録内容に基づいて代替する他のＳＶＰを選定し、そのＳ
ＶＰに対してシステム管理を開始すべき旨の指令（以
下、機能代替依頼という）を送信処理部２２を介して送
信する診断インターフェイス（切替制御部）、２２は診
断インターフェイス２１から送信される機能代替依頼を
送信するとともに、故障処理実行部１５の指示に従って
障害の発生していないＣＰＵ及び他のＳＶＰに対して、
あるＣＰＵに障害が発生した旨の信号を送信する送信処
理部である。なお、上記にも明示したが、故障処理実行
部１５及び診断インターフェイス２１から切替制御部を
構成している。Reference numeral 16 indicates in advance the initial setting stage of the system.
A failure processing table in which failure processing information as a system is registered, and when the reception information analysis unit 14 analyzes that a CPU has failed, the failure processing execution unit 15 displays the analysis result of the reception information analysis unit 14. Faulty CPU to register
When the reception information analysis unit 14 analyzes the management unit 18 that a failure has occurred in the SVP, the failure processing execution unit 15 registers the analysis result of the reception information analysis unit 14 and the SVP management unit 19, 19 A self-diagnosis unit for diagnosing whether or not it is operating normally, 20 is an SVP existing in this system
And a system configuration table for storing system configuration information of the CPU, 21 updates the storage contents of the system configuration table 20 in accordance with an instruction from the failure processing execution unit 15, and the self-diagnosis unit 19 diagnoses that a self-failure has occurred. Then, the failure processing execution unit 15 is instructed that a failure has occurred, and another SVP to be replaced is selected based on the registered contents of the SVP management unit 18, and the S
A diagnostic interface (switch control unit) for transmitting a command to the VP to start system management (hereinafter referred to as a function substitution request) via the transmission processing unit 22, and 22 is a function substitution transmitted from the diagnostic interface 21. While transmitting the request, the failure processing executing unit 15 instructs the CPU and other SVPs in which no failure has occurred to
A transmission processing unit that transmits a signal indicating that a failure has occurred in a certain CPU. It should be noted that, as described above, the failure processing execution unit 15 and the diagnosis interface 21 constitute a switching control unit.

【００２０】また、図３はＣＰＵ８−１〜８−Ｍの詳細
な構成を示す構成図であり、図において、３１はシステ
ムバス４とのインターフェイスとなるシステムバスイン
ターフェイス、３２は診断用バス５とのインターフェイ
スとなる診断用バスインターフェイス、３３は診断用バ
スインターフェイス３８を介して現在システム管理を行
っているＳＶＰ（以下、システムＳＶＰという）から障
害の発生したＳＶＰまたはＣＰＵの情報を受信する受信
処理部、３４は受信処理部３３が受信した情報を解析
し、例えば、受信した情報がＣＰＵの障害情報であれ
ば、その情報を故障ＣＰＵ管理部３５に登録し、システ
ムＳＶＰを変更する旨の情報であれば、その情報をシス
テムＳＶＰ登録テーブル３６に登録し、システムＳＶＰ
から自己診断をすべき旨の指令であれば、その指令を診
断インターフェイス３８に与える受信情報解析部であ
る。FIG. 3 is a block diagram showing a detailed configuration of the CPUs 8-1 to 8-M. In the figure, 31 is a system bus interface which is an interface with the system bus 4, and 32 is a diagnostic bus 5. A diagnostic bus interface serving as an interface of the SVP 33, and a reception processing unit 33 for receiving information on an SVP or a CPU having a failure from an SVP currently performing system management (hereinafter referred to as a system SVP) via the diagnostic bus interface 38. , 34 analyzes the information received by the reception processing unit 33. For example, if the received information is CPU failure information, the information is registered in the failed CPU management unit 35 and the system SVP is changed. If so, the information is registered in the system SVP registration table 36, and the system SVP registration table 36 is registered.
If it is an instruction to self-diagnose, the reception information analysis section gives the instruction to the diagnostic interface 38.

【００２１】３５はどのＣＰＵに障害が発生しているか
を管理する故障ＣＰＵ管理部、３６は現在システムＳＶ
Ｐとして動作しているＳＶＰを登録するシステムＳＶＰ
登録テーブル、３７は自己が正常に動作しているか否か
を診断する自己診断部、３８は受診情報解析部３４から
自己診断をすべき旨の指令を与えられると自己診断部３
７に自己診断を行わせ、その自己診断部３７の診断結果
を、システムＳＶＰ登録テーブル３６を参照してシステ
ムＳＶＰに送信する診断インターフェイス、３９は診断
インターフェイス３８の指示に従って自己診断部３７の
診断結果をシステムＳＶＰに送信する送信処理部であ
る。Reference numeral 35 is a faulty CPU management section for managing which CPU has a failure, and 36 is the current system SV.
System SVP that registers the SVP operating as P
A registration table, 37 is a self-diagnosis unit for diagnosing whether or not the self is normally operating, and 38 is a self-diagnosis unit 3 when a command to perform self-diagnosis is given from the consultation information analysis unit 34.
7 performs a self-diagnosis, and the diagnostic result of the self-diagnosis unit 37 is transmitted to the system SVP by referring to the system SVP registration table 36. 39 is a diagnostic result of the self-diagnosis unit 37 according to the instruction of the diagnostic interface 38. To the system SVP.

【００２２】次に動作について説明する。最初に、シス
テムＳＶＰの動作を、図６のフローチャート図を参照し
つつ説明するが、説明の便宜上、ＳＶＰ９−１がシステ
ムＳＶＰであったとする。Next, the operation will be described. First, the operation of the system SVP will be described with reference to the flowchart of FIG. 6, but for convenience of explanation, it is assumed that the SVP 9-1 is the system SVP.

【００２３】まず、ＳＶＰ９−１は、立ち上げ時におい
て、ＳＶＰ管理部１８を参照することにより、システム
ＳＶＰとなっているＳＶＰを認識し（ステップＳＴ１
１）、自己がシステムＳＶＰであるか否かを判断する
（ステップＳＴ１２）。即ち、ＳＶＰ管理部１８は、そ
の構成要素としてシステムＳＶＰ登録テーブル１８ｃを
有しており、システムＳＶＰ登録テーブル１８ｃには図
５に示すように、現在のシステムにおけるシステムＳＶ
Ｐが登録されている（因に、フラグ１はＳＶＰ９−１，
フラグ２はＳＶＰ９−２，・・・，フラグＭはＳＶＰ９
−Ｍに対応している）ので、ＳＶＰ９−１は、ＳＶＰ管
理部１８を参照することにより自己がシステムＳＶＰで
あるか否かを認識することができる。First, the SVP 9-1 recognizes the SVP which is the system SVP by referring to the SVP management unit 18 at the time of startup (step ST1).
1), it determines whether or not it is the system SVP (step ST12). That is, the SVP management unit 18 has a system SVP registration table 18c as its component, and the system SVP registration table 18c includes the system SVs in the current system as shown in FIG.
P is registered (incidentally, flag 1 is SVP9-1,
The flag 2 is SVP9-2, ..., The flag M is SVP9.
(Corresponding to -M), the SVP 9-1 can recognize whether or not it is the system SVP by referring to the SVP management unit 18.

【００２４】上述したように、ＳＶＰ９−１は、システ
ムＳＶＰとして登録されているので、以下ステップＳＴ
１３以降の処理を行うが、他のＳＶＰはシステムＳＶＰ
ではないので、図６における処理は終了する。そして、
ＳＶＰ９−１は、システムＳＶＰであるため、診断イン
ターフェイス２１が、システム構成テーブル２０を参照
することにより、現在のシステムに接続されているＳＶ
Ｐ及びＣＰＵを認識し、その接続されているＳＶＰ及び
ＣＰＵに対し、自己診断をすべき旨の指令を送信処理部
２２を介して送信する。As described above, since the SVP 9-1 is registered as the system SVP, the following steps ST
13 and subsequent processes are performed, but other SVPs are system SVPs.
However, the processing in FIG. 6 ends. And
Since the SVP 9-1 is a system SVP, the diagnostic interface 21 refers to the system configuration table 20, and thus the SV connected to the current system.
The P and CPU are recognized, and a command to the effect that self-diagnosis should be performed is transmitted to the connected SVP and CPU via the transmission processing unit 22.

【００２５】そして、ＳＶＰ９−１は、自己診断をすべ
き旨の指令を送信したＳＶＰ及びＣＰＵから、自己診断
結果の応答が送られてくるのを待ち、その応答信号が送
られてくると受信処理部１３が受信して、受信情報解析
部１４がその応答信号（正常応答信号又は異常応答信
号）を解析するが、所定時間内に正常であることを示す
正常応答信号を送信してこなかったＳＶＰ及びＣＰＵに
ついては障害が発生したもの判断する。Then, the SVP 9-1 waits for a response of the self-diagnosis result from the SVP and the CPU, which have transmitted the command for the self-diagnosis, and receives it when the response signal is transmitted. The processing unit 13 receives the signal, and the reception information analysis unit 14 analyzes the response signal (normal response signal or abnormal response signal), but does not transmit the normal response signal indicating normality within a predetermined time. Regarding the SVP and CPU, it is determined that a failure has occurred.

【００２６】もし、現在のシステムに接続されているＳ
ＶＰ及びＣＰＵの中で、１つでも障害が発生していれ
ば、障害の発生したＳＶＰあるいはＣＰＵをシステムか
ら切り離す処理を行う（ステップＳＴ１５，１６，１
７）。ＳＶＰに障害が発生した場合には（例えば、ＳＶ
Ｐ９−３に障害が発生したものとする）、図５に示すよ
うに、ＳＶＰ９−１のＳＶＰ管理部１８における故障管
理テーブル１８ａのフラグ３をオンにして、ＳＶＰ９−
３に障害が発生したことを識別できるようにする（ステ
ップＳＴ１５）。そして、ＳＶＰ９−１の故障処理実行
部１５が故障処理テーブル１６を参照することにより、
診断インターフェイス２１に対して、システム構成テー
ブル２０に登録されているシステム構成を更新するよう
指示を与え、診断インターフェイス２１が、システム構
成テーブル２０に登録されているシステム構成のなかか
ら、ＳＶＰ９−３を削除する（ステップＳＴ１６）。ま
た、診断インターフェイス２１は、他の正常なＳＶＰ９
−２，９−４〜９−Ｍ及びＣＰＵ８−１〜８−Ｎに対し
て、ＳＶＰ９−３に障害が発生した旨を通知する（ステ
ップＳＴ１７）。If the S connected to the current system
If at least one of the VPs and CPUs has a failure, the processing for disconnecting the failed SVP or CPU from the system is performed (steps ST15, 16, 1).
7). In case of SVP failure (for example, SV
Assuming that a failure has occurred in P9-3), as shown in FIG. 5, the flag 3 of the failure management table 18a in the SVP management unit 18 of the SVP 9-1 is turned on, and the SVP 9-
It is made possible to identify that a failure has occurred in step 3 (step ST15). Then, the failure processing execution unit 15 of the SVP 9-1 refers to the failure processing table 16,
The diagnostic interface 21 is instructed to update the system configuration registered in the system configuration table 20, and the diagnostic interface 21 selects the SVP 9-3 from among the system configurations registered in the system configuration table 20. It is deleted (step ST16). In addition, the diagnostic interface 21 is used for other normal SVP9.
-2, 9-4 to 9-M and CPUs 8-1 to 8-N are notified that the SVP 9-3 has failed (step ST17).

【００２７】一方、ＣＰＵに障害が発生した場合には
（例えば、ＣＰＵ８−３に障害が発生したものとす
る）、図４に示すように、ＳＶＰ９−１の故障ＣＰＵ管
理部１７のフラグ３をオンにして、ＣＰＵ８−３に障害
が発生したことを識別できるようにする（ステップＳＴ
１５）。そして、ＳＶＰの場合と同様にして、システム
構成テーブル２０を更新し（ステップＳＴ１６）、他の
正常なＳＶＰ９−２〜９−Ｍ及びＣＰＵ８−１，８−
２，８−４〜８−Ｎに対して、ＣＰＵ８−３に障害が発
生した旨を通知する（ステップＳＴ１７）。On the other hand, if the CPU fails (for example, if the CPU 8-3 fails), the flag 3 of the failed CPU management unit 17 of the SVP 9-1 is set as shown in FIG. It is turned on so that it can be identified that a failure has occurred in the CPU 8-3 (step ST
15). Then, similarly to the case of SVP, the system configuration table 20 is updated (step ST16), and other normal SVPs 9-2 to 9-M and CPUs 8-1 and 8-
The CPU 8-3 is notified that the failure has occurred in the CPUs 8-3 to 2, 8-4 to 8-N (step ST17).

【００２８】次に、ＳＶＰ９−１は、自己に障害が発生
した場合に備え、予め、正常なＳＶＰの中から、自己に
障害が発生した場合にシステムＳＶＰとなる代替のＳＶ
Ｐを選定する（ステップＳＴ１８）。つまり、ＳＶＰ管
理部１８の故障管理テーブル１８ａを参照することによ
り、正常なＳＶＰを認識し（この場合、ＳＶＰ３以外は
正常であると認識する）、ある規定、例えば、診断イン
ターフェイス２１の動作プログラムに、各ＳＶＰに対す
る代替のシステムＳＶＰ候補の優先順位が定義されてお
り、これに従い、代替のシステムＳＶＰを選定する（ス
テップＳＴ１８）。Next, the SVP 9-1 is a substitute SV that becomes a system SVP when a failure occurs in itself from the normal SVP in advance, in case the failure occurs in the SVP 9-1.
P is selected (step ST18). That is, by referring to the failure management table 18a of the SVP management unit 18, a normal SVP is recognized (in this case, all other than SVP3 are recognized as normal), and a certain rule, for example, an operation program of the diagnostic interface 21 is recognized. , The priority order of the alternative system SVP candidates for each SVP is defined, and the alternative system SVP is selected accordingly (step ST18).

【００２９】ここで、代替のシステムＳＶＰがＳＶＰ９
−２であるという設定結果を得られたとしたならば、そ
の選定結果を、図５に示すように、代替システムＳＶＰ
管理テーブル１８ｂに登録する（ステップＳＴ１９）。Here, the alternative system SVP is SVP9.
If the setting result of −2 is obtained, the selection result is shown in FIG.
It is registered in the management table 18b (step ST19).

【００３０】次に、ＳＶＰ９−１は、自己診断部１９に
よって自己診断を行い（ステップＳＴ２０）、障害が発
生していなければ、ステップＳＴ１３に戻って一連の処
理を継続するが、障害が発生している場合には、診断イ
ンターフェイス２１が故障処理実行部１５に対して障害
が発生した旨の通知をすることにより、システム管理を
中止させるとともに、代替システムＳＶＰ管理テーブル
１８ｂを参照することにより、代替システムＳＶＰを認
識し、その代替システムＳＶＰ（ＳＶＰ９−２）に対し
送信処理部２２を介して機能代替依頼を通知する。ま
た、その他のＳＶＰ９−３〜９−Ｍに対しては、送信処
理部２２を介して更新依頼を通知する（ステップＳＴ２
２）。Next, the SVP 9-1 carries out self-diagnosis by the self-diagnosis unit 19 (step ST20). If no fault has occurred, the process returns to step ST13 to continue the series of processes, but a fault has occurred. In this case, the diagnosis interface 21 notifies the failure processing execution unit 15 of the occurrence of a failure, thereby stopping the system management and referring to the alternative system SVP management table 18b to perform the replacement. The system SVP is recognized, and the substitute system SVP (SVP9-2) is notified of the function substitution request via the transmission processing unit 22. Further, the other SVPs 9-3 to 9-M are notified of the update request via the transmission processing unit 22 (step ST2).
2).

【００３１】次に、待機用のＳＶＰの動作を、図７のフ
ローチャート図を参照しつつ説明する。まず、ＳＶＰ９
−２〜９−Ｍは、受信処理部１３が診断用バスインター
フェイス１２を介して割込み信号を受信し（ステップＳ
Ｔ３１）、その割込み信号はシステムＳＶＰ（ＳＶＰ９
−１）から送信された割込み信号であるか否かを、受信
情報処理部１４が解析する（ステップＳＴ３２）。Next, the operation of the standby SVP will be described with reference to the flow chart of FIG. First, SVP9
-2-9-M, the reception processing unit 13 receives the interrupt signal via the diagnostic bus interface 12 (step S
T31), the interrupt signal is the system SVP (SVP9
The reception information processing unit 14 analyzes whether or not the interrupt signal is transmitted from -1) (step ST32).

【００３２】もし、その割込み信号が、ＳＶＰ９−１以
外から送信されたものであれば、詳述しないが、その割
込み信号に応じた処理を行う（ステップＳＴ３３）。一
方、その割込み信号が、ＳＶＰ９−１から送信されたも
のであれば、さらに解析して下記のような処理を行う。If the interrupt signal is transmitted from a device other than SVP 9-1, the process according to the interrupt signal is performed (step ST33), which will not be described in detail. On the other hand, if the interrupt signal is transmitted from the SVP 9-1, further analysis is performed and the following processing is performed.

【００３３】即ち、その割込み信号が、あるＣＰＵに障
害が発生したことを示す通知であれば（ステップＳＴ３
３）、ＳＶＰ９−２〜９−Ｍにおける故障処理実行部１
５が、各故障ＣＰＵ管理部１７に障害の発生したＣＰＵ
を登録する（ステップＳＴ３４）。上述したように、こ
の例では、ＣＰＵ８−３に障害が発生したものとしてい
るので、各故障ＣＰＵ管理部１７のフラグ３をオンにす
る。そして、ＳＶＰ９−２〜９−Ｍの故障処理実行部１
５が各故障処理テーブル１６を参照することにより、診
断インターフェイス２１に対して、システム構成テーブ
ル２０に登録されているシステム構成を更新するよう指
示を与え、診断インターフェイス２１が、システム構成
テーブル２０に登録されているシステム構成のなかか
ら、ＣＰＵ８−３を削除する（ステップＳＴ３５）。That is, if the interrupt signal indicates that a failure has occurred in a certain CPU (step ST3).
3), failure processing execution unit 1 in SVP 9-2 to 9-M
5 is a CPU in which a failure has occurred in each failed CPU management unit 17.
Is registered (step ST34). As described above, in this example, the CPU 8-3 is assumed to have failed, so the flag 3 of each failed CPU management unit 17 is turned on. Then, the failure processing execution unit 1 of the SVPs 9-2 to 9-M
5 gives an instruction to the diagnostic interface 21 to update the system configuration registered in the system configuration table 20 by referring to each failure processing table 16, and the diagnostic interface 21 registers in the system configuration table 20. The CPU 8-3 is deleted from the existing system configurations (step ST35).

【００３４】次に、その割込み信号が、あるＳＶＰに障
害が発生したことを示す通知であれば（ステップＳＴ３
６）、ＳＶＰ９−３〜９−Ｍにおける故障処理実行部１
５が、各ＳＶＰ管理部１８の故障管理テーブル１８ａに
障害の発生したＳＶＰを登録する（ステップＳＴ３
７）。上述したように、この例では、ＳＶＰ９−３に障
害が発生したものとしているので、故障管理テーブル１
８ａのフラグ３をオンにする。なお、ＳＶＰ９−３は、
障害の発生したＳＶＰであるので、かかる通知はなされ
ない。そして、この場合も、ＳＶＰ９−２〜９−Ｍの故
障処理実行部１５が各故障処理テーブル１６を参照する
ことにより、診断インターフェイス２１に対して、シス
テム構成テーブル２０に登録されているシステム構成を
更新するよう指示を与え、診断インターフェイス２１
が、システム構成テーブル２０に登録されているシステ
ム構成のなかから、ＳＶＰ９−３を削除する（ステップ
ＳＴ３５）。Next, if the interrupt signal is a notification indicating that a failure has occurred in a certain SVP (step ST3).
6), failure processing execution unit 1 in SVP 9-3 to 9-M
5 registers the faulty SVP in the fault management table 18a of each SVP management unit 18 (step ST3).
7). As described above, in this example, since it is assumed that the SVP 9-3 has a failure, the failure management table 1
The flag 3 of 8a is turned on. In addition, SVP9-3 is
Since the SVP has a failure, no such notification is given. Also in this case, the failure processing execution unit 15 of the SVPs 9-2 to 9-M refers to the respective failure processing tables 16 so that the system configuration registered in the system configuration table 20 is set to the diagnostic interface 21. Diagnostic interface 21 with instructions to update
Deletes SVP9-3 from the system configurations registered in the system configuration table 20 (step ST35).

【００３５】次に、その割込み信号が、システムＳＶＰ
の更新依頼に関する通知であれば（ステップＳＴ３
８）、ＳＶＰ９−２〜９−Ｍにおける故障処理実行部１
５が、各ＳＶＰ管理部１８の、代替システムＳＶＰ管理
テーブル１８ｂに代替のシステムＳＶＰを登録する（ス
テップＳＴ３９）。上述したように、この例では、ＳＶ
Ｐ９−２を代替のシステムＳＶＰとして選定しているの
で、代替システムＳＶＰ管理テーブル１８ｂのフラグ２
をオンにする一方、現在のシステムＳＶＰであるＳＶＰ
９−１はシステムＳＶＰでなくなるので、代替システム
ＳＶＰ管理テーブル１８ｂのフラグ１をオフにする。Next, the interrupt signal is the system SVP.
If it is a notification regarding the update request (step ST3
8), failure processing execution unit 1 in SVP 9-2 to 9-M
5 registers the alternative system SVP in the alternative system SVP management table 18b of each SVP management unit 18 (step ST39). As mentioned above, in this example, SV
Since P9-2 is selected as the alternative system SVP, flag 2 of the alternative system SVP management table 18b is selected.
While turning on the SVP which is the current system SVP
9-1 is no longer the system SVP, so the flag 1 of the alternative system SVP management table 18b is turned off.

【００３６】次に、その割込み信号が、システムＳＶＰ
の機能代替依頼に関する通知であれば（ステップＳＴ４
０）、ＳＶＰ９−２〜９−Ｍにおける故障処理実行部１
５が各診断インターフェイス２１にシステムＳＶＰの機
能代替依頼があったことを通知する。これにより、各診
断インターフェイス２１は、代替システムＳＶＰ管理テ
ーブル１８ｂを参照し、代替のシステムＳＶＰはＳＶＰ
９−２であることを認識する（ステップＳＴ４１）。こ
こで、各診断インターフェイス２１が、代替システムＳ
ＶＰ管理テーブル１８ｂを参照した結果、自己が代替の
システムＳＶＰであると認識した場合に限り、図６のス
テップＳＴ１２の処理に移行し、システムＳＶＰとして
システム管理を開始する。この例では、ＳＶＰ９−２の
みが図６のステップＳＴ１２の処理に移行する。Next, the interrupt signal is the system SVP.
If it is a notification regarding the function substitution request of (step ST4
0), failure processing execution unit 1 in SVP 9-2 to 9-M
5 notifies each diagnostic interface 21 that there has been a function replacement request for the system SVP. As a result, each diagnostic interface 21 refers to the alternative system SVP management table 18b, and the alternative system SVP is the SVP.
9-2 is recognized (step ST41). Here, each diagnostic interface 21 is replaced by the alternative system S.
As a result of referring to the VP management table 18b, only when the self recognizes that it is the alternative system SVP, the process proceeds to step ST12 in FIG. 6 and system management is started as the system SVP. In this example, only the SVP 9-2 shifts to the processing of step ST12 in FIG.

【００３７】最後に、その割込み信号が、システムＳＶ
Ｐからの自己診断要求であれば（ステップＳＴ４２）、
ＳＶＰ９−２〜９−Ｍにおける故障処理実行部１５が各
診断インターフェイス２１に自己診断要求があったこと
を通知する。これにより、各診断インターフェイス２１
は、自己診断部１９に対して自己診断を行わせる（ステ
ップＳＴ４３）。自己診断の結果、障害が発生していれ
ば、システムＳＶＰに対して異常応答信号を送信し（ス
テップＳＴ４５）、障害が発生していなければ、システ
ムＳＶＰに対して正常応答信号を送信する（ステップＳ
Ｔ４６）。Finally, the interrupt signal indicates that the system SV
If it is a self-diagnosis request from P (step ST42),
The failure processing execution unit 15 in the SVPs 9-2 to 9-M notifies each diagnostic interface 21 of the self-diagnosis request. As a result, each diagnostic interface 21
Causes the self-diagnosis unit 19 to perform self-diagnosis (step ST43). If a failure has occurred as a result of the self-diagnosis, an abnormal response signal is transmitted to the system SVP (step ST45), and if no failure has occurred, a normal response signal is transmitted to the system SVP (step ST45). S
T46).

【００３８】次に、ＣＰＵの動作を、図８のフローチャ
ート図を参照しつつ説明する。まず、ＣＰＵ８−１〜８
−Ｎは、受信処理部３３が診断用バスインターフェイス
３２を介して割込み信号を受信し（ステップＳＴ５
１）、その割込み信号がシステムＳＶＰからの自己診断
要求であれば（ステップＳＴ５２）、受信情報解析部３
４が各診断インターフェイス３８に自己診断要求があっ
たことを通知する。これにより、各診断インターフェイ
ス３８は、自己診断部３７に対して自己診断を行わせる
（ステップＳＴ５３）。Next, the operation of the CPU will be described with reference to the flowchart of FIG. First, the CPUs 8-1 to 8
-N, the reception processing unit 33 receives the interrupt signal through the diagnostic bus interface 32 (step ST5).
1) If the interrupt signal is a self-diagnosis request from the system SVP (step ST52), the reception information analysis unit 3
4 notifies each diagnostic interface 38 that there is a self-diagnosis request. Thereby, each diagnosis interface 38 causes the self-diagnosis unit 37 to perform self-diagnosis (step ST53).

【００３９】自己診断の結果、障害が発生していれば、
故障ＣＰＵ管理部３５（ＳＶＰの故障ＣＰＵ管理部１７
と同様の構成）に自己に障害が発生したことを登録する
（ステップＳＴ５５）。この例では、ＣＰＵ８−３に障
害が発生することにしているので、ＣＰＵ８−３がのみ
がステップＳＴ５５の処理を行い、故障ＣＰＵ管理部３
５のフラグ３をオンにする。If a failure has occurred as a result of self-diagnosis,
Failed CPU management unit 35 (SVP failed CPU management unit 17
The fact that a failure has occurred is registered in the same configuration (step ST55). In this example, since the failure occurs in the CPU 8-3, only the CPU 8-3 performs the process of step ST55, and the faulty CPU management unit 3
The flag 3 of 5 is turned on.

【００４０】そして、各診断インターフェイス３８は、
診断結果に応じたメッセージをシステムＳＶＰに対して
送信する（ステップＳＴ５６）。具体的には、障害が発
生していれば、異常応答信号を送信し、障害が発生して
いなければ、正常応答信号を送信する。Then, each diagnostic interface 38 is
A message corresponding to the diagnosis result is transmitted to the system SVP (step ST56). Specifically, if a failure has occurred, an abnormal response signal is transmitted, and if no failure has occurred, a normal response signal is transmitted.

【００４１】次に、その割込み信号が、あるＣＰＵに障
害が発生したことを示す通知であれば（ステップＳＴ５
７）、ＣＰＵ８−１〜９−Ｎにおける受信情報解析部３
４が、各故障ＣＰＵ管理部３５に障害の発生したＣＰＵ
を登録する（ステップＳＴ５８）。上述したように、こ
の例では、ＣＰＵ８−３に障害が発生したものとしてい
るので、各故障ＣＰＵ管理部５７のフラグ３をオンにす
る。なお、ＣＰＵ８−３は、障害の発生したＣＰＵであ
るので、かかる通知はなされない。Next, if the interrupt signal is a notification indicating that a failure has occurred in a certain CPU (step ST5).
7), the reception information analysis unit 3 in the CPUs 8-1 to 9-N
4 is a CPU in which a failure has occurred in each failed CPU management unit 35
Is registered (step ST58). As described above, in this example, it is assumed that the CPU 8-3 has failed, so the flag 3 of each failed CPU management unit 57 is turned on. Since the CPU 8-3 is the CPU in which the failure has occurred, such notification is not issued.

【００４２】最後に、その割込み信号が、システムＳＶ
Ｐの更新依頼に関する通知であれば（ステップＳＴ５
９）、ＣＰＵ８−１〜８−Ｎにおける受信情報解析部３
４が、各システムＳＶＰ登録テーブル３６（ＳＶＰの代
替システムＳＶＰ管理テーブル１８ｂと同様の構成）に
代替のシステムＳＶＰを登録する（ステップＳＴ６
０）。上述したように、この例では、ＳＶＰ９−２を代
替のシステムＳＶＰとして選定しているので、システム
ＳＶＰ登録テーブル３６のフラグ２をオンにする一方、
現在のシステムＳＶＰであるＳＶＰ９−１はシステムＳ
ＶＰでなくなるので、システムＳＶＰ登録テーブル３６
のフラグ１をオフにする。Finally, the interrupt signal indicates that the system SV
If it is a notification regarding the update request of P (step ST5
9), the reception information analysis unit 3 in the CPUs 8-1 to 8-N
4 registers the alternative system SVP in each system SVP registration table 36 (the same configuration as the alternative system SVP management table 18b of the SVP) (step ST6).
0). As described above, in this example, since the SVP 9-2 is selected as the alternative system SVP, while the flag 2 of the system SVP registration table 36 is turned on,
The current system SVP SVP9-1 is the system S
Since it is no longer a VP, the system SVP registration table 36
The flag 1 of is turned off.

【００４３】なお、その割込み信号が、上記の何れにも
該当しない場合は、詳述しないが、その割込み信号に応
じた処理を行う（ステップＳＴ６１）。If the interrupt signal does not correspond to any of the above, details will not be described, but processing corresponding to the interrupt signal is performed (step ST61).

【００４４】以上の説明から明らかなように、この実施
例１によれば、あるＣＰＵに障害が発生した場合におい
て、障害の発生したＣＰＵをシステムから切り離す機能
を有するだけでなく、システムＳＶＰに障害が発生した
場合には、他のＳＶＰにシステムＳＶＰを代替する機能
を有しているので、システムＳＶＰの障害発生に伴うシ
ステムへの悪影響をもたらされることなく、安定したシ
ステムの稼働が可能になる。As is apparent from the above description, according to the first embodiment, when a certain CPU fails, it has not only a function of disconnecting the failed CPU from the system, but also a failure of the system SVP. If a system SVP occurs, it has a function of replacing the system SVP with another SVP, so that stable system operation is possible without adversely affecting the system due to the occurrence of a system SVP failure. .

【００４５】実施例２．図９はこの発明の他の実施例に
よる計算機装置のＳＶＰの詳細な構成を示す構成図であ
り、図において、４０は自己の負荷状況を測定する負荷
率測定部、４１は自己が正常に動作しているか否かを診
断するとともに、その診断の際、負荷率測定部４０によ
り測定された負荷状況が所定値を越えている場合には自
己に障害が発生したものとみなす自己診断部である。Example 2. FIG. 9 is a configuration diagram showing a detailed configuration of an SVP of a computer apparatus according to another embodiment of the present invention. In the figure, 40 is a load factor measuring unit for measuring its own load condition, and 41 is a normal operation of the self. It is a self-diagnosis unit that diagnoses whether or not the failure occurs, and at the time of the diagnosis, if the load condition measured by the load factor measurement unit 40 exceeds a predetermined value, the self-diagnosis unit determines that a failure has occurred. .

【００４６】次に動作について説明する。まず、負荷率
測定部４０は、当該ＳＶＰが所定時間内において稼働し
ている時間を測定し、その稼働している時間の比率（負
荷状況）を演算する。一方、自己診断部４１は、実施例
１における自己診断部１９と同様にして、自己が正常に
動作しているか否かを診断するとともに、その診断の
際、負荷率測定部４０により測定された負荷状況が所定
値を越えている場合には、自己に障害が発生していなく
とも、自己に障害が発生したものとみなす。例えば、自
己の負荷状況が当該ＳＶＰの稼働能力の５０％を越えて
いる場合に、自己に障害が発生したものとみなす。これ
により、診断インターフェイス２１が代替のシステムＳ
ＶＰを選定する際、負荷状況が所定値を越えているＳＶ
Ｐは、選定候補からはずされることになるので、現在負
荷状況の低いＳＶＰ、即ち、システムＳＶＰの機能を付
加してもさほど負担にならないＳＶＰのみが、システム
ＳＶＰの選定候補になる。従って、代替のシステムＳＶ
Ｐは、確実に動作できるので、実施例１のものよりも、
さらに確実に、システムに与える悪影響を最小限に抑え
ることができる。なお、負荷率測定部４０及び自己診断
部４１以外は、実施例１と同様であるため説明を省略す
る。Next, the operation will be described. First, the load factor measuring unit 40 measures the time during which the SVP is operating within a predetermined time, and calculates the ratio of the operating time (load condition). On the other hand, the self-diagnosis unit 41 diagnoses whether or not the self is operating normally in the same manner as the self-diagnosis unit 19 in the first embodiment, and at the time of the diagnosis, it is measured by the load factor measurement unit 40. If the load status exceeds a predetermined value, it is considered that a failure has occurred even if the failure has not occurred. For example, if the load condition of the self exceeds 50% of the operating capacity of the SVP, it is considered that the self has a failure. This allows the diagnostic interface 21 to replace the system S
When selecting a VP, the load condition exceeds the specified value SV
Since P is excluded from the selection candidates, only SVPs having a low load condition at present, that is, SVPs that do not cause much burden even if the function of the system SVP is added, become the selection candidates for the system SVP. Therefore, the alternative system SV
Since P can operate reliably,
Further, it is possible to surely minimize the adverse effect on the system. Note that, except for the load factor measurement unit 40 and the self-diagnosis unit 41, the description is omitted because it is the same as in the first embodiment.

【００４７】実施例３．上記実施例では、各ＳＶＰに、
故障処理テーブル１６、故障ＣＰＵ管理部１７、ＳＶＰ
管理部１８及びシステム構成テーブル２０を配置し、ま
た、各ＣＰＵに、故障ＣＰＵ管理部３５及びシステムＳ
ＶＰ登録テーブル３６を配置したものを示したが、これ
ら各テーブル及び管理部を、図１０に示すように、例え
ば、バッテリバックアップメモリなどで構成される共有
メモリ４２として構成するとともに、その共有メモリ４
２を管理するメンテナンスツール４３を設け、オンライ
ンによって情報の一元管理を行ってもよい。これによ
り、メンテナンス性とフレキシビリティがさらに向上す
る。なお、実施例３における各ＳＶＰ及びＣＰＵの詳細
な構成を図１１及び図１２に示す。Example 3. In the above embodiment, in each SVP,
Failure processing table 16, failed CPU management unit 17, SVP
The management unit 18 and the system configuration table 20 are arranged, and each CPU has a failed CPU management unit 35 and a system S.
Although the arrangement of the VP registration table 36 is shown, as shown in FIG. 10, each of these tables and the management unit is configured as a shared memory 42 including, for example, a battery backup memory, and the shared memory 4 is also provided.
It is also possible to provide a maintenance tool 43 for managing 2 and perform unified management of information online. This further improves maintainability and flexibility. The detailed configuration of each SVP and CPU in the third embodiment is shown in FIGS. 11 and 12.

【００４８】[0048]

【発明の効果】以上のように、請求項１の発明によれ
ば、自己がシステム管理を行っている場合には、自己診
断部により自己に障害が発生したものと診断されるとシ
ステム管理を中止するとともに、待機用のサービスプロ
セッサに対してシステム管理を開始すべき旨の指令を出
力するように構成したので、障害の発生したサービスプ
ロセッサがシステムから切り離され、待機用のサービス
プロセッサがシステム管理を行うようになり、サービス
プロセッサに障害が発生した場合でも、システムに与え
る悪影響を最小限に抑えることができるなどの効果があ
る。As described above, according to the first aspect of the present invention, when the system is managed by itself, if the self-diagnosis unit diagnoses that a failure has occurred, the system management is performed. Since it is configured to output the command to start system management to the standby service processor when it is stopped, the faulty service processor is disconnected from the system, and the standby service processor is managed by the system. With such a configuration, even if a failure occurs in the service processor, the adverse effect on the system can be minimized.

【００４９】また、請求項２の発明によれば、自己がシ
ステム管理を行っている場合には、各待機用のサービス
プロセッサに対して自己診断をすべき旨の指令を出力す
ることにより、各待機用のサービスプロセッサからの正
常応答信号の有無によって各待機用のサービスプロセッ
サの診断結果を把握しておき、自己診断部により自己に
障害が発生したものと診断されるとシステム管理を中止
するとともに、その診断結果に基づいて正常な待機用の
サービスプロセッサに対してシステム管理を開始すべき
旨の指令を出力するように構成したので、障害の発生し
たサービスプロセッサがシステムから切り離され、正常
な待機用のサービスプロセッサがシステム管理を行うよ
うになり、請求項１の発明より、さらに確実に、システ
ムに与える悪影響を最小限に抑えることができるなどの
効果がある。According to the second aspect of the present invention, when the system is managed by itself, by outputting a command to the self-diagnosis to each standby service processor, The diagnosis result of each standby service processor is known based on the presence / absence of a normal response signal from the standby service processor. When the self-diagnosis unit diagnoses that a failure has occurred, system management is stopped. Since it is configured to output a command to start system management to the normal standby service processor based on the diagnosis result, the faulty service processor is disconnected from the system and the normal standby The service processor for use in the system now manages the system, and more reliably than the invention of claim 1, adversely affects the system. There is an effect, such as can be kept to a minimum.

【００５０】さらに、請求項３の発明によれば、負荷率
測定部により測定された負荷状況が所定値を越えている
場合には自己に障害が発生したものとみなすように構成
したので、稼働率の低いサービスプロセッサのみが代替
用のサービスプロセッサとして選定されるようになり、
請求項１の発明より、さらに確実に、システムに与える
悪影響を最小限に抑えることができるなどの効果があ
る。Further, according to the invention of claim 3, when the load condition measured by the load factor measuring unit exceeds a predetermined value, it is considered that a failure has occurred in itself, so that the operation is performed. Only service processors with low rates are selected as alternative service processors,
According to the invention of claim 1, there is an effect that the adverse effect on the system can be more surely suppressed to the minimum.

[Brief description of drawings]

【図１】この発明の一実施例による計算機装置を示す構
成図である。FIG. 1 is a configuration diagram showing a computer apparatus according to an embodiment of the present invention.

【図２】ＳＶＰ９−１〜９−Ｍの詳細な構成を示す構成
図である。FIG. 2 is a configuration diagram showing a detailed configuration of SVPs 9-1 to 9-M.

【図３】ＣＰＵ８−１〜８−Ｍの詳細な構成を示す構成
図である。FIG. 3 is a configuration diagram showing a detailed configuration of CPUs 8-1 to 8-M.

【図４】故障ＣＰＵ管理部の構成を示す構成図である。FIG. 4 is a configuration diagram showing a configuration of a failed CPU management unit.

【図５】ＳＶＰ管理部の構成を示す構成図である。FIG. 5 is a configuration diagram showing a configuration of an SVP management unit.

【図６】システムＳＶＰの動作を示すフローチャート図
である。FIG. 6 is a flowchart showing the operation of the system SVP.

【図７】待機用のＳＶＰの動作を示すフローチャート図
である。FIG. 7 is a flowchart showing the operation of a standby SVP.

【図８】ＣＰＵの動作を示すフローチャート図である。FIG. 8 is a flowchart showing the operation of the CPU.

【図９】この発明の他の実施例による計算機装置のＳＶ
Ｐの詳細な構成を示す構成図である。FIG. 9 is an SV of a computer device according to another embodiment of the present invention.
It is a block diagram which shows the detailed structure of P.

【図１０】この発明の他の実施例による計算機装置を示
す構成図である。FIG. 10 is a block diagram showing a computer system according to another embodiment of the present invention.

【図１１】図１０の計算機装置のＳＶＰの詳細な構成を
示す構成図である。11 is a configuration diagram showing a detailed configuration of an SVP of the computer device of FIG.

【図１２】図１１の計算機装置のＣＰＵの詳細な構成を
示す構成図である。12 is a configuration diagram showing a detailed configuration of a CPU of the computer device of FIG.

【図１３】従来の計算機装置を示す構成図である。FIG. 13 is a configuration diagram showing a conventional computer apparatus.

【図１４】従来の計算機装置の動作を示すフローチャー
ト図である。FIG. 14 is a flowchart showing an operation of a conventional computer device.

[Explanation of symbols]

８−１〜８−ＮＣＰＵ（中央処理装置）９−１〜９−ＭＳＶＰ（サービスプロセッサ）１５故障処理実行部（切替制御部）１９、４１自己診断部２１診断インターフェイス（切替制御部）４０負荷率測定部 8-1 to 8-N CPU (Central Processing Unit) 9-1 to 9-M SVP (Service Processor) 15 Failure Processing Execution Unit (Switch Control Unit) 19, 41 Self-Diagnosis Unit 21 Diagnostic Interface (Switch Control Unit) 40 Load factor measurement section

Claims

[Claims]

1. A self-diagnosis function for diagnosing whether or not the self is operating normally, and a self-diagnosis result,
When operating normally, a plurality of central processing units that send normal response signals and a command to self-diagnose each of the above central processing units are output, and normal response signals are sent within a predetermined time. If there is a central processing unit that does not send, it is determined that a failure has occurred in the central processing unit, and the central processing unit is disconnected from the system. In addition to the standby service processor, a self-diagnosis unit that diagnoses whether or not it is operating normally, and a self-diagnosis unit that causes self-diagnosis when the system is managed by itself If it is diagnosed that the system management is stopped, the system management is stopped and the standby service processor is instructed to start the system management. On the other hand, each service processor is equipped with a switching controller that starts system management when it receives a command from the service processor that is performing system management to start system management A computer device characterized by the above.

2. A self-diagnosis function for diagnosing whether or not the self is operating normally, and a self-diagnosis result,
When operating normally, a plurality of central processing units that send normal response signals and a command to self-diagnose each of the above central processing units are output, and normal response signals are sent within a predetermined time. If there is a central processing unit that does not send, it is determined that a failure has occurred in the central processing unit, and the central processing unit is disconnected from the system. In addition to multiple standby service processors, a self-diagnosis unit for diagnosing whether or not the self is operating normally, and a self-diagnosis service processor for each of the above standby service processors when the system is managed by itself. By outputting a command to the effect that self-diagnosis should be performed, the standby processor Over previously know the service processor diagnostic result, with the self failure is diagnosed as occurring to stop the system managed by the self-diagnosis unit,
Based on the above diagnosis result, it outputs a command to the normal service processor for standby to start system management. On the other hand, when it is in standby, the service processor performing system management performs system management. A computer apparatus, wherein each service processor is provided with a switching control unit that starts system management when receiving a command to start the service.

3. Each of the service processors is provided with a load factor measuring unit that measures its own load condition, and the self-diagnosis unit is configured to determine whether the load condition measured by the load factor measuring unit exceeds a predetermined value. 3. The computer system according to claim 1, wherein the computer is regarded as having a failure.