JP2000322274A

JP2000322274A - Hierarchical recovery device for operating system

Info

Publication number: JP2000322274A
Application number: JP11129781A
Authority: JP
Inventors: Koji Ebara; 広治江原
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1999-05-11
Filing date: 1999-05-11
Publication date: 2000-11-24

Abstract

PROBLEM TO BE SOLVED: To obtain a hierarchical recovery device for operating system which can avoid system down and minimizes the influence of a fault. SOLUTION: If a trap handler 3 judges that restoration with a recovery routine 11 is necessary when an exception is generated due to wrong address reference of modules 1 which is modularized for each function and which are provided with the recovery routine 11, a module which is the cause of the exception is specified by a recovery manager 4, and the recovery routine 11 of the module 1 is executed, and initialization or partial disconnection of a faulty part is executed by the recovery routine 11 to try the restoration of the entire system. It restoration is impossible in this case, a recovery routine 21 of a sub-system 2 to which the module 1 belongs is executed to perform the initialization or disconnection of the faulty part.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】この発明は、オペレーティン
グシステムの障害時に、障害が発生した個所から階層的
にリカバリ処理を行ない、障害による影響を最小限にと
どめつつ、システム全体がダウンするような重大な障害
を回避するようにしたオペレーティングシステムの階層
的リカバリ装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention provides a method for performing a recovery process hierarchically from a location where a failure has occurred in the event of a failure of an operating system, minimizing the influence of the failure and reducing the entire system down. The present invention relates to a hierarchical recovery device of an operating system for avoiding a failure.

【０００２】[0002]

【従来の技術】オペレーティングシステムは、周知のよ
うにコンピュータを作動させるときに必ず必要となるも
っとも基本的なシフトウェアである。このシフトウェア
は、コンピュータ使用に当たってのソフトウェア上の共
通の規格であり、コンピュータのあらゆるユーザが使用
するものであり、技術的にも高度で特殊な分野に属する
ソフトウェアである。オペレーティングシステムに、各
種のアプリケーションを適用してコンピュータを動作さ
せている。2. Description of the Related Art As is well known, an operating system is the most basic shiftware required when operating a computer. This shiftware is a common standard for software when using a computer, is used by all users of computers, and is software that belongs to a technically advanced and special field. A computer is operated by applying various applications to an operating system.

【０００３】[0003]

【発明が解決しようとする課題】従来のオペレーティン
グシステムには、次のような課題があった。すなわち、
あるモジュールが不正アドレス参照などをすると、シス
テム全体がダウンしてしまうことである。このモジュー
ルがシステムの主要なタスクに関係がなく、このモジュ
ール自体の動作が停止してもシステム全体で見れば影響
が少ないような場合でも、システム全体がダウンしてし
まい、このモジュールのためにシステムの主要なタスク
も停止させてしまうという課題があった。The conventional operating system has the following problems. That is,
When a certain module refers to an illegal address, the whole system goes down. Even if this module has nothing to do with the main task of the system, and if the operation of this module itself stops working but the whole system has little effect, the whole system will go down and the system There was a problem that also stopped the main task of.

【０００４】なお、近似技術として、たとえば、特開昭
６２−２７６６５２号公報には、端末装置を通信通信制
御プログラムにより制御する通信制御手段の誤処理によ
る異常発生時にシステムダウンを防止し、異常発生要因
を除去することにより、システムの継続運転を可能とす
る機能回復方式について開示されているが、障害の発生
箇所から階層的にリカバリ処理を行って障害による影響
を最小限にとどめつつ、システム全体のダウンの回避を
可能にすることについて教示がなされていない。As an approximation technique, for example, Japanese Patent Application Laid-Open No. 62-276652 discloses a system for preventing a system from being down when an error occurs due to erroneous processing of communication control means for controlling a terminal device by a communication communication control program. A function recovery method that enables continuous operation of the system by removing the factors is disclosed.However, the recovery processing is performed hierarchically from the location where the failure occurred, minimizing the impact of the failure, There is no teaching about enabling the avoidance of downtime.

【０００５】この発明は、上記従来の課題を解決するた
めになされたもので、従来ではシステム全体がダウンし
たような障害でも、回避することができるとともに、階
層的な復旧を行うことにより、障害の影響を最小限にと
どめることができるオペレーティングシステムの階層的
リカバリ装置を提供することを目的とする。SUMMARY OF THE INVENTION The present invention has been made to solve the above-mentioned conventional problems. In the prior art, it is possible to avoid a failure in which the entire system has been down, and to perform a hierarchical restoration to thereby prevent the failure. It is an object of the present invention to provide an operating system hierarchical recovery apparatus capable of minimizing the influence of the above.

【０００６】また、この発明は、モジュールレベルで障
害を検出した場合に、障害原因がそのモジュール内では
解決できないサブシステムレベルの障害であっても、階
層的なリカバリルーチンにより復旧が可能となるオペレ
ーティングシステムの階層的リカバリ装置を提供するこ
とを目的とする。Further, according to the present invention, when a failure is detected at a module level, even if the cause of the failure is a subsystem-level failure that cannot be resolved within the module, an operating system capable of recovering by a hierarchical recovery routine. It is an object of the present invention to provide a hierarchical recovery apparatus for a system.

【０００７】[0007]

【課題を解決するための手段】上記目的を達成するため
に、この発明のオペレーティングシステムの階層的リカ
バリ装置は、オペレーティングシステムのカーネル部に
含まれ、機能ごとにモジュール化されたモジュールで不
正アドレス参照を行って例外が発生した場合にリカバリ
ルーチンによる復旧の必要の有無を判断するトラップハ
ンドラと、上記トラップハンドラが上記復旧の必要性あ
りと判断した場合には、上記例外の原因となったモジュ
ールを特定してそのモジュールのリカバリルーチンを実
行するリカバリマネージャと、上記モジュールを含み、
上記リカバリルーチンの実行により復旧しない場合に自
己の備えるリカバリルーチンを実行するサブシステムと
を備えることを特徴とする。そのため、モジュールで不
正アドレス参照を行って例外が発生するとオペレーティ
ングシステムのカーネル部に含まれるトラップハンドラ
は、リカバリルーチンにより復旧が必要であるか、否か
の判断を行い、その判断の結果、復旧が必要な場合に
は、トラップハンドラはリカバリマネージャに制御を渡
すことにより、リカバリマネージャは例外の原因となっ
たモジュールを特定し、モジュールのリカバリルーチン
を実行し、システム全体がダウンすることを回避する復
旧を試みるが、復旧できない場合にはモジュールが属す
るサブシステムのリカバリルーチンを実行するようにし
たので、従来ではシステム全体がダウンしたような障害
でも、回避することができるとともに、障害の影響を最
小限にとどめることができる。In order to achieve the above object, a hierarchical recovery apparatus for an operating system according to the present invention is included in a kernel section of an operating system, and refers to an illegal address by a module that is modularized for each function. And the trap handler that determines whether recovery by the recovery routine is necessary when an exception occurs, and the module that caused the exception when the trap handler determines that recovery is necessary. A recovery manager that identifies and executes a recovery routine for the module, including the module,
A sub-system for executing a recovery routine provided by itself when recovery is not performed by execution of the recovery routine. Therefore, if an exception occurs due to a reference to an illegal address in the module, the trap handler included in the kernel part of the operating system determines whether or not recovery is necessary by a recovery routine, and as a result of the determination, recovery is If necessary, the trap handler passes control to the recovery manager, which identifies the module that caused the exception, executes the module's recovery routine, and prevents the entire system from going down. However, if recovery is not possible, the system executes the recovery routine of the subsystem to which the module belongs. Can be stopped.

【０００８】また、この発明のオペレーティングシステ
ムの階層的リカバリ装置は、オペレーティングシステム
のカーネル部に含まれ、機能ごとにモジュール化され、
オペレーティングシステムの障害検出時にリカバリルー
チンによる復旧の必要の有無を判断するモジュールと、
上記モジュールが上記復旧の必要性ありと判断した場合
には、上記モジュールのリカバリルーチンを実行するリ
カバリマネージャと、上記モジュールを含み、上記リカ
バリルーチンの実行により復旧しない場合に自己の備え
るリカバリルーチンを実行するサブシステムとを備える
ことを特徴とする。そのため、モジュールが障害を検出
すると、モジュールはそのリカバリルーチンにより復旧
が必要であるとの判断時には、リカバリマネージャはモ
ジュールのリカバリルーチンを実行し、障害に対する復
旧を試みるが、復旧できない場合にサブシステムのレベ
ルの障害が原因であると判断されたときには、モジュー
ルの属するサブシステムのリカバリルーチンを実行し
て、障害を回避するようにしたので、モジュールレベル
で障害を検出した場合に、障害原因がそのモジュール内
では解決できないサブシステムレベルの障害であって
も、階層的なリカバリルーチンにより復旧が可能とな
る。The hierarchical recovery apparatus for an operating system according to the present invention is included in a kernel section of the operating system and is modularized for each function.
A module for determining whether recovery by a recovery routine is necessary when an operating system failure is detected,
If the module determines that the recovery is necessary, execute the recovery manager that executes the recovery routine of the module and the recovery routine that includes the module and that is provided when the recovery is not performed by executing the recovery routine. And a subsystem that performs Therefore, when a module detects a failure, the recovery manager executes the module's recovery routine and attempts to recover from the failure when it determines that recovery is necessary according to its recovery routine. When it is determined that the failure is caused by the failure at the module level, the recovery routine of the subsystem to which the module belongs is executed to avoid the failure. Even a subsystem-level failure that cannot be resolved within the system can be recovered by a hierarchical recovery routine.

【０００９】[0009]

【発明の実施の形態】次に、この発明によるオペレーテ
ィンングシステムの階層的リカバリ装置の実施の形態に
ついて図面に基づき説明する。図１は、この発明の第１
実施の形態の構成を示す機能ブロック図である。この図
１において、オペレーティングシステムのカーネル部
に、モジュール１と、サブシステム２と、トラップハン
ドラ３と、リカバリマネージャ４とを含む。モジュール
１はモジュールのリカバリルーチン１１を備え、サブシ
ステム２はサブシステムのリカバリルーチン２１を備え
ている。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Next, an embodiment of a hierarchical recovery apparatus for an operating system according to the present invention will be described with reference to the drawings. FIG. 1 shows a first embodiment of the present invention.
FIG. 2 is a functional block diagram showing a configuration of the embodiment. In FIG. 1, the kernel part of the operating system includes a module 1, a subsystem 2, a trap handler 3, and a recovery manager 4. The module 1 has a module recovery routine 11 and the subsystem 2 has a subsystem recovery routine 21.

【００１０】モジュール１は機能ごとにモジュール化さ
れたものである。サブシステム２は関連するモジュール
をグループ化して、それらの制御を行ない、ある程度ま
とまった機能を提供する。サブシステム２には、モジュ
ール１と同じモジュールが複数個配置されている。ま
た、上記サブシステム２は、複数台配置される場合があ
り、図１に示す第１実施の形態では、３台配置されてい
る場合を例示しており、サブシステム２ａ，２ｂがサブ
システム２の他に追加されている。The module 1 is modularized for each function. The subsystem 2 groups related modules, controls them, and provides a certain set of functions. In the subsystem 2, a plurality of modules same as the module 1 are arranged. Further, there are cases where a plurality of subsystems 2 are arranged, and the first embodiment shown in FIG. 1 illustrates a case where three subsystems 2 are arranged. Others have been added.

【００１１】これらのサブシステム２ａ，２ｂはサブシ
ステム２と同一構成をなしているから、サブシステム２
の内部構成と同一部分には同一符号にアルファベットの
「ａ」，「ｂ」の添え字を付すのみにとどめ、その内部
構成の重複説明を省略する。トラップハンドラ３は例外
が発生したときに、例外の処理を行なう。リカバリマネ
ージャ４は復旧が必要な場合にトラップハンドラ３から
呼び出され、リカバリルーチン１１およびリカバリルー
チン２１の実行を制御する。Since these subsystems 2a and 2b have the same configuration as the subsystem 2, the subsystem 2
The same reference numerals as in FIG. 1 denote the same parts, and the same reference numerals are added to the letters “a” and “b”. When an exception occurs, the trap handler 3 processes the exception. The recovery manager 4 is called from the trap handler 3 when recovery is necessary, and controls the execution of the recovery routine 11 and the recovery routine 21.

【００１２】次に、この第１実施の形態の動作について
図１および図２のフローチャートを参照して詳細に説明
する。たとえば、ＬＡＮドライバであるモジュール１で
不正アドレス参照を行ない、例外が発生した場合（図２
のステップＡ１）、トラップハンドラ３はリカバリルー
チンによる復旧が必要かどうかを判断する（ステップＡ
２）。この判断の結果、復旧が必要な場合は、トラップ
ハンドラ３はリカバリーマネージャ４に制御を渡す。Next, the operation of the first embodiment will be described in detail with reference to the flowcharts of FIGS. For example, when an illegal address is referenced in module 1 which is a LAN driver and an exception occurs (see FIG. 2).
Step A1), the trap handler 3 determines whether or not recovery by the recovery routine is necessary (Step A).
2). As a result of this determination, if recovery is necessary, the trap handler 3 passes control to the recovery manager 4.

【００１３】これにより、リカバリマネージャ４は例外
の原因となったモジュールを特定し、モジュール１のリ
カバリルーチン１１を実行する（ステップＡ３）。リカ
バリルーチン１１では、初期化処理を行なうか障害個所
を部分的に切り離すかして、システム全体がダウンする
ことを回避し、復旧を試みる。この復旧の実行により、
リカバリルーチン１１で復旧できたか、否かの判断を行
い（ステップＡ４）、その判断の結果、これで復旧でき
ないと判断された場合は、モジュール１が属するサブシ
ステム２、たとえばネットワークサブシステムのリカバ
リルーチン２１を実行する（ステップＡ５）。As a result, the recovery manager 4 specifies the module that caused the exception, and executes the recovery routine 11 of the module 1 (step A3). In the recovery routine 11, an initialization process is performed or a faulty part is partially separated to prevent the whole system from going down and to attempt recovery. By performing this recovery,
A determination is made in the recovery routine 11 as to whether or not the recovery was successful (step A4). As a result of the determination, if it is determined that the recovery is not possible, the recovery routine of the subsystem 2 to which the module 1 belongs, for example, the network subsystem 21 is executed (step A5).

【００１４】リカバリールーチン２１では、初期化処理
を行なうか、障害個所を切り離すかする。たとえば、ネ
ットワーク全体の初期化やネットワークの停止を行なう
ことになる。ここまでの動作で復旧できたか、否かをリ
カバリマネージャ４で判断して、その判断の結果、復旧
したと判断すれば（ステップＡ６）、障害の影響を最小
限に押えつつ、システムダウンを回避できたことにな
り、一連の処理を終了する。また、上記判断の結果、復
旧できていないと判断された場合には、既存のトラップ
ハンドラによる実行を行い（ステップＡ７）、再び、ス
テップＡ２での処理である復旧が必要か、否のチェック
を行う処理に戻る。The recovery routine 21 performs an initialization process or separates a failed part. For example, the entire network is initialized or the network is stopped. The recovery manager 4 determines whether or not the operation has been recovered by the above operation. If it is determined that the recovery has been performed (step A6), the system is prevented from being down while minimizing the influence of the failure. Then, a series of processes is completed. If it is determined that the restoration has not been completed, the execution is performed by the existing trap handler (step A7), and it is again checked whether or not the restoration in step A2 is necessary. Return to the processing to be performed.

【００１５】次に、この発明による第２実施の形態につ
いて図面を参照して説明する。図３はこの第２実施の形
態の構成を示す機能ブロック図である。この図３に示す
第２実施の形態の場合には、リカバリマネージャ４が、
例外の延長で動作することに代わり、モジュール１が障
害を検出した延長で動作する点で異なる。その他の部分
は、図１に示す第１実施の形態と同じである。Next, a second embodiment of the present invention will be described with reference to the drawings. FIG. 3 is a functional block diagram showing the configuration of the second embodiment. In the case of the second embodiment shown in FIG. 3, the recovery manager 4
The difference is that the module 1 operates on the extension in which the failure is detected, instead of operating on the extension of the exception. Other parts are the same as those of the first embodiment shown in FIG.

【００１６】次に、この第２実施の形態の動作を図４の
フローチャートを参照して説明する。この図４におい
て、モジュール１が障害を検出した場合（図４のステッ
プＡ１１）、モジュール１はリカバリルーチン１１によ
る復旧が必要かどうかを判断する（ステップＡ１２）。
この判断の結果、復旧が必要な場合はリカバリマネージ
ャ４に制御を渡す。リカバリーマネージャ４はモジュー
ル１のリカバリルーチン１１を実行する（ステップＡ１
３）。Next, the operation of the second embodiment will be described with reference to the flowchart of FIG. In FIG. 4, when the module 1 detects a failure (step A11 in FIG. 4), the module 1 determines whether recovery by the recovery routine 11 is necessary (step A12).
As a result of this determination, if recovery is necessary, control is passed to the recovery manager 4. The recovery manager 4 executes the recovery routine 11 of the module 1 (Step A1)
3).

【００１７】リカバリールーチン１１では、初期化処理
を行なうか障害個所を部分的に切り離すかして復旧を試
みる。この試みによる結果を判断して（ステップＡ１
４）、復旧がなされていると判断した場合には、処理を
終了するが、復旧されていない場合で、サブシステムレ
ベルの障害が原因と判断した場合、モジュール１が属す
るサブシステム２のリカバリールーチン２１を実行する
（ステップＡ１５）。In the recovery routine 11, recovery is attempted by performing initialization processing or partially isolating a faulty part. The result of this attempt is determined (step A1
4) If it is determined that the recovery has been performed, the processing is terminated. If the recovery has not been performed, and if it is determined that a failure at the subsystem level is the cause, the recovery routine of the subsystem 2 to which the module 1 belongs. 21 is executed (step A15).

【００１８】リカバリルーチン２１では初期化処理を行
なうか障害個所を切り離すかする。この動作により、モ
ジュールレベルで障害を検出した場合に、障害原因がそ
のモジュール内では解決できないサブシステムレベルの
障害であっても、階層的なリカバリルーチンにより復旧
が可能になる。In the recovery routine 21, an initialization process is performed or a faulty part is separated. With this operation, when a failure is detected at the module level, even if the cause of the failure is a subsystem-level failure that cannot be resolved within that module, recovery can be performed by a hierarchical recovery routine.

【００１９】[0019]

【発明の効果】以上のように、この発明によれば、モジ
ュールで不正アドレス参照を行い、例外発生時にトラッ
プハンドラによりリカバリルーチンによる復旧の必要性
ありと判断すると、リカバリマネージャにより例外の原
因となったモジュールを特定してモジュールのリカバリ
ルーチンを実行し、復旧不可能時にサブシステムのリカ
バリルーチンを実行するようにしたので、従来のオペレ
ーティングシステムでシステム全体がダウンしていたよ
うな障害でも、復旧手段を実装することにより、それを
回避することが可能となるとともに、階層的に復旧手段
を試みていることから、障害の影響を最小限に押さえる
ことができる。As described above, according to the present invention, when an illegal address is referred to in the module and it is determined by the trap handler that recovery is required by the recovery routine when an exception occurs, the recovery manager causes the exception. The recovery routine of the module is specified and the recovery routine of the subsystem is executed when recovery is not possible. By implementing, it is possible to avoid this, and since the recovery means is tried hierarchically, the effect of the failure can be minimized.

【００２０】また、この発明によれば、モジュールが障
害を検出すると、モジュールはそのリカバリルーチンに
より復旧の必要性の判断時に、リカバリマネージャはモ
ジュールのリカバリルーチンを実行し、障害に対する復
旧を試みるが、復旧できない場合にサブシステムのレベ
ルの障害が原因であると判断されたときには、モジュー
ルの属するサブシステムのリカバリルーチンを実行し
て、障害を回避するようにしたので、モジュールレベル
で障害を検出した場合に、障害原因がそのモジュール内
では解決できないサブシステムレベルの障害であって
も、階層的なリカバリルーチンにより復旧が可能とな
る。According to the present invention, when the module detects a failure, the recovery manager executes the recovery routine of the module and attempts recovery from the failure when the recovery routine determines the necessity of recovery by the recovery routine. If it is determined that the failure is caused by a subsystem-level failure when recovery is not possible, the failure is avoided by executing the recovery routine of the subsystem to which the module belongs. Furthermore, even if the cause of the failure is a subsystem-level failure that cannot be resolved in the module, recovery can be performed by a hierarchical recovery routine.

[Brief description of the drawings]

【図１】この発明によるオペレーティングシステムの階
層的リカバリ装置の第１実施の形態の構成を示す機能ブ
ロック図である。FIG. 1 is a functional block diagram showing a configuration of a first embodiment of a hierarchical recovery device for an operating system according to the present invention.

【図２】この発明によるオペレーティングシステムの階
層的リカバリ装置の第１実施の形態の動作を説明するた
めのフローチャートである。FIG. 2 is a flowchart for explaining the operation of the first embodiment of the hierarchical recovery apparatus for an operating system according to the present invention;

【図３】この発明によるオペレーティングシステムの階
層的リカバリ装置の第２実施の形態の構成を示す機能ブ
ロック図である。FIG. 3 is a functional block diagram showing a configuration of a second embodiment of the hierarchical recovery apparatus for an operating system according to the present invention;

【図４】この発明によるオペレーティングシステムの階
層的リカバリ装置の第２実施の形態の動作を説明するた
めのフローチャートである。FIG. 4 is a flowchart for explaining the operation of the second embodiment of the hierarchical recovery apparatus for an operating system according to the present invention;

[Explanation of symbols]

１，１ａ，１ｂ……モジュール、２，２ａ，２ｂ……サ
ブシステム、３…トラップハンドラ、４……リカバリマ
ネージャ、１１，１１ａ，１１ｂ…モジュールのリカバ
リルーチン、２１，２１ａ，２１ｂ……サブシステムの
リカバリルーチン。1, 1a, 1b ... module, 2, 2a, 2b ... subsystem, 3 ... trap handler, 4 ... recovery manager, 11, 11a, 11b ... module recovery routine, 21, 21a, 21b ... subsystem Recovery routine.

Claims

[Claims]

1. A trap handler that is included in a kernel section of an operating system and that determines whether or not recovery by a recovery routine is necessary when an exception occurs by referring to an illegal address in a module modularized for each function, and If the trap handler determines that the recovery is necessary, a recovery manager that identifies the module that caused the exception and executes a recovery routine of the module; A sub-system for executing a recovery routine provided by the self-executing system when recovery is not performed by execution; and a hierarchical recovery apparatus for an operating system.

2. The hierarchical recovery apparatus according to claim 1, wherein the module is a LAN driver.

3. The operating system according to claim 1, wherein the recovery manager performs an initialization process at the time of executing a recovery routine of the module to prevent the entire system from going down, and attempts recovery. Hierarchical recovery device.

4. The recovery manager according to claim 1, wherein at the time of execution of the recovery routine of the module, a fault location is partially cut off to prevent the entire system from going down.
The hierarchical recovery apparatus for an operating system according to claim 1, wherein recovery is attempted.

5. The subsystem including the module,
The hierarchical recovery device for an operating system according to claim 1, wherein the hierarchical recovery device is a network subsystem.

6. The hierarchical recovery apparatus of an operating system according to claim 1, wherein said subsystem performs an initialization process when executing a recovery routine provided in said subsystem.

7. The operating system hierarchical recovery apparatus according to claim 1, wherein said subsystem performs initialization processing of the entire network when executing a recovery routine provided therein.

8. The hierarchical recovery apparatus for an operating system according to claim 1, wherein said subsystem separates a failure point when executing a recovery routine provided in said subsystem.

9. The hierarchical recovery system for an operating system according to claim 1, wherein the subsystem stops the network when executing the recovery routine provided in the subsystem.

10. A module that is included in a kernel part of an operating system, is modularized for each function, and determines whether or not recovery by a recovery routine is necessary when a failure of the operating system is detected. If it is determined that the recovery manager executes the recovery routine of the module, and a subsystem that includes the module and executes its own recovery routine when the recovery is not performed by executing the recovery routine. Characterized hierarchical recovery device of operating system.

11. The hierarchical recovery apparatus of an operating system according to claim 10, wherein said recovery manager performs an initialization process when executing a recovery routine of said module.

12. The recovery manager according to claim 10, wherein the recovery manager attempts recovery by partially disconnecting a failure part when executing a recovery routine of the module.
A hierarchical recovery device for the described operating system.

13. The hierarchical recovery apparatus for an operating system according to claim 10, wherein said subsystem performs initialization processing of the entire network when executing a recovery routine provided in said subsystem.

14. The hierarchical recovery system for an operating system according to claim 10, wherein said subsystem isolates a failure point when executing a recovery routine provided in said subsystem.