JP2000181823A

JP2000181823A - Fault tolerance network management system

Info

Publication number: JP2000181823A
Application number: JP10353603A
Authority: JP
Inventors: Yasuhiro Kojima; 康博小島
Original assignee: NEC Communication Systems Ltd
Current assignee: NEC Communication Systems Ltd
Priority date: 1998-12-11
Filing date: 1998-12-11
Publication date: 2000-06-30
Anticipated expiration: 2018-12-11
Also published as: JP3256506B2

Abstract

PROBLEM TO BE SOLVED: To enable a subsidiary sub-manager to economically realize vicarious execution management in the case of fault in a main sub-manager with a simple configuration and secure fault tolerance performance to the fault of the main sub-manager. SOLUTION: This system is provided with plural local area networks 1, 2 and 3 connected to a wide area network 5 and main sub-managers 11 and 12 which are connected to the local area networks 1, 2 and 3 having no redundant configuration. The plural subsidiary sub-managers 14 and 15 having the redundant configuration being smaller than that in the main sub-managers at the side of an integrated manager is allowed to vicariously execute the function of either one of the main sub-managers 11 and 12 where fault occurs.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】この発明は、エージェント，
サブマネージャ，統合マネージャにより階層的に管理さ
れるフォールトトレラントネットワーク管理システムに
関する。The present invention relates to an agent,
The present invention relates to a fault-tolerant network management system that is hierarchically managed by a sub-manager and an integrated manager.

【０００２】[0002]

【従来の技術】従来、フォールトトレラントネットワー
ク管理システムでは、システムに壊滅的な障害が発生し
た時にサービスを中断することなく自動復旧を行うため
に、例えば、特開平９−１１６５３８号公報に示される
ように、主ネットワーク管理システムの一つに対し、必
ず一つの副ネットワーク管理システムを配置することに
より、フォールトトレラント性を確保していた。また、
大規模なネットワークやローカルエリアネットワーク
（以下、ＬＡＮという）の様な分散したネットワークを
管理する場合、管理パケットの削減やネットワーク管理
システムの簡略化を計るため、階層構造でエージェン
ト，サブマネージャ，統合マネージャを配置することに
より、ネットワークを管理した方が効果的な場合がある
が、このとき、前記公報に示されるような、主ネットワ
ーク管理システム一つに対し、必ず副ネットワーク管理
システムを一つ取る構成とすると、主サブマネージャｍ
個を配置して、ネットワーク管理を行う階層構造管理の
構成を取ったとき、副サブマネージャもｍ個配置する必
要がある。2. Description of the Related Art Conventionally, in a fault-tolerant network management system, when a catastrophic failure occurs in a system, in order to perform automatic restoration without interrupting service, as disclosed in Japanese Patent Application Laid-Open No. Hei 9-116538, for example. In addition, one sub-network management system is always provided for one of the main network management systems, thereby ensuring fault tolerance. Also,
When managing a distributed network such as a large-scale network or a local area network (hereinafter, referred to as a LAN), a hierarchical structure of agents, sub-managers, and integrated managers is used to reduce management packets and simplify the network management system. In some cases, it is more effective to manage a network by arranging a network. In this case, as shown in the above-mentioned publication, a configuration in which one main network management system and one sub-network management system are always used Then the main sub-manager m
When a number of sub-sub-managers are arranged and a hierarchical structure management for performing network management is adopted, m sub-sub-managers also need to be arranged.

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、かかる
従来のフォールトトレラントネットワーク管理システム
では、主ネットワーク管理システムと副ネットワーク管
理システムが固定的に一対一に対応したシステム構成で
あることにより、主ネットワーク管理システムがｍ個存
在するとき、副ネットワーク管理システムも必ずｍ個必
要となり、この結果、構成が複雑になるばかりか、シス
テムが高価になるという課題があった。また、ネットワ
ーク管理システムにフォールトトレラント性を確保する
ため、主ネットワーク管理システムｍ個に対し、副ネッ
トワーク管理システムがｍ個存在しても、一対の主・副
ネットワーク管理システムが二重障害に陥ると、フォー
ルトトレラント性が確保できないという課題があった。However, in such a conventional fault-tolerant network management system, the main network management system and the sub-network management system have a fixed one-to-one system configuration. When there are m sub-network management systems, m sub-network management systems are necessarily required. As a result, not only the configuration becomes complicated, but also the system becomes expensive. Further, in order to ensure fault tolerance of the network management system, even if there are m main network management systems and m sub-network management systems, if a pair of main / sub-network management systems suffers a double failure. However, there has been a problem that fault-tolerance cannot be ensured.

【０００４】この発明は前記課題を解決するものであ
り、複数の主サブマネージャに対しこれよりも少ない副
サブマネージャに、主サブマネージャの障害時の代行管
理を簡単な構成にて、かつ経済的に実現しながら、主サ
ブマネージャの障害に対するフォールトトレラント性を
確保することができるフォールトトレラントネットワー
ク管理システムを得ることを目的とする。SUMMARY OF THE INVENTION The present invention solves the above-mentioned problems. A plurality of main sub-managers are provided with a smaller number of sub-sub-managers. It is an object of the present invention to provide a fault-tolerant network management system capable of ensuring fault-tolerance against a failure of a main sub-manager while realizing the above.

【０００５】[0005]

【課題を解決するための手段】前記目的達成のために、
請求項１の発明にかかるフォールトトレラントネットワ
ーク管理システムは、ワイドエリアネットワークにルー
タを介して接続された複数のローカルエリアネットワー
クと、これらのローカルエリアネットワークのいくつか
に接続され、これらの各ローカルエリアネットワーク内
の管理オブジェクトを管理，制御する冗長構成を持たな
い主サブマネージャとを設け、前記ローカルエリアネッ
トワークの残りに接続した冗長構成を持つ統合マネージ
ャに、前記各主サブマネージャおよび前記ワイドエリア
ネットワークを介してこれらの管理オブジェクトを管
理，制御させるようにし、該統合マネージャを接続した
ローカルエリアネットワークに接続した前記主サブマネ
ージャの総数より少ない複数の副サブマネージャに、障
害発生したいずれかの前記主サブマネージャの機能を代
行させるようにしたものである。To achieve the above object,
The fault tolerant network management system according to the first aspect of the present invention includes a plurality of local area networks connected to a wide area network via a router, and a plurality of local area networks connected to some of these local area networks. A main sub-manager that does not have a redundant configuration for managing and controlling managed objects in the local area network is provided to an integrated manager having a redundant configuration connected to the rest of the local area network via the main sub-managers and the wide area network. One of the sub-sub-managers, which is smaller than the total number of the main sub-managers connected to the local area network to which the integrated manager is connected, manages and controls these managed objects. It is obtained so as to act for the function of the main sub-manager.

【０００６】また、請求項２の発明にかかるフォールト
トレラントネットワーク管理システムは、前記副サブマ
ネージャの数を、主サブマネージャがどの程度の確率で
同時に障害となる可能性があるかにもとづいて設定した
ものである。In the fault tolerant network management system according to the second aspect of the present invention, the number of the sub-sub-managers is set on the basis of the probability with which the main sub-manager is likely to fail simultaneously. Things.

【０００７】また、請求項３の発明にかかるフォールト
トレラントネットワーク管理システムは、前記ローカル
エリアネットワークのそれぞれに、これらの資源単位に
構成情報，状態情報等の管理オブジェクトを管理，制御
するエージェントを接続したものである。In the fault tolerant network management system according to the third aspect of the present invention, an agent that manages and controls a management object such as configuration information and status information for each resource unit is connected to each of the local area networks. Things.

【０００８】また、請求項４の発明にかかるフォールト
トレラントネットワーク管理システムは、前記各主サブ
マネージャとこれに対応するエージェントとの間、前記
統合マネージャと各主サブマネージャとの間、前記各副
サブマネージャと前記各エージェントとの間、および統
合マネージャと各副サブマネージャとの間で、それぞれ
シンプルネットワークマネージメントプロトコルを使用
して前記管理オブジェクトを管理するようにしたもので
ある。The fault tolerant network management system according to the invention of claim 4 is characterized in that each of the sub-sub-managers and the corresponding agent, between the integrated manager and each of the main sub-managers, and between each of the sub-sub-managers. The management object is managed between the manager and each of the agents and between the integrated manager and each of the sub-sub-managers using a simple network management protocol.

【０００９】[0009]

【発明の実施の形態】以下、この発明の実施の一形態を
図について説明する。図１は、この発明のフォールトト
レラントネットワーク管理システムの構成を示すブロッ
ク図である。同図において、１，２，３は複数のＬＡＮ
で、これらがワイドエリアネットワーク（以下、ＷＡＮ
という）５によって、ルータ６，７，８を経由して接続
されている。また、ＬＡＮ２には、ネットワーク資源単
位に、これの構成情報，状態情報等の管理オブジェクト
を管理，制御する複数のエージェント２０，２１が接続
され、更に、これらエージェント２０，２１を介して、
ＬＡＮ２内の管理オブジェクトを管理、制御する冗長構
成を持たない主サブマネージャ１１が接続されている。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS One embodiment of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing the configuration of the fault tolerant network management system of the present invention. In the figure, 1, 2, and 3 indicate a plurality of LANs.
These are wide area networks (hereinafter referred to as WAN).
5) are connected via routers 6, 7, and 8. The LAN 2 is connected to a plurality of agents 20 and 21 for managing and controlling management objects such as configuration information and status information in units of network resources.
A main sub-manager 11 having no redundant configuration for managing and controlling the management objects in the LAN 2 is connected.

【００１０】前記同様にして、ＬＡＮ３には、ネットワ
ーク構成単位に、この構成情報，状態情報等の管理オブ
ジェクトを管理，制御する複数のエージェント３０，３
１が接続され、更に、これらエージェント３０，３１を
介して、ＬＡＮ３内の管理オブジェクトを管理，制御す
る冗長構成を持たない主サブマネージャ１２が接続され
ている。一方、ＬＡＮ１には、複数のエージェント５
０，５１が接続され、更に、これらエージェント５０，
５１の管理下の管理オブジェクトを管理、制御するとと
もに、ＷＡＮ５および主サブマネージャ１１，１２を通
じて、これら管理下の管理オブジェクトを管理，制御す
る統合マネージャ１０が接続されている。すなわち、Ｌ
ＡＮ１にはネットワーク全体の資源を管理する統合マネ
ージャ１０が接続されている。In the same manner as described above, the LAN 3 has a plurality of agents 30 and 3 for managing and controlling management objects such as configuration information and status information in network configuration units.
The main sub-manager 12 having no redundant configuration for managing and controlling the management objects in the LAN 3 is connected via these agents 30 and 31. On the other hand, the LAN 1 has a plurality of agents 5
0, 51 are connected, and these agents 50, 51
An integrated manager 10 that manages and controls the managed objects under the management of 51, and manages and controls these managed objects through the WAN 5 and the main sub-managers 11 and 12 is connected. That is, L
An integrated manager 10 that manages resources of the entire network is connected to AN1.

【００１１】なお、前記ＬＡＮ２，３は統合マネージャ
１０が管理対象とする拠点であるが、その総数はｍ個で
ある。つまり、統合マネージャ１０は主サブマネージャ
１１から１２の総数ｍ個を通じて、これら管理下の各管
理オブジェクトを管理，制御する。ここで、統合マネー
ジャ１０のみ、それ自身、一対一の冗長構成を持つシス
テムである。また、ＬＡＮ１には、ＬＡＮ２，３の総数
ｍ個の主サブマネージャ１１，１２のバックアップとな
るべき、副サブマネージャ１４，１５が接続されてい
る。この副サブマネージャ１４，１５は、主サブマネー
ジャ１１，１２のうち、任意の主サブマネージャが壊滅
的障害となった場合、その主サブマネージャ機能を代行
するバックアップ用のサブマネージャである。なお、副
サブマネージャ１４，１５は総数ｎ個存在する。このと
き、ｍ＞ｎであり、ｎの値は主サブマネージャ１１，１
２がどの程度の確率で同時に障害となる可能性があるか
で決定する。The LANs 2 and 3 are sites managed by the integrated manager 10, and the total number is m. That is, the integrated manager 10 manages and controls each of the managed objects under the management through the total number m of the main sub-managers 11 to 12. Here, only the integration manager 10 itself has a one-to-one redundant configuration. Further, the LAN 1 is connected to sub-sub-managers 14 and 15 which are to be backups of the m main sub-managers 11 and 12 of the total number of LANs 2 and 3. The sub-sub-managers 14 and 15 are backup sub-managers that take over the function of the main sub-manager when any one of the main sub-managers 11 and 12 has a catastrophic failure. Incidentally, there are a total of n sub-sub-managers 14 and 15. At this time, m> n, and the value of n is the main sub-manager 11, 1
2 is determined based on the probability of simultaneous failure.

【００１２】図２は、エージェント２０，２１，３０，
３１，主サブマネージャ１１，１２，副サブマネージャ
１４，１５および統合マネージャ１０の論理的関係を示
す説明図である。ここで、主サブマネージャ１１とエー
ジェント２０，２１との間は、通信ネットワーク管理に
関する国際的な標準規格ＩＡＢ（ＩｎｔｅｒｎｅｔＡｃ
ｔｉｖｉｔｉｅｓＢｏａｒｄ）のＳＮＭＰ（Ｓｉｍｐ
ｌｅＮｅｔｗｏｒｋＭａｎａｇｅｍｅｎｔＰｒｏ
ｔｏｃｏｌ）を使用して管理オブジェクトを管理する。
そして、主サブマネージャ１１は、管理範囲のエージェ
ント２０，２１を通じて収集した、複数の管理オブジェ
クトの集合を木構造で表現したＭＩＢ（Ｍａｎａｇｅｍ
ｅｎｔＩｎｆｏｒｍａｔｉｏｎＢａｓｅ）という形式
で保持する。FIG. 2 shows agents 20, 21, 30,
FIG. 3 is an explanatory diagram showing a logical relationship among 31, main sub-managers 11 and 12, sub-sub-managers 14 and 15, and an integrated manager 10. Here, between the main sub-manager 11 and the agents 20 and 21, an international standard IAB (Internet Ac) for communication network management.
SNMP (Simp)
le Network Management Pro
tocol) to manage managed objects.
Then, the main sub-manager 11 is a MIB (Managem) that expresses a set of a plurality of management objects collected through the agents 20 and 21 in the management range in a tree structure.
entInformation Base).

【００１３】同様にして、主サブマネージャ１２とエー
ジェント３０，３１との間も、ＳＮＭＰを使用して管理
オブジェクトを管理し、また、主サブマネージャ１１，
１２のバックアップとなる副サブマネージャ１４，１５
と各エージェント２０，２１，３０，３１との間も、Ｓ
ＮＭＰを使用して管理オブジェクトを管理する。なお、
主サブマネージャ１１，１２と統合マネージャ１０との
間、副サブマネージャ１４，１５と統合マネージャ１０
との間もＳＮＭＰを使用し管理オブジェクトを管理す
る。Similarly, between the main sub-manager 12 and the agents 30 and 31, managed objects are managed by using SNMP.
12 sub-sub-managers 14 and 15 to be backup
And between each agent 20, 21, 30, 31
Manage managed objects using NMP. In addition,
Between the main sub-managers 11 and 12 and the integrated manager 10, the sub-sub-managers 14 and 15 and the integrated manager 10
Also, the management object is managed using SNMP.

【００１４】図３および図４は、エージェント３０，３
１，主サブマネージャ１１，１２，副サブマネージャ１
４，１５，統合マネージャ１０により階層的にネットワ
ーク資源を管理するとき、主サブマネージャ１１，１２
が障害に陥り、バックアップとなるべき副サブマネージ
ャ１４，１５が起動され、ネットワーク管理システムと
してフォールトトレラント性を確保するための各システ
ムの動作シーケンスを示すものである。統合マネージャ
１０は主サブマネージャ１１，１２のｍ個の主サブマネ
ージャに対しＩＣＭＰ（ＩｎｔｅｒｎｅｔＣｏｎｔｒ
ｏｌＭｅｓｓａｇｅＰｒｏｔｏｃｏｌ）を使用して
ＩＣＭＰエコー要求を定期的に発行し（ヘルスチェッ
ク）、その結果であるＩＣＭＰエコー応答を取得するこ
とにより、各々の主サブマネージャ１１，１２の生死を
確認する。FIG. 3 and FIG.
1, main sub-managers 11 and 12, sub-sub-manager 1
4, 15 When the network resources are hierarchically managed by the integrated manager 10, the main sub-managers 11, 12
2 shows a sequence of operations of each system for ensuring fault-tolerance as a network management system when the secondary sub-managers 14 and 15 to be backed up are activated. The integrated manager 10 sends ICMP (Internet Control) to the m main sub-managers of the main sub-managers 11 and 12.
ol Message Protocol), an ICMP echo request is issued periodically (health check), and the resulting ICMP echo response is obtained, thereby confirming whether each of the main sub-managers 11 and 12 is alive or dead.

【００１５】上記のごとく、統合マネージャ１０が各々
の主サブマネージャ１１，１２にＩＣＭＰエコー要求を
発行してから一定時間経過しても、例えば主サブマネー
ジャ１２からの応答がなき場合（応答待ちタイムアウ
ト）、その主サブマネージャ１２の障害と認識する。そ
して統合マネージャ１０が主サブマネージャ１１，１２
のうちの主サブマネージャ１２の障害を認識した場合、
この統合マネージャ１０は、空きの状態の例えば副サブ
マネージャ１４に対し、障害のある主サブマネージャ１
２の代行を要求するメッセージをＳＮＭＰを用いて送信
する。この要求メッセージには、代行すべき主サブマネ
ージャ１２のＩＰアドレス，その属するＬＡＮ３のネッ
トワークアドレス，ＬＡＮ３を接続するルータアドレス
を合わせて送信する。統合マネージャ１０は、副サブマ
ネージャ１４，１５の使用中状態を管理するが、その情
報としては、使用中／未使用の状態、使用中の場合、ど
の主サブマネージャの代行であるかを示す、ＩＰアドレ
ス情報を管理する。As described above, for example, when there is no response from the main sub-manager 12 even after a certain period of time has elapsed since the integrated manager 10 issued the ICMP echo request to each of the main sub-managers 11 and 12 (response waiting timeout) ), And recognizes that the main sub-manager 12 has failed. Then, the integrated manager 10 becomes the main sub-managers 11 and 12.
When the failure of the main sub-manager 12 is recognized,
The unified manager 10 replaces the free sub-manager 14 with the failed main sub-manager 1, for example.
A message requesting the proxy of the second is transmitted using SNMP. This request message is sent together with the IP address of the main sub-manager 12 to be substituted, the network address of the LAN 3 to which the sub-manager 12 belongs, and the address of the router connecting the LAN 3. The integrated manager 10 manages the in-use state of the sub-sub-managers 14 and 15, and the information includes a used / unused state, and when in use, indicates which main sub-manager is acting on behalf of Manages IP address information.

【００１６】主サブマネージャ１２の代行要求を受け付
けた副サブマネージャ１４は、代行要求メッセージに含
まれる、代行すべき主サブマネージャ１２の属するＬＡ
Ｎ３を接続するルータ８のＩＰアドレスをもとに、ＳＮ
ＭＰによりそのルータが持つ、アール・エフ・シー・１
２１３、マネージメント・インフォメーション・ベース
・フォー・ネットワーク・マネージメント・オブ・ティ
シーピー／アイピー・ベースド・インターネット、ミブ
−ツー（ＲＦＣ１２１３、ＭａｎａｇｅｍｅｎｔＩｎ
ｆｏｒｍａｔｉｏｎＢａｓｅｆｏｒＮｅｔｗｏｒ
ｋＭａｎａｇｅｍｅｎｔｏｆＴＣＰ／ＩＰＢａ
ｓｅｄＩｎｔｅｒｎｅｔ、ＭＩＢ−II）で規定される
ドット表記、ｉｓｏ（１）．ｏｒｄ（３）．ｄｏｄ
（６）．ｉｎｔｅｒｎｅｔ（１）．ｍｇｍｔ（２）．ｍ
ｉｂ−２（１）．ａｔ（３）．ａｔＴａｂｌｅ（１）に
存在するａｔＴａｂｌｅの値を全て取得する。The sub-sub-manager 14 that has received the proxy request from the main sub-manager 12 includes the LA to which the main sub-manager 12 to be substituted belongs, which is included in the proxy request message.
Based on the IP address of the router 8 connecting N3, SN
RFC 1 that the router has by MP
213, Management Information Base for Network Management of TSP / IP Based Internet, Mib-two (RFC1213, Management In
formation Base for Network
k Management of TCP / IP Ba
dot notation defined by the sed Internet (MIB-II), iso (1). ord (3). dod
(6). internet (1). mgmt (2). m
ib-2 (1). at (3). Acquires all the values of atTable existing in atTable (1).

【００１７】前記のａｔＴａｂｌｅの値は、そのルータ
８とＡＲＰ（ａｄｄｒｅｓｓＲｅｓｏｌｕｔｉｏｎ
Ｐｒｏｔｏｃｏｌ）により、ＩＰアドレスから物理アド
レスを解決した、全てのＩＰアドレスと物理アドレスの
対応関係が保持されているため、結果として、そのルー
タ８が収容するＬＡＮ３に接続されている装置、つまり
ＬＡＮ３に接続されているエージェント３０，３１の全
てのＩＰアドレスと物理アドレスの組み合わせを取得す
る事が可能となる。The value of the above atTable is determined by the router 8 and the ARP (address Resolution).
Protocol), the physical address is resolved from the IP address, and the correspondence between all the IP addresses and the physical addresses is retained. As a result, the device connected to the LAN 3 accommodated by the router 8, that is, the LAN 3 It is possible to acquire all combinations of IP addresses and physical addresses of the connected agents 30 and 31.

【００１８】次のステップとして、副サブマネージャ１
４は、上記で取得したＩＰアドレスを基に、ＬＡＮ３に
接続されている全てのエージェント３０，３１に対し、
自副サブマネージャ１４のＩＰアドレス変更のメッセー
ジをＳＮＭＰにより送信し、各エージェント３０，３１
に対してマネージャが主サブマネージャ１２から副サブ
マネージャ１４に変更になったことを通知する。以上に
より、副サブマネージャ１４は、構成情報，状態情報等
を管理，制御すべき全てのエージェント３０，３１のＩ
Ｐアドレスを取得し、主サブマネージャ１２が障害に陥
った各エージェント３０，３１は、新しい代行マネージ
ャ、つまり副サブマネージャ１４のＩＰアドレスを取得
することが可能となる。As the next step, the sub-sub-manager 1
4 is for all the agents 30 and 31 connected to the LAN 3 based on the IP address obtained above.
A message for changing the IP address of the own sub-sub-manager 14 is transmitted by SNMP, and the agents 30 and 31
Is notified that the manager has changed from the main sub-manager 12 to the sub-sub-manager 14. As described above, the sub-sub-manager 14 manages the I / Os of all the agents 30 and 31 to manage and control the configuration information, the status information, and the like.
The agents 30 and 31 that have acquired the P address and the main sub-manager 12 has failed can acquire the IP address of the new substitute manager, that is, the sub-sub-manager 14.

【００１９】以降、統合マネージャ１０は、これまで障
害中の主サブマネージャ１２に対し送信していた各種メ
ッセージを副サブマネージャ１４に送信する。副サブマ
ネージャ１４は対応するＬＡＮ３配下の各エージェント
３０，３１にそのメッセージを送信し、副サブマネージ
ャ１４宛のトラップも必要に応じ、統合マネージャ１０
に送信することで、ネットワーク管理システム全体とし
てのフォールトトレラント性を確保する。統合マネージ
ャ１０は、管理すべきアクティブとなった副サブマネー
ジャ１４が増加したことを認識しているため、新規にア
クティブとなった副サブマネージャ１４についても、定
期的にＩＣＭＰエコー要求を送信し、主サブマネージャ
１２の場合と同様に、生死を監視する。Thereafter, the integrated manager 10 transmits various messages that have been transmitted to the failed main sub-manager 12 to the sub-sub-manager 14. The sub-sub-manager 14 transmits the message to each of the agents 30 and 31 under the corresponding LAN 3, and traps destined for the sub-sub-manager 14 are also sent as needed.
By transmitting to the network management system, the fault tolerance of the entire network management system is ensured. Since the integrated manager 10 recognizes that the number of active sub-sub-managers 14 to be managed has increased, the integrated manager 10 also periodically sends an ICMP echo request to the newly activated sub-sub-managers 14, As in the case of the main sub-manager 12, the life and death are monitored.

【００２０】従って、監視対象の副サブマネージャ１４
が統合マネージャ１０のＩＣＭＰエコー要求に、一定時
間経過しても応答しない場合は、上記と同様の手順に
て、統合マネージャ１０は、次なる空きの副サブマネー
ジャ１５に代行要求を送信し、副サブマネージャ１４の
代行を実施させる。以上により、複数の主サブマネージ
ャ１１，１２が同時に障害に陥らない限り、副サブマネ
ージャ１４，１５がｎ台存在すれば、ある一つの主サブ
マネージャに着目したとき、ｎ重障害までのフォールト
トレラント性を確保することが可能となる。一方、統合
マネージャ１０は、障害に陥った主サブマネージャ１２
にも定期的にＩＣＭＰエコー要求を送信し続け、この主
サブマネージャ１２の復旧を監視する。あるタイミング
で、障害と認識していた主サブマネージャ１２からＩＣ
ＭＰエコー要求に対する応答が送信されたとき、統合マ
ネージャ１０はこれまで障害とされた主サブマネージャ
１２が復旧したと判断する。Therefore, the sub-submanager 14 to be monitored
Does not respond to the ICMP echo request from the integrated manager 10 even after a certain period of time, the integrated manager 10 transmits a proxy request to the next free sub-sub-manager 15 in the same procedure as described above, and The sub-manager 14 is executed. As described above, as long as a plurality of main sub-managers 11 and 12 do not fail simultaneously, if there are n sub-sub-managers 14 and 15, when focusing on one certain main sub-manager, fault tolerance up to n-fold failures Can be ensured. On the other hand, the integrated manager 10
Also, the main sub-manager 12 continues to periodically transmit the ICMP echo request and monitors the recovery of the main sub-manager 12. At a certain timing, the main sub-manager 12 that has
When a response to the MP echo request is transmitted, the integrated manager 10 determines that the main sub-manager 12 that has been in failure so far has been recovered.

【００２１】この時、統合マネージャ１０は、復旧した
主サブマネージャ１２に、復旧メッセージをＳＮＭＰに
より送信し、副サブマネージャ１５からエージェント３
０，３１に対して、副サブマネージャ１５から主サブマ
ネージャ１２にマネージャＩＰアドレスが変更となった
ことを伝えるメッセージを送信することを指示する。こ
のメッセージにより該当ＬＡＮ配下の各エージェント
は、主サブマネージャが復旧したことを認識し、各シス
テムとも元の状態に復旧する。合わせて統合マネージャ
１０は、主サブマネージャ１２の代行を行っていた副サ
ブマネージャ１５に代行解除のメッセージを送信し、統
合マネージャ１０の管理する副サブマネージャの状態を
空きに管理し、次の主サブマネージャ１１，１２のいず
れかの障害に備える。At this time, the integrated manager 10 sends a restoration message to the restored main sub-manager 12 by SNMP, and the sub-
0, 31 is instructed to transmit a message from the sub-sub-manager 15 to the main sub-manager 12 indicating that the manager IP address has been changed. By this message, each agent under the LAN recognizes that the main sub-manager has been restored, and each system is restored to the original state. At the same time, the integrated manager 10 transmits a message of delegation release to the sub-sub-manager 15 that has been acting for the main sub-manager 12, manages the state of the sub-sub-manager managed by the integrated manager 10 to be empty, and Prepare for a failure of any of the sub-managers 11 and 12.

【００２２】[0022]

【発明の効果】以上のように、この発明によれば、主ネ
ットワーク管理システムと副ネットワーク管理システム
が固定的に一対一に対応していないシステム構成とした
ので、ｍ個のネットワーク管理システムのフォールトト
レラント性を確保するために、ｍより小さい値の、ｎ個
の副サブマネージャでネットワーク管理システムのフォ
ールトトレラント性を確保できるとともに、これを簡単
かつローコストの構成にて実現できるという効果が得ら
れる。また、同時に複数の主サブマネージャが障害に陥
らない限り、一つの主サブマネージャの最大ｎ重障害ま
でフォールトトレラント性を確保できるという効果が得
られる。As described above, according to the present invention, the system configuration is such that the main network management system and the sub network management system are not fixedly one-to-one, so that the faults of the m network management systems In order to ensure the tolerant property, the effect that the fault tolerant property of the network management system can be ensured by n sub-sub-managers having a value smaller than m and that this can be realized with a simple and low-cost configuration is obtained. Further, as long as a plurality of main sub-managers do not fail at the same time, the effect that fault tolerability can be ensured up to a maximum n-fold failure of one main sub-manager is obtained.

[Brief description of the drawings]

【図１】この発明の実施の一形態によるフォールトト
レラントネットワーク管理システムを示すブロック図で
ある。FIG. 1 is a block diagram showing a fault tolerant network management system according to an embodiment of the present invention.

【図２】図１におけるエージェント，サブマネージャ
および統合マネージャの論理的関係を示す説明図であ
る。FIG. 2 is an explanatory diagram showing a logical relationship among an agent, a sub-manager, and an integrated manager in FIG.

【図３】図１におけるエージェント，サブマネージャ
および統合マネージャの動作を示すシーケンス図であ
る。FIG. 3 is a sequence diagram showing operations of an agent, a sub-manager, and an integrated manager in FIG.

【図４】図３にひき続き、図１におけるエージェン
ト，サブマネージャおよび統合マネージャの動作を示す
シーケンス図である。FIG. 4 is a sequence diagram showing operations of the agent, the sub-manager, and the integration manager in FIG. 1, following FIG. 3;

[Explanation of symbols]

１，２，３ＬＡＮ（ローカルエリアネットワーク）５ＷＡＮ（ワイドエリアネットワーク）６，７，８ルータ１０統合マネージャ１１，１２主サブマネージャ１４，１５副サブマネージャ２０，２１，３０，３１，５０，５１エージェント 1,2,3 LAN (Local Area Network) 5 WAN (Wide Area Network) 6,7,8 Router 10 Integrated Manager 11,12 Main Sub-Manager 14,15 Sub-Sub Manager 20,21,30,31,50,51 Agent

─────────────────────────────────────────────────────
────────────────────────────────────────────────── ───

【手続補正書】[Procedure amendment]

【提出日】平成１２年３月１６日（２０００．３．１
６）[Submission date] March 16, 2000 (200.3.1.1)
6)

【手続補正１】[Procedure amendment 1]

【補正対象書類名】明細書[Document name to be amended] Statement

【補正対象項目名】特許請求の範囲[Correction target item name] Claims

【補正方法】変更[Correction method] Change

【補正内容】[Correction contents]

【特許請求の範囲】[Claims]

【手続補正２】[Procedure amendment 2]

【補正対象書類名】明細書[Document name to be amended] Statement

【補正対象項目名】０００５[Correction target item name] 0005

【補正方法】変更[Correction method] Change

【補正内容】[Correction contents]

【０００５】[0005]

【課題を解決するための手段】前記目的達成のために、
請求項１の発明にかかるフォールトトレラントネットワ
ーク管理システムは、ワイドエリアネットワークにルー
タを介して接続された複数のローカルエリアネットワー
クと、これらのローカルエリアネットワークのいくつか
に接続され、これらの各ローカルエリアネットワーク内
の管理オブジェクトを管理，制御する冗長構成を持たな
い主サブマネージャとを設け、前記ローカルエリアネッ
トワークの残りに接続した冗長構成を持つ統合マネージ
ャに、前記各主サブマネージャおよび前記ワイドエリア
ネットワークを介してこれらの管理オブジェクトを管
理，制御させるようにし、該統合マネージャを接続した
ローカルエリアネットワークに接続した前記主サブマネ
ージャの総数より少ない複数の副サブマネージャに、障
害発生したいずれかの前記主サブマネージャの機能を代
行させる構成としている。そして、前記統合マネージャ
が主サブマネージャの障害を認識した場合、前記統合マ
ネージャは、空きの状態の副サブマネージャに対し、障
害のある主サブマネージャの代行を要求するメッセージ
をＳＮＭＰを用いて送信すると共に、該要求メッセージ
には、代行すべき主サブマネージャのＩＰアドレス，そ
の属するＬＡＮのネットワークアドレス，該ＬＡＮを接
続するルータアドレスを合わせて送信するようにしたも
のである。To achieve the above object,
The fault tolerant network management system according to the first aspect of the present invention includes a plurality of local area networks connected to a wide area network via a router, and a plurality of local area networks connected to some of these local area networks. A main sub-manager that does not have a redundant configuration for managing and controlling managed objects in the local area network is provided to an integrated manager having a redundant configuration connected to the rest of the local area network via the main sub-managers and the wide area network. One of the sub-sub-managers, which is smaller than the total number of the main sub-managers connected to the local area network to which the integrated manager is connected, manages and controls these managed objects. It has a configuration which act for the function of the main sub-manager. And the integration manager
If the master recognizes the failure of the primary sub-manager,
The manager issues an error to the free secondary sub-manager.
Message requesting harmful primary sub-manager to act
Is transmitted using SNMP, and the request message is transmitted.
Contains the IP address of the main sub-manager to be substituted,
Network address of the LAN to which the
The next router address is transmitted together .

Claims

[Claims]

1. A plurality of local area networks connected to a wide area network via a router, and connected to some of these local area networks to manage and control a management object in each of these local area networks. A main sub-manager having no redundant configuration; an integrated manager having a redundant configuration connected to the rest of the local area network and managing and controlling these managed objects via the main sub-managers and the wide area network; A plurality of sub-sub-managers connected to the local area network to which the integrated manager is connected and substituting for the function of any of the failed main sub-managers, the number being smaller than the total number of the main sub-managers. Fall Tolerant network management system.

2. The fault according to claim 1, wherein the number of the sub-sub-managers is set based on the probability that the main sub-manager is likely to fail simultaneously. Tolerant network management system.

3. The local area network according to claim 1, wherein an agent that manages and controls a management object such as configuration information and status information for each resource unit is connected to each of the local area networks. Fault tolerant network management system.

4. Each of the main sub-managers and their corresponding agents, between the integration manager and each of the main sub-managers, between each of the sub-sub-managers and each of the agents, and between the integration manager and each of the agents. The fault tolerant network management system according to claim 1, wherein the management object is managed between the sub-manager and the sub-manager using a simple network management protocol.