JP2006203721A

JP2006203721A - Fault location finding method and apparatus

Info

Publication number: JP2006203721A
Application number: JP2005015063A
Authority: JP
Inventors: Takashi Miyamura; 崇宮村; Rie Hayashi; 理恵林; Takashi Kurimoto; 崇栗本; Akira Misawa; 明三澤
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2005-01-24
Filing date: 2005-01-24
Publication date: 2006-08-03
Anticipated expiration: 2025-01-24
Also published as: JP4377822B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide a technique capable of quickly specifying faulty parts with high accuracy even when a relay node in a network has no fault detection function by using SRG (Shared Risk Group) information imparted to a resource in the network and information of a faulty path only. <P>SOLUTION: A fault location finding apparatus 3 performs the steps of; receiving a notice of first path information denoting the occurrence of a fault from a node being a start point or an end point of a path; receiving a notice of second path information denoting the occurrence of a fault caused within a prescribed time from the notice of the first path information to create a fault path list on the basis of both the first and second path information items; referencing a storage means to create a fault resource list comprising at least part of resource IDs of resources for configuring each path of the fault path list; referencing the storage means to create fault parts list comprising at least part of parts IDs of parts configuring each resource of the fault resource list; and outputting the fault parts list. <P>COPYRIGHT: (C)2006,JPO&NCIPI

Description

本発明は、故障箇所発見方法および故障箇所発見装置に関する。 The present invention relates to a failure location discovery method and a failure location discovery device.

近年、インターネットやＬＡＮ（Local Area Network）の急速な普及から、ＩＰ（Internet Protocol）トラヒックを含めたデータトラヒックが指数関数的に増加している。トラヒックの増加に対応するため、バックボーンネットワークにおける高速なＩＰパケットの転送を実現するメカニズムが求められている。同時にＩＰをベースとしたアプリケーションも多様化しており、従来は電子メールやウェッブなどのネットワークに品質を要求しないアプリケーションが中心であったが、最近ではＩＰ電話や電子商取引のようにネットワークに高い品質を要求するアプリケーションも普及しはじめている。 In recent years, data traffic including IP (Internet Protocol) traffic is increasing exponentially due to the rapid spread of the Internet and LAN (Local Area Network). In order to cope with an increase in traffic, a mechanism for realizing high-speed IP packet transfer in a backbone network is required. At the same time, IP-based applications are diversifying, and traditionally applications such as e-mail and web that do not require quality have been the main focus. Recently, however, networks such as IP telephones and electronic commerce have been improved in quality. Requested applications are also becoming popular.

従来のＩＰルータ網では、中継段の各ルータで電気的なパケット処理が必要であり、またＥｎｄ−Ｅｎｄで高度な品質制御を行うメカニズムを持たないため、高速化の実現と高品質なサービスの提供を実現することが困難であった。ここで、品質としては、パケットの遅延時間や損失率という転送品質の他にも、ユーザから見えるサービス中断時間といった信頼性も重要な指標となる。 In the conventional IP router network, each router in the relay stage requires electrical packet processing, and since there is no mechanism for performing advanced quality control with End-End, it realizes high speed and high quality service. It was difficult to realize the provision. Here, as the quality, in addition to the transfer quality such as the packet delay time and loss rate, the reliability such as the service interruption time as seen by the user is an important index.

前記のＩＰルータ網に関わる問題を解決するひとつの手段として、ＭＰＬＳ（Multi-Protocol Label Switching）が提案されている。ＭＰＬＳは転送するパケットに固定長のラベルを付与し、ラベルをもとに通信網内の各ノードでは転送する方路を決定する。このＭＰＬＳをＩＰバックボーンネットワークに導入することにより高速なＩＰパケットの転送と高度なＴＥ（Traffic Engineering）が可能となる。 As one means for solving the problems related to the IP router network, MPLS (Multi-Protocol Label Switching) has been proposed. MPLS gives a fixed-length label to a packet to be transferred, and each node in the communication network determines a route to transfer based on the label. By introducing this MPLS into the IP backbone network, high-speed IP packet transfer and advanced TE (Traffic Engineering) are possible.

さらに近年、ＩＰネットワークの中継ノードに光クロスコネクトなどのレイヤ１技術を利用したネットワークアーキテクチャが提案されている。これまでに、ＡＳＯＮ（Automated Switched Optical Network）、ＧＭＰＬＳ（Generalized Multi-Protocol Label Switching）などが提案されている。これらネットワークに共通する特徴としては、パス（コネクション）がシグナリングにより動的に設定される点にある。このため、従来のレイヤ１網に比べて、パス開通までの時間を大幅に短縮でき、より効率的な網運営が期待できる。 In recent years, a network architecture using layer 1 technology such as an optical cross-connect for a relay node of an IP network has been proposed. So far, ASON (Automated Switched Optical Network), GMPLS (Generalized Multi-Protocol Label Switching), and the like have been proposed. A feature common to these networks is that a path (connection) is dynamically set by signaling. For this reason, compared with the conventional layer 1 network, the time until path establishment can be significantly shortened, and more efficient network operation can be expected.

前記のＭＰＬＳ、ＧＭＰＬＳに共通な特徴としては、ＩＰパケットを含むユーザのトラヒックを転送するにあたって、事前にパス設定を行う必要がある点があげられる。パス設定を行うためにはルーティングプロトコルなどを用いてネットワークのトポロジ情報（ノード間の接続構成に関する情報）を収集し、パスの中継経路を計算した上で、ＣＲ−ＬＤＰ（constraint-based Label Distribution Protocol）やＲＳＶＰ−ＴＥ（Resource Reservation Protocol -Traffic Engineering）などのシグナリングプロトコルによってパスの設定を行う。 A feature common to the MPLS and GMPLS is that it is necessary to set a path in advance when transferring user traffic including IP packets. In order to set a path, network topology information (information related to the connection configuration between nodes) is collected using a routing protocol, and a path relay route is calculated. Then, CR-LDP (constraint-based Label Distribution Protocol) is used. ) And RSVP-TE (Resource Reservation Protocol-Traffic Engineering).

パスの設定完了後に、該当パス上にトラヒックの転送を開始する。また、ＧＭＰＬＳとＭＰＬＳの関係であるが、ＧＭＰＬＳはＭＰＬＳの一般化であり、基本的にＭＰＬＳを包含するものである。 After the path setting is completed, traffic transfer on the corresponding path is started. Further, regarding the relationship between GMPLS and MPLS, GMPLS is a generalization of MPLS and basically includes MPLS.

次にＭＰＬＳやＧＭＰＬＳを用いて、ネットワークの高信頼化を実現する技術について説明する。ここで、例としてはＧＭＰＬＳの場合を説明するが、ＭＰＬＳの場合にも適用可能である。 Next, a technique for realizing high network reliability using MPLS or GMPLS will be described. Here, the case of GMPLS will be described as an example, but the present invention can also be applied to the case of MPLS.

ネットワークの高信頼化は、ネットワークの故障に対してネットワークを故障前の正常な状態に戻すことを指す。よって、高信頼化のための作業は、現用系から予備系への切替による一時的な復旧作業、または、故障した部品の交換による恒久的な復旧作業に分類できる。 High reliability of the network refers to returning the network to a normal state before the failure in response to the failure of the network. Therefore, the work for achieving high reliability can be classified into a temporary restoration work by switching from the active system to the standby system, or a permanent restoration work by replacing a failed part.

まず、一時的な復旧作業について、説明する。ＧＭＰＬＳにおける切替技術としては、パスプロテクションがよく知られている（非特許文献１、特許文献１）。パスプロテクションとは通信の起点となる通信ノードから終点となる通信ノード間に２本以上のパスをあらかじめ設定し、そのうちの１つを現用パスとして、通常時にユーザトラヒックを転送する。 First, temporary recovery work will be described. As a switching technique in GMPLS, path protection is well known (Non-patent Document 1, Patent Document 1). With path protection, two or more paths are set in advance between the communication node that is the starting point of communication and the communication node that is the end point, and one of them is used as a working path, and user traffic is transferred during normal times.

現用パス上のノードまたはリンクに障害が発生した際に、起点となるノード（発側）においてパス切替えを行い、ユーザトラヒックを予備パス上に転送する。現用パスの経路上に障害が発生した際に、直ちに予備パスに切替えることにより、ユーザトラヒックの不通時間を短縮し、高信頼化を実現している。予備パスは現用パス上の障害の影響を受けないようにするため、現用パスとは中継段のリンクおよびノードを共有しないように設計する必要がある。 When a failure occurs in a node or link on the working path, path switching is performed at the starting node (originating side), and user traffic is transferred onto the protection path. When a failure occurs on the path of the working path, switching to the backup path is immediately performed, thereby shortening the non-communication time of user traffic and realizing high reliability. In order to prevent the protection path from being affected by a failure on the working path, it is necessary to design the protection path so as not to share the link and node of the relay stage with the working path.

一般的に通信ネットワークは複数のレイヤで構成されている。光レイヤでは現用パス、予備パスは中継段において、ノードおよびリンクを共有していないが、物理レイヤまで考慮すると、現用パスと予備パスは１箇所で物理レイヤにおけるリンクであるファイバを共有している場合がある。このような場合に、共有しているファイバに故障が発生すると、パスプロテクションにより障害を復旧できないために著しくネットワーク信頼性、サービスの可用性が低下するという問題が生じる。 In general, a communication network is composed of a plurality of layers. In the optical layer, the working path and the protection path do not share nodes and links in the relay stage, but considering the physical layer, the working path and the protection path share a fiber that is a link in the physical layer at one location. There is a case. In such a case, if a failure occurs in the shared fiber, the failure cannot be recovered by path protection, so that there is a problem that network reliability and service availability are significantly reduced.

よって、パスプロテクションを用いても予備パス上のリソースに故障が発生した場合はサービスが中断してしまうため、故障したリソースを早期に特定し復旧することが重要となる。 Therefore, even if the path protection is used, if a failure occurs in a resource on the backup path, the service is interrupted. Therefore, it is important to identify and restore the failed resource at an early stage.

このような問題を回避する方法として、ＳＲＧ（Shared Risk Group）番号が提案されている（非特許文献２）。物理レイヤの各ファイバリンクにＳＲＧ番号を付与し、光レイヤの各リンクが収容されているファイバのＳＲＧ番号の和を光レイヤのリンクのＳＲＧ情報としてルーティングプロトコルなどで広告して、パスの経路計算に用いる。 As a method for avoiding such a problem, an SRG (Shared Risk Group) number has been proposed (Non-Patent Document 2). SRG numbers are assigned to each fiber link in the physical layer, and the route route calculation is performed by advertising the sum of the SRG numbers of the fibers that accommodate each link in the optical layer as SRG information of the optical layer link using a routing protocol. Used for.

光レイヤでパスプロテクションの経路設計を実施する際に、光レイヤのトポロジに加えて、ＳＲＧ情報を考慮することで、任意の物理レイヤリソースの故障に対しても、復旧可能なパスプロテクションの設計が可能となる。 When implementing path protection path design in the optical layer, by considering SRG information in addition to the topology of the optical layer, it is possible to design a path protection that can be recovered from any physical layer resource failure. It becomes possible.

以上説明した切替作業による復旧は、現用系から予備系に動作を切り替えることで、ネットワークの品質にかかわらず稼働させればいいという一時しのぎ程度のものである。しかし、予備系が使用可能なネットワーク資源は、現用系のものより劣っていることが多く、故障前と同などのネットワークサービスを実現するためには、故障箇所を特定し、その部品（パーツ）を交換（または修理）する必要がある。 The recovery by the switching operation described above is of a temporary level that the operation can be performed regardless of the quality of the network by switching the operation from the active system to the standby system. However, the network resources that can be used by the standby system are often inferior to those of the active system, and in order to realize the same network service as before the failure, the failure location is identified and its parts (parts) Need to be replaced (or repaired).

実際のネットワークでは、ファイバ以外の伝送装置、ＴＤＭ（Time Division Multiplexing）スイッチ、ルータ、回線カード、インターフェイスなど様々なパーツにおいて、それぞれ異なる要因で故障が発生する。 In an actual network, failures occur due to different factors in various parts such as transmission devices other than fibers, TDM (Time Division Multiplexing) switches, routers, line cards, and interfaces.

まず、光ネットワークなどではパスの終端点となる発側ノードまたは着側ノードのみにしか故障検出機能が配備されないため、パスの故障自体は検出可能だが故障箇所の特定が非常に困難であった。 First, in an optical network or the like, since a failure detection function is provided only at a source node or a destination node as a path termination point, it is possible to detect a path failure itself, but it is very difficult to specify a failure location.

そこで、故障箇所の発見方法としては特許文献２の方法が知られている。特許文献２に記載の方法により故障が発生したリンクまでは特定可能である。
特願２００２−２３５９８０号公報特開２００３−１１５８０６号公報 E. Oki他、“A Disjoint Path Selection Scheme with SRLG in GMPLS networks”、2002 WKsp.IEEE HPSR,May 2002,88-92 塩本公平著、“階層型ＳＲＧに基づくリストレーション方式”、電子情報通信学会総合大会論文集、２００３ Therefore, the method disclosed in Patent Document 2 is known as a method for finding a fault location. Even the link where the failure has occurred can be identified by the method described in Patent Document 2.
Japanese Patent Application No. 2002-235980 JP 2003-115806 A E. Oki et al., “A Disjoint Path Selection Scheme with SRLG in GMPLS networks”, 2002 WKsp. IEEE HPSR, May 2002, 88-92 Shiomoto, "Restoration method based on hierarchical SRG", IEICE General Conference Proceedings, 2003

しかし、特許文献２の特定するリンクとは、物理的なハードウェア資源としてのパーツではなく、ルーティングプロトコルが参照する論理的な回線である。よって、論理的なリンクは、実際には複数のパーツから構成されている。よって、特許文献２などの従来の故障箇所の特定技術では、物理的なパーツは特定することが困難である。そのため、現状では、作業員が現地に赴いてテスタなどで逐一部品の稼働状況を調査する必要があり、手間がかかっていた。 However, the link specified in Patent Document 2 is not a part as a physical hardware resource but a logical line referred to by a routing protocol. Therefore, the logical link is actually composed of a plurality of parts. Therefore, it is difficult to specify a physical part by a conventional technique for identifying a fault location such as Patent Document 2. Therefore, at present, it is necessary for workers to go to the site and investigate the operation status of each part with a tester or the like, which is troublesome.

そこで、本発明は、前記した問題を解決し、故障が発生したパーツを特定することを主な目的とする。 Therefore, the main object of the present invention is to solve the above-described problems and to specify a part in which a failure has occurred.

前記課題を解決するため、本発明は、パスを設定するノードを接続する通信ネットワークにおける構成要素となるパーツの故障を発見する故障箇所発見方法であって、前記パスを構成するリソースを一意に特定するリソースＩＤ、および、前記リソースを構成する前記パーツを一意に特定するパーツＩＤを格納する記憶手段にアクセスする手段を有するコンピュータが、前記パスの始点または終点となるノードから、故障が発生した第１パス情報の通知を受ける手順と、前記第１パス情報の通知から所定時間内に、故障が発生した第２パス情報の通知を受け、前記第１パス情報と前記第２パス情報とを併せて故障パスリストを作成する手順と、前記記憶手段を参照し、前記故障パスリストの各パスを構成するリソースのリソースＩＤの少なくとも一部から構成される故障リソースリストを作成する手順と、前記記憶手段を参照し、前記故障リソースリストの各リソースを構成するパーツのパーツＩＤの少なくとも一部から構成される故障パーツリストを作成する手順と、前記故障パーツリストを出力する手順と、を実行することを特徴とする。 In order to solve the above problems, the present invention provides a failure location detection method for detecting a failure of a part which is a component in a communication network connecting nodes for setting a path, and uniquely identifies a resource constituting the path. And a computer having a means for accessing a storage means for storing a part ID for uniquely identifying the part that constitutes the resource has a fault from a node that is a start point or an end point of the path. A procedure for receiving notification of 1-path information, and notification of second path information in which a failure has occurred within a predetermined time from the notification of the first path information, and combining the first path information and the second path information. The procedure for creating the fault path list and the storage means are referred to, and the resource ID of the resource constituting each path of the fault path list is reduced. A failure resource list consisting of at least a part of a part ID of a part constituting each resource of the failure resource list with reference to the storage means And a procedure for outputting the failed parts list.

これにより、出力された故障パーツリストから、故障が発生したパーツを特定することが可能となる。 As a result, it is possible to identify the part where the failure has occurred from the output failed parts list.

本発明は、前記パーツを一意に特定するパーツＩＤが、パーツの種別を示すパーツ種別ＩＤと、前記パーツ種別ＩＤが同じパーツについて、一意に識別するパーツ製造ＩＤとの組み合わせにより構成されることを特徴とする。 According to the present invention, the part ID for uniquely identifying the part is configured by a combination of a part type ID indicating a part type and a part manufacturing ID for uniquely identifying a part having the same part type ID. Features.

これにより、複数のベンダから出されたネットワークのパーツについて、パーツＩＤを衝突することなく割り当てることが可能となる。 Thereby, it becomes possible to assign part IDs without colliding with respect to parts of the network issued from a plurality of vendors.

本発明は、前記故障リソースリストを作成する手順が、所定のリソースについて、前記通信ネットワーク内に設定されたパスの中で所定のリソースを使用しているパスの本数と、故障したパスの中で所定のリソースを使用しているパスの本数との割合と、第１閾値とを比較し、前記故障パスリストの各パスを構成する全てのリソースのうち、前記第１閾値との比較結果により所定のリソースを選別して、故障リソース候補リストを作成することを特徴とする。 According to the present invention, the procedure for creating the fault resource list includes the number of paths using the predetermined resource among the paths set in the communication network for the predetermined resource, and among the faulty paths. The ratio of the number of paths using a predetermined resource to the first threshold value is compared, and among all the resources constituting each path of the failed path list, the predetermined value is determined based on the comparison result with the first threshold value. This is characterized in that a failure resource candidate list is created by selecting the resources.

これにより、第１閾値で指定された精度で、故障リソースリストを作成することが可能となる。 As a result, the failure resource list can be created with the accuracy specified by the first threshold.

本発明は、前記故障パーツリストを作成する手順が、所定のパーツについて、前記通信ネットワーク内に設定された所定のパーツを使用しているリソースの数と、前記故障リソースリストにある所定のパーツを使用しているリソースの数との割合と、第２閾値とを比較し、前記故障リソースリストの各リソースを構成する全てのパーツのうち、前記第２閾値との比較結果により所定のパーツを選別して、故障パーツリストを作成することを特徴とする。 According to the present invention, the procedure for creating the failed part list includes, for a predetermined part, the number of resources using the predetermined part set in the communication network and the predetermined part in the failed resource list. The ratio of the number of used resources and the second threshold value are compared, and a predetermined part is selected based on the comparison result with the second threshold value among all the parts constituting each resource of the failure resource list. Then, a failure parts list is created.

これにより、第２閾値で指定された精度で、故障パーツリストを作成することが可能となる。 As a result, it is possible to create a fault parts list with the accuracy specified by the second threshold.

本発明は、パスを設定するノードを接続する通信ネットワークにおける構成要素となるパーツの故障を発見する故障箇所発見装置であって、前記パスを構成するリソースを一意に特定するリソースＩＤ、および、前記リソースを構成する前記パーツを一意に特定するパーツＩＤを格納する記憶手段にアクセスする手段を有する故障箇所発見装置が、前記パスの始点または終点となるノードから、故障が発生した第１パス情報の通知を受けるネットワークインターフェイスと、前記第１パス情報の通知から所定時間内に、故障が発生した第２パス情報の通知を受け、前記第１パス情報と前記第２パス情報とを併せて故障パスリストを作成し、前記記憶手段を参照し、前記故障パスリストの各パスを構成するリソースのリソースＩＤの少なくとも一部から構成される故障リソースリストを作成し、前記記憶手段を参照し、前記故障リソースリストの各リソースを構成するパーツのパーツＩＤの少なくとも一部から構成される故障パーツリストを作成する故障パーツ発見部と、前記故障パーツリストを出力する故障パーツ出力部と、を有することを特徴とする。 The present invention is a failure location finding device for finding a failure of a part that is a component in a communication network that connects nodes that set a path, a resource ID that uniquely identifies a resource constituting the path, and The failure location detection apparatus having a means for accessing a storage means for storing a part ID for uniquely identifying the part constituting the resource is configured so that the failure of the first path information in which the failure has occurred from a node that is a start point or an end point of the path. The network interface that receives the notification, the notification of the second path information that has failed within a predetermined time from the notification of the first path information, and the failure path that combines the first path information and the second path information. Create a list, refer to the storage means, and at least resource IDs of resources constituting each path of the fault path list A faulty part list that consists of at least part of part IDs of parts that make up each resource of the faulty resource list by creating a faulty resource list that consists of parts and referring to the storage means And a faulty part output unit that outputs the faulty parts list.

これにより、出力された故障パーツリストから、故障が発生したパーツを特定することが可能となる。 As a result, it is possible to identify the part in which the failure has occurred from the output failed parts list.

本発明は、前記故障パーツ発見部が、所定のリソースについて、前記通信ネットワーク内に設定されたパスの中で所定のリソースを使用しているパスの本数と、故障したパスの中で所定のリソースを使用しているパスの本数との割合と、第１閾値とを比較し、前記故障パスリストの各パスを構成する全てのリソースのうち、前記第１閾値との比較結果により所定のリソースを選別して、故障リソース候補リストを作成することを特徴とする。 The present invention provides the number of paths in which the failed part finding unit uses the predetermined resource among the paths set in the communication network for the predetermined resource, and the predetermined resource in the failed path. The ratio of the number of paths using the first threshold is compared with a first threshold, and among all resources constituting each path of the failure path list, a predetermined resource is determined according to a comparison result with the first threshold. A failure resource candidate list is created by screening.

本発明は、前記故障パーツ発見部が、所定のパーツについて、前記通信ネットワーク内に設定された所定のパーツを使用しているリソースの数と、前記故障リソースリストにある所定のパーツを使用しているリソースの数との割合と、第２閾値とを比較し、前記故障リソースリストの各リソースを構成する全てのパーツのうち、前記第２閾値との比較結果により所定のパーツを選別して、故障パーツリストを作成することを特徴とする。 In the present invention, the failure part discovery unit uses, for a predetermined part, the number of resources using the predetermined part set in the communication network and the predetermined part in the failure resource list. The ratio of the number of resources that are present and the second threshold value are compared, and out of all the parts constituting each resource of the fault resource list, a predetermined part is selected according to the comparison result with the second threshold value, A failure parts list is created.

本発明によれば、ネットワーク内のリソースに付与されたＳＲＧ情報と故障したパスの情報のみを用いて、ネットワーク内の中継ノードに故障検出機能がない場合であっても、故障したパーツを精度よく迅速に発見可能となる。 According to the present invention, using only the SRG information assigned to the resources in the network and the information on the failed path, even if the relay node in the network does not have a failure detection function, the failed part can be accurately identified. It can be discovered quickly.

以下に、本発明が適用される通信システム１の一実施形態について、図面を参照して詳細に説明する。まず、本実施形態の通信システム１の構成について、図１から図５を参照して説明する。図１に示す通信システム１は、通信ネットワーク２内で障害が発生した場合に、その発生箇所を通信システム１の構成要素であるパーツの単位（ファイバ、装置のインターフェイスカードなど）まで特定することを特徴とする。 Hereinafter, an embodiment of a communication system 1 to which the present invention is applied will be described in detail with reference to the drawings. First, the configuration of the communication system 1 according to the present embodiment will be described with reference to FIGS. 1 to 5. The communication system 1 shown in FIG. 1 specifies that when a failure occurs in the communication network 2, the occurrence location is specified up to a unit of a part (fiber, device interface card, etc.) that is a component of the communication system 1. Features.

通信システム１は、パスＣの設定を行うノードＮ、故障の発生箇所を発見する故障箇所発見装置３を含むものとする。そして、通信システム１は、ノードＮ間を通信ネットワーク２で接続する。この、通信ネットワーク２においては、網内の各ノードＮでルーティングプロトコルを動作させて、ネットワークトポロジ情報およびリソースＲ情報の収集を行っている。ルーティングプロトコルとしてはＯＳＰＦ（Open Shortest Path First）やＩＳ−ＩＳ（Intermediate System - Intermediate System）などが通常利用されている。 The communication system 1 includes a node N that sets a path C, and a failure location finding device 3 that finds a location where a failure has occurred. The communication system 1 connects the nodes N with the communication network 2. In the communication network 2, the network protocol information and resource R information are collected by operating a routing protocol at each node N in the network. As a routing protocol, OSPF (Open Shortest Path First), IS-IS (Intermediate System-Intermediate System) and the like are usually used.

なお、通信システム１を構成する装置（ノードＮ、故障箇所発見装置３）は、それぞれ演算処理を行う際に用いられる記憶手段としてのメモリと、前記演算処理を行う演算処理装置と、通信ネットワーク２に接続するためのネットワークインターフェイスとを少なくとも備えるコンピュータとして構成される。なお、メモリは、ＲＡＭ（Random Access Memory）などにより構成される。演算処理は、ＣＰＵ（Central Processing Unit）によって構成される演算処理装置が、メモリ上のプログラムを実行することで、実現される。 Note that the devices (node N and fault location finding device 3) constituting the communication system 1 each include a memory serving as storage means used when performing arithmetic processing, an arithmetic processing device that performs the arithmetic processing, and a communication network 2. And a network interface for connecting to the computer. The memory is constituted by a RAM (Random Access Memory) or the like. Arithmetic processing is realized by an arithmetic processing unit configured by a CPU (Central Processing Unit) executing a program on a memory.

各ノードＮは、発側ノードＮＳ、中継ノードＮＭ、着側ノードＮＥに分類される。発側ノードＮＳはパスＣの開始点となるノードＮで、新たなパスＣを設定する際に設定のためのシグナリングメッセージの生成などを行う。例えばＧＭＰＬＳネットワークの場合、ＣＲ−ＬＤＰやＲＳＶＰ−ＴＥなどのシグナリングプロトコルを用いてパスＣの設定を行う。中継ノードＮＭはパスＣ上を転送されるデータの中継を主に行う。着側ノードＮＥは、パスＣの終端点となるノードＮである。また、発側ノードＮＳおよび着側ノードＮＥは、図示しないユーザ側の通信装置と接続しており、通信ネットワーク２は、２拠点間のユーザ側の通信装置を接続するコネクションを提供している。また、故障箇所発見装置３は、通信システム１内に少なくとも１つ配備され、発側ノードＮＳ、着側ノードＮＥと接続している。 Each node N is classified into a source side node NS, a relay node NM, and a destination side node NE. The originating node NS is a node N that is the starting point of the path C, and generates a signaling message for setting when setting a new path C. For example, in the case of a GMPLS network, the path C is set using a signaling protocol such as CR-LDP or RSVP-TE. The relay node NM mainly relays data transferred on the path C. The destination node NE is a node N that is a termination point of the path C. The originating node NS and the terminating node NE are connected to a user-side communication device (not shown), and the communication network 2 provides a connection for connecting the user-side communication device between two sites. Further, at least one failure location detection device 3 is provided in the communication system 1 and is connected to the originating node NS and the terminating node NE.

なお、パスＣの端点となる発側ノードＮＳおよび着側ノードＮＥは、そのパスＣの通信不通などの故障を検出するパス故障検出部１０を有している。パス故障検出部１０は、例えば、発側ノードＮＳが定期的に生存確認メッセージをパスＣに流し、正常に生存確認メッセージが着側ノードＮＥに届いたか否かによって、故障を検出する。このパス故障検出部１０は、各パスＣの故障自体は検出可能ではある。そして、故障箇所発見装置３が、パス故障検出部１０からの故障情報をもとに、発側ノードＮＳから着側ノードＮＥまでの間のどの区間で故障したのかを切り分ける。 Note that the originating node NS and the terminating node NE, which are the end points of the path C, have a path failure detection unit 10 that detects a failure such as communication failure of the path C. For example, the path failure detection unit 10 detects a failure depending on whether or not the origination node NS periodically sends a survival confirmation message to the path C and the survival confirmation message normally reaches the destination node NE. The path failure detection unit 10 can detect the failure of each path C itself. Then, based on the failure information from the path failure detection unit 10, the failure location finding device 3 determines in which section the failure occurs from the originating node NS to the terminating node NE.

次に、通信システム１の一部を示すリソースＲとパーツＰの概念を説明する。 Next, the concept of the resource R and the part P indicating a part of the communication system 1 will be described.

図２（ａ）は、通信システム１の装置間を回線（ファイバ）で接続したネットワーク構成図を示している。通信システム１には、ノードＮ１からノードＮ４へのパスＣが２本（現用パスＣと予備パスＣ）設定されている。各パスＣは、パスＣの経路上に存在するノードＮをリンクＬにより順に接続した集合として定義する。このリンクＬには、通常のルーティングプロトコルで使用される、リンクＬコストおよび帯域情報などの付加情報が付与されている。 FIG. 2A shows a network configuration diagram in which devices of the communication system 1 are connected by a line (fiber). In the communication system 1, two paths C (working path C and backup path C) from the node N1 to the node N4 are set. Each path C is defined as a set in which nodes N existing on the path C are connected in order by links L. This link L is given additional information such as link L cost and bandwidth information used in a normal routing protocol.

なお、現用パスＣ以外のパスＣを予備パスＣと呼ぶ。この予備パスＣを現用パスＣとリンクＬおよびノードＮを共有しないように設定する図２の例のように、２つのパスＣが共有するリンクＬおよびノードＮを持たないとき、ｄｉｓｊｏｉｎｔ（互いに素）なパスＣと呼ぶ。 A path C other than the working path C is called a backup path C. When the protection path C is set so as not to share the link L and the node N with the working path C, when the two paths C do not have the link L and the node N shared, disjoint ) Path C.

リソースＲは、パスＣの集合の要素であるノードＮまたはリンクＬである。よって、パスＣは、複数のリソースＲの順序集合である。なお、図２（ａ）は、ノードＮ１からスイッチＳＷに接続し、スイッチＳＷからノードＮ２またはノードＮ３に分岐している。そして、各リソースＲには、リソースＲをユニークに識別するためのリソースＩＤが割り当てられている。リソースＩＤは、例えば、ＳＲＧ番号の組み合わせであるＳＲＧ情報である。 The resource R is a node N or a link L that is an element of a set of paths C. Therefore, the path C is an ordered set of a plurality of resources R. In FIG. 2A, the node N1 is connected to the switch SW, and the switch SW branches to the node N2 or the node N3. Each resource R is assigned a resource ID for uniquely identifying the resource R. The resource ID is, for example, SRG information that is a combination of SRG numbers.

図２（ｂ）は、図２（ａ）の通信システム１において、リソースＲを抽出した図である。例えば、現用パスＣは、＜ノードＮ１→リンクＬ２→ノードＮ３→リンクＬ４→ノードＮ４＞の順序集合であり、５つのリソースＲから構成されている。同じように、予備パスＣは、＜ノードＮ１→リンクＬ１→ノードＮ２→リンクＬ３→ノードＮ４＞の順序集合であり、５つのリソースＲから構成されている。 FIG. 2B is a diagram in which the resource R is extracted in the communication system 1 in FIG. For example, the working path C is an ordered set of <node N1 → link L2 → node N3 → link L4 → node N4>, and includes five resources R. Similarly, the protection path C is an ordered set of <node N1 → link L1 → node N2 → link L3 → node N4>, and includes five resources R.

図２（ｂ）のリンクＬ１は１本であるが、このリンクＬ１を構成する要素は、図２（ａ）に示すように、２本のファイバ（＃１、＃２）である。また、リンクＬ２は１本であるが、このリンクＬ２を構成する要素は、図２（ａ）に示すように、２本のファイバ（＃１、＃３）である。よって、リンクＬ１とリンクＬ２とは、ファイバ＃１を共用している。これにより、ファイバ＃１が故障したときには、リンクＬ１とリンクＬ２とは、同時に使用ができなくなってしまう。 The link L1 in FIG. 2B is one, but the elements constituting this link L1 are two fibers (# 1, # 2) as shown in FIG. 2A. Further, although there is one link L2, the elements constituting the link L2 are two fibers (# 1, # 3) as shown in FIG. Therefore, the link L1 and the link L2 share the fiber # 1. As a result, when the fiber # 1 fails, the link L1 and the link L2 cannot be used at the same time.

図３は、リソースＲの共用を示す説明図である。図３の通信システム１には、３本のパスＣ（Ｃ１、Ｃ２、Ｃ３）が設定されており、矢印はパスＣの経路を示す。例えば、リンクＬ１５に障害が発生したときには、そのリンクＬ１５を共用している２本のパスＣ（Ｃ２、Ｃ３）が使用ができなくなるが、リンクＬ１５を使用していないパスＣ１は、引き続き使用可能である。また、リンクＬ１４に障害が発生したときには、そのリンクＬ１４を使用しているパスＣ３のみが使用ができなくなる。 FIG. 3 is an explanatory diagram showing sharing of the resource R. In the communication system 1 of FIG. 3, three paths C (C1, C2, C3) are set, and an arrow indicates the path C. For example, when a failure occurs in the link L15, the two paths C (C2, C3) sharing the link L15 cannot be used, but the path C1 not using the link L15 can continue to be used. It is. Further, when a failure occurs in the link L14, only the path C3 using the link L14 cannot be used.

このように、同じパスＣ３の故障であっても、その故障の原因となるリンクＬの位置によって、パスＣ３とは別のパスＣの故障が起きるか否かが決まる。よって、故障箇所発見装置３は、通信システム１に設定されている所定のパスＣの故障について、そのパスＣ以外のパスＣの故障を併せて検討することによって、所定のパスＣの故障原因となるリンクＬの位置を特定する。 In this way, even if a failure occurs in the same path C3, whether or not a failure occurs in a path C different from the path C3 is determined depending on the position of the link L that causes the failure. Therefore, the failure location detection device 3 considers the failure of the predetermined path C as a cause of the failure of the predetermined path C by examining the failure of the path C other than the path C for the failure of the predetermined path C set in the communication system 1. The position of the link L is specified.

図４は、パーツＩＤの割り当てを示す説明図である。パーツＰは、リソースＲを構成するネットワークまたは装置の部品（ハードウェア資源）であり、通常のルーティングプロトコルで配布されない下位レイヤの物理的な機器の構成要素を表す。パーツＰは、例えば、図４に示される回線カード、インターフェイス、ファイバなどが挙げられる。パーツＩＤは＜パーツ種別ＩＤ、パーツ製造ＩＤ＞の組合せで構成されている。パーツＩＤは、通信システム１内の各パーツＰに対してユニークに割り当てられている。パーツＩＤは、例えば、ＳＲＧ番号である。 FIG. 4 is an explanatory diagram showing assignment of part IDs. The part P is a component of the network or apparatus (hardware resource) that constitutes the resource R, and represents a component of a physical device in a lower layer that is not distributed by a normal routing protocol. Examples of the part P include a line card, an interface, and a fiber shown in FIG. The part ID is composed of a combination of <part type ID, part manufacturing ID>. The part ID is uniquely assigned to each part P in the communication system 1. The part ID is, for example, an SRG number.

まず、パーツ種別ＩＤは障害要因となるパーツＰの種類を表す。パーツＰの種類は、回線カードや、ファイバ回線などパーツＰの用途を識別する単位に設定してもよいし、同じイーサネット（登録商標）の回線カードでも、Ａ社が製造する回線カードと、Ｂ社が製造する回線カードとで、別々のパーツ種別ＩＤを割り当ててもよい。よって、通信システム１は、同じパーツ種別ＩＤを有する複数のパーツＰから構成されてもよい。 First, the part type ID represents the type of the part P that becomes a failure factor. The type of the part P may be set to a unit for identifying the use of the part P, such as a line card or a fiber line. Even if the line card of the same Ethernet (registered trademark) is used, Different part type IDs may be assigned to the line cards manufactured by the company. Therefore, the communication system 1 may be composed of a plurality of parts P having the same part type ID.

次に、パーツ製造ＩＤは、パーツ種別ＩＤごとに割り当てられるＩＤである。これは、例えば、Ａ社が製造する複数の回線カードには、それぞれユニークな（別々の）パーツ製造ＩＤが割り当てられる。例えば、パーツ製造ＩＤは、各製品にメーカが割り当てる製造番号である。よって、Ａ社とＢ社とでは、独立に製造番号を割り当てられるので、通信システム１は、同じパーツ製造ＩＤを有する複数のパーツＰから構成されてもよい。しかし、パーツ種別ＩＤとパーツ製造ＩＤとの組で構成されるパーツＩＤは、通信システム１内の各パーツＰに対してユニークに割り当てられている。 Next, the part manufacturing ID is an ID assigned for each part type ID. For example, a unique (separate) parts manufacturing ID is assigned to each of a plurality of line cards manufactured by Company A. For example, the part manufacturing ID is a manufacturing number assigned to each product by the manufacturer. Therefore, since the company A and the company B are independently assigned production numbers, the communication system 1 may be composed of a plurality of parts P having the same parts production ID. However, a part ID composed of a set of a part type ID and a part manufacturing ID is uniquely assigned to each part P in the communication system 1.

パーツＩＤに関する情報は、各ノードＮが保持し、ルーティングプロトコルにより、ネットワーク内の他のノードＮおよび故障箇所発見装置３間に配布される。 Information about the part ID is held by each node N, and distributed between the other nodes N in the network and the failure location finding device 3 by a routing protocol.

図５は、故障箇所発見装置３を示す構成図である。故障箇所発見装置３は、記憶手段（パス情報データベース２０、トポロジデータベース２２、および、故障情報データベース２４）と、制御手段（故障パーツ発見部２６、および、故障パーツ出力部２８）とを備える。 FIG. 5 is a configuration diagram illustrating the failure location device 3. The failure location discovery device 3 includes storage means (path information database 20, topology database 22, and failure information database 24) and control means (failure parts discovery unit 26 and failure part output unit 28).

まず、パス情報データベース２０は、通信ネットワーク２内のパスＣの中継経路（使用しているリソースＲ）に関する情報を記録する。この情報は、パスＩＤと、パスＩＤの始点から終点までの経路上のリソースＲのリソースＩＤとの対応情報を含む。これにより、所定のパスＩＤのパスＣが故障したときには、このパス情報データベース２０を検索することにより、その故障したパスＣを構成するリソースＲのリソースＩＤを取得することができる。 First, the path information database 20 records information related to the relay route (used resource R) of the path C in the communication network 2. This information includes correspondence information between the path ID and the resource ID of the resource R on the route from the start point to the end point of the path ID. Thereby, when a path C having a predetermined path ID fails, the resource ID of the resource R constituting the failed path C can be acquired by searching the path information database 20.

次に、トポロジデータベース２２は、ルーティングプロトコルにより配布されている経路情報などから作成されるネットワークトポロジを格納する。さらに、トポロジデータベース２２は、リソースＩＤとパーツＩＤとの対応情報を保持する。 Next, the topology database 22 stores the network topology created from the route information distributed by the routing protocol. Further, the topology database 22 holds correspondence information between resource IDs and part IDs.

リソースＩＤとパーツＩＤとの対応情報を具体的に説明する。図４の例では、１つのノードＮに、２つのリンクＬ（Ｌ１，Ｌ２）が接続されており、パーツＰは、ファイバ、インターフェイス、回線カード、装置の４つの階層で構成されている。各階層には、異なるパーツ種別ＩＤが付与されている。 The correspondence information between the resource ID and the part ID will be specifically described. In the example of FIG. 4, two links L (L1, L2) are connected to one node N, and the part P is composed of four layers of fiber, interface, line card, and apparatus. A different part type ID is assigned to each layer.

２つのリンクＬ（Ｌ１，Ｌ２）は、装置と回線カードとを共用しているが、インターフェイスとファイバは、別々のものを使用している。よって、リソースＩＤ（リンク＃Ａ）は、パーツＩＤの集合＜装置＃１、回線カード＃２、インターフェイス＃３１、ファイバ＃４１＞と対応づけられる。また、リソースＩＤ（リンク＃Ｂ）は、＜装置＃１、回線カード＃２、インターフェイス＃３２、ファイバ＃４２＞と対応づけられる。 The two links L (L1, L2) share a device and a line card, but use separate interfaces and fibers. Therefore, the resource ID (link #A) is associated with a set of part IDs <device # 1, line card # 2, interface # 31, fiber # 41>. The resource ID (link #B) is associated with <device # 1, line card # 2, interface # 32, fiber # 42>.

このリソースＩＤとパーツＩＤとの対応情報は、例えば、ルーティングプロトコルにより、通信ネットワーク２内に配布される。この情報より、リンクＬ１とリンクＬ２は、同時に故障する可能性があり、要因としては共用している装置の故障と、共用している回線カードの故障がある。一方、インターフェイスまたはファイバの故障については、１つのパーツＰだけ故障したときには、その故障に起因して発生するリンクＬの故障は片方だけであり、同時には故障しない。 The correspondence information between the resource ID and the part ID is distributed in the communication network 2 by a routing protocol, for example. From this information, there is a possibility that the link L1 and the link L2 may fail at the same time, and the cause is a failure of the shared device and a failure of the shared line card. On the other hand, regarding the failure of the interface or the fiber, when only one part P fails, only one of the links L is generated due to the failure and does not fail at the same time.

そして、故障情報データベース２４は、通信ネットワーク２の故障に関する情報を格納する。具体的には、故障箇所発見装置３は、故障が発生したときに、図６に示すフローチャートに従う動作を行うことにより、図７に示すように故障情報データベース２４を更新する。 The failure information database 24 stores information related to failures in the communication network 2. Specifically, the failure location device 3 updates the failure information database 24 as shown in FIG. 7 by performing an operation according to the flowchart shown in FIG. 6 when a failure occurs.

以下、通信システム１の動作について、図１から図５および図７を参照しつつ、図６のフローチャートに沿って説明する。図６は、故障箇所発見処理を示すフローチャートである。 Hereinafter, the operation of the communication system 1 will be described along the flowchart of FIG. 6 with reference to FIGS. 1 to 5 and FIG. FIG. 6 is a flowchart showing a failure location finding process.

まず、故障箇所発見装置３は、パスＣの故障を検出する（Ｓ１０１）。つまり、故障箇所発見装置３は、パスＣの故障を検出した発側ノードＮＳ（または着側ノードＮＥ）から、故障したパスＣを特定可能な情報（ＧＭＰＬＳではＴｕｎｎｅｌＩＤ、ＬＳＰＩＤ、終端ノードＮのアドレスなど）を含む故障発生通知を受信する。故障箇所発見装置３は、特定可能な情報からパス情報データベース２０を参照することにより、そのパスＣのパスＩＤを取得する。 First, the failure location device 3 detects a failure in the path C (S101). That is, the failure location detection device 3 can identify the failed path C from the originating node NS (or the destination node NE) that detected the failure of the path C (TunnelID, LSPID, and the address of the terminal node N in GMPLS). Etc.) is received. The failure location device 3 acquires the path ID of the path C by referring to the path information database 20 from the identifiable information.

次に、故障箇所発見装置３は、パスＣの故障通知をまとめて故障パスリストを作成する（Ｓ１０２）。つまり、故障箇所発見装置３は、Ｓ１０１の故障発生通知の内容を故障情報データベース２４に保存し、所定期間（一定時間など）のタイマを起動する。そのタイマが終了するまでの間に受信した故障発生通知は、Ｓ１０１の故障発生通知と同時に発生した故障として、故障情報データベース２４に保存される（図７（ａ）の左列を参照）。 Next, the failure location device 3 collects failure notifications for the path C and creates a failure path list (S102). That is, the failure location finding device 3 stores the content of the failure occurrence notification in S101 in the failure information database 24 and starts a timer for a predetermined period (such as a fixed time). The failure occurrence notification received until the timer expires is stored in the failure information database 24 as a failure occurring at the same time as the failure occurrence notification in S101 (see the left column in FIG. 7A).

なお、図７（ａ）の右列は、左列のパスＣが使用するリソースＩＤのリストとなっている。例えば、パス＃１は、３つのリソースＲ（Ｒ１，Ｒ２，Ｒ４）を経由するパスＣである。Ｓ１０２のタイマが終了すると、故障パーツ発見部２６は、図７（ａ）の左列を検索キーとして、パス情報データベース２０を検索することにより、図７（ａ）の右列を取得する。 The right column of FIG. 7A is a list of resource IDs used by the path C in the left column. For example, path # 1 is path C that passes through three resources R (R1, R2, and R4). When the timer of S102 expires, the failed part finding unit 26 searches the path information database 20 using the left column in FIG. 7A as a search key, thereby acquiring the right column in FIG. 7A.

そして、故障箇所発見装置３は、故障パスリスト（図７（ａ））から故障リソースリスト（図７（ｂ））を作成する（Ｓ１０３）。具体的には、各リソースＲをＲｉ（ｉ＝１、２、…、Ｎ）とし、ｉ＝１〜Ｎまでに対して以下の手順１から手順３まで適用することにより、リソースＲを選別する。
手順１：パス情報データベース２０を検索し、リソースＲｉに含まれる全てのパス数をカウントし、ＰＡとする。
手順２：図７（ａ）を参照し、Ｒｉを使用しているパスＣの数をカウントし、ＰＢとする。
手順３：条件（ＰＡ−ＰＢ）／ＰＡ≦閾値α（第１閾値）を満たす場合、リソースＲｉを故障リソースＲとして、図７（ｂ）の左列の故障リソースリストに追加する。 Then, the fault location device 3 creates a fault resource list (FIG. 7B) from the fault path list (FIG. 7A) (S103). Specifically, each resource R is set to Ri (i = 1, 2,..., N), and resource R is selected by applying the following procedure 1 to procedure 3 for i = 1 to N. .
Procedure 1: The path information database 20 is searched, and the total number of paths included in the resource Ri is counted as PA.
Procedure 2: Referring to FIG. 7A, the number of paths C using Ri is counted and set as PB.
Procedure 3: When the condition (PA−PB) / PA ≦ threshold value α (first threshold value) is satisfied, the resource Ri is added as a failure resource R to the failure resource list in the left column of FIG.

ここで閾値αは、０以上１以下の実数である。基本的には故障したリソースＲに収容される全てのパスＣはサービス断となるので、α＝０として、手順３の条件式を満たすリソースＲが故障したリソースＲである。これにより、故障したリソースＲを検出する精度が向上する。 Here, the threshold value α is a real number between 0 and 1. Basically, all paths C accommodated in the failed resource R are out of service, so that α = 0 and the resource R satisfying the conditional expression in Procedure 3 is the failed resource R. Thereby, the accuracy of detecting the failed resource R is improved.

一方、ノードＮから故障箇所発見装置３ヘの故障発生通知の到着が遅れたため、故障は発生したが故障情報データベース２４に含まれないパスＣが存在するケースもある。そのときには、例えば、α＝０．２などαを０より大きな値とする。これにより、迅速に故障が発生したリソースＲを発見できる。 On the other hand, since the arrival of the failure occurrence notification from the node N to the failure location finding device 3 is delayed, there may be a path C that has failed but is not included in the failure information database 24. In this case, for example, α is set to a value larger than 0, such as α = 0.2. Thereby, it is possible to quickly find the resource R in which the failure has occurred.

そして、故障箇所発見装置３は、故障リソースリスト（図７（ｂ））から故障パーツリスト（図７（ｃ））を作成する（Ｓ１０４）。そのために、まず、故障箇所発見装置３は、トポロジデータベース２２を検索することにより、故障リソースリスト（図７（ｂ）の左列）から、その各リソースＲが使用するパーツＰのリスト（図７（ｂ）の右列）を作成する。 Then, the failure location device 3 creates a failure parts list (FIG. 7C) from the failure resource list (FIG. 7B) (S104). For this purpose, first, the fault location device 3 searches the topology database 22 to search the list of parts P used by each resource R from the fault resource list (left column of FIG. 7B) (FIG. 7). (B) right column) is created.

故障リソースリストの作成が完了すると、リスト内のリソースＲの情報から、さらにトポロジデータベース２２を検索して、リソースＲに付与された階層化ＳＲＧ番号を検索し、該当するＳＲＧ情報を故障ＳＲＧ候補リストに追加し、故障パーツリストを作成する。 When the creation of the fault resource list is completed, the topology database 22 is further searched from the information of the resource R in the list, the hierarchical SRG number assigned to the resource R is searched, and the corresponding SRG information is stored in the fault SRG candidate list. And create a failed parts list.

次に故障パーツ発見部２６は故障が発生したパーツＰを発見する処理として、図７（ｂ）の右列に含まれる全てのパーツＰに対して以下の手順４〜手順６を適用する。 Next, the failed part finding unit 26 applies the following steps 4 to 6 to all the parts P included in the right column of FIG. 7B as a process for finding the part P in which the failure has occurred.

手順４：所定のパーツＰについてトポロジデータベース２２を検索し、所定のパーツＰを含むリソースＲの数をカウントする。このリソースＲの数をＱＡとする。
手順５：所定のパーツＰについて故障リソースリストに含まれている全リソースＲの数をカウントする。この数をＱＢとする。
手順６：条件ＱＡ／ＱＢ≧閾値β（第２閾値）を満たす場合、所定のパーツＰを故障したパーツＰとして、図７（ｃ）の故障パーツリストに追加する。 Procedure 4: The topology database 22 is searched for a predetermined part P, and the number of resources R including the predetermined part P is counted. Let QA be the number of resources R.
Procedure 5: Count the number of all resources R included in the failure resource list for the predetermined part P. This number is QB.
Procedure 6: When the condition QA / QB ≧ threshold value β (second threshold value) is satisfied, the predetermined part P is added as a failed part P to the failed part list in FIG.

ここで、閾値βは、０以上１以下の実数である。基本的には故障したパーツＰが１つであれば、β＝１として、手順６の条件式を満たすパーツＰが、故障したパーツＰである。 Here, the threshold value β is a real number between 0 and 1. Basically, if there is only one failed part P, the part P satisfying the conditional expression of Procedure 6 is β = 1 and is the failed part P.

実際には確率は低いが同時に複数のＳＲＧが故障することがあるため、例えばβ＝０．２などβを１より小さな値とすることで、多重故障が発生した場合でも故障したパーツＰを発見できる。パーツＩＤとパーツＰとは１対１に対応しているため、故障箇所発見装置３内の故障パーツリスト内に含まれるパーツＩＤにより、故障したパーツＰを発見することができる。 Actually, although the probability is low, multiple SRGs may fail at the same time. For example, by setting β to a value smaller than 1, such as β = 0.2, the faulty part P is found even when multiple failures occur it can. Since the part ID and the part P have a one-to-one correspondence, the failed part P can be found by the part ID included in the failed part list in the failed part finding device 3.

さらに、故障パーツ出力部２８は、Ｓ１０４で発見した故障パーツリストを出力する（Ｓ１０５）。出力の形式は、例えば、パーツＩＤやパーツＩＤに対応するパーツＰ名をテキスト形式で出力するシンプルなものでもよいし、ネットワークトポロジを示すマップ上に位置するパーツＩＤに対応するパーツＰのアイコンの色を赤色にするなどの視覚的にわかりやすい形式で出力してもよい。 Further, the failed parts output unit 28 outputs the failed parts list found in S104 (S105). The output format may be, for example, a simple one that outputs the part ID and the part P name corresponding to the part ID in a text format, or the icon of the part P corresponding to the part ID located on the map indicating the network topology. It may be output in a visually easy-to-understand format such as red.

以上説明した本実施形態は、通信網に収容される各コネクションの中継経路の計算を行う装置および方法に関する。本実施形態は、例えば、レイヤ３のヘッダ情報を元に転送を行なうノードＮ網、固定長のラベルを転送するパケットに付与することで通信網内を転送するＭＰＬＳ通信網、ＴＤＭのタイムスロットや光の波長を一般化ラベルとして扱うＧＭＰＬＳ通信網、ＡＴＭ（Asynchronous Transfer Mode）通信網などに適用される。これにより、障害発生がどのリンクＬ、装置で発生したかまでを発見するだけでなく、従来の方法では困難であった、そのリンクＬのファイバの断線か、そのリンクＬの両端のどちらの装置か、装置のどのインターフェイスカードか、といった詳細な障害発生箇所まで特定することが可能となった。 The embodiment described above relates to an apparatus and a method for calculating a relay route of each connection accommodated in a communication network. In the present embodiment, for example, a node N network that performs transfer based on layer 3 header information, an MPLS communication network that transfers within a communication network by attaching a fixed-length label to a packet to be transferred, a TDM time slot, The present invention is applied to GMPLS communication networks that handle light wavelengths as generalized labels, ATM (Asynchronous Transfer Mode) communication networks, and the like. As a result, not only the link L and the device in which the failure occurs, but also the device at either end of the link L or the fiber breakage of the link L, which was difficult with the conventional method, is detected. It is now possible to identify the detailed failure location such as which interface card of the device.

以上説明した本発明は、以下のようにその趣旨を逸脱しない範囲で広く変形実施することができる。 The present invention described above can be widely modified without departing from the spirit thereof as follows.

例えば、故障箇所発見装置３の記憶手段における各データベースは、故障箇所発見装置３がデータアクセスできればいいので、故障箇所発見装置３の内部に格納してもよいし、故障箇所発見装置３とは別の装置に格納してもよい。 For example, each database in the storage means of the failure location discovery device 3 may be stored inside the failure location discovery device 3 or separated from the failure location discovery device 3 as long as the failure location discovery device 3 can access data. You may store in the apparatus of.

また、本実施形態で記載されている故障箇所発見装置３は、ノードＮと別装置として説明されているが、ノードＮ内に故障箇所発見装置３を配備してもよいし、複数の装置に故障箇所発見装置３の機能を分散させてもよい。 Moreover, although the failure location discovery device 3 described in the present embodiment has been described as a device separate from the node N, the failure location discovery device 3 may be provided in the node N, or a plurality of devices may be provided. The functions of the failure location detection device 3 may be distributed.

さらに、パーツＩＤは、パーツＰをユニークに識別するように設定されているとしたが、複数のパーツＰが同時に故障する可能性があるときには、それらの複数のパーツＰをまとめて、１つのパーツＩＤを割り当ててもよい。 Furthermore, although the part ID is set so as to uniquely identify the part P, when there is a possibility that a plurality of parts P may fail at the same time, the plurality of parts P are combined into one part. An ID may be assigned.

本発明の一実施形態に関する通信システムを示す構成図である。It is a block diagram which shows the communication system regarding one Embodiment of this invention. 本発明の一実施形態に関するリソースの概念を示す説明図である。It is explanatory drawing which shows the concept of the resource regarding one Embodiment of this invention. 本発明の一実施形態に関するリソースの共用を示す説明図である。It is explanatory drawing which shows the sharing of the resource regarding one Embodiment of this invention. 本発明の一実施形態に関するパーツＩＤの割り当てを示す説明図である。It is explanatory drawing which shows allocation of parts ID regarding one Embodiment of this invention. 本発明の一実施形態に関する故障箇所発見装置を示す構成図である。It is a block diagram which shows the failure location discovery apparatus regarding one Embodiment of this invention. 本発明の一実施形態に関する故障箇所発見処理を示すフローチャートである。It is a flowchart which shows the failure location discovery process regarding one Embodiment of this invention. 本発明の一実施形態に関する故障パーツリストの作成を示す説明図である。It is explanatory drawing which shows preparation of the failure parts list regarding one Embodiment of this invention.

Explanation of symbols

Ｃパス
Ｌリンク
Ｎノード
Ｐパーツ
Ｒリソース
１通信システム
２通信ネットワーク
３故障箇所発見装置
１０パス故障検出部
２０パス情報データベース
２２トポロジデータベース
２４故障情報データベース
２６故障パーツ発見部
２８故障パーツ出力部 C path L link N node P part R resource 1 communication system 2 communication network 3 failure location detection device 10 path failure detection unit 20 path information database 22 topology database 24 failure information database 26 failure part discovery unit 28 failure part output unit

Claims

A failure location finding method for finding a failure of a part that is a component in a communication network connecting nodes for setting a path,
A computer having a means for accessing a storage means for storing a resource ID for uniquely identifying a resource constituting the path and a part ID for uniquely identifying the part constituting the resource,
A procedure of receiving notification of first path information in which a failure has occurred from a node that is a start point or an end point of the path;
Receiving a notification of the second path information in which a failure has occurred within a predetermined time from the notification of the first path information, and creating a failure path list by combining the first path information and the second path information;
A procedure for creating a fault resource list including at least a part of resource IDs of resources constituting each path of the fault path list with reference to the storage means;
A procedure for referring to the storage means and creating a fault parts list composed of at least part of part IDs of parts constituting each resource of the fault resource list;
A procedure for outputting the failed parts list;
A fault location finding method characterized by executing

The part ID for uniquely identifying the part is configured by a combination of a part type ID indicating a part type and a part manufacturing ID for uniquely identifying a part having the same part type ID. Item 8. The method for finding a fault location according to Item 1.

The procedure for creating the fault resource list is as follows:
Ratio of the number of paths using the predetermined resource among the paths set in the communication network and the number of paths using the predetermined resource among the failed paths for the predetermined resource And the first threshold value,
The failure resource candidate list is created by selecting a predetermined resource based on a result of comparison with the first threshold value from all resources constituting each path of the failure path list. Item 3. A fault location finding method according to Item 2.

The procedure for creating the failed parts list is as follows:
For a predetermined part, a ratio between the number of resources using the predetermined part set in the communication network and the number of resources using the predetermined part in the fault resource list; Compare with the threshold,
The failure part list is created by selecting a predetermined part based on a comparison result with the second threshold value from all the parts constituting each resource of the failure resource list. 2. The fault location finding method according to 2.

A failure location finding device that finds a failure of a part that is a component in a communication network that connects nodes that set a path,
A failure location finding apparatus having means for accessing a storage means for storing a resource ID for uniquely identifying a resource constituting the path and a part ID for uniquely identifying the part constituting the resource,
A network interface that receives notification of first path information in which a failure has occurred from a node that is a start point or an end point of the path;
The notification of the second path information in which a failure has occurred is received within a predetermined time from the notification of the first path information, and a failure path list is created by combining the first path information and the second path information. A failure resource list composed of at least a part of resource IDs of resources constituting each path of the failure path list is created with reference to the means, and the storage means is referenced to configure each resource of the failure resource list. A faulty part finding unit that creates a faulty parts list composed of at least a part of part IDs of parts to be
A faulty part output unit for outputting the faulty parts list;
A fault location finding device characterized by comprising:

The part ID for uniquely identifying the part is configured by a combination of a part type ID indicating a part type and a part manufacturing ID for uniquely identifying a part having the same part type ID. Item 6. A fault location detection apparatus according to Item 5.

The failed part discovery unit
Ratio of the number of paths using the predetermined resource among the paths set in the communication network and the number of paths using the predetermined resource among the failed paths for the predetermined resource And the first threshold value,
6. The failure resource candidate list is created by selecting a predetermined resource from all resources constituting each path of the failure path list based on a comparison result with the first threshold value. Item 7. A failure location detection apparatus according to Item 6.

The failed part discovery unit
For a predetermined part, a ratio between the number of resources using the predetermined part set in the communication network and the number of resources using the predetermined part in the fault resource list; Compare with the threshold,
The failure part list is created by selecting a predetermined part from all the parts constituting each resource of the failure resource list according to a comparison result with the second threshold value. 6. The fault location device according to 6.